1 / 19

ISQS 6339 Data Management and Business Intelligence Database Review

Zhangxi Lin Texas Tech University. ISQS 6339 Data Management and Business Intelligence Database Review. Attributes of Data. Sharable. Moveable. Secure. Accurate. Timely. Relevant. Data hierarchy. Bits. Characters. Fields (columns). Records (rows). Files (table). Database.

synclair
Télécharger la présentation

ISQS 6339 Data Management and Business Intelligence Database Review

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Zhangxi Lin Texas Tech University ISQS 6339 Data Management and Business IntelligenceDatabase Review ISQS 6347, Data & Text Mining

  2. Attributes of Data Sharable. Moveable. Secure. Accurate. Timely. Relevant.

  3. Data hierarchy Bits. Characters. Fields (columns). Records (rows). Files (table). Database.

  4. Why Build a Database? Handle large amounts of data. Satisfy multiple users. Make information retrieval faster. Make data input faster. Provide greater accuracy.

  5. Database versus Database Management System (DBMS) Database is a self-describing collection of integrated files. A DBMS is a complex computer program that acts as a data librarian, supervising the transfer of data between the end user and the database.

  6. Relational Model • Relation? Attribute?Tuple? • Keys. • Primary and foreign. • Referential integrity. • Relational algebra. • Relational Calculus.

  7. Relational DB Rules Every row must have exactly the same number of columns (fields or attributes). Each row can have only one value stored in each column (fields or attributes). A column must contain the same kind of value in every row of that column. No two rows can be exactly the same. The order of the rows or of the columns can’t be used to provide information.

  8. Terminology Data Processing Informal Relational DB Formal Relational DB File Table Relation Record Row Tuple Field Column Attribute

  9. Normalization • Purpose: • Avoid anomalies. • Not delete something you wish to keep while deleting something you do not want to keep. • Not having to add something that is unnecessary while adding something that is necessary. • Reduce redundancy. • Process: • Successive application of rules. • Bottom-up (data drives process). • Move from first through fifth normal form. • Does it make more or less tables?

  10. Entity Relationship Modeling • List the entities or objects in the environment. • People, things, transactions. • Describe the relationship between them • A single row in table A can be related to how many rows in table B (one or many). • A single row in table B can be related to how many rows in table A (one or many).

  11. Example E/RD

  12. SQL • Definition (DDL). • CREATE, ALTER, DROP. • Manipulation (DML). • SELECT, INSERT, UPDATE, DELETE. • The most used SQL command • SELECT

  13. General Format of SELECT SELECT [DISTINCT] item(s) FROM table(s) [WHERE condition] [GROUP BY columns] [HAVING condition] [ORDER BY row(s)]

  14. Case Study - IMW ISQS 6347, Data & Text Mining

  15. ISQS 6347, Data & Text Mining Want a house? Find one from the web • Up to 50% of prospective American homebuyers use the Internet to search for new homes, encompassing more than 9% of households online • Or about six million visitors, to various real estate sites. • These users have accessed the real estate sites on an average of 1.8 days per month or a total of 13.9 minutes each day. • From 1996 to 2000, more than 400 business models were created across the entire real estate spectrum.

  16. About IMW ISQS 6347, Data & Text Mining Based in Austin, Texas, IMW (Internet Media Works!) is an ASP, specialized mainly in web-based application development, database integration, and web development and hosting for all kinds of businesses. IMW has been more successful in selling its e-business services for commercial real estate. Its services include lead generation, real estate transaction management, property listing, realtor membership management, real estate indices, real estate auctions, etc., with COMMREX as a complete e-business solution. IMW used to have up to 6 full-time employees and a few part-time employees.

  17. IMW’s Services IMW’s Web-Based Application Services Website Hosting Services Optional Website Hosting Services Public User Application Services Core Membership Database Services Optional Membership Database Services Core Property Listing Database Services Optional Property Listing Database Services Public User Support Networking and System Operation Services Internet Service Provider’s Services ISQS 6347, Data & Text Mining

  18. IMW’s Data Model Property Listing Database Membership Database Property ID M:1 Listor ID Listor ID Listor Name    M:M Property Type Property Type    Type Name Address Office City    Subtype 1 Chapter    Subtype 2 Chapter M:M Functions    Feature Specializations Subtype n          Office M:1 Legends Company ID Company ID Primary Key Company Name    Address    Secondary Key    Telephone # Link to a table    ISQS 6347, Data & Text Mining

More Related