1 / 8

MIS 451 Building Business Intelligence Systems

MIS 451 Building Business Intelligence Systems. Clustering (1). Problem. Target Marketing Diaper, Baby food, Swiss cheese and Belgian Toys chocolate . French Wine. Clustering.

caspar
Télécharger la présentation

MIS 451 Building Business Intelligence Systems

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. MIS 451Building Business Intelligence Systems Clustering (1)

  2. Problem • Target Marketing Diaper, Baby food, Swiss cheese and Belgian Toys chocolate French Wine

  3. Clustering • Clustering is a data mining method for grouping data points such that data points within the same cluster are similar and data points in different clusters are dissimilar. • How to calculate similarity between data points??

  4. Measuring Similarity • Continuous variable • Use distance to measure dissimilarity between data points • For two data points, distance between them can be measured in two ways • Manhattan distance • Euclidean distance

  5. Measuring Similarity • For two continuous data points X, Y, Manhattan distance is defined as:

  6. Measuring Similarity • Example of Manhattan distance • NAME AGE SPENDING($) • Sue 21 2300 • Carl 27 2600 • TOM 45 5400 • JACK 52 6000

  7. Measuring Similarity • For two continuous data points X, Y, Euclidean distance is defined as:

  8. Measuring Similarity • Example of Euclidean distance • NAME AGE SPENDING($) • Sue 21 2300 • Carl 27 2600 • TOM 45 5400 • JACK 52 6000 • Reading: Data mining book PP335-341

More Related