Non-parametric Methods in Pattern Recognition

Non Parametric Methods Pattern Recognition and Machine Learning Debrup Chakraborty

Nearest Neighbor classification Given: Given a labeled sample of n feature vectors ( call X) A distance measure (say the Euclidian Distance) To find: The class label of a given feature vector xwhich is not in X

Nearest Neighbor classification (contd.) The NN rule: Find the point y in X which is nearest to x Assign the label of y to x

Nearest Neighbor classification (contd.) This rule allows us to partition the feature space into cells consisting of all points closer to a given training point x All points in such cells are labeled by the class of the training point. This partitioning is called a Voronoi Tesselation

Nearest Neighbor classification (contd.) Voronoi Cells in 2d

Nearest Neighbor classification Complexity of the NN rule Distance calculation Finding the minimum distance

Nearest Neighbor classification Nearest Neighbor Editing X= Data set, n= no of training points, j=0 Construct the full Voronoi diagram for X Doj=j+1, for each point xj in X find Voronoi neighbors of xj If any neighbor is not from the same class as xj then mark xj Untilj==n Discard all points that are not marked.

k nearest neighbor classification Given: Given a labeled sample of N feature vectors ( call X) A distance measure (say the Euclidian Distance) An integer k (generally odd) To find: The class label of a given feature vector x which is not in X

k-NN classification (contd.) Algorithm: Find out the k nearest neighbors of x in X Call them Out of the k samples, let ki of them belong to class ci . Choose that ci to be the class of x for which ki is maximum

K-nn Classification Class 1 Class 2 Class 3 z

k-NN classification (contd.) Distance weighted nearest neighbor In case x=xi, return f(xi) Training set Given an instance x to be classified Let be the nearest neighbors of x Return

Remarks on k-NN classification • The distance weighted kNN is robust to noisy training data and is quite effective when it is provided a sufficiently large set of training examples. • One drawbak of kNN method is that, it defers all computation till a new querry point is presented. Various methods have been developed to index the training examples so that the nearest neighbor can be found with less search time. One such indexing method is kd-tree developed by Bently 1975 • kNN is a lazy learner

Locally Weighted Regression • In the linear regression problem, to find h(x) at a point x we would do the following: • Minimize • Output

Locally Weighted Regression • In the llocally weighted regression problem we would do the following • Minimize • Output • A standard choice of weights is • is called the bandwidth parameter

Clustering Is different from Classification Classification is partitioning the feature space whereas Clustering is partitioning the data into“homogeneous groups” Clustering is Unsupervised!!

K-means Clustering Given: A data set Fix the number of clusters K Let represent the i-th cluster center (prototype) at the k-th iteration Let represent the j-th cluster at the k-th iteration

K-means Clustering Steps • Choose the initial cluster centers • At the k-th iterative step distribute the points in X in K cluster using: • Compute • If then the procedure has • converged else repeat from 2.

Non-parametric Methods in Pattern Recognition

Non-parametric Methods in Pattern Recognition

Presentation Transcript

Dr. Snigdha Chakraborty Coordinator (Program Quality) CRS – India Program

Massimo Franceschetti Kaushik Chakraborty University of California at San Diego

Dr. Snigdha Chakraborty Coordinator Program Quality CRS India Program

Dr T K Chakraborty , Director, CSIR-CDRI

Tom Duvall Deep Chakraborty Tim Larsen

Dipanjan Chakraborty Anupam Joshi CSEE University of Maryland Baltimore County

Gaurav Chakraborty 21 st Feb’09

Dipanjan Chakraborty Anupam Joshi CSEE University of Maryland Baltimore County

Ayon Chakraborty 1 , Kaushik Chakraborty 1 , Swarup Kumar Mitra 2 ,M. K. Naskar 3

Where intuition leads, physics follows presented by Keka Chakraborty , Pondicherry, India

By Suchandrima Chakraborty

Kanishka Chakraborty, MD Assistant Professor Department of Internal Medicine

Project Report prepared by Shayok Chakraborty Weijia Che

Shayok Chakraborty Ph.D. student, Department of Computer Science and Engineering

Fundamentals of Web Design | Chandan Chakraborty

Presented by Prashanta Kumar Chakraborty Joint Chief, Programme Division Planning Commission,

By Soham Chakraborty, Joey Huntley, and Susan Xie

LABORATORY ACCREDITATION PROCESS PRESENTED BY DR. S CHAKRABORTY

Alzheimer’s disease Dr. Manodeep Chakraborty

INTENRAL QUALITY AUDIT PRACTICES PRESENTED BY Dr S Chakraborty sambhuchakraborty

About Dr. Soumitro Chakraborty

Rhea Chakraborty Complete info