120 likes | 239 Vues
Explore challenges in mining large image datasets and advancements in perceptual classification, spatial data mining, and generalization algorithms by the Vision Research Lab at University of California, Santa Barbara. Discover their adaptive nearest neighbor search for relevance feedback and performance evaluation on extensive image collections.
E N D
Challenges in Mining Large Image Datasets Jelena Tešić, B.S. Manjunath University of California, Santa Barbara http://vision.ece.ucsb.edu
Introduction • Data and event representation • Meaningful data summarization • Modeling of high-level human concepts • Learning events • Feature space and perceptual relations • Mining image datasets • Feature set size and dimension • Size and nature of image dataset • Aerial Images of SB county • 54 images - 5428x5428 pixels • 177,174 tiles - 128x128 pixels Vision Research Lab
Visual Thesaurus • Perceptual Classification • T=1; SOM dim. red. of input training feature space • Assign labels to SOM output • LVQ finer tuning of class boundaries • It T< number_of_iterations { T=T+1; go back to step 2. } else END. Perceptual and feature space brought together: same class (16) and class 17 • Thesaurus Entries Generalized Lloyd Algorithm 330 codewords Vision Research Lab
x p distance q direction y Cρ(u,v) v u TEXTURE C O L O R SEC Spatial Event Cubes • Image tile raster space • Thesaurus entries • Spatial binary relation ρ • SEC face values • Multimode SEC Vision Research Lab
SEC 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1566 0 0 0 0 0 0 0 8 0 1 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0 1 0 1874 0 0 1 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 121 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 496 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 6 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 397 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 114 0 0 0 2 0 0 0 0 0 0 0 3825 2 0 0 0 0 0 0 8 0 0 0 0 5 3 50 0 0 0 72 0 0 0 2 0 0 0 1 4215 0 0 0 2 0 0 0 0 1 0 0 5 8 653 0 0 0 434 Cluster Analysis Visual Data Mining Vision Research Lab
Generalized Apriori Find all sets of tuples that have minimum support Use the frequent itemsets to generate the desired rules Low-level mining Occurrence of the ocean in the image dataset 2D 3D Spatial Data Mining Vision Research Lab
Ocean analysis Higher level Mining 890 434 653 Vision Research Lab
Conclusion • Visual mining framework • Spatial event representation • Image analysis at a conceptual level • Perceptual knowledge discovery • Demos: • http://vision.ece.ucsb.edu/texture/mpeg7/ • http://nayana.ece.ucsb.edu/registration/ • Amazon forest DV 40 hours – 5tbytes Mosaics from 2 h Vision Research Lab
Adaptive NN Search for Relevance Feedback • Relevance Feedback • learn user’s subjective similarity measures • Scalable solution • Explore the correlation of consecutive NN search • VA-file indexing • Feature space • Query • Distance Measure • - K nearest neighbors at iteration t • - distance between Q and the K-th farthest object • upper bound • - K-th largest upper bound of all approximations Vision Research Lab
Adaptive NN Search for Relevance Feedback • If is a qualified one in its lower bound must satisfy • When , it is guaranteed that more candidates can be excluded as compared with traditional search Vision Research Lab
vs. Their difference is larger at a coarser resolution vs. At coarser resolution, the estimate is better Performance Evaluation - 685,900 images Vision Research Lab
Performance Evaluation Adaptive NN search • Utilizing the correlation to confine the search space • The constraints can be computed efficiently • Significant savings on disk accesses Vision Research Lab