300 likes | 448 Vues
Indexing and Mining Biological Images. Christos Faloutsos CMU. THANKS. Outline. PART1: ViVo: Visual Vocabulary for cat retina images [PART2: other related work FALCON: relevance feedback for image by content Drosophila embryo image mining ]. PART1: ViVo.
E N D
Indexing and Mining Biological Images Christos Faloutsos CMU
Outline • PART1: ViVo: Visual Vocabulary for cat retina images • [PART2: other related work • FALCON: relevance feedback for image by content • Drosophila embryo image mining ]
PART1: ViVo with Ambuj Singh, Mark Verardo, Vebjorn Ljosa, Arnab Bhattacharya (UCSB) Jia-Yu Tim Pan, HJ Yang (CMU)
Detachment Development 1 day after detachment 3 days after detachment Normal 3 months after detachment 7 days after detachment 28 days after detachment
Data and Problem • (Data) Retinal images taken from cats • (Problem) What happens in retina after detachment? • What tissues (regions) are involved? • How do they change over time? • How will a program convey this info? • More than classification“we want to learn what classifier learned”
Main idea • extract characteristic visual ‘words’ • Equivalent to characteristic keywords, in a collection of text documents
V1 V2 Visual Vocabulary (ViVo) generation Visualvocabulary Step 3: ViVo generation Step 1: Tile image 8x12 tiles Step 2: Extract tile features Feature 2 Feature 1
skip ViVos
Evaluation of ViVo method • how meaningful are the discovered ViVos? • can they help in classification? • generality? • how else can they help biologists create hypotheses?
Goals: • how meaningful are the discovered ViVos? • can they help in classification? • generality? • how else can they help biologists create hypotheses?
Quality of ViVo – by classification Predicted Truth 86% accuracy 46 ViVos (90% energy)
Goals: • how meaningful are the discovered ViVos? • can they help in classification? • generality? • how else can they help biologists create hypotheses?
Protein images – MPEG7 CS Predicted Truth 84% accuracy 4 ViVos (93% energy) 1-NN classifier
Evaluation of ViVo method • how meaningful are the discovered ViVos? • can they help in classification? • generality? • how else can they help biologists create hypotheses? ‘ViVo-annotation’!
Automatic ViVo-annotation of images • A tile represents a ViVo vk if the largest coefficient of the tile is along the kth basis vector • A ViVo vk represents a class ci if the majority of its tiles are in that class • For each image, the representative ViVos for the class are automatically highlighted
Conclusions: • how meaningful are the discovered ViVos? • can they help in classification? • generality? • how else can they help biologists create hypotheses?
Outcome/status • ViVos: Automatic Visual Vocabulary generation for biomedical image mining, Bhattacharya, Ljosa, Pan, Verardo, Yang, Faloutsos, Singh; ICDM’05 (one of 5 best student paper award) • Software – MATLAB code • Tutorial in SIGMOD’05 (Murphy+Faloutsos)
Outline • PART1: ViVo: Visual Vocabulary for cat retina images • PART2: FALCON: relevance feedback for image by content: SEE DEMO, later • Ongoing work: Drosophila Fly Embryos
FALCON - Example query: Vs Inverted Vs Trader wants only ‘unstable’ stocks
Outline • PART1: ViVo: Visual Vocabulary for cat retina images • PART2: FALCON: relevance feedback for image by content: SEE DEMO, later • Ongoing work: Drosophila Fly Embryos
FEMine: Drosophila embryos • Feature extraction • ICA • query by image content, mining, clustering with Andre Balan, Eric Xing, Tim Pan
Conclusions Machine vision + Data mining + Data bases + Biology: => necessary partners