Biomedicine and Big Data

Biomedicine and Big Data Normal Analyzing spatio-temporal patterns in biomedical data Stiff Wavy

My Research Group Dr. Chakra Chennubhotla Ph.D. Computer Science University of Toronto Shannon Quinn B.S. Computer Science Georgia Tech Andrej Savol B.S. Applied Mathematics University of Pittsburgh Virginia Burger M.S. Mathematics University of Vienna

Our Mission • High-throughput biomedical data analysis

Problem and Solution • Biomedical and biological data are BIG • MapReduce! chunks C0 C1 C2 C3 Map Phase M0 M1 M2 M3 mappers IO0 IO1 IO2 IO3 Shuffling Data R0 R1 Reduce Phase Reducers FO0 FO1

Specifically… Clustering!

Requirements • Java • Apache Hadoop or Amazon EC2 • Apache Mahout • Comfortable with linear algebra • Ax = b • X = UΣUT • Hive, HBase, Giraph, GraphLab, etc optional but awesome

Final Thoughts • Distributed computing • Open source development • Programming at scale • Large project management • Software engineering principles, tools • Biomedical context • Biological data is huge • Diagnostics: helping people

Questions? Comments? Interested? • squinn@cmu.edu || spq1@pitt.edu

Biomedicine and Big Data

Biomedicine and Big Data

Presentation Transcript

Ontologies and Biomedicine

Big Data

Big Data and Data Mining

Big Data and NoSQL

Big Data, Big Knowledge, and Big Crowd

NLP and Big Data

Big Data

Big Data and Usability

Big Data and Analytics

Big Data

BIG Biomedicine and the Foundations of BIG Data Analysis

Big Questions, Big Data and Big Answers

HUMAN RIGHTS AND BIOMEDICINE

Biomedicine and Big Data

Big Data

Big Data

Big Data and Hadoop

Big Data and Hadoop

Big Data Training | Big Data Courses | Big Data Online Courses

Big Data Big Data

BIG Biomedicine and the Foundations of BIG Data Analysis

Ontologies and Biomedicine