360 likes | 533 Vues
The Initiative in Innovative Computing at Harvard. Alyssa A. Goodman IIC Director & Prof. of Astronomy. Agenda. What is IIC? (“Filling the Gap”) Where did it come from? (A Story) What have we done so far? (Startup Mode) What are we about to do? (Projects, Hiring Plans)
 
                
                E N D
The Initiative in Innovative Computing at Harvard Alyssa A. Goodman IIC Director & Prof. of Astronomy
Agenda What is IIC? (“Filling the Gap”) Where did it come from? (A Story) What have we done so far? (Startup Mode) What are we about to do? (Projects, Hiring Plans) What do we hope to do? (Long-term Goals)
Filling the “Gap” between Science and Computer Science Scientific disciplines Computer Science departments Increasingly, core problems in science require computational solution Typically hire/“home grow” computationalists, but often lack the expertise or funding to go beyond the immediate pressing need Focused on finding elegant solutions to basic computer science challenges Often see specific, “applied” problems as outside their interests
Where did IIC come from? Short Version:Response to Harvard’s “expansion” in Science, and into Allston. See IIC Whitepaper (2004) & Task Force on Science & Technology report (2005) for more. Long Version…
IIC & NCSA/IACAT’s Shared Focus “We must make the cyberinfrastructure as easy to use as Mosaic made the Internet to use…. Raw computing power is still very important, but many scientific and engineering problems require integration of simulation, data sources (sensor arrays, telescopes, etc.), databases, and analysis and visualization, all in a distributed environment. Cyberenvironments will allow the scientist or engineer to fashion her own course, knowing that all of the capabilities in the cyberinfrastructure are reliably behind her … The tools and services in the cyberenvironments will include scientific and engineering applications and web services, graphical user interfaces and portals for easy interaction with the applications, and workflow and collaboration software to support complex, collaborative projects. A major component of this effort will be an integrated data analysis and visualization capability, which is needed by an increasing number of scientific and engineering communities.” --Thom Dunning (in Will, Ingenuity, Skill & Planning, 2006)
Computational challenges are common across scientific disciplines How to: Acquire, transmit, organize, and query new kinds of data? Apply distributed computing resources to solve complex problems? Derive meaningful insight from large datasets? Share, integrate and analyze knowledge across geographically dispersed researchers? Visually represent scientific results so as to maximize understanding? Opportunity to collaborate and apply insights from one field to another Early IIC Slide (1 yr old!)
Real World Workflowe.g. Emergency Medicine in the Age of High-Speed Networks, Fast Processors, Mass Storage, and Miniature Devices IIC/Harvard contact: Matt Welsh, DEAS
Continuum “Computational Science” Missing at Most Universities “Pure” Computer Science (e.g. Turing) “Pure” Discipline Science (e.g. Galileo)
Filling the “computational science” gap: IIC Problem-driven approach …focusing effort on solving problems that will have greatest impact & educational value Collaborative projects …combining disciplinary knowledge with computer science expertise Interdisciplinary effort …to ensure that best practices are shared across fields and that new tools and methodologies will be broadly applicable Links with industry …to draw on and learn from experience in applied computation Institutional funding …to ensure effort is directed towards key needs and not driven solely by narrow priorities of funding agencies
Science Departments CS Departments What is the right shape for that boundary? Where are the optimal “IIC” problems? HIgh “Never Mind” Domain Science Payoff Computer Science Department Low Low High Computer Science Payoff
IIC Research Branches( and Projects Draw upon >1 ) V AS I DC DB/P Plus…Educational Programs that bring IIC Science to Harvard students, and to the public at large.
Education is central to IIC’s mission At Harvard: Undergraduate & graduate courses focused on “data-intensive science” New graduate certificate program, within existing Ph.D. programs Research opportunities at undergraduate, graduate, and postdoctoral levels Beyond Harvard: New museum, highlighting the kind of science done at the IIC
V V DC DC DB/P DB/P AS AS I I IIC’s First Activities(2005-) Image & Meaning Collaboration IIC Seminar Series at Harvard Astronomical Medicine (IIC/CfA/HMS/MGH/BWH-SPL) 1st Call for Ideas (deadline was 3/15/06) V I V DC DB/P AS
“Image and Meaning” “I-M”=Working group of scientists, computer scientists, graphic artists, writers, publishers, designers organized and led by Felice Frankel of MIT (soon to be at IIC!) Goal: To increase both scientists understanding of their own data, and the public’s understanding of scientists’ findings, through graphical display. Activities: Large conferences at MIT in 2001 and Getty Center in 2005. Smaller “IM2.x” local workshops throughout 2006-7, including @ IIC. Upcoming IM/SIGGRAPH, in conjunction with SIGGRAPH 2007. Online community to be hosted by IIC, beginning later this year. (Social Network model.)
Responses to 1st IIC Call for Ideas V DC DB/P AS V DC DB/P AS V DC DB/P AS DC DB/P I V DC DB/P AS I V DC V DC DB/P V DC DB/P AS I V DC DB/P AS I V DC DB/P AS I V DC V DC DB/P AS
Wiring Diagram for a Complete Brain Circuit (Connectional Analysis of Synaptic Circuitry in the Mammalian Nervous System) 3D images from electron-microsope images of serial sections (slices) • Large volumes studies: up to 500 mm cubes • High resolution: 5nm x-y; 50 nm in z(105 x 105 x 104=1014 voxels) • Large datasets: 10-100 TB Potentially intractable computationally w/o a hierarchical approach Start with the large, dominant pathways: The biggest wires and the biggest excitatory connections. Use this as scaffolding to then solve other pathways: inhibition, lateral connections, feedback. V DC DB/P AS I
Modeling Blood Flow(Multiscale Hemodynamics) V DC DB/P AS Develop parallelization, visualization tools, to scale up to real applications Ultimate goal is MULTISCALE HEMODYNAMICS • Movie: Multiscale approach for translocation of DNA through a nanopore • Molecular Dynamics for DNA • Lattice Boltzmann Equation for the solvent • S. Melchionna, S. Succi, M. Fyta, L. Stein ,E. Kaxiras, M. Seltzer
Modeling Blood Flow(Multiscale Hemodynamics) V DC DB/P AS Develop parallelization, visualization tools, to scale up to real applications Ultimate goal is MULTISCALE HEMODYNAMICS
Virtual Observatory Portal Virtual Observatory Portal Default values are shown in green Data on:One object One Region A list of objects A list of regions I want: Spectra Images Catalogs (click all that apply) I want to: Use VO tools to browse data Download data to local computer Would you like help writing a script to do your query? Yes or No Continue V DC
Family history pedigree software toolkit Core Imaging Methodologies Topology differences in cocaine addiction Computational Framework A Computational Framework for Multimodal Studies in GENETICS, BIOLOGY, AND THE MIND V DC DB/P AS I Cortical Thickness AD vs. Controls Lab 4 Lab 1 Lab 3 Lab 5 Histological Correlates of AD Lab 2
Computational Framework A Computational Framework for Multimodal Studies in GENETICS, BIOLOGY, AND THE MIND V DC DB/P AS I “An Entire Disease or Condition of the Brain”
“Astronomical Medicine” Brigham & Women’s Hospital, Surgical Planning Lab Michael Halle (IIC) Massachusetts General Hospital, Martinos Center David Kennedy (IIC) Harvard-Smithsonian Center for Astrophysics Alyssa Goodman (IIC) Pepi Fabbiano Martin Elvis Jonathan McDowell IIC Douglas Alan (Systems Engineer) Michelle Borkin (Harvard Senior) Demo Movie
Project 1 Building the Best (Startup) Program V AS I DC DB/P
Project 2 Project 3 Building the Best (Startup) Program Project 1 V AS I DC DB/P
Agenda What is IIC? (“Filling the Gap”) Where did it come from? (A Story) What have we done so far? (Startup Mode) What are we about to do? (Projects, Hiring Plans) What do we hope to do? (Long-term Goals)
IIC will evolve over three phases Phase I 2005-08 Phase II 2008-10 Phase III 2011+ • Timing • IIC staffing level, combo of • new faculty • senior scientists • admin staff • Number of projects • Educational mission • New courses offered • Outreach programs • Other key milestones Total ~25 to ~100 ~3-5 to ~15 New courses to museum Evaluation schedule (internal, external committees)
Challenges In “Phase I” Result of “Allston” Science & Technology Task Force IIC intended to be a “University” (not a single school) initiative FAS (Faculty of Arts & Science) Constraints Faculty Appointments Non-Faculty Appointments Startup Space “Chicken-and-Egg” Problem with Recruiting Good, but not certain, Funding Prospects Role of DEAS Computer Science
Science Departments CS Departments Will departments hire“computationalists” with regular slots? How big is this overlap? “Challenges” How do we give Senior non-faculty similar statureto faculty? (e.g. P.I. rights, job security) HIgh “Never Mind” Domain Science Payoff Will CS/DEAS use slots forthese people? How big is that overlap? Computer Science Department Low Low High Computer Science Payoff
Sample Long Term Goal “3D Data Desk” Demo, using data from http://www.electoral-vote.com/2004/info/president.csv) Perseus file
IIC: Mission The Institute for Innovative Computing (IIC) will make Harvard a world leader in the innovative and creative use of computational resources to address forefront scientific problems. We will focus on developing capabilities that are applicable to multiple disciplines, by undertaking specific, well-defined projects, thereby developing tools and approaches that can be generalized and shared. We will foster the flow of ideas and inventions along the continuum from basic science to scientific computation to computational science to computer science. We will train a next generation of creative and computationally capable scientists, build linkages to industry, and communicate with the public at large.
The Initiative in Innovative Computing at Harvard Alyssa A. Goodman IIC Director & Prof. of Astronomy