1 / 61

Directions in Open Science

Directions in Open Science. Mike Travers SRI Bioinformatics Research Group. For AIC Lunch and Learn, 30 Jan 2012. About this talk . Partly a trip report from Open Science Summit 2011 Partly an attempt to define open science and explore its impact

davin
Télécharger la présentation

Directions in Open Science

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Directions in Open Science Mike Travers SRI Bioinformatics Research Group For AIC Lunch and Learn, 30 Jan 2012

  2. About this talk • Partly a trip report from Open Science Summit 2011 • Partly an attempt to define open science and explore its impact • Partly an excuse to talk about some of my own vaguely related work • And partly some semi-crazy speculation about future projects in this space

  3. The Open Science Summit unites researchers, life science industry professionals, students, patients and other stakeholders to discuss the future of collaborative science and innovation. …in-depth sessions on new models for drug discovery and clinical trials, personal genomics, the patent system, the future of scientific publications, and more.

  4. What is Open Science? • Many different things, but boils down to: • Removing barriers to scientific communication and collaboration: • Social • Technical • Legal • Economic • Bureaucratic • To accelerate scientific progress • Utilizing modern technology

  5. Driven by technological change • The Internet has radically reduced communication costs • So old institutions of scientific communication are now obstacles • Closed academic publishers, notably: • Internet will transform scientific media just like it has newspapers, TV, social life…. • The difference is: science is more important than sharing cat pictures

  6. For-profit academic publishing is a racket A very lucrative one Starting to be rumbles of complaint (boycotts) from academics

  7. Open • Most visible and successful branchof open science • Articles are free to read, payto publish • Funders are starting to requiresome form of public access

  8. Gold: OA journal, Green: OA self-archivingOpen Access to the Scientific Journal Literature: Situation 2009, PLoS ONE, Bo-ChristerBjörk et al

  9. Research Works Act • H.R.3699 – “A bill to ensure the continued publication and integrity of the peer-reviewed research works by the private sector.” No Federal agency may adopt, implement, maintain, continue, or otherwise engage in any policy, program, or other activity that-- (1) causes, permits, or authorizes network dissemination of any private-sector research work without the prior consent of the publisher of such work; or (2) requires that any actual or prospective author, or the employer of such an actual or prospective author, assent to network dissemination of a private-sector research work.

  10. Myth 1: American consumers have a right to free access to articles their tax dollars fund. Fact American taxpayers do not fund peer reviewed research articles; they fund some of the research that is used in those articles…

  11. Beyond Open Access • Not going to say a whole lot about OA, because: • It’s easy to understand • It’s pretty clearly going to win in the long term • By itself, not a very radical change to how science is done: • Knowledge is still in paper-sized chunks • Papers are peer-reviewed prior to publication; • Once something is published, it’s static • All these parameters are being challenged in some way by other efforts • George Whitesides (Harvard chemist): “The concept of the scientific paper is eroding before our very eyes”

  12. Variations on publishing • “Peer review is broken” • Too slow • Too biased • Too rigid • May be “the worst system except for all the others” • Pre-peer-review publication • EgarXiv.org • Micropublication • Crowdsourcing, blogs, wikis…. • Open-notebook science • No gap at all between bench and publication • Database-linked publications • Dynamic Review Papers

  13. Biggest sequencing operation in the world Generating 6 terabytes/day of genomic data Open-Source Genomic Analysis of Shiga-Toxin–Producing E. coli O104:H4 Rohde et al 2011 (NEJM) Toxic E. coli outbreak in Germany May 2011: We released these data into the public domain… which elicited a burst of crowd-sourced, curiosity-driven analyses carried out by bioinformaticians on four continents. Twenty-four hours after the release of the genome, it had been assembled; … Five days after the release of the sequence data, we had designed and released strain-specific diagnostic primer sequences, and within a week, two dozen reports had been filed on an open-source wiki …dedicated to analysis of the strain Sequenced the rice genome https://github.com/ehec-outbreak-crowdsourced

  14. GigaScienceis a new integrated database and journal co-published in collaboration between BGI Shenzhen and BioMed Central, to meet the needs of a new generation of biological and biomedical research as it enters the era of "big-data."

  15. Dynamic Review Papers Conventional paper Paired withDynamically-updated,wiki-based paper/database/model

  16. Driving apps

  17. Who comes to Open Science Summits?

  18. Activist Organizations

  19. Participatory Medicine& Disease Foundations

  20. Startups Social paper and citation management Scientific servicesmarketplace Web-based moleculelibrary management

  21. Citizen Science

  22. Somewhat less garage-y • Independent research institute, started from data released by Merck • Repository of experimental data (Sage Commons) • Network of cooperating institutions • Starting to build a computational platform (Synapse)

  23. Synthetic Biology

  24. And some individual researchers • Peter Murray-Rust Chemist, Cambridge, promoter of Chemical Markup Language and semantic web“Closed science makes people die!” • Victoria StoddenStatistician, Columbia, reproducibility of computational science(cfClimateGate)

  25. Some open science success stories • Galaxy Zoo • FoldIt • Nutrient Network (NutNet) • Prazinquantel synthesis

  26. Galaxy Zoo • Citizen science (loosely) • Image classification task • Mechanical Turk-like approach (but unpaid) • About 200K participants • Discovered a whole new class of galaxies (“green pea”) and a quasar mirror • 22 published papers in 3 years

  27. Social sharing of algorithms (“recipes”) Descent with modification

  28. Matthew Todd, chemist at U of Syndney Schistosmiasis Looking for synthesis for known drug Prazinquantel (PZQ) in enantiopure form Open-notebook science (LabTrove)

  29. Nutrient Network (NutNet)

  30. What paper has the most authors? • NutNet paper:40 authors, 41 institutions • This one from SLAC and elsewhere:407 authors, but only 35 institutions

  31. Three variations on the scientific process • Automated Science • Distributed Science • Web-scale Intelligent Science • Open Science as the lubrication / accelerant that makes these feasible

  32. Afferent: Automation for Drug Discovery • Combinatorial Chemistry • Planning software to drive lab robots

  33. Distributed Science • Some science (eg evaluation of drug candidates) is highly parallelizable, • Hence distributable • CollabRx was initially supposed to support “virtual pharma companies” that would tie disparate academic research efforts into focused teams

  34. Web-scale Intelligent Science • Imagine all of science as a giant distributed computational process • Individual scientists are agents • working on a small part of the problem • Sharing their results • Getting feedback and funding dependent on success • Centralized data integration and decision tools used to help determine next useful experiment

  35. Steps towards distributed intelligence • Adaptive clinical trials • Rather than a classical trial with two arms run to completion • Change the distribution of test cases based on ongoing results • Now imagine this strategy applied more globally across all treatments for a disease • Credit for this slightly mad vision goes mainly to Marty Tenenbaum: • AI Meets Web 2.0 (2006) • Shrager, Tenenbaum, Travers, Cancer Commons: Biomedicine in the Internet Age (2011)

  36. What does all that have to do with Open Science? • Open Science is lowering barriers to collaboration • So it’s a necessary but not sufficient step towards this new kind of science • CollabRx may just have been too early: • the groundwork hasn’t been laid yet, • we are still working on basics • (eg standards for representation) • Reducing friction (or transaction costs) can be incredibly important

  37. “Changing the cost of innovation fundamentally changes the nature of innovation” – Joichi Ito TCP, HTTP etc are the containerization of data. So what’s the analog for scientific knowledge?

  38. Standardized Legal and Institutional Mechanisms

  39. A mix of technical, institutional, and legal standardization: • Standard licenses (parameterizable) • RDF representation for licenses. • Web Tools to generate these • Sites that collect and “market” available materials.

  40. BioBike, a platform for open science • Conceived of as a vehicle for getting biologists to do their own knowledge-based biocomputing. • Lisp + Frame system + Bioinformatics Tools • Through-the-web programmability • Community sharing of code and data • Visual Programming Language • Open Source  Jeff Elhai, Arnaud Taton, J. P. Massar, John K. Myers, Michael Travers, Johnny Casey, Mark Slupesky, Jeff Shrager. BioBIKE: A Web-based, programmable, integrated biological knowledge base. Nucleic Acids Research, 2009 

More Related