1 / 31

Accelerating Discovery in Science and Engineering

Accelerating Discovery in Science and Engineering. Fabrizio Gagliardi Director – EMEA & LATAM Technical Computing Microsoft Corporation. Introduction. Some personal introductory remarks Progress in grid computing Microsoft progress in HPC Microsoft technology for science

Télécharger la présentation

Accelerating Discovery in Science and Engineering

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Accelerating Discovery in Science and Engineering Fabrizio Gagliardi Director – EMEA & LATAM Technical Computing Microsoft Corporation

  2. Introduction • Some personal introductory remarks • Progress in grid computing • Microsoft progress in HPC • Microsoft technology for science • Engagements in science • Conclusions

  3. Some personal introductory remarks • I am again here: since 2001 I have not missed this event a single time! • Happy to be associated with the pioneering work of Poland in HPC, networking and Grid computing • Honoured to witness the present success • Good opportunity to review the progress of my activity since last year • Last year I spoke about e-infrastrcuture, Grids and Microsoft plans for Science • Let’s review the progress now

  4. Progress in grid computing • Microsoft has sponsored GGF16 and GGF17 and took the initiative of proposing a HPC profile within the OGSA WG; a Data Management profile is also being discussed • On the application side we were prime sponsor at HealthGrid in Valencia with a key note by David Heckerman (AIDS vaccine research) • Rapid adoption from IT industry is essential for the future of Grid technology : GGF and EGA have merged in the Open Grid Forum (OGF) and held the first conference in Washington early September this year • Industry is now represented in a board of directors: all major vendors including Microsoft (Tony Hey) • Microsoft is also participating in the AdCom (myself) and in some of the WGs (OGSA and Security)

  5. Progress in grid computing 2 • Major issues which still remain to bring grid computing from academy to industry and commerce are: • Security • Interoperability • Easy to integrate and use • Reliability of the infrastructure • Adequate new business models • Microsoft is now considering most of those issues in the context of OGF

  6. Microsoft progress in HPC • Windows Computer Cluster Software released • Microsoft HPC institutes successful experience around the world

  7. Head Node Active Directory Job Mgmt Cluster Mgmt Scheduling Resource Mgmt Desktop App Job Policy, reports Admin Console User User Console Admin Management Cmd line Input Cmd line Job Data DB/FS Node Manager High speed, low latency interconnect Job Execution User App MPI Microsoft Compute Cluster Server • What it does : • Solution for High-Performance Computing application at a medium-low range of the scale • Simplified administration and job management • Built-in job scheduler and MPI lib • Four basic job scheduling policies supported in V1 • Key advantages: • Fully integrated cluster solution • Interoperability with Unix systems • Leverages existing Windows infrastructure and security

  8. HPC Innovation Centers Nizhni Novgorod University Nizhni Novgorod, Russia University of VirginiaCharlottesville, VA USA Southampton UniversitySouthampton, UK TACC – University of TexasAustin, TX USA Tokyo Institute of TechnologyTokyo, Japan University of UtahSalt Lake City, UT USA Cornell Theory CenterIthaca, NY USA HLRS – University of StuttgartStuttgart, Germany Shanghai Jiao Tong UniversityShanghai, PRC University of TennesseeKnoxville, TN USA Institutes for High Performance Computing

  9. HPC Market Trends

  10. Top 500 Supercomputer Trends Clusters over 50% Industry usage rising GigE is gaining x86 is leading

  11. Supercomputing Goes Personal

  12. Technology challenges Moore’s law continues but power consumption and heat dissipation are reaching their limits Memory and data access gap widen Applications become more data intensive

  13. The Future: Supercomputing on a Chip IBM Cell processor 256 Gflops today 4 node personal cluster => 1 Tflops 32 node personal cluster => Top100 MS Xbox 3 custom PowerPCs + ATI graphics processor 1 Tflops today $300 8 node personal cluster => “Top100” for $2500 (ignoring all that you don’t get for $300) Intel many-core chips “100’s of cores on a chip in 2015” (Justin Rattner, Intel: http://www.hpcwire.com/hpc/629783.html) “4 cores”/Tflop => 25 Tflops/chip

  14. The Microsoft project in Barcelona Microsoft is interested in helping computer scientists to develop new computing architectures with a high level of parallelism Mateo Valero and his BSC centre in Barcelona are leaders in this field in Europe Microsoft will collaborate with BSC to research and develop an entirely new parallel computing ecosystem http://www.hpcwire.com/hpc/633342.html

  15. Microsoft Technical Computing: Radical Computing Research in potential breakthrough technologies Advanced Computing for Science and Engineering Application of new algorithms, tools and technologies to scientific and engineering problems High Performance Computing and tools Application of high performance clusters and database technologies to industrial applications Application of existing and new tools for science

  16. Can “Here and Now” technologies accelerate discovery? Can “Business” Tools and techniques for dealing with be used in scientific research to allow researchers to be scientists and not computer scientists…

  17. Real-world Data Persistent Distributed Data Workflow, Data Mining& Algorithms Interpretation & Insight ComputationalModeling

  18. Real-world Data Persistent Distributed Data Workflow, Data Mining& Algorithms Interpretation & Insight ComputationalModeling

  19. The Problem for the e-Scientist Data ingest Managing a petabyte Common schema How to organize it? How to reorganize it? How to coexist & cooperate with others? Experiments & Instruments facts questions facts ? Other Archives facts answers Literature facts Simulations • Data Query and Visualization tools • Support/training • Performance • Execute queries in a minute • Batch (big) query scheduling

  20. Persistent Distributed Storage Visual Programming

  21. Distributed Computation Interoperability & Legacy Support via Web Services

  22. Searching & Visualization Live Documents Reputation & Influence

  23. Windows Compute Cluster Server Faster Time to Insight Better integration to existing Windows infrastructure Integrated and familiar development environment

  24. Integrate Analyze Report Research • Data acquisition from source systems and integration • Data transformation and synthesis • Data enrichment, with business logic, hierarchical views • Data discovery via data mining • Data presentation and distribution • Data access for the masses

  25. Comparison of soil moisture Thanks to Gretchen Miller – UC Berkeley & Catharine Van Ingen (MSR)

  26. Platform ServicesWorkspaces, Mgmt, Security, Storage, Topology, Site Model SharePoint Products and TechnologiesMicrosoft Office SharePoint Server 2007 Server-based Excel spreadsheets and data visualization, Report Center, BI Web Parts, KPIs/Dashboards Docs/tasks/calendars, blogs, wikis, e-mail integration, project management “lite”, Outlook integration, offline docs/lists Business Intelligence Collaboration Enterprise Portal template, Site Directory, My Sites, social networking, privacy control Rich and Web forms based front-ends, LOB actions, enterprise SSO BusinessForms Portal Content Management Search Integrated document management, records management, and Web content management with policies and workflow Enterprise scalability, contextual relevance, rich people and business data search

  27. High quality web rending Zero-footprint Interactive: Set parameters, sort, filter, explore Limit to browser access Browser Excel 2007 View and Interact Publish Spreadsheets Design and author Export/Snapshot into Excel Excel 2007 Programmatic Access Open in Excel for rich exploration and analysis Open snapshots SharePoint platform and Excel services Spreadsheets stored in document libraries Spreadsheet calculation and rendering External data retrieval and caching 100% calculation fidelity Customapplications Set values, perform calculations, get updated values via web services Retrieve full workbook file Excel ServicesOverview

  28. .NET & Visual Studio • F# • Iron Python • SQL Sever • SQL Server analysis Services • Windows Workflow • SharePoint Server 2007 • Instant Messenger • ConferenceXP • Academic Live, Onfolio… Development: Data: Workflow: Collaboration: Publications:

  29. Questions to our scientist colleagues? • Can these tools/technologies provide value/insight to scientists? • What’s missing? • Ie. on HPC, analysis, etc? • How best to test/integrate these technologies? • How to communicate these ideas? • Conferences, Workshops, Website? • Sharecode, Samples

  30. Conclusions • Industry is moving HPC to commodity • Microsoft is world leader in commodity computing and will play a major role in scientific and technical computing solutions • Key figures in scientific computing such as Burton Smith, Tony Hey have recently joined the company in senior strategic positions • We are interested in getting your opinion and collaborating with you to develop the most productive computing environment for science • Thanks again for the invitation and see you next year!!!

  31. More info: • Windows HPwww.windowshpc.net • Data miningwww.sqlserverdatamining.com/ • Develop without Borders Challengewww.developwithoutborders.com • Technical Computing Blog http://blogs.msdn.com/eScience

More Related