talon
Uploaded by
6 SLIDES
256 VUES
80LIKES

High-Performance Computing in Bioinformatics: Supercomputers vs. Clusters

DESCRIPTION

Explore the nuances of high-performance computing (HPC) in bioinformatics as presented by Nick Lindberg at the MCW Bioinformatics User Group. Discover the differences between supercomputers, like IBM’s Blue Gene and NCSA’s Blue Waters, and HPC clusters made up of commodity servers. Supercomputers offer powerful, singular computing capabilities, while clusters excel in parallel tasks, suited for bioinformatics workloads. Learn about job submission, data processing, and how these systems transform raw data into actionable knowledge through advanced computing techniques.

1 / 6

Télécharger la présentation

High-Performance Computing in Bioinformatics: Supercomputers vs. Clusters

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. High Performance Computing & Bioinformatics PerformanceNick Lindberg MCW Bioinformatics User Group

  2. “If you were plowing a field, which would you rather use: two strong oxen or 1024 chickens?”-Seymour Cray MCW Bioinformatics User Group

  3. HPC: Computing Platforms • Supercomputer (Oxen) • Fast, expensive, custom CPU/memory architecture and high-speed interconnect facilitating parallelization of a single computation • Appears as a singular, large computer • IBM “Bluegene” • NCSA “Blue Waters” • HPC Cluster (Chickens) • A lot of cheaper, commodity servers/CPUs with loosely coupled, slower (but still fast) interconnect (Ethernet/Infiniband) • Excels at ‘embarrassingly parallel’ batch computing as well as MPI-enabled parallel computing • Marquette’s “Pere” • MKEI’s “hpc01” MCW Bioinformatics User Group

  4. Matrix Multiplication C[0,0] = A[0,0]*B[0,0] + A[0,1]*B[1,0] + A[0,2]*B[2,0] C[0,0] A[0,o] A[0,1] A[0,2] B[0,0] B[1,0] For (i=0; i<3; i++) { C[i, j] = A[i, k] * B[k, j] B[2,0] MCW Bioinformatics User Group

  5. HPC Cluster Job/Software Stack Internet (Remote Users) Head/Login Node [Job Submission] Scheduler (Torque/Moab) R Bowtie2 Matlab 1 GigE/Infiniband Internode Communication Compute Nodes /scratch Shared Storage: Home Directories, Software, Scratch /usr/home /scratch

  6. Data to Information: Where HPC Fits In Raw Data Target Data Processed Data Transformed Information Patterns Knowledge Pattern Recognition Data Processing Interpretation Sampling on customer devices, servers, end points Feature extraction and content resolution, database construction/storing Dimension reduction, matrix/vector building, pre-process formatting Association classifications, algorithm application, simulation/modeling Visualization and validation, model reconstruction, feedback

More Related
SlideServe
Audio
Live Player
Audio Wave
Play slide audio to activate visualizer