190 likes | 328 Vues
Join us for the 2011 Galaxy Research Computing Workshop led by Curtis Hendrickson at UAB. Learn about Galaxy, the open-source web application designed for data analysis and command-line-free cluster integration. This workshop covers multi-terabyte storage in UAB cloud, managing data workflows, and sharing protocols. We will focus on Next Generation Sequence analysis and explore EMBOSS tools, data upload methods, and handling large datasets. Ideal for those interested in improving their biomedical informatics skills.
E N D
Tutorial: Welcome to Galaxy Research Computing Workshop 2011 Curtis Hendrickson (curtish@uab.edu) CCTS Biomedical Informatics Service
What is Galaxy? • Open source web application for data analysis • Command-line free cluster integration • Multi-Terabyte “scratch” storage in UAB cloud • Galaxy manages data and workflows/protocol • Allows sharing of data & protocols • Plug-ins for each tool, can add your own tools • Today, we will discuss using it for Next Generation Sequence analysis • Note that EMBOSS sequence analysis tools installed
First Time Login • https://galaxy.uabgrid.uab.edu • Re-direction to BlazerID login
Homepage Tools History
5. Get Data – BIG FILES move to UAB cloud Get account on Cheaha cluster Secure Transfer: scp, SSH Secure File Transfer http://www.uab.edu/it/software Transfer to /lustre/scratch/user/projects_name Uncompress (/share/apps/galaxy/galaxy-tools/bin/q_gunzip) Get 10Gb network connection from IT! Desktop Cheaha
5. Get Data – BIG FILES Bring UAB cloud scratch files into Galaxy
Get Data – UCSC Annotation Click Dataset “title” to see metadata Click Eyeball to show data
Get Data – UCSC Annotation Click “pencil” to edit metadata Click “disk” to download
Get Data – UCSC Annotation Eyeball of large data shows a sample