80 likes | 194 Vues
This guide offers insights into cluster computing applications for bioinformatics, focusing on efficient process management and shell scripting. Learn how to access the cluster using SSH, manage processes with commands like `ps`, `top`, and `kill`, and automate tasks through scripting. Discover how to utilize Sun Grid Engine (SGE) for job scheduling and load balancing, and run parallel programs using MPI. Gain proficiency in setting up your shell environment and executing commands that enhance performance and streamline workflow in bioinformatics projects.
E N D
Cluster Computing Applications for Bioinformatics • Thurs., Sept. 20, 2007 • process management • shell scripting • Sun Grid Engine • running parallel programs
Accessing the Cluster • ssh username@server • -X to enable X forwarding • ssh compute-#-# to access specific node • qrsh to access the least busy node • cluster-fork command to run on every node
Managing Processes • ps – list your running processes • -f : show file information • -e : list everyone's processes • top – current top processes by CPU and memory use • kill – terminate a process by number • killall to kill by program name • command & - run in background • bg - show background tasks • nice / renice – set priority
cd ~ ls -a vi .bash_profile echo $PATH The Shell • Unix command interpreter • bash – Bourne Again Shell • .bashrc and .bash_profile • settings for your shell environment
mkdir ~/bin cd /share/bio/examples/ cp makeseqdir ~/bin cd TFL makeseqdir Shell Scripting • Automate common tasks • create directory structure required for sequence assembly
cd /share/bio/examples/ cp assemble ~/bin assemble Distributed Shell Scripts • Preface CPU intensive commands with qrsh -cwd • qtcsh • shell that does this automatically based on ~/.qtask file • Does not work
Sun Grid Engine - SGE • Job queue and load balancing • commands: • qrsh / qtcsh • qstat -f : show status of jobs / queues • qdel : delete a job from the queue • qmon : graphical interface • qsub : submit job
cd /share/bio/examples cp .ncbirc ~ cp mpiblast.sh ~ cd ~ qsub -pe mpich 8 mpiblast.sh Running Parallel Programs • MPI – Message Passing Interface • must be launched with mpirun or as a script with qsub • mpiblast - parallel version of BLAST • modify ~/.ncbirc • first run mpiformatdb –nfrags=n