Download
slide1 n.
Skip this Video
Loading SlideShow in 5 Seconds..
flow PowerPoint Presentation

flow

126 Vues Download Presentation
Télécharger la présentation

flow

- - - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

  1. flow charts make science less hard

  2. Importing honey bee data into lightning 3 Click here to get started

  3. Once the information reloads based on your settings, click on your organism - Kingdom - Group - Subgroup

  4. species selection

  5. species selection assembly

  6. species selection assembly

  7. Download the file to your computer

  8. Checkpoint: data retrieved • Now it’s time for you to decide what you want to do with your data set. • You may want to • Alter its format (unzip, %GC, etc.) • Send it to a supercomputer

  9. Compressed data Uncompressed Lightning 3

  10. From here the file is recursively sent to the directory with the -r command: scp-r ~/Desktop/Primary_Assemblytut_user2@lightning3.its.iastate.edu:/data003/GIFTEACH/BCB660/foldername This command should be thought of as three commands into one: • scp –r ~/Desktop/Primary_Assembly • what you want to move • tut_user2@lightning3.its.iastate.edu • where you want to move it • :/data003/GIFTEACH/BCB660/foldername • where to go once it gets there You’re now ready to log into lightning3

  11. lightning3 address password change directory to folder name containing Primary assembly change directory to Primary_Assembly/ change directory to placed_scaffolds Type this long command This last command takes everything with gz in the name and decompresses each file

  12. Unzipping file at lightning3 • Permission may be denied, if so enter : • This should grant permission to each file chmod –R 777 Primary_Assembly/ re-enter long command

  13. We want to takeallthe individual scaffold files and put them into 1 file.Run the GC program on 1 file instead of 16 files • cat*.fa*>ApisMellifera_4.5.fasta • Here's the breakdown of this command • cat- concatenate, so take all these folders • * - wild card, around key letters • .fa– key letters >- sends command to a file • ApisMellifera_4.5.fasta - this is the name of our file

  14. to convert this to GC content • To convert this to GC content • ./percentGCApisMellifera_4.5fasta • the './percentGC' is a program turning ApisMellifera_4.5fasta into a table format.

  15. Open this file into R • > honeyBeeGC← read.table(“ApisMellifera_4.5gc”) • > ls(honeyBeeGC) • This should read • [1] “V1” “V2” “V3” • If you want a histogram • > hist(honeyBeeGC$V2), breaks=seq(0,100, len=1000))

  16. dfd

  17. use join if have a common field • use cat to glue fast afiles together in order cat file1 file2 file3 > redirectedOutput transliterating httyp://stackoverflow.com/questions

  18. list of species • unzipping gzips • current process to generate our .gc format file • percent GC Abdf.fasta >gctemp • seqlen.awkAbr.fasta > sitemp_tabs

  19. cut out first two fields of gctemp outputs to temp file • cut – f1-2 gctemp_tabs