Thursday Aug 16
Introduction to Genomic Annotation - Joshua Orvis (8:30 -11:00)
https://docs.google.com/presentation/d/1Z7P0qVuVEHQ0qwrVSdhJ_paumhsz7nT8I-fJC7FztAY/edit?usp=sharing
https://secure.join.me/733-561-924
JOIN.ME: https://secure.join.me/bioinfosu
Lunch (12:00-1:00)
Slides (PDF 4.4MB)
- Log into iPlant Atmosphere Home: https://atmo-beta.iplantcollaborative.org/login/
- Launch an instance of the Entangled Genomes virtual machine. We'll return to it later.
Links
- Amazon EC2
- Wikipedia on "Virtual Machine"
- iPlant Atmosphere Video Introduction
- GNOME desktop
- Wikipedia on "VNC"
Introduction: Metagenomic Analysis With MEGAN - Peter Hoyt (1:30-1:50)
- Slides (PDF)
Links
Hands-on: Metagenomic Analysis With MEGAN - Matt Vaughn - (1:50-4:00)
- Slides (PDF 3.4 MB)
- Connect to your 'Entangled Genomes' VM using VNC Viewer
- Load in the Phyllosphere data (from the Desktop folder on your VM desktop)
- Follow along with your presenter to learn how to access various functions in MEGAN
- Now, try to answer the study questions.
Study Questions
Taxonomic Composition
- Extract the sequence reads from the Malpighiales node
- Use NCBI BLAST to see if you can determine what other species was picked up in the metagenomics sample
- Hint: You can use BLAST right from the browser on your iPlant VM
- What is the common name for the most abundant species in Malpighiales?
- Why do you think this species was picked up from a leaf surface sample of soybean?
- Use NCBI BLAST to see if you can determine what other species was picked up in the metagenomics sample
- What other types of eukaryotic organisms appear to be present on the leaf of soybean in this sample?
- Do you see any fungi that might be pathogenic?
- Hint: Internet searches for species names may help
- Do you see any fungi that might be pathogenic?
- Can you use BLAST with reads from the dsDNA virus node to identify what may be trying to infect the plant used for the phyllosphere sample?
- Overall, how MEGAN and MetaPhlan taxonomic profiles do differ?
- Are the results for Bacteria comparable between MEGAN and MetaPhlan?
- Hint: Download a Krona-formatted file (taxonomy.krona.html) based on the MEGAN Bacteria taxonomic profile from the Community Data/osu-entangled-genomes folder in the DE and compare it with the one you or your group generated from MetaPhlan on Wednesday
- Which method appears to be more sensitive for classifying bacterial populations?
- Why do you say this?
- MEGAN+BLAST picks up a wider range of taxonomic categories (Eukarya, etc). How would you propose to extend/update MetaPhlan to make it sensitive to non-Bacteria taxa?
Functional Analysis
- Subselect the Bacteria node from the taxonomic tree
- Launch a SEED analysis and explore the various categories. Given that these bacteria are harvested from the aerial portion of soybean, a nitrogen-fixing species, please consider the following
- How do the majority of bacteria on these soybean leaf appear to move around?
- Do you think these species generally use aerobic or anaerobic respiration?
- If you were to examine the phyllosphere of Maize grown in the same field as these soybeans, do you think you would see a similar enrichment of Nitrogen metabolism genes?
- What kind of stresses do leaf surface bacteria appear to encounter, based on their SEED profile?
- Subselect the Bacteria node from the taxonomic tree
- Launch a KEGG analysis
- Can you find enriched KEGG categories that are consistent with the findings from SEED?
- Which type of analysis (KEGG or SEED) do you find more useful for understanding a metagenomic dataset?
- What is the habitat (at least according to MEGAN) for _Bradyrhizobium japonicus_?
- Are there any anaerobic bacteria present?
- Identify one bacterial genus that is facultatively aerobic.
Microbial Attributes
- Subselect the Bacteria node again
- Launch the Microbial Attributes analysis
- Are the majority of classified species Gram + or Gram -
- What is the habitat of Bradyrhizobium japnonicum?
- How do most species found on soybean leaf move around?
- Can you identify a genus that is facultatively aerobic?
- What pathogenic species has been identified from this metagenomic sample?
- Search the internet and determine whether it is able to infect soybean (Glycine max).
- What does a 'Mesophilic' temperature range mean?
Experimental strategies
- Design your own experimental and bioinformatics strategem for comparing the phyllosphere of a non-nitrogen fixing species to soybean
- What species would you examine?
- Are there other nitrogen fixing species you could examine to shore up some of your results?
- Describe the sequencing strategy you would use that will give you similar results to the soybean phyllosphere sample
- Design a bioinformatics strategy starting with acquisition of sequence and ending with import into MEGAN for this project.
- Enumerate the tools you will need, the sequence you will use them, and explain the purpose of each.
- Are there other phlyllosphere data sets at the NCBI SRA? Could any of them be useful for comparative analysis to the soybean phyllosphere?
Synthesis Period (4:00-5:00)
- In your groups, design a 5 minute presentation for tomorrow to address one section of the Study Questions (Taxonomic Composition, Functional Analysis, Microbial Attributes, or Experimental strategies)
- Try to send your presentation to Dana dana.s.brunson@gmail.com by the end of the day
, multiple selections available,