Megahit-1.0.6
The DE Quick Start tutorial provides an introduction to basic DE functionality and navigation.
Please work through the tutorial and add your comments on the bottom of this page, or send comments per email to upendra@cyverse.org. Thank you.
Rationale and background:
MEGAHIT is a single node assembler for large and complex metagenomics NGS reads, such as soil. It makes use of succinct de Bruijn graph (SdBG) to achieve low memory assembly. MEGAHIT can optionally utilize a CUDA-enabled GPU to accelerate its SdBG contstruction. The GPU-accelerated version of MEGAHIT has been tested on NVIDIA GTX680 (4G memory) and Tesla K40c (12G memory) with CUDA 5.5, 6.0 and 6.5. MEGAHIT v1.0 or greater also supports IBM Power PC and has been tested on IBM POWER8
Prerequisites
A CyVerse account. (Register for an CyVerse account here - user.cyverse.org.)
Input: Select one of the following. Accepted file formats include fasta (.fasta, .fa, .fna) or fastq (.fastq, .fq) or gzip files (with .gz extensions) and bzip2 files (with .bz2 extensions)
Single End Reads
Interleaved Paired End Reads
Paired End Reads (Read 1) - If you selected this option you should select the below option as well
Paired End Reads (Read 2)
Output options
Output folder Name: Name of the output folder
Minimum contig length: Default is 200bp
Advanced Parameters (Optional)
Presets parameters: The available options include meta, meta-sensitive, meta-large, bulk and single-cell
Test/sample data:
Use the following inputs/outputs and parameters for testing Megahit-1.0.6. The test data is available here: /iplant/home/shared/iplantcollaborative/example_data/megahit
Input
Paired End Reads (Read 1) - SRR341725_1.fastq.gz
Paired End Reads (Read 2) - SRR341725_2.fastq.gz
Output
Output File Name: megahit_out
intermediate_contigs folder
final.contigs.fa
opts.txt
Advanced Parameters (Optional): None
More information about Megahit-0.6.1 can be found at Megahit