rust-mdbg 0.1.0 multik
Quick Start
To use rust-mdbg 0.1.0 multik, import your data in FASTA/FASTQ format
- Resources: https://github.com/ekimb/rust-mdbg
rust-mdbg is a modular assembler. It consists of three components:
- rust-mdbg, to perform assembly in minimizer-space
- gfatools (external component), to perform graph simplifications
- to_basespace, to convert a minimizer-space assembly to base-space
(For convenience, components 2 and 3 are wrapped into a script called magic_simplify.)
For better contiguity, try the provided multi-k
assembly script. It performs assembly iteratively, starting with k
= 10, up to an automatically-determined largest k
. This comes at the expense of ~7x longer running time.
There are 4 rust-mdbg apps in the DE:
- rust-mdbg 0.1.0
- rust-mdbg 0.1.0 magic simplify
- rust-mdbg 0.1.0 pipeline (runs both apps 1 and 2)
- rust-mdbg 0.1.0 multik
Test Data
Test data for this app appears directly in the Discovery Environment in the Data window under Community Data -> iplantcollaborative -> example_data -> rust-mdbg
Input File(s)
Use reads-0.00.fa.gz from the directory above as test input.
Parameters Used in App
When the app is run in the Discovery Environment, use the following parameters with the above input file(s) to get the output provided in the next section below.
- Use these parameters within the DE app interface:
- Select a prefix for the output files
- All the other parameters may be left as default
Output File(s)
Expect the following as output.
- example-final.gfa
- example-final.msimpl.fa
- example-final.msimpl.gfa
- example-k10.140420614465280.sequences
- example-k10.140420616578816.sequences
- example-k10.140420618692352.sequences
- example-k10.140420620805888.sequences
- example-k10.140420622907136.sequences
- example-k10.140420625032960.sequences
- example-k10.140420627146496.sequences
- example-k10.140420629247744.sequences
- reads-0.00.fa.gz