rust-mdbg 0.1.0 multik
Quick Start
To use rust-mdbg 0.1.0 multik, import your data in FASTA/FASTQ format
Resources: https://github.com/ekimb/rust-mdbg
rust-mdbg is a modular assembler. It consists of three components:
rust-mdbg, to perform assembly in minimizer-space
gfatools (external component), to perform graph simplifications
to_basespace, to convert a minimizer-space assembly to base-space
(For convenience, components 2 and 3 are wrapped into a script called magic_simplify.)
For better contiguity, try the provided multi-k assembly script. It performs assembly iteratively, starting with k= 10, up to an automatically-determined largest k. This comes at the expense of ~7x longer running time.
There are 4 rust-mdbg apps in the DE:
rust-mdbg 0.1.0
rust-mdbg 0.1.0 magic simplify
rust-mdbg 0.1.0 pipeline (runs both apps 1 and 2)
rust-mdbg 0.1.0 multik
Test Data
Test data for this app appears directly in the Discovery Environment in the Data window under Community Data -> iplantcollaborative -> example_data -> rust-mdbg
Input File(s)
Use reads-0.00.fa.gz from the directory above as test input.
Parameters Used in App
When the app is run in the Discovery Environment, use the following parameters with the above input file(s) to get the output provided in the next section below.
Use these parameters within the DE app interface:
Select a prefix for the output files
All the other parameters may be left as default
Output File(s)
Expect the following as output.
example-final.gfa
example-final.msimpl.fa
example-final.msimpl.gfa
example-k10.140420614465280.sequences
example-k10.140420616578816.sequences
example-k10.140420618692352.sequences
example-k10.140420620805888.sequences
example-k10.140420622907136.sequences
example-k10.140420625032960.sequences
example-k10.140420627146496.sequences
example-k10.140420629247744.sequences
reads-0.00.fa.gz