Panel | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||
|
Panel | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||
For an introduction to using the DE, see Using the Discovery Environment. Please work through the tutorial and add your comments on the bottom of this page, or email comments to upendra@cyverse support@cyverse.org. Thank you. |
Rationale and background
...
Clean_fasta_header app removes everything after "|" in the fasta header of the fasta file. The special character "|" is not ideal with many of the bioinformatics tools and it is important to remove them in the fasta header. This app will help you remove one of the special character
Prerequisites
A CyVerse account (Register for a CyVerse account at https://user.cyverse.org/).
An up-to-date Java-enabled web browser. (Firefox recommended. If you wish to work with your own large datasets and upload them using iCommands, Chrome is not suitable due to its issues in utilizing 64-bit Java.)
- Input:
- Reference genome from DE
- Output Folder name: Name of the output folder (default "output")
Test/sample data
This tutorial uses the test data that is stored in the Data Store at Community Data > iplantcollaborative > example_data > bwa clean_fasta_header
- Use 10K_SRR192294_1.fastq for Left_Read file and 10K_SRR192294_2.fastq (2.4M) for Right_Read file
- Select a reference genome from the list - Brachypodium distachyon (line Bd21) v1
- All other parameters remain in their default state
Output
Expect a SAM file named after the input files as output. For the test case, the output file is 10K_SRR192294_1.fastq-10K_SRR192294_2.fastq.sam (4.8M)
Tool Source for App
...
- Input:
- Reference genomes: Acromyrmex_echinatior
- Output Folder name: Use default folder name - "output"
Output
- logs
- Output
- genome.cleaned.fas