Clean_fasta_header
Rationale and background
Clean_fasta_header app removes everything after "|" in the fasta header of the fasta file. The special character "|" is not ideal with many of the bioinformatics tools and it is important to remove them in the fasta header. This app will help you remove one of the special character
Prerequisites
A CyVerse account (Register for a CyVerse account at https://user.cyverse.org/).
An up-to-date Java-enabled web browser. (Firefox recommended. If you wish to work with your own large datasets and upload them using iCommands, Chrome is not suitable due to its issues in utilizing 64-bit Java.)
Input:
Either one of the below options should be selected for modifying the fasta header. Custom reference genome or any fasta sequence with "|" can be used here
Cutsom Reference genome
Reference genome from DE
Output Folder name: Name of the output folder (default "output")
Test/sample data
This tutorial uses the test data that is stored in the Data Store at Community Data > iplantcollaborative > example_data > clean_fasta_header
Input:
Reference genomes: Acromyrmex_echinatior
Output Folder name: Use default folder name - "output"
Output
logs
Output
genome.cleaned.fas