Panel | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||
|
Panel | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||
For an introduction to using the DE, see Using the Discovery Environment. Please work through the tutorial and add your comments on the bottom of this page, or email comments to support@cyverse.org. Thank you. |
Rationale and background
CleanModify_fastaGFF_header app removes everything after "|" in the fasta header of the fasta file. The special character "|" is not ideal with many of the bioinformatics tools and it is important to remove them in the fasta header. This app will help you remove one of the special character Coordinates app will add bp to either side of the start and stop coordinates of the gff file. The user can select the delimiter (eg, gene, exon etc) as well as the number of base pairs to add to the gff file
Prerequisites
A CyVerse account (Register for a CyVerse account at https://user.cyverse.org/).
An up-to-date Java-enabled web browser. (Firefox recommended. If you wish to work with your own large datasets and upload them using iCommands, Chrome is not suitable due to its issues in utilizing 64-bit Java.)
- Inputs:
- Reference genome from DE
- Delimiter: The feature of gff file that you want to add the bp to. Ex: gene)
- Base pairs: The number of base pairs to be added to either side of start and stop coordinates in the gff file
- Output File name: Name of the output folder (default "output")file
Test/sample data
This tutorial uses the test data that is stored in the Data Store at Community Data > iplantcollaborative > example_data > clean_fasta_header Evolinc-I
- Input:
- Reference genomes: Acromyrmex_echinatior
- Input gff file: TAIR10_genes.gff
- Delimiter: gene
- Base pairs: 500
- Output File name: output.gff
Output
- logs
- Output
- genomeoutput.cleaned.fasgff