RAxML-7.3.0
RAxML-7.3.0
RAxML implements a rapid maximum likelihood-based inference of phylogenetic trees from nucleotide or amino acid multiple sequence alignments. The phylip_interleaved.phy output of the Muscle-3.8.31 App in the Discovery Environment is suitable for this analysis.
Quick Start
- To use RAxML, upload your data in relaxed PHYLIP format (interleaved, no name length restrictions).
- Resources: documentation
Test Data
Use the protein alignment elong_alpha.phy as test data. It can be accessed by navigating to :
'Community Data'->iplantcollaborative>example_data->raxml-7.3.0
Parameters Used in App
When the app is run in the Discovery Environment, use the following parameters with the above input file(s) to get the output provided in the next section below.
- Run the test analysis with default parameters
- Setting the *Multiple Sequence Alignment* and *Sequence Type* Parameters is mandatory.
DNA-Specific Options
There is only one option for dna sequences, the 'Substitution Model'. If this option is not set, the DNACAT option will be used by default. The RAxML descriotion of the available options is shown below:
GTRCAT
GTR + Optimization of substitution rates + Optimization of site-specific evolutionary rates which are categorized into numberOfCategories distinct rate categories for greater computational efficiency. Final tree might be evaluated under GTRGAMMA, depending on the tree search option
GTRCATI
GTR + Optimization of substitution rates + Optimization of site-specific evolutionary rates which are categorized into numberOfCategories distinct rate categories for greater computational efficiency. Final tree might be evaluated under GTRGAMMAI, depending on the tree search option
GTRGAMMA
GTR + Optimization of substitution rates + GAMMA model of rate heterogeneity (alpha parameter will be estimated)
GTRGAMMAI
Same as GTRGAMMA, but with estimate of proportion of invariable sites
Protein-Specific Options
Components of substitution model can be selected. The amino acid substitution model has three parts, which are joined automatically into the model name used by RAxML:
ModelmatrixNameF
The substitutionmatrix can be either BLOSSUM62 or DAYHOFF. Emprirical base Frequences F is a shown as a checkbox (checked by default). The F option is either present or absent, depending on whether the box is checked.
By default, the PROTCAT substitution model, the BLOSSUM62 substitution matrix, and empirical amino acid frequencies (F) will be used to form the model: PROTCATBLOSSUM62F
PROTCATmatrixName(F)
specified AA matrix + Optimization of substitution rates + Optimization of site-specific evolutionary rates which are categorized into numberOfCategories distinct rate categories for greater computational efficiency. Final tree might be evaluated automatically under PROTGAMMAmatrixNamef, depending on the tree search option"PROTCATImatrixNameF" : specified AA matrix + Optimization of substitution rates + Optimization of site-specific evolutionary rates which are categorized into numberOfCategories distinct rate categories for greater computational efficiency. Final tree might be evaluated automatically under PROTGAMMAImatrixNamef, depending on the tree search option
PROTGAMMAmatrixName(F)
specified AA matrix + Optimization of substitution rates + GAMMA model of rate heterogeneity (alpha parameter will be estimated)
PROTGAMMAImatrixName(F)
Same as PROTGAMMAmatrixName(F), but with estimate of proportion of invariable sites
Output File(s)
- Expect the following output files:
- output.txt – the text output of the analysis
- treefile.newick – the tree file in Newick format
- treefile.svg – a SVG tree image generated by TreeVector
- treefile.png – a PNG version of the tree image
- TreeVector.html – a downloadable HTML form to render the tree in TreeVector using different options
- The raw RAxML output files are saved in the RAXML_out folder