Sonya Lowry/Working group
Data model and scripts
The TR database stores gene tree reconciliations as a mapping of the nodes of the gene tree (guest tree) onto the nodes and edges of a species tree (host tree). The topology of the gene tree and species tree involved in the reconciliation are stored separate from the mapping itself. The attributes of the reconciliation mapping as well as the components of the gene trees and species trees can be labeled using a controlled vocabulary. The genes contributing to the gene trees are stored as 'members' within the database.
The TR schema extends the preexisting Ensemble-Compara schema to include species trees, the mappings of reconciliations of gene trees onto these species trees, and a controlled vocabulary module to support the use of ontologies to tag attribute values within the database.The slides describing the initial thoughts on the development of the schema are available and would be helpful to view to see how the schema was designed.
Populating the Database
The tools to populate the database are written as Perl command line programs, and their use to populate the database are described in the populating the database section of the wiki.
Will be extracted as PerlDocs
Viewer-Editor Integration Framework