SDI R is for inferring gene duplications on a gene tree and for rooting by duplication minimization. For more information see Zmasek and Eddy (2001). A simple algorithm to infer gene duplication and speciation events on a gene tree. Bioinformatics, 17, 821-828.
SDI R has been implemented in Java as part of the
forester libraries.
Source code is available at [forester] at sourceforge.net
java -cp path\to\forester.jar org.forester.application.sdi_r
Options:
-ml to root by minimizing the mapping cost L (and also the sum of duplications)
-md to root by minimizing the sum of duplications
-mh to root by minimizing tree height (can be used together with -ml or -md)
Download a example species tree ("arthropoda_species_tree.xml") and a
example gene tree ("sample_unrooted_gene_tree.xml").
Then, run SDI R with the following command line:
% java -cp path\to\forester.jar org.forester.application.sdi_r -md sample_unrooted_gene_tree.xml arthropoda_species_tree.xml
The important point to keep in mind is that there must be at least one sub-element of the 'taxonomy' element which allows to
match the sequences in the gene tree with a taxonomy in the species tree. In this example this sub-element of the 'taxonomy' element
is 'code'.
Christian M Zmasek
phylosoft -at- gmail -dot- com
Last updated 2010.03.01
www.phyloxml.org | Archaeopteryx | www.phylosoft.org