The genealogical sorting index (gsi) quantifies the degree of exclusive ancestry of labeled groups on a rooted genealogy (Cummings, Neel and Shaw 2008). The statistic is useful for testing the differentiation of species and populations or subpopulations. The probability of an observed genealogical sorting index value is estimated by permutation.
The algorithm for calculating the genealogical sorting index was written in R, a statistical programming environment. We provide two ways to use the program. A parallel computing resource for running the analysis is available via a web interface (http://www.genealogicalsorting.org/) for users interested in quickly calculating the genealogical sorting index and performing permutation tests on phylogenetic trees. It will eventually be available as an open source package on the Comprehensive R Archive Network (http://cran.r-project.org/); in the meantime, the R package is available for download.
Two input files are needed –
- rooted tree file in NEXUS or Newick format containing only representatives for groups you wish to analyze (e.g., no outgroup)
- class (group) assignment file in plain text
Sample files
- simple_trees
- simple_assignments
Additional sample files are available. Alternatively, the assignment file can be created using the web interface.
To begin analysis, go to the web site: http://www.genealogicalsorting.org/.