Home > Phylogenetics > Modeltest
Updated 29th March 2010
This guide focuses on using Modeltest to find the best parameters to construct a maximum likelihood phylogenetic tree. This is not a substitute for a bioinformatics education, make sure you know what you are doing.
MODELTEST is a computer program by David Posada and Keith A Crondall. One use of this program is to calculate the best model for DNA evolution using Maximum Likelihood (ML). For more information read their publication (pdf) or visit the Modeltest website. When I started to use this program I found there was little detailed information about to help newbies use it, so I wrote this tutorial to remind me of the steps, and to help some graduate students in my lab.
The latest version of Modeltest is 3.7 released on July 29, 2005. However David Posada now recomends you use jModeltest
To use the Modeltest program you need the application itself (available from the authors website), and the phlyogenetics package PAUP* (this is not free unfortunately, hopefully your university or company has a license) [note jModeltest does not require PAUP]. I will be referring to the windows command line version in this document, but these files will work on all versions. Finally you will need a fast computer, I used a 2.4 GHz Pentium 4 laptop, on this it took about an hour to work through 50 taxa of 1300 bp. You also need two files: modelblockPAUPb10.txt and ML-search.txt which you can download from my website. modelblockPAUPb10.txt was written by David Posada and is a copy of the file found in the paupblock folder that is included in the download. ML-search was written by me, modify this for you own use.
I also assume that at this point you have a sequence alignment in the NEXUS format (this is critical). The easiest way to do this is use the "save as" command in ClustalX Open up this file in a text editor and replace the "-" (gap) symbol with the missing data symbol "?" where appropriate. It is getting out of the scope of this document to explain this, but typically this is at the ends of the alignment when you don't have full sequence for all taxa. One handy hint don't use the "-" symbol in taxa names, use "_" instead.
I use my own file-naming scheme to keep track of things. I suggest replacing the word "test" below with the name of your gene, i.e.: "p53.ML.search.nex"
Finally I highly recommend you read the documentation file: modeltest3.6.pdf found in the doc folder of the download
Bootstrapping a ML tree is possible, but very very time consuming. If you do wish to do this add: "bootstrap search=Heuristic nreps=100;" near the end of the file. (search=Heuristic is the default, other possibilites are: BandB, FastStep, NJ, and UPGMA — but don't use those).
Home | About Me | Site Map | Copyright Policy | Contact Me