Important Dates: Last day for Automatic Withdrawal: Wednesday, October 23. Project idea due on Wednesday, November 6. Prelimary project report due on Wednesday, December 4. Final Project due by 12:30 p.m., Wednesday December 18.
Religious Holiday Policy: If you will miss class, especially a class during which there will be an exam or other required work, for a religious holiday, you must inform me during the first two weeks of the semester.
Cheating: It is not possible to cheat on homeworks. However, all final project work must be your own and no cheating will be tolerated. Also, you cannot combine this project with that of another class (past, present, or future) unless you get the permission of both instructors. All suspected cases of cheating will be handled in accordance with University procedures found at http://dsa.indiana.edu/ethics.html . If you do cheat, you will receive an F* in the course - the star informs the registar that the F is due to cheating. Additional sanctions may be imposed by the Dean.
Homework: The homework is designed to help you learn how to use relevant software and to help you get a feel for the issues that come up when you conduct a phylogenetic analysis. However, by keeping the homework reasonable by limiting the number of species and by considering only one gene, there is no way that the homework reflects a deep phylogenetic analysis. You may work with each other on the homework assignments.
Projects: Half your grade will be based on a final project. I am imagining that your project will be of one of the following types:
(1) You can use original sequence data (yours or someone else's) to conduct an original, careful phylogenetic analysis. You should describe what you are looking for in the data and why it is important.
(2) You can discuss and critique two or more different approaches in the literature to an issue in phylogenetics. The topic can be biological (such as the placement of a group on a phylogeny) or theoretical (such as two different phylogenetic comparative methods).
If your imagination takes you elsewhere, just run your idea by me to get my approval.
You will turn in your project idea and a rough draft before you turn in your final project. This is designed to keep you on schedule and to give you feedback so that you can address any problems in a timely manner.
Project idea: Due Wednesday, November 6. Give a brief description of what you intend to do for your final project. If you have your own data to work with, provide a copy of your data. If you have data from another source provide the data or the reference. If you are comparing two different approaches to an issue in phylogenetics, briefly describe the topic and provide copies of the references you are using.
Preliminary report: Due Wednesday, December 4. Provide a rough draft of your final project report.
Grades: Grades are calculated as 20% participation (attendance and questions), 30% homeworks, and 50% the final project.
Database Links: NCBI GenBank UniProt (contains SwissProt) Ensembl European Nucleotide Archive DNA Databank of Japan
Software Links: Clustal Omega online from EMBL-EBI Coffee programs DNA sequence manipulation Tree View jModeltest PHYLIP MEGA Tree Puzzle MrBayes PAUP R (statistical software) PhyML Online SplitsTree4 Others (Felsenstein's list)
Links to other Phylogenetics Learning Resources: David Baum's Phylogenetics Course NCBI Phylogenetics Primer Software Learning Activities Welcome to a Little Book of R for Bioinformatics!
|Monday, August 26||Basic concepts of molecular evolution||Chapter 1|
|Wednesday, August 28||Getting data||Chapter 2|
|Friday, August 30||Aligning the data||Chapter 3|
|Homework 1 Due: Friday, September 6|
|Monday, September 2||Labor Day Holiday||--|
|Wednesday, September 4||Aligning the data||
BaliBASE Kinase alignment
|Friday, September 6||Genetic distances and Jukes Cantor||Chapter 4|
|Homework 2 Due: Friday, September 13|
|Monday, September 9||More complicated models of DNA evolution||Chapter 4|
|Wednesday, September 11||Even more complicated models||
Primate mitochondrial DNA [Hayasaka, K., T. Gojobori, and S. Horai. 1988. Molecular phylogeny and evolution of primate mitochondrial DNA. Mol. Biol. Evol. 5:626-644.]
R code for getting pairwise distances from 'ape'
|Friday, September 13||Log det distances||
Recovering Evolutionary Trees under a More Realistic Model of Sequence Evolution by Peter Lockhart, Michael Steel, Michael Hendy, and David Penny in Molecular Biology and Evolution (1994) 11:605-612.
Evolutionary Distance Estimation Under Heterogeneous Substitution Pattern Among Lineages by Koichiro Tamura and Sudhir Kumar in Molecular Biology and Evolution (2002) 19:1727-1736.
Algae data from the Lockhart et al. paper
Some R code for roughly checking the equidistribution of base frequencies
|Homework 3 Due: Friday, September 20|
|Monday, September 16||Model selection procedures||
Algae data from the Lockhart et al. paper in MEGA format
Some R code for roughly checking the equidistribution of base frequencies
|Wednesday, September 18||Distance based methods using clustering||Chapter 5|
|Friday, September 20||Minimum evolution and neighbor joining||Chapter 5|
|Homework 4 Due: Friday, September 27|
|Monday, September 23||Bootstrapping - nonphylogenetic examples||
Introduction to the Bootstrap Lecture Notes by Michael Eichler
Bootstrap Methods: Another Look at the Jackknife by Bradley Efron in The Annals of Statistics (1979) 7:1-26.
The bootstrap: a tutorial by Ron Wehrens, Hein Putter, and Lutgarde Buydens in Chemometrics and Intelligent Laboratory Systems (2000) 54:35-52.
Some simple R code demonstrating the bootstrap for the silly example given in class.
|Wednesday, September 25||Bootstrapping in phylogenetics||
Chapters 5 and 12
R code for bootstrapping and for Felsenstein's and Kishino's example.
|Friday, September 27||Correcting the bootstrap||
Bootstrap support is first-order correct (Efron, Halloran, and Holmes)
Bootstrap support is not first-order correct (Susko)
How to make phylogenetic bootstrap values first order correct (Susko)
The additional problem of post-hoc analyses.
|Homework 5 Due: Friday, October 4|
|Monday, September 30||Maximum likelihood theory||Chapter 6|
|Wednesday, October 2||Maximum likelihood in phylogenetics||
R code for phylogenetic maximum likelihood analyses
|Friday, October 4||Maximum likelihood in phylogenetics||Chapter 6|
|Homework 6 Due: Friday, October 11|
|Monday, October 7||Searching tree space||Chapter 6|
|Wednesday, October 9||Bayesian theory||
An Introduction to Bayesian Statistics Without Using Equations by Eguchi
Introduction to Bayesian Statistics (with some equations) by Brynjarsdottir and Li
|Friday, October 11||Bayesian phylogenetics||
Mr. Bayes User Manual
Primate Data Nexus file plus code
|Monday, October 14||Using Mr. Bayes||Chapter 7|
|Wednesday, October 16||Parsimony theory||Chapter 8|
|Friday, October 18||Fall Break||--|
|Homework 7 Due: Friday, October 25|
|Monday, October 21||Parsimony and phylogenetics||Chapter 8|
|Wednesday, October 23||Branch and bound||Chapter 8|
|Friday, October 25||Parsimony example||Chapter 8|
|Homework 8 Due: Friday, November 1|
|Monday, October 28||Consensus methods and total evidence||
Cladistics: Consensus trees by Peter Forey, posted by Thomas R. Holtz, from The Palaeontology Newsletter (2007) 64:28-34.
War and Peace in Phylogenetics: A Rejoinder on Total Evidence and Consensus by Claudine Levasseur and Francois-Joseph Lapointe in Systematic Biology (2001) 50:881-891.
R Code for creating a consensus phylogeny
A collection of Bayesian Posterior Primate Trees
| Wednesday, October 30
||Split networks||Chapter 21|
|Friday, November 1||Phylogenies and the coalescent process||
Gene Tree Distributions Under the Coalescent Process by James Degnan and Laura Salter in Evolution (2005) 59:24-37
|Homework 9 Due: Friday, November 8 (Skipped due to accidental deletion)|
|Monday, November 4||Phylogenetic networks||NETGEN: generating phylogenetic networks with diploid hybrids by M. M. Morin and B. M. E. Moret in Bioinformatics (2006) 22:1921-1923.|
| Wednesday, November 6
Project idea due
|Testing the molecular clock||Chapter 11|
|Friday, November 8||Testing the molecular clock||Chapter 11|
|Homework 10 Due: Friday, November 15|
|Monday, November 11||Synonomous and nonsynonymous substitutions||
Chapter 13 and
Handout on dN/dS ratios by Rausher
Li's method Unbiased estimation of the rates of synonymous and nonsynonymous substitution. Journal of Molecular Evolution (1993) 36:96-99.
R code for reverse.align and kaks.
Parul's DNA data set for input into R
Parul's protein alignment for input into R
|Wednesday, November 13||McDonald-Kreitman test||
Standard and generalized McDonald–Kreitman test: a website to detect selection by comparing different classes of DNA sites by
Raquel Egea, Sonia Casillas, and Antonio Barbadilla in Nucleic Acids Research (2008) 36(suppl. 2): W157-W162.
McDonald-Kreitman Test software
Partial gnd locus in strains of E. coli.
Frequent adaptation and the McDonald-Kreitman Test by Philipp Messer and Dmitri Petrov in PNAS (2013) 110:8615-8620.
|Friday, November 15||Other methods for detecting selection||
Chapter 14 and
Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under Selection by Sergei L. Kosakovsky Pond and Simon D. W. Frost in Molecular Biology and Evolution (2005) 22:1208-1222.
Inferring Selection and Mutation from DNA Sequences: The McDonald-Kreitman Test Revisited by Stanely Sawyer in Non-Neutral Evolution, Brian Golding, Editor, Springer (1994) 77-87.
|Homework 11 Due: Friday, November 22|
|Monday, November 18||Host-parasite coevolution tests||
Disparate Rates of Molecular Evolution in Cospeciating Hosts and Parasites
by Mark Hafner, Philip Sudman, Francis Villablanca, Theresa Spradling, James Demastes, and Steven Nadler in Science (1994) 265:1087-1090
Statistical Tests of Host-Parasite Cospeciation by John P. Huelsenbeck; Bruce Rannala; Ziheng Yang in Evolution (1997) 51:410-419.
A Statistical Test for Host-Parasite Coevolution by Pierre Legendre, Yves Desdevises, and Eric Bazin in Systematic Biology (2002) 51:217-234
|Wednesday, November 20||Distances between trees||Geometry of the Space of Phylogenetics Trees by Louis Billera, Susan Holmes, and Karen Vogtmann in Advances in Applied Mathematics (2001) 27:733-767|
|Friday, November 22||Experimental design issues: taxa and data||Experimental design criteria in phylogenetics: where to add taxa by K. Geuten, T. Massingham, P. Darius, E. Smets, and N. Goldman in Systematic Biology (2007) 56:609-22.|
|Monday, December 2||Phylogenetic Comparative Methods||
Phylogenies and the Comparative Method by
Joseph Felsenstein in The American Naturalist (1985) 125:1-15.
Salmon data from Crespi and Teo (2002)
Salmon phylogeny approximation from Crespi and Teo (2002)
Sample R code for conducting Independent Contrasts
| Wednesday, December 4
Rough draft of final project due
|Phylogenetic Comparative Methods||
Conducting Phylogenetic Comparative Studies When the Phylogeny is not Known by Emilia Martins in Evolution (1996) 50:12-22.
Sample R code for including phylogenetic uncertainty
|Friday, December 6||Phylogenetic Comparative Methods||
Phylogenetic Analysis and Comparative Data: A Test and Review of Evidence by R. P. Freckleton, P. H. Harvey, and M. Pagel in The American Naturalist (2002) 160:712-726.
The Phylogenetic Mixed Model by Elizabeth Housworth, Emilia Martins, and Michael Lynch in The American Naturalist (2004) 163:84-96.
|Monday, December 9||Phylogenetic comparative methods||--|
|Wednesday, December 11||Phylogenetic meta-analysis/Evaluations||
General quantitative genetic methods for comparative biology:
phylogenies, taxonomies and multi-trait models for continuous and
categorical characters by J. D. Hadfield and S. Nakagawa in Journal of Evolutionary Biology (2010) 23:494-508.
Sample R code
|Friday, December 13||Help with Projects Day||--|
|Final Project due by 12:30 p.m., Wednesday December 18. Final Project Grading Rubric|