QERM 598: Computational Methods in Quantitative Ecology

From QERM Wiki
(Difference between revisions)
Jump to: navigation, search
(Week by Week: Homeworks, Labs, Data and Notes for QERM 598)
m
 
(6 intermediate revisions by one user not shown)
Line 1: Line 1:
'''The mainpage for this course remains at its original site at http://qerm.pbwiki.com.  The materials are slowly being moved onto this page, but this is at the moment a low priority for the instructor.  Volunteers from the world wide community of CMQE lovers are welcome to contribute to this project!'''
+
[[Image:SuperMosaic.png|right|thumb|300px|Mosaic plot of Steller sea lion birth probabilities]]
  
 +
* This course is designed to introduce students to computational methods in '''Ecological Modeling'''.  Commonly used techniques in '''Applied Statistics''' will be covered, as well as some deterministic and stochastic models encountered in '''Mathematical Biology'''.  While we cover the mechanics of doing various statistical analyses, the goal of the course is to develop the ability to use computational tools creatively to help visualize and grasp statistical concepts and modeled processes.  There will be lots of emphasis on '''simulation''' and '''graphical visualization''', since these are such an important tools in practice but scantily touched upon in other courses. 
 +
 +
* Our primary tool will be [http://cran.r-project.org/ '''R''']. 
 +
 +
* Each week will consist of a lecture, an R lab and an assignment.  One of the practical goals is to create a body of labs, assignments and datasets that can be used as a basis for a continuation of this course in years to come.  All of these materials will be stored on this website.
 +
 +
* There will be a final project involving some combination of analysis and presentation using R. These will be presented on the last day of class, aka: [http://qerm.pbwiki.com/FinalProjects Annual QERM Miniconference on Computational Methods in Quantitative Ecology].
 +
 +
 +
=== Practical Details===
  
 
* '''Location:''' [http://www.washington.edu/home/maps/northcentral.html?OUG Odegaard Undergraduate Library] Computer Lab 102  
 
* '''Location:''' [http://www.washington.edu/home/maps/northcentral.html?OUG Odegaard Undergraduate Library] Computer Lab 102  
Line 9: Line 19:
 
* '''Prerequisites:''' Statistical Inference (STAT 512), Mathematical biology (AMATH 420)
 
* '''Prerequisites:''' Statistical Inference (STAT 512), Mathematical biology (AMATH 420)
  
 
=== The Course ===
 
[[Image:SuperMosaic.png|right|thumb|300px|Mosaic plot of Steller sea lion birth probabilities]]
 
 
* This course is designed to introduce students to computational methods in '''Ecological Modeling'''.  Commonly used techniques in '''Applied Statistics''' will be covered, as well as some deterministic and stochastic models encountered in '''Mathematical Biology'''.  While we cover the mechanics of doing various statistical analyses, the goal of the course is to develop the ability to use computational tools creatively to help visualize and grasp statistical concepts and modeled processes.  There will be lots of emphasis on '''simulation''' and '''graphical visualization''', since these are such an important tools in practice but scantily touched upon in other courses. 
 
 
* Our primary tool will be [http://cran.r-project.org/ '''R''']. 
 
 
* Each week will consist of a lecture, an R lab and an assignment.  One of the practical goals is to create a body of labs, assignments and datasets that can be used as a basis for a continuation of this course in years to come.  All of these materials will be stored on this website on [HomeWorksAndLabs this page].
 
 
* There will be a final project involving some combination of analysis and presentation using R. These will be presented on the last day of class, aka: [http://qerm.pbwiki.com/FinalProjects Annual QERM Miniconference on Computational Methods in Quantitative Ecology].
 
  
 
=== Week by Week: Homeworks, Labs, Data and Notes for QERM 598 ===
 
=== Week by Week: Homeworks, Labs, Data and Notes for QERM 598 ===
 
[[Image:boxplot.jpg|right|thumb|300px|Box-plots]]
 
[[Image:boxplot.jpg|right|thumb|300px|Box-plots]]
 +
  
 
===== Week 0: Basic Introduction to R =====
 
===== Week 0: Basic Introduction to R =====
Line 33: Line 33:
  
 
* {{Rcode | LabHomework0.txt | Lab/Homework 0}} (due Wednesday, January 9).
 
* {{Rcode | LabHomework0.txt | Lab/Homework 0}} (due Wednesday, January 9).
 +
  
 
===== Week 1: Distributions and the Central Limit Theorem =====
 
===== Week 1: Distributions and the Central Limit Theorem =====
Line 46: Line 47:
 
* {{Rcode| Week02_TwoSampleComparison.r | R Lab on comparing two samples}}   
 
* {{Rcode| Week02_TwoSampleComparison.r | R Lab on comparing two samples}}   
 
* {{Pdf|QERM598_Homework01.pdf|Homework 1}}
 
* {{Pdf|QERM598_Homework01.pdf|Homework 1}}
You will need to download the following data files to perform the lab and the homeworks.
+
You will need to download the following data files to perform the lab and the homeworks:
* Single column of weight data:  {{Data|Weight.txt|Ant Weight data}}  
+
** {{Data|Weight.txt|Ant Weight data}} (Single column of weight data)
* Comma-delimited data file:  {{Data| ants.csv | ants.csv}}
+
** {{Data| ants.csv | ants.csv}} (Comma-delimited data file)
*  Space-delimited data file{{Data| ants.dat | ants.dat}}
+
** {{Data| ants.dat | ants.dat}} (Space-delimited data file)
You will also need to read Mike Keim's {{Pdf | NoteOnNonparametricTests.pdf | notes on non-parametric tests}}.
+
You will also need to read Mike Keim's {{Pdf | NotesOnNonparametricTests.pdf | notes on non-parametric tests}}.
 +
 
  
 
===== Week 3: Introduction to Analysis of Variance =====
 
===== Week 3: Introduction to Analysis of Variance =====
Line 59: Line 61:
 
* {{Data | QERM598_SealionBirthData.dat | Sea Lion data for homework}}
 
* {{Data | QERM598_SealionBirthData.dat | Sea Lion data for homework}}
 
* {{Pdf | QERM598_MathFacts.pdf | Math Facts for ANOVA}}
 
* {{Pdf | QERM598_MathFacts.pdf | Math Facts for ANOVA}}
 +
  
 
===== Week 4: Linear Regression =====
 
===== Week 4: Linear Regression =====
Line 74: Line 77:
  
 
* {{Pdf | QERM598_Chi2.pdf | Lecture on contingency tables and Chi-squared tests}}  
 
* {{Pdf | QERM598_Chi2.pdf | Lecture on contingency tables and Chi-squared tests}}  
* {{R | | QERM598_Week5Chi2lab.r | In-class R lab}}
+
* {{Rcode| QERM598_Week5Chi2lab.r | In-class R lab}}
 
* Data files for lecture/lab:  
 
* Data files for lecture/lab:  
 
** {{Data | QERM598_Clams.csv | Clams.csv}}  
 
** {{Data | QERM598_Clams.csv | Clams.csv}}  
 
** {{Data | QERM598_KurilBirths.csv | SealionBirths.dat}}
 
** {{Data | QERM598_KurilBirths.csv | SealionBirths.dat}}
 
* {{Pdf | QERM598_HW5.pdf | Homework}}
 
* {{Pdf | QERM598_HW5.pdf | Homework}}
 +
 
There is, by the way, a great explanation/derivation of the theory behind Chi-Squared contingency tests and other applications of the central limit theorem [http://www.cefe.cnrs.fr/BIOM/pdf/mt2004-part2.pdf  here].
 
There is, by the way, a great explanation/derivation of the theory behind Chi-Squared contingency tests and other applications of the central limit theorem [http://www.cefe.cnrs.fr/BIOM/pdf/mt2004-part2.pdf  here].
  
 
Also, visit this link on making mosaic plots of categorical data: [[R tips/Contingency Tables]].
 
Also, visit this link on making mosaic plots of categorical data: [[R tips/Contingency Tables]].
 +
  
 
===== Week 6: Introduction to Stochastic Modelling =====
 
===== Week 6: Introduction to Stochastic Modelling =====
  
* Lecture presentation: {{Pdf | StochasticModelling.pdf | Introduction to Stochastic Modelling}}
+
* {{Pdf | StochasticModelling.pdf | Introductory lecture on Stochastic Modeling}}
* In-class R codes: {{Rcode| MarkovMatrix.r | MarkovMatrix.r}} and {{Rcode| RandomWalks.r |RandomWalks.r}}
+
* In-class R codes:  
* Homework: {{Pdf | QERM598_HW6.pdf | HW6.pdf}}
+
** {{Rcode| MarkovMatrix.r | MarkovMatrix.r}}  
 +
** {{Rcode| RandomWalks.r |RandomWalks.r}}
 +
* {{Pdf | QERM598_HW6.pdf | Homework}}
 +
 
  
 
===== Week 7: Population Dynamics Models and MCMC =====
 
===== Week 7: Population Dynamics Models and MCMC =====
  
* Lecture presentation: [http://qerm.pbwiki.com/f/Week7_Ian.pdf Week7_Ian.pdf]
+
* {{Pdf | Week7_PopulationsAndMCMC.pdf | Lecture presentation}}
* In-class R codes: [http://qerm.pbwiki.com/f/fibo.r fibo.r]
+
* {{Rcode | Week7_FibonacciPopulations.r | R code}}
 
* No homework
 
* No homework
 +
  
 
===== Week 8: Project Discussions =====
 
===== Week 8: Project Discussions =====
Line 102: Line 111:
 
===== Week 9: Fitting models and optimization =====
 
===== Week 9: Fitting models and optimization =====
  
* Lecture: [http://qerm.pbwiki.com/f/Fitting.pdf Fitting.pdf]
+
* {{Pdf| QERM598_OptimizationAndFitting.pdf | Lecture}}
* Cassava disease data for in-class lab: [http://qerm.pbwiki.com/f/CassavaDisease.csv CassavaDisease.csv]
+
* Data:
* White fly data for in-class lab: [http://qerm.pbwiki.com/f/WhiteFly.csv WhiteFly.csv]
+
** {{Data | QERM598_CassavaDisease.csv |Cassava disease data}}
* Rough R-code for Cassava analysis: [http://qerm.pbwiki.com/f/CassavaCode.r CassavaCode.r]
+
** {{Data | QERM598_WhiteFly.csv | White fly data}}
 +
* {{Rcode | QERM598_CassavaCode.r | Rough R-code for Cassava analysis}}
 +
 
  
 
===== Week 10: Final project presentations =====
 
===== Week 10: Final project presentations =====

Latest revision as of 19:35, 8 February 2011

Mosaic plot of Steller sea lion birth probabilities
  • This course is designed to introduce students to computational methods in Ecological Modeling. Commonly used techniques in Applied Statistics will be covered, as well as some deterministic and stochastic models encountered in Mathematical Biology. While we cover the mechanics of doing various statistical analyses, the goal of the course is to develop the ability to use computational tools creatively to help visualize and grasp statistical concepts and modeled processes. There will be lots of emphasis on simulation and graphical visualization, since these are such an important tools in practice but scantily touched upon in other courses.
  • Our primary tool will be R.
  • Each week will consist of a lecture, an R lab and an assignment. One of the practical goals is to create a body of labs, assignments and datasets that can be used as a basis for a continuation of this course in years to come. All of these materials will be stored on this website.


Contents

Practical Details

  • Location: Odegaard Undergraduate Library Computer Lab 102
  • Times: Wednesday - 1:30-3:30
  • Instructor: Eli Gurarie with the possible participation of guest lecturers, under the benevolent auspices of Dr. Tilmann Gneiting.
  • Office Hours: all day Wednesday in Loew 328 and by appointment.
  • Target Audience: First-year QERM students and other qualified students interested in Quantitative Ecology.
  • Prerequisites: Statistical Inference (STAT 512), Mathematical biology (AMATH 420)


Week by Week: Homeworks, Labs, Data and Notes for QERM 598

Box-plots


Week 0: Basic Introduction to R

If you don't have R already, you can download it from this website. You'll need to download the executable binaries: For Windows users they are here; for MacOS users they are here. Install R and run it.

R is an open source program, which means it's free, efficient and no-frills. It also means that documentation is patchy at best and learning how to use it involves poking around the internet, blindly groping around the internal help file, and, whenever possible, stealing other people's code.

Thankfully, there is lots of expertise at QERM! To start off, download or open the file below, print it out, read it, and walk through the examples. There is a brief assignment at the end of the lab for practice.


Week 1: Distributions and the Central Limit Theorem

Obtaining quantiles and p-values from distributions. Plotting cdfs and pdfs. Generating random data. Introducing plotting functions and loops. Illustrating the Central Limit Theorem numerically.


Week 2: Comparing two samples: Monte Carlo and t-tests

You will need to download the following data files to perform the lab and the homeworks:

You will also need to read Mike Keim's Pdf10.png notes on non-parametric tests.


Week 3: Introduction to Analysis of Variance


Week 4: Linear Regression


Week 5: Chi-squared Contingency Tables

There is, by the way, a great explanation/derivation of the theory behind Chi-Squared contingency tests and other applications of the central limit theorem here.

Also, visit this link on making mosaic plots of categorical data: R tips/Contingency Tables.


Week 6: Introduction to Stochastic Modelling


Week 7: Population Dynamics Models and MCMC


Week 8: Project Discussions


Week 9: Fitting models and optimization


Week 10: Final project presentations
  • Please visit the Final Projects page to browse through final project presentations and R code.
Personal tools