STAT 789

Advanced Topics in Statistics: Computer-intensive Methods for Classification and Regression

Summer 2005


STAT 789, Advanced Topics in Statistics, is offered from time to time, with different topics being the subject matter of the course. My course for the Summer of 2005 will deal with classification and regression, with an emphasis on newer computer-intensive methods.
Welcome (last updated: June 6, 2005)
page, on which I introduce the course and indicate what you can read prior to the first lecture.
Syllabus (last updated: June 6, 2005)
giving a description of the course, policies and procedures, information about books and software, and information about how to contact me.
Announcements (last updated: 10:01 AM, July 29, 2005)
about the course. (I'll update this section as needed during the summer, providing information about changes I make pertaining to this web site and to previously posted/announced policies and schedules.)
Lecture (last updated: July 21, 2005)
supplements.
Homework (last updated: July 18, 2005)
information (assignments, comments, and hints).
Project (last updated: July 18, 2005)
information.
Final exam review (last updated: July 24, 2005)
information.
Data web pages:
OzDASL - Australasian Data and Story Library
data sets from a Vanderbilt Medical Center web site (but not all data sets pertain to medicine and health)
UCI Machine Learning Repository
StatLib
CISER Internet Data Sources for Social Scientists
Fedstats
Software web pages:
home page for Salford Systems
(producers of CART, MARS, TreeNet, and RandomForests software),
home page for R
(to install, I clicked on CRAN under Download (on left), clicked on the Pittsburgh site under USA, clicked on Windows (95 and later) under Precompiled Binary Distributions, clicked on base under R for Windows, clicked on rw2010.exe under R-2.1.0 for Windows and run the Setup Wizard (clicking Next several times and accepting the defaults), but you may want to choose something different),
home page for Weka
(to install, I clicked on the first link under the downloading and installing information, clicked on the icon in the download column of the Atlanta row, and kept clicking to take the default settings, but you may want to chose something different),
Other web pages:
book chapter on Clasification and Regression Trees, Bagging, and Boosting.