Welcome to CSI 972 / STAT 972 and CSI 973 / STAT 973

<TITLE>Mathematical Statistics</TITLE>
<H1>Welcome to CSI 972 / STAT 972 and CSI 973 / STAT 973</H1>
<H2>Mathematical Statistics I and II
</H2>

<p>
Instructor:
<A HREF="http://mason.gmu.edu/~jgentle/">James Gentle</A>
<p>
If you <a href="mailto:jgentle@gmu.edu"> send email to the instructor</a>,
please put "CSI 972" or "CSI 973" in the subject line.

<p>

<HR>
<p>
This two-course sequence covers topics in statistical theory
essential for advanced work in statistics.
<p>
<b> Course Objectives:</b>
<br>
At the end of this two-course sequence the student
should be very familiar with the concepts of mathematical statistics, and
should have the ability to read the advanced literature in the area.
The student should learn a set of tools for doctoral research and should
have the confidence to embark on such research.
<p>
<b>
The prerequisites for the first course include a course in mathematical statistics
at the advanced calculus level, for example, at George Mason, CSI 672 / STAT 652,
"Statistical Inference", and a measure-theory-based course in probability, for example,
at George Mason, CSI 971 / STAT 971, "Probability Theory".
</b>
<p>
The first course begins with a brief overview of concepts and
results in measure-theoretic probability theory that are useful in statistics.
This is followed by discussion of some fundamental concepts
in statistical decision theory and inference.
The basic approaches and principles of estimation are explored,
including minimum risk methods with various restrictions such as
unbiasedness or equivariance, maximum likelihood,
and functional methods such as the method of moments and other plug-in
methods.  Bayesian decision rules are then considered in some detail.
The methods of minimum variance unbiased estimation are covered in detail.
Topics include sufficiency and completeness of statistics, Fisher
information, bounds on variances of estimators, asymptotic
properties, and statistical decision theory,
including minimax and Bayesian decision rules.

<p>
The second course begins where the first course ends.  The second course
covers the principles of hypothesis testing and confidence sets in more
detail. We consider
characterization of the decision process, the Neyman-Pearson lemma and
uniformly most powerful tests, confidence sets, and unbiasedness
in inference procedures.
Additional topics include equivariance, robustness, and estimation of functions.
<p>

In addition to the classical results in mathematical statistics, the
theory underlying
Markov chain Monte Carlo, quasi-likelihood, empirical likelihood,
statistical functionals, generalized estimating equations, the jackknife,
and the bootstrap are addressed.
<p>
The use of computer software for symbolic computations and for Monte Carlo simulations
is encouraged throughout the courses.
<p>
I have put together a set of
<A HREF="http://mason.gmu.edu/~jgentle/csi9723/MathStat.pdf">
notes</a>
<i>to supplement</i> the material in the text and the lectures.  These notes
are in the form of a book.  It has
a subject index that should be useful.  (I am continually working on
these notes, so they may change from week to week.)

<p>
<hr>
<p>
The main ingredient for success in a course in mathematical statistics
is the ability to work problems.  (It's harder to identify the "main
ingredient" for success in the field of statistics, but
even for that, the ability to work problems is an important component.)
The only way to enhance one's ability to work problems is to
<i>work problems.</i>
It is not sufficient to read, to watch, or to hear solutions to problems.
<i> One of the most serious mistakes students make in courses in
mathematical statistics is to work through a solution that somebody else
has done and to think they have worked the problem.
</i>
<P>
Some problems, proofs, counterexamples, and derivations should become <i>"easy pieces"</i>
(see <a href="../courses.htm"> my other comments</a>).
An easy piece is something that is important in its own right, but also
may serve as a model or template for many other problems.  A student should
attempt to accumulate a large bag of easy pieces.  If development of this
bag involves some memorization, that is OK, but things should just naturally
get into the bag in the process of working problems and observing similarities
among problems --- and by seeing the same problem over and over.
<p>
<hr>
<p>
Student work in each course will consist of
<li> a number of homework assignments
<li> a midterm exam consisting of an in-class component and, possibly, a
     take-home component
<li> a final exam consisting of an in-class component and, possibly, a
     take-home component
<p>
<hr>
<h4> Instantiations </h4>


Fall, 2012:
<a href="12f/
"> CSI 972</a>;
<!--
Spring, 2013:
<a href="13s/
"> CSI 973</a>.
-->
<p>

Fall, 2011:
<a href="11f/
"> CSI 972</a>;
Spring, 2012:
<a href="12s/
"> CSI 973</a>.

<p>

Fall, 2010:
<a href="10f/
"> CSI 972</a>;
Spring, 2011:
<a href="11s/
"> CSI 973</a>.
<p>

Fall, 2009:
<a href="09f/
"> CSI 972</a>;
Spring, 2010:
<a href="10s/
"> CSI 973</a>.
<p>

Fall, 2008:
<a href="08f/
"> CSI 972</a>.
<p>

Fall, 2007:
<a href="07f/
"> CSI 972</a>;
Spring, 2008:
<a href="08s/
"> CSI 973</a>.
<p>

Fall, 2005:
<a href="05f/
"> CSI 972</a>;
Spring, 2006:
<a href="06s/
"> CSI 973</a>.
<p>

Fall, 2003:
<a href="http://scs.gmu.edu/~jgentle/csi9723/03f/
"> CSI 972</a>;
Spring, 2004:
<a href="http://scs.gmu.edu/~jgentle/csi9723/04s/
"> CSI 973</a>.
<p>

Fall, 2001:
<a href="http://scs.gmu.edu/~jgentle/csi9723/01f/
"> CSI 972</a>;
 Spring, 2002:
<a href="http://scs.gmu.edu/~jgentle/csi9723/02s/
"> CSI 973</a>.
<hr>
<h4> Some Useful References for Mathematical Statistics </h4>

<h5> Texts on general mathematical statistics at the level of this course, more-or-less </h5>

<ul>
<li> Lehmann, E. L., and George Casella (1998), <i> Theory of Point Estimation, </i>
second edition, Springer.

<li> Lehmann, E. L., and Joeph P. Romano (2005), <i> Testing Statistical Hypotheses, </i>
third edition, Springer.
<br>
There is a useful companion
book called <i> Testing Statistical Hypotheses: Worked Solutions </i>
by some people at CWI in Amsterdam that has solutions to the exercises
in the first edition.  (Most of these are also in the third edition.)
<li> Schervish, Mark J. (1995), <i> Theory of Statistics, </i>
 Springer.
<br> This rigorous and quite comprehensive text has a Bayesian orientation.

<li> Shao, Jun (2003), <i> Mathematical Statistics,  </i> second edition, Springer.
<br> Comprehensive and rigorous; better than the first edition.
<li> Shao, Jun (2005), <i> Mathematical Statistics:
Exercises and Solutions,
</i> Springer.
<br> Solutions (or partial solutions) to some exercises in Shao (2003), plus some
additional exercises and solutions.
</ul>
<h5> Texts in probability and measure theory and linear spaces
roughly at the level of this course  </h5>

<ul>
<li> Ash, Robert B., and Catherine A. Doleans-Dade (1999),
   <i>Probability & Measure Theory, </i> second edition, Academic Press.
   <br> Accessible and wide-ranging text; also covers stochastic calculus.
<li> Athreya, Krishna B.,  and Soumen N. Lahiri (2006),
<i> Measure Theory and Probability Theory, </i> Springer.
<br> A very solid book, but beware of typos in the first printing.
 <li> Billingsley, Patrick (1995), <i> Probability and Measure, </i>
   third edition, John Wiley & Sons.
<br> This is one of the best books on probability and measure theory for probability,
in terms of coverage and rigor.
No explicit coverage of linear spaces.
 <li> Breiman, Leo (1968), <i> Probability, </i>
   Addison-Wesley.
<br> This is a classic book on measure-theoretic-based probability theory.
No explicit coverage of measure theory or linear spaces.
The book (with corrections) is available in the SIAM Classics in Allied Mathematics
series (1992).
<li>Dudley, R. M. (2002),
<i> Real Analysis and Probability, </i> second edition, Cambridge University Press.
<br> Accessible and comprehensive.
</ul>

<h5> Texts that provide good background for this course </h5>
<ul>
 <li> Berger, James O. (1985), <i> Statistical Decision Theory and Bayesian
  Analysis, </i> second edition, Springer.
 <li> Bickel, Peter, and Kjell A. Doksum (2001),  <i> Mathematical Statistics:
   Basic Ideas and Selected Topics, Volume I, </i> second edition,
   Prentice Hall.
<br> This book covers material from Chapters 1-6 and Chapter 10 of the first edition,
  but with more emphasis on nonparametric and semiparametric models and on
  function-valued parameters.  It also includes more Bayesian perspectives.  The second
  volume will not appear for a couple of years.  In the meantime, the first edition
  remains a very useful text.
 <li> Casella, George, and Roger L. Berger (2001), <i> Statistical Inference,
   </i> second edition, Duxbury Press.
 <li> Robert, Christian P. (1995), <i> The Bayesian Choice, </i>
 Springer.
<br> This is a carefully-written book with a somewhat odd title.  This book
is at a slightly higher level than the others in this grouping.
<li>Hogg,  Robert V.; and Allen T. Craig (1994),
<i>Introduction to Mathematical Statistics </i> (5th Edition), Prentice-Hall.
<br> This old standard
is at a slightly lower level than the others in this grouping.

</ul>
<h5> Interesting monographs </h5>
<ul>
 <li> Barndorff-Nielson, O. E., and D. R. Cox (1994), <i> Inference and Asymptotics,
      </i> Chapman and Hall.
 <li> Brown, Lawrence D. (1986), <i> Fundamentals of Statistical Exponential
      Families with Applications in Statistical Decision Theory,</i>
      Institute of Mathematical Statistics.
 <li> Lehmann, E. L. (1999), <i> Elements of Large-Sample Theory,
      </i> Springer.
 <li> Serfling, Robert J. (1980), <i> Approximation Theorems
      of Mathematical Statistics,</i> John Wiley & Sons.
</ul>
</ul>

<h5> Interesting compendia of counterexamples</h5>
An interesting kind of book is one with
the word ``counterexamples'' in its title.  Counterexamples provide useful
limits on mathematical facts.
As Gelbaum and Olmsted observed in the preface to their 1964 book, which was the
first in this genre, ``At the risk of oversimplification, we might say that (aside
from definitions, statements, and hard work), mathematics consists of two classes ---
proof and counterexamples, and that mathematical discovery is directed toward
two major goals --- the formulation of proofs and the construction of
counterexamples.''
<ul>
 <li> Gelbaum, Bernard R., and John M. H. Olmsted (1990), <i> Theorems and
      Counterexamples in Mathematics, </i> Springer.
 <li> Gelbaum, Bernard R., and John M. H. Olmsted (2003), <i> Counterexamples in
      Analysis, </i>
(originally published in 1964; corrected reprint of the second printing published by
Holden-Day, Inc., San Francisco, 1965), Dover Publications, Inc., Mineola, New York.
 <li> Romano, Joseph P., and Andrew F. Siegel (1986), <i> Counterexamples in
      Probability and Statistics, </i> Chapman and Hall.
<br> In the field of mathematical statistics, this is the most useful
of the ``counterexamples'' books.
It has been rumored that
course instructors get problems from this book.  I can neither confirm
nor deny this rumor.  I can report that I have the book.
 <li> Stoyanov, Jordan M. (1987), <i> Counterexamples in Probability, </i>
      John Wiley & Sons.
 <li> Wise, Gary L., and Eric B. Hall (1993), <i> Counterexamples in Probability
      and Real Aanalysis, </i> The Clarendon Press, Oxford University Press.
</ul>
<h5> Interesting set of essays </h5>
<ul>
 <li> Various authors (2002), Chapter 4, Theory and Methods of Statistics, in
<i> Statistics in the 21st Century, </i>edited by Adrian E. Raftery, Martin A.
Tanner, and Martin T. Wells, Chapman and Hall.
<br> The "golden age" of mathematical statistics was the middle third of the
twentieth century, and the content of the books in the first grouping above
cover the developments of this period very well.  The set of
essays in Chapter 4 reviews some of the more recent and ongoing work.
</ul>

<h5> Good compendium on standard probability distributions </h5>

<ul>
 <li> Evans, Merran; Nicholas Hastings; and Brian Peacock (2000), <i> Statistical
Distributions, </i> third edition, John Wiley & Sons.

</ul>
<p>
There is also a multi-volume/multi-edition set of books by Norman Johnson and
Sam Kotz and co-authors, published by Wiley.
The books have titles like "Discrete Multivariate Distributions".
(The series began with four volumes in the 70's by Johnson and Kotz.
I have those, but over
the years they have been revised, co-authors have been added, and volumes
have been subdivided.
I am not sure what comprises the current set, but any or all of the books
are useful.)

<h5> Software for symbolic computation </h5>

There is, of course, no substitute for the ability to understand and
work through mathematical derivations and proofs, but just as data
analysis software aids in understanding statistical methods, software for
symbolic manipulation can aid in working through mathematical arguments.
In addition to the role played by data analysis
software in understanding applied statistics,
the software is a major tool of professionals who do data analysis.
Likewise, software for symbolic manipulation is becoming a major tool for
professionals working in mathematical statistics.
<p>
Some steps in work in "higher mathematics" depend on recognition of an
expression as a particular form of some well-known object.
This recognition is essentially a data-retrieval problem.  The data
may be stored in one's brain, in a table of integrals, or in some other
place.
It is a
stretch to think of the recognition problem as one
that requires a "higher intelligence", although
certainly the ability to do it easily is an important component of
general mathematical ability.  Software packages for symbolic computation
can sometimes help in the mechanical processes of solving
mathematical problems.
<p>
The main software packages for symbolic computation are Mathematica,
Maple, Macsyma, and Reduce.  There are relatively inexpensive
student versions of all of these.  Some University computer labs have
one or more of the packages installed.  The SCS science cluster has
Mathematica.
Most of the books listed below
provide introductions to the software using relatively low-level
applications for illustration.
<ul>
<li>
Abell, Martha L.; James P. Braselton; and John A. Rafter (1998),
<i>Statistics With Mathematica </i>, Academic Press.
<br>
Mostly devoted to elementary data analysis.
<li>
Andrews, D. F.; and J. E. H. Stafford (2000),
<i>Symbolic Computation for Statistical Inference</i>,
Oxford University Press.
<br>
Covers mathematical statistics (as opposed to data analysis).
Emphasizes Mathematica.
<li> Hastings,  Kevin J. (2000), <i>
Introduction to Probability with Mathematica </i>,
Lewis Publishers, Inc.
 <li> Rafter, John A.; James P. Braselton; and Martha L. Abell (2002),
<i> Statistics with Maple </i>, Academic Press.
<br>
Mostly devoted to elementary data analysis.
<li> Rose, Colin; and Murray D. Smith (2002),
<i>Mathematical Statistics with MATHEMATICA</i>,
Springer.
<br>
Covers mathematical statistics (as opposed to data analysis).  This book
includes a crippled version of a commercial product called
<a href="http://www.mathstatica.com/
"> MathStatica </a>.
</ul>