AMS 572: Data Analysis
Introduction to basic statistical procedures. Survey of elementary statistical procedures such as the t-test and chi-square test. Procedures to verify that assumptions are satisfied. Extensions of simple procedures to more complex situations and introduction to one-way analysis of variance. Basic exploratory data analysis procedures (stem and leaf plots, straightening regression lines, and techniques to establish equal variance).
Required Textbook: Statistics and Data Analysis: From Elementary to Intermediate by Ajit C. Tamhane and Dorothy D. Dunlop, 1999, Pearson
Supplementary Textbook: Applied Statistics and the SAS® Programming Language by Ronald P. Cody and Jeffrey K. Smith, 5th edition, 2005, Pearson
Learning Outcomes
- Master the sampling distributions of statistics especially:
- Sampling from the normal populations;
- Sampling from the Bernoulli populations;
- Large sample distribution of sample mean;
- Distribution of order statistics.
- Master the basic concepts of statistical inference:
- Point estimators;
- Pivotal quantity;
- Maximum likelihood based methods;
- Confidence intervals;
- Hypothesis testing.
- Demonstrate skills for inference with one population mean (including derivation of the formulas using the pivotal quantity method):
- Inference on one population mean when the population is normal and the population variance is known;
- Inference on one population mean when the population is normal and the population variance is unknown;
- Inference on one population mean when the population distribution is unknown but the sample size is large;
- Normality test using the normal probability plot and the Shapiro-Wilk test.
- Demonstrate skills for inference with one population variance when the population is normal (including derivation of the formulas using the pivotal quantity method).
- Demonstrate skills for inference with two population means (including derivation of the formulas using the pivotal quantity method):
- Inference on two population means with paired samples – how to reduce that to inference on one population mean with the paired differences;
- Inference on two population means, two independent samples, when both populations are normal and the population variances are known;
- Inference on two population means, two independent samples, when both populations are normal and the population variances are unknown but equal;
- Inference on two population means, two independent samples, when at least one population distribution is not normal but both sample sizes are large.
- Demonstrate skills for inference with two population variances when both populations are normal (including derivation of the formulas using the pivotal quantity method) – especially the F-test for the equality of two population variances.
- Master the basic inference with proportions and count data (including derivation of the formulas using the pivotal quantity method for the inference on one-population proportion and two-population proportions):
- Inference on one population proportion – exact test and large sample inference;
- Inference on two population proportions, independent samples – exact test and large sample inference;
- Inference on two population proportions, paired samples – exact test;
- Inference with one-way contingency table, including the Chi-square goodness-of-fit test;
- Inference with two-way contingency table, test for homogeneity and test for independence.
- Master the basic inference with simple linear regression and correlation:
- Least squares method;
- Error in variable regression;
- Bivariate normal distribution;
- Pearson correlation;
- Spearman rank correlation.
- Demonstrate skills with inference on several population means, independent samples – One-Way ANOVA:
- Understanding of the assumptions, derivation, interpretation of results from statistical analysis;
- Post-hoc (pairwise) comparison of the population means.
- Master the related SAS® and R procedures for all materials covered in lectures.
- Group presentations covering some of the materials in both text books not covered in the regular lectures including:
- Multiple regression;
- One-way ANCOVA;
- Two-way ANOVA & ANCOVA;
- Repeated measures ANOVA;
- Nonparametric methods: Rank based methods;
- Nonparametric methods: Permutation based (permutation test, Jackknife, Bootstrap).