Differences between revisions 6 and 7
Revision 6 as of 2008-06-04 20:54:15
Size: 1963
Editor: static-74-41-89-18
Comment:
Revision 7 as of 2008-06-04 20:58:20
Size: 2301
Editor: static-74-41-89-18
Comment:
Deletions are marked like this. Additions are marked like this.
Line 48: Line 48:
       robust regression (based on t-distribution): {{{tlm()}}} (in package {{{hatt}}}))[[BR]]        robust regression (based on t-distribution): {{{tlm()}}} (in package {{{hatt}}})[[BR]]
Line 51: Line 51:
 
 * Collinearity:
   * tests: {{{vif(), kappa()}}}, summary of correlations between fixed effects in {{{lmer()}}} [[BR]]
   * countermeasures:
     * centering and/or standardizing {{{scale()}}}} [[BR]]
     * use of residuals {{{resid(lm(x1 ~ x2, data))}}} [[BR]]
     * principal component analysis (PCA) {{{princomp()}}} [[BR]]
Line 55: Line 62:
   * overall: {{{validate()))[[BR]]    * overall: {{{validate()}}}[[BR]]

Session 2: Issues in linear regression

June 3 2008

Reading

G&H07

Chapter 4 (pp. 53-74)

Linear regression: before and after fitting the model

Baa08

Sections 6.2.2-6.2.4 (pp. 198-212)

Collinearity, Model criticism, and Validation

Section 6.4 (pp. 234-239)

Regression with breakpoints

Notes on the readings

Additional terminology

Feel free to add terms you want clarified in class:

Questions

  • Q:

Suggested topics

If you have any material that you would like to cover that isn't included in the list below, please make note of it here.

Anchor(assignments)

Assignments

Send your solutions to Andrew Watts, who will upload them here. Please send them by Wednesday 3:30pm or upload them yourself to this page by 10pm.

G&H07

Section 4.9 (p.76)

Exercise 4

Baa08

Section 6.7 (p. 260)

Exercise 1, 8

In addition to the book problems, we will distribute a data set from the ongoing ngrams project.

Anchor(Topics)

Topics

  • More on outliers
    • detect outliers
      • boxplot(), scatterplots plot(), identify()BR

    • dealing with outliers
      • exclusion subset()BR robust regression (based on t-distribution): tlm() (in package hatt)BR

  • Overly influential cases (can be, but don't have to be outliers)
    • lm.influence(), also library(Rcmdr)BR

  • Collinearity:
    • tests: vif(), kappa(), summary of correlations between fixed effects in lmer() BR

    • countermeasures:
      • centering and/or standardizing scale()} BR

      • use of residuals resid(lm(x1 ~ x2, data)) BR

      • principal component analysis (PCA) princomp() BR

  • Model evaluation: Where is the model off?
    • case-by-case: residuals(), predict()BR

    • based on predictor: residuals() against predictors, calibrate()BR

    • overall: validate()BR

  • Corrections:
    • correcting for clusters (violation of assumption of independence): bootcov()BR

HLPMiniCourseSession2 (last edited 2008-11-09 02:03:35 by cpe-67-240-134-21)

MoinMoin Appliance - Powered by TurnKey Linux