Differences between revisions 4 and 5
Revision 4 as of 2008-05-28 14:42:15
Size: 832
Editor: static-69-95-227-215
Comment:
Revision 5 as of 2008-06-04 20:53:46
Size: 1962
Editor: static-74-41-89-18
Comment:
Deletions are marked like this. Additions are marked like this.
Line 14: Line 14:
=== Notes on the readings ===

=== Additional terminology ===
Feel free to add terms you want clarified in class:
 
 *
 *

=== Questions ===
 * Q:

== Suggested topics ==
If you have any material that you would like to cover that isn't included in the list below, please make note of it here.

[[Anchor(assignments)]]
Line 21: Line 36:




[[Anchor(Topics)]]
== Topics ==

 * More on outliers
   * detect outliers
       {{{boxplot()}}}, scatterplots {{{plot(), identify()}}[[BR]]
   * dealing with outliers
       exclusion {{{subset()}}}[[BR]]
       robust regression (based on t-distribution): {{{tlm()}}} (in package {{{hatt}}}))[[BR]]
 * Overly influential cases (can be, but don't have to be outliers)
       {{{lm.influence()}}}, also {{{library(Rcmdr)}}}[[BR]]
 
 * Model evaluation: Where is the model off?
   * case-by-case: {{{residuals(), predict()}}}[[BR]]
   * based on predictor: {{{residuals()}}} against predictors, {{{calibrate()}}}[[BR]]
   * overall: {{{validate()))[[BR]]

 * Corrections:
   * correcting for clusters (violation of assumption of independence): {{{bootcov()}}}[[BR]]

Session 2: Issues in linear regression

June 3 2008

Reading

G&H07

Chapter 4 (pp. 53-74)

Linear regression: before and after fitting the model

Baa08

Sections 6.2.2-6.2.4 (pp. 198-212)

Collinearity, Model criticism, and Validation

Section 6.4 (pp. 234-239)

Regression with breakpoints

Notes on the readings

Additional terminology

Feel free to add terms you want clarified in class:

Questions

  • Q:

Suggested topics

If you have any material that you would like to cover that isn't included in the list below, please make note of it here.

Anchor(assignments)

Assignments

Send your solutions to Andrew Watts, who will upload them here. Please send them by Wednesday 3:30pm or upload them yourself to this page by 10pm.

G&H07

Section 4.9 (p.76)

Exercise 4

Baa08

Section 6.7 (p. 260)

Exercise 1, 8

In addition to the book problems, we will distribute a data set from the ongoing ngrams project.

Anchor(Topics)

Topics

  • More on outliers
    • detect outliers
      • boxplot(), scatterplots {plot(), identify()BR

    • dealing with outliers
      • exclusion subset()BR robust regression (based on t-distribution): tlm() (in package hatt))BR

  • Overly influential cases (can be, but don't have to be outliers)
    • lm.influence(), also library(Rcmdr)BR

  • Model evaluation: Where is the model off?
    • case-by-case: residuals(), predict()BR

    • based on predictor: residuals() against predictors, calibrate()BR

    • overall: {{{validate()))BR

  • Corrections:
    • correcting for clusters (violation of assumption of independence): bootcov()BR

HLPMiniCourseSession2 (last edited 2008-11-09 02:03:35 by cpe-67-240-134-21)

MoinMoin Appliance - Powered by TurnKey Linux