Session 1: Linear and logistic regression

This session will cover the basics of linear and logistic regression. See below for a [#Topics list of topics]. Please make sure to do the readings and let me know in advance if there is terminology that you would like to be clarified. The goal of this session is to go through the basic steps of building linear or logistic regression model, understanding the output, and validating how good this model is.

I've also posted some [#assignments assignments] below. There is only one way to learn how to use the methods we will talk about and that is to apply them yourself to a data set that you understand. The tutorial is intended to get you to the level where you can do that.

Materials

Reading

G&H07

Chapter 3 (pp. 29-49)

Linear regression: the basics

Baa08

Section 4.3.2 (pp. 91 - 105)

Functional relations: linear regression

Sections 6 - 6.2.1 (pp. 181-198)

Regression Modeling (Introduction and Ordinary Least Squares Regression)

Section 6.6 (pp. 258-259)

General considerations

Notes on the readings

If you'd like to follow along, the dataset used in the G&H07 reading can be found here: [http://www.stat.columbia.edu/~gelman/arm/examples/child.iq/]. To use the file, you will need to load the "foreign" package, then use the read.dta() function. Eg:

Additional terminology

Feel free to add terms you want clarified in class:

Questions

Anchor(assignments)

Assignments

Please upload your solutions by Friday 3:30pm.

G&H07

Section 3.9 (pp. 50-51)

Exercises 3 and 5

Baa08

Section 4.7 (p. 126)

Exercises 3 and 7*

* (for Exercise 7, Baayen treats linear regression using lm or ols as the same as analysis of covariance (see section 4.4.1 (pp. 117-119))).

Anchor(Topics)

Topics

MoinMoin Appliance - Powered by TurnKey Linux