Differences between revisions 23 and 24
Revision 23 as of 2008-05-21 22:35:17
Size: 5683
Editor: cpe-74-65-13-211
Comment:
Revision 24 as of 2008-05-21 22:36:04
Size: 5722
Editor: cpe-74-65-13-211
Comment:
Deletions are marked like this. Additions are marked like this.
Line 27: Line 27:
|| Wiki |||| [http://en.wikipedia.org/wiki/Probability_distribution Probability distributions] ||
Line 28: Line 29:
|| Wiki |||| [http://en.wikipedia.org/wiki/Probability_distribution Probability distributions] ||
Line 39: Line 39:
Absolute novices to R should start by reading at least the parts of Dalgaard's Ch1 indicated above (but really Baayen is pretty easy even without that). Everyone should read Baayen's R intro chapter, followed by a quick walk through Dalgaard's Ch2 on probability distributions in R. Absolute novices to R should start by reading at least the parts of Dalgaard's Ch1 indicated above (but really Baayen is pretty easy even without that). Everyone should read Baayen's R intro chapter. Then read the wiki entrie givne above, followed by a quick walk through Dalgaard's Ch2 on probability distributions in R.

I'm going to be running this section of the course. All questions and comments should either be posted here, or sent directly to me. -- AustinFrank DateTime(2008-05-20T18:19:39Z)

Make sure to read the main page for this tutorial prior to reading this one. In particular, please read the "How to read" section. In a crash course approach like the one we will be taking here, it is inevitable that you will encounter terminology that you are unfamiliar with. Please collect those and feel free to post on this wiki. Each session page has a section with Notes on the readings (by Austin and me) and Additional terminology (where you can add terms you want clarified - just edit the page). -- Florian Jaeger DateTime(2008-05-21T17:21:00Z)

Session 0: Basics (with optional R primer)

11:00, May 27 2008

This optional meeting of the course will be an R primer. We'll be focusing on obtaining a basic level of familiarity required to participate in the course. We won't cover everything you'll need to know for the course, but hopefully we'll cover enough that you will be able to learn new material on your own.

Whether you attend the R primer or not, you are responsible for understanding the content of these readings before the first session.

If there are things you would like to cover, please make note of them below. NB: As of now there are no plans to teach graphing functions during the primer. It's possible that we could have a special add-on session to discuss different graphics packages in R if there's sufficient interest. Let AustinFrank know if you want to participate in such an event.

This page has three sections: Readings and notes on them, Assignments, and a list of topics that will be covered in class (to be edited by you).

Reading

Understanding of this material will be assumed throughout the course. Please read these introductory materials and make sure you understand them before beginning the readings for the first session. Here and for the other sessions, we sometimes assign parts of chapters for reading, so please check the page numbers for the suggested readings (or risk being confused).

Baa08

Chapter 1 (pp. 1-20)

Intro to R

Wiki

[http://en.wikipedia.org/wiki/Probability_distribution Probability distributions]

Dal04

Chapter 2 (pp. 45-55)

Probability distributions in R

G&H07

Chapter 2 (pp. 13-26)

Terminological convention and intro to probability theory

For absolute beginners, this is also very useful:

Dal04

Chapter 1.1 - 1.2

Basics of R

Chapter 1.5 - 1.5

Additionally, feel free to download and print out this [attachment:R-Refcard.pdf reference card] for R. While it's a few years old, the basics it covers have not changed.

Notes on the readings

Absolute novices to R should start by reading at least the parts of Dalgaard's Ch1 indicated above (but really Baayen is pretty easy even without that). Everyone should read Baayen's R intro chapter. Then read the wiki entrie givne above, followed by a quick walk through Dalgaard's Ch2 on probability distributions in R.

The G&H chapter is not an introduction to probability theory, but rather a summary of notational conventions for that book with brief explanations of the concepts. This chapter is definitely not the strong part of the book and not very insightful (and sometimes close to wrong in its simplification), but you should read through it, being fine with the fact that probably it will somewhat confuse you. Later you can go back to this chapter to review conventions used in G&H.

Additional terminology

Feel free to add terms you want clarified in class:

  • term1
  • term2
  • ...

Assignments

  • Make sure you have the latest version of R (version 2.7) installed on a laptop that you can use during class. AndrewWatts wrote some very useful [http://linginst07.stanford.edu/florianR/software/ instructions for installing R] last summer. You should still be able to follow those steps, but make sure you download and install version 2.7.

R Primer

Suggested topics

If you have any material that you would like to cover that isn't included in the list below, please make note of it here.

Topics

Interacting with R and R files

  • Using a command line
    • command history, continuation lines, stopping execution BR defining variables BR calling functions

  • Installing packages
    • install.package(), update.package()

  • Using the R workspace
    • ls(), rm(), setwd(), getwd(), library()

  • Using an R script file
  • Saving R objects
    • save(), save.image()

Getting help

  • Function-specific help
    • ?(), help()

  • Searching the help
    • apropos(), help.search(), RSiteSearch()

Loading data

  • general purpose functions
    • scan()

  • specific formats
    • read.csv(), read.delim(), library(foreign)

General data structures

  • vectors / arrays
    • c()

  • matrices
    • cbind(), rbind(), table()

  • lists
    • list(), unlist()

  • data frames
    • data.frame()

  • general structure manipulation and interaction
    • [], [[]], $, subset(), str(), cut(), repr()

Basic descriptive statistics

  • summary stats
    • mean(), sd(), var(), quantile()

Probability distributions

  • random sampling
    • runif(), rnorm(), rbinom()

HLPMiniCourseSession0 (last edited 2008-11-09 04:14:14 by cpe-69-207-169-204)

MoinMoin Appliance - Powered by TurnKey Linux