Differences between revisions 7 and 8
Revision 7 as of 2014-02-05 16:39:35
Size: 3503
Editor: wireless
Comment:
Revision 8 as of 2014-02-05 16:50:58
Size: 3931
Editor: wireless
Comment:
Deletions are marked like this. Additions are marked like this.
Line 4: Line 4:
#pragma section-numbers 3 #pragma section-numbers 4
Line 19: Line 19:
=== Lectures ===
Line 28: Line 29:
{{{#!wiki comment/dashed
2nd group: aux lectures
3rd group (one bullet -- there's only one problem set) : problem set [pdf | script | data zip]
4th group (one bullet): assorted additional scripts [script zip]
}}}
=== Auxiliary lectures ===
 * Auxiliary lecture 1: Coding Categorical Predictors & Auxiliary lecture 2: Interactions [[attachment:LSA13-AuxLecture 1&2 - Coding.pdf|PDF]]
 * Auxiliary lecture 5: Time Series Data [[attachment:LSA13-AuxLecture5-TimeSeriesData.pdf|PDF]] [[attachment:LSA13-AuxLecture5-TimeSeriesData-eye-tracking-analysis.R|Script]]

=== Problem Set ===
 * Problem set [[attachment:LSA13-PreLecture2-ProblemSet.pdf|PDF]] [[attachment:LSA13-PreLecture2-ProblemSet.R|Script ]] [[attachment:data.zip|Data zip]]

=== Assorted additional scripts ===
 * Additional scripts [[attachment:scripts.zip|scripts zip]]

2013 LSA Summer Institute class on ''Mixed effect regression''

The slides provided below can be used in teaching with proper acknowledgment of the source. If you need any of the source code, let me know. All slides were created with latex and knitr in RStudio, so that the code shown is also the code used to generate the content of the slides. The R scripts extracted from .Rnw files are provided along with the slides. By downloading the slides you agree to inform me of any errors you find on them ;). Thank you.

This http://lsa2013.lsa.umich.edu/2012/05/mixed-effect-models/ was held at the 2013 LSA Summer Institute.

Class descriptions

With increasing use of quantitative behavioral data, statistical data analysis has rapidly become a crucial part of linguistic training. Linguistic data analysis is often particularly challenging because (i) the relevant data are often sparse, (ii) the data sets are often unbalanced with regard to the variables of interest, and (iii) data points are typically not sampled independently of each other, making it necessary to account for—possibly hierarchical—grouping structures (clusters) in the data. This course provides an introduction to several advanced data analyses techniques that help us to address these challenges. We will focus on the Generalized Linear Model (GLM) and Generalized Linear Mixed Model (GLMM) – what they are, how to fit them, what common ‘traps’ to be aware of, how to interpret them, and how to report and visualize results obtained from these models. GLMs and GLMMs are a powerful tool to understand complex data, including not only whether effects are significant but also what direction and shape they have. GLMs have been used in corpus and sociolinguistics since at least the 60s. GLMMs have recently been introduced to language research through corpus- and psycholinguistics. They are rapidly becoming a popular data analysis techniques in these and other fields (e.g. sociolinguistics).

In this course, I will assume a basic statistical background and a conceptual understanding of at least linear regression.

Materials

Lectures

  • Course overview PDF

  • Day 1: The Generalized Linear Model PDF Script

  • Day 2: The Generalized Linear Mixed Model PDF Script

  • Day 3: Beyond Linear Models PDF

  • Day 4: Tools for data analysis, exploration, and transformation: plyr and reshape2 PDF Script

  • Day 5: Using ggplot2 for data visualization PDF Script

  • Day 6: Common Issues and Solutions in GLM/GLMM modeling PDF

  • Day 6: Reporting Results PDF

Auxiliary lectures

  • Auxiliary lecture 1: Coding Categorical Predictors & Auxiliary lecture 2: Interactions PDF

  • Auxiliary lecture 5: Time Series Data PDF Script

Problem Set

Assorted additional scripts

LSA2013Regression (last edited 2014-02-05 16:50:58 by wireless)

MoinMoin Appliance - Powered by TurnKey Linux