Nominal Variables

  • Things like gender or color (what we would have as factors in ANOVA)
    • We will assume (for now) that a subject can be a member of only one level of a factor a variable (i.e., you can blue or red, but not both)
  • Decisions have to made on how to treat these variables in a regression
  • There are three basic methods we will cover and how to interpret their effects and interactions
  • Key to remember you are basically doing to t-tests, so you are always just comparing two things, but what those two things are will change as you change your coding (think the concept of contrasts in ANOVA from last semester)

Models with no Interactions (1 Nominal variable)

Dummy Coding

  • The most common and basic type used (default in R if it senses a categorical variable)
  • For each variable, you must assign a reference group (a baseline) for each variable
  • Each group is compared to the reference group
  • We ask the question, how much does each group deviate from the reference
  • Here are two levels of one factor
Variable C1
Happy 0
Sad 1
  • Here are Three levels, but now you have to make 2 new variables
  • Happy is the reference group
Variable C1 C2
Happy 0 0
Sad 0 1
Mad 1 0
  • Sad is the reference group
Variable C1 C2
Happy 0 1
Sad 0 0
Mad 1 0
  • Here are Four levels, but now you have to make 3 new variables
  • Happy is the reference group
Variable C1 C2 C3
Happy 0 0 0
Sad 0 0 1
Mad 0 1 0
Grumpy 1 0 0

Regression equation

  • Two Levels: \(Y = B_1C1 + B_0\)
  • Three Levels: \(Y = B_1C1 +B_2C2+ B_0\)
  • Four Levels: \(Y = B_1C1 + B_2C2 + B_2C3 + B_0\)

Creating Dummy Variables

  • In SPSS you would have to create dummy variables using Recode, but it R its a little easier
  • In R, we have to convert our variable into a factor
    • R will default to alphabetical order
      • Sometimes easier to convert all your words into numbers (start with 0)
    • Next, convert your variable into a factor (as.factor)
      • Note: R calls dummy coding as contr.treatment
  • First, let’s simulate a simple factor to work with
    • Emotion rating ~ Expressive intentions of the actor (0 = Flat, 1 = Normal, 2 = Exaggerated )
    • Three Levels: \(Emotion Rating = 0*Flat + 5*Normal +10*Exaggerated + 50 + \epsilon\)
Variable C1 C2
Flat 0 0
Normal 1 0
Exaggerated 0 1
library(ggplot2)
#Set up simulation
set.seed(42)
N <- 200
X<- sample(rep(c(0,1,2),N),N,replace = FALSE)
Y <- 5*X + 50 + rnorm(N, sd=10)
Emotion.Data<-data.frame(Emotion=Y,Style=X)
qplot(Style,Emotion, data = Emotion.Data, colour = I("red"))+theme_bw()+geom_smooth(method='lm')

  • If the data is already coded as 0, 1, 2 all we have to do is:
Emotion.Data$StyleF <- factor(Emotion.Data$Style,
                              level=c(0,1,2),
                              labels=c("Flat", "Normal", "Exaggerated"))
  • We can re-plot it as violin/box plot to see our distributions (its a double-sided density plot [smoothed histogram])
ggplot(data = Emotion.Data,aes(StyleF,Emotion))+
  xlab("")+
  geom_violin(aes(fill=StyleF), trim = FALSE,adjust=1.5)+
  geom_boxplot(width=.1)+
  theme_bw()+
  theme(legend.position = "none")

Interpret Regression

  • Order of the factors matters
  • Remember you are making each level of the variable a term in the equation as you have made a new variable for each level (except baseline)
  • R make these dummy coded automatically so its easy to forget this
    • for 3 levels we will have 3 terms: intercept and StyleFNormal and StyleFExaggerated.
Model.1<-lm(Emotion ~ StyleF, data = Emotion.Data)
Dependent variable:
Emotion
Constant 50.798*** (1.273)
StyleFNormal 2.182 (1.828)
StyleFExaggerated 7.961*** (1.780)
Observations 200
R2 0.098
Adjusted R2 0.089
F Statistic 10.705*** (df = 2; 197)
Note: p<0.05; p<0.01; p<0.001

Coefficients

  • Intercept = Intercept of the equation (Which is the mean of Flat)
  • StyleFNormal = coefficient (slope of Normal) from baseline (Flat)
  • StyleFExaggerated = coefficient (slope of Exaggerated) from baseline (Flat)

Means per Condition

  • Flat = Intercept
  • Normal = Intercept + StyleFNormal
  • Exaggerated = Intercept + StyleFExaggerated

Pvalues

  • The pvalue on the intercept asks if the baseline (Flat) condition different from zero
  • The pvalue on the slopes asks if each of the other levels is different from baseline (Flat)

Rotate the matrix

  • What if we want to know if Normal is different from Exaggerated?
  • We need to relevel (aka change what is zero)
Variable C1 C2
Flat 1 0
Normal 0 0
Exaggerated 0 1
  • We can fix that manually change what the baseline line level is with the relevel command in car package
Emotion.Data$StyleN<- relevel(Emotion.Data$StyleF, ref = "Normal")
Model.1.N<-lm(Emotion ~ StyleN, data = Emotion.Data)
  • Notice in the table the labels and values have changed
Dependent variable:
Emotion
Constant 52.979*** (1.312)
StyleNFlat -2.182 (1.828)
StyleNExaggerated 5.779** (1.809)
Observations 200
R2 0.098
Adjusted R2 0.089
F Statistic 10.705*** (df = 2; 197)
Note: p<0.05; p<0.01; p<0.001

Contrast Coding

Deviation Coding

  • Note: R calls them Contrast Sums
  • Let’s make the Flat the reference again
Variable C1 C2
Normal 1 0
Exaggerated 0 1
Flat -1 -1
  • For deviatation code, R will use the LAST variable as references so we need to recode it by hand to be first as to match the order we want in the above table
Emotion.Data$Style.C.S <- factor(Emotion.Data$Style,
                              level=c(1,2,0),
                              labels=c("Normal", "Exaggerated", "Flat"))
  • Then we will create our new contrast contr.sum with three levels (3)
contr.sum(3)
##   [,1] [,2]
## 1    1    0
## 2    0    1
## 3   -1   -1
  • We will add that code to our lm function, by adding the parameters list(Style.C.S=contr.sum(3))
Model.1.CS<-lm(Emotion ~ Style.C.S, data = Emotion.Data,
               contrasts=list(Style.C.S=contr.sum(3)))
  • Let’s look at how things differ from the mean of all conditions (grand mean)
Dependent variable:
Emotion
Constant 54.178*** (0.737)
Style.C.S1 -1.199 (1.057)
Style.C.S2 4.580*** (1.030)
Observations 200
R2 0.098
Adjusted R2 0.089
F Statistic 10.705*** (df = 2; 197)
Note: p<0.05; p<0.01; p<0.001
  • Intercept = Mean of means (grand mean)
  • Mean of each condition calculated below
Mean.Interaction<-aggregate(Emotion~StyleF,data = Emotion.Data, FUN=mean)
StyleF Emotion
Flat 50.79758
Normal 52.97930
Exaggerated 58.75829
  • Grand mean calculated below
sum(aggregate(Emotion~StyleF,data = Emotion.Data, FUN=mean)[2])/3
## [1] 54.17839

Coefficients

  • Style.C.S1 = coefficient on Normal as it differs from grand mean
  • Style.C.S2 = coefficient on Exaggerated as it differs from grand mean
  • To get the means for each condition:

Means per Condition

  • Flat = Cannot get from this model
  • Normal = Intercept + Style.C.S1
  • Exaggerated = Intercept + Style.C.S2

Pvalues

  • The pvalue on the intercept asks if the grand mean different from zero
  • The pvalue on the slopes asks if each of the other levels is different from grand mean
  • Note: We have lost the ability to ask about Flat condition, to get it back we would have to rotate the baseline and do it again

Simple Coding

  • They must sum to zero, and the abs(values) must sum to 1
  • Like in ANOVA you can design these to ask specific questions by merging conditions (or not)
  • We will use the contr.treatment treatment (which R is using to automatically convert your categorical variables already)
  • Just like dummy coding but we change the meaning of the intercept
  • The reference level is First again
Variable C1 C2
Flat -1/3 -1/3
Normal 2/3 -1/3
Exaggerated -1/3 2/3
Emotion.Data$Style.Simple <- factor(Emotion.Data$Style,
                              level=c(0,1,2),
                              labels=c("Flat", "Normal", "Exaggerated"))

Levels<-3
Simple.1<-contr.treatment(Levels) # Dummy code
#Make your codes
my.coding<-matrix(rep(1/Levels, Levels*(Levels-1)), ncol=Levels-1)
my.simple<-Simple.1-my.coding
my.simple
##            2          3
## 1 -0.3333333 -0.3333333
## 2  0.6666667 -0.3333333
## 3 -0.3333333  0.6666667
  • We will add that code to our lm function, by adding the parameters list(Style.Simple=my.simple)
Model.1.Simple<-lm(Emotion ~ Style.Simple, data = Emotion.Data, 
                   contrasts=list(Style.Simple=my.simple))
Dependent variable:
Emotion
Constant 54.178*** (0.737)
Style.Simple2 2.182 (1.828)
Style.Simple3 7.961*** (1.780)
Observations 200
R2 0.098
Adjusted R2 0.089
F Statistic 10.705*** (df = 2; 197)
Note: p<0.05; p<0.01; p<0.001

Coefficients

  • This merges the dummy and deviance codings thus:
  • Intercept = Mean of means (grand mean)
  • StyleFNormal = slope of Normal from baseline (Flat)
  • StyleFExaggerated = slope of Exaggerated from baseline (Flat)

Means per Condition

  • You cannot get means of each condition from this model

Pvalues

  • The pvalue on the intercept asks if the grand mean different from zero
  • The pvalue on the slopes asks if each of the other levels is different from baseline (Flat)

Other types of contrast coding

  • Helmert: compares each level to the mean of the subsequent levels [can be reversed]
  • Forward Difference coding: one level is compared to the next (adjacent) level. (level 1 vs. 2. level 2 vs. 3) [can be reversed]
  • Custom: Merge and compare levels at will

Center variables and test slopes

  • There is no reason why you cannot treat the variables as continuous is they are ordinal
  • just center them and treat like a continuous variable

Models with Interactions (2 Nominal variables)

  • Interactions can be tricky as the interpretation depends on how you code IVs
    • First, let’s try some categorical vs. categorical interactions
  • We will add a variable to our experiment from above; we will add the location where the actors work (Movie vs. Theatre)
  • Emotion rating ~ Expressive intentions*location of the actor

\(Emotion = -3*Flat + 0*Normal +3*Exag +0*Movie - 1.5*Theatre - 0*Flat*Movie + 0*Normal*Movie +0*Exag*Movie - 10*Flat*Theatre +0*Normal*Theatre +10*Exag*Theatre + 50 + \epsilon\)

set.seed(42)
N <- 200
X <- sample(rep(c(-1,0,1),N),N,replace = FALSE)
Z <- sample(rep(c(0,1),N*3/2),N,replace = FALSE)

# Our equation to create Y
Y <- 3*X -1.5*Z+10*X*Z+ 50 + rnorm(N, sd=10)
#Built our data frame
Emotion.Data.2<-data.frame(Emotion=Y,Style=X,Location=Z)
Emotion.Data.2$Style<-Emotion.Data.2$Style+1

Dummy coding

  • Lets dummy code all of them: convert them to a label in the other we want
# Convert all our factors
Emotion.Data.2$StyleF <- factor(Emotion.Data.2$Style,
                              level=c(0,1,2),
                              labels=c("Flat", "Normal", "Exaggerated"))

Emotion.Data.2$LocationF <- factor(Emotion.Data.2$Location,
                              level=c(0,1),
                              labels=c("Movie Set", "Theatre"))

-Let’s look at the collapsed means and the means per cell

Mean.Style<-aggregate(Emotion~StyleF,data = Emotion.Data.2, FUN=mean)
kable(Mean.Style)
StyleF Emotion
Flat 43.13791
Normal 47.56490
Exaggerated 58.39412
Mean.Location<-aggregate(Emotion~LocationF,data = Emotion.Data.2, FUN=mean)
kable(Mean.Location)
LocationF Emotion
Movie Set 49.75709
Theatre 49.99666
Mean.Interaction<-aggregate(Emotion~StyleF*LocationF,data = Emotion.Data.2, FUN=mean)
kable(Mean.Interaction)
StyleF LocationF Emotion
Flat Movie Set 49.69693
Normal Movie Set 45.80954
Exaggerated Movie Set 53.42534
Flat Theatre 34.00214
Normal Theatre 49.26539
Exaggerated Theatre 63.08686
  • Let examine our regression, but we need to examine a main effects and interaction model
Model.I1.Dummy<-lm(Emotion ~ StyleF+LocationF, data = Emotion.Data.2)
Model.I2.Dummy<-lm(Emotion ~ StyleF*LocationF, data = Emotion.Data.2)
Dependent variable:
Emotion
Main Effects Interaction
(1) (2)
Constant 43.412*** (1.534) 49.697*** (1.592)
StyleFNormal 4.486* (1.987) -3.887 (2.392)
StyleFExaggerated 15.319*** (1.936) 3.728 (2.333)
LocationFTheatre -0.655 (1.604) -15.695*** (2.462)
StyleFNormal:LocationFTheatre 19.151*** (3.513)
StyleFExaggerated:LocationFTheatre 25.356*** (3.423)
Observations 200 200
R2 0.253 0.427
Adjusted R2 0.242 0.412
F Statistic 22.170*** (df = 3; 196) 28.924*** (df = 5; 194)
Note: p<0.05; p<0.01; p<0.001
  • You will notice a large difference between the main effects and interaction model
  • This is because the meaning coefficients in the interaction differ from the main effects model
  • The main effects were explained above, but they change meaning a little because there is second variable

Main Effects

Coefficients

  • These do not connect back to the individual cells as you are assuming the two factors have independent effects

    • Thus these are hypothetical values (not connected to the individual cells)
  • Intercept = Estimate of the mean, assuming the two factors have independent effects at their 0 points [baseline]

  • StyleFNormal = Normal - Flat (assuming the two factors are independent)

  • StyleFExaggerated = Exaggerated - Flat (assuming the two factors are independent)

  • LocationFTheatre = Theatre - Movie(assuming the two factors are independent)

  • We can see the main effect models results

library(effects)
allEffects(Model.I1.Dummy)
##  model: Emotion ~ StyleF + LocationF
## 
##  StyleF effect
## StyleF
##        Flat      Normal Exaggerated 
##    43.09724    47.58320    58.41658 
## 
##  LocationF effect
## LocationF
## Movie Set   Theatre 
##  50.18653  49.53144

Means per Condition

  • You cannot get them

Pvalues

  • The pvalue on the intercept asks if the baseline is different from zero
  • The pvalue on the differences asks if they are from different from baseline

Interactions

Coefficients

  • These cells connect back to experimental cells
  • Intercept = Location(0) @ Style(0) aka Movie @ Flat [real value]
  • StyleFNormal = Differnce of [Location(0) @ Style(1)] - [Location(0) @ Style(0)]
  • StyleFExaggerated = Differnce of [Location(0) @ Style(2)] - [Location(0) @ Style(0)]
  • LocationFTheatre = Differnce of [Location(1) @ Style(0)] - [Location(0) @ Style(0)]
  • StyleFNormal:LocationFTheatre = Differnce of [Location(1) @ Style(1)] - [Location(0) @ Style(1) - Location(0) @ Style(0) + [Location(1) @ Style(0)] - [Location(0) @ Style(0)]
  • StyleFExaggerated:LocationFTheatre = Differnce of [Location(1) @ Style(2)] - [Location(0) @ Style(2) - Location(0) @ Style(0) + [Location(2) @ Style(0)] - [Location(0) @ Style(0)]

Means per Condition

  • You can find all means for all cells!
  • Flat @ Movie = Intercept
  • Normal @ Movie = Intercept + StyleFNormal
  • Exaggerated @ Movie = Intercept + StyleFExaggerated
  • Flat @ Theatre = Intercept + LocationFTheatre
  • Normal @ Theatre = Intercept + LocationFTheatre+StyleFNormal+StyleFNormal:LocationFTheatre
  • Exaggerated @ Theatre = Intercept + LocationFTheatre + StyleFExaggerated + StyleFExaggerated:LocationFTheatre

Pvalues

  • The pvalue on the intercept asks if the baseline is different from zero
  • The pvalue on the differences asks if they are from different from baseline
  • But these are now are now simple effects

Main effect model vs. Interaction model

  • Since the results are not additive, the main effect model is useful only in which to understand how the terms have changed
  • Gives some insight into the interaction

Graph Model

Inter.1<-effect("StyleF*LocationF", Model.I2.Dummy)
plot(Inter.1, multiline = TRUE)

Connection to between-subject ANOVA

  • Regression with only categorical variables can be converted into an ANOVA very easily
  • Thus it can be treated just like ANOVA
  • but the regression makes the ANOVA process moot as you can now the regression can be coded in such a way that you can test follow up tests in a way you want to test them all
library(car)
Anova(Model.I2.Dummy, type="III")
## Anova Table (Type III tests)
## 
## Response: Emotion
##                  Sum Sq  Df  F value    Pr(>F)    
## (Intercept)       96322   1 974.6213 < 2.2e-16 ***
## StyleF              941   2   4.7593  0.009598 ** 
## LocationF          4015   1  40.6229 1.316e-09 ***
## StyleF:LocationF   5814   2  29.4130 6.967e-12 ***
## Residuals         19173 194                       
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
  • Note, we have to call for a Type III sum of squares to match SPSS output as R would default to Type I

Contrast Coding

Deviation Coding

  • Lets code each variable as we did above
Emotion.Data.2$Style.C.S <- factor(Emotion.Data.2$Style,
                              level=c(1,2,0),
                              labels=c("Normal", "Exaggerated", "Flat"))
Emotion.Data.2$Location.C.S <- factor(Emotion.Data.2$Location,
                              level=c(1,0),
                              labels=c("Theatre", "Movie Set"))

Model.I1.C.S<-lm(Emotion ~ Style.C.S+Location.C.S, data = Emotion.Data.2, 
                 contrasts=list(Style.C.S=contr.sum(3),Location.C.S=contr.sum(2)))
Model.I2.C.S<-lm(Emotion ~ Style.C.S*Location.C.S, data = Emotion.Data.2,
                contrasts=list(Style.C.S=contr.sum(3),Location.C.S=contr.sum(2)))
  • Let’s look at how things differ from the mean of all conditions (grand mean)
Dependent variable:
Emotion
Main Effects Interaction
(1) (2)
Constant 49.686*** (0.800) 49.214*** (0.707)
Style.C.S1 -2.116 (1.147) -1.677 (1.011)
Style.C.S2 8.718*** (1.117) 9.042*** (0.985)
Location.C.S1 -0.328 (0.802) -0.430 (0.707)
Style.C.S1:Location.C.S1 2.157* (1.011)
Style.C.S2:Location.C.S1 5.260*** (0.985)
Observations 200 200
R2 0.253 0.427
Adjusted R2 0.242 0.412
F Statistic 22.170*** (df = 3; 196) 28.924*** (df = 5; 194)
Note: p<0.05; p<0.01; p<0.001
  • Again we are testing between the grand mean and the individual terms

Main Effects

Coefficients
  • These do not connect back to the individual cells as you are assuming the two factors have independent effects
  • Thus these are hypothetical Grand mean values
  • Intercept = Grand Mean, but assuming the two factors have independent effects
  • Style.C.S1 = Normal (averaged over location) - Grand Mean (assuming the two factors are independent)
  • Style.C.S2 = Exaggerated (averaged over location) - Grand Mean (assuming the two factors are independent)
  • Location.C.S1 = Theatre (averaged over Style) - Grand Mean (assuming the two factors are independent)
Means per Condition
  • Cannot get from this model
Pvalues
  • The pvalue on the intercept asks if the baseline (Grand Mean) is different from zero
  • The pvalue on the slopes asks if each of the other levels is different from the baseline (Grand Mean)
  • Note we are missing tests on Flat and Movie cell, we would need to rotate the matrix and run again

Interactions

Coefficients
  • Intercept = Grand Mean (all 6 cells mean averaged)
  • Style.C.S1 = Normal (averaged over location) from Grand Mean
  • Style.C.S2 = Exaggerated (averaged over location) from Grand Mean
  • Location.C.S1 = Theatre (averaged over Style) from Grand Mean
  • Style.C.S1:Location.C.S1 = difference of Normal @ Theatre from Grand Mean
  • Style.C.S2:Location.C.S1 = difference of Exaggerated @ Theatre from Grand Mean
Means per Condition
  • Cannot get from this model
Pvalues
  • The pvalue on the intercept asks if the baseline (Grand Mean) is different from zero
  • The pvalue on the differences asks if each of the other levels is from the (Grand Mean)
  • Note we are missing tests on Flat and movie cell, we would need to rotate the matrix and run again

Models with Interactions (1 Nominal and 1 Continuous Variable)

  • We will change our experiment from above, we will add a likert scale which the actor rates how “good” they think the performance was [-3 terrible to 3 excellent]
  • Emotion rating ~ Quality Rating X location of the actor (i.e., Movie Set, Theatre)
#Set up simulation
set.seed(42)
N <- 200
X <- runif(N,-3,3)
Z <- sample(rep(c(0,1),N/2),N,replace = FALSE)
# Our equation to create Y
Y <- 4*X +2*Z+8*X*Z+ 50 + rnorm(N, sd=10)
#Built our data frame
Emotion.Data.3<-data.frame(Emotion=Y,Quality=X,Location=Z)

Center the continuous variable

  • Best to center as its will help us interpret
# Convert all our factors
Emotion.Data.3$Quality.C <- scale(Emotion.Data.3$Quality, scale = FALSE)[,]

Dummy coding of our nominal variable

  • Make Location into a factor
# Convert all our factors
Emotion.Data.3$LocationF <- factor(Emotion.Data.3$Location,
                                   level=c(0,1),
                                   labels=c("Movie Set","Theatre"))
  • First to understand what the interaction will yield lets us run two regressions where we seperate our factor to see what the slopes would be just for actors at Movie Set vs at the Theatre

Movie Set ONLY vs Theatre Only

  • We will subset the data and test for the Quality slopes
Model.Movie<-lm(Emotion ~ Quality.C, 
                data = subset(Emotion.Data.3,LocationF=="Movie Set"))
Model.Theatre<-lm(Emotion ~ Quality.C, 
                data = subset(Emotion.Data.3,LocationF=="Theatre"))
Dependent variable:
Emotion
Movie Set ONLY Theatre Only
(1) (2)
Constant 50.468*** (0.976) 52.426*** (1.051)
Quality.C 4.551*** (0.593) 12.083*** (0.569)
Observations 100 100
R2 0.375 0.821
Adjusted R2 0.369 0.819
F Statistic (df = 1; 98) 58.842*** 450.430***
Note: p<0.05; p<0.01; p<0.001
  • Ignore the intercepts and pay attention to the slopes:
    • Quality @ the Movie Set ONLY is \(b\)= 4.551
      • The pvalues tells you if the slope of quality is differnet from a slope of 0
    • Quality @ the Theatre Only ONLY is \(b\)= 12.083
      • The pvalues tells you if the slope of quality is differnet from a slope of 0
  • What we really want to test is if there is a difference in slopes between quality at Movies sets VS Theatre

Testing the Interaction

  • We will test a main effects model, Quality+Location and compare it to an interaction model Quality*Location
Model.E3.1<-lm(Emotion ~ Quality.C+LocationF, data = Emotion.Data.3)
Model.E3.2<-lm(Emotion ~ Quality.C*LocationF, data = Emotion.Data.3)
  • Does model interaction improve the model fit? (Just like we did with hierarchical testing)
Model.Fit.E3<-anova(Model.E3.1,Model.E3.2)
library(knitr)
kable(Model.Fit.E3)
Res.Df RSS Df Sum of Sq F Pr(>F)
197 28691.61 NA NA NA NA
196 20141.73 1 8549.886 83.19931 0
  • Yes! The interaction is significant improvement
  • Let’s go examine the individual regression paramaters:

Interpret Regression

Dependent variable:
Emotion
Main Effects Interaction
(1) (2)
Constant 50.513*** (1.207) 50.468*** (1.014)
Quality.C 8.749*** (0.488) 4.551*** (0.616)
LocationFTheatre 1.948 (1.707) 1.958 (1.434)
Quality.C:LocationFTheatre 7.532*** (0.826)
Observations 200 200
R2 0.621 0.734
Adjusted R2 0.617 0.730
F Statistic 161.308*** (df = 2; 197) 180.143*** (df = 3; 196)
Note: p<0.05; p<0.01; p<0.001

Main Effects Model

  • Intercept = Mean of Quality @ Movies [p = Is the intercept different from 0]
  • Quality.C = Quality.C slope averaged across Movies & * Threatre*
    • This is a Main Effect [p = Is slope different from 0 slope]
      • This would almost be like averaging the slopes from our Movie Set ONLY and Theatre Only Models
        • \(b\) = (4.551 + 12.083)/2 = 8.317
  • LocationFTheatre = Threatre difference from intercept [p = Is mean of Theatre different from Movies @ mean of quality]
    • This is a Simple Effect

Interaction Model

  • Intercept = Mean of Quality @ Movies [p = Is the intercept different from 0]
  • Quality.C = Quality.C slope @ Movies
    • This is a Simple Slope [p = Is slope different from 0 slope]
      • To prove it we can see it matches the slopes from our Movie Set ONLY Model, \(b\)= 4.551
  • LocationFTheatre = Threatre difference from intercept [p = Is mean of Theatre different from movies @ mean of quality]
    • This is a Simple Effect
  • Quality.C:LocationFTheatre= slope difference between Movie and Theatre [p = Is the slope of quality @ Theatre different from slope at ]
    • This is an Interaction
      • To prove it we can see it matches the slopes of Theatre Only - Movie Set ONLY Model
        • \(b\)= (12.083 - 4.551) = 7.532
  • If you wanted to know the simple slope of Theatre Only you would have to relevel Location and make Theatre = 0 (i.e., the reference)
Emotion.Data.3$LocationT<- relevel(Emotion.Data.3$LocationF, ref = "Theatre")
Model.E3.2.ReLevel<-lm(Emotion ~ Quality.C*LocationT, data = Emotion.Data.3)
Dependent variable:
Emotion
Theatre is Ref
Constant 52.426*** (1.014)
Quality.C 12.083*** (0.549)
LocationTMovie Set -1.958 (1.434)
Quality.C:LocationTMovie Set -7.532*** (0.826)
Observations 200
R2 0.734
Adjusted R2 0.730
F Statistic 180.143*** (df = 3; 196)
Note: p<0.05; p<0.01; p<0.001
  • You will notice the interaction flipped direction and now the Quality.C coefficient \(b\) = 12.083 = the Theatre Only model \(b\) = 12.083

Plotting Interaction

  • Often times you will need to use the effects package and plot by hand
  • The rockchalk package will work for simple models like this one
library(rockchalk)
plotSlopes(Model.E3.2, plotx = "Quality.C", modx = "LocationF")

Deviation coding of our nominal variable

  • We will still center quality
  • This creates a more ANOVA like interpretation of the interaction
Emotion.Data.3$LocationD <- factor(Emotion.Data.3$Location,
                                   level=c(1,0),
                                   labels=c("Theatre","Movie Set"))
Model.E3.D1<-lm(Emotion ~ Quality.C+LocationD, data = Emotion.Data.3,
                 contrasts=list(LocationD=contr.sum(2)))
Model.E3.D2<-lm(Emotion ~ Quality.C*LocationD, data = Emotion.Data.3,
                 contrasts=list(LocationD=contr.sum(2)))

Interpret Regression

Dependent variable:
Emotion
Main Effects Interaction
(1) (2)
Constant 51.487*** (0.853) 51.447*** (0.717)
Quality.C 8.749*** (0.488) 8.317*** (0.413)
LocationD1 0.974 (0.853) 0.979 (0.717)
Quality.C:LocationD1 3.766*** (0.413)
Observations 200 200
R2 0.621 0.734
Adjusted R2 0.617 0.730
F Statistic 161.308*** (df = 2; 197) 180.143*** (df = 3; 196)
Note: p<0.05; p<0.01; p<0.001

Main Effects

Coefficients & Pvalues

  • Intercept = Mean of Quality @ mean of Movie & Theatre (imaginary) [p = Is the intercept different from 0]
  • Quality.C = Quality.C slope [p = Is slope different from 0]
  • LocationFTheatre = Threatre difference from 0 (not - 1) thus is 1/2 the slope of the dummy code [p = Is mean of Theatre different from movies @ mean of quality]

Interactions

Coefficients & Pvalues

  • Intercept = Mean of Quality @ mean of Movie & Theatre (imaginary) [p = Is the intercept different from 0]
  • Quality.C = Quality.C slope @ mean of Movie & Theatre (imaginary) [p = Is the Main effect of quality @ mean of Movie & Theatre]
  • LocationFTheatre = Threatre difference from intercept [p = Is the Main effect of Location @ mean of quality]
  • Quality.C:LocationFTheatre= Quality.C slope difference between Movie and theatre [p = Is the interaction]
  • Note all coefficients are 1/2 of the size they should be!

Simple coding of our nominal variable

  • We will still center quality
  • This creates a more ANOVA-like interpretation of the interaction,
    • Most importantly it does not screw up our coefficients as they match the dummy code values (which make the most sense)
  • This time I will hand code it into a factor (easy with only 2 levels)
Emotion.Data.3$LocationS<-as.numeric(Emotion.Data.3$Location)-.5
Model.E3.S1<-lm(Emotion ~ Quality.C+LocationS, data = Emotion.Data.3)
Model.E3.S2<-lm(Emotion ~ Quality.C*LocationS, data = Emotion.Data.3)

Interpret Regression

Dependent variable:
Emotion
Main Effects Interaction
(1) (2)
Constant 51.487*** (0.853) 51.447*** (0.717)
Quality.C 8.749*** (0.488) 8.317*** (0.413)
LocationS 1.948 (1.707) 1.958 (1.434)
Quality.C:LocationS 7.532*** (0.826)
Observations 200 200
R2 0.621 0.734
Adjusted R2 0.617 0.730
F Statistic 161.308*** (df = 2; 197) 180.143*** (df = 3; 196)
Note: p<0.05; p<0.01; p<0.001

Main Effects

Coefficients & Pvalues

  • Intercept = Mean of Quality @ mean of Movie & Theatre (imaginary thing) [p = Is the intercept different from 0]
  • Quality.C = Quality.C slope [p = Is slope different from 0]
  • LocationFTheatre = Threatre difference from 0 [p = Is mean of Theatre different from movies @ mean of quality]

Interactions

Coefficients & Pvalues

  • Intercept = Mean of Quality @ mean of Movie & Theatre (imaginary thing) [p = Is the intercept different from 0]
  • Quality.C = Quality.C slope @ mean of Movie & Theatre (imaginary thing) [p = Is the Main effect of quality @ mean of Movie & Theatre]
  • LocationS = Threatre difference from intercept [p = Is the Main effect of Location @ mean of quality]
  • Quality.C:LocationS= Quality.C slope difference between Movie and theatre [p = Is their an interaction]
  • Note all coefficients are their proper size now (to match true differences)
  • Best option is often dummy or simple, deviation here is weird
---
title: 'Categorical Variables'
output:
  html_document:
    code_download: yes
    fontsize: 8pt
    highlight: textmate
    number_sections: no
    theme: flatly
    toc: yes
    toc_float:
      collapsed: no
---

```{r setup, include=FALSE}
knitr::opts_chunk$set(cache=TRUE)
knitr::opts_chunk$set(echo = TRUE)
knitr::opts_chunk$set(message = FALSE)
knitr::opts_chunk$set(warning =  FALSE)
knitr::opts_chunk$set(fig.width=4.25)
knitr::opts_chunk$set(fig.height=4.0)
knitr::opts_chunk$set(fig.align='center') 
knitr::opts_chunk$set(results='hold') 
```


# Nominal Variables 
- Things like gender or color (what we would have as factors in ANOVA)
    - We will assume (for now) that a subject can be a member of *only one* level of a factor a variable (i.e., you can blue or red, but not both)
- Decisions have to made on how to treat these variables in a regression
- There are three basic methods we will cover and how to interpret their effects and interactions
- Key to remember you are basically doing to t-tests, so you are always just comparing two things, but what those two things are will change as you change your coding (think the concept of contrasts in ANOVA from last semester)


# Models with no Interactions (1 Nominal variable)

## Dummy Coding
- The most common and basic type used (default in R if it senses a categorical variable)
- For each variable, you must assign a reference group (a baseline) for each variable
- Each group is compared to the reference group
- We ask the question, *how much does each group deviate from the reference*
- Here are two levels of one factor


Variable | C1 
---------| ---
Happy    | 0
Sad      | 1

- Here are Three levels, but now you have to make 2 new variables 
- Happy is the reference group

Variable| C1| C2
--------| --| --
Happy   | 0 | 0
Sad     | 0 | 1
Mad     | 1 | 0

- Sad is the reference group

Variable| C1| C2
--------| --| --
Happy   | 0 | 1
Sad     | 0 | 0
Mad     | 1 | 0

- Here are Four levels, but now you have to make 3 new variables 
- Happy is the reference group

Variable     | C1 | C2 | C3
-------------| -- | -- | -- 
Happy        | 0  | 0  | 0
Sad          | 0  | 0  | 1
Mad          | 0  | 1  | 0
Grumpy       | 1  | 0  | 0


### Regression equation
- Two Levels: $Y = B_1C1 + B_0$
- Three Levels: $Y = B_1C1 +B_2C2+ B_0$
- Four Levels: $Y = B_1C1 + B_2C2 + B_2C3 + B_0$

### Creating Dummy Variables
- In SPSS you would have to create dummy variables using Recode, but it R its a little easier
- In R, we have to convert our variable into a factor
    - R will default to alphabetical order
        - Sometimes easier to convert all your words into numbers (start with 0)
    - Next, convert your variable into a factor (as.factor)
        - Note: R calls dummy coding as contr.treatment
- First, let's simulate a simple factor to work with
    - Emotion rating ~ Expressive intentions of the actor (0 = Flat, 1 = Normal, 2 = Exaggerated )
    - Three Levels: $Emotion Rating = 0*Flat + 5*Normal +10*Exaggerated + 50 + \epsilon$


Variable    | C1| C2
------------| --| ---
Flat        | 0 | 0
Normal      | 1 | 0
Exaggerated | 0 | 1


```{r}
library(ggplot2)
#Set up simulation
set.seed(42)
N <- 200
X<- sample(rep(c(0,1,2),N),N,replace = FALSE)
Y <- 5*X + 50 + rnorm(N, sd=10)
Emotion.Data<-data.frame(Emotion=Y,Style=X)
qplot(Style,Emotion, data = Emotion.Data, colour = I("red"))+theme_bw()+geom_smooth(method='lm')
```

- If the data is already coded as 0, 1, 2 all we have to do is: 

```{r}
Emotion.Data$StyleF <- factor(Emotion.Data$Style,
                              level=c(0,1,2),
                              labels=c("Flat", "Normal", "Exaggerated"))
```

- We can re-plot it as violin/box plot to see our distributions (its a double-sided density plot [smoothed histogram])

```{r}
ggplot(data = Emotion.Data,aes(StyleF,Emotion))+
  xlab("")+
  geom_violin(aes(fill=StyleF), trim = FALSE,adjust=1.5)+
  geom_boxplot(width=.1)+
  theme_bw()+
  theme(legend.position = "none")
```


### Interpret Regression
- Order of the factors matters
- Remember you are making each level of the variable a term in the equation as you have made a new variable for each level (except baseline)
- *R make these dummy coded automatically so its easy to forget this*
    - for 3 levels we will have 3 terms: intercept and StyleFNormal and StyleFExaggerated.

```{r}
Model.1<-lm(Emotion ~ StyleF, data = Emotion.Data)
```


```{r, echo=FALSE,results='asis'}
library(stargazer)
stargazer(Model.1,type="html",
          intercept.bottom = FALSE,  single.row=TRUE, notes.append = FALSE,
          omit.stat=c("ser"), star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```

#### Coefficients
- Intercept = Intercept of the equation (Which is the mean of Flat)
- StyleFNormal = coefficient (slope of Normal) from baseline (Flat)
- StyleFExaggerated = coefficient (slope of Exaggerated) from baseline (Flat)

#### Means per Condition

- *Flat = Intercept*
- *Normal = Intercept + StyleFNormal*
- *Exaggerated = Intercept + StyleFExaggerated*

#### Pvalues
- The pvalue on the intercept asks if the baseline (*Flat*) condition different from zero
- The pvalue on the slopes asks if each of the other levels is different from baseline (*Flat*)


### Rotate the matrix
- What if we want to know if Normal is different from Exaggerated?
- We need to relevel (aka change what is **zero**)

Variable    | C1| C2
------------| --| ---
Flat        | 1 | 0
Normal      | 0 | 0
Exaggerated | 0 | 1

- We can fix that manually change what the baseline line level is with the `relevel` command in `car` package

```{r}
Emotion.Data$StyleN<- relevel(Emotion.Data$StyleF, ref = "Normal")
Model.1.N<-lm(Emotion ~ StyleN, data = Emotion.Data)
```

- Notice in the table the labels and values have changed

```{r, echo=FALSE, results='asis'}
stargazer(Model.1.N,type="html",
          intercept.bottom = FALSE,single.row=TRUE, notes.append = FALSE, 
          omit.stat=c("ser"), star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```


## Contrast Coding
- Same contrasts as in our ANOVAs but there are two versions (unweighted and weighted) and both need to sum to zero
- We ask now a different question: *How much does each group deviate from the mean of all groups*
- I will review only the most common see for the others:
    - http://stats.idre.ucla.edu/r/library/r-library-contrast-coding-systems-for-categorical-variables/
- I will not show you the weighted version as not often used

### Deviation Coding 
- Note: R calls them Contrast Sums
- Let's make the Flat the reference again

Variable    | C1| C2
------------| --| ---
Normal      | 1 | 0
Exaggerated | 0 | 1
Flat        |-1 | -1

- For deviatation code, R will use the **LAST** variable as references so we need to recode it by hand to be first as to match the order we want in the above table

```{r, echo=TRUE, warning=FALSE}
Emotion.Data$Style.C.S <- factor(Emotion.Data$Style,
                              level=c(1,2,0),
                              labels=c("Normal", "Exaggerated", "Flat"))
```

- Then we will create our new contrast `contr.sum` with three levels `(3)`  

```{r}
contr.sum(3)
```

- We will add that code to our `lm` function, by adding the parameters `list(Style.C.S=contr.sum(3))`  

```{r, echo=TRUE, warning=FALSE}
Model.1.CS<-lm(Emotion ~ Style.C.S, data = Emotion.Data,
               contrasts=list(Style.C.S=contr.sum(3)))
```

- Let's look at how things differ from the mean of all conditions (grand mean)

```{r, echo=FALSE, results='asis'}
stargazer(Model.1.CS,type="html",
          intercept.bottom = FALSE,
          single.row=TRUE, notes.append = FALSE,
          omit.stat=c("ser"), star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```

- Intercept = Mean of means (grand mean)
- Mean of each condition calculated below

```{r}
Mean.Interaction<-aggregate(Emotion~StyleF,data = Emotion.Data, FUN=mean)
```

```{r, echo=FALSE, results='asis'}
library(knitr)
kable(Mean.Interaction)
```

- Grand mean calculated below

```{r}
sum(aggregate(Emotion~StyleF,data = Emotion.Data, FUN=mean)[2])/3
```

#### Coefficients
- Style.C.S1 = coefficient on Normal as it differs from grand mean
- Style.C.S2 = coefficient on Exaggerated as it differs from grand mean
- To get the means for each condition: 

#### Means per Condition
- *Flat = Cannot get from this model*
- *Normal = Intercept + Style.C.S1*
- *Exaggerated = Intercept + Style.C.S2*

#### Pvalues
- The pvalue on the intercept asks if the *grand mean* different from zero
- The pvalue on the slopes asks if each of the other levels is different from *grand mean*
- *Note*: We have lost the ability to ask about Flat condition, to get it back we would have to rotate the baseline and do it again

###  Simple Coding
- They must sum to zero, and the abs(values) must sum to 1
- Like in ANOVA you can design these to ask specific questions by merging conditions (or not)
- We will use the contr.treatment treatment (which R is using to automatically convert your categorical variables already)
- **Just like dummy coding but we change the meaning of the intercept**
- The reference level is **First** again

Variable    | C1   | C2
------------| ---- | ---
Flat        | -1/3 | -1/3
Normal      |  2/3 | -1/3
Exaggerated | -1/3 |  2/3

```{r, echo=TRUE, warning=FALSE,message=FALSE}
Emotion.Data$Style.Simple <- factor(Emotion.Data$Style,
                              level=c(0,1,2),
                              labels=c("Flat", "Normal", "Exaggerated"))

Levels<-3
Simple.1<-contr.treatment(Levels) # Dummy code
#Make your codes
my.coding<-matrix(rep(1/Levels, Levels*(Levels-1)), ncol=Levels-1)
my.simple<-Simple.1-my.coding
my.simple
```

- We will add that code to our `lm` function, by adding the parameters `list(Style.Simple=my.simple)`  

```{r}
Model.1.Simple<-lm(Emotion ~ Style.Simple, data = Emotion.Data, 
                   contrasts=list(Style.Simple=my.simple))
```

```{r, echo=FALSE, results='asis'}
stargazer(Model.1.Simple,type="html",
          intercept.bottom = FALSE,
          single.row=TRUE,  notes.append = FALSE,
          omit.stat=c("ser"), star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```

#### Coefficients
- This merges the dummy and deviance codings thus: 
- Intercept = Mean of means (grand mean)
- StyleFNormal =  slope of Normal from baseline (Flat)
- StyleFExaggerated = slope of Exaggerated from baseline (Flat)

#### Means per Condition
- *You cannot get means of each condition from this model*

#### Pvalues
- The pvalue on the intercept asks if the *grand mean* different from zero
- The pvalue on the slopes asks if each of the other levels is different from baseline (*Flat*)

### Other types of contrast coding
- Helmert: compares each level to the mean of the subsequent levels [can be reversed]
- Forward Difference coding: one level is compared to the next (adjacent) level. (level 1 vs. 2. level 2 vs. 3) [can be reversed]
- Custom: Merge and compare levels at will

#### Center variables and test slopes
- There is no reason why you cannot treat the variables as continuous is they are *ordinal*
- just center them and treat like a continuous variable


# Models with Interactions (2 Nominal variables)
- Interactions can be tricky as the interpretation depends on how you code IVs
    - First, let's try some categorical vs. categorical interactions
- We will add a variable to our experiment from above; we will add the location where the actors work (Movie vs. Theatre)
- Emotion rating ~ Expressive intentions*location of the actor 

$Emotion = -3*Flat + 0*Normal +3*Exag +0*Movie - 1.5*Theatre - 0*Flat*Movie + 0*Normal*Movie +0*Exag*Movie - 10*Flat*Theatre +0*Normal*Theatre +10*Exag*Theatre + 50 + \epsilon$


```{r}
set.seed(42)
N <- 200
X <- sample(rep(c(-1,0,1),N),N,replace = FALSE)
Z <- sample(rep(c(0,1),N*3/2),N,replace = FALSE)

# Our equation to create Y
Y <- 3*X -1.5*Z+10*X*Z+ 50 + rnorm(N, sd=10)
#Built our data frame
Emotion.Data.2<-data.frame(Emotion=Y,Style=X,Location=Z)
Emotion.Data.2$Style<-Emotion.Data.2$Style+1
```


## Dummy coding
- Lets dummy code all of them: convert them to a label in the other we want

```{r, echo=TRUE, warning=FALSE}
# Convert all our factors
Emotion.Data.2$StyleF <- factor(Emotion.Data.2$Style,
                              level=c(0,1,2),
                              labels=c("Flat", "Normal", "Exaggerated"))

Emotion.Data.2$LocationF <- factor(Emotion.Data.2$Location,
                              level=c(0,1),
                              labels=c("Movie Set", "Theatre"))

```

-Let's look at the collapsed means and the means per cell

```{r, echo=TRUE, warning=FALSE,message=FALSE,results='asis'}
Mean.Style<-aggregate(Emotion~StyleF,data = Emotion.Data.2, FUN=mean)
kable(Mean.Style)
Mean.Location<-aggregate(Emotion~LocationF,data = Emotion.Data.2, FUN=mean)
kable(Mean.Location)
Mean.Interaction<-aggregate(Emotion~StyleF*LocationF,data = Emotion.Data.2, FUN=mean)
kable(Mean.Interaction)
```

- Let examine our regression, but we need to examine a main effects and interaction model

```{r}
Model.I1.Dummy<-lm(Emotion ~ StyleF+LocationF, data = Emotion.Data.2)
Model.I2.Dummy<-lm(Emotion ~ StyleF*LocationF, data = Emotion.Data.2)
```


```{r, echo=FALSE,results='asis'}
stargazer(Model.I1.Dummy,Model.I2.Dummy,type="html",
          column.labels = c("Main Effects", "Interaction"),
          intercept.bottom = FALSE,
          single.row=TRUE,notes.append = FALSE,
          omit.stat=c("ser"), star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```


- You will notice a large difference between the main effects and interaction model
- This is because the meaning coefficients in the interaction differ from the main effects model
- The main effects were explained above, but they change meaning a little because there is second variable

### Main Effects

#### Coefficients
- These do not connect back to the individual cells as you are assuming the two factors have independent effects
    - Thus these are hypothetical values (not connected to the individual cells)
- Intercept = Estimate of the mean, assuming the two factors have independent effects at their 0 points [baseline]
- StyleFNormal = Normal - Flat (assuming the two factors are independent)
- StyleFExaggerated = Exaggerated - Flat (assuming the two factors are independent)
- LocationFTheatre = Theatre - Movie(assuming the two factors are independent)

- We can see the main effect models results

```{r}
library(effects)
allEffects(Model.I1.Dummy)
```

#### Means per Condition 
- You cannot get them

#### Pvalues
- The pvalue on the intercept asks if the baseline is different from zero
- The pvalue on the differences asks if they are from different from baseline 

### Interactions

#### Coefficients
- These cells connect back to experimental cells
- Intercept = Location(0) @ Style(0) aka Movie @ Flat [real value]
- StyleFNormal = Differnce of [Location(0) @ Style(1)] - [Location(0) @ Style(0)]
- StyleFExaggerated = Differnce of [Location(0) @ Style(2)] - [Location(0) @ Style(0)]
- LocationFTheatre = Differnce of [Location(1) @ Style(0)] - [Location(0) @ Style(0)]
- StyleFNormal:LocationFTheatre = Differnce of [Location(1) @ Style(1)] - [Location(0) @ Style(1) - Location(0) @ Style(0) + [Location(1) @ Style(0)] - [Location(0) @ Style(0)]
- StyleFExaggerated:LocationFTheatre = Differnce of [Location(1) @ Style(2)] - [Location(0) @ Style(2) - Location(0) @ Style(0) + [Location(2) @ Style(0)] - [Location(0) @ Style(0)]


#### Means per Condition
- You can find all means for all cells! 
- *Flat @ Movie = Intercept*
- *Normal @ Movie = Intercept + StyleFNormal* 
- *Exaggerated @ Movie = Intercept + StyleFExaggerated* 
- *Flat @ Theatre = Intercept + LocationFTheatre*
- *Normal @ Theatre = Intercept + LocationFTheatre+StyleFNormal+StyleFNormal:LocationFTheatre* 
- *Exaggerated @ Theatre =  Intercept + LocationFTheatre + StyleFExaggerated + StyleFExaggerated:LocationFTheatre * 

#### Pvalues
- The pvalue on the intercept asks if the baseline is different from zero
- The pvalue on the differences asks if they are from different from baseline 
- But these are now are now simple effects 


### Main effect model vs. Interaction model
- Since the results are not additive, the main effect model is useful only in which to understand how the terms have changed
- Gives some insight into the interaction

### Graph Model

```{r}
Inter.1<-effect("StyleF*LocationF", Model.I2.Dummy)
plot(Inter.1, multiline = TRUE)
```

### Connection to between-subject ANOVA
- Regression with only categorical variables can be converted into an ANOVA very easily 
- Thus it can be treated just like ANOVA
- but the regression makes the ANOVA process moot as you can now the regression can be coded in such a way that you can test follow up tests in a way you want to test them all 

```{r}
library(car)
Anova(Model.I2.Dummy, type="III")
```

- Note, we have to call for a Type III sum of squares to match SPSS output as R would default to Type I

## Contrast Coding

### Deviation Coding
- Lets code each variable as we did above

```{r, echo=TRUE, warning=FALSE}
Emotion.Data.2$Style.C.S <- factor(Emotion.Data.2$Style,
                              level=c(1,2,0),
                              labels=c("Normal", "Exaggerated", "Flat"))
Emotion.Data.2$Location.C.S <- factor(Emotion.Data.2$Location,
                              level=c(1,0),
                              labels=c("Theatre", "Movie Set"))

Model.I1.C.S<-lm(Emotion ~ Style.C.S+Location.C.S, data = Emotion.Data.2, 
                 contrasts=list(Style.C.S=contr.sum(3),Location.C.S=contr.sum(2)))
Model.I2.C.S<-lm(Emotion ~ Style.C.S*Location.C.S, data = Emotion.Data.2,
                contrasts=list(Style.C.S=contr.sum(3),Location.C.S=contr.sum(2)))
```

- Let's look at how things differ from the mean of all conditions (grand mean)

```{r, echo=FALSE, results='asis'}
stargazer(Model.I1.C.S,Model.I2.C.S,type="html",
          column.labels = c("Main Effects", "Interaction"),
          intercept.bottom = FALSE,
          single.row=TRUE, notes.append = FALSE,
          omit.stat=c("ser"), star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```

- Again we are testing between the grand mean and the individual terms

#### Main Effects

##### Coefficients
- These do not connect back to the individual cells as you are assuming the two factors have independent effects
- Thus these are hypothetical *Grand mean* values 
- Intercept = *Grand Mean*, but assuming the two factors have independent effects
- Style.C.S1  = Normal (averaged over location) - *Grand Mean* (assuming the two factors are independent)
- Style.C.S2 = Exaggerated (averaged over location) - *Grand Mean* (assuming the two factors are independent)
- Location.C.S1 = Theatre (averaged over Style) -  *Grand Mean* (assuming the two factors are independent)

##### Means per Condition
- *Cannot get from this model*

##### Pvalues
- The pvalue on the intercept asks if the baseline (*Grand Mean*) is different from zero
- The pvalue on the slopes asks if each of the other levels is different from the baseline (*Grand Mean*)
- Note we are missing tests on Flat and Movie cell, we would need to rotate the matrix and run again

#### Interactions
##### Coefficients
- Intercept =  *Grand Mean* (all 6 cells mean averaged)
- Style.C.S1 = Normal (averaged over location) from *Grand Mean*
- Style.C.S2 = Exaggerated (averaged over location) from *Grand Mean*
- Location.C.S1 = Theatre (averaged over Style) from *Grand Mean*
- Style.C.S1:Location.C.S1 = difference of Normal @ Theatre from *Grand Mean*
- Style.C.S2:Location.C.S1 = difference of Exaggerated @ Theatre from *Grand Mean*
 
##### Means per Condition
- *Cannot get from this model*

##### Pvalues
- The pvalue on the intercept asks if the baseline (*Grand Mean*) is different from zero
- The pvalue on the differences asks if each of the other levels is from the (*Grand Mean*)
- Note we are missing tests on Flat and movie cell, we would need to rotate the matrix and run again


# Models with Interactions (1 Nominal and 1 Continuous Variable)
- We will change our experiment from above, we will add a likert scale which the actor rates how "good" they think the performance was [-3 terrible to 3 excellent]
- *Emotion rating ~ Quality Rating X location of the actor (i.e., Movie Set, Theatre)* 

```{r, echo=TRUE, warning=FALSE}
#Set up simulation
set.seed(42)
N <- 200
X <- runif(N,-3,3)
Z <- sample(rep(c(0,1),N/2),N,replace = FALSE)
# Our equation to create Y
Y <- 4*X +2*Z+8*X*Z+ 50 + rnorm(N, sd=10)
#Built our data frame
Emotion.Data.3<-data.frame(Emotion=Y,Quality=X,Location=Z)
```

## Center the continuous variable
- Best to center as its will help us interpret 

```{r, echo=TRUE, warning=FALSE}
# Convert all our factors
Emotion.Data.3$Quality.C <- scale(Emotion.Data.3$Quality, scale = FALSE)[,]
```

## Dummy coding of our nominal variable
- Make Location into a factor

```{r, echo=TRUE, warning=FALSE}
# Convert all our factors
Emotion.Data.3$LocationF <- factor(Emotion.Data.3$Location,
                                   level=c(0,1),
                                   labels=c("Movie Set","Theatre"))
```

- First to understand what the interaction will yield lets us run two regressions where we seperate our factor to see what the slopes would be just for actors at *Movie Set* vs at the *Theatre*

### Movie Set ONLY vs Theatre Only
- We will subset the data and test for the Quality slopes

```{r}
Model.Movie<-lm(Emotion ~ Quality.C, 
                data = subset(Emotion.Data.3,LocationF=="Movie Set"))
Model.Theatre<-lm(Emotion ~ Quality.C, 
                data = subset(Emotion.Data.3,LocationF=="Theatre"))
```

```{r, echo=FALSE,results='asis'}
library(stargazer)
stargazer(Model.Movie,Model.Theatre,type="html",
          column.labels = c("Movie Set ONLY", "Theatre Only"),
          intercept.bottom = FALSE,
          single.row=TRUE,  notes.append = FALSE,
          omit.stat=c("ser"),
          star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```

- Ignore the intercepts and pay attention to the slopes: 
    - Quality @ the *Movie Set ONLY* is $b$= `r round(Model.Movie$coefficients[2],3)`
        - The pvalues tells you if the slope of quality is differnet from a slope of 0   
    - Quality @ the *Theatre Only* ONLY is $b$= `r round(Model.Theatre$coefficients[2],3)`
        - The pvalues tells you if the slope of quality is differnet from a slope of 0         
- What we really want to test is if there is a difference in slopes between quality at *Movies sets* VS *Theatre*

### Testing the Interaction
- We will test a main effects model, `Quality+Location` and compare it to an interaction model `Quality*Location`

```{r}
Model.E3.1<-lm(Emotion ~ Quality.C+LocationF, data = Emotion.Data.3)
Model.E3.2<-lm(Emotion ~ Quality.C*LocationF, data = Emotion.Data.3)
```

- Does model interaction improve the model fit? (Just like we did with hierarchical testing)
```{r}
Model.Fit.E3<-anova(Model.E3.1,Model.E3.2)
```

```{r, ech=FALSE}
library(knitr)
kable(Model.Fit.E3)
```

- Yes! The interaction is significant improvement
- Let's go examine the individual regression paramaters:

### Interpret Regression

```{r, echo=FALSE,results='asis'}
stargazer(Model.E3.1,Model.E3.2,type="html",
          column.labels = c("Main Effects", "Interaction"),
          intercept.bottom = FALSE,
          single.row=TRUE,  notes.append = FALSE,
          omit.stat=c("ser"),
          star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```



#### Main Effects Model
- Intercept = Mean of Quality @ Movies [*p* = Is the intercept different from 0]
- Quality.C  = Quality.C slope averaged across *Movies* & * Threatre*   
    - This is a **Main Effect** [*p* = Is slope different from 0 slope] 
        - This would almost be like averaging the slopes from our *Movie Set ONLY* and *Theatre Only* Models
            - $b$ = (`r round(Model.Movie$coefficients[2],3)` +  `r round(Model.Theatre$coefficients[2],3)`)/2 = `r round(((Model.Movie$coefficients[2]+Model.Theatre$coefficients[2])/2),3)` 
- LocationFTheatre = Threatre difference from intercept [p = Is mean of *Theatre* different from *Movies* @ mean of quality] 
    - This is a **Simple Effect**

#### Interaction Model
- Intercept = Mean of Quality @ Movies [*p* = Is the intercept different from 0]
- Quality.C  = Quality.C slope @ Movies 
    - This is a **Simple Slope** [*p* = Is slope different from 0 slope] 
        - To prove it we can see it matches the slopes from our *Movie Set ONLY* Model, $b$= `r round(Model.Movie$coefficients[2],3)`
- LocationFTheatre = Threatre difference from intercept [*p* = Is mean of *Theatre* different from movies @ mean of quality]
    - This is a **Simple Effect**
- Quality.C:LocationFTheatre= slope difference between *Movie* and *Theatre* [p = Is the slope of quality @ Theatre different from slope at ]
    - This is an **Interaction**
        - To prove it we can see it matches the slopes of *Theatre Only* - *Movie Set ONLY* Model
            - $b$= (`r round(Model.Theatre$coefficients[2],3)` -  `r round(Model.Movie$coefficients[2],3)`) = `r round((Model.Theatre$coefficients[2]-Model.Movie$coefficients[2]),3)`
            
- If you wanted to know the simple slope of *Theatre Only* you would have to relevel Location and make Theatre = 0 (i.e., the reference)

```{r}
Emotion.Data.3$LocationT<- relevel(Emotion.Data.3$LocationF, ref = "Theatre")
Model.E3.2.ReLevel<-lm(Emotion ~ Quality.C*LocationT, data = Emotion.Data.3)
```
   
```{r, echo=FALSE,results='asis'}
stargazer(Model.E3.2.ReLevel,type="html",
          column.labels = c("Theatre is Ref"),
          intercept.bottom = FALSE,
          single.row=TRUE,  notes.append = FALSE,
          omit.stat=c("ser"),
          star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```   

- You will notice the interaction flipped direction and now the Quality.C coefficient $b$ = `r round(Model.E3.2.ReLevel$coefficients[2],3)` =  the *Theatre Only* model $b$ = `r round(Model.Theatre$coefficients[2],3)`           

### Plotting Interaction
- Often times you will need to use the effects package and plot by hand
- The rockchalk package will work for simple models like this one

```{r}
library(rockchalk)
plotSlopes(Model.E3.2, plotx = "Quality.C", modx = "LocationF")
```

## Deviation coding of our nominal variable
- We will still center quality
- This creates a more ANOVA like interpretation of the interaction 
```{r, echo=TRUE, warning=FALSE}
Emotion.Data.3$LocationD <- factor(Emotion.Data.3$Location,
                                   level=c(1,0),
                                   labels=c("Theatre","Movie Set"))
Model.E3.D1<-lm(Emotion ~ Quality.C+LocationD, data = Emotion.Data.3,
                 contrasts=list(LocationD=contr.sum(2)))
Model.E3.D2<-lm(Emotion ~ Quality.C*LocationD, data = Emotion.Data.3,
                 contrasts=list(LocationD=contr.sum(2)))
```

### Interpret Regression

```{r, echo=FALSE, results='asis'}
stargazer(Model.E3.D1,Model.E3.D2,type="html",
          column.labels = c("Main Effects", "Interaction"),
          intercept.bottom = FALSE,
          single.row=TRUE,notes.append = FALSE,  omit.stat=c("ser"),
          star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```


### Main Effects

#### Coefficients & Pvalues
- Intercept = Mean of Quality @ mean of Movie & Theatre (imaginary) [p = Is the intercept different from 0]
- Quality.C  = Quality.C slope [p = Is slope different from 0]
- LocationFTheatre = Threatre difference from 0 (not - 1) thus is 1/2 the slope of the dummy code [p = Is mean of Theatre different from movies @ mean of quality]

### Interactions
#### Coefficients & Pvalues 
- Intercept = Mean of Quality @ mean of Movie & Theatre (imaginary) [p = Is the intercept different from 0]
- Quality.C  = Quality.C slope @ mean of Movie & Theatre (imaginary) [p = Is the Main effect of quality @ mean of Movie & Theatre]
- LocationFTheatre = Threatre difference from intercept [*p* = Is the Main effect of Location @ mean of quality]
- Quality.C:LocationFTheatre= Quality.C slope difference between Movie and theatre [*p* = Is the interaction]
- Note all coefficients are 1/2 of the size they should be! 
 
 
## Simple coding of our nominal variable
- We will still center quality
- This creates a more ANOVA-like interpretation of the interaction,
    - Most importantly it does not screw up our coefficients as they match the dummy code values (which make the most sense)
- This time I will hand code it into a factor (easy with only 2 levels)

```{r}
Emotion.Data.3$LocationS<-as.numeric(Emotion.Data.3$Location)-.5
Model.E3.S1<-lm(Emotion ~ Quality.C+LocationS, data = Emotion.Data.3)
Model.E3.S2<-lm(Emotion ~ Quality.C*LocationS, data = Emotion.Data.3)
```

### Interpret Regression

```{r, echo=FALSE,results='asis'}
stargazer(Model.E3.S1,Model.E3.S2,type="html",
          column.labels = c("Main Effects", "Interaction"),
          intercept.bottom = FALSE,
          single.row=TRUE, 
          notes.append = FALSE,
          omit.stat=c("ser"),
          star.cutoffs = c(0.05, 0.01, 0.001),
          header=FALSE)
```


### Main Effects

#### Coefficients & Pvalues
- Intercept = Mean of Quality @ mean of Movie & Theatre (imaginary thing) [p = Is the intercept different from 0]
- Quality.C  = Quality.C slope [p = Is slope different from 0]
- LocationFTheatre = Threatre difference from 0 [p = Is mean of Theatre different from movies @ mean of quality]

### Interactions

#### Coefficients & Pvalues 
- Intercept = Mean of Quality @ mean of Movie & Theatre (imaginary thing) [p = Is the intercept different from 0]
- Quality.C  = Quality.C slope @ mean of Movie & Theatre (imaginary thing) [p = Is the Main effect of quality @ mean of Movie & Theatre]
- LocationS = Threatre difference from intercept [p = Is the Main effect of Location @ mean of quality]
- Quality.C:LocationS= Quality.C slope difference between Movie and theatre [p = Is their an interaction]
- Note all coefficients are their proper size now (to match true differences)
- Best option is often dummy or simple, deviation here is weird 

