Longitudinal Designs
- Longitudinal generally means you measure a person over time. Thus,
the person is the level 2 (the cluster) and Time is the level 1.
- At least 3 measurements for that person: Longitudinal can be defined
in seconds, minutes, days, weeks, years, or decades.
- Time is repeated measure on the subject, but you are interested in
charting how they change over time
Mixed Model framework for Longitudinal
- Mixed models have some major advantages over RM ANOVA approach to
dealing with time
- Intercept differences: do different treatments have different
baselines
- Slope differences between treatments
- Growth curves: Are slopes linear or curvilinear
- 3a) Each subject can have a different slope and slope shape
- Time measurements do not need to be the same for each subject:
Subject 1: Month 1,2,4,6, Subject 2: Month 2,3,5,7
- 4a) This because you can have missing data in your measurements:
in RM ANOVA this would mean you lose the whole subject
- You test for discontinuities in time: For example, treatment changes
during the measurements (think elbow shaped slope)
- Subjects have 3 baselines: Month 1, 2, 3 and at Month 4 they start
treatment, and you have two more measurements. Thus you want to see if
the slope during no treatment for that subject changes after
treatment
- We can measure Time Fixed Covariates: Covariate
does not change over the measurement period (DV = Depression score at
each treatment, Covariate = Trait Anxiety level)
- We can measure Time Variable Covariates: Covariate
that change over the measurement period (DV = Depression score at each
treatment, Covariate = State Anxiety level during a therapy
session)
- Interact Time Fixed and Time Variable covariates
- You can control the covariance structure and use autoregressive
structures (but not in lme4)
- Deal with nested forms of Time: For example, month to month change
in Anxiety levels, but you can also examine hour to hour Anxiety levels
(nested in the month they were collected)
Simplest Type of Longitudinal Model
Children with ASD typically have difficulty with motor skill and poor
interpersonal coordination. Their reduced ability to coordinate with
people maybe be causing them to be left out of play with other children
isolating them. Past efforts to improve their interpersonal coordination
have often failed because children with ASD are overwhelmed by people.
So training sessions often fail. However, children with ASD gravitate
towards electronics and have been observed to be interested and
comfortable with robots. Srinivasan et al., (2015) hypothesized that
children with ASD might be better at coordinating with robots because a)
children with ASD love robots, and b) robots are less threatening as
they convey less information (not overwhelming the child). Srinivasan et
al., (2015) found that having children “dance” with dancing robots
improved their interpersonal coordination ability.
Today we will examine a simulated study of interpersonal coordination
on 12 children with ASD. Each child underwent 6 training sessions
(including the baseline where they had an ASD assessment to see where
they were on the spectrum; Chlebowski et al. 2010). The children were
asked to “dance” with an instructor and their parent in each of the 6
sessions. In the last part of the session their degree of coordination
(from 0 - 1) with the new instructor who was in the room during the
session, but did not dance. For half the children the instructor was
human, and for the other half, the instructor was a robot.
Data
- As with all our other analysis, the data is in long format (for
Longitudinal you might see it called period-period data)
- Time is coded to start at 0 (baseline or first measurement).
- Instructor (0,1): 0: Human; 1: Robot
- Coordination (0-1): Degree of coordination (DV)
- ASD: ASD spectrum score (possible covariate)
- Download data
LongSim<-read.csv("Mixed/LongData1.csv")
Subject
|
Time
|
Instructor
|
Coordination
|
ASD
|
1
|
0
|
0
|
0.2220880
|
26.07486
|
1
|
1
|
0
|
0.2780595
|
26.07486
|
1
|
2
|
0
|
0.2701741
|
26.07486
|
1
|
3
|
0
|
0.1431913
|
26.07486
|
1
|
4
|
0
|
0.4037957
|
26.07486
|
1
|
5
|
0
|
0.3261772
|
26.07486
|
Coding
- Decisions have to be made about how to code Instructor
- Dummy or effects code it? For today lets just dummy code our
treatment
#Dummy
LongSim$Instructor.D<-factor(LongSim$Instructor,
levels = c(0,1),
labels = c("Human","Robot"))
- Decisions have to be made about how to code Time. Each
again will change how we interpret our model. We will code it out now
for later:
- We can leave Time as is: it starts at 0 and progresses in
the units of measurements
- Center time Time: So there are 6 sessions (0-5), thus
center is 2.5: So Session # - 2.5
- (Note: yes you can Zscore time or Zscore and NOT center as
well)
# Centered
LongSim$Time.C<-scale(LongSim$Time, scale=F, center=T)[,]
Spaghetti Plots
- Longitudinal Models are best assessed with spaghetti plots. When you
have a small number of subjects, you can plot it as “facets.”
- We will plot each person in a facet and add their linear model for
each subject
# Make sure that subject is a factor
LongSim$Subject<-as.factor(LongSim$Subject)
Speg.1<-ggplot(data = LongSim, aes(x=Time,y=Coordination))+
facet_wrap(~Subject)+
geom_point(aes(color=Instructor.D))+
geom_smooth(method='lm', se=FALSE,aes(color=Instructor.D))+
xlab("Time")+ylab("Coordination")+
theme_bw()+
theme(legend.position = "top")
Speg.1
- If you have too many subjects, you can plot the results like this
instead
- On the left, you can fit a regression per subject, on the right you
plot lines instead of the slope
theme_set(theme_bw(base_size = 7, base_family = ""))
Speg.2<-ggplot(data = LongSim, aes(x=Time,y=Coordination, group=Subject, color=Subject))+
facet_wrap(~Instructor.D)+
geom_point()+
geom_smooth(method='lm', se=FALSE)+
xlab("Time")+ylab("Coordination")+
theme(legend.position = "none")
Speg.2
Speg.3<-ggplot(data = LongSim, aes(x=Time,y=Coordination, group=Subject, color=Subject))+
facet_wrap(~Instructor.D)+
geom_point()+
geom_line()+
xlab("Time")+ylab("Coordination")+
theme(legend.position = "none")
Speg.3
Modeling to Examine Random Structure
- The subject is the Level 2 variable (just like in our repeated
measures mixed models)
- We can show that via our ICC analysis
Intercepts only model
Null<-lmer(Coordination ~ 1+(1|Subject), data=LongSim, REML=FALSE)
sjstats::icc(Null)
## # Intraclass Correlation Coefficient
##
## Adjusted ICC: 0.420
## Unadjusted ICC: 0.420
Intercepts only plot
- Predict out the model results
- Dots = Real data
- Line = Predicted result
theme_set(theme_bw(base_size = 7, base_family = ""))
LongSim$Null<-predict(Null, newdata=LongSim)
Speg.Null.1<-ggplot(data = LongSim)+
facet_wrap(~Subject)+
geom_point(aes(x=Time,y=Coordination,color=Instructor.D))+
geom_line(aes(x=Time,y=Null, color=Instructor.D))+
xlab("Time")+ylab("Coordination")+
theme(legend.position = "top")
Speg.Null.1
Speg.Null.2<-ggplot(data = LongSim)+
facet_wrap(~Instructor.D)+
geom_point(aes(x=Time,y=Coordination, group=Subject,color=Subject))+
geom_line(aes(x=Time,y=Null,group=Subject,color=Subject))+
xlab("Time")+ylab("Coordination")+
theme(legend.position = "none")
Speg.Null.2
- Notice that each subject has their own intercept, but no slope
Random Intercepts and Slopes Model
- We will let the intercept and Time vary relative to each subject:
(1+Time|Subject)
- Note you can block the correlation between the slope/intercept like
before if the correlation is -1/1:
(1+Time||Subject)
RISlope<-lmer(Coordination ~ 1+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(RISlope)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ 1 + (1 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -102.0 -90.6 56.0 -112.0 67
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.21989 -0.52703 0.01245 0.55970 2.00199
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## Subject (Intercept) 0.004804 0.06931
## Time 0.004751 0.06893 0.42
## Residual 0.005488 0.07408
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.26005 0.02481 12.00035 10.48 2.15e-07 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Random Intercept and Slopes Plot
- Predict out the model results
- Dots = Real data
- Line = Predicted result
theme_set(theme_bw(base_size = 7, base_family = ""))
LongSim$RISlope<-predict(RISlope, newdata=LongSim)
Speg.R.I.Slope.1<-ggplot(data = LongSim)+
facet_wrap(~Subject)+
geom_point(aes(x=Time,y=Coordination, color=Instructor.D))+
geom_line(aes(x=Time,y=RISlope, color=Instructor.D))+
xlab("Time")+ylab("Coordination")+
theme(legend.position = "top")
Speg.R.I.Slope.1
Speg.R.I.Slope.2<-ggplot(data = LongSim)+
facet_wrap(~Instructor.D)+
geom_point(aes(x=Time,y=Coordination, group=Subject,color=Subject))+
geom_line(aes(x=Time,y=RISlope,group=Subject,color=Subject))+
xlab("Time")+ylab("Coordination")+
theme(legend.position = "none")
Speg.R.I.Slope.2
Random Slopes Only Model
- It could be there is no random intercept, so you can remove it (but
you need to have a reason)
- We will let Time vary relative to each subject:
(0+Time|Subject)
RSlope<-lmer(Coordination ~ 1+(0+Time|Subject), data=LongSim, REML=FALSE)
summary(RSlope)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ 1 + (0 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -99.3 -92.5 52.6 -105.3 69
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.54802 -0.53178 -0.00811 0.59543 2.92944
##
## Random effects:
## Groups Name Variance Std.Dev.
## Subject Time 0.005376 0.07332
## Residual 0.007289 0.08538
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.28463 0.01739 65.05840 16.36 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Random Slopes Only Plot
- Predict out the model results
- Dots = Real data
- Line = Predicted result
theme_set(theme_bw(base_size = 7, base_family = ""))
LongSim$RSlope<-predict(RSlope, newdata=LongSim)
Speg.R.Slope.1<-ggplot(data = LongSim)+
facet_wrap(Instructor.D~Subject)+
geom_point(aes(x=Time,y=Coordination, color=Instructor.D))+
geom_line(aes(x=Time,y=RSlope, color=Instructor.D))+
xlab("Time")+ylab("Coordination")+
theme(legend.position = "top")
Speg.R.Slope.1
Speg.R.Slope.2<-ggplot(data = LongSim)+
facet_wrap(~Instructor.D)+
geom_point(aes(x=Time,y=Coordination, group=Subject,color=Subject))+
geom_line(aes(x=Time,y=RSlope,group=Subject,color=Subject))+
xlab("Time")+ylab("Coordination")+
theme(legend.position = "none")
Speg.R.Slope.2
- Note: The intercepts for each subject are the same.
- You can also guess that this model will not be a good fit as the
last one, which we can test:
anova(RSlope,RISlope)
## Data: LongSim
## Models:
## RSlope: Coordination ~ 1 + (0 + Time | Subject)
## RISlope: Coordination ~ 1 + (1 + Time | Subject)
## npar AIC BIC logLik deviance Chisq Df Pr(>Chisq)
## RSlope 3 -99.287 -92.457 52.643 -105.29
## RISlope 5 -102.000 -90.617 56.000 -112.00 6.7134 2 0.03485 *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Modeling for Results
- The goal above was to understand the random effects. This is useful
if you want to understand your data, but when you move to modeling for
publication you will want to set your random effects and move
to fixed effect testing
- The most obvious structure for most longitudinal (2 level) data is
to use random slopes and intercepts (watch that correlation between
them)
- We will now also explore the different coding schemes to understand
the models
Non-Centered Time & Dummy coded Instructor
- Note: because Instructor is 0/1 coded you can also enter that into
the model instead of it as a factor (you will get the same answer)
Main effects
MainEffect.1<-lmer(Coordination ~ Time+Instructor.D+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(MainEffect.1, correlation=FALSE)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ Time + Instructor.D + (1 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -113.8 -97.9 63.9 -127.8 65
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.15774 -0.56020 0.04558 0.61775 1.91569
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## Subject (Intercept) 0.003360 0.05796
## Time 0.001327 0.03642 0.15
## Residual 0.005488 0.07408
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.23232 0.03206 12.08500 7.247 9.8e-06 ***
## Time 0.05852 0.01169 11.99995 5.005 0.000307 ***
## Instructor.DRobot 0.08358 0.04509 11.99994 1.854 0.088498 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Interpretation
- Time: After each session, the kids get are better
at coordinating regardless of Instruction (the estimate is the
slope over sessions)
- Instructor.DRobot: t < 1.96 (so not
significant). Had it been significant it would mean that kids who had
the robot instructor had overall higher coordination [averaged
over all session] (the estimate reported is the difference
between human and robot)
Interaction
Interaction.1<-lmer(Coordination ~ Time*Instructor.D+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Interaction.1, correlation=F)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ Time * Instructor.D + (1 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -115.5 -97.3 65.7 -131.5 64
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.17344 -0.56178 0.04976 0.64212 2.08058
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## Subject (Intercept) 0.0033241 0.05766
## Time 0.0008965 0.02994 0.25
## Residual 0.0054877 0.07408
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.23829 0.03214 12.00007 7.414 8.12e-06 ***
## Time 0.03777 0.01420 11.99972 2.660 0.0208 *
## Instructor.DRobot 0.07163 0.04546 12.00007 1.576 0.1411
## Time:Instructor.DRobot 0.04148 0.02008 11.99972 2.065 0.0612 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Interpretation
- Time: slope over sessions for kids who had the
human instructor.
- Instructor.DRobot: It is the difference between
human and robot when Time = 0 [Baseline].
- To prove this we can fit the model and subtract the fitted values:
Robot @ Time 0 - Human @ Time 0
library(effects)
effect(c("Time*Instructor.D"),Interaction.1)
Time
|
Instructor.D
|
fit
|
se
|
lower
|
upper
|
0
|
Human
|
0.2383
|
0.0321
|
0.1742
|
0.3024
|
1
|
Human
|
0.2761
|
0.0334
|
0.2093
|
0.3428
|
2
|
Human
|
0.3138
|
0.0401
|
0.2338
|
0.3938
|
4
|
Human
|
0.3894
|
0.0616
|
0.2665
|
0.5123
|
5
|
Human
|
0.4272
|
0.0741
|
0.2793
|
0.5751
|
0
|
Robot
|
0.3099
|
0.0321
|
0.2458
|
0.3741
|
1
|
Robot
|
0.3892
|
0.0334
|
0.3224
|
0.4559
|
2
|
Robot
|
0.4684
|
0.0401
|
0.3884
|
0.5484
|
4
|
Robot
|
0.6269
|
0.0616
|
0.5040
|
0.7499
|
5
|
Robot
|
0.7062
|
0.0741
|
0.5583
|
0.8541
|
- Estimate of **Instructor.DRobot**: 0.3099211 - 0.2382928 = 0.0716283
- Note: *You are missing the simple effect of Human @ Time 0*. Relevel instructor to test if you need to know.
- Time:Instructor.DRobot: slope over sessions
difference = slope @ robot instructor - slope @ human
instructor
Centered Time & Dummy coded Instructor
- Note: Because Instructor is 0/1 coded you can also enter that into
the model instead of it as factor (you will get the same answer)
Main effects
MainEffect.2<-lmer(Coordination ~ Time.C+Instructor.D+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(MainEffect.2, correlation=F)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ Time.C + Instructor.D + (1 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -113.8 -97.9 63.9 -127.8 65
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.15774 -0.56020 0.04558 0.61775 1.91569
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## Subject (Intercept) 0.003360 0.05796
## Time 0.001327 0.03642 0.15
## Residual 0.005488 0.07408
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.37861 0.04105 14.18874 9.223 2.26e-07 ***
## Time.C 0.05852 0.01169 11.99995 5.005 0.000307 ***
## Instructor.DRobot 0.08358 0.04509 11.99994 1.854 0.088498 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Interaction
- Now things will change because the zero-point in time is now the
middle of the session (not the baseline)
Interaction.2<-lmer(Coordination ~ Time.C*Instructor.D+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Interaction.2, correlation=F)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ Time.C * Instructor.D + (1 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -115.5 -97.3 65.7 -131.5 64
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.17344 -0.56178 0.04976 0.64212 2.08058
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## Subject (Intercept) 0.0033241 0.05766
## Time 0.0008965 0.02994 0.25
## Residual 0.0054877 0.07408
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.33273 0.04476 11.99999 7.434 7.9e-06 ***
## Time.C 0.03777 0.01420 11.99972 2.660 0.0208 *
## Instructor.DRobot 0.17533 0.06329 11.99999 2.770 0.0170 *
## Time.C:Instructor.DRobot 0.04148 0.02008 11.99972 2.065 0.0612 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Interpretation
- Time: slope for kids who had the human
instructor.
- Instructor.DRobot: It is the difference between
human and robot when Time = 0. But remember Time 0 equals
the Middle Session!
- To prove this, we can fit the model and subtract the fitted values:
Robot @ Time 0 - Human @ Time 0 [Middle
session]
effect(c("Time.C*Instructor.D"),Interaction.2)
Time.C
|
Instructor.D
|
fit
|
se
|
lower
|
upper
|
-2
|
Human
|
0.2572
|
0.0320
|
0.1933
|
0.3211
|
-1
|
Human
|
0.2950
|
0.0362
|
0.2227
|
0.3672
|
0
|
Human
|
0.3327
|
0.0448
|
0.2434
|
0.4220
|
1
|
Human
|
0.3705
|
0.0556
|
0.2595
|
0.4816
|
2
|
Human
|
0.4083
|
0.0678
|
0.2730
|
0.5435
|
-2
|
Robot
|
0.3495
|
0.0320
|
0.2857
|
0.4134
|
-1
|
Robot
|
0.4288
|
0.0362
|
0.3565
|
0.5011
|
0
|
Robot
|
0.5081
|
0.0448
|
0.4188
|
0.5974
|
1
|
Robot
|
0.5873
|
0.0556
|
0.4763
|
0.6984
|
2
|
Robot
|
0.6666
|
0.0678
|
0.5313
|
0.8018
|
- Estimate of **Instructor.DRobot**: 0.5080641 - 0.3327298 = 0.1753344
- Time:Instructor.DRobot: slope over sessions
difference = slope @ robot instructor - slope @ human
instructor
Summary
- To center time or not center time?
- This all depends on what story you need to tell
- In the interaction model:
- if you don’t center time, you are talking about the baseline. In
this case, it seems very useful to be able to say the kids with ASD
started out at the same level of coordination and over time their slopes
changed
- if you center time you are talking about the middle session, I find
this less useful for the question at hand, but keep it in mind that you
can examine it this way
Plot our Final Model
- We can use the effects package and ggplot to plot the fitted
model
library(effects)
Fitted.I1<-effect(c("Time*Instructor.D"),Interaction.1)
Fitted.I1<-as.data.frame(Fitted.I1)
Fitted.Plot<-ggplot(data = Fitted.I1, aes(x=Time,y=fit, group=Instructor.D))+
geom_line(aes(color=Instructor.D))+
geom_ribbon(aes(ymin=lower,ymax=upper,fill=Instructor.D),alpha=.2)+
xlab("Session")+ylab("Coordination")+
scale_y_continuous(lim=c(0,1),breaks = seq(0, 1,.2)) +
theme_bw()+
theme(legend.position = c(.85,.875),
legend.title = element_blank(),
panel.grid.major = element_line(linetype = "blank"),
panel.grid.minor = element_line(linetype = "blank"))
Fitted.Plot
Fixed Covariates
- We had a covariate: each kids autism spectrum score.
- We can simply add it to our model if we want to control for it, but
you really should center it so you intercept does not change.
LongSim$ASD.C<-scale(LongSim$ASD, scale=F, center=T)[,]
- You can add it to your model just as any regression
Cov1.Model<-lmer(Coordination ~ Time*Instructor.D+ASD.C
+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Cov1.Model, correlation=F)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ Time * Instructor.D + ASD.C + (1 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -125.0 -104.5 71.5 -143.0 63
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.13957 -0.60777 0.05259 0.66668 1.77972
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## Subject (Intercept) 0.001085 0.03294
## Time 0.001052 0.03244 1.00
## Residual 0.004844 0.06960
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.2739315 0.0261886 18.4814776 10.460 3.39e-09 ***
## Time 0.0377748 0.0148838 12.3504929 2.538 0.025558 *
## Instructor.DRobot 0.0003509 0.0391905 19.1578812 0.009 0.992948
## ASD.C -0.0657266 0.0167110 25.5959845 -3.933 0.000569 ***
## Time:Instructor.DRobot 0.0414824 0.0210488 12.3504929 1.971 0.071583 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## optimizer (nloptwrap) convergence code: 0 (OK)
## boundary (singular) fit: see help('isSingular')
Note: the correlation between the intercept and slope = 1, so
we should block that moving forward and we would go back and block it in
prior models to they are all have the same random structure. For now we
will ignore it and move on
anova(Interaction.1,Cov1.Model)
## Data: LongSim
## Models:
## Interaction.1: Coordination ~ Time * Instructor.D + (1 + Time | Subject)
## Cov1.Model: Coordination ~ Time * Instructor.D + ASD.C + (1 + Time | Subject)
## npar AIC BIC logLik deviance Chisq Df Pr(>Chisq)
## Interaction.1 8 -115.48 -97.265 65.739 -131.48
## Cov1.Model 9 -124.95 -104.462 71.476 -142.95 11.474 1 0.0007059 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
- Our slope is no longer significant, meaning the covariate might be
explaining some of our interaction at the mean level of ASD score. This
is suggestive of 3-way interaction.
Cov2.Model<-lmer(Coordination ~ Time*Instructor.D*ASD.C
+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Cov2.Model, correlation=F)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ Time * Instructor.D * ASD.C + (1 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -127.7 -100.4 75.9 -151.7 60
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.25728 -0.53957 -0.00054 0.73048 1.68226
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## Subject (Intercept) 0.0005812 0.02411
## Time 0.0004871 0.02207 1.00
## Residual 0.0047984 0.06927
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.290973 0.027451 25.063026 10.600 9.49e-11
## Time 0.042820 0.013615 13.221798 3.145 0.00761
## Instructor.DRobot -0.005657 0.037056 25.063026 -0.153 0.87989
## ASD.C -0.097154 0.028438 25.063026 -3.416 0.00217
## Time:Instructor.DRobot 0.051314 0.018379 13.221798 2.792 0.01506
## Time:ASD.C -0.009305 0.014105 13.221798 -0.660 0.52075
## Instructor.DRobot:ASD.C 0.051776 0.034086 25.063026 1.519 0.14128
## Time:Instructor.DRobot:ASD.C 0.036742 0.016906 13.221798 2.173 0.04849
##
## (Intercept) ***
## Time **
## Instructor.DRobot
## ASD.C **
## Time:Instructor.DRobot *
## Time:ASD.C
## Instructor.DRobot:ASD.C
## Time:Instructor.DRobot:ASD.C *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## optimizer (nloptwrap) convergence code: 0 (OK)
## boundary (singular) fit: see help('isSingular')
anova(Cov1.Model,Cov2.Model)
## Data: LongSim
## Models:
## Cov1.Model: Coordination ~ Time * Instructor.D + ASD.C + (1 + Time | Subject)
## Cov2.Model: Coordination ~ Time * Instructor.D * ASD.C + (1 + Time | Subject)
## npar AIC BIC logLik deviance Chisq Df Pr(>Chisq)
## Cov1.Model 9 -124.95 -104.46 71.476 -142.95
## Cov2.Model 12 -127.74 -100.42 75.872 -151.74 8.7914 3 0.0322 *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
- The three way improved the model fit.
- Time:Instructor.DRobot: is the slope over sessions
difference = slope @ robot instructor - slope @ human
instructor [@ at the average ASD level]
- Time:Instructor.DRobot:ASD.C: is the slope over
sessions difference = slope @ robot instructor - slope @
human instructor [as a function of ASD score]
- So the slope difference between robot and human instructor is
getting larger as ASD level increases. See the plot below to understand
this:
Plot 3-Way
- We will plot it like any other interaction in a regression involving
two continuous variables
- We will need to find 1 SD above/below the mean of the covariate
(which was mean centered)
SDN1<- -sd(LongSim$ASD.C)
SDP1<- +sd(LongSim$ASD.C)
Fitted.Cov2<-effect(c("Time*Instructor.D*ASD.C"),Cov2.Model,
xlevel=list(Time=seq(0,5,1),
ASD.C=c(SDN1,0,SDP1)))
Fitted.Cov2<-as.data.frame(Fitted.Cov2)
Fitted.Cov2$ASD<-factor(Fitted.Cov2$ASD.C,
levels=c(SDN1,0,SDP1),
labels=c("-1 SD ASD","Mean ASD","+1 SD ASD"))
Fitted.Plot.2<-ggplot(data = Fitted.Cov2, aes(x=Time,y=fit, group=Instructor.D))+
facet_grid(~ASD)+
geom_line(aes(color=Instructor.D))+
geom_ribbon(aes(ymin=lower,ymax=upper,fill=Instructor.D),alpha=.2)+
xlab("Session")+ylab("Coordination")+
scale_y_continuous(lim=c(0,1),breaks = seq(0, 1,.2)) +
theme_bw()+
theme(legend.position = c(.15,.85),
legend.title = element_blank(),
panel.grid.major = element_line(linetype = "blank"),
panel.grid.minor = element_line(linetype = "blank"))
Fitted.Plot.2
Test of Simple Interactions
- If you want to know if the interaction is significant at each level
of ASD score, you simply have to re-center the ASD. We already know the
middle plot was significant. We are going to watch the
Time:Instructor.DRobot terms as we change the center
point at ASD.C
Test -1 SD of ASD
- We move the center to be -1 SD by adding 1 SD
LongSim$ASD.N1sd<-LongSim$ASD.C+sd(LongSim$ASD.C)
Cov2.Model.Simple.N1sd<-lmer(Coordination ~ Time*Instructor.D*ASD.N1sd
+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Cov2.Model.Simple.N1sd, correlation=F)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ Time * Instructor.D * ASD.N1sd + (1 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -127.7 -100.4 75.9 -151.7 60
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.25728 -0.53957 -0.00054 0.73048 1.68226
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## Subject (Intercept) 0.0005812 0.02411
## Time 0.0004871 0.02207 1.00
## Residual 0.0047984 0.06927
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.404358 0.053652 25.063026 7.537 6.74e-08
## Time 0.053680 0.026610 13.221798 2.017 0.06444
## Instructor.DRobot -0.066083 0.059432 25.063026 -1.112 0.27673
## ASD.N1sd -0.097154 0.028438 25.063026 -3.416 0.00217
## Time:Instructor.DRobot 0.008433 0.029477 13.221798 0.286 0.77924
## Time:ASD.N1sd -0.009305 0.014105 13.221798 -0.660 0.52075
## Instructor.DRobot:ASD.N1sd 0.051776 0.034086 25.063026 1.519 0.14128
## Time:Instructor.DRobot:ASD.N1sd 0.036742 0.016906 13.221798 2.173 0.04849
##
## (Intercept) ***
## Time .
## Instructor.DRobot
## ASD.N1sd **
## Time:Instructor.DRobot
## Time:ASD.N1sd
## Instructor.DRobot:ASD.N1sd
## Time:Instructor.DRobot:ASD.N1sd *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## optimizer (nloptwrap) convergence code: 0 (OK)
## boundary (singular) fit: see help('isSingular')
- Time:Instructor.DRobot is not significant. So the
left panel is our graph is not an interaction
Test +1 SD of ASD
- We move the center to be +1 SD by subtracting 1 SD
LongSim$ASD.P1sd<-LongSim$ASD.C-sd(LongSim$ASD.C)
Cov2.Model.Simple.P1sd<-lmer(Coordination ~ Time*Instructor.D*ASD.P1sd
+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Cov2.Model.Simple.P1sd, correlation=F)
## Linear mixed model fit by maximum likelihood . t-tests use Satterthwaite's
## method [lmerModLmerTest]
## Formula: Coordination ~ Time * Instructor.D * ASD.P1sd + (1 + Time | Subject)
## Data: LongSim
##
## AIC BIC logLik deviance df.resid
## -127.7 -100.4 75.9 -151.7 60
##
## Scaled residuals:
## Min 1Q Median 3Q Max
## -2.25728 -0.53957 -0.00054 0.73048 1.68226
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## Subject (Intercept) 0.0005812 0.02411
## Time 0.0004871 0.02207 1.00
## Residual 0.0047984 0.06927
## Number of obs: 72, groups: Subject, 12
##
## Fixed effects:
## Estimate Std. Error df t value Pr(>|t|)
## (Intercept) 0.177587 0.028836 25.063027 6.159 1.92e-06
## Time 0.031961 0.014302 13.221798 2.235 0.04330
## Instructor.DRobot 0.054770 0.048776 25.063027 1.123 0.27213
## ASD.P1sd -0.097154 0.028438 25.063027 -3.416 0.00217
## Time:Instructor.DRobot 0.094195 0.024192 13.221798 3.894 0.00179
## Time:ASD.P1sd -0.009305 0.014105 13.221798 -0.660 0.52075
## Instructor.DRobot:ASD.P1sd 0.051776 0.034086 25.063027 1.519 0.14128
## Time:Instructor.DRobot:ASD.P1sd 0.036742 0.016906 13.221798 2.173 0.04849
##
## (Intercept) ***
## Time *
## Instructor.DRobot
## ASD.P1sd **
## Time:Instructor.DRobot **
## Time:ASD.P1sd
## Instructor.DRobot:ASD.P1sd
## Time:Instructor.DRobot:ASD.P1sd *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## optimizer (nloptwrap) convergence code: 0 (OK)
## boundary (singular) fit: see help('isSingular')
- Time:Instructor.DRobot is significant (as it has to
be). So the right panel in our graph is an interaction.
- In summary, our two way interaction changes as function of ASD
score
References
Chlebowski, C., Green, J. A., Barton, M. L., & Fein, D. (2010).
Using the childhood autism rating scale to diagnose autism spectrum
disorders. Journal of autism and developmental disorders,
40(7), 787-799.
Srinivasan, S. M., Kaur, M., Park, I. K., Gifford, T. D., Marsh, K.
L., & Bhat, A. N. (2015). The effects of rhythm and robotic
interventions on the imitation/praxis, interpersonal synchrony, and
motor performance of children with autism spectrum disorder (ASD): a
pilot randomized controlled trial. Autism research and
treatment.
---
title: 'Longitudinal'
output:
  html_document:
    code_download: yes
    fontsize: 8pt
    highlight: textmate
    number_sections: no
    theme: flatly
    toc: yes
    toc_float:
      collapsed: no
---


```{r setup, include=FALSE}
knitr::opts_chunk$set(cache = TRUE)
knitr::opts_chunk$set(echo = TRUE)
knitr::opts_chunk$set(message = FALSE)
knitr::opts_chunk$set(warning =  FALSE)
knitr::opts_chunk$set(fig.width=4.25)
knitr::opts_chunk$set(fig.height=4.0)
knitr::opts_chunk$set(fig.align='center') 
knitr::opts_chunk$set(results='hold') 
```

```{r, echo=FALSE, warning=FALSE}
library(ggplot2)
library(lme4)
```

# Longitudinal Designs
- Longitudinal generally means you measure a person over time. Thus, the person is the level 2 (the cluster) and Time is the level 1.
    - At least 3 measurements for that person: Longitudinal can be defined in seconds, minutes, days, weeks, years, or decades.
    - Time is repeated measure on the subject, but you are interested in charting how they change over time
    
# Mixed Model framework for Longitudinal
- Mixed models have some major advantages over RM ANOVA approach to dealing with time
    - 1) Intercept differences: do different treatments have different baselines 
    - 2) Slope differences between treatments
    - 3) Growth curves: Are slopes linear or curvilinear
        - 3a) Each subject can have a different slope and slope shape 
    - 4) Time measurements do not need to be the same for each subject: Subject 1: Month 1,2,4,6, Subject 2: Month 2,3,5,7
        - 4a) *This because you can have missing data in your measurements: in RM ANOVA this would mean you lose the whole subject* 
    - 5) You test for discontinuities in time: For example, treatment changes during the measurements (think elbow shaped slope) 
        - Subjects have 3 baselines: Month 1, 2, 3 and at Month 4 they start treatment, and you have two more measurements. Thus you want to see if the slope during no treatment for that subject changes after treatment         
    - 6) We can measure **Time Fixed Covariates**: Covariate does not change over the measurement period (DV = Depression score at each treatment, Covariate = Trait Anxiety level)
    - 7) We can measure **Time Variable Covariates**: Covariate that change over the measurement period (DV = Depression score at each treatment, Covariate = State Anxiety level during a therapy session)
    - 8) Interact Time Fixed and Time Variable covariates
    - 9) You can control the covariance structure and use autoregressive structures (but not in lme4)
    - 10) Deal with nested forms of Time: For example, month to month change in Anxiety levels, but you can also examine hour to hour Anxiety levels (nested in the month they were collected)  

# Simplest Type of Longitudinal Model
Children with ASD typically have difficulty with motor skill and poor interpersonal coordination. Their reduced ability to coordinate with people maybe be causing them to be left out of play with other children isolating them. Past efforts to improve their interpersonal coordination have often failed because children with ASD are overwhelmed by people. So training sessions often fail. However, children with ASD gravitate towards electronics and have been observed to be interested and comfortable with robots. Srinivasan et al., (2015) hypothesized that children with ASD might be better at coordinating with robots because a) children with ASD love robots, and b) robots are less threatening as they convey less information (not overwhelming the child).  Srinivasan et al., (2015) found that having children "dance" with dancing robots improved their interpersonal coordination ability.  

Today we will examine a simulated study of interpersonal coordination on 12 children with ASD. Each child underwent 6 training sessions (including the baseline where they had an ASD assessment to see where they were on the spectrum; Chlebowski et al. 2010). The children were asked to "dance" with an instructor and their parent in each of the 6 sessions. In the last part of the session their degree of coordination (from 0 - 1) with the new instructor who was in the room during the session, but did not dance. For half the children the instructor was human, and for the other half, the instructor was a robot.

## Data
- As with all our other analysis, the data is in long format (for Longitudinal you might see it called period-period data)
    - Time is coded to start at 0 (baseline or first measurement). 
    - Instructor (0,1): 0: Human;  1: Robot 
    - Coordination (0-1): Degree of coordination (DV)
    - ASD: ASD spectrum score (possible covariate)
    - [Download data](/Mixed/LongData1.csv)
    
```{r}
LongSim<-read.csv("Mixed/LongData1.csv")
```

```{r, echo=FALSE}
library(knitr); library(kableExtra) 
kable(head(LongSim), "html", booktabs = T) %>%
  kable_styling(bootstrap_options = "striped", full_width = F)
```

### Coding
- Decisions have to be made about how to code *Instructor*
    - Dummy or effects code it? For today lets just dummy code our treatment
    
```{r}
#Dummy
LongSim$Instructor.D<-factor(LongSim$Instructor,
                             levels = c(0,1),
                             labels = c("Human","Robot"))
```

- Decisions have to be made about how to code *Time*. Each again will change how we interpret our model. We will code it out now for later:
    - We can leave *Time* as is: it starts at 0 and progresses in the units of measurements
    - Center time *Time*: So there are 6 sessions (0-5), thus center is 2.5: So Session # - 2.5
    - (Note: *yes you can Zscore time or Zscore and NOT center as well*)

```{r}
# Centered
LongSim$Time.C<-scale(LongSim$Time, scale=F, center=T)[,]
```        
  
## Spaghetti Plots
- Longitudinal Models are best assessed with spaghetti plots. When you have a small number of subjects, you can plot it as "facets."
- We will plot each person in a facet and add their linear model for each subject
```{r,fig.width=3.5, fig.height=3.5}
# Make sure that subject is a factor
LongSim$Subject<-as.factor(LongSim$Subject)

Speg.1<-ggplot(data = LongSim, aes(x=Time,y=Coordination))+
  facet_wrap(~Subject)+
  geom_point(aes(color=Instructor.D))+
  geom_smooth(method='lm', se=FALSE,aes(color=Instructor.D))+
  xlab("Time")+ylab("Coordination")+
  theme_bw()+
  theme(legend.position = "top")
Speg.1
```

- If you have too many subjects, you can plot the results like this instead
- On the left, you can fit a regression per subject, on the right you plot lines instead of the slope

```{r, echo=TRUE, out.width='50%', fig.height=2.5,fig.show='hold',fig.align='center'}
theme_set(theme_bw(base_size = 7, base_family = ""))

Speg.2<-ggplot(data = LongSim, aes(x=Time,y=Coordination, group=Subject, color=Subject))+
  facet_wrap(~Instructor.D)+
  geom_point()+
  geom_smooth(method='lm', se=FALSE)+
  xlab("Time")+ylab("Coordination")+
  theme(legend.position = "none")
Speg.2

Speg.3<-ggplot(data = LongSim, aes(x=Time,y=Coordination, group=Subject, color=Subject))+
  facet_wrap(~Instructor.D)+
  geom_point()+
  geom_line()+
  xlab("Time")+ylab("Coordination")+
  theme(legend.position = "none")
Speg.3
```

## Modeling to Examine Random Structure 
- The subject is the Level 2 variable (just like in our repeated measures mixed models)
- We can show that via our ICC analysis 

### Intercepts only model 
```{r}
Null<-lmer(Coordination ~ 1+(1|Subject), data=LongSim, REML=FALSE)
sjstats::icc(Null)
```

#### Intercepts only plot
- Predict out the model results 
- Dots = Real data
- Line = Predicted result
```{r, echo=TRUE,out.width='50%', fig.height=2.5,fig.show='hold',fig.align='center'}
theme_set(theme_bw(base_size = 7, base_family = ""))

LongSim$Null<-predict(Null, newdata=LongSim)

Speg.Null.1<-ggplot(data = LongSim)+
  facet_wrap(~Subject)+
  geom_point(aes(x=Time,y=Coordination,color=Instructor.D))+
  geom_line(aes(x=Time,y=Null, color=Instructor.D))+
  xlab("Time")+ylab("Coordination")+
  theme(legend.position = "top")
Speg.Null.1

Speg.Null.2<-ggplot(data = LongSim)+
  facet_wrap(~Instructor.D)+
  geom_point(aes(x=Time,y=Coordination, group=Subject,color=Subject))+
  geom_line(aes(x=Time,y=Null,group=Subject,color=Subject))+
  xlab("Time")+ylab("Coordination")+
  theme(legend.position = "none")
Speg.Null.2

```
- Notice that each subject has their own intercept, but no slope

### Random Intercepts and Slopes Model 
- We will let the intercept and Time vary relative to each subject: `(1+Time|Subject)`
    - Note you can block the correlation between the slope/intercept like before if the correlation is -1/1: `(1+Time||Subject)`
```{r}
RISlope<-lmer(Coordination ~ 1+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(RISlope)
```

#### Random Intercept and Slopes Plot
- Predict out the model results 
- Dots = Real data
- Line = Predicted result

```{r, echo=TRUE,out.width='50%', fig.height=2.5,fig.show='hold',fig.align='center'}
theme_set(theme_bw(base_size = 7, base_family = ""))

LongSim$RISlope<-predict(RISlope, newdata=LongSim)
Speg.R.I.Slope.1<-ggplot(data = LongSim)+
  facet_wrap(~Subject)+
  geom_point(aes(x=Time,y=Coordination, color=Instructor.D))+
  geom_line(aes(x=Time,y=RISlope, color=Instructor.D))+
  xlab("Time")+ylab("Coordination")+
  theme(legend.position = "top")
Speg.R.I.Slope.1

Speg.R.I.Slope.2<-ggplot(data = LongSim)+
  facet_wrap(~Instructor.D)+
  geom_point(aes(x=Time,y=Coordination, group=Subject,color=Subject))+
  geom_line(aes(x=Time,y=RISlope,group=Subject,color=Subject))+
  xlab("Time")+ylab("Coordination")+
  theme(legend.position = "none")
Speg.R.I.Slope.2
```

### Random Slopes Only Model 
- It could be there is no random intercept, so you can remove it (but you need to have a reason)
- We will let Time vary relative to each subject: `(0+Time|Subject)`
```{r}
RSlope<-lmer(Coordination ~ 1+(0+Time|Subject), data=LongSim, REML=FALSE)
summary(RSlope)
```

#### Random Slopes Only Plot
- Predict out the model results 
- Dots = Real data
- Line = Predicted result

```{r, echo=TRUE,out.width='50%', fig.height=2.5,fig.show='hold',fig.align='center'}
theme_set(theme_bw(base_size = 7, base_family = ""))

LongSim$RSlope<-predict(RSlope, newdata=LongSim)

Speg.R.Slope.1<-ggplot(data = LongSim)+
  facet_wrap(Instructor.D~Subject)+
  geom_point(aes(x=Time,y=Coordination, color=Instructor.D))+
  geom_line(aes(x=Time,y=RSlope, color=Instructor.D))+
  xlab("Time")+ylab("Coordination")+
  theme(legend.position = "top")
Speg.R.Slope.1

Speg.R.Slope.2<-ggplot(data = LongSim)+
  facet_wrap(~Instructor.D)+
  geom_point(aes(x=Time,y=Coordination, group=Subject,color=Subject))+
  geom_line(aes(x=Time,y=RSlope,group=Subject,color=Subject))+
  xlab("Time")+ylab("Coordination")+
  theme(legend.position = "none")
Speg.R.Slope.2
```

- Note: The intercepts for each subject are the same.
- You can also guess that this model will not be a good fit as the last one, which we can test: 

```{r, eval=TRUE}
anova(RSlope,RISlope)
```

## Modeling for Results
- The goal above was to understand the random effects. This is useful if you want to understand your data, but when you move to modeling for publication you will want to *set* your random effects and move to fixed effect testing
- The most obvious structure for most longitudinal (2 level) data is to use random slopes and intercepts (watch that correlation between them)
- We will now also explore the different coding schemes to understand the models

### Non-Centered Time & Dummy coded Instructor 
- Note: because Instructor is 0/1 coded you can also enter that into the model instead of it as a factor (you will get the same answer)

#### Main effects
```{r}
MainEffect.1<-lmer(Coordination ~ Time+Instructor.D+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(MainEffect.1, correlation=FALSE)
```

##### Interpretation
- **Time**: After each session, the kids get are better at coordinating _regardless_ of Instruction (the estimate is the slope over sessions)
- **Instructor.DRobot**: t < 1.96 (so not significant). Had it been significant it would mean that kids who had the robot instructor had overall higher coordination [**averaged over all session**] (the estimate reported is the difference between human and robot)

#### Interaction 
```{r}
Interaction.1<-lmer(Coordination ~ Time*Instructor.D+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Interaction.1, correlation=F)
```

##### Interpretation
- **Time**: slope over sessions for kids who had the *human instructor*.  
- **Instructor.DRobot**: It is the difference between human and robot when Time = 0 [**Baseline**]. 
    - To prove this we can fit the model and subtract the fitted values: *Robot @ Time 0 - Human @ Time 0 *

```{r, eval=FALSE}
library(effects)
effect(c("Time*Instructor.D"),Interaction.1)
```

```{r, echo=FALSE}
library(effects)
# Fancy version of above: Ignore me!
kable(as.data.frame(effect(c("Time*Instructor.D"),Interaction.1)), digits=4,"html", booktabs = T) %>%
  kable_styling(bootstrap_options = "striped", full_width = F)
Inter.1.Effects<-as.data.frame(effect(c("Time*Instructor.D"),Interaction.1))
```
    - Estimate of **Instructor.DRobot**: `r Inter.1.Effects$fit[6]` - `r Inter.1.Effects$fit[1]` = `r Inter.1.Effects$fit[6] - Inter.1.Effects$fit[1]`
        - Note: *You are missing the simple effect of Human @ Time 0*. Relevel instructor to test if you need to know.
- **Time:Instructor.DRobot**: slope over sessions difference =  slope @ *robot instructor* - slope @ *human instructor*

### Centered Time & Dummy coded Instructor 
- Note: Because Instructor is 0/1 coded you can also enter that into the model instead of it as factor (you will get the same answer)

#### Main effects
```{r}
MainEffect.2<-lmer(Coordination ~ Time.C+Instructor.D+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(MainEffect.2, correlation=F)
```

##### Interpretation
- Same as non-centered

#### Interaction 
- Now things will change because the zero-point in time is now the middle of the session (not the baseline)
```{r}
Interaction.2<-lmer(Coordination ~ Time.C*Instructor.D+(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Interaction.2, correlation=F)
```

##### Interpretation
- **Time**: slope for kids who had the human instructor.  
- **Instructor.DRobot**: It is the difference between human and robot when Time = 0. _**But remember Time 0 equals the Middle Session!**_ 
    - To prove this, we can fit the model and subtract the fitted values: *Robot @ Time 0 - Human @ Time 0* [**Middle session**]

```{r, eval=FALSE}
effect(c("Time.C*Instructor.D"),Interaction.2)
```
```{r, echo=FALSE}
# Fancy version of above: Ignore me!
kable(as.data.frame(effect(c("Time.C*Instructor.D"),Interaction.2)), digits=4,"html", booktabs = T) %>%
  kable_styling(bootstrap_options = "striped", full_width = F)
Inter.2.Effects<-as.data.frame(effect(c("Time.C*Instructor.D"),Interaction.2))
```
    - Estimate of **Instructor.DRobot**: `r Inter.2.Effects$fit[8]` - `r Inter.2.Effects$fit[3]` = `r Inter.2.Effects$fit[8] - Inter.2.Effects$fit[3]`
- **Time:Instructor.DRobot**: slope over sessions difference =  slope @ *robot instructor* - slope @ *human instructor*


### Summary
- To center time or not center time? 
- This all depends on what story you need to tell
- In the interaction model:
    - if you don't center time, you are talking about the baseline. In this case, it seems very useful to be able to say the kids with ASD started out at the same level of coordination and over time their slopes changed
     - if you center time you are talking about the middle session, I find this less useful for the question at hand, but keep it in mind that you can examine it this way
     
     
# Plot our Final Model
- We can use the effects package and ggplot to plot the fitted model
```{r,fig.width=3.5, fig.height=3.5}
library(effects)
Fitted.I1<-effect(c("Time*Instructor.D"),Interaction.1)
Fitted.I1<-as.data.frame(Fitted.I1)

Fitted.Plot<-ggplot(data = Fitted.I1, aes(x=Time,y=fit, group=Instructor.D))+
  geom_line(aes(color=Instructor.D))+
  geom_ribbon(aes(ymin=lower,ymax=upper,fill=Instructor.D),alpha=.2)+
  xlab("Session")+ylab("Coordination")+
  scale_y_continuous(lim=c(0,1),breaks = seq(0, 1,.2)) +
  theme_bw()+
  theme(legend.position = c(.85,.875),
        legend.title = element_blank(),
        panel.grid.major = element_line(linetype = "blank"),
        panel.grid.minor = element_line(linetype = "blank"))
Fitted.Plot
```

# Fixed Covariates
- We had a covariate: each kids autism spectrum score. 
- We can simply add it to our model if we want to control for it, but you really should center it so you intercept does not change. 

```{r}
LongSim$ASD.C<-scale(LongSim$ASD, scale=F, center=T)[,]
```

- You can add it to your model just as any regression
```{r}
Cov1.Model<-lmer(Coordination ~ Time*Instructor.D+ASD.C
                +(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Cov1.Model, correlation=F)
```

**Note: the correlation between the intercept and slope = 1, so we should block that moving forward and we would go back and block it in prior models to they are all have the same random structure. For now we will ignore it and move on**

```{r, eval=TRUE}
anova(Interaction.1,Cov1.Model)
```


- Our slope is no longer significant, meaning the covariate might be explaining some of our interaction at the mean level of ASD score. This is suggestive of 3-way interaction.

```{r}
Cov2.Model<-lmer(Coordination ~ Time*Instructor.D*ASD.C
                +(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Cov2.Model, correlation=F)
```

```{r}
anova(Cov1.Model,Cov2.Model)
```


- The three way improved the model fit. 
- **Time:Instructor.DRobot**: is the slope over sessions difference =  slope @ *robot instructor* - slope @ *human instructor* [@ at the average ASD level]
- **Time:Instructor.DRobot:ASD.C**: is the slope over sessions difference =  slope @ *robot instructor* - slope @ *human instructor* [as a function of ASD score]
    - So the slope difference between robot and human instructor is getting larger as ASD level increases. See the plot below to understand this: 

## Plot 3-Way
- We will plot it like any other interaction in a regression involving two continuous variables
- We will need to find 1 SD above/below the mean of the covariate (which was mean centered)

```{r,fig.width=5.5, fig.height=3.5}
SDN1<- -sd(LongSim$ASD.C)
SDP1<- +sd(LongSim$ASD.C)

Fitted.Cov2<-effect(c("Time*Instructor.D*ASD.C"),Cov2.Model, 
                    xlevel=list(Time=seq(0,5,1),
                                ASD.C=c(SDN1,0,SDP1)))
Fitted.Cov2<-as.data.frame(Fitted.Cov2)

Fitted.Cov2$ASD<-factor(Fitted.Cov2$ASD.C, 
                           levels=c(SDN1,0,SDP1),
                           labels=c("-1 SD ASD","Mean ASD","+1 SD ASD"))

Fitted.Plot.2<-ggplot(data = Fitted.Cov2, aes(x=Time,y=fit, group=Instructor.D))+
  facet_grid(~ASD)+
  geom_line(aes(color=Instructor.D))+
  geom_ribbon(aes(ymin=lower,ymax=upper,fill=Instructor.D),alpha=.2)+
  xlab("Session")+ylab("Coordination")+
  scale_y_continuous(lim=c(0,1),breaks = seq(0, 1,.2)) +
  theme_bw()+
  theme(legend.position = c(.15,.85),
        legend.title = element_blank(),
        panel.grid.major = element_line(linetype = "blank"),
        panel.grid.minor = element_line(linetype = "blank"))
Fitted.Plot.2
```


## Test of Simple Interactions
- If you want to know if the interaction is significant at each level of ASD score, you simply have to re-center the ASD.  We already know the middle plot was significant. We are going to watch the **Time:Instructor.DRobot** terms as we change the center point at ASD.C

### Test -1 SD of ASD
- We move the center to be -1 SD by adding 1 SD
```{r}
LongSim$ASD.N1sd<-LongSim$ASD.C+sd(LongSim$ASD.C)
Cov2.Model.Simple.N1sd<-lmer(Coordination ~ Time*Instructor.D*ASD.N1sd
                +(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Cov2.Model.Simple.N1sd, correlation=F)
```

- **Time:Instructor.DRobot** is not significant. So the left panel is our graph is not an interaction

### Test +1 SD of ASD
- We move the center to be +1 SD by subtracting 1 SD
```{r}
LongSim$ASD.P1sd<-LongSim$ASD.C-sd(LongSim$ASD.C)
Cov2.Model.Simple.P1sd<-lmer(Coordination ~ Time*Instructor.D*ASD.P1sd
                +(1+Time|Subject), data=LongSim, REML=FALSE)
summary(Cov2.Model.Simple.P1sd, correlation=F)
```

- **Time:Instructor.DRobot** is significant (as it has to be). So the right panel in our graph is an interaction. 
- In summary, our two way interaction changes as function of ASD score

# References
Chlebowski, C., Green, J. A., Barton, M. L., & Fein, D. (2010). Using the childhood autism rating scale to diagnose autism spectrum disorders. *Journal of autism and developmental disorders*, 40(7), 787-799.

Srinivasan, S. M., Kaur, M., Park, I. K., Gifford, T. D., Marsh, K. L., & Bhat, A. N. (2015). The effects of rhythm and robotic interventions on the imitation/praxis, interpersonal synchrony, and motor performance of children with autism spectrum disorder (ASD): a pilot randomized controlled trial. *Autism research and treatment*.

<script>
  (function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
  (i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
  m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
  })(window,document,'script','https://www.google-analytics.com/analytics.js','ga');

  ga('create', 'UA-90415160-1', 'auto');
  ga('send', 'pageview');

</script>
