Dominant Philosophy in Experimental Psychology
Declared | Null is True (no difference) | Null is False (difference) | Total |
---|---|---|---|
Sig | Type 1 (V) | True Positives (S) | R |
Non-Sig | True Negative (U) | Type II (T) | M-R |
Total | Mo | M-Mo | M |
Familywise error = V / Mo = Probability of getting at least one test wrong
Let’s imagine that 25% of the brain region are different from each other at a set effect size. We will run a Monte-Carlo Simulation: 50,000 t-tests, \(N=10\) per group, \(\alpha_{pc}=.05\), which will yield Power = .50. Our goal is to find those 25 regions and we will assume each test is independent and we sample with replacement. Further, we will know which regions were actually different (We know the “truth”).
Declared | Null is True (no difference) | Null is False (difference) | Total |
---|---|---|---|
Sig | 1829 | 6281 | 8110 |
Non-Sig | 35671 | 6219 | 41890 |
Total | 37500 | 12500 | 50000 |
FWER = V / Mo = 0.049
Type II = T/M-Mo = 0.498, which is correct since power = .50
If you ran 10K studies 16.22% of them would have been declared significant (R/M), but in reality 25% of the studies had a true effect.
Here is another way to view it: Of the tests we declared significant a large percentage of them of were not real (V / R = 22.55%).
\[\alpha_{pc} = \frac{\alpha_{EW}}{j} = \frac{.05}{50,000} = 1X10^{-6} \]
We will keep everything else the same from the last simulation.
Declared | Null is True (no difference) | Null is False (difference) | Total |
---|---|---|---|
Sig | 0 | 3 | 3 |
Non-Sig | 37500 | 12497 | 49997 |
Total | 37500 | 12500 | 50000 |
FWER = V / Mo = 0
Type II = T/M-Mo = 1. So we missed everything (power is basically 0)
So we made no mistakes, but we found nothing. NIH never gives you another penny.
Given our original d = 0.926, we would recalculate the sample size we would need per group (62) given our new alpha \(\alpha_{pc} = 5X10^{-6}\). That is a 520% increase in sample size!
Let’s try to increase our sample size to 62 per group and try again
Declared | Null is True (no difference) | Null is False (difference) | Total |
---|---|---|---|
Sig | 0 | 6339 | 6339 |
Non-Sig | 37500 | 6161 | 43661 |
Total | 37500 | 12500 | 50000 |
FWER = V / Mo = 0
Type II = T/M-Mo = 0.493. So we fixed our power problem and made no FWER! BUT:
650 Per scan * N = 10 * 2 groups = 13,000 dollars
650 Per scan * N = 62 * 2 groups = 80,600 dollars
And yet we still miss 50% of the results because we are powered at .50. If we go to power .95, as we don’t want to miss 50% of the results.
N = 106
650 Per scan * N = 106 * 2 groups = 137,800 dollars
Declared | Null is True (no difference) | Null is False (difference) | Total |
---|---|---|---|
Sig | 0 | 11906 | 11906 |
Non-Sig | 37500 | 594 | 38094 |
Total | 37500 | 12500 | 50000 |
FWER = V / Mo = 0
Type II = T/M-Mo = 0.048.
All of the low low price of 137,800 dollars for 1 study!
FWER is appropriate when you want NO false positives
Dominant Philosophy in Neuroscience, genomics, etc. Any field where they are not as worried about false positives but interested in making discoveries.
Declared | Null is True (no difference) | Null is False (difference) | Total |
---|---|---|---|
Sig | Type 1 (V) | True Positives (S) | R |
Non-Sig | True Negative (U) | Type II (T) | M-R |
Total | Mo | M-Mo | M |
FDR error = V / R
For example 5 t-tests only:
Test | Pvalue | Rank | BH Critical Value (Rank/m)*q [.1] | Significant? |
---|---|---|---|---|
Test 5 | .001 | 1 | 0.02 | Yes |
Test 1 | .030 | 2 | 0.04 | Yes |
Test 3 | .045 | 3 | 0.06 | Yes |
Test 2 | .056 | 4 | 0.08 | Yes |
Test 4 | .230 | 4 | 0.1 | No |
Repeat our original simulation but correct the pvalues using FDR correction. Note we will set FDR level to be about .05 to compare to Bonferroni, but people could change their alpha to be higher (.1 or .25 as in the Dead Salmon Paper).
Declared | Null is True (no difference) | Null is False (difference) | Total |
---|---|---|---|
Sig | 21 | 633 | 654 |
Non-Sig | 37479 | 11867 | 49346 |
Total | 37500 | 12500 | 50000 |
We can compare this to our Bonferroni Correction Table.
FDR = V / R = 0.0321
Type II = T/M-Mo = 0.949
Declared | Null is True (no difference) | Null is False (difference) | Total |
---|---|---|---|
Sig | 152 | 1958 | 2110 |
Non-Sig | 37348 | 10542 | 47890 |
Total | 37500 | 12500 | 50000 |
We can compare this to our Bonferroni Correction Table.
FDR = V / R = 0.072
Type II = T/M-Mo = 0.843
Correct.Pvalues<-p.adjust(Pvector, method = 'fdr')
. You
can decide on what to call significant (alpha .05 or .10 depending on
how exploratory you want to be).