afex package & bootstrapping

Wed, Jul 6, 2016 5:04 AM

Hi Dan,

Your code looks reasonable to me. details = 2 is probably responsible 
for the following message which shows you the time it took to calculate 
the reference distribution for the first p-value:
"Reference distribution with   996 samples; computing time: 159891.87 secs."

The one period in the output also shows you that the first of 15 
p-values has been successfully calculated and the next is being calculating.

So it seems that calculating the reference distribution for the first 
p-values took around 44 hours on your 12 cores.

Unfortunately, the speed up by using parallel computation is not linear 
when calculating the reference distribution using pbkrtest::PBmodcomp 
(which mixed uses as for the computation). This is most likely due to 
the fact that some models sometimes are faster than others and then the 
cores idle before starting with the next model until all cores can start 
the next model (I didn't look at the code of PBmodcomp, but that seems 
somewhat likely).

On my 8 cores one can nevertheless see a considerable speed up when 
using multicore compared to when not, using an example model (16 samples 
is obviously a stupid example):

require(afex)
require("mlmRev")
require(parallel)
(nc <- detectCores()) # 8 in my case
cl <- makeCluster(rep("localhost", nc))

mixed(use ~ age + I(age^2) + urban + livch + (1 | district), family = 
binomial, method = "PB", data = Contraception, args.test = list(nsim = 
16, cl = cl, details = 2))

## Contrasts set to contr.sum for the following variables: use, urban, 
livch, district
## Numerical variables NOT centered on 0 (i.e., interpretation of all 
main effects might be difficult if in interactions): age
## Fitting 5 (g)lmer() models:
## [.....]
## Obtaining 4 p-values:
## [Reference distribution with    16 samples; computing time: 33.29 secs.
## .Reference distribution with    16 samples; computing time: 37.25 secs.
## .Reference distribution with    16 samples; computing time: 41.06 secs.
## .Reference distribution with    16 samples; computing time: 27.92 secs.
## .]

mixed(use ~ age + I(age^2) + urban + livch + (1 | district), family = 
binomial, method = "PB", data = Contraception, args.test = list(nsim = 
16, details = 2))
## Contrasts set to contr.sum for the following variables: use, urban, 
livch, district
## Numerical variables NOT centered on 0 (i.e., interpretation of all 
main effects might be difficult if in interactions): age
## Fitting 5 (g)lmer() models:
## [.....]
## Obtaining 4 p-values:
## [Reference distribution with    16 samples; computing time: 143.49 secs.
## .Reference distribution with    16 samples; computing time: 201.51 secs.
## .Reference distribution with    16 samples; computing time: 185.76 secs.
## .Reference distribution with    16 samples; computing time: 120.80 secs.
## .]


You can see a speed up from around 150 to 35 seconds per p-value which 
corresponds to around a fourfold increase in speed, but not anywhere 
near an eightfold increase. So you cannot do much other than wait if you 
want to use parametric bootstrap and not e.g., "LRT" (or use a machine 
with more cores, but given the assumed problem, the speed up will most 
likely be less and less).

Hope that helps,
Henrik  (author of afex)



Am 05.07.2016 um 22:59 schrieb Daniel McCloy:

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

I'm using afex to get bootstrapped p-values. The model fit is quite
fast, but the bootstrapped p-values are taking forever and I wonder if
I'm doing something stupid.  Relevant details from the model summary:

Generalized linear mixed model fit by maximum likelihood (Laplace
Approximation) ['glmerMod']
Family: binomial  ( probit )
Formula: press ~ truth * revb * gend * attn + (1 | subj)
Number of obs: 20480, groups:  subj, 16

I set the script to use all 12 cores of my local machine and run over
the weekend:

library(afex)
library(parallel)
cl <- makeForkCluster(nnodes=12, outfile="cluster-log.txt")
form <- formula(press ~ truth*revb*gend*attn + (1|subj))
rev_mod <- mixed(form, data=revb_df, family=binomial(link="probit"),
                 method="PB", check.contrasts=FALSE, cl=cl,
                 args.test=list(nsim=1000, cl=cl, seed=1234, details=2),
                 control=glmerControl(optCtrl=list(maxfun=30000)))

After running for 3.5 days, all 12 cores are still maxed out, RAM usage
looks fine (around 16GB in use of the 32GB total), and the afex output
looks like this:

Fitting 16 (g)lmer() models.
Obtaining 15 p-values:
[Reference distribution with   996 samples; computing time: 159891.87 secs.
.

(the single period on the 4th line is part of the output).  The first 2
lines appeared before I left work on friday (i.e., within about 15
minutes of starting the script).  So presumably it has been working on
the p-values for the remaining 80+ hours.  I don't know exactly when the
"computing time" line appeared, but its estimate works out to about 44
hours...  is that just for the first of the 15 p-values, or for all of
them?  Is creating a cluster with all 12 cores a foolish idea?  (Am I
thereby strangling the multithreading abilities of some underlying
linalg library, effectively slowing things down?)  The dataset and
number of iterations just don't seem all that huge to me but maybe I'm
being naive.

- -- dan

Daniel McCloy
http://dan.mccloy.info
Postdoctoral Research Associate
Institute for Learning & Brain Sciences
University of Washington
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQEcBAEBCAAGBQJXfB+cAAoJEEzlbMQqUVLOIuUH/Rb920Pcbedb6WaudG0KyifM
T3hQEALxFLyHttrbRAC1CrHgD4XClrTNQJGs3jcr+F23T/JYaR1jizFJ6SOBLi5i
niB+F5z2nD5yLsYSAHvvvfkcHTGCsFmUwp2tvThxYySAjehpzKzcm30ZSVcZH3qN
VpaWR0fBH3rmqwiUqE6IT3Nbt+wV9F2G/SK+wyX9waSwRN1SpiiCwQ1AIxU5s6SI
BSRL96nBLf93OLUgGZ22Lvu9w/AR7xNT6uNE2R9XdSMWF7/u+/eEgw/GVDWe+1l+
NYKN28qoynBeEagMfeZt+JCq6OG9yIWv4z6cE7EpPtTbxMqebrIQUrFU4uIGoAI=
=nuKw
-----END PGP SIGNATURE-----

afex package & bootstrapping

Thread (2 messages)