[R-meta] Calculation of p values in selmodel

Thanks for the suggestion of a Bayesian approach, James. I want to avoid priors, if possible, and go as far as I can with the selmodel approaches, for now. And I don't want to move the p-value threshold around, since it's the studies with p>0.05 that are less likely to get published. The 3-parameter selection model, with one step at 0.025, works brilliantly in the simulations when there isn't too much publication bias, including, importantly, when there is none, where it works much better than the PEESE method. Of course, you don't know how much publication bias there is, so it's important to use a method that works across the possible range of none through lots, including 100% failure to publish non-significant effects. That's why it's so disappointing that the 3PSM doesn't work with no non-significant effects.

When I looked at the data I showed in my last message, you could get the impression that the problem is simply that selmodel needs at least one non-significant study estimate for each level of the factor Sex in the model. But it isn't so. There are plenty of sims where there are no non-significant estimates for the females and just one for the males. For example, one sim has 11 study estimate consisting of 5 significant females, 5 significant males, and one non-significant male (p=0.58). No problem. So maybe the error message is misleading. For about 5% of the simulations I get the warning message "Error when trying to invert Hessian", but it still produces adjusted point estimate for the fixed effects and tau2, so that's not the problem. The problem is the occasional sim (about 1 in 300, with the current simulation) where the error message "One or more intervals do not contain any observed p-values" is wrong, and where it then crashes out of the list processing.

Will

-----Original Message-----
From: R-sig-meta-analysis <r-sig-meta-analysis-bounces at r-project.org> On Behalf Of James Pustejovsky via R-sig-meta-analysis
Sent: Monday, March 18, 2024 5:47 AM
To: R Special Interest Group for Meta-Analysis <r-sig-meta-analysis at r-project.org>
Cc: James Pustejovsky <jepusto at gmail.com>
Subject: Re: [R-meta] Calculation of p values in selmodel

This is an issue with maximum likelihood estimation of the step function selection model generally (rather than a problem with the software implementation).

The step function model assumes that there are different selection probabilities for effect size estimates with p-values that fall into different intervals. For a 3-parameter model, the intervals are [0, .025] and (.025, 1], with the first interval fixed to have selection probability
1 and the second interval having selection probability lambda > 0 (an unknown parameter of the model). If there are no observed ES estimates in the first interval, then the ML estimate of lambda is infinite. If there are no observed ES estimates in the second interval, then the ML estimate of lambda is zero, outside of the parameter space.

In some of my work, I've implemented an ad hoc fix for the issue by moving the p-value threshold around so that there are at least three ES estimates in each interval. This isn't based on any principle in particular, although Jack Vevea once suggested to me that this might be the sort of thing an analyst might do just to get the model to converge.

A more principled way to fix the issue would be to use penalized likelihood or Bayesian methods with an informative prior on lambda. See the publipha package (https://cran.r-project.org/package=publipha) for one implementation.

James

[R-meta] Calculation of p values in selmodel

Thread (14 messages)