Skip to content
Prev 18452 / 20628 Next

mcmcglmm Priors: Auto correlation and extreme post mean values

I forgot to copy the group on my original reply: here is the text of my
notes:

Thanks for sharing the relevant data and model details and output. The
first and biggest concern I have is the use of Site variable for a Random
Intercept. There are only 175 observations, but you have 86 different
sites. On an average, without seeing the actual data, that means you have
~2 observations per site. There is not enough data to use Site for a Random
intercept as it exists.

You should use site only after you can reduce 86 sites into some meaningful
groupings based on criteria you use in Ecology (reduce to 3 or 4 groups).
Additionally, the remaining factors create 2 * 5 * 4 = 40 cells and you
have only 175 observations ~ 4.4 observations per cell - this number is
also too low for you to use all of these factors in your analysis.

All of the problems you are facing mostly likely are arising from the wrong
application of the techniques to your data. I would take the following
steps:
1. Run a bivariate means comparison using Anova after you declare your
Landuse, Human_presence etc as factors using ( aov( trappings ~ landuse)
2. In doing so you are going to change your assumptions of trappings from
count to continuous, but I think for exploration, it is okay.
3. Make sure you examine the means of the trappings for each level and
combine levels within a factor, when the sub-groups are not different from
each other.
4. This will reduce the number of groups you are working with since your
records are limited.

Only after undertaking the steps above should you consider, running any
models in lme4 or nlme. Ideally, if you could reduce those factors that
have 4 and 5 groups to having 2 each (if the differences are not
significant - see # 3) then you could do meaningful analyses. Similarly,
with Site, the 86 levels should be reduced to either 3 or 4 for you to use
them even for a descriptive analysis. Without adding in more data or
reducing your number of levels, you should not undertake any further
modeling.

I also wanted to add that for Site variable, you should first find a
theoretical way to reduce the number of sites to 3 or 4 broad groups. You
cannot use Means comparison to do that.
On Fri, May 15, 2020 at 12:08 PM sree datta <sreedta8 at gmail.com> wrote:

            

  
  
Message-ID: <CAHftDbhQoa5JTpo+-FT3Y4wqtBHSgWKD=ykvSDJg74G_+r4oxg@mail.gmail.com>
In-Reply-To: <CAHftDbj8MX_ivViFaq-y88a7-u1ME31_mafs=b0UoX4RqWFy=w@mail.gmail.com>