Modelling with uncertain (but not missing) categorical random effect values
Dear Michael, Maybe something like (0 + w_1 | dad_1) + (0 + w_2 | dad_2) + (0 + w_3 | dad_3). Where w_1 is the probability of dad_1. Make sure that dad_1, dad_2 and dad_3 are factors with the same levels. Then INLA allows you to add this as f(dad_1, w_1, model = "iid") + f(dad_2, w_2, copy = "dad_"1) + f(dad_3, w_3, copy = "dad_1"). So you end up with a single random intercept for every dad (dad_2 and dad_3 share their estimates with dad_1). mum_id mum_sp dad_sp dad_id con dad_1 w_1 dad_2 w_ 2 dad_3 w_3 Af1 A A Am1 / Am2 1 Am1 0.6 Am2 0.4 NA 0 Af1 A A Am2 1 Am2 1 NA 0 NA 0 Bf1 B A Am1 / Am2 / Am4 0 Am1 0.4 Am2 0.3 Am4 0.3 Bf2 B B Bm1 / Bm3 1 Bm1 0.5 Bm2 0.5 NA 0 Best regards, ir. Thierry Onkelinx Statisticus / Statistician Vlaamse Overheid / Government of Flanders INSTITUUT VOOR NATUUR- EN BOSONDERZOEK / RESEARCH INSTITUTE FOR NATURE AND FOREST Team Biometrie & Kwaliteitszorg / Team Biometrics & Quality Assurance thierry.onkelinx at inbo.be Havenlaan 88 bus 73, 1000 Brussel www.inbo.be /////////////////////////////////////////////////////////////////////////////////////////// To call in the statistician after the experiment is done may be no more than asking him to perform a post-mortem examination: he may be able to say what the experiment died of. ~ Sir Ronald Aylmer Fisher The plural of anecdote is not data. ~ Roger Brinner The combination of some data and an aching desire for an answer does not ensure that a reasonable answer can be extracted from a given body of data. ~ John Tukey /////////////////////////////////////////////////////////////////////////////////////////// <https://www.inbo.be> Op di 13 jul. 2021 om 12:30 schreef Michael Lawson via R-sig-mixed-models < r-sig-mixed-models at r-project.org>:
I have a dataset where I have offspring paternity of females with males of different species. However, many of the offspring have ambiguous paternity - where I know the offspring must be from particular fathers, but not from others. The data currently looks a bit like this (but with many more rows per mum_id): mum_id mum_sp dad_sp dad_id con Af1 A A Am1 / Am2 1 Af1 A A Am2 1 Bf1 B A Am1 / Am2 / Am4 0 Bf2 B B Bm1 / Bm3 1 Which I have so far run as a binomial GLMM with con (conspecific mating) as a binary response, mum_sp and dad_sp (species) as fixed factors and mum_id as a random factor - and have just not included dad_id as a random factor. The ambiguously assigned fathers in dad_id is also non-random, i.e. certain individuals are more likely to be ambiguously assigned than others, so just leaving these cases as NA is problematic. For some of the ambiguous assignments, I can also extract probabilities that a possible male is the father of the offspring, e.g. for the first row, father Am1 is 60% likely to be the father and Am2 40% likely. Are there any approaches where I can include the ambiguous dad_id in a GLMM framework? - where the uncertainty of the assignment contributes to the overall uncertainty in the tested relationship. Thank you for any suggestions, Mike
_______________________________________________ R-sig-mixed-models at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models