Skip to content
Prev 215 / 490 Next

Improves haplotype() function in pegas 0.13

Hi Jarrett,

I'm Cc'ing to r-sig-genetics since we had a recent discussion on a similar topic (see below).

----- Le 14 Mai 20, ? 9:59, Jarrett Phillips phillipsjarrett1 at gmail.com a ?crit :
Base ambiguities were already considered in previous versions of pegas, but it was not explicit (or flexible).
Yes. 'strict' means "strict interpretation of the characters without interpreting them as base ambiguities". For instance, consider the following 9 aligned sequences with 2 sites (without labels for simplicity):

AA
AR
AM
AW
AV
AH
AD
AN
A-

By default (and with the last version of pegas), haplotype() will return a single haplotype because it cannot be inferred whether any of the sequences 2-9 is different from the first one. If strict = TRUE, nine haplotypes will be returned.
Yes. I've added a line in the help page of haplotype() to say that 'trailingGapsAsN' has no effect if 'strict = TRUE'.
Maybe the discussion we had on r-sig-genetics could be relevant here. There doesn't seem to be an easy answer to these questions.

Also, the coming version of ape will include the new function latag2n (Leading and Trailing Alignment Gaps to N) which changes sequences such as "A-C-" into "A-CN".

Cheers,

Emmanuel
Message-ID: <786746508.68578.1589437533839.JavaMail.zimbra@ird.fr>
In-Reply-To: <CAMVrHdCOXZGetiABEpz80KHZCnUjRf3H7tF8_RShCXCAJP+Ybg@mail.gmail.com>