Skip to content
Prev 170899 / 398506 Next

Counting/processing a character vector

Apologies, Jim Holtman has pointed out a couple of problems/queries with
my original email that I would like to make clear.

Firstly, I introduced a typo when trying to be helpful. In my email
below, I had incorrectly typed out one of the species codes I would
count:

10000000
16220602
20110000
24000000
40320203 ## This should have been 40210102
45140000
45630600 == 7 "species" present.

Secondly, the criteria I laid out might suggest that in the 10 rows of
example I quoted, I would count both:

45630000
45630600

This is not what I wanted and apologies that this was not clear. I only
want to count 45630600 because this is more "specific" in terms of what
creature this is than 45630000. I don't know that 45630000 is not
45630600, so I should not count both 45630000 and 45630600, as this
could be double accounting.

These data are species counts and sometimes it is not possible to
identify an individual to species level. Sometime we can't even get the
genera, or even family, hence why sometimes we have a count for the
family (45630000) as well as for the genus (45630600) in the same
sample/site. It depends on how much of the individual there is to
identify it from as to how precise the identification is.

So I only want to count a higher level category only if I have not
counted a lower level category contained within this higher level.

I hope this is a little bit clearer? And no, I did not come up with this
coding system nor the idea to use "counts" of "species" in this
way... ;-)

Apologies if my original email caused unnecessary confusion.

All the best,

G
On Wed, 2009-02-18 at 13:37 +0000, Gavin Simpson wrote: