SUM,COUNT,AVG

Hadley Wickham · 2009-04-06T14:56:05Z

On Mon, Apr 6, 2009 at 9:34 AM, Stavros Macrakis wrote: > There are various ways to do this in R. > > # sample data > dd > Using the standard built-in functions, you can use: > > *** aggregate *** > > aggregate(dd,list(b=dd$b,c=dd$c),sum) > ?b c ?a b c > 1 1 1 10 2 2 > 2 2 1 ?3 2 1 > .... > > *** tapply *** > > tapply(dd$a,interaction(dd$b,dd$c),sum) > ? ? ?1.1 ? ? ? 2.1 ? ? ? 3.1 ? ? ? 1.2 ? ? ?

Hadley Wickham

Mon, Apr 6, 2009 7:56 AM

On Mon, Apr 6, 2009 at 9:34 AM, Stavros Macrakis <macrakis at alum.mit.edu> wrote:

That's because ddply applies the function to the whole data frame, not
just the columns that aren't participating in the split.  One way
around it is:

ddply(dd, ~ b + c, function(df) each(length, sum, mean)(df$a))

I haven't figured out a more elegant way to specify this yet.

Hadley

http://had.co.nz/

SUM,COUNT,AVG

Thread (14 messages)