Skip to content

extracting data from a dataframe

5 messages · Tim Wilson, Thomas Lumley, A.J. Rossini +2 more

#
Hi everyone,

Here's a question about extracting data from a dataframe:

Let's say I've got a dataframe with two vectors, TEST and GENDER. GENDER
contains a 1 for males and 2 for females. I want to separate the results
of TEST by GENDER so I can compare their means.

What's the most efficient way to do this with R?

-Tim
#
On Fri, 2 Aug 2002, Tim Wilson wrote:

            
No guarantees of efficiency, but the easiest is
  split(df$TEST,df$GENDER)

	-thomas


-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
#
tim> Hi everyone,
    tim> Here's a question about extracting data from a dataframe:

    tim> Let's say I've got a dataframe with two vectors, TEST and GENDER. GENDER
    tim> contains a 1 for males and 2 for females. I want to separate the results
    tim> of TEST by GENDER so I can compare their means.

    tim> What's the most efficient way to do this with R?

something like:

lapply(split(df$TEST,df$GENDER),mean)

?
(where "df" is the data.frame).
#
results
Presuming that you wish to perform a two-sample t-test on the means, you
don't need to actually "split" the dataframe. You could use the formula
method for t.test() by the following:

t.test(df$TEST ~ df$GENDER)

where df is the name of your dataframe.

If you actually want to split the dataframe, see ?split.

HTH.

Marc


-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
2 days later
#
Hi, dear R

I am dealing with my dataset with strong collinearity. I want to use the
Variance Decomposition (Variance Proportion) method. There is a process in
SAS called PROC REG to do it. I wonder if I can find a function in R or
Splus to carry out as well.

Thanks a lot!

Huan

-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._