Hi, imagine the following matrix/data.frame Letter Number a 1 a 1 b 1 b 0 c 0 c 1 d 0 d 0 If the numbers for two identical letters are also identical then I want to remove either the first or the second row of that letter. If for a letter the numbers are 1 and 0 I want to remove the row with the 0. That means if the code works I would and up with the following matrix/data.frame Letter Number a 1 b 1 c 1 d 1 Many thanks, Syrvn -- View this message in context: http://r.789695.n4.nabble.com/Remove-specific-rows-in-a-matrix-data-frame-tp3902149p3902149.html Sent from the R help mailing list archive at Nabble.com.
Remove specific rows in a matrix/data.frame
9 messages · syrvn, David Winsemius, Vining, Kelly +3 more
On Oct 13, 2011, at 12:42 PM, syrvn wrote:
Hi, imagine the following matrix/data.frame Letter Number a 1 a 1 b 1 b 0 c 0 c 1 d 0 d 0 If the numbers for two identical letters are also identical then I want to remove either the first or the second row of that letter. If for a letter the numbers are 1 and 0 I want to remove the row with the 0.
?duplicated
That means if the code works I would and up with the following matrix/data.frame Letter Number a 1 b 1 c 1 d 1
But with no 1's associated with 'd' this does not make sense.
Many thanks, Syrvn
David Winsemius, MD West Hartford, CT
Hi Syrvn,
how about this
dtf<-read.table(textConnection("Letter Number
a 1
a 1
b 1
b 0
c 0
c 1
d 0
d 0"),header=T)
aggregate(Number~Letter,data=dtf,max)
cheers.
Am 13.10.2011 18:42, schrieb syrvn:
Hi, imagine the following matrix/data.frame Letter Number a 1 a 1 b 1 b 0 c 0 c 1 d 0 d 0 If the numbers for two identical letters are also identical then I want to remove either the first or the second row of that letter. If for a letter the numbers are 1 and 0 I want to remove the row with the 0. That means if the code works I would and up with the following matrix/data.frame Letter Number a 1 b 1 c 1 d 1 Many thanks, Syrvn -- View this message in context: http://r.789695.n4.nabble.com/Remove-specific-rows-in-a-matrix-data-frame-tp3902149p3902149.html Sent from the R help mailing list archive at Nabble.com.
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Eik Vettorazzi Department of Medical Biometry and Epidemiology University Medical Center Hamburg-Eppendorf Martinistr. 52 20246 Hamburg T ++49/40/7410-58243 F ++49/40/7410-57790 -- Pflichtangaben gem?? Gesetz ?ber elektronische Handelsregister und Genossenschaftsregister sowie das Unternehmensregister (EHUG): Universit?tsklinikum Hamburg-Eppendorf; K?rperschaft des ?ffentlichen Rechts; Gerichtsstand: Hamburg Vorstandsmitglieder: Prof. Dr. Guido Sauter (Vertreter des Vorsitzenden), Dr. Alexander Kirstein, Joachim Pr?l?, Prof. Dr. Dr. Uwe Koch-Gromus
Why would you end up with d1 in your output if you don't have a d1 in your original data frame? Are you saying that, when both letters have a zero after them, you want to replace one of them with a 1? -----Original Message----- From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf Of syrvn Sent: Thursday, October 13, 2011 9:43 AM To: r-help at r-project.org Subject: [R] Remove specific rows in a matrix/data.frame Hi, imagine the following matrix/data.frame Letter Number a 1 a 1 b 1 b 0 c 0 c 1 d 0 d 0 If the numbers for two identical letters are also identical then I want to remove either the first or the second row of that letter. If for a letter the numbers are 1 and 0 I want to remove the row with the 0. That means if the code works I would and up with the following matrix/data.frame Letter Number a 1 b 1 c 1 d 1 Many thanks, Syrvn -- View this message in context: http://r.789695.n4.nabble.com/Remove-specific-rows-in-a-matrix-data-frame-tp3902149p3902149.html Sent from the R help mailing list archive at Nabble.com. ______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
try this:
x <- read.table(textConnection("Letter Number
+ a 1 + a 1 + b 1 + b 0 + c 0 + c 1 + d 0 + d 0"), as.is = TRUE, header = TRUE)
closeAllConnections()
# following assumes that there are pairs of numbers
result <- do.call(rbind, lapply(split(x, x$Letter), function(.pair){
+ if (all(.pair$Number[1L] == .pair$Number)) return(.pair[1L, ]) + is1 <- which(.pair$Number == 1) + if (length(is1) == 1) return(.pair[is1, ]) + else return(NULL) + }))
result
Letter Number a a 1 b b 1 c c 1 d d 0
On Thu, Oct 13, 2011 at 12:42 PM, syrvn <mentor_ at gmx.net> wrote:
Hi, imagine the following matrix/data.frame Letter Number a 1 a 1 b 1 b 0 c 0 c 1 d 0 d 0 If the numbers for two identical letters are also identical then I want to remove either the first or the second row of that letter. If for a letter the numbers are 1 and 0 I want to remove the row with the 0. That means if the code works I would and up with the following matrix/data.frame Letter Number a 1 b 1 c 1 d 1 Many thanks, Syrvn -- View this message in context: http://r.789695.n4.nabble.com/Remove-specific-rows-in-a-matrix-data-frame-tp3902149p3902149.html Sent from the R help mailing list archive at Nabble.com.
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Jim Holtman Data Munger Guru What is the problem that you are trying to solve?
An embedded and charset-unspecified text was scrubbed... Name: not available URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20111013/5c78beec/attachment.pl>
Thanks for your answers! Will check them now :) Yes, sorry, I was wrong. Letter Number d 0 d 0 should be: Letter Number d 0 after applying the algorithm! -- View this message in context: http://r.789695.n4.nabble.com/Remove-specific-rows-in-a-matrix-data-frame-tp3902149p3902216.html Sent from the R help mailing list archive at Nabble.com.
Hello again,
dtf<-read.table(textConnection("Letter Test Number
a b 1
a b 1
b b 1
b b 0
c b 0
c b 1
d b 0
d b 0"),header=T)
aggregate(Number ~ Letter,data=dtf,max)
how can I adjust this solution that the results also includes "Test"?
I tried:
aggregate(Number ~ Letter,data=dtf,max,by=list("Letter", "Test", "Number"))
But it breaks with the following error message:
Error in aggregate.data.frame(mf[1L], mf[-1L], FUN = FUN, ...) :
arguments must have same length
--
View this message in context: http://r.789695.n4.nabble.com/Remove-specific-rows-in-a-matrix-data-frame-tp3902149p3902286.html
Sent from the R help mailing list archive at Nabble.com.
Hi, just put it in the formula: aggregate(Number ~ Letter+Test,data=dtf,max) cheers Am 13.10.2011 19:30, schrieb syrvn:
Hello again,
dtf<-read.table(textConnection("Letter Test Number
a b 1
a b 1
b b 1
b b 0
c b 0
c b 1
d b 0
d b 0"),header=T)
aggregate(Number ~ Letter,data=dtf,max)
how can I adjust this solution that the results also includes "Test"?
I tried:
aggregate(Number ~ Letter,data=dtf,max,by=list("Letter", "Test", "Number"))
But it breaks with the following error message:
Error in aggregate.data.frame(mf[1L], mf[-1L], FUN = FUN, ...) :
arguments must have same length
--
View this message in context: http://r.789695.n4.nabble.com/Remove-specific-rows-in-a-matrix-data-frame-tp3902149p3902286.html
Sent from the R help mailing list archive at Nabble.com.
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Eik Vettorazzi Institut f?r Medizinische Biometrie und Epidemiologie Universit?tsklinikum Hamburg-Eppendorf Martinistr. 52 20246 Hamburg T ++49/40/7410-58243 F ++49/40/7410-57790