Skip to content
Prev 246537 / 398506 Next

Advice on obscuring unique IDs in R

On Jan 5, 2011, at 3:19 PM, Anthony Staines wrote:

            
Try using sapply():


L <- replicate(60000, paste(sample(letters, 10, replace = TRUE), collapse = ""))
chr [1:60000] "dfederergw" "nwphehurvb" "avzmvltrhn" ...
[1] "dfederergw" "nwphehurvb" "avzmvltrhn" "ecmeiasmbk" "kmlcxydygl"
[6] "wpftnyrzwe"


# Use sapply() to run digest() over each element of L
user  system elapsed 
  6.920   0.031   7.361
Named chr [1:60000] "6d5861904ee004d251504cb0f731a69a" ...
 - attr(*, "names")= chr [1:60000] "dfederergw" "nwphehurvb" "avzmvltrhn" "ecmeiasmbk" ...
dfederergw                         nwphehurvb 
"6d5861904ee004d251504cb0f731a69a" "bf8ee61f69c83468988cad681a9f7ad0" 
                        avzmvltrhn                         ecmeiasmbk 
"ba1c66af41359cf1a3f5e91f22c6dfe5" "95ca2deaa6c1118852c9ffed71994a7f" 
                        kmlcxydygl                         wpftnyrzwe 
"f3647a7937a2c484123ef33bb52a27ac" "e84f17180703e4805493d88a760be682" 


HTH,

Marc Schwartz