[R-pkg-devel] Writing a data package with large files
I have been working on making a data package. The goal is to one day push
it to CRAN,
but I am having 2 problems (one warning and one note) from R CMD. I think
the problems
are due to having large files (a 453M csv.7z raw file and a 75M .rda file)
Below is my R CMD check.
?? R CMD check results ????????????????????????????????? cfsales 0.0.0.9000
????
Duration: 2m 0.9s
? checking data for ASCII and uncompressed saves ... WARNING
Warning: package needs dependence on R (>= 2.10)
? checking installed package size ... NOTE
installed size is 133.1Mb
sub-directories of 1Mb or more:
data 133.0Mb
0 errors ? | 1 warning ? | 1 note ?
Error: R CMD check found WARNINGs
Execution halted
Exited with status 1.
1. WARNING - Warning: package needs dependence on R (>= 2.10). I am not
sure where to start looking to fix this problem.
2. NOTE - I have two big files that I think are flagging this note.
train.rda and train.csv.7z. Is there any guidance on how
to deal with large files?
Additionally, train.rda is just a sample of the full data. The original
data has 52 stores and I only
take 13 stores. If I were to take the full set then R CMD check throws an
error as opposed the the current note I am getting now. If there is a way
to use the full data set without getting an error I would love to hear it.
The link to this package is here
https://github.com/alexhallam/cfsales
The location of the problem files are here
https://github.com/alexhallam/cfsales/tree/master/data-raw
Thanks! -Alex [[alternative HTML version deleted]]