tladeras’s Twitter Archive—№ 2,734

Bryan Mayer talking about DataPackageR and the trials of sharing data. Lots of issues, hard to notate final final versions of data... #Cascadiarconf
Permalink On twitter.com ♻️ 3 Retweets ❤️ 4 Favorites 2019 Jun 8 Mood -1 🙁

…in reply to @tladeras
DataPackageR: Versionable data, with processing scripts, and processing data. An @rOpenSci joint. #Cascadiarconf
Permalink On twitter.com ❤️ 2 Favorites 2019 Jun 8 Mood 0

…in reply to @tladeras
@rOpenSci Good for mature workflows using version control, and packaging multiple datasets. Not for really large data. Data is shared as R package, easy to load in data as you need it. #Cascadiarconf
Permalink On twitter.com 2019 Jun 8 Mood +7 🙂

…in reply to @tladeras
@rOpenSci Configuration using datapackage_skeleton() - YAML file controls package building process. data-raw/ folder houses user code for data. #Cascadiarconf
Permalink On twitter.com 2019 Jun 8 Mood 0

…in reply to @tladeras
[editorial note: I was a reviewer on DataPackageR for Gates open sci, and we use DataPackageR for our work]
Permalink On twitter.com 2019 Jun 8 Mood 0

…in reply to @tladeras
Convenience functions exist for adding/removing data objects in the package. Store data processing code in data-raw/ and access using project_extdata_path() #Cascadiarconf
Permalink On twitter.com 2019 Jun 8 Mood 0

…in reply to @tladeras
Once everything is set, build package and fill out documentation. Goes through datasets and creates documentation. Fill out documentation, and then use devtools::document(). Then you can build as a normal R package. #Cascadiarconf
On twitter.com 2019 Jun 8 Mood 0