tladeras’s avatartladeras’s Twitter Archive—№ 2,728

  1. Bryan Mayer talking about DataPackageR and the trials of sharing data. Lots of issues, hard to notate final final versions of data... #Cascadiarconf
    1. …in reply to @tladeras
      DataPackageR: Versionable data, with processing scripts, and processing data. An @rOpenSci joint. #Cascadiarconf
      1. …in reply to @tladeras
        @rOpenSci Good for mature workflows using version control, and packaging multiple datasets. Not for really large data. Data is shared as R package, easy to load in data as you need it. #Cascadiarconf
        1. …in reply to @tladeras
          @rOpenSci Configuration using datapackage_skeleton() - YAML file controls package building process. data-raw/ folder houses user code for data. #Cascadiarconf
          1. …in reply to @tladeras
            [editorial note: I was a reviewer on DataPackageR for Gates open sci, and we use DataPackageR for our work]
            1. …in reply to @tladeras
              Convenience functions exist for adding/removing data objects in the package. Store data processing code in data-raw/ and access using project_extdata_path() #Cascadiarconf
              1. …in reply to @tladeras
                Once everything is set, build package and fill out documentation. Goes through datasets and creates documentation. Fill out documentation, and then use devtools::document(). Then you can build as a normal R package. #Cascadiarconf