In collaboration with ESIP and participants at the 2016 Data Fair during the AGU Fall Meeting, listed below are several resources, tools, and guidelines to help researchers, authors, and journals in managing data throughout the data lifecycle.
Guiding principles for data and software citation:
- FORCE11 FAIR Data https://www.force11.org/group/fairgroup
- ESIP Data Citation Guidelines http://commons.esipfed.org/node/308 They include implementation examples
- Force11 Software Citation Principles https://www.force11.org/software-citation-principles We should soon see guidelines coming from this effort. Meanwhile, the principles will help with the type of information to include in a citation.
- COPDESS additional reading: http://www.copdess.org/discussions-about-copdess/ References to articles about data and software management and best practices.
COPDESS Best Practices for Journals: http://www.copdess.org/copdess-suggested-author-instructions-and-best-practices-for-journals/
Including data citations in talks: http://www.copdess.org/agu-best-practices-for-data-in-oral-presentations/
Best Practices For Repositories: http://biorxiv.org/content/early/2016/12/28/097196 A new article on “A Data Citation Roadmap for Scholarly Data Repositories” by Martin Fenner et al.
Guidelines and help for creating reproducible papers
- Towards the Geoscience Paper of the Future: Best Practices for Documenting and Sharing Research from Data to Software to Provenance. Yolanda Gil, Cédric H. David, Ibrahim Demir, Bakinam T. Essawy, Robinson W. Fulweiler, Jonathan L. Goodall, Leif Karlstrom, Huikyo Lee, Heath J. Mills, Ji‐Hyun Oh, Suzanne A. Pierce, Allen Pope, Mimi W. Tzeng, Sandra R. Villamizar, and Xuan Yu. http://dx.doi.org/10.1002/2015EA000136
- The Geoscience Paper of the Future: OntoSoft Training. Yolanda Gil. 2016. https://dx.doi.org/10.6084/m9.figshare.1586773.v9
- The Geoscience Paper of the Future Initiative: http://www.scientificpaperofthefuture.org/gpf/
- https://drclimate.wordpress.com/2016/06/16/how-to-write-a-reproducible-paper/
- https://speakerdeck.com/damienirving/how-to-write-a-reproducible-paper-2
Data Management Training
- ESIP Data Management Short Courses
- ESIP Data Management Training Clearinghouse
- DataONE Education Modules
- USGS Data Management Training
Data Management Plan information and tools
- DMPTool (US Based)
- DMPOnline (UK Based)
- http://www.iedadata.org/compliance/plan IEDA DMP tool and information
- MOOC – Coursera Course on Research Data Management https://www.coursera.org/learn/data-management
Podcasts
- Don’t Panic Geocast (com)
- Undersampled Radio (https://undersampledrad.io).
In person courses and training resources:
- Data Carpentry http://www.datacarpentry.org/
- Software Carpentry https://software-carpentry.org/
Tools to organize and link your data, code, notes, and more across your research workflow:
- Recordr – tracks provenance and publishes software, inputs, and outputs–package for R https://github.com/NCEAS/recordr and matlab https://github.com/DataONEorg/matlab-dataone
- Project Jupyter – The Jupyter Notebook is a web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, machine learning and much more. Works across 40 programming languages.
- org – Turn a GitHub repo into a collection of interactive notebooks
- Open Science Framework: https://osf.io/ and http://help.osf.io/ – A scholarly commons to connect the entire research cycle
Information on ID’s
- ORCID https://orcid.org/about/what-is-orcid/mission
- IGSN – Samples http://www.geosamples.org/igsnabout
- DOI’s for Samples – DataCite https://www.datacite.org/