Skip to content
/ dataset-template Public template

A reusable, FAIR-aligned dataset template designed for interdisciplinary scientific datasets. Includes metadata standards, licensing, documentation guides, and FAIR statement scaffolds.

License

Notifications You must be signed in to change notification settings

RYFoR/dataset-template

Repository files navigation

📊 [Dataset Title Here]

License: CC BY 4.0 DOI: 10.5281/zenodo.XXXXXXX FAIR Data Status Made with Jupyter Open Science

⚠️ This is a template file. Please customize each section with your own dataset-specific details. Placeholder fields are marked clearly.

🧭 Overview

This dataset contains [briefly describe dataset contents: e.g. "survey responses from 1500 households across three regions in Colombia, collected during 2024"]. It was prepared using a standardized FAIR-aligned data template and includes all relevant documentation for reuse, citation, and integration into scientific workflows.

It is intended for use in [e.g., social sciences, policy research, urban planning] and is distributed under an open license to encourage transparency and collaboration.


📚 Table of Contents

  1. Overview
  2. File and Structture
  3. Key Files
  4. Usage
  5. FAIR Statement
  6. Citation
  7. Contributing
  8. License
  9. Contact

📁 Files and Structure

📦 dataset-name/
├── data/raw/                  # Original data files
├── docs/                      # Documentation and codebook
├── notebooks/                 # Data exploration and analysis notebooks
├── catalog/                   # Metadata catalog and schema
├── FAIR/                      # FAIR machine-readable metadata (optional)
├── LICENSE
├── CITATION.cff
└── README.md

📂 Key Files

  • data/raw/*.csv: Raw dataset files (e.g., survey_data.csv)
  • docs/codebook.md: Definitions and value domains for variables
  • docs/data_dictionary.md: Column-by-column descriptions
  • docs/methodology.md: Description of how the data was collected or generated
  • docs/quality_report.md: Summary of quality and completeness checks
  • catalog/catalog.json: Machine-readable metadata following FAIR principles
  • catalog/schema.yaml: Data structure specification
  • notebooks/: Jupyter notebooks for exploration and analysis

🔍 Usage

You can explore the data using the provided Jupyter notebooks. To get started:

# Clone the repository
git clone https://github.com/ryfor/dataset-name.git
cd dataset-name

# Open Jupyter
jupyter notebook notebooks/01-explore_raw_data.ipynb

Alternatively, load the CSV into your tool of choice (Python, R, Excel, etc.)

This dataset can be integrated into institutional repositories, linked data portals, or educational platforms. The structure is designed for maximum compatibility with data catalogs, FAIR APIs, and data governance workflows


🧬 FAIR Statement

This dataset follows the FAIR data principles:

  • Findable: It is indexed and published with a DOI.
  • Accessible: All files are publicly available in open formats.
  • Interoperable: Metadata and structure follow community standards.
  • Reusable: Documentation and licensing enable long-term reuse.

Machine-readable metadata is available in the FAIR/ and catalog/ folders.

Note: Adapt to all your FAIR compliants statements to this list


📖 Citation

If you use this dataset, please cite:

@dataset{dataset_author_2025,
  author       = {Farnwick, Alberic and Kozlova, Nadezhka-Elis and Oluremi, Tundé-Maxon},
  title        = {Household Survey Dataset: Rural Infrastructure and Wellbeing, 2024},
  year         = {2025},
  publisher    = {RYFoR Research Commons},
  doi          = {10.5281/zenodo.XXXXXXX},
  url          = {https://doi.org/10.5281/zenodo.XXXXXXX}
}

📜 License

This dataset is shared under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. You are free to reuse, adapt, and distribute it with proper attribution.

This project is licensed under the CC BY 4.0 License.

🤝 Contributing

Please read CONTRIBUTING.md for details on how to contribute to this project.

🧾 Code of Conduct

This project adheres to a Code of Conduct. By participating, you are expected to uphold this standard.

📬 Contact

For questions or collaboration inquiries, contact the dataset curators listed in the CITATION.cff.

About

A reusable, FAIR-aligned dataset template designed for interdisciplinary scientific datasets. Includes metadata standards, licensing, documentation guides, and FAIR statement scaffolds.

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Packages

No packages published