Skip to content

Commit c7c213b

Browse files
authored
Merge pull request #4168 from ELIXIR-UK-DaSH/ELIXIR-UK-DaSH
Expand the FAIR Training Topic
2 parents ee4fc15 + 4f54e98 commit c7c213b

File tree

15 files changed

+389
-11
lines changed

15 files changed

+389
-11
lines changed

CONTRIBUTORS.yaml

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -913,11 +913,13 @@ kilpert:
913913
name: Fabian Kilpert
914914
joined: 2017-09
915915

916-
kpbioteam:
917-
name: Katarzyna Murat
916+
kkamieniecka:
917+
name: Katarzyna Kamieniecka
918+
orcid: 0009-0004-2454-5950
918919
joined: 2018-08
920+
elixir_node: uk
919921

920-
kpoterlowicz:
922+
poterlowicz-lab:
921923
name: Krzysztof Poterlowicz
922924
twitter: bioinfbrad
923925
orcid: 0000-0001-6173-5674

topics/epigenetics/tutorials/ewas-suite/slides.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,8 +3,8 @@
33
logo: "GTN"
44
title: EWAS Epigenome-Wide Association Studies Introduction
55
contributors:
6-
- kpoterlowicz
7-
- kpbioteam
6+
- kkamieniecka
7+
- poterlowicz-lab
88
---
99

1010

topics/epigenetics/tutorials/ewas-suite/tutorial.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -14,8 +14,8 @@ key_points:
1414
- "Infinium Human Methylation BeadChip is an array based technology to generate DNA methylation profiling at individual CpG loci in the human genome based on Illumina’s bead technology."
1515
- "Time and cost efficiency followed by high sample output, and overall quantitative accuracy and reproducibility made Infinium Human Methylation BeadChip one of the most widely used arrays on the market."
1616
contributors:
17-
- kpbioteam
18-
- kpoterlowicz
17+
- kkamieniecka
18+
- poterlowicz-lab
1919
---
2020
> <agenda-title></agenda-title>
2121
> In this tutorial we will do:
@@ -26,7 +26,7 @@ contributors:
2626
>
2727
{: .agenda}
2828

29-
This tutorial is based on [Hugo W, Shi H, Sun L, Piva M et al.: Non-genomic and Immune Evolution of Melanoma Acquiring MAPKi Resistance.](https://doi.org/10.1016/j.cell.2015.07.061).
29+
This tutorial is based on Hugo W, Shi H, Sun L, Piva M et al.: Non-genomic and Immune Evolution of Melanoma Acquiring MAPKi Resistance {% cite Hugo2015 %}.
3030

3131
The data we use in this tutorial are available at [Zenodo](https://zenodo.org/record/1251211).
3232

topics/fair/images/fair_gtn.png

766 KB
Loading

topics/fair/images/fair_open.png

30.8 KB
Loading

topics/fair/metadata.yaml

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,12 +5,17 @@ title: "FAIR Data, Workflows, and Research"
55
summary: "These lessons will teach you how to make your research objects more FAIR with practical, hands-on advice."
66

77
subtopics:
8+
- id: fair-data
9+
title: "FAIR Data Management"
10+
description: "The FAIR (Findable, Accessible, Interoperable, Reusable) data stewardship created the foundation for sharing and publishing digital assets. These lessons will apply to machine accessibility and emphasize that all digital assets should share data in a way that will enable maximum use and reuse."
811
- id: ro-crate
9-
title: "Make workflows fair with RO-Crate"
12+
title: "Make workflows FAIR with RO-Crate"
1013
description: "This section brought to you by the BY-COVID project, and will teach you how to make your research objects more FAIR with practical, hands-on advice."
1114

1215
maintainers:
1316
- simleo
1417
- ilveroluca
1518
- stain
1619
- pauldg
20+
- kkamieniecka
21+
- poterlowicz-lab
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
---
2+
layout: faq-page
3+
---
Lines changed: 97 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,97 @@
1+
---
2+
layout: tutorial_hands_on
3+
title: FAIR data management solutions
4+
abbreviations:
5+
FAIR: Findable, Accessible, Interoperable, Reusable
6+
DMP: Data Management Plan
7+
PID: Persistent Identifier
8+
9+
zenodo_link: ''
10+
questions:
11+
- Is there a reproducibility crisis?
12+
- What can go wrong with data analysis?
13+
objectives:
14+
- Learn best practices in data management
15+
- Learn how to introduce computational reproducibility in your research
16+
time_estimation: "10M"
17+
key_points:
18+
- FAIR data management allows machines to automatically find and use the data accordingly.
19+
tags:
20+
- fair
21+
- dmp
22+
- data stewardship
23+
priority: 2
24+
contributions:
25+
authorship:
26+
- kkamieniecka
27+
- poterlowicz-lab
28+
editing:
29+
- hexylena
30+
subtopic: fair-data
31+
32+
requirements:
33+
- type: "internal"
34+
topic_name: fair
35+
tutorials:
36+
- fair-intro
37+
38+
---
39+
40+
41+
# Introduction
42+
43+
The FAIR (Findable, Accessible, Interoperable, Reusable) data stewardship created the foundation for sharing and publishing digital assets, especially data. This apply to machine accessibility and emphasize that all digital assets should share data in a way that will enable maximum use and reuse.
44+
45+
This tutorial is a short introduction to the FAIR data management framework. You can find out more at the [FAIR Pointers](https://elixir-uk-dash.github.io/FAIR-Pointers/ep1/index.html) online course.
46+
47+
> <agenda-title></agenda-title>
48+
>
49+
> In this tutorial, we will cover:
50+
>
51+
> 1. TOC
52+
> {:toc}
53+
>
54+
{: .agenda}
55+
56+
# Data management planning
57+
In recent years we have notice a data explosion. Number of sequence records in each release of [GenBank](https://www.ncbi.nlm.nih.gov/genbank/statistics/), from 1982 to the present, doubled in size approximately every 18 months. Great amounts of data available followed by expanding range of tools and computational solutions result in reproducibility crisis. Having **data management plan (DMP)** in place is essential to achieve FAIR data management. DMPs are often described as living documents and should be updated according to changing circumstances.
58+
59+
There are several ways to set up FAIR (Findable, Accessible, Interoperable, Reusable) data management plans (DMPs) :
60+
- Findable (F): Data description and collection or reuse of existing data
61+
- Accessible (A): Standardised authentication or authorisation (e.g. HTTP, HTTPS)
62+
- Interoperable (I): Documentation and data quality
63+
- Reusable (R): Storage and backup supported by legal and ethical requirements
64+
65+
## Data description and collection or reuse of existing data
66+
Reusing legacy datasets from institutional repositories or the digital libraries data collections can be FAIRified retrospectively. Support for data collection and development, throughout the life cycle can be provided and followed by change management and capacity improvement.
67+
68+
Multi-part FAIR research need a way of wrapping up, describing and sharing to promote the reuse of data. **Data sharing agreements** define the purpose of data sharing. Reference roles and responsibilities; specifies the purpose and legal requirements, e.g. for data security.
69+
70+
An institutional aim should be to create an integrated view and context over fragmented resources using their **persistent identifiers (PIDs)** and **metadata**. To make datasets findable, these metadata need to be as widely available as possible.
71+
72+
Enhancing reproducibility, quality and transparency by ensuring information flow and showcasing secondary use is also a part of data management. Promoting hands-on data experience and events activities built an collaborative environment for reproducible science.
73+
74+
75+
## Documentation and data quality
76+
Having access to local knowledge and encouraging best practises at the departmental level is a smart way to offer direction on a variety of standards and methods. In order to implement FAIR data practises within an institution, resources and infrastructure are needed. To increase the possibility of data reuse, several FAIR requirements can be satisfied using freely available guidelines e.g. [RDMkit](https://rdmkit.elixir-europe.org/), [FAIR Cookbook](https://faircookbook.elixir-europe.org/content/home.html), [ELIXIR-UK DaSH Fellowship](https://sites.google.com/view/navigation-portal-fellowship/home?authuser=0) initiative and repositories e.g. [Zenodo](https://zenodo.org/), [Harvard Dataverse](https://dataverse.harvard.edu/) and [figshare](https://figshare.com/).
77+
78+
## Storage and backup
79+
Systems for storage, backup and collaboration depend upon technical infrastructure.The **'3-2-1 rule'**, a recommendation for saving three copies of the research data—two locally and one off-site—is a standard backup strategy for research data. Data, metadata, and other research artefacts, such as ontologies, software, documentation, and papers, must all be kept in locations where they are adequately safeguarded, backed up, and accessible to maximise their potential for reuse. Appropriate access management is essential, in addition to backup and restoration services that protect researchers against data loss, theft, malfunctioning computers or storage media, and accidental deletion or inadvertent alterations to the data.
80+
81+
The fundamental component of infrastructure required for the FAIR research data lifecycle are repository services. They allow access to the data, a persistent identifier, and the descriptive metadata that support interoperability. Repositories can include basic data storage, resource finding, managing access and use of private information, facilitating peer review of information related to publications or services requiring digital preservation, and more.
82+
83+
The [OpenAIRE](https://www.openaire.eu/opendatapilot-repository-guide) repository guide advises users to check the availability of a suitable repository in following order:
84+
85+
1. The most effective option (if available) is to maintain the data in accordance with acknowledged discipline-specific criteria using an established, dedicated (external) data archive or repository that caters specifically to the study domain.
86+
2. Making use of institutional data repositories is the second-best option.
87+
3. If none of those options is practical, a free data repository should be used.
88+
89+
Up-to-date lists of available registered data repositories can be found at [re3data](https://www.re3data.org/) and [FAIRsharing](https://fairsharing.org/search?fairsharingRegistry=Database).
90+
91+
## Legal and ethical requirements
92+
Institutional support network (data stewards, ethics boards, IP, legal and financial offices) need to guide researchers in safeguarding data management responsibilities and resources.
93+
94+
# Conclusion
95+
You will have the advantage of saving time and resources by planning how to FAIRify your data in the early phases of your research endeavour. To put this into action, a data management strategy, or DMP, must be written. A DMP is also where you outline your data collection, storage, processing, sharing, and disposal procedures. Planning the management and FAIRification of your data reduces the possibility of issues down the road, whether they be practical, legal, or technical.
96+
97+
Keep in mind that creating FAIR data is a complex process. Consider how you can make your data FAIR one step at a time at each stage of the creation, collection, documentation, storage, sharing, archiving, and preservation processes. The framework for the rest of your study planning is laid by incorporating your data documentation. Imagine you want to use a dataset that was generated by another researcher and how you would like it to be found. Hope this quick introduction to FAIR data management solutions will help you improve not only your experience with data but also influence others by using your guidance and FAIRified resources.
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
---
2+
layout: faq-page
3+
---
Lines changed: 69 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,69 @@
1+
@misc{wiegers_luc_2019_3593258,
2+
author = {Wiegers, Luc and
3+
van Gelder, Celia W. G.},
4+
title = {{Illustration for "Ten simple rules for making
5+
training materials FAIR"}},
6+
month = dec,
7+
year = 2019,
8+
publisher = {Zenodo},
9+
version = {1.0},
10+
doi = {10.5281/zenodo.3593258},
11+
url = {https://doi.org/10.5281/zenodo.3593258}
12+
}
13+
14+
@article{chevron2014metacognitive,
15+
title={A metacognitive tool: Theoretical and operational analysis of skills exercised in structured concept maps},
16+
author={Chevron, Marie-Pierre},
17+
journal={Perspectives in Science},
18+
doi = {10.1016/j.pisc.2014.07.001},
19+
volume={2},
20+
number={1-4},
21+
pages={46--54},
22+
year={2014},
23+
publisher={Elsevier}
24+
}
25+
26+
@article{mcmurry2017identifiers,
27+
title={Identifiers for the 21st century: How to design, provision, and reuse persistent identifiers to maximize utility and impact of life science data},
28+
author={McMurry, Julie A and Juty, Nick and Blomberg, Niklas and Burdett, Tony and Conlin, Tom and Conte, Nathalie and Courtot, M{\'e}lanie and Deck, John and Dumontier, Michel and Fellows, Donal K and others},
29+
journal={PLoS biology},
30+
doi = {10.1371/journal.pbio.2001414},
31+
volume={15},
32+
number={6},
33+
pages={e2001414},
34+
year={2017},
35+
publisher={Public Library of Science San Francisco, CA USA}
36+
}
37+
38+
@article{hiltemann2023galaxy,
39+
title={Galaxy Training: A powerful framework for teaching!},
40+
author={Hiltemann, Saskia and Rasche, Helena and Gladman, Simon and Hotz, Hans-Rudolf and Larivi{\`e}re, Delphine and Blankenberg, Daniel and Jagtap, Pratik D and Wollmann, Thomas and Bretaudeau, Anthony and Gou{\'e}, Nadia and others},
41+
journal={PLOS Computational Biology},
42+
doi = {10.1371/journal.pcbi.1010752},
43+
volume={19},
44+
number={1},
45+
pages={e1010752},
46+
year={2023},
47+
publisher={Public Library of Science San Francisco, CA USA}
48+
}
49+
50+
@misc{fair-training-materials,
51+
url = {https://training.galaxyproject.org/training-material/faqs/gtn/fair_training.html},
52+
note = {Accessed 2023-05-23},
53+
title = {How does the GTN ensure our training materials are FAIR?}
54+
}
55+
56+
@article{Garcia2020,
57+
doi = {10.1371/journal.pcbi.1007854},
58+
url = {https://doi.org/10.1371/journal.pcbi.1007854},
59+
year = {2020},
60+
month = may,
61+
publisher = {Public Library of Science ({PLoS})},
62+
volume = {16},
63+
number = {5},
64+
pages = {e1007854},
65+
author = {Leyla Garcia and B{\'{e}}r{\'{e}}nice Batut and Melissa L. Burke and Mateusz Kuzak and Fotis Psomopoulos and Ricardo Arcila and Teresa K. Attwood and Niall Beard and Denise Carvalho-Silva and Alexandros C. Dimopoulos and Victoria Dominguez del Angel and Michel Dumontier and Kim T. Gurwitz and Roland Krause and Peter McQuilton and Loredana Le Pera and Sarah L. Morgan and P\"{a}ivi Rauste and Allegra Via and Pascal Kahlem and Gabriella Rustici and Celia W. G. van Gelder and Patricia M. Palagi},
66+
editor = {Scott Markel},
67+
title = {Ten simple rules for making training materials {FAIR}},
68+
journal = {{PLOS} Computational Biology}
69+
}

0 commit comments

Comments
 (0)