TY - JOUR
T1 - Towards FAIR protocols and workflows
T2 - The OpenPREDICT use case
AU - Celebi, Remzi
AU - Moreira, Joao Rebelo
AU - Hassan, Ahmed A.
AU - Ayyar, Sandeep
AU - Ridder, Lars
AU - Kuhn, Tobias
AU - Dumontier, Michel
PY - 2020/9/21
Y1 - 2020/9/21
N2 - It is essential for the advancement of science that researchers share, reuse and reproduce each other's workflows and protocols. The FAIR principles are a set of guidelines that aim to maximize the value and usefulness of research data, and emphasize the importance of making digital objects findable and reusable by others. The question of how to apply these principles not just to data but also to the workflows and protocols that consume and produce them is still under debate and poses a number of challenges. In this paper we describe a two-fold approach of simultaneously applying the FAIR principles to scientific workflows as well as the involved data. We apply and evaluate our approach on the case of the PREDICT workflow, a highly cited drug repurposing workflow. This includes FAIRification of the involved datasets, as well as applying semantic technologies to represent and store data about the detailed versions of the general protocol, of the concrete workflow instructions, and of their execution traces. We propose a semantic model to address these specific requirements and was evaluated by answering competency questions. This semantic model consists of classes and relations from a number of existing ontologies, including Workflow4ever, PROV, EDAM, and BPMN. This allowed us then to formulate and answer new kinds of competency questions. Our evaluation shows the high degree to which our FAIRified OpenPREDICT workflow now adheres to the FAIR principles and the practicality and usefulness of being able to answer our new competency questions.
AB - It is essential for the advancement of science that researchers share, reuse and reproduce each other's workflows and protocols. The FAIR principles are a set of guidelines that aim to maximize the value and usefulness of research data, and emphasize the importance of making digital objects findable and reusable by others. The question of how to apply these principles not just to data but also to the workflows and protocols that consume and produce them is still under debate and poses a number of challenges. In this paper we describe a two-fold approach of simultaneously applying the FAIR principles to scientific workflows as well as the involved data. We apply and evaluate our approach on the case of the PREDICT workflow, a highly cited drug repurposing workflow. This includes FAIRification of the involved datasets, as well as applying semantic technologies to represent and store data about the detailed versions of the general protocol, of the concrete workflow instructions, and of their execution traces. We propose a semantic model to address these specific requirements and was evaluated by answering competency questions. This semantic model consists of classes and relations from a number of existing ontologies, including Workflow4ever, PROV, EDAM, and BPMN. This allowed us then to formulate and answer new kinds of competency questions. Our evaluation shows the high degree to which our FAIRified OpenPREDICT workflow now adheres to the FAIR principles and the practicality and usefulness of being able to answer our new competency questions.
KW - Drug repurposing
KW - FAIR data principles
KW - FAIR workflows
KW - Ontology-driven healthcare
KW - Reproducibility
KW - Research object
KW - Scientific workflows and protocols
KW - Semantic web
UR - http://www.scopus.com/inward/record.url?scp=85092652339&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85092652339&partnerID=8YFLogxK
U2 - 10.7717/PEERJ-CS.281
DO - 10.7717/PEERJ-CS.281
M3 - Article
AN - SCOPUS:85092652339
SN - 2376-5992
VL - 6
SP - 1
EP - 29
JO - PeerJ Computer Science
JF - PeerJ Computer Science
M1 - 281
ER -