Abstract
Introduction: Sharing microbiome data among researchers fosters new innovations and reduces cost for research. Practically, this means that the (meta)data will have to be standardized, transparent and readily available for researchers. The microbiome data and associated metadata will then be described with regards to composition and origin, in order to maximize the possibilities for application in various contexts of research. Here, we propose a set of tools and protocols to develop a real-time FAIR (Findable. Accessible, Interoperable and Reusable) compliant database for the handling and storage of human microbiome and host-associated data. Methods: The conflicts arising from privacy laws with respect to metadata, possible human genome sequences in the metagenome shotgun data and FAIR implementations are discussed. Alternate pathways for achieving compliance in such conflicts are analyzed. Sample traceable and sensitive microbiome data, such as DNA sequences or geolocalized metadata are identified, and the role of the GDPR (General Data Protection Regulation) data regulations are considered. For the construction of the database, procedures have been realized to make data FAIR compliant, while preserving privacy of the participants providing the data. Results and discussion: An open-source development platform, Supabase, was used to implement the microbiome database. Researchers can deploy this real-time database to access, upload, download and interact with human microbiome data in a FAIR complaint manner. In addition, a large language model (LLM) powered by ChatGPT is developed and deployed to enable knowledge dissemination and non-expert usage of the database.
| Original language | English |
|---|---|
| Article number | 1384809 |
| Pages (from-to) | 1-11 |
| Number of pages | 11 |
| Journal | Frontiers in Cellular and Infection Microbiology |
| Volume | 14 |
| Early online date | 7 May 2024 |
| DOIs | |
| Publication status | Published - 2024 |
Bibliographical note
Publisher Copyright:Copyright © 2024 Dorst, Zeevenhooven, Wilding, Mende, Brandt, Zaura, Hoekstra and Sheraton.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This publication is part of the project \u201CMETAHEALTH: Health in a microbial, sociocultural and care context in the first 1000 days of life\u201D (project number NWA.1389.20.080) of the research program Dutch Research Agenda - Research along Routes by Consortia (NWA-ORC) 2020/21 which is (partly) financed by the Dutch Research Council (NWO).
| Funders | Funder number |
|---|---|
| Dutch Research Agenda - Research along Routes by Consortia | |
| NWA-ORC | |
| ???publication-publication-funding-organisation-not-added??? | NWA.1389.20.080 |
Keywords
- (meta)data
- database
- fair principles
- general data protection regulation (GDPR)
- microbiome
- pseudonymize
- real-time
Fingerprint
Dive into the research topics of 'FAIR compliant database development for human microbiome data samples'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver