TY - JOUR
T1 - Towards a database for genotype-phenotype association research: mining data from encyclopaedia
AU - Pajić, V.S.
AU - Pavlović-Lažetić, G.M.
AU - Beljanski, M.V.
AU - Brandt, B.W.
AU - Pajić, M.B.
PY - 2013
Y1 - 2013
N2 - To associate phenotypic characteristics of an organism to molecules encoded by its genome, there is a need for well-structured genotype and phenotype data. We use a novel method for extracting data on phenotype and genotype characteristics of microorganisms from text. As a resource, we use an encyclopedia of microorganisms, which holds phenotypic and genotypic data and create a structured, flexible data resource, which can be exported to a range of database formats, containing genotype and phenotype data for 2412 species and 873 genera of microbes. This data source has great potential as a resource for future biological research on genotype-phenotype associations. In this paper, we focus on describing the structure and content of the resulting database and on evaluating the method used for extracting the data. We conclude that the resulting database can be used as a reliable complementary resource for research into genotype-phenotype association.
AB - To associate phenotypic characteristics of an organism to molecules encoded by its genome, there is a need for well-structured genotype and phenotype data. We use a novel method for extracting data on phenotype and genotype characteristics of microorganisms from text. As a resource, we use an encyclopedia of microorganisms, which holds phenotypic and genotypic data and create a structured, flexible data resource, which can be exported to a range of database formats, containing genotype and phenotype data for 2412 species and 873 genera of microbes. This data source has great potential as a resource for future biological research on genotype-phenotype associations. In this paper, we focus on describing the structure and content of the resulting database and on evaluating the method used for extracting the data. We conclude that the resulting database can be used as a reliable complementary resource for research into genotype-phenotype association.
U2 - 10.1504/IJDMB.2013.053196
DO - 10.1504/IJDMB.2013.053196
M3 - Article
SN - 1748-5673
VL - 7
SP - 196
EP - 213
JO - International Journal of Data Mining and Bioinformatics
JF - International Journal of Data Mining and Bioinformatics
IS - 2
ER -