TY - GEN
T1 - The Effect of Knowledge Graph Schema on Classifying Future Research Suggestions
AU - Alivanistos, Dimitrios
AU - van der Bijl, Seth
AU - Cochez, Michael
AU - van Harmelen, Frank
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024
Y1 - 2024
N2 - The output of research doubles at least every 20 years and in most research fields the number of research papers has become overwhelming. A critical task for researchers is to find promising future directions and interesting scientific challenges in the literature. To tackle this problem, we hypothesize that structured representations of information in the literature can be used to identify these elements. Specifically, we look at structured representations in the form of Knowledge Graphs (KGs) and we investigate how using different input schemas for extraction impacts the performance on the tasks of classifying sentences as future directions. Our results show that the MECHANIC-Granular schema yields the best performance across different settings and achieves state of the art performance when combined with pretrained embeddings. Overall, we observe that schemas with limited variation in the resulting node degrees and significant interconnectedness lead to the best downstream classification performance.
AB - The output of research doubles at least every 20 years and in most research fields the number of research papers has become overwhelming. A critical task for researchers is to find promising future directions and interesting scientific challenges in the literature. To tackle this problem, we hypothesize that structured representations of information in the literature can be used to identify these elements. Specifically, we look at structured representations in the form of Knowledge Graphs (KGs) and we investigate how using different input schemas for extraction impacts the performance on the tasks of classifying sentences as future directions. Our results show that the MECHANIC-Granular schema yields the best performance across different settings and achieves state of the art performance when combined with pretrained embeddings. Overall, we observe that schemas with limited variation in the resulting node degrees and significant interconnectedness lead to the best downstream classification performance.
KW - Classification
KW - Information Extraction
KW - Scientific Discourse
KW - Scientific Knowledge Graphs
UR - http://www.scopus.com/inward/record.url?scp=85202208404&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85202208404&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-65794-8_10
DO - 10.1007/978-3-031-65794-8_10
M3 - Conference contribution
AN - SCOPUS:85202208404
SN - 9783031657931
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 149
EP - 170
BT - Natural Scientific Language Processing and Research Knowledge Graphs
A2 - Rehm, Georg
A2 - Dietze, Stefan
A2 - Schimmler, Sonja
A2 - Krüger, Frank
PB - Springer Science and Business Media Deutschland GmbH
T2 - 1st International Workshop on Natural Scientific Language Processing and Research Knowledge Graphs, NSLP 2024
Y2 - 27 May 2024 through 27 May 2024
ER -