TY - JOUR
T1 - A Heuristic Approach to Author Name Disambiguation in Bibliometrics Databases for Large-scale Research Assessments
AU - D'Angelo, C.A.
AU - Giuffrida, C.
AU - Abramo, G.
PY - 2011
Y1 - 2011
N2 - National exercises for the evaluation of research activity by universities are becoming regular practice in ever more countries. These exercises have mainly been conducted through the application of peer-review methods. Bibliometrics has not been able to offer a valid large-scale alternative because of almost overwhelming difficulties in identifying the true author of each publication. We will address this problem by presenting a heuristic approach to author name disambiguation in bibliometric datasets for large-scale research assessments. The application proposed concerns the Italian university system, comprising 80 universities and a research staff of over 60,000 scientists. The key advantage of the proposed approach is the ease of implementation. The algorithms are of practical application and have considerably better scalability and expandability properties than state-of-the-art unsupervised approaches. Moreover, the performance in terms of precision and recall, which can be further improved, seems thoroughly adequate for the typical needs of large-scale bibliometric research assessments. © 2010 ASIS&T.
AB - National exercises for the evaluation of research activity by universities are becoming regular practice in ever more countries. These exercises have mainly been conducted through the application of peer-review methods. Bibliometrics has not been able to offer a valid large-scale alternative because of almost overwhelming difficulties in identifying the true author of each publication. We will address this problem by presenting a heuristic approach to author name disambiguation in bibliometric datasets for large-scale research assessments. The application proposed concerns the Italian university system, comprising 80 universities and a research staff of over 60,000 scientists. The key advantage of the proposed approach is the ease of implementation. The algorithms are of practical application and have considerably better scalability and expandability properties than state-of-the-art unsupervised approaches. Moreover, the performance in terms of precision and recall, which can be further improved, seems thoroughly adequate for the typical needs of large-scale bibliometric research assessments. © 2010 ASIS&T.
UR - https://www.scopus.com/pages/publications/78951494661
UR - https://www.scopus.com/inward/citedby.url?scp=78951494661&partnerID=8YFLogxK
U2 - 10.1002/asi.21460
DO - 10.1002/asi.21460
M3 - Article
SN - 1532-2882
VL - 62
JO - Journal of the American Society for Information Science and Technology
JF - Journal of the American Society for Information Science and Technology
IS - 2
ER -