CIC-GIL approach to author profiling in Spanish tweets: Location and occupation

I. Markov, H. Gómez-Adorno, M. Jasso-Rosales, G. Sidorov

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

© 2018 CEUR-WS. All Rights Reserved.We present the CIC-GIL approach to the author profiling (AP) task at MEX-A3T 2018. The task consists of two subtasks: identification of authors’ location (6-way) and occupation (8-way) in a corpus of Mexican Spanish tweets. We used the logistic regression algorithm trained on typed character n-grams, function-word n-grams, and regionalisms for location identification, and typed character n-grams with several modifications for occupation identification. Our best run showed F1-macro score of 73.63% for location and 48.94% for occupation identification. The results are competitive with other participating teams; in particular, our best run was ranked fourth in the shared task.
Original languageEnglish
Title of host publicationProceedings of the 3rd Workshop on Evaluation of Human Language Technologies for Iberian Languages, IberEval 2018 - co-located with 34th Conference of the Spanish Society for Natural Language Processing, SEPLN 2018
EditorsP. Rosso, J. Carrillo-de-Albornoz, J. Gonzalo, R. Martinez, S. Montalvo
PublisherCEUR-WS
Pages97-101
Volume2150
Publication statusPublished - 2018
Externally publishedYes
Event3rd Workshop on Evaluation of Human Language Technologies for Iberian Languages, IberEval 2018 - Sevilla, Spain
Duration: 18 Sept 2018 → …

Publication series

NameCEUR Workshop Proceedings
ISSN (Print)1613-0073

Conference

Conference3rd Workshop on Evaluation of Human Language Technologies for Iberian Languages, IberEval 2018
Country/TerritorySpain
CitySevilla
Period18/09/18 → …

Fingerprint

Dive into the research topics of 'CIC-GIL approach to author profiling in Spanish tweets: Location and occupation'. Together they form a unique fingerprint.

Cite this