TY - GEN
T1 - Estimating the time between twitter messages and future events
AU - Hürriyetoʇlu, Ali
AU - Kunneman, Florian
AU - Van Den Bosch, Antal
PY - 2013/1/1
Y1 - 2013/1/1
N2 - We describe and test three methods to estimate the remain-ing time between a series of microtexts (tweets) and the future event they refer to via a hashtag. Our system gener-ates hourly forecasts. A linear and a local regression-based approach are applied to map hourly clusters of tweets directly onto time-to-event. To take changes over time into account, we develop a novel time series analysis approach that first derives word frequency time series from sets of tweets and then performs local regression to predict time- to-event from nearest-neighbor time series. We train and test on a single type of event, Dutch premier league foot- ball matches. Our results indicate that in an 'early' stage, four days or more before the event, the time series analysis produces time-to-event predictions that are about one day off; closer to the event, local regression attains a similar ac-curacy. Local regression also outperforms both mean and median-based baselines, but on average none of the tested system has a consistently strong performance through time.
AB - We describe and test three methods to estimate the remain-ing time between a series of microtexts (tweets) and the future event they refer to via a hashtag. Our system gener-ates hourly forecasts. A linear and a local regression-based approach are applied to map hourly clusters of tweets directly onto time-to-event. To take changes over time into account, we develop a novel time series analysis approach that first derives word frequency time series from sets of tweets and then performs local regression to predict time- to-event from nearest-neighbor time series. We train and test on a single type of event, Dutch premier league foot- ball matches. Our results indicate that in an 'early' stage, four days or more before the event, the time series analysis produces time-to-event predictions that are about one day off; closer to the event, local regression attains a similar ac-curacy. Local regression also outperforms both mean and median-based baselines, but on average none of the tested system has a consistently strong performance through time.
KW - Event prediction
KW - Time series analysis
KW - Twitter
UR - http://www.scopus.com/inward/record.url?scp=84921902139&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84921902139&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84921902139
VL - 986
T3 - CEUR Workshop Proceedings
SP - 20
EP - 23
BT - 13th Dutch-Belgian Workshop on Information Retrieval, DIR 2013
T2 - 13th Dutch-Belgian Workshop on Information Retrieval, DIR 2013
Y2 - 26 April 2013
ER -