Improving the classification of flood tweets with contextual hydrological information in a multimodal neural network

Jens A. de Bruijn*, Hans de Moel, Albrecht H. Weerts, Marleen C. de Ruiter, Erkan Basar, Dirk Eilander, Jeroen C.J.H. Aerts

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

While text classification can classify tweets, assessing whether a tweet is related to an ongoing flood event or not, based on its text, remains difficult. Inclusion of contextual hydrological information could improve the performance of such algorithms. Here, a multilingual multimodal neural network is designed that can effectively use both textual and hydrological information. The classification data was obtained from Twitter using flood-related keywords in English, French, Spanish and Indonesian. Subsequently, hydrological information was extracted from a global precipitation dataset based on the tweet's timestamp and locations mentioned in its text. Three experiments were performed analyzing precision, recall and F1-scores while comparing a neural network that uses hydrological information against a neural network that does not. Results showed that F1-scores improved significantly across all experiments. Most notably, when optimizing for precision the neural network with hydrological information could achieve a precision of 0.91 while the neural network without hydrological information failed to effectively optimize. Moreover, this study shows that including hydrological information can assist in the translation of the classification algorithm to unseen languages.

Original languageEnglish
Article number104485
Pages (from-to)1-9
Number of pages9
JournalComputers and Geosciences
Volume140
Early online date3 Apr 2020
DOIs
Publication statusPublished - Jul 2020

Funding

Our research was funded by an NWO-Vici grant from the Netherlands Organisation for Scientific Research (NWO; grant number 453-14-006 ) and an EU-ENHANCE grant from the European Community's Seventh Framework Programme (FP7; grant number 308438 ). The model's development was also supported by the Department for International Development and by the Global Facility for Disaster Reduction and Recovery through the Challenge Fund .

FundersFunder number
EU-ENHANCE
NWO-VICI
Netherlands Organisation for Scienti?c Research
Seventh Framework Programme308438
Department for International Development, UK Government
Department for International Development
Nederlandse Organisatie voor Wetenschappelijk Onderzoek453-14-006
Seventh Framework Programme

    Keywords

    • Authoritative data
    • Early warning
    • Floods
    • Multimodal fusion
    • Natural hazards
    • Neural networks
    • Text classification
    • Word embeddings

    Fingerprint

    Dive into the research topics of 'Improving the classification of flood tweets with contextual hydrological information in a multimodal neural network'. Together they form a unique fingerprint.

    Cite this