Abstract
While text classification can classify tweets, assessing whether a tweet is related to an ongoing flood event or not, based on its text, remains difficult. Inclusion of contextual hydrological information could improve the performance of such algorithms. Here, a multilingual multimodal neural network is designed that can effectively use both textual and hydrological information. The classification data was obtained from Twitter using flood-related keywords in English, French, Spanish and Indonesian. Subsequently, hydrological information was extracted from a global precipitation dataset based on the tweet's timestamp and locations mentioned in its text. Three experiments were performed analyzing precision, recall and F1-scores while comparing a neural network that uses hydrological information against a neural network that does not. Results showed that F1-scores improved significantly across all experiments. Most notably, when optimizing for precision the neural network with hydrological information could achieve a precision of 0.91 while the neural network without hydrological information failed to effectively optimize. Moreover, this study shows that including hydrological information can assist in the translation of the classification algorithm to unseen languages.
Original language | English |
---|---|
Article number | 104485 |
Pages (from-to) | 1-9 |
Number of pages | 9 |
Journal | Computers and Geosciences |
Volume | 140 |
Early online date | 3 Apr 2020 |
DOIs | |
Publication status | Published - Jul 2020 |
Funding
Our research was funded by an NWO-Vici grant from the Netherlands Organisation for Scientific Research (NWO; grant number 453-14-006 ) and an EU-ENHANCE grant from the European Community's Seventh Framework Programme (FP7; grant number 308438 ). The model's development was also supported by the Department for International Development and by the Global Facility for Disaster Reduction and Recovery through the Challenge Fund .
Funders | Funder number |
---|---|
EU-ENHANCE | |
NWO-VICI | |
Netherlands Organisation for Scienti?c Research | |
Seventh Framework Programme | 308438 |
Department for International Development, UK Government | |
Department for International Development | |
Nederlandse Organisatie voor Wetenschappelijk Onderzoek | 453-14-006 |
Seventh Framework Programme |
Keywords
- Authoritative data
- Early warning
- Floods
- Multimodal fusion
- Natural hazards
- Neural networks
- Text classification
- Word embeddings