Abstract
We propose a method to simplify textual Twitter data into understandable networks of terms that can signify important events and their possible changes over time. The method allows for common characteristics of the networks across time periods and each period can comprise multiple unknown sub-networks. The networks are described by Gaussian graphical models and their parameter values are estimated through a Bayesian approach with a fused lasso-type prior on the precision matrices of the underlying mixtures of the sub-models. A flexible data allocation scheme is at the heart of an MCMC algorithm to recover mean and covariance parameters of the mixture components. Several implementations of the outlined estimation procedure are studied and compared based on simulated data. The procedure with the highest predictive power is used for mining tweets regarding the 2009 Iranian presidential election.
| Original language | English |
|---|---|
| Article number | e0235596 |
| Pages (from-to) | 1-28 |
| Number of pages | 28 |
| Journal | PLoS ONE |
| Volume | 15 |
| Issue number | 7 |
| Early online date | 27 Jul 2020 |
| DOIs | |
| Publication status | Published - Jul 2020 |
Funding
This research was supported by NWO-STAR grant 613.009.014 from the Netherlands Organization for Scientific Research.
| Funders | Funder number |
|---|---|
| NWO-STAR | |
| ???publication-publication-funding-organisation-not-added??? | 613.009.014 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Fingerprint
Dive into the research topics of 'Analysis of Twitter data with the Bayesian fused graphical lasso'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver