Abstract
Machine learning applications in the chemical sciences, especially when based on neural networks, critically depend on the availability of large quantities of high-quality data. As they provide excellent accuracy for both charged and neutral excitations, a large dataset containing quasiparticle self-consistent GW (qsGW) and Bethe-Salpeter equation (BSE) data would be highly desirable to model excited state energies and properties. In this work, we introduce a dataset for qsGW-BSE excitation energies and qsGW quasiparticle energies of unprecedented size. Our dataset, denoted QM9GWBSE, supplies GW-BSE singlet-singlet and singlet-triplet excitation energies, corresponding transition dipole moments and oscillator strengths as well as qsGW quasiparticle energies for all molecules from the popular QM9 dataset. We anticipate that QM9GWBSE will provide a solid foundation to train highly accurate machine learning models for the prediction of molecular excited state properties.
| Original language | English |
|---|---|
| Article number | 643 |
| Pages (from-to) | 1-8 |
| Number of pages | 8 |
| Journal | Scientific Data |
| Volume | 13 |
| Issue number | 1 |
| Early online date | 10 Mar 2026 |
| DOIs | |
| Publication status | Published - 2026 |
Bibliographical note
Publisher Copyright:© The Author(s) 2026.
Fingerprint
Dive into the research topics of 'qsGW quasiparticle and GW-BSE excitation energies of 133,885 molecules'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver