Abstract
Most common methods for automatic text analysis in communication science ignore syntactic information, focusing on the occurrence and co-occurrence of individual words, and sometimes n-grams. This is remarkably effective for some purposes, but poses a limitation for fine-grained analyses into semantic relations such as who does what to whom and according to what source. One tested, effective method for moving beyond this bag-of-words assumption is to use a rule-based approach for labeling and extracting syntactic patterns in dependency trees. Although this method can be used for a variety of purposes, its application is hindered by the lack of dedicated and accessible tools. In this paper we introduce the rsyntax R package, which is designed to make working with dependency trees easier and more intuitive for R users, and provides a framework for combining multiple rules for reliably extracting useful semantic relations.
| Original language | English |
|---|---|
| Pages (from-to) | 180-194 |
| Number of pages | 15 |
| Journal | Computational Communication Research |
| Volume | 3 |
| Issue number | 2 |
| Early online date | 1 Oct 2021 |
| DOIs | |
| Publication status | Published - Oct 2021 |
Fingerprint
Dive into the research topics of 'Extracting semantic relations using syntax: An R package for querying and reshaping dependency trees'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver