We present INCA (short for INfrastructure for Content Analysis), a Python module for collecting, storing, processing, and analyzing a wide variety of media content, including but not limited to news, political debates, social media, forums, and customer reviews. Using Elasticsearch as a database backend and Celery for task management, it makes automated content analysis scalable. INCA's main objective is to enable and promote an integrated workflow. INCA focuses on re-usability of data, processors, and analyses; making all steps of automated content analysis (ACA) accessible to social scientists, without requiring advanced programming skills. Here, we present the aim, implementation and recommended workflow for INCA.
|Name||Proceedings - IEEE 14th International Conference on eScience, e-Science 2018|
|Conference||14th IEEE International Conference on eScience, e-Science 2018|
|Period||29/10/18 → 1/11/18|
- Automated content analysis
- Communication science
- Python module
- Social science