Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data

Artur Rocha*, Rui Camacho, Jeroen Ruwaard, Heleen Riper

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review


Introduction: Clinical trials of blended Internet-based treatments deliver a wealth of data from various sources, such as self-report questionnaires, diagnostic interviews, treatment platform log files and Ecological Momentary Assessments (EMA). Mining these complex data for clinically relevant patterns is a daunting task for which no definitive best method exists. In this paper, we explore the expressive power of the multi-relational Inductive Logic Programming (ILP) data mining approach, using combined trial data of the EU E-COMPARED depression trial. Methods: We explored the capability of ILP to handle and combine (implicit) multiple relationships in the E-COMPARED data. This data set has the following features that favor ILP analysis: 1) Time reasoning is involved; 2) there is a reasonable amount of explicit useful relations to be analyzed; 3) ILP is capable of building comprehensible models that might be perceived as putative explanations by domain experts; 4) both numerical and statistical models may coexist within ILP models if necessary. In our analyses, we focused on scores of the PHQ-8 self-report questionnaire (which taps depressive symptom severity), and on EMA of mood and various other clinically relevant factors. Both measures were administered during treatment, which lasted between 9 to 16 weeks. Results: E-COMPARED trial data revealed different individual improvement patterns: PHQ-8 scores suggested that some individuals improved quickly during the first weeks of the treatment, while others improved at a (much) slower pace, or not at all. Combining self-reported Ecological Momentary Assessments (EMA), PHQ-8 scores and log data about the usage of the ICT4D platform in the context of blended care, we set out to unveil possible causes for these different trajectories. Discussion: This work complements other studies into alternative data mining approaches to E-COMPARED trial data analysis, which are all aimed to identify clinically meaningful predictors of system use and treatment outcome. Strengths and limitations of the ILP approach given this objective will be discussed.

Original languageEnglish
Pages (from-to)176-180
Number of pages5
JournalInternet Interventions
Early online date13 Mar 2018
Publication statusPublished - Jun 2018


Publication costs of this article are financed by the ERDF – European Regional Development Fund through the Operational Programme for Competitiveness and Internationalisation – COMPETE 2020 Programme within project « POCI-01-0145-FEDER-006961 », and by National Funds through the FCT – Fundação para a Ciência e a Tecnologia (Portuguese Foundation for Science and Technology) as part of project UID/EEA/50014/2013 .

FundersFunder number
Seventh Framework Programme603098
Fundação para a Ciência e a TecnologiaUID/EEA/50014/2013
European Regional Development FundPOCI-01-0145-FEDER-006961
Programa Operacional Temático Factores de Competitividade


    • Ecological momentary assessment
    • Internet intervention
    • Log data
    • Moodbuster
    • Multi-relational data mining


    Dive into the research topics of 'Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data'. Together they form a unique fingerprint.

    Cite this