Bayesian mixture regression analysis for regulation of Pluripotency in ES cells

Mehran Aflakparast*, Geert Geeven, Mathisca C.M. De Gunst

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Background: Observed levels of gene expression strongly depend on both activity of DNA binding transcription factors (TFs) and chromatin state through different histone modifications (HMs). In order to recover the functional relationship between local chromatin state, TF binding and observed levels of gene expression, regression methods have proven to be useful tools. They have been successfully applied to predict mRNA levels from genome-wide experimental data and they provide insight into context-dependent gene regulatory mechanisms. However, heterogeneity arising from gene-set specific regulatory interactions is often overlooked. Results: We show that regression models that predict gene expression by using experimentally derived ChIP-seq profiles of TFs can be significantly improved by mixture modelling. In order to find biologically relevant gene clusters, we employ a Bayesian allocation procedure which allows us to integrate additional biological information such as three-dimensional nuclear organization of chromosomes and gene function. The data integration procedure involves transforming the additional data into gene similarity values. We propose a generic similarity measure that is especially suitable for situations where the additional data are of both continuous and discrete type, and compare its performance with similar measures in the context of mixture modelling. Conclusions: We applied the proposed method on a data from mouse embryonic stem cells (ESC). We find that including additional data results in mixture components that exhibit biologically meaningful gene clusters, and provides valuable insight into the heterogeneity of the regulatory interactions.

Original languageEnglish
Article number3
Pages (from-to)1-13
Number of pages13
JournalBMC Bioinformatics
Volume21
DOIs
Publication statusPublished - 2 Jan 2020

Keywords

  • Bayesian analysis
  • Data integration
  • Mixture regression
  • Pluripotency
  • Transcription regulation

Fingerprint

Dive into the research topics of 'Bayesian mixture regression analysis for regulation of Pluripotency in ES cells'. Together they form a unique fingerprint.

Cite this