Classification of cytochrome P450 1A2 inhibitors and noninhibitors by machine learning techniques

P. Vasanthanathan, O. Taboureau, C. Oostenbrink, N.P. Vermeulen, L. Olsen, F.S. Jorgensen

Research output: Contribution to JournalArticleAcademicpeer-review


The cytochrome P450 (P450) superfamily plays an important role in the metabolism of drug compounds, and it is therefore highly desirable to have models that can predict whether a compound interacts with a specific isoform of the P450s. In this work, we provide in silico models for classification of CYP1A2 inhibitors and noninhibitors. Training and test sets consisted of approximately 400 and 7000 compounds, respectively. Various machine learning techniques, such as binary quantitative structure activity relationship, support vector machine (SVM), random forest, kappa nearest neighbor (kNN), and decision tree methods were used to develop in silico models, based on Volsurf and Molecular Operating Environment descriptors. The best models were obtained using the SVM, random forest, and kNN methods in combination with the BestFirst variable selection method, resulting in models with 73 to 76% of accuracy on the test set prediction (Matthews correlation coefficients of 0.51 and 0.52). Finally, a decision tree model based on Lipinski's Rule-of-Five descriptors was also developed. This model predicts 67% of the compounds correctly and gives a simple and interesting insight into the issue of classification. All of the models developed in this work are fast and precise enough to be applicable for virtual screening of CYP1A2 inhibitors or noninhibitors or can be used as simple filters in the drug discovery process. Copyright © 2009 by The American Society for Pharmacology and Experimental Therapeutics.
Original languageEnglish
Pages (from-to)658-64
JournalDrug Metabolism and Disposition
Issue number3
Publication statusPublished - 2009


Dive into the research topics of 'Classification of cytochrome P450 1A2 inhibitors and noninhibitors by machine learning techniques'. Together they form a unique fingerprint.

Cite this