TY - JOUR
T1 - SeRenDIP: SEquential REmasteriNg to DerIve profiles for fast and accurate predictions of PPI interface positions
AU - Hou, Qingzhen
AU - De Geest, Paul F. G.
AU - Griffioen, Christian J.
AU - Abeln, Sanne
AU - Heringa, Jaap
AU - Feenstra, K. Anton
N1 - © The Author(s) (2019). Published by Oxford University Press. All rights reserved. For Permissions, please email: [email protected].
PY - 2019/11
Y1 - 2019/11
N2 - MOTIVATION: Interpretation of ubiquitous protein sequence data has become a bottleneck in biomolecular research, due to a lack of structural and other experimental annotation data for these proteins. Prediction of protein interaction sites from sequence may be a viable substitute. We therefore recently developed a sequence-based random-forest method for protein-protein interface prediction, which yielded a significantly increased performance than other methods on both homomeric and heteromeric protein-protein interactions. Here we present a webserver that implements this method efficiently.RESULTS: With the aim of accelerating our previous approach, we obtained sequence conservation profiles by re-mastering the alignment of homologous sequences found by PSI-BLAST. This yielded a more than ten-fold speedup and at least the same accuracy, as reported previously for our method; these results allowed us to offer the method as a webserver. The web-server interface is targeted to the non-expert user. The input is simply a sequence of the protein of interest, and the output a table with scores indicating the likelihood of having an interaction interface at a certain position. As the method is sequence-based and not sensitive to the type of protein interaction, we expect this webserver to be of interest to many biological researchers in academia and in industry.AVAILABILITY: Webserver, source code and datasets are available at www.ibi.vu.nl/programs/serendipwww/.
AB - MOTIVATION: Interpretation of ubiquitous protein sequence data has become a bottleneck in biomolecular research, due to a lack of structural and other experimental annotation data for these proteins. Prediction of protein interaction sites from sequence may be a viable substitute. We therefore recently developed a sequence-based random-forest method for protein-protein interface prediction, which yielded a significantly increased performance than other methods on both homomeric and heteromeric protein-protein interactions. Here we present a webserver that implements this method efficiently.RESULTS: With the aim of accelerating our previous approach, we obtained sequence conservation profiles by re-mastering the alignment of homologous sequences found by PSI-BLAST. This yielded a more than ten-fold speedup and at least the same accuracy, as reported previously for our method; these results allowed us to offer the method as a webserver. The web-server interface is targeted to the non-expert user. The input is simply a sequence of the protein of interest, and the output a table with scores indicating the likelihood of having an interaction interface at a certain position. As the method is sequence-based and not sensitive to the type of protein interaction, we expect this webserver to be of interest to many biological researchers in academia and in industry.AVAILABILITY: Webserver, source code and datasets are available at www.ibi.vu.nl/programs/serendipwww/.
UR - http://www.scopus.com/inward/record.url?scp=85074965924&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074965924&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btz428
DO - 10.1093/bioinformatics/btz428
M3 - Article
C2 - 31116381
SN - 1367-4803
VL - 35
SP - 4794
EP - 4796
JO - Bioinformatics
JF - Bioinformatics
IS - 22
ER -