基于概率神经网络的蛋白质相互作用分类器Predicting type of protein-protein interaction based on probabilistic neural network
张伟;郝江锋;项俊平;胡茂林;
摘要(Abstract):
蛋白质-蛋白质作用面上的结构特征对于研究蛋白质功能具有重要意义。提出了一种新的、基于统计直方图提取蛋白质作用面特征的方法,并且利用提取出的作用面特征,结合概率神经网络,实现了对作用面结构类型的分类预测。从预测结果来看,统计直方图提取出的特征,对蛋白质作用面结构具有很好的区分能力,而且可以通过调节划分的区间个数和节点的选取方式,达到对作用面结构的不同粒度的描述,以适用于不同目的的研究,这可能对与结构有关的某些生物信息学问题的研究具有启发性。利用概率神经网络对作用面结构进行分类预测,避开了费时的结构比对和数据库搜索,且训练快速,扩展能力强,正确率高,对独立测试集的911个蛋白复合物视在正确率达到90.67%。基于该算法的MATLAB分类器软件可以通过E-Mail与作者联系获取。
关键词(KeyWords): 统计直方图;蛋白质作用面;概率神经网络;结构比对;分类器
基金项目(Foundation):
作者(Author): 张伟;郝江锋;项俊平;胡茂林;
Email:
DOI:
参考文献(References):
- [1]Valencia A,Pazos F.Computational methods for the prediction of protein interactions[J].Curr Opin Struct Biol,2002,12:368-373.
- [2]Ferrer M,Harrison S C.Peptide ligands to human immu no defi-ciency virus type1gp120identified from phagedisplay libraries[J].J Virol,1999,73:5795-5802.
- [3]Kortemme T,Baker D.Computational design of protein-protein in-teractions[J].Curr Opin Struct Biol,2004,8:91-97.
- [4]Chakrabarti P,Janin J.Dissecting protein-protein recognition sites[J].Proteins,2002,47:334-343.
- [5]Lo C L.The atomic structure of protein-protein recognition sites[J].J Mol Biol,1999,285:2177-2198.
- [6]Jones S,Thornton J.Analysis of protein-protein interaction sites us-ing surface patches[J].J Mol Biol,1997,272:121-132.
- [7]Gavin A C.Functional organization of the yeast genome by system-atic analysis of protein complexes[J].Nature,2002,415:141-147.
- [8]Ito T.A comprehensive two-hybrid analysis to explore the yeast protein interactome[J].Proc Natl Acad Sci USA,2001,98:4569-4574.
- [9]Wu S J.Randomization of the receptor alpha chain recruitment epitote reveals a functional interleukin-5with charged epletion in the CD loop[J].J Biol Chem,1999,274:20479-20488.
- [10]Xenarios I.DIP,the database of interacting proteins:a research tool for studying cellular networks of protein interactions[J].Nu-cleic Acids Res,2002,30:303-305.
- [11]Bader G D.BIND:the bio-molecular interaction network database[J].Nucleic Acids Res,2003,31:248-250.
- [12]Marcotte E M.Detecting protein function and protein-protein in-teractions from genome sequences[J].Science,1999,285:751-753.
- [13]Salwinski L,Eisenberg D.Computational method sof analysis of protein-protein interactions[J].Curr Opin Struct Biol,2003,13:377-382.
- [14]Lu L.Multiprospector:an algorithm for the prediction of protein-protein interactions by multimeric threading[J].Proteins,2002,49:350-364.
- [15]Keskin O.A new,structurally non-redundant,diverse data set of protein-protein interfaces and it simplications[J].Protein Sci,2004,13:1043-1055.
- [16]Shatsky M,Nussinov R,Wolfson H J.A method for simultaneous alignment of multiple protein structures[J].Proteins:Structure,Function,and Bioinformatics,2004,56:143-156.
- [17]Shulman-Peleg A,Shatsky M,Nussinov R,et al.MAPPIS:multiple 3D alignment of protein-protein interfaces[C]//LNCS,2005,3695:91-103.
- [18]Sprecht D F.Probabilistic neural networks for classification,map-ping and associative memory[J].IEEE ICNN San Dieg CA,1988,I:525-532.
- [19]Havel T F,Kuntz I D,Crippen G M.The theory and practice of distance geometry[J].Bull Math Biol,1983,45:665-720.
- [20]Holm L,Sander C.Protein structure comparison by alignment of distance matrices[J].J Mol Biol,1993,233:123-138.