進階搜尋


 
系統識別號 U0026-0812200915375894
論文名稱(中文) 以溶劑可接觸面積預測結果為基礎分析蛋白質之間的交互作用
論文名稱(英文) Analyzing Protein-Protein Interactions based on Predicted Accessible Surface Area
校院名稱 成功大學
系所名稱(中) 電機工程學系碩博士班
系所名稱(英) Department of Electrical Engineering
學年度 97
學期 2
出版年 98
研究生(中文) 許育堂
研究生(英文) Yu-Tang Syu
學號 N2696195
學位類別 碩士
語文別 中文
論文頁數 44頁
口試委員 指導教授-張天豪
口試委員-吳謂勝
口試委員-魏嘉玲
口試委員-陳倩瑜
口試委員-黃乾綱
口試委員-蘇中才
中文關鍵字 蛋白質之間交互作用  溶劑可接觸面積 
英文關鍵字 accessible surface area (ASA)  protein-protein interaction (PPI)  solvent accessibility 
學科別分類
中文摘要 了解蛋白質之間交互作用的機制可以幫助我們建構蛋白質交互作用網路以及釐清生物系統的運作原理。先前的研究顯示出透過分析蛋白質一級結構-胺基酸序列-可以有效預測蛋白質交互作用,這些以序列為基礎(sequence-based)的方法,相較於其他需要額外資訊(如蛋白質結構、基因表現等)的方法,提供了更廣泛的應用範疇。
本論文提出一個新的以序列為基礎來預測蛋白質交互作用的方法,該方法依據蛋白質之間交互作用與位於表面之氨基酸的相關程度比其它位於核心的胺基酸來的高,因此在預測時考慮了蛋白質的表面資訊。為了延續只使用序列資料的優勢,本研究使用一套以序列為基礎之溶劑可接觸面積(accessible surface area, ASA)預測器來決定蛋白質表面。實驗結果顯示表面資訊確實可以幫助預測蛋白質之間交互作用,本研究亦分析了使用ASA預測器所預測出的蛋白質表面與透過結構所獲得的蛋白質表面之間的差異,實驗結果顯示使用ASA預測器的蛋白質表面,在預測蛋白質交互作用時準確度僅略低於透過結構所獲得的蛋白質表面。
英文摘要 In this study, we demonstrate a mechanism of predicting protein-protein interactions that is essential to construct protein interaction networks and assist researchers to understand the characteristic of the general principles of biological systems. Previous studies have shown that interacting protein pair can be predicted by its primary structure. These sequence-based methods provide broader applications than those require both additional information and protein sequences.
This work presents a novel sequence-based method based on an assumption that the protein-protein interactions are more related to amino acids at the surface than those at the core. This study utilizes the accessible surface area predictor (ASA) predictor to decide protein surface. The predicted surface information can help to predict protein-protein interactions. This study also analyzes the performance of using the predicted surface by the ASA predictor in comparison with that using the surface obtained from structures.
論文目次 摘 要 I
目 錄 IV
表目錄 VI
圖目錄 VII
CHAPTER 1 緒論 8
CHAPTER 2 相關研究 9
2.1蛋白質 9
2.2 ASA 13
2.3預測蛋白質交互作用方法 14
CHAPTER 3資料集與實驗方法 21
3.1資料集 21
3.2 預測流程 22
3.2.1 ASA預測 23
3.2.2蛋白質表面預測 25
3.2.3特徵編碼 26
3.3分類工具 27
CHAPTER 4 實驗結果與討論分析 29
4.1預測效能評估準則 30
4.2不同的蛋白質表面定義對預測蛋白質交互作用結果之影響 31
4.3 ASA估計值與實際值預測蛋白質之間交互作用結果之影響 33
4.4評估蛋白質表面區塊預測之好壞 34
4.5評估蛋白質之間交互作用的接合面區塊預測之好壞 36
4.6不同資料比例的影響 38
CHAPTER 5結論與未來展望 40
5.1 結論 40
5.2 未來展望 40
參考文獻 41
參考文獻 1. Pellegrini, M., et al., Assigning protein functions by comparative genome analysis protein phylogenetic profiles. 2003, Google Patents.
2. Aloy, P. and R.B. Russell, InterPreTS: protein Interaction Prediction through Tertiary Structure. 2003, Oxford Univ Press. p. 161-162.
3. Aloy, P. and R.B. Russell, Interrogating protein interaction networks through structural biology. Proceedings of the National Academy of Sciences, 2002. 99(9): p. 5896.
4. Ogmen, U., et al., PRISM: protein interactions by structural matching. Nucleic acids research, 2005. 33(Web Server Issue): p. W331.
5. Marcotte, E.M., et al., Detecting protein function and protein-protein interactions from genome sequences. Science, 1999. 285(5428): p. 751.
6. Soong, T., K.O. Wrzeszczynski, and B. Rost, Physical protein-protein interactions predicted from microarrays. Bioinformatics, 2008. 24(22): p. 2608.
7. Enright, A.J., et al., Protein interaction maps for complete genomes based on gene fusion events. Nature, 1999. 402: p. 86-90.
8. Guo, Y., et al., Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences. Nucleic acids research, 2008.
9. Chen, X.W. and M. Liu, Prediction of protein-protein interactions using random decision forest framework. Bioinformatics, 2005. 21(24): p. 4394-4400.
10. Martin, S., D. Roe, and J.L. Faulon, Predicting protein-protein interactions using signature products. Bioinformatics, 2005. 21(2): p. 218-226.
11. Chou, K.C. and Y.D. Cai, Predicting protein-protein interactions from sequences in a hybridization space. Journal of proteome research, 2006. 5(2): p. 316-322.
12. Bock, J.R. and D.A. Gough, Predicting protein-protein interactions from primary structure. 2001, Oxford Univ Press. p. 455-460.
13. Shen, J., et al., Predicting protein–protein interactions based only on sequences information. Proceedings of the National Academy of Sciences, 2007. 104(11): p. 4337.
14. Pitre, S., et al., PIPE: a protein-protein interaction prediction engine based on the re-occurring short polypeptide sequences between known interacting protein pairs. BMC bioinformatics, 2006. 7(1): p. 365.
15. Ben-Hur, A. and W.S. Noble, Kernel methods for predicting protein-protein interactions. Bioinformatics, 2005. 21(Suppl 1): p. i38-i46.
16. Lo, S.L., et al., Effect of training datasets on support vector machine prediction of protein-protein interactions. Proteomics, 2005. 5(4).
17. Oyang, Y.J., et al., Data classification with radial basis function networks based on a novel kernel density estimation algorithm. IEEE transactions on neural networks, 2005. 16(1): p. 225-236.
18. Bourne, P.E. and H. Weissig, Structural bioinformatics. 2003: Wiley-liss Hoboken, NJ.
19. Jacob, E. and R. Unger, A tale of two tails: why are terminal residues of proteins exposed? Bioinformatics, 2007. 23(2): p. e225.
20. Petsko, G.A. and D. Ringe, Potein Structure and function: New Sceince Press in association with BioMed Central 2008.
21. Shrake, A. and J.A. Rupley, Environment and exposure to solvent of protein atoms. Lysozyme and insulin. Journal of Molecular Biology, 1973. 79(2): p. 351.
22. Bruce, A., et al., Molecular biology of the cell. New York: Garland Science, 2002.
23. Braig, K., et al., The crystal structure of the bacterial chaperonln GroEL at 2.8. 1994.
24. Yusupov, M.M., et al., Crystal structure of the ribosome at 5.5 A resolution. Science, 2001. 292(5518): p. 883-896.
25. Zuiderweg, E.R.P., Mapping Protein Protein Interactions in Solution by NMR Spectroscopy. Biochemistry, 2002. 41(1): p. 1-7.
26. Gaasterland, T. and M.A. Ragan, Microbial genescapes: phyletic and functional patterns of ORF distribution among prokaryotes. Microbial and Comparative Genomics, 1998. 3(4): p. 199-217.
27. Dandekar, T., et al., Conservation of gene order: a fingerprint of proteins that physically interact. Trends in biochemical sciences, 1998. 23(9): p. 324-328.
28. Overbeek, R., et al., The use of gene clusters to infer functional coupling. 1999, National Acad Sciences. p. 2896-2901.
29. Tsoka, S. and C.A. Ouzounis, Prediction of protein interactions: metabolic enzymes are frequently involved in gene fusion. Nature genetics, 2000. 26(2): p. 141-141.
30. Pellegrini, M., et al., Assigning protein functions by comparative genome analysis: protein phylogeneticprofiles. Proceedings of the National Academy of Sciences, 1999. 96(8): p. 4285-8.
31. Pazos, F. and A. Valencia, Similarity of phylogenetic trees as indicator of protein-protein interaction. 2001, Oxford Univ Press. p. 609-614.
32. Gobel, U., et al., Correlated mutations and residue contacts in proteins. Proteins: Structure, Function, and Genetics, 1994. 18(4): p. 309-317.
33. Olmea, O. and A. Valencia, Improving contact predictions by the combination of correlated mutations and other sources of sequence information. Folding and Design, 1997. 2: p. 25-32.
34. Pazos, F., et al., Correlated mutations contain information about protein-protein interaction. Journal of Molecular Biology, 1997. 271(4): p. 511-523.
35. Pazos, F. and A. Valencia, In silico two-hybrid system for the selection of physically interacting protein pairs. PROTEINS-NEW YORK-, 2002. 47(2): p. 219-227.
36. Nielsen, J. and S. Oliver, The next wave in metabolome analysis. TRENDS in Biotechnology, 2005. 23(11): p. 544-546.
37. Rajagopalan, D. and P. Agarwal, Inferring pathways from gene lists using a literature-derived network of biological relationships. Bioinformatics, 2005. 21(6): p. 788-793.
38. Zhu, J., et al., Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks. 2008.
39. Cherry, J.M., et al., SGD: Saccharomyces genome database. Nucleic acids research, 1998. 26(1): p. 73.
40. Zhu, J. and M.Q. Zhang, SCPD: a promoter database of the yeast Saccharomyces cerevisiae. Bioinformatics, 1999. 15(7): p. 607-611.
41. Wingender, E., et al., TRANSFAC: an integrated system for gene expression regulation. Nucleic acids research, 2000. 28(1): p. 316.
42. Mewes, H.W., et al., MIPS: a database for genomes and protein sequences. Nucleic acids research, 1999. 27(1): p. 44.
43. Bairoch, A., et al., The universal protein resource (UniProt). Nucleic acids research, 2005. 33(Database Issue): p. D154.
44. Artin, E., The gamma function. 1964: Holt, Rinehart and Winston.
45. Kabsch, W. and C. Sander, Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers, 1983. 22(12).
46. Kim, W.K. and J.C. Ison, Survey of the geometric association of domain-domain interfaces. Proteins: Structure, Function, and Bioinformatics, 2005. 61(4).
47. Kim, W.K., et al., The many faces of protein-protein interactions: a compendium of interface geometry. PLoS Comput Biol, 2006. 2(9): p. e124.
48. Lise, S., A. Walker-Taylor, and D.T. Jones, Docking protein domains in contact space. BMC bioinformatics, 2006. 7(1): p. 310.
49. Jones, S. and J.M. Thornton, Principles of protein-protein interactions. Proceedings of the National Academy of Sciences, 1996. 93(1): p. 13-20.
論文全文使用權限
  • 同意授權校內瀏覽/列印電子全文服務,於2019-08-28起公開。
  • 同意授權校外瀏覽/列印電子全文服務,於2019-08-28起公開。


  • 如您有疑問,請聯絡圖書館
    聯絡電話:(06)2757575#65773
    聯絡E-mail:etds@email.ncku.edu.tw