Uncovering the Statistical Trends of Protein Evolution with AlphaFold Database
Author:
Affiliation:

Funding:

This work is supported by Natural Science Foundation of the Jiangsu Higher Education Institutions of China (22KJD14005) and Early Career Scheme (22302723) from Research Grants Council of Hong Kong

Ethical statement:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
    Abstract:

    AlphaFold, which is developed by DeepMind, has made amazing advances in predicting protein structures for life sciences research. Using the vast structural predictions made possible by AlphaFold, a database of over 200 million proteins has been established. Such a database covers the complete proteomes of many organisms. This review outlines the most recent progresses in exploring protein evolution using statistical physical methods based on the AlphaFold database. Traditional protein evolution research often concentrates on the sequences or structures of proteins within the same family, using a narrow microscopic approach. With the new emergence of extensive protein structure predictions by AlphaFold, whereas scientists can expand their horizons to include vast assortments of proteins to make parallels with all proteins in different species and extract statistical trends through macroscopic observation. By comparing the proteins with similar chain lengths in over 40 model organisms, the statistical trends in protein evolution are discovered. For organisms with higher complexity, their constituent proteins present larger radii of gyration, higher flexibility, and higher segregation of hydrophobic and hydrophilic residues in both spatial and sequence. It is also validated by statistical physics analysis that higher organismal complexity correlates with higher functional specialization of constituent proteins. The findings in these studies connect molecular evolution to organism evolution, contributing to the understanding of the origin and evolution of lives.

    Reference
    Related
    Cited by
Get Citation

XIA Chenliang, TANG Qianyuan. Uncovering the Statistical Trends of Protein Evolution with AlphaFold Database[J]. Journal of Integration Technology,2024,13(2):74-88

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
History
  • Received:September 12,2023
  • Revised:September 12,2023
  • Adopted:November 23,2023
  • Online: November 23,2023
  • Published: