Proteome Amino Acid Compositional Variation is Uniform Across 85 Species from the Three Domains of Life
DOI:
https://doi.org/10.51094/jxiv.2928キーワード:
Proteome、 Amino acid composition、 Intra-specific variation、 Jensen-Shannon distance、 Mutual Constraint Model抄録
Amino acid composition is a primary determinant of protein function; thus, the compositional variation within a proteome inherently defines its functional diversity. Quantifying this intra-proteomic variation and comparing it across species therefore addresses a fundamental question in genomic biology. Here, we evaluated this variation using the Jensen-Shannon (JS) distance across 85 species, comprising 81 species from the Reference Proteomes list and four additional species selected for their extreme genomic GC content. We calculated the JS distance between the amino acid composition of individual proteins and the corresponding proteome-wide average. Our results reveal that the distribution of this variation is remarkably uniform in both size and shape across all three domains of life. Given that a parallel analysis of gene nucleotide compositional variation showed no such uniformity, our findings suggest a universal selective pressure acting to govern amino acid compositional diversity within individual proteomes.
利益相反に関する開示
The author declare no conflicts of interest associated with this manuscript.ダウンロード *前日までの集計結果を表示します
引用文献
James, J. E., & Lascoux, M. (2025). Amino Acid Properties, Substitution Rates, and the Nearly Neutral Theory. Genome Biology and Evolution, 17(3). https://doi.org/10.1093/gbe/evaf025
Esumi, G. (2022). Proteome and cellular amino acid compositions may be mutually constrained and in a state of narrow convergence [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.95
Esumi, G. (2023). The distributions of amino acid compositions of proteins in an organism’s proteome uniformly approximate binomial distributions [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.408
Lin, J. (1991). Divergence measures based on the Shannon entropy. IEEE Transactions on Information Theory, 37(1), 145–151. https://doi.org/10.1109/18.61115
Endres, D. M., & Schindelin, J. E. (2003). A new metric for probability distributions. IEEE Transactions on Information Theory, 49(7), 1858–1860. https://doi.org/10.1109/TIT.2003.813506
Doud, M. B., Ashenberg, O., & Bloom, J. D. (2015). Site-Specific Amino Acid Preferences Are Mostly Conserved in Two Closely Related Protein Homologs. Molecular Biology and Evolution, 32(11), 2944–2960. https://doi.org/10.1093/molbev/msv167
Sueoka, N. (1961). Compositional Correlation between Deoxyribonucleic Acid and Protein. Cold Spring Harbor Symposia on Quantitative Biology, 26, 35–43. https://doi.org/10.1101/SQB.1961.026.01.009
EMBL-EBI. (2024). Reference Proteomes (Release 2024_02) [Database]. Retrieved January 20, 2026, from https://www.ebi.ac.uk/reference_proteomes/
National Center for Biotechnology Information (n.d.). National Center for Biotechnology Information. U.S. National Library of Medicine. Retrieved January 20, 2026, from https://www.ncbi.nlm.nih.gov/
Esumi, G. (2025). Chicken Eggs Are a Practical and Common Exome-Matched Diet for Multicellular Eukaryotic Organisms [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.1056
Esumi, G. (2023). The Total Body Amino Acid Composition of an Animal Could Be Explained by a Mixture of the Average Composition of Its Proteome as an Intracellular Composition Estimate and the Type I Collagen Composition as an Extracellular Composition Estimate [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.424
ダウンロード
公開済
投稿日時: 2026-02-03 03:25:57 UTC
公開日時: 2026-03-27 06:08:18 UTC
ライセンス
Copyright(c)2026
Genshiro Esumi
この作品は、Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Licenseの下でライセンスされています。
