プレプリント / バージョン1

Proteome Amino Acid Compositional Variation is Uniform Across 85 Species from the Three Domains of Life

##article.authors##

DOI:

https://doi.org/10.51094/jxiv.2928

キーワード:

Proteome、 Amino acid composition、 Intra-specific variation、 Jensen-Shannon distance、 Mutual Constraint Model

抄録

Amino acid composition is a primary determinant of protein function; thus, the compositional variation within a proteome inherently defines its functional diversity. Quantifying this intra-proteomic variation and comparing it across species therefore addresses a fundamental question in genomic biology. Here, we evaluated this variation using the Jensen-Shannon (JS) distance across 85 species, comprising 81 species from the Reference Proteomes list and four additional species selected for their extreme genomic GC content. We calculated the JS distance between the amino acid composition of individual proteins and the corresponding proteome-wide average. Our results reveal that the distribution of this variation is remarkably uniform in both size and shape across all three domains of life. Given that a parallel analysis of gene nucleotide compositional variation showed no such uniformity, our findings suggest a universal selective pressure acting to govern amino acid compositional diversity within individual proteomes.

利益相反に関する開示

The author declare no conflicts of interest associated with this manuscript.

ダウンロード *前日までの集計結果を表示します

ダウンロード実績データは、公開の翌日以降に作成されます。

引用文献

James, J. E., & Lascoux, M. (2025). Amino Acid Properties, Substitution Rates, and the Nearly Neutral Theory. Genome Biology and Evolution, 17(3). https://doi.org/10.1093/gbe/evaf025

Esumi, G. (2022). Proteome and cellular amino acid compositions may be mutually constrained and in a state of narrow convergence [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.95

Esumi, G. (2023). The distributions of amino acid compositions of proteins in an organism’s proteome uniformly approximate binomial distributions [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.408

Lin, J. (1991). Divergence measures based on the Shannon entropy. IEEE Transactions on Information Theory, 37(1), 145–151. https://doi.org/10.1109/18.61115

Endres, D. M., & Schindelin, J. E. (2003). A new metric for probability distributions. IEEE Transactions on Information Theory, 49(7), 1858–1860. https://doi.org/10.1109/TIT.2003.813506

Doud, M. B., Ashenberg, O., & Bloom, J. D. (2015). Site-Specific Amino Acid Preferences Are Mostly Conserved in Two Closely Related Protein Homologs. Molecular Biology and Evolution, 32(11), 2944–2960. https://doi.org/10.1093/molbev/msv167

Sueoka, N. (1961). Compositional Correlation between Deoxyribonucleic Acid and Protein. Cold Spring Harbor Symposia on Quantitative Biology, 26, 35–43. https://doi.org/10.1101/SQB.1961.026.01.009

EMBL-EBI. (2024). Reference Proteomes (Release 2024_02) [Database]. Retrieved January 20, 2026, from https://www.ebi.ac.uk/reference_proteomes/

National Center for Biotechnology Information (n.d.). National Center for Biotechnology Information. U.S. National Library of Medicine. Retrieved January 20, 2026, from https://www.ncbi.nlm.nih.gov/

Esumi, G. (2025). Chicken Eggs Are a Practical and Common Exome-Matched Diet for Multicellular Eukaryotic Organisms [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.1056

Esumi, G. (2023). The Total Body Amino Acid Composition of an Animal Could Be Explained by a Mixture of the Average Composition of Its Proteome as an Intracellular Composition Estimate and the Type I Collagen Composition as an Extracellular Composition Estimate [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.424

ダウンロード

公開済


投稿日時: 2026-02-03 03:25:57 UTC

公開日時: 2026-03-27 06:08:18 UTC
研究分野
生物学・生命科学・基礎医学