The Distributions of Amino Acid Compositions of Proteins in an Organism's Proteome Uniformly Approximate Binomial Distributions
DOI:
https://doi.org/10.51094/jxiv.408Keywords:
proteome, amino acid composition, distribution, binomial distribution, resourceAbstract
Advances in the life science technologies have made it possible to access the genomic information of organisms, providing a bird's eye view of the "proteome," the entire set of proteins encoded by each genome. There have been many reports and discoveries in this research field. However, the shape of the distribution of the amino acid composition of the proteins in each proteome has not been reported.
In this study, I used NCBI proteome data. I calculated the amino acid compositions from the downloaded protein amino acid sequences. Then, I examined their distributions within each species and found that they all have bell-shaped distributions without exception.
Assuming that binomial distributions could explain these distribution shapes, I compared these proteome distributions with their adjusted binomial distributions, adjusted for lengths and means. These pairs of distributions were in fairly good agreement.
From these results, I speculated that the amino acid compositions of the proteins in each organism's proteome are in a state of mutual convergence with the amino acid compositions of the organism's cell bodies, which are composed of the proteome proteins, and this is the reason why their proteome distributions approximate binomial distributions.
Conflicts of Interest Disclosure
The author declare no conflicts of interest associated with this manuscript.Downloads *Displays the aggregated results up to the previous day.
References
Sawada, R., Ke, R., Tsuji, T., Sonoyama, M., & Mitaku, S. (2007). Ratio of membrane proteins in total proteomes of prokaryota. BIOPHYSICS, 3, 37–45. https://doi.org/10.2142/biophysics.3.37
Esumi, G. (2022). Proteome and cellular amino acid compositions may be mutually constrained and in a state of narrow convergence. Jxiv. https://doi.org/10.51094/jxiv.95
Genome dataset, “Homo sapiens (human) / Genome assembly T2T-CHM13v2.0", NCBI website, https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_009914755.1/
Genome dataset, "Escherichia coli str. K-12 substr. MG1655 / Genome assembly ASM584v2", NCBI website, https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_000005845.2/
Genome dataset, "Methanocaldococcus jannaschii DSM 2661 / Genome assembly ASM9166v1", NCBI website, https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000091665.1/
Esumi, G. (2020). Autophagy: Possible origin of essential amino acids. Cambridge Open Engage. https://doi.org/10.33774/coe-2020-lll03
Downloads
Posted
Submitted: 2023-06-09 06:46:03 UTC
Published: 2023-06-13 01:42:18 UTC
License
Copyright (c) 2023
Genshiro Esumi
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.