Proteome-wide Convergence of Amino Acid Composition in Phage–Host Systems
DOI:
https://doi.org/10.51094/jxiv.2374キーワード:
Bacteriophage、 Temperate phages、 Amino acid composition、 Jensen-Shannon distance、 Mutual constraint model、 Molecular evolution抄録
Background: Understanding the selective pressures that shape amino acid usage is a fundamental challenge in evolutionary biology. We previously proposed a "mutual constraint" model, suggesting that the cytoplasmic protein pool serves as both a product of the genome and a resource for new protein synthesis, creating a feedback loop that stabilizes species-specific proteome profiles.
Methods: To test this hypothesis, we evaluated the compositional alignment between bacteriophages and their hosts using a comprehensive dataset of 117 phage species and 67 bacterial host species. We calculated the species-specific proteome-wide mean amino acid composition for each organism and quantified the differences using the Jensen-Shannon (JS) distance.
Results: Our analysis revealed that specific phage–host pairs exhibit significantly closer compositional proximity (median JS distance = 0.076) compared to random inter-species combinations (median = 0.110). This alignment was strikingly lifestyle-dependent: temperate phages showed a highly concentrated reciprocal convergence, with approximately 34% of pairs occupying the highest proximity bin in a two-dimensional distance map. In contrast, virulent phages displayed a broader and more scattered distribution.
Discussion: Furthermore, computational frame-shift experiments on Escherichia coli K-12 and lambda phage demonstrated that genomic signatures alone cannot dictate amino acid profiles, suggesting that selective pressure acts directly on the amino acid composition to minimize metabolic friction within the host’s resource environment.
Conclusion: These findings demonstrate that the host's nutritional landscape is a primary determinant of viral proteome evolution. Our study supports the existence of universal metabolic constraints governing the evolution of obligate intracellular entities and provides a new quantitative framework for investigating host–parasite co-evolution.
利益相反に関する開示
The author declare no conflicts of interest associated with this manuscript.ダウンロード *前日までの集計結果を表示します
引用文献
Zuckerkandl, E., & Pauling, L. (1965). Molecules as documents of evolutionary history. Journal of Theoretical Biology, 8(2), 357–366. https://doi.org/10.1016/0022-5193(65)90083-4
Esumi, G. (2022). Proteome and cellular amino acid compositions may be mutually constrained and in a state of narrow convergence [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.95
Esumi, G. (2023). The distributions of amino acid compositions of proteins in an organism’s proteome uniformly approximate binomial distributions [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.408
Esumi, G. (2025). Chicken Eggs Are a Practical and Common Exome-Matched Diet for Multicellular Eukaryotic Organisms [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.1056
Alberts, B., Heald, R., Johnson, A., Morgan, D., Raff, M., Roberts, K., & Walter, P. (2022). Molecular biology of the cell (7th ed., pp. 33–35). W. W. Norton & Company.
McNair, K., Bailey, B. A., & Edwards, R. A. (2012). PHACTS, a computational approach to classifying the lifestyle of phages. Bioinformatics, 28(5), 614–618. https://doi.org/10.1093/bioinformatics/bts014
Mihara, T., Nishimura, Y., Shimizu, Y., Nishiyama, H., Yoshikawa, G., Uehara, H., Hingamp, P., Goto, S., & Ogata, H. (2016). Linking virus genomes with host taxonomy. Scientific Reports, 6, 34588. https://doi.org/10.1038/srep34588 (Retrieved December 12, 2025, from https://www.genome.jp/virushostdb/)
National Center for Biotechnology Information (n.d.). National Center for Biotechnology Information. U.S. National Library of Medicine. Retrieved December 12, 2025, from https://www.ncbi.nlm.nih.gov/
Lin, J. (1991). Divergence measures based on the Shannon entropy. IEEE Transactions on Information Theory, 37(1), 145–151. https://doi.org/10.1109/18.61115
Endres, D. M., & Schindelin, J. E. (2003). A new metric for probability distributions. IEEE Transactions on Information Theory, 49(7), 1858–1860. https://doi.org/10.1109/TIT.2003.813506
Mavrich, T. N., & Hatfull, G. F. (2017). Bacteriophage evolution differs by host, lifestyle and genome. Nature Microbiology, 2. https://doi.org/10.1038/nmicrobiol.2017.112
Wu, S., Fang, Z., Tan, J., Li, M., Wang, C., Guo, Q., Xu, C., Jiang, X., & Zhu, H. (2021). DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach. GigaScience, 10(9). https://doi.org/10.1093/gigascience/giab056
Zhang, Y., Mao, M., Zhang, R., Liao, Y.-T., & Wu, V. C. H. (2024). DeepPL: A deep-learning-based tool for the prediction of bacteriophage lifecycle. PLOS Computational Biology, 20(10), e1012525. https://doi.org/10.1371/journal.pcbi.1012525
Esumi, G. (2025). Amino‑acid composition is a selection target for coding‑sequence retention: evidence from out‑of‑frame translation comparisons in Escherichia coli [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.1565
公開済
投稿日時: 2025-12-23 07:06:23 UTC
公開日時: 2026-01-14 01:16:53 UTC
ライセンス
Copyright(c)2026
Genshiro Esumi
この作品は、Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Licenseの下でライセンスされています。
