プレプリント / バージョン1

Proteome-wide Convergence of Amino Acid Composition in Phage–Host Systems

##article.authors##

DOI:

https://doi.org/10.51094/jxiv.2374

キーワード:

Bacteriophage、 Temperate phages、 Amino acid composition、 Jensen-Shannon distance、 Mutual constraint model、 Molecular evolution

抄録

Background: Understanding the selective pressures that shape amino acid usage is a fundamental challenge in evolutionary biology. We previously proposed a "mutual constraint" model, suggesting that the cytoplasmic protein pool serves as both a product of the genome and a resource for new protein synthesis, creating a feedback loop that stabilizes species-specific proteome profiles.

Methods: To test this hypothesis, we evaluated the compositional alignment between bacteriophages and their hosts using a comprehensive dataset of 117 phage species and 67 bacterial host species. We calculated the species-specific proteome-wide mean amino acid composition for each organism and quantified the differences using the Jensen-Shannon (JS) distance.

Results: Our analysis revealed that specific phage–host pairs exhibit significantly closer compositional proximity (median JS distance = 0.076) compared to random inter-species combinations (median = 0.110). This alignment was strikingly lifestyle-dependent: temperate phages showed a highly concentrated reciprocal convergence, with approximately 34% of pairs occupying the highest proximity bin in a two-dimensional distance map. In contrast, virulent phages displayed a broader and more scattered distribution.

Discussion: Furthermore, computational frame-shift experiments on Escherichia coli K-12 and lambda phage demonstrated that genomic signatures alone cannot dictate amino acid profiles, suggesting that selective pressure acts directly on the amino acid composition to minimize metabolic friction within the host’s resource environment.

Conclusion: These findings demonstrate that the host's nutritional landscape is a primary determinant of viral proteome evolution. Our study supports the existence of universal metabolic constraints governing the evolution of obligate intracellular entities and provides a new quantitative framework for investigating host–parasite co-evolution.

利益相反に関する開示

The author declare no conflicts of interest associated with this manuscript.

ダウンロード *前日までの集計結果を表示します

ダウンロード実績データは、公開の翌日以降に作成されます。

引用文献

Zuckerkandl, E., & Pauling, L. (1965). Molecules as documents of evolutionary history. Journal of Theoretical Biology, 8(2), 357–366. https://doi.org/10.1016/0022-5193(65)90083-4

Esumi, G. (2022). Proteome and cellular amino acid compositions may be mutually constrained and in a state of narrow convergence [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.95

Esumi, G. (2023). The distributions of amino acid compositions of proteins in an organism’s proteome uniformly approximate binomial distributions [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.408

Esumi, G. (2025). Chicken Eggs Are a Practical and Common Exome-Matched Diet for Multicellular Eukaryotic Organisms [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.1056

Alberts, B., Heald, R., Johnson, A., Morgan, D., Raff, M., Roberts, K., & Walter, P. (2022). Molecular biology of the cell (7th ed., pp. 33–35). W. W. Norton & Company.

McNair, K., Bailey, B. A., & Edwards, R. A. (2012). PHACTS, a computational approach to classifying the lifestyle of phages. Bioinformatics, 28(5), 614–618. https://doi.org/10.1093/bioinformatics/bts014

Mihara, T., Nishimura, Y., Shimizu, Y., Nishiyama, H., Yoshikawa, G., Uehara, H., Hingamp, P., Goto, S., & Ogata, H. (2016). Linking virus genomes with host taxonomy. Scientific Reports, 6, 34588. https://doi.org/10.1038/srep34588 (Retrieved December 12, 2025, from https://www.genome.jp/virushostdb/)

National Center for Biotechnology Information (n.d.). National Center for Biotechnology Information. U.S. National Library of Medicine. Retrieved December 12, 2025, from https://www.ncbi.nlm.nih.gov/

Lin, J. (1991). Divergence measures based on the Shannon entropy. IEEE Transactions on Information Theory, 37(1), 145–151. https://doi.org/10.1109/18.61115

Endres, D. M., & Schindelin, J. E. (2003). A new metric for probability distributions. IEEE Transactions on Information Theory, 49(7), 1858–1860. https://doi.org/10.1109/TIT.2003.813506

Mavrich, T. N., & Hatfull, G. F. (2017). Bacteriophage evolution differs by host, lifestyle and genome. Nature Microbiology, 2. https://doi.org/10.1038/nmicrobiol.2017.112

Wu, S., Fang, Z., Tan, J., Li, M., Wang, C., Guo, Q., Xu, C., Jiang, X., & Zhu, H. (2021). DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach. GigaScience, 10(9). https://doi.org/10.1093/gigascience/giab056

Zhang, Y., Mao, M., Zhang, R., Liao, Y.-T., & Wu, V. C. H. (2024). DeepPL: A deep-learning-based tool for the prediction of bacteriophage lifecycle. PLOS Computational Biology, 20(10), e1012525. https://doi.org/10.1371/journal.pcbi.1012525

Esumi, G. (2025). Amino‑acid composition is a selection target for coding‑sequence retention: evidence from out‑of‑frame translation comparisons in Escherichia coli [Preprint]. Jxiv. https://doi.org/10.51094/jxiv.1565

公開済


投稿日時: 2025-12-23 07:06:23 UTC

公開日時: 2026-01-14 01:16:53 UTC
研究分野
生物学・生命科学・基礎医学