Collagens achieve highly biased amino acid composition not only by random mutation but also by slanted assignment of the genetic code table
DOI:
https://doi.org/10.51094/jxiv.127Keywords:
collagen, amino acid composition, GC content, synonymous codon, genetic code tableAbstract
Collagens are proteins that are ubiquitous in animal bodies. They have unique triple-helix domains mainly consisting of glycines, prolines, and hydroxyprolines, resulting in collagens being highly biased amino acid compositions. Traditionally, we have considered that only the accumulation of random mutations develops the structures and compositions of proteins. However, the results of the present study suggest that it is not only random mutations but also the slanted assignment of the genetic code table that assisted in forming the biased amino acid composition of collagens.
In my previous paper, I showed that the synonymous codon usage selections in all proteins of various 23 bacteria species primarily offset the influences of guanine and cytosine (GC) content variation on their amino acid compositions. In this report, I did the same analysis on all the human proteins and found two things. First, the human proteins' coding sequences have a broader GC content range than each bacteria species. Second, most human proteins' synonymous codon usage selections offset the influences of GC content variation like bacteria, but those of some proteins like collagens did not. Instead, synonymous codon usage selections of these proteins emphasized the feature of their high GC content.
These findings suggest that the slanted genetic code table assignment assisted animals in forming collagens, highly biased amino acid composition proteins, and assisted us be multicellular organisms in our evolutions.
Downloads *Displays the aggregated results up to the previous day.
References
National Center for Biotechnology Information (NCBI). (2022). Genome assembly GRCh38.p14. National Library of Medicine (NIH) website. https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_000001405.40/
Esumi, G. (2022). Synonymous codon usage and its bias in the bacterial proteomes primarily offset GC content variation to maintain optimal amino acid compositions. Jxiv. https://doi.org/10.51094/jxiv.99
Esumi, G. (2022). Proteome and cellular amino acid compositions may be mutually constrained and in a state of narrow convergence. Jxiv. https://doi.org/10.51094/jxiv.95
Costantini, M., Cammarano, R., & Bernardi, G. (2009). The evolution of isochore patterns in vertebrate genomes. BMC Genomics, 10, 146. https://doi.org/10.1186/1471-2164-10-146
Bernardi, G. (1993). The vertebrate genome: isochores and evolution. Molecular Biology and Evolution, 10(1), 186–204. https://doi.org/10.1093/oxfordjournals.molbev.a039994
Kiktev, D. A., Sheng, Z., Lobachev, K. S., & Petes, T. D. (2018). GC content elevates mutation and recombination rates in the yeast Saccharomyces cerevisiae. Proceedings of the National Academy of Sciences of the United States of America, 115(30), E7109–E7118. https://doi.org/10.1073/pnas.1807334115
UniProt consortium. (2022). P02452 · CO1A1_HUMAN. Entry version 250. UniProtKB website. https://www.uniprot.org/uniprotkb/P02452/entry
UniProt consortium. (2022). P08123 · CO1A2_HUMAN. Entry version 237. UniProtKB website. https://www.uniprot.org/uniprotkb/P08123/entry
Zhang, Y., Shimizu, K., Shiomi, K., Kajiura, Z., & Nakagaki, M. (2008). cDNA cloning of Nephila clavata dragline silk (MaSp1) gene and comparison with the sequence of Bombyx mori fibroin heavy chain. Sanshi-Konchu Biotec, 77(1), 39–46. https://doi.org/10.11416/konchubiotec.77.1_39
Downloads
Posted
Submitted: 2022-08-01 00:51:09 UTC
Published: 2022-08-03 07:19:26 UTC
License
Copyright (c) 2022
Genshiro Esumi
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.