Preprint / Version 1

Collagens achieve highly biased amino acid composition not only by random mutation but also by slanted assignment of the genetic code table

##article.authors##

DOI:

https://doi.org/10.51094/jxiv.127

Keywords:

collagen, amino acid composition, GC content, synonymous codon, genetic code table

Abstract

Collagens are proteins that are ubiquitous in animal bodies. They have unique triple-helix domains mainly consisting of glycines, prolines, and hydroxyprolines, resulting in collagens being highly biased amino acid compositions. Traditionally, we have considered that only the accumulation of random mutations develops the structures and compositions of proteins. However, the results of the present study suggest that it is not only random mutations but also the slanted assignment of the genetic code table that assisted in forming the biased amino acid composition of collagens.

In my previous paper, I showed that the synonymous codon usage selections in all proteins of various 23 bacteria species primarily offset the influences of guanine and cytosine (GC) content variation on their amino acid compositions. In this report, I did the same analysis on all the human proteins and found two things. First, the human proteins' coding sequences have a broader GC content range than each bacteria species. Second, most human proteins' synonymous codon usage selections offset the influences of GC content variation like bacteria, but those of some proteins like collagens did not. Instead, synonymous codon usage selections of these proteins emphasized the feature of their high GC content.

These findings suggest that the slanted genetic code table assignment assisted animals in forming collagens, highly biased amino acid composition proteins, and assisted us be multicellular organisms in our evolutions.

Downloads *Displays the aggregated results up to the previous day.

Download data is not yet available.

References

National Center for Biotechnology Information (NCBI). (2022). Genome assembly GRCh38.p14. National Library of Medicine (NIH) website. https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_000001405.40/

Esumi, G. (2022). Synonymous codon usage and its bias in the bacterial proteomes primarily offset GC content variation to maintain optimal amino acid compositions. Jxiv. https://doi.org/10.51094/jxiv.99

Esumi, G. (2022). Proteome and cellular amino acid compositions may be mutually constrained and in a state of narrow convergence. Jxiv. https://doi.org/10.51094/jxiv.95

Costantini, M., Cammarano, R., & Bernardi, G. (2009). The evolution of isochore patterns in vertebrate genomes. BMC Genomics, 10, 146. https://doi.org/10.1186/1471-2164-10-146

Bernardi, G. (1993). The vertebrate genome: isochores and evolution. Molecular Biology and Evolution, 10(1), 186–204. https://doi.org/10.1093/oxfordjournals.molbev.a039994

Kiktev, D. A., Sheng, Z., Lobachev, K. S., & Petes, T. D. (2018). GC content elevates mutation and recombination rates in the yeast Saccharomyces cerevisiae. Proceedings of the National Academy of Sciences of the United States of America, 115(30), E7109–E7118. https://doi.org/10.1073/pnas.1807334115

UniProt consortium. (2022). P02452 · CO1A1_HUMAN. Entry version 250. UniProtKB website. https://www.uniprot.org/uniprotkb/P02452/entry

UniProt consortium. (2022). P08123 · CO1A2_HUMAN. Entry version 237. UniProtKB website. https://www.uniprot.org/uniprotkb/P08123/entry

Zhang, Y., Shimizu, K., Shiomi, K., Kajiura, Z., & Nakagaki, M. (2008). cDNA cloning of Nephila clavata dragline silk (MaSp1) gene and comparison with the sequence of Bombyx mori fibroin heavy chain. Sanshi-Konchu Biotec, 77(1), 39–46. https://doi.org/10.11416/konchubiotec.77.1_39

Downloads

Posted


Submitted: 2022-08-01 00:51:09 UTC

Published: 2022-08-03 07:19:26 UTC
Section
Biology, Life Sciences & Basic Medicine