The Synonymous Codon Usage of a Protein Gene Is Primarily Determined by the Guanine + Cytosine Content of the Individual Gene Rather Than the Species to Which It Belongs To Synthesize Proteins With a Balanced Amino Acid Composition
DOI:
https://doi.org/10.51094/jxiv.561Keywords:
codon usage bias, standard genetic code, GC content, amino acid composition, proteomeAbstract
In the genetic code, most amino acids have multiple corresponding codons, and codons corresponding to the same amino acid are called synonymous codons. The use of synonymous codons in protein genes is known to be biased rather than random, and such bias is often explained by species differences in the guanine + cytosine (GC) content of their genomes, or in the abundance of tRNAs that intervene between codons and amino acids in the translation process.
In this study, I statistically analyzed the synonymous codon usage in protein genes of the proteomes of 79 species from 3 domains, published as Reference Proteome, and found that the GC content of the individual gene, rather than the species to which it belongs, primarily determines its synonymous codon usage.
Why then does the GC content of the individual gene determine its codon usage selection? Some papers have already mentioned that the GC content of the third letters of codons is even higher in genes with high total GC content and even lower in genes with low total GC content, and these were usually explained by evolutionary pressure on their genomic GC content and its subsequent shift. However, while this explanation explained the behavior of the third letter of the codon, it did not explain the behavior of the first and second letters. To provide a new explanation for the overall behavior of synonymous codon usage, I added an analysis. Since in previous work the amino acid composition distributions of organisms appeared to be in a state of narrow convergence, and since we know that most organisms share some highly conserved proteins across species boundaries, an additional analysis, based on the assumption that the organism maintains a proteome close to the amino acid composition of a particular conserved protein, suggests that this codon selection counteracts the effect of the GC content of the gene on the amino acid composition of the protein and behaves in the direction of counterbalancing and maintaining a constant and balanced amino acid composition.
From the results of this study, I concluded that synonymous codon selection in a protein gene primarily counterbalances its GC content to maintain a balanced amino acid composition for the proteome. The ability to generate proteins with balanced amino acid composition from genes with different ranges of GC content is considered to be one of the basic functions achieved by the genetic code itself.
Conflicts of Interest Disclosure
The author declare no conflicts of interest associated with this manuscript.Downloads *Displays the aggregated results up to the previous day.
References
Hershberg, R., & Petrov, D. A. (2009). General Rules for Optimal Codon Choice. In M. W. Nachman (Ed.), PLoS Genetics (Vol. 5, Issue 7, p. e1000556). Public Library of Science (PLoS). https://doi.org/10.1371/journal.pgen.1000556
Dong, H., Nilsson, L., & Kurland, C. G. (1996). Co-variation of tRNA Abundance and Codon Usage inEscherichia coliat Different Growth Rates. In Journal of Molecular Biology (Vol. 260, Issue 5, pp. 649–663). Elsevier BV. https://doi.org/10.1006/jmbi.1996.0428
Esumi, G. (2022). Synonymous codon usage and its bias in the bacterial proteomes primarily offset guanine and cytosine content variation to maintain optimal amino acid compositions. Jxiv. https://doi.org/10.51094/jxiv.99
"Quest for Orthologs" group. (2023) Reference proteomes - Primary proteome sets for the Quest For Orthologs, RELEASE 2023_03. https://www.ebi.ac.uk/reference_proteomes/ Accessed 1 Sep 2023
Bateman, A., Martin, M.-J., Orchard, S., Magrane, M., Ahmad, S., Alpi, E., Bowler-Barnett, E. H., Britto, R., Bye-A-Jee, H., Cukura, A., Denny, P., Dogan, T., Ebenezer, T., Fan, J., Garmiri, P., da Costa Gonzales, L. J., Hatton-Ellis, E., Hussein, A., … Zhang, J. (2022). UniProt: the Universal Protein Knowledgebase in 2023. In Nucleic Acids Research (Vol. 51, Issue D1, pp. D523–D531). Oxford University Press (OUP). https://doi.org/10.1093/nar/gkac1052
Hamashima, K., & Kanai, A. (2013). Alternative genetic code for amino acids and transfer RNA revisited. In BioMolecular Concepts (Vol. 4, Issue 3, pp. 309–318). Walter de Gruyter GmbH. https://doi.org/10.1515/bmc-2013-0002
Esumi, G. (2022). Proteome and cellular amino acid compositions may be mutually constrained and in a state of narrow convergence. Jxiv. https://doi.org/10.51094/jxiv.95
Esumi, G. (2023). The Distributions of Amino Acid Compositions of Proteins in an Organism's Proteome Uniformly Approximate Binomial Distributions. Jxiv. https://doi.org/10.51094/jxiv.408
Esumi, G. (2023). The standard genetic code is designed to generate transmembrane domains and intrinsically disordered regions as projections of the thymine density on the gene. Jxiv. https://doi.org/10.51094/jxiv.533
Homma, K., Noguchi, T., & Fukuchi, S. (2016). Codon usage is less optimized in eukaryotic gene segments encoding intrinsically disordered regions than in those encoding structural domains. In Nucleic Acids Research (Vol. 44, Issue 21, pp. 10051-10061). Oxford University Press (OUP). https://doi.org/10.1093/nar/gkw899
Masłowska-Górnicz, A., van den Bosch, M. R. M., Saccenti, E., & Suarez-Diez, M. (2022). A large-scale analysis of codon usage bias in 4868 bacterial genomes shows association of codon adaptation index with GC content, protein functional domains and bacterial phenotypes. In Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms (Vol. 1865, Issue 6, p. 194826). Elsevier BV. https://doi.org/10.1016/j.bbagrm.2022.194826
Behura, S. K., & Severson, D. W. (2012). Codon usage bias: causative factors, quantification methods and genome‐wide patterns: with emphasis on insect genomes. In Biological Reviews (Vol. 88, Issue 1, pp. 49–61). Wiley. https://doi.org/10.1111/j.1469-185x.2012.00242.x
Parvathy, S. T., Udayasuriyan, V., & Bhadana, V. (2021). Codon usage bias. In Molecular Biology Reports (Vol. 49, Issue 1, pp. 539–565). Springer Science and Business Media LLC. https://doi.org/10.1007/s11033-021-06749-4
Posted
Submitted: 2023-12-01 03:10:20 UTC
Published: 2023-12-05 04:18:09 UTC
License
Copyright (c) 2023
Genshiro Esumi
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.