Interpreting mammalian synonymous site conservation in light of the unwanted transcript hypothesis.
Authors | |
Abstract | Mammalian genomes are biased towards GC bases at third codon positions, likely due to a GC-biased ancestral genome and the selectively neutral recombination-related process of GC-biased gene conversion. The unwanted transcript hypothesis posits that this high GC content at synonymous sites may be beneficial for protecting against spurious transcripts, particularly in species with low effective population sizes. Utilising a 240 placental mammal genome alignment and single-base resolution conservation scores, we interpret sequence conservation at mammalian four-fold degenerate sites in this context and find evidence in support of the unwanted transcript hypothesis, including a strong GC bias, high conservation at sites relating to exon splicing, less human genetic variation at conserved four-fold degenerate sites, and conservation of sites important for epigenetic regulation of developmental genes. Additionally, we show that high conservation of four-fold degenerate sites in essential developmental genes, including homeobox genes, likely relates to the low mutation rates experienced by these genes. |
Year of Publication | 2025
|
Journal | Nature communications
|
Volume | 16
|
Issue | 1
|
Pages | 2007
|
Date Published | 02/2025
|
ISSN | 2041-1723
|
DOI | 10.1038/s41467-025-57179-w
|
PubMed ID | 40011430
|
Links |