Mining regulatory 5'UTRs from cDNA deep sequencing datasets.

Nucleic Acids Res
Authors
Keywords
Abstract

Regulatory 5' untranslated regions (r5'UTRs) of mRNAs such as riboswitches modulate the expression of genes involved in varied biological processes in both bacteria and eukaryotes. New high-throughput sequencing technologies could provide powerful tools for discovery of novel r5'UTRs, but the size and complexity of the datasets generated by these technologies makes it difficult to differentiate r5'UTRs from the multitude of other types of RNAs detected. Here, we developed and implemented a bioinformatic approach to identify putative r5'UTRs from within large datasets of RNAs recently identified by pyrosequencing of the Vibrio cholerae small transcriptome. This screen yielded only approximately 1% of all non-overlapping RNAs along with 75% of previously annotated r5'UTRs and 69 candidate V. cholerae r5'UTRs. These candidates include several putative functional homologues of diverse r5'UTRs characterized in other species as well as numerous candidates upstream of genes involved in pathways not known to be regulated by r5'UTRs, such as fatty acid oxidation and peptidoglycan catabolism. Two of these novel r5'UTRs were experimentally validated using a GFP reporter-based approach. Our findings suggest that the number and diversity of pathways regulated by r5'UTRs has been underestimated and that deep sequencing-based transcriptomics will be extremely valuable in the search for novel r5'UTRs.

Year of Publication
2010
Journal
Nucleic Acids Res
Volume
38
Issue
5
Pages
1504-14
Date Published
2010 Mar
ISSN
1362-4962
URL
DOI
10.1093/nar/gkp1121
PubMed ID
19969537
PubMed Central ID
PMC2836559
Links
Grant list
R37 AI042347 / AI / NIAID NIH HHS / United States
AI-076608 / AI / NIAID NIH HHS / United States
R37-AI-42347 / AI / NIAID NIH HHS / United States
Howard Hughes Medical Institute / United States