PRADA: pipeline for RNA sequencing data analysis.

Bioinformatics
Authors
Keywords
Abstract

SUMMARY: Technological advances in high-throughput sequencing necessitate improved computational tools for processing and analyzing large-scale datasets in a systematic automated manner. For that purpose, we have developed PRADA (Pipeline for RNA-Sequencing Data Analysis), a flexible, modular and highly scalable software platform that provides many different types of information available by multifaceted analysis starting from raw paired-end RNA-seq data: gene expression levels, quality metrics, detection of unsupervised and supervised fusion transcripts, detection of intragenic fusion variants, homology scores and fusion frame classification. PRADA uses a dual-mapping strategy that increases sensitivity and refines the analytical endpoints. PRADA has been used extensively and successfully in the glioblastoma and renal clear cell projects of The Cancer Genome Atlas program.

AVAILABILITY AND IMPLEMENTATION:  

CONTACT:  gadgetz@broadinstitute.org or rverhaak@mdanderson.org

SUPPLEMENTARY INFORMATION:  Supplementary data are available at Bioinformatics online.

Year of Publication
2014
Journal
Bioinformatics
Volume
30
Issue
15
Pages
2224-6
Date Published
2014 Aug 01
ISSN
1367-4811
URL
DOI
10.1093/bioinformatics/btu169
PubMed ID
24695405
PubMed Central ID
PMC4103589
Links
Grant list
P30 CA016672 / CA / NCI NIH HHS / United States
CA143883 / CA / NCI NIH HHS / United States