ABOUT   |   AIMS and SCOPE   |   AUTHOR INSTRUCTIONS   |   EDITORIAL BOARD   |   ARCHIVE   |   CURRENT ISSUE   |   SUBMIT MANUSCRIPT ONLINE  |
bioinfo issue

This Article

Copyright © 2013 | AIZEON publishers | All rights reserved

 Open Access
 Abstract
 Full Text PDF Free

 

Services

Articles by corresponding author in PubMed

Articles by corresponding author in Google Scholar

 

 

 

aizeon

 



 

International Journal of Computational Bioinformatics and In Silico Modeling
2013: Volume-2 Issue-6
ISSN: 2320-0634

>>Back to Issues

ABSTRACT   REFERENCES  
International Journal of Computational Bioinformatics and In Silico Modeling 2(6) 2013: 285-292

Optimal Calculation of RNA-Seq Fold-Change Values


Charles D. Warden1*, Yate-Ching Yuan2, and Xiwei Wu1

1Integrative Genomics Core, Department of Molecular and Cellular Biology, City of Hope National Medical Center, Duarte, California.
2Bioinformatics Core, Department of Molecular Medicine, City of Hope National Medical Center, Duarte, California.

* Corresponding Author

ABSTRACT

Biologists often use RNA-Sequencing (RNA-Seq) to identify a limited number of genes for subsequent validation, and one important factor for candidate gene selection is the fold-change in expression between two groups. However, RNA-Seq produces a wide range of read counts per gene, and genes with a low coverage of reads can produce artificially high fold-change values. In this paper, we present a solution to this problem: adding a factor between 0.01 and 1 to normalized expression values. This conclusion is based upon analysis of a large patient cohort of paired tumor and normal samples from patients with lung adenocarcinomas as well as a small, two-group cell line dataset. The optimal factor to add to normalized expression values is chosen based upon testing a range of factors on: the number of genes or transcripts whose expression is effectively censored (using three different alignment algorithms) and 2) the potential level of bias introduced by the factor (defined by comparing unadjusted gene lists). The robustness of these trends is also tested by comparing multiple mRNA quantification and differential expression algorithms. The relationship between RPKM cutoff and concordance between gene lists produced using different statistical methods can be complicated, but this study emphasizes that simple statistical analysis (amendable to the use of rounded RPKM values) at least provides equal quality results as popular algorithms for RNA-Seq differential expression.


Copyright © 2013 | AIZEON publishers | All rights reserved

..........................................................................................................................................................................................................

Citation: Charles D. Warden et al. (2013). Optimal Calculation of RNA-Seq Fold-Change Values Int J Comput Bioinfo In Silico Model 2(6): 285-292

..........................................................................................................................................................................................................