Heading Down the Wrong Pathway: on the Influence of Correlation within Gene Sets

dc.contributor.author

Gatti, Daniel M

dc.contributor.author

Barry, William T

dc.contributor.author

Nobel, Andrew B

dc.contributor.author

Rusyn, Ivan

dc.contributor.author

Wright, Fred A

dc.date.accessioned

2011-06-21T17:29:33Z

dc.date.available

2011-06-21T17:29:33Z

dc.date.issued

2010

dc.description.abstract

Background: Analysis of microarray experiments often involves testing for the overrepresentation of pre-defined sets of genes among lists of genes deemed individually significant. Most popular gene set testing methods assume the independence of genes within each set, an assumption that is seriously violated, as extensive correlation between genes is a well-documented phenomenon. Results: We conducted a meta-analysis of over 200 datasets from the Gene Expression Omnibus in order to demonstrate the practical impact of strong gene correlation patterns that are highly consistent across experiments. We show that a common independence assumption-based gene set testing procedure produces very high false positive rates when applied to data sets for which treatment groups have been randomized, and that gene sets with high internal correlation are more likely to be declared significant. A reanalysis of the same datasets using an array resampling approach properly controls false positive rates, leading to more parsimonious and high-confidence gene set findings, which should facilitate pathway-based interpretation of the microarray data. Conclusions: These findings call into question many of the gene set testing results in the literature and argue strongly for the adoption of resampling based gene set testing criteria in the peer reviewed biomedical literature.

dc.description.version

Version of Record

dc.identifier.citation

Gatti,Daniel M.;Barry,William T.;Nobel,Andrew B.;Rusyn,Ivan;Wright,Fred A.. 2010. Heading Down the Wrong Pathway: on the Influence of Correlation within Gene Sets. Bmc Genomics 11( ): 574-574.

dc.identifier.issn

1471-2164

dc.identifier.uri

https://hdl.handle.net/10161/4348

dc.language.iso

en_US

dc.publisher

Springer Science and Business Media LLC

dc.relation.isversionof

10.1186/1471-2164-11-574

dc.relation.journal

Bmc Genomics

dc.subject

expression data

dc.subject

microarray data

dc.subject

functional categories

dc.subject

enrichment

dc.subject

omnibus

dc.subject

bioinformatics

dc.subject

bioconductor

dc.subject

carcinomas

dc.subject

knowledge

dc.subject

patterns

dc.subject

biotechnology & applied microbiology

dc.subject

genetics & heredity

dc.title

Heading Down the Wrong Pathway: on the Influence of Correlation within Gene Sets

dc.title.alternative
dc.type

Other article

duke.date.pubdate

2010-10-18

duke.description.issue
duke.description.volume

11

pubs.begin-page

574

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
283853100001.pdf
Size:
2.43 MB
Format:
Adobe Portable Document Format