A mixture model approach to sample size estimation in two- sample comparative microarray experiments

2008

BMC Bioinformatics 2008, 9:117 doi:10.1186/1471-2105-9-117

Background: Choosing the appropriate sample size is an important step in the design of a

microarray experiment, and recently methods have been proposed that estimate sample sizes for

control of the False Discovery Rate (FDR). Many of these methods require knowledge of the

distribution of effect sizes among the differentially expressed genes. If this distribution can be

determined then accurate sample size requirements can be calculated.

Results: We present a mixture model approach to estimating the distribution of effect sizes in data

from two-sample comparative studies. Specifically, we present a novel, closed form, algorithm for

estimating the noncentrality parameters in the test statistic distributions of differentially expressed

genes. We then show how our model can be used to estimate sample sizes that control the FDR

together with other statistical measures like average power or the false nondiscovery rate. Method

performance is evaluated through a comparison with existing methods for sample size estimation,

and is found to be very good.

Conclusion: A novel method for estimating the appropriate sample size for a two-sample

comparative microarray study is presented. The method is shown to perform very well when

compared to existing methods.

BioMed Central

BMC Bioinformatics

Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 3.0 Norge