John D. Storey
John D. Storey is the William R. Harman '63 and Mary-Love Harman Professor in Genomics at Princeton University.[1] His research is focused on statistical inference of high-dimensional data, particularly genomic data. Storey was the founding director of the Princeton University Center for Statistics and Machine Learning.[2]
John D. Storey | |
---|---|
Nationality | American |
Alma mater | Stanford University Ph.D. (2002) |
Known for | Q-value |
Awards | COPSS Presidents' Award (2015) Mortimer Spiegelman Award (2015) |
Scientific career | |
Fields | Statistics Statistical genetics Genomics |
Institutions | Princeton University |
Doctoral advisor | Robert Tibshirani |
Doctoral students | Jeffrey T. Leek |
Website | storeylab |
Research
Storey's early research focused on the false discovery rate. At the time the false discovery rate had only been studied in the context of sequential p-value methods and it was not yet in widespread use. However, Storey showed that false discovery rates can be approached through point estimation[3] opening up this very active branch of statistics to false discovery rates. He simultaneously proved a result showing that the positive false discovery rate (pFDR) is exactly equal to a Bayesian posterior probability, thereby providing the first direct connection between false discovery rates and Bayesian theory.[4] In these works, he also invented the q-value, which is a false discovery rate analogue of the p-value. Storey then introduced false discovery rates and q-values as widely applicable measures of statistical significance in genomics, shifting the focus from false positive control to false discovery rate control.[5] With Jeff Leek, Storey discovered that "expression heterogeneity", or unmodeled sources of systematic variation in gene expression data, are very prevalent and need to be modeled and corrected when analyzing genome-wide gene expression data.[6] Leek and Storey introduced "surrogate variable analysis", which is a high-dimensional regression model that includes both known and unknown covariates. He has developed a number of methods for estimating this model. Recently, Storey has shifted his focus to population genomics, where he has introduced genome-wide models of allele frequencies, Hardy–Weinberg equilibrium, and F-statistics that hold under arbitrary population structures.
Honors and awards
- Fellow of the American Association for the Advancement of Science 2011[7]
- Fellow of the Institute of Mathematical Statistics 2012[8]
- COPSS Presidents' Award 2015[9]
- Mortimer Spiegelman Award 2015[10]
References
- "Faculty chosen for endowed professorships". News, Office of Communications, Princeton University. October 8, 2014.
- "Storey to head new Center for Statistics and Machine Learning".
- Storey, John D. (2002). "A direct approach to false discovery rates". Journal of the Royal Statistical Society, Series B (Statistical Methodology). 64 (3): 479–498. CiteSeerX 10.1.1.320.7131. doi:10.1111/1467-9868.00346. S2CID 122987911.
- Storey, John D. (2003). "The positive false discovery rate: a Bayesian interpretation and the q-value". The Annals of Statistics. 31 (6): 2013–2035. doi:10.1214/aos/1074290335.
- Storey, John D.; Tibshirani, Robert (2003). "Statistical significance for genomewide studies". PNAS. 100 (16): 9440–9445. Bibcode:2003PNAS..100.9440S. doi:10.1073/pnas.1530509100. PMC 170937. PMID 12883005.
- Leek, Jeff; Storey, John (2007-09-28). "Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis". PLOS Genetics. 3 (9): 1724–35. doi:10.1371/journal.pgen.0030161. PMC 1994707. PMID 17907809.
- "FACULTY AWARD: Six professors named 2011 AAAS fellows".
- "IMS Fellows announced « IMS Bulletin".
- "Storey receives COPSS Presidents' Award for outstanding statisticians 40 or younger".
- "FACULTY AWARD: Storey receives Mortimer Spiegelman Award for health statisticians under 40".