MDL | David Bickel, statistician

Local FDR estimation for low-dimensional data

18 October 2012 DRB Leave a comment

M. Padilla and D. R. Bickel, “Estimators of the local false discovery rate designed for small numbers of tests,” Statistical Applications in Genetics and Molecular Biology 11 (5), art. 4 (2012). Full article | 2010 & 2012 preprints

This article describes estimators of local false discovery rates, compares their biases for small-scale inference, and illustrates the methods using a quantitative proteomics data set. In addition, theoretical results are presented in the appendices.

Categories: complexity, empirical Bayes, fiducial inference, MDL, proteomics, publications, statistical evidence, trainee author

How to combine statistical methods

29 August 2012 DRB 1 comment

D. R. Bickel, “Game-theoretic probability combination with applications to resolving conflicts between statistical methods,” International Journal of Approximate Reasoning 53, 880-891 (2012). Full article | 2011 preprint | Slides | Simple explanation

Cover image

This paper proposes both a novel solution to the problem of combining probability distributions and a framework for using the new method to combine the results of differing statistical methods that may legitimately be used to analyze the same data set. While the paper emphasizes theoretical development, it is motivated by the need to combine two conflicting estimators of the probability of differential gene expression.

Categories: complexity, empirical Bayes, fiducial inference, imprecise probability, MDL, publications, simply explained, slides

Estimating probabilities of enrichment

4 January 2012 DRB Leave a comment

Z. Yang, Z. Li, and D. R. Bickel, “Empirical Bayes estimation of posterior probabilities of enrichment,” Technical Report, Ottawa Institute of Systems Biology, Technical Report, Ottawa Institute of Systems Biology, arXiv:1201.0153 (2011). Full preprint | 2010 seed

This paper adapts novel empirical Bayes methods for the problem of detecting enrichment in the form of differential representation of genes associated with a biological category with respect to a list of genes identified as differentially expressed. A microarray case study illustrates the methods using Gene Ontology (GO) terms, and a simulation study compares their performance. We report that which enrichment methods work best depends strongly on how many GO terms or other biological categories are of interest.

Categories: empirical Bayes, MDL, ontology, preprints, statistical evidence

Combining inferences from different methods

28 November 2011 DRB Leave a comment

D. R. Bickel, “Resolving conflicts between statistical methods by probability combination: Application to empirical Bayes analyses of genomic data,” Technical Report, Ottawa Institute of Systems Biology, arXiv:1111.6174 (2011). Full preprint

This paper proposes a solution to the problem of combining the results of differing statistical methods that may legitimately be used to analyze the same data set. The motivating application is the combination of two estimators of the probability of differential gene expression: one uses an empirical null distribution, and the other uses the theoretical null distribution. Since there is usually not any reliable way to predict which null distribution will perform better for a given data set and since the choice between them often has a large impact on the conclusions, the proposed hedging strategy addresses a pressing need in statistical genomics. Many other applications are also mentioned in the abstract and described in the introduction.

Categories: empirical Bayes, fiducial inference, imprecise probability, MDL, preprints

Minimax strength of statistical evidence

24 November 2011 DRB Leave a comment

D. R. Bickel, “A predictive approach to measuring the strength of statistical evidence for single and multiple comparisons,” Canadian Journal of Statistics 39, 610–631 (2011). Full text | Revised preprint | 2010 draft

93663f53-d2d9-49de-b379-0cb0b7b566d7

This paper introduces a novel approach to the multiple comparisons problem by generalizing a promising method of model selection developed by information theorists. The first two sections present that method and its main advantages over conventional approaches without burdening statisticians with unfamiliar terms from coding theory. A quantitative proteomics case study facilitates application of the new method to the analysis of data sets involving multiple biological features. The theorems describe its operating characteristics.

The cited medium-scale paper presented previous minimum description length (MDL) methods. Unlike those methods, the new MDL methods of the current paper are based on a conflation of the normalized maximum likelihood (NML) with the weighted likelihood (WL). The previous MDL methods are used in the CJS article for comparison with its NML/WL methods.

Categories: complexity, MDL, proteomics, publications, statistical evidence

Software for local false discovery rate estimation

15 August 2011 DRB Leave a comment

LFDR-MLE is a suite of R functions for the estimation of local false discovery rates by maximum likelihood under a two-group parametric mixture model of test statistics.

Categories: empirical Bayes, gene expression, genetic association, MDL, proteomics, software, trainee author

Unknown Bayes factor approximation

5 April 2011 DRB Leave a comment

D. R. Bickel, “Measuring support for a hypothesis about a random parameter without estimating its unknown prior,” Technical Report, Ottawa Institute of Systems Biology, arXiv:1101.0305 (2011). Full preprint

Categories: empirical Bayes, MDL, preprints, proteomics, statistical evidence

Quantifying evidence for enrichment

7 January 2011 DRB Leave a comment

Z. Yang and D. R. Bickel, “Minimum description length measures of evidence for enrichment,” Technical Report, Ottawa Institute of Systems Biology, COBRA Preprint Series, Article 76, available at biostats.bepress.com/cobra/ps/art76 (2010). Full preprint

Categories: MDL, ontology, preprints, statistical evidence

Quantifying evidence for genetic association

30 November 2010 DRB Leave a comment

Y. Yang and D. R. Bickel, “Minimum description length and empirical Bayes methods of identifying SNPs associated with disease,” Technical Report, Ottawa Institute of Systems Biology, COBRA Preprint Series, Article 74, available at biostats.bepress.com/cobra/ps/art74 (2010).

This manuscript adapts two new evidential, information-theoretic methods to the problem of detecting SNPs associated with disease on the basis of genome-wide association data. Both an application to coronary artery disease and an extensive set of simulation studies indicate that these parametric methods tend to be more reliable than a popular semi-parametric approach to estimating local false discovery rates. In addition, the paper reports that one of the two novel methods performs better than the other.

The abstract and the discussion section of the preprint provide more detailed summaries.

Categories: empirical Bayes, genetic association, MDL, preprints, statistical evidence

Normalized maximum weighted likelihood

8 October 2010 DRB 1 comment

D. R. Bickel, “Statistical inference optimized with respect to the observed sample for single or multiple comparisons,” Technical Report, Ottawa Institute of Systems Biology, arXiv:1010.0694 (2010). Full preprint

Categories: MDL, preprints, proteomics, statistical evidence

Newer Entries Older Entries

David Bickel, statistician

Archive

Local FDR estimation for low-dimensional data

How to combine statistical methods

Estimating probabilities of enrichment

Combining inferences from different methods

Minimax strength of statistical evidence

Software for local false discovery rate estimation

Unknown Bayes factor approximation

Quantifying evidence for enrichment

Quantifying evidence for genetic association

Normalized maximum weighted likelihood

News

Pre-2020 content by topic

Pre-2020 content by date

Slideshow

Facebook page

Archive

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

News

Pre-2020 content by topic

Pre-2020 content by date

Slideshow