Unbiased estimation of odds ratios: combining genomewide association scans with replication studies

Abstract

Abstract Odds ratios or other effect sizes estimated from genome scans are upwardly biased, because only the top‐ranking associations are reported, and moreover only if they reach a defined level of significance. No unbiased estimate exists based on data selected in this fashion, but replication studies are routinely performed that allow unbiased estimation of the effect sizes. Estimation based on replication data alone is inefficient in the sense that the initial scan could, in principle, contribute information on the effect size. We propose an unbiased estimator combining information from both the initial scan and the replication study, which is more efficient than that based just on the replication. Specifically, we adjust the standard combined estimate to allow for selection by rank and significance in the initial scan. Our approach explicitly allows for multiple associations arising from a scan, and is robust to mis‐specification of a significance threshold. We require replication data to be available but argue that, in most applications, estimates of effect sizes are only useful when associations have been replicated. We illustrate our approach on some recently completed scans and explore its efficiency by simulation. Genet. Epidemiol . 33:406–418, 2009. © 2009 Wiley‐Liss, Inc.

Keywords

Replication (statistics)EstimatorRanking (information retrieval)Computer scienceStatisticsOddsRank (graph theory)BiologyMathematicsArtificial intelligence

Affiliated Institutions

MRC Biostatistics Unit GB

Related Publications

The many weak instruments problem and Mendelian randomization

Neil M Davies , Stephanie von Hinke , Helmut Farbmacher +3 more

Instrumental variable estimates of causal effects can be biased when using many instruments that are only weakly associated with the exposure. We describe several techniques to ...

2014 Statistics in Medicine 145 citations

Very early scans for demonstrating dissemination in time in multiple sclerosis

Carmen Tur , Mar Tintoré , Àlex Rovira +8 more

Objective To evaluate the clinical significance of the 2005 modified imaging criteria for dissemination in time in multiple sclerosis stating that detection of a new T2 lesion a...

2008 Multiple Sclerosis Journal 25 citations

Explaining heterogeneity in meta-analysis: a comparison of methods

Simon G. Thompson , Stephen J. Sharp

Exploring the possible reasons for heterogeneity between studies is an important aspect of conducting a meta-analysis. This paper compares a number of methods which can be used ...

1999 Statistics in Medicine 1702 citations

Effect size, confidence interval and statistical significance: a practical guide for biologists

Shinichi Nakagawa , Innes C. Cuthill

Abstract Null hypothesis significance testing (NHST) is the dominant statistical approach in biology, although it has many, frequently unappreciated, problems. Most importantly,...

2007 Biological reviews/Biological reviews... 3646 citations

Learning to Count: Robust Estimates for Labeled Distances between Molecular Sequences

John O’Brien , Vladimir N. Minin , Marc A. Suchard

Researchers routinely estimate distances between molecular sequences using continuous-time Markov chain models. We present a new method, robust counting, that protects against t...

2009 Molecular Biology and Evolution 113 citations

Publication Info

Year: 2009
Type: article
Volume: 33
Issue: 5
Pages: 406-418
Citations: 65
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Unbiased estimation of odds ratios: combining genomewide association scans with replication studies

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                                    Jack Bowden, 
                                
                                    Frank Dudbridge
                                
                            (2009). 
                            Unbiased estimation of odds ratios: combining genomewide association scans with replication studies. 
                            Genetic Epidemiology
                            , 33
                            (5)
                            , 406-418.
                            https://doi.org/10.1002/gepi.20394

Identifiers

DOI: 10.1002/gepi.20394