Unsupervised empirical Bayesian multiple testing with external covariates

The Annals of Applie d Statistics 2008, V ol. 2, No. 2, 714–735 DOI: 10.1214 /08-A OAS158 c  Institute of Mathematical Statistics , 2 008 UNSUPER VISED EMPIRICAL BA YE SIAN MUL TIPLE TESTING WITH EXTERNAL CO V ARIA TES By Egil Ferk ingst ad, 1 Arnoldo Frigessi, H ˚ av ar d R ue, Gudmar Thorleifs s on and A ugustine Kong University of Oslo and Centr e for Inte gr ative Genetics , ( sﬁ ) 2 —Statistics for Innovation , N orwe gian University of Sci enc e and T e ch nolo gy , De c o de Genetics and D e c o de Genetics In an empirical Bay esian setting, w e provide a new multiple test- ing metho d, useful when an additional co v ariate is av ailable, that inﬂuences the probability of eac h null h y p othesis being true. W e mea- sure the posterior signiﬁcance of eac h test conditionally on the co- v ariate and the data, leading to greater p o w er. Using cov ariate-based prior information in an unsup erv ised fashion, we pro duce a list of signiﬁcan t hyp oth eses which diﬀers in length and order from the list obtained by method s not taking co v ariate-information into account. Co va riate-modu lated p osterior probabilities of eac h null h yp oth esis are estimated using a fast approximate algorithm. The new metho d is applied to ex pression quantitativ e trait loci (eQTL) data. 1. In tro d uction. Science, indu stry and b usiness p ossess the tec hnology to collect , store and distribute h u ge amoun ts of data eﬃcien tly and often at lo w cost. Sensors and instru men tation, data logging capaci t y and com- m u nication p o wer hav e increased the br eadth and d epth of data. Systems are measured more in detail, giving a more complete but complex picture of pro cesses and phenomena. Also, it is necessary to int egrate many sources of data of diﬀeren t t y p e and qualit y . In high-through p ut genomics, large n u m b ers of sim ultaneous comparisons are necessary to d isco v er diﬀeren tially expressed genes among thirty thous and measured on es. S im ilarly , in ﬁnance, one wishes to monitor prices of thousands of pro ducts and deriv ativ es sim u l- taneously to detect abnormal b eh a vior, or in geoph ysics or brain imaging, questioning thous an d s of 3D v oxel s ab out their prop er ties. Such tests are Received March 2007; revised January 2008. 1 Supp orted by the National program for researc h in functional genomics in N orw ay from the Research Council of Norwa y . Key wor ds and phr ases. Bioinformatics, multiple hyp othesis testing, false disco very rates, data integration, empirical Bay es. This is an electro nic r eprint of the origina l article published by the Institute of Mathematica l Statistics in The Annals of A pplie d S tatistics , 2008, V ol. 2, No. 2, 71 4–735 . This reprint diﬀers from the o riginal in paginatio n and t ypo graphic detail. 1 2 FERKINGST AD ET AL. often dep endent, and the dep endency s tructure is ill sp eciﬁed, so that the ef- fectiv e num b er of indep en d en t tests is unknown. Sometimes, we exp ect that only a small subset of decisions will hav e a p ositiv e r esult: the solution is then sparse in the huge parameter space. T o d isco v er signiﬁ can t cases, it is neces- sary to dev elop new metho ds that either exploit a v ailable a p riori kno wledge on th e structure of the solution, or merge d iﬀeren t data s ets, eac h addin g information. Benjamini and Ho ch b erg ( 1995 ) prop osed the false disco v ery rate (FDR), wh ic h can adapt automatically to spars ity and h as b een shown to b e asymptotically optimal in a certain minimax sense [Abramo vich et al. ( 2006 )]. FDR adju s tmen ts of p-v alues are n o wada ys routinely p erformed on large scale multiple stud ies in man y sciences and applied areas, from astronom y [Miller et al. ( 2001 )] to genomics [T u sher et al. ( 2001 ); from neu- roimaging [Geno v ese, Lazar and Nic hols ( 2002 )] to ind ustrial organization [Bro wn et al. ( 2005 )]. Ba yesian approac hes are based on the estimation of the p osterior p robabilit y of the null hyp othesis. Efron et al. ( 2001 ) ha v e de- v elop ed the theory of the lo c al false disc overy r ate , b ased on an estimation pro cedure originally deve lop ed by Anderson and Blair ( 1982 ). As the FDR pro v id es a pr obabilit y of misclassiﬁcation f or sets of tests called signiﬁ can t, the p osterior probability that the null hyp othesis is true pro vid es a similar measure, bu t for a lo cal set ab out the particular v alue of the test statistic. Instead of summarizing the data by a test statistic, hierarc hical Ba y esian ap- proac hes h a ve b een dev elop ed that mo d el p arametrically th e full measur ed data [ Baldi and Long ( 20 01 ), Do, M ¨ uller an d T a n g ( 2005 ), Kend ziorski et al. ( 2006 ), Lonnstedt and Sp eed ( 2002 ), Newton et al. ( 200 4 ), and Storey ( 2007 ) also mak es full use of the data in a h yp othesis testing setting. Both approac hes h a ve their strengths and we aknesses, in terms of v alidit y of the distributional assumptions un der the alternativ e h yp othesis, actual av ailabil- it y of the fu ll d ata, computational sp eed and s im p licit y of the metho dology . This pap er assu mes access to sum mary test statistics for ev ery hyp othesis to b e tested. W e p rop ose a sim p le metho dology wh ic h allo ws mo d ulating the p osterior probabilit y of eac h n u ll h yp othesis based on a p r iori additional external in- formation. Assu m e that for eac h null h yp othesis H 0 i a kno w n co v ariate x i is a v ailable whic h could inﬂuence the pr ior probabilit y that H 0 i is true. T yp- ically , the co v ariates x i represent a v ailable kno wledge d eriving from other data and measurement tec hnologies. Our metho d is unsu p ervised and semi- parametric, in the sens e that it is not n ecessary to m o del th e joint d istribu- tion of the test statistics with the co v ariate. By means of empirical Bay es, w e tak e adv an tage of p rior information on th e probabilit y of eac h null h yp othe- sis b eing true, based on such add itional d ata av ailable for eac h single test, to pro du ce a more precise list of rejected n ull hyp otheses. This leads to a mea- sure of the p osterior signiﬁcance of eac h test i , conditional on the co v ariate x i and the data, often leading to greater p o w er. F urthermore, th e ranking UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 3 of the hyp otheses based on th e co v ariate mo dulated p osterior probability is diﬀeren t from the order pro vid ed in absence of the external co v ariate. W e estimate the co v ariate mo dulated p osterior prob ab ility of eac h h y- p othesis by bin ning the data according to the external co v ariate, w ith the help of an appro ximate mixtur e mo d el on p-v alues (Section 2 ). In ference is then based on approxima te Ba yesian inferen ce [ Rue and Martino ( 2007 )], in order to redu ce computational time (Section 3 ). In Section 4 w e present the analysis of exp r ession quan titativ e trait loci (eQTL) data, where gene ex- pression measuremen ts obtained b y m icroarra ys are com b ined with genetic link age analysis. Ou r example explicitly considers an imp ortan t case where (1) th e test is not a comparison, (2) the co v ariate cannot b e used f or testing alone, and (3) there is no w ay (that we kno w of ) to pro d uce a full mo del. W e then compare our approac h w ith the recen tly prop osed Optimal Disco ve ry Pro cedure (ODP) [ Storey ( 2007 )], and sho w that our metho d is op timal in the sen s e of Storey , wh en data are av ailable only at test statistic lev el, within eac h co v ariate-determined bin (Sectio n 5 ). S ection 6 includes a sim ulation study , whic h illustrates the eﬀect of external inform ation on the p o we r of the test. Section 7 present s some discussion and extensions. Our metho d pro v id es a simp le unsup ervised wa y of in corp orating diﬀeren t information in to th e Ba ye sian hyp othesis testing framew ork, w h en fu ll mo deling of the complete data is not p ractical or reliable. 2. Co v ariate-mo dulated multi ple testing. W e consider the s im ultaneous testing of m h yp otheses: H 0 i vs. H 1 i , i = 1 , . . . , m. F or eac h i , a test statistic Z i = z i is calculated. If H 0 i is tru e, then Z i has a kno wn distribution F 0 [e.g., F 0 could b e N (0 , 1) or a t-distribution]. Here, large Z i are evidence against the null hypothesis. Also, for eac h i , we ha ve a co v ariate x i whic h m a y in ﬂ uence th e prior probability that the null hypoth- esis is true. W e consider the x i , i = 1 , . . . , m , to b e kno wn co v ariates. W e can reasonably assu m e that x i do es not inﬂuence the p robabilit y distributions of Z i under H 0 i , so Z i ∼ F 0 under H 0 i no m atter what the v alue of x i is. Ho we v er, for eac h i , the distribution of Z i under H 1 i dep end s on x i . Assume that, f or eac h i , H 1 i = H c 0 i in some space of in terest. Let g ( z i | x i ) b e the d ensit y of Z i giv en x i . F rom the assumptions ab ov e, the follo w ing basic mo d el is deriv ed: g ( z i | x i ) = π 0 ( x i ) g ( z i | H 0 i ) + (1 − π 0 ( x i )) g ( z i | H 1 i , x i ) , (1) where π 0 ( x i ) = P ( H 0 i | x i ). Here, π 0 ( x i ) and g ( z i | H 1 i , x i ), the densit y of Z i giv en x i under H 1 i , are u nkno wn, while the distribution function corresp ond- ing to g ( z i | H 0 i ) is kno w n to b e F 0 . 4 FERKINGST A D ET AL. F or eac h test statistic Z i , th e corresp onding p-v alue P i is giv en by P i = 1 − F 0 ( Z i ). Hence, based on the observe d z i , i = 1 , . . . , m , w e can calculate observ ed p-v alues as p i = 1 − F 0 ( z i ) , i = 1 , . . . , m. W e can wr ite the basic mo del ( 1 ) in terms of p-v alues instead of z-sco res. Let f ( p i | x i ) b e the densit y of p -v alue p i giv en co v ariate x i . Clearly , the distribution of the p-v alue P i under H 0 i is uniform on [0 , 1] and do es not dep end on x i . Accordingly , th e mo del in terms of p-v alues is f ( p i | x i ) = π 0 ( x i ) + (1 − π 0 ( x i )) f ( p i | H 1 i , x i ) . (2) W e d eﬁne the co v ariate-mo dulated p osterior probability of H 0 i as P ( H 0 i | p i , x i ). W e hav e data ( p i , x i ) and need to estimate π 0 ( x i ) and f ( p i | H 1 i , x i ). Di- aconis and Ylvisak er [( 1985 ), Th eorem 1] sho w ed that an y d istr ibution on [0 , 1] can b e we ll appro ximated b y a mixture of b eta d istributions. Allison et al. ( 2002 ) in v estigated this approac h furth er and applied it to estimat- ing the densit y f und er lyin g a sample of p-v alues. In their exp erience with sev eral sets of data, the simplest p ossible mo del, whic h is a mixture of a standard un iform, U [0 , 1], corr esp onding to the tru e null h yp otheses and one b eta comp onen t corresp onding to the alternativ e hypotheses, seemed to b e suﬃcient ; fur thermore, Park er and Rothenber g [( 198 8 ), Section 3] noted that addin g more b eta comp onen ts will in cr ease the num b er of observed r e- jections. T h erefore, usin g only one single b eta comp onen t corr esp onding to the f alse null h y p otheses can b e seen as a c onservative c hoice, as w e a v oid o ve restimating the prop ortion of false null hyp otheses. W e h a ve c h osen to use a m ixture of a uniform d ensit y and a b eta dens ity as our mo del. T o accoun t for the dep endence on x i in f ( p i | x i ), w e bin the p -v alues int o B sets B 1 , B 2 , . . . , B B increasing in x . W e assume th at bins are small enough so that π 0 ( x ) is n early constant in x for eac h b in B j , that is, π 0 ( x ) = π 0 j for all x ∈ B j , and similarly for the parameters of the b eta d istribution. Thus, w e drop the d ep endence on x within eac h bin. W e assu m e that within eac h bin B j , the u niform-b eta mixtur e mo del holds, so f j ( p i ) = π 0 j + (1 − π 0 j ) Γ( ξ j + θ j ) Γ( ξ j )Γ( θ j ) p ξ j − 1 i (1 − p i ) θ j − 1 (3) for p i ∈ B j , where Γ denotes the gamma function, and ξ j > 0 and θ j > 0 are the p arameters of the b eta density in bin B j . The co v ariate-mod ulated p osterior probabilit y of H 0 i corresp ondin g to a p-v alue p i in bin B j is then P ( H 0 i | p i , x i ) = P ( H 0 i | x i ) f ( p i | x i ) = π 0 j f j ( p i ) . UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 5 Since small (la rge) p-v alues should corresp ond to false (tru e) null hy- p otheses, p-v alue den s ities should b e nonincreasing. Hence, we assume th at, in eac h bin j , f j ( p ) is a n on in creasing f unction of p for p ∈ [0 , 1]. [See also W u, Guan and Zhao ( 2006 ), who show that f j ( p ) is alwa ys decreasing for p-v alues from a lik eliho o d ratio test.] In addition, w e mak e the assumption that f j ( p ) is con vex. Th is is done to av oid und erestimation of f j ( p ) near p = 1, whic h w ould lead to und er estimation of π 0 j [since π 0 j = f j (1)]. Without the con- v exit y assump tion, the assum ption of n on-increasingness for a densit y with b ound ed sup p ort may lead to und er estimation du e to a “drop-down eﬀect” near the right en d p oint. S ee Lan gaas, Lindqvist and F erkingstad ( 2005 ) for further d iscussion of the con vexi t y assu mption. It can b e sh o wn that f j ( p ) is n on in creasing an d con vex if and only if ξ j ≤ 1 and θ j ≥ 2. A t this p oin t, we h a ve a separate mod el for eac h bin . The next step is to smo oth o ver the diﬀerent bins and b orro w strength b et ween neighb orin g bins, leading to imp ro ved estimates. T his can b e done u sing a Ba yesia n approac h, as f ollo ws. Deﬁning a smo othness pr ior on the sequ ence of π 01 , . . . , π 0 B is complicated b y the fact that eac h π 0 j is limited to the interv al [0 , 1] . W e add ress this issu e using reparametrization e π 0 j = log π 0 j 1 − π 0 j , so that e π 0 j is deﬁned on the whole real line. Th e sm o othness pr ior is no w tak en to b e f ( e π 01 , . . . , e π 0 B ) ∝ exp − λ 1 2 B X j =2 ( e π 0 j − e π 0( j − 1) ) 2 ! (4) to encourage the parameter v alues in neigh b oring bins to b e similar. Here, λ 1 is the smo othing parameter. Note that ( 4 ) is imp r op er as it is inv arian t to adding any constan t to its arguments. The remaining t wo p arameter sequences, ξ 1 , . . . , ξ B , and θ 1 , . . . , θ B ha ve similar restrictions, as ξ j ∈ [0 , 1] and θ j > 2 , for eac h j . T he sm o othness pri- ors for { ξ j } an d { θ j } are deﬁned similarly as for { π 0 j } , u sing the reparametriza- tion e ξ j = log ξ j 1 − ξ j and e θ j = log( θ j − 2) . Denote the smo othing parameters as λ 2 and λ 3 , resp ectiv ely . No w, denoting the p -v alues in bin j b y p j 1 , p j 2 , . . . , p j m j , the sim u ltaneous p osterior of in terest is th en f ( { e π 0 j } , { e ξ j } , { e λ j } | λ 1 , λ 2 , λ 3 , data) ∝ f ( { e π 0 j } ) f ( { e ξ j } ) f ( { e θ j } ) 6 FERKINGST A D ET AL. (5) × m j Y h =1  π 0 j ( e π 0 j ) + (1 − π 0 j ( e π 0 j )) × Γ( ξ j ( e ξ j ) + θ j ( e θ j )) Γ( ξ j ( e ξ j ))Γ( θ j ( e θ j )) p ξ j ( e ξ j ) − 1 j h (1 − p j h ) θ j ( e θ j ) − 1  . W e p erform inferen ce on the transformed p arameters as this is an uncon- strained parametrization, bu t can easily transform bac k to the original pa- rameters. F or nearest-neigh b orho o d improp er priors, smo othing is regulated b y the smo othing parameters ( λ 1 , λ 2 , λ 3 ). These n eed to b e estimated or tuned in calibration exp eriments. Among the v arious app roac hes for estimating smo othing p arameters, we mentio n cross-v alidation [Thompson et al. ( 1991 )], estimates based on appro ximate mo d els, and fu lly Ba y esian inference [ Kunsch ( 1994 ), Heikkinen and P en ttinen ( 1999 )]. W e ﬁ nd preliminary estimates of the parameters e π 0 j , e ξ j , e θ j ; j = 1 , . . . , B , withou t smo othing; then we ﬁt a Gaussian mo del to eac h parameter and estimate ( λ 1 , λ 2 , λ 3 ) based on the estimated inv erse v ariances. Thus, ˆ λ 1 = B . B X j =2 ( ˆ e π 0 j − ˆ e π 0( j − 1) ) 2 , and ˆ λ 2 and ˆ λ 3 are deﬁned similarly . In addition, w e consider scaling eac h smo othing parameter by a tuning parameter c > 0, leading to c ˆ λ 1 , c ˆ λ 2 , c ˆ λ 3 . The eQTL d ata analysis in S ection 4 is p erformed for b oth c = 1 and c = 5 for comparison. It is easy to estimate the b eta -mixture mo del from equation ( 3 ) with only one bin , B = [0 , 1], thus d iscarding the inform ation in the co v ariate. W e will refer to this as the “one-bin mo del.” The estimated p osterior probabilities from the one-bin mo del are usefu l for assessing th e u s efulness of the infor- mation con tained in the co v ariate, by comparing the num b er of rejectio ns giv en by the one-bin and B -bin m o dels. Finally , note th at the ordering of tests by signiﬁcance will often b e dif- feren t for the co v ariate-mo dulated p osterior pr obabilit y metho d than for metho ds n ot taking th e co v ariate in to accoun t, suc h as Ef ron’s lo cal FDR and the one-bin m o del. If, f or example (as in the data set considered in Section 4 ), the co v ariate-mo dulation function π 0 ( x ) is d ecreasing in x , then tests corresp onding to a low v alue of x ma y mo ve d o wn in the s igniﬁ cance list, and tests corresp onding to a high x may mo ve up in the list. Th is re- ordering of signiﬁcance will b e illustrated for the eQTL d ata set analyzed in Section 4 . UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 7 W e estimate the π 0 j , ξ j , θ j and the co v ariate-mo dulated p osterior prob- abilit y of the H 0 i sim u ltaneously using the app ro ximate Ba yesian metho d describ ed in Section 3 . 3. Computational strategy . This sect ion discusses ho w to devel op an eﬃcien t strategy for doing inf erence from the full p osterior of in terest ( 5 ). Although inference is p ossib le usin g Mark ov c hain Mon te Carlo metho ds, w e need a computationally faster approac h for our pr oblem. T he amount of data in eac h bin, com bined with the Gaussian s m o othing, ju stify to app ro ximate the joint p osterior of { e π 0 j } , { e ξ j } , { e θ j } as Gauss ian. Th e p osterior mean is the mo dal conﬁguration, and the p osterior co v ariance matrix is the inv ers e of the negativ e Hessian at the mo de. Note that the Gaussian app ro ximation is lik ely to b e less accurate without reparametrization, due to the constraints of the p arameters. T o compu te the p osterior mean an d the co v ariance matrix, we need to optimize the log p osterior, that is, the log of ( 5 ). W e d o this in t wo steps; in the ﬁr st step w e compu te reasonable initial v alues for th e parameters in eac h of the B b ins, while in the second step w e start from these initial v alues and optimize the log p osterior with resp ect to all the parameters to lo cate the mo d e: 1. The in itial v alues for th e parameters of int erest are d etermined sequen - tially . W e start with the ﬁr st bin. W e use only th e data in the ﬁrst b in to optimize the log lik eliho o d with resp ect to e π 01 , e ξ 1 , e θ 1 . Then we go on to the second b in usin g the initial v alues found from th e ﬁrst bin to initialize the optimization in b in 2. This pro cess contin ues until all th e B bins are pro cessed. 2. The log p osterior is then optimized with resp ect to all the parameters starting from the initial v alues found in the pr evious step. Man y numerical op timization s c hemes can b e u s ed in eac h of the tw o steps, and w e use the classical Newton–Raphson algorithm. The main motiv ation is that b oth the gradient and th e Hessian of the log p osterior (b oth in the fu ll mo del and for eac h b in separately) are r elativ ely easy to compute. F urther, the Hessian matrix of the log p osterior will b e spars e in the full mo del since the parameters in eac h bin are only linked to the previous and follo wing bin. As a side eﬀect, we can inv ert the negativ e Hessian used at the last iteration to obtain the co v ariance matrix. As all th e parameters are (appro ximately) join tly Gaussian, then so are the p osterior marginals for eac h e π 0 j , e ξ 0 j and e θ 0 j , and they can b e found analyticall y . The p osterior densit y of P ( H 0 i | p i , x i ) for bin j can b e computed from the join t p osterior for e π 0 j , e ξ j , e θ j . Such an approac h is feasible bu t cumb er- some using numerical int egration. F or this reason, we attac k this p roblem 8 FERKINGST A D ET AL. using the delta-metho d: Expand P ( H 0 i | p i , x i ) around the p osterior mean of ( e π 0 j , e ξ j , e θ j ), ( e π ∗ 0 j , e ξ ∗ j , e θ ∗ j ), sa y , to obtain P ( H 0 i | p i , x i ) = a + b T (( e π 0 j , e ξ j , e θ j ) − ( e π ∗ 0 j , e ξ ∗ j , e θ ∗ j )) + · · · . W e now app r o ximate the p osterior of P ( H 0 i | p i , x i ) at bin j , by a Gaussian with mean a and v ariance b T Σ j b , wh ere Σ j is the p osterio r co v ariance of ( e π 0 j , e ξ j , e θ j ). W e ha ve ve riﬁed our Gaussian app ro ximations using long runs of an MCMC algorithm without b eing able to detect any relev ant diﬀerences. 4. eQTL data. Th rough the collection of phenot ypic and genetic d ata for individuals in family clusters, link age analysis is a standard m etho d to map genetic v ariant s that can in ﬂuence a trait to sp eciﬁc regions of the genome [ Ott ( 1999 )]. Recen tly , there was the recognition that the magni- tude of gene expr ession, m easur ed u sing mo d ern microarray tec hnology for man y genes sim u ltaneously , v aries among ind ividuals and often has a ge- netic comp onen t [ Jansen and Nap ( 2001 ), Sc hadt et al. ( 2003 ), Bry s trykh et al. ( 2005 )]. It hence can b e treated as a quan titativ e genetic trait. The lo ci aﬀecting gene expression are referred to as expression quant itativ e trait lo ci (eQTLs). While the same link age m etho dolog y applies, there are sp e- ciﬁc c haracteristics that distinguish the study of eQTLs fr om th e stud y of other more tr ad itional traits. First, often the expressions of tens of thou - sands of genes are studied simultaneously . Second, a v ariant that aﬀects the expression of a certain gene that is lo cated in the imm ediate neighborh o o d of where the gene resides is called a cis-v arian t, whereas a v ariant that can aﬀect the expr ession of the gene, directly or indirectly , b ut is lo cated f ar a wa y (e.g., on a diﬀerent chromosome), is a trans-v arian t. In general, giv en the phenotypic and genetic data, for eac h trait, a link age score can b e calc u- lated f or ev ery p osition on th e genome (referred to as a genome-scan). Here w e fo cus on the test of whether a cis-v ariant exists so that only the link age score ev aluated at the known lo cation of the gene is used . Th e data in volv e the expr essions of 22317 exp r essed sequence tags (ESTs) for 370 Icelandic individuals clustered into 85 families. Th e data stru cture is s imilar to that of Morley et al. ( 2004 ) and Monks et al. ( 2004 ), the d iﬀerence b eing that we ha ve a larger sample s ize and th e gene expressions are m easured in blo o d instead of lym p hoblastoid cell lines. Denote the gene expression (p henot yp e) data by E an d the genot yp e data b y G . Using link age analysis, the join t data ( E , G ) is used to test the null h y p othesis H 0 : No cis-v ariant is aﬀecting E , UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 9 through the cond itional distribution P ( G | E ), for eac h ES T (index drop p ed here for simplicit y). In a link age analysis, as opp osed to an asso ciation anal- ysis, we do not u se the correlation b et w een th e genot yp e data G and E directly . Rather, the genot yp e data are used to trac k the segmen ts of c hr o- mosomes that are sh ared b y relativ es identica l by d escen t. Evidence of link- age of a phenot yp e to a genomic region sho w s u p when relativ es h aving similar p henot yp es s hare a r egion b y descent in excess of what is exp ected based on their kno wn relationships, and relativ es with phenot yp es that are substanti ally diﬀerent ha ve a deﬁcit of sharing. T he test w e u sed is partic- ularly simple. Un der H 0 , G is indep enden t of E , or P 0 ( G | E ) = P ( G ), and the distribu tion of any test statistic has a k n o wn d istribution under H 0 with no nuisance parameters. S p eciﬁcally , what we used h er e is an allele-sharing score, an extension of that describ ed in Kong and C ox ( 1997 ) and closely re- lated to the metho d of S h am et al. ( 2002 ). Through exp onenti al tilting which leads to a one-parameter alternativ e distribu tion for the allele-sharing s core, a simple lik eliho o d ratio test is u sed for eac h E ST. The link age score is cal- culated u sing th e p rogram Allegro [Gudb jartsson et al. ( 2000 )]. Sim u lation sho w s th at the p -v alues calculated using the c hi-squ are distr ib ution as the reference are v ery wel l calibrated, as exp ected. While lin k age analysis inv olv es the stud y of the co-segregatio n of ph eno- t yp es and genetic material, the phenot yp e data alone can b e used to estimate heritabilit y , that is, the strength of the correlation of ph enot yp es among rel- ativ es can b e used to estimate the fraction of the phenot yp e v ariance that is p oten tially accoun ted for by genetic v ariants. Denote the estimated h er- itabilit y b y H ( E ), wh ic h we compute from E using the program SOLAR [ Almasy and Blangero ( 1998 )] for eac h of th e 22317 EST s. It might ap p ear that w e ha ve artiﬁcially partitioned the ov erall informa- tion captured by the join t distrib ution P ( G, E ) int o t wo p arts, one b ased on the conditional distr ib ution P ( G | E ) , and one based on P ( E ) . Ho w ever, in this case, as in m an y other r eal data problems, there is su bstan tial asym- metry b etw een the information captured by P ( G | E ) and P ( E ). As noted ab o ve, the test we used to d irectly test H 0 and compute p-v alues is simple and straigh tforward. It is v ery d iﬀeren t f or H ( E ). S p eciﬁcally , H ( E ) can b e signiﬁcan tly and su bstan tially diﬀerent fr om zero for at least four diﬀeren t reasons: (A) There exists cis-v ariant s that aﬀect E . (B) There exist trans-v arian ts (v arian ts that are lo cated in regions of the genome that are aw a y from the gene for which the expression is mea- sured) that, d ir ectly or in directly , aﬀect E . (C) H ( E ) is capturin g familial clustering/similarities that result from shared en viron m en t among relativ es in stead of genetic factors. 10 FERKINGST A D ET AL. (D) Sub jects and families are often not collected completel y at rand om. Nonrandom ascertainmen t with resp ect to traits such as ob esity , which are associated with some of the expression ph enot yp es, could lead to bias in the estimate of h eritabilit y . Reason (A) ab o v e is the alternativ e for H 0 as sp eciﬁed ab ov e, but E or H ( E ) cannot b e used to d irectly test H 0 against th e tru th of (A). T h erefore, we are not able to use H ( E ) directly to compute p-v alues. Ho wev er, there are ob vious reasons to b eliev e that the pr obabilit y for (A) b eing true is correlated with H ( E ). Making as little assump tions as p ossible, our approac h is to use H ( E ) thr ough a semi-parametric empirical-Ba y es pro cedur e to provide a prior distr ib ution for H 0 and (A). Co v ariate-mo dulated p osterior pr obabilities w ere calculate d as d escrib ed in Sections 2 and 3 . Twen t y bins w ere used , with bin 1 con taining the p- v alues with corresp onding heritabilit y estimated exactly equ al to zero, and the other b in s chosen to con tain an appro ximately equal n u m b er of p-v alues. Smo othing scaling f actors c = 1 and c = 5 were used. R esu lting estimates in four of the bins, for c = 5 , with 0.95 p oint wise s ymmetric cred ib ilit y inter- v als, are sho wn in Figure 1 . Here w e see that, for any giv en v alue of p , the co v ariate-modu lated p osterior pr obabilit y of H 0 decreases for increasing bin index (increasing heritabilit y). Th is eﬀect seems to b e qu ite strong. The one-bin mo del was used to assess the eﬀects of co v ariate-mo dulation. Figure 2 sho ws a comparison b etw een the results of the co v ariate-mod ulated p osterior probab ility metho d and the one-bin, n o-co v ariate metho d. This plot sh ows a 2-dimensional h istogram of the (co v ariate, test statistic) p air for eac h of th e 22317 ES Ts. A higher density of tests (ESTs ) is indicated by a darker gra y tone in the histogram. Ass u me that a test is called signiﬁcan t if the p osterior prob ab ility of the null hypothesis is smaller than 0.05. T he step-lik e s olid curv e is the signiﬁ cance threshold at lev el 0.05 for the 20- bin co v ariate-mo dulated p osterior probabilit y based metho d. The d ashed line sho ws the signiﬁcance th reshold, also at lev el 0.05, for the one-bin , nonco v ariate based, p osterior p robabilit y . A total of 818 tests w er e called signiﬁcan t by b oth metho d s, while 704 tests were called signiﬁcan t b y th e 20-bin metho d only . 53 tests (corresp ond in g to EST ’s w ith low heritabilit y) w ere calle d signiﬁcan t by th e one-bin metho d, but d iscarded as suc h by the co v ariate-mo dulated 20-bin method . T hese results are for smo othing scale c = 5 . Results for smo othing s cale c = 1 are similar, with 817 tests called signiﬁcan t for b oth method s, 738 tests called signiﬁcan t for the 20- bin metho d only , and 54 tests called signiﬁcant for the one-bin metho d only . The imp act of using her itabilit y information is huge. Lo oking at Fig- ure 2 , it is eviden t that there is a large d iﬀerence when u sing th e co v ariate- mo dulated metho d compared to the no-co v ariate m etho d. In Figure 2 the 53 tests in the r egion b elo w the 20-bin threshold and ab o v e the one-bin UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 11 Fig. 1. Estimate d c ovariate-mo dulate d p osterior pr ob abilities of H 0 in bins 1 , 7 , 13 and 20 . The solid line shows the estimate, whil e the dashe d l ines ar e appr oximate p ointwise 95% symmetric cr e dibility intervals. threshold are considered b y the co v ariate-based m etho d as false discov er- ies, obtained err oneously b y a no-co v ariate metho d . T h ese p oin ts are in a region of low er heritabilit y , wh ile th e 704 p oin ts b elo w the one-bin signiﬁ- cance line and ab o v e the 20-bin signiﬁcance curve (whic h are “gained” b y in tro ducing co v ariate-mo dulation) are in the r egion w ith higher heritabilit y . As exp ecte d, th e tests with lo w h er itabilit y are eﬀectiv ely d own-w eigh ted b y the 20-bin metho d , wh ile tests with high heritabilit y are up-weigh ted, in a nonsup ervised f ash ion b y the Ba yesia n rule. Thus, there are t wo p oten- tial gains of using our metho d: Higher o ve rall p o w er , and b etter fo cus on individual ﬁ ndings supp orted by add itional information. It may also b e of inte rest to study the estimated co v ariate-mo dulated functions (i.e., the estimated π 01 , π 02 , . . . , π 0 B ) directly . A plot of π 0 j vs. co- v ariate x for is shown in Figure 3 . Clearly , the dep enden ce on the co v ariate, heritabilit y , is very stron g. The co v ariate-modu lation function also illustrates the eﬀect of smo othing. The ab o v e plot sho ws the case c = 1, while the b elo w plot shows the case c = 5. In this case, smo othing c = 5 s eems most appro- priate, based on the degree of smo othness and the a pr iori exp ectation th at the co v ariate-mo dulation fu nction should b e decreasing in x . 12 FERKINGST A D ET AL. Fig. 2. Two-dimensional histo gr am of p ai rs of c ovariate (heritability) and test statistic (z-sc or e) for e ach of the 22317 tests (EST ’ s). A darker gr ay tone i ndic ates a higher density of p oi nts (tests ) in e ach pixel. T he solid step-like curve shows the signiﬁc anc e thr eshold using the c ovariate-mo dulate d, 20-bin p osterior pr ob abil ity of H 0 . The dashe d line shows the signiﬁc anc e thr eshold for the no-c ovariate, one-bin p osterior pr ob ability. Both thr esholds ar e at p osterior pr ob abili ty level 0.05. 53 tests fal l in the signiﬁc anc e r e gion of the one-bin metho d, but outside the signiﬁc anc e r e gion of the 20-bin metho d, 704 tests fal l i n the signiﬁc anc e r e gion of the 20-bin metho d, but outside the signiﬁc anc e r e gion of the one-bin metho d, while 818 tests ar e within the signiﬁc anc e r e gi on of b oth metho ds. Using heritability has a very lar ge i mp act. The one-bin mo d el giv es the p osterior estimate ˆ π 0 = 0 . 701, with approx- imate symmetric 95% credibilit y inte r v al (0 . 687 , 0 . 714), for the o v erall pro- p ortion of tr ue null hyp otheses π 0 . By comparison, the estimates ˆ π s 0 of Storey ( 2002 ) and ˆ π c 0 of Langaas et al. ( 2005 ) are ˆ π s 0 = 0 . 601 and ˆ π c 0 = 0 . 615, resp ec- tiv ely , so it is p ossible that the one-bin u niform-b eta mixture mo del giv es a conserv ative estimate in this case. As m entioned in Section 2 , the ranking of genes according to the p osterior probabilit y of the alternativ e h yp othesis may b e diﬀeren t for the co v ariate- mo dulated p osterior p robabilit y and for the one-bin, no-co v ariate p osterior UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 13 Fig. 3. Covariate- mo dulation functions for the eQTL data: Estimate d π 0 j versus c ovari- ate (heritability) x . The steps c orr esp ond to bins. Two de gr e es of smo othing ar e shown (sc ali ng factors c = 1 and c = 5 , r esp.). Pointwise 95% symmetric cr e dibi lity intervals ar e shown as dashe d lines. probabilit y . T his is illustrated in Figure 4 , w hic h sho ws the change in rank- ings for the top 100 genes for the 20-bin, co v ariate b ased metho d . Reshuf- ﬂing is quite evident. Rank information is imp ortan t for fur ther analysis 14 FERKINGST A D ET AL. and exp erimen ts on the most promising ES T ’s, with p oten tial impact on drug-disco very plans. 5. Connections to other metho d s and optimalit y . Sev eral multiple test- ing metho ds use, for eac h test, the full lik eliho o d of the d ata rather than a one-dimensional p-v alue/statistic su mmary . Use of the full lik eliho o d im- plies that tests are r eordered with resp ect to plain p -v alue ranking, thanks to in formation not conv ey ed by th e corresp onding un iv ariate p -v alues. T h is happ ens also in our approac h, b ut th e additional information comes from Fig. 4. Comp arison of r anks b ase d on no-c ovariate (1-bin) p osterior pr ob abili ty versus c ovariate-mo dulate d (20-bin) p osterior pr ob abil ity for the eQTL data. The EST ’ s with low- est c ovariate-mo dulate d p osterior pr ob ability of H 0 ar e r anke d along the left y-axis. Each se gm ent ends on the right y-axis of the plot on the r ank given to that EST by the one-bin metho d. Se gments moving down indic ate EST ’ s which ar e mor e str ongly cis-r e gulate d ac- c or ding to the c ovariate-mo dulate d p osterior pr ob ability, whi l e se gments m oving up i ndic ate the opp osite. R eshuﬄing of r anks is due to the m er gi ng of expr ession data with heritability know le dge. Some lines esc ap e the plot; for these the ﬁnal r ank is written on the se gment. UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 15 external co v ariates, wh ile in the full-lik eliho o d (FL) metho ds there is a b or- ro win g of strength b etw een tests (in ternally). In some cases our “external co v ariate” may b e incorp orated into the like- liho o d mo del (and thus b ecome “in tern al” in our terminology), and the FL metho ds describ ed b elo w may b e used directly . How ev er, in many cases this will b e imp ossible, impr actical or unnatural. Our eQT L application b elo w is an example wh er e it wo u ld b e inapp ropriate to use the cov ariate in this w ay , as wa s explained in Section 4 . Our method has the adv ant age that since ev erything is done conditionally on the cov ariate, no mo del for the co v ariate is needed. In this sense, our approac h uses a min im al set of assumptions to pro vide a generally applicable w ay of us in g co v ariate information. FL m etho ds are cur r en tly implement ed for k -class comparisons (like the d iﬀeren tial expr ession problem), while our metho dology may readily b e applied for any m ultiple testing p roblem wh ere external cov ariate information is av ailable. Recen tly , Storey ( 2007 ) and Storey , Dai and Leek ( 2007 ) in tro duced a fre- quen tist FL metho d called the Optimal Disco v ery Pro cedure (ODP). The ODP is a theoretically optimal p ro cedure b ased on a generalized lik eliho o d ratio thresholdin g function. Th e optimalit y goal is that of maximizing the exp ected n u m b er of true p ositiv es for eac h ﬁ xed exp ected n u mb er of f alse p ositiv es, where the former exp ectation is calculat ed u nder the s ubset where the alternativ e is true and the latter exp ectation is calculated und er th e su b - set wher e the null is true. In the O DP metho d, t wo main steps are distinguished: 1. ordering tests b y signiﬁcance, and 2. c ho osing a cutoﬀ to obtain a sp eciﬁed err or rate. The pr o cedure is d eﬁned as follo ws. Supp ose that we ha v e m s igniﬁ cance tests p erformed on observ ed data sets y 1 , y 2 , . . . , y m , where y i is the v ector of data for test i . T est i has densit y f i under H 0 and density g i under H 1 . As- sume that H 0 i is true for i = 1 , . . . , m 0 and H 1 i is true for i = m 0 + 1 , . . . , m . The ODP for test i is then giv en by the follo win g signiﬁcance thresholding function: S ODP ( y i ) = f 1 ( y i ) + · · · + f m 0 ( y i ) + g m 0 +1 ( y i ) + · · · + g m ( y i ) f 1 ( y i ) + f 2 ( y i ) + · · · + f m 0 ( y i ) , (6) rejecting null hyp othesis H 0 i if and only if S ODP ( y i ) ≥ λ . Notice that step 1 ab o ve is here accomplished b y the sp eciﬁcatio n of th e thresholding fu n c- tion ( 6 ), while step 2 is done by c ho osing λ . In our app roac h the cut- oﬀ is immediate to ﬁnd, sin ce w e rank according to p osterior probabili- ties. Since the expression ( 6 ) for S ODP ( y i ) in v olv es seve ral unknown quant i- ties, the actual pro cedure is based on an estimated thresholding function 16 FERKINGST A D ET AL. ˆ S ODP ( y i ). Storey , Dai and Leek ( 2007 ) prop ose to assume norm ality for the y i , and plug the m axim um lik eliho o d estimat e in to th e Gaussian den- sit y functions to estimate g i . [Note th at ˆ g m 0 + j ( y i ) is estimated using data y m 0 + j .] In addition, in the p resence of n uisance parameters, it is necessary to pro v id e a p reliminary estimate of the summation of the tru e null densities. This ma y b e done by ranking the tests b y ord inary univ ariate signiﬁcance criteria (e.g., p-v alues), estimating the prop ortion π 0 of tr u e null hypothe- ses, and then deciding that the m (1 − ˆ π 0 ) tests with sm allest p-v alues hav e a true H 1 , wh ile the rest ha ve a true H 0 . Op timalit y is lost at this p oin t, but satisfactory r esults are rep orted in pr actice . Ch oice of λ is d one using a b ootstrappin g pro cedur e, describ ed in the Onlin e Supp lemen t of Storey , Dai and Leek ( 2007 ). The ODP p ro cedure is closely r elated to our metho d . T o see this, ﬁrst disregard the presence of the external co v ariate, and assu m e that eac h null h y p othesis H 0 i has the same p r obabilit y π 0 of b eing tru e. In this case, it is sho wn in Storey ( 2007 ) (p age 364) that the ODP thresholding function is equiv alent to the th r esholding f unction given by the p osterior probabil- it y that the alte rnativ e hypothesis is tru e giv en th e data y i . This implies that the ranking usin g this p osterior pr obabilit y is optimal in the ODP sense. No w, consider the use of external co v ariates. Our method is based on the p osterior probability of H 1 i giv en a one-dimensional (p-v alue/test statistic) summary of the data and the co v ariate x i . Thus, our metho d may b e seen as a version of the ODP metho d in the setting where we only us e a p-v alue/test statistic sum mary . T h erefore, b y optimalit y of the ODP , our metho d is optimal within e ach bin among m etho ds based on one-dimensional summary statistics, bu t optimalit y o ver all bins is n ot guarante ed . It should b e noted that al l metho d s that are based on the p osterior probabilit y of the H 0 i ( H 1 i ) can b e seen as v arian ts of the ODP method in this general sense. It is a disadv an tage of the ODP m etho d to assume n ormalit y (or an y other sp eciﬁc parametric mo del und er H 1 ) of th e measurement s y and mak e an ad ho c preliminary guess of the status of eac h n ull hyp otheses. In addition, the use of p lug-in estimation is an iss ue. O u r metho d is based on m uc h fewer assumptions, basically only that v alid p-v alues may b e calculated, and that p-v alues/test statistics are indep enden t of the co v ariate under the n ull. In this sense, our approac h is u nsup ervised and data driv en, and allo ws to incorp orate co v ariates in a very simple and intuitiv e w a y . F urth er m ore, our co v ariate-mo dulation id eas may b e combined with the full-lik eliho o d version of the ODP in at least t wo w a ys. On e approac h is to bin the data y i b y the co v ariate and estimate the ODP separately in eac h bin , extending a suggestion in Storey ( 2007 ) (p age 353 ) to an exter- nal co v ariate. A problem with this approac h is that is not clear how one could smo oth b et we en bins, wh ic h should b e b eneﬁcial. Another approac h UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 17 is to ﬁrst apply the ODP metho d to the data ignoring the co v ariate, cal- culate ODP-based adjusted p -v alues [as describ ed in the Su p plemen tary Material of Storey , Dai and Leek ( 2007 ), equation (10), page 5], and ﬁn ally apply our metho d using the ODP-based p-v alues and the external co v ari- ate. Other FL metho ds are based on hierarchical Ba yesia n mixtu r e mo d eling. Relev ant references include Baldi and Long ( 2001 ), Do, M ¨ uller and T ang ( 2005 ), Kendziorski et al. ( 2006 ), Lonnstedt and Sp eed ( 200 2 ), Newton et al. ( 2004 ). As in our approac h , inference is b ased on p osterior probabilities of the null and alternativ e hypotheses. These metho ds require fu ll Ba y esian mo deling and distribu tional assumptions, which are lo ok ed at with skep- ticism b y practitioners used to un sup er v ised testing. As for the ODP , the hierarc h ical Ba y esian m ixture m o dels m ay b e combined w ith our approac h. This can agai n b e done by b in ning, and in the Ba y esian approac h s m o othing b et ween bin s is no w more immediately applicable. Ho wev er, the use of our approac h in a fully Ba yesian setting m a y b ecome computationally demand - ing. The one-bin mo del d iﬀers from Efron’s lo cal FDR in that the one-bin mo del assumes a sp eciﬁc parametric mixtur e mo del for the p-v alues, whereas in the lo cal FDR metho d the densit y of the p -v alues is estimated using para- metric smo othing of the p-v alue histogram. A b y-pro duct of the estimation of the one-bin mo del is an estimate of π 0 , the probabilit y that a giv en null h y p othesis is tru e (or, equiv alen tly , the p r op ortion of tru e null hyp othesis). This is an inte resting quan tit y in its o wn righ t, and several metho ds ha ve b een prop osed for its estimatio n , suc h as th e estimato r of Storey ( 200 2 ), and the con v ex d ecreasing estimator of Langaas, Lind qvist and F erkingstad ( 2005 ). When estimating th e full B -bin mo d el, similarly an estimate of π 0 j can readily b e found using the method s of Storey ( 2002 ) or Langaas, Lind qvist and F erkin gstad ( 2005 ) in eac h bin j , and this ma y b e compared to the estimated π 0 j from the B -bin mo del. 6. Sim u lation exp eriment. This sim ulation exp erimen t sho ws that the estimated co v ariate-mo dulated p osterior probabilities of the null hyp othe- ses are close to the true v alues in a con trolled setting, and visualizes the eﬀect of v arying degrees of co v ariate mo d u lation. Simulati ons we re d one for v aryin g degrees of co v ariate-dep en dence and v arying ov erall prop ortions of true null h yp otheses. The main pro cedure for the generation of eac h sim u lated d ata set is as follo ws: First, co v ariates x i , i = 1 , . . . , m , are sampled iid from Unif [0 , 1], m = 30000. A cov ariate-modulation function π 0 ( x ) is sp eciﬁed (see b elo w). F or eac h i , we let H 0 i b e tr u e with probabilit y π 0 ( x i ), and let H 1 i b e true with pr obabilit y 1 − π 0 ( x i ). W e sample z i from N (0 , 1) if H 0 i is true, and 18 FERKINGST A D ET AL. sample z i from N (2 , 1) otherwise. The samp led z i are then test statistics f or m tests of H 0 i : θ = 0 vs. H 1 i : θ = 2, where Z i ∼ N ( θ , 1). As a ﬁrst step in c ho osing a suitable cov ariate-modu lation function, w e decide on an ov erall lev el for the p r op ortion of tru e null hyp otheses, that is, w e set ¯ π 0 ≡ R 1 0 π 0 ( x ) dx to some ﬁ xed num b er. Th e strength of the d ep en- dence on the cov ariate is determined by the steepness of th e function π 0 ( x ). W e ha ve c hosen the follo wing parametric f orm for π 0 ( x ): π 0 ( x ) = exp( − α − ( β − α ) x γ ) , (7) where α , β and γ are p ositiv e constan ts, and π 0 ( x ) is d ecreasing if and only if α > β . W e ha ve conﬁned our sim ulation exp eriment to decreasing π 0 ( x ). F or eac h v alue of ¯ π 0 , w e set the strength of the co v ariate-dep endence by c ho osing appropriate α and β , implicitly setting π 0 (0) and π 0 (1) since α = − log( π 0 (0)) and β = − log( π 0 (1)), an d then c h o osing γ s uc h that R 1 0 exp( − α − ( β − α ) x γ ) dx = ¯ π 0 . The sim ulation results rep orted b elow are all from 100 0 run s of the al- gorithm d escrib ed in S ection 3 . Po in twise medians, 5% qu an tiles and 95% quan tiles were calculated based on the 1000 run s. In eac h run a total num- b er of 30000 test statistics were simulat ed. In eac h case, one-bin no-co v ariate and 10-bin co v ariate-mo dulated p osterior probabilities were calculated for the same data sets. T rue p osterior p r obabilities w ere calculated as d escrib ed in the S upplementary Material [F erkingstad, F rigessi, Rue, Th orleifsson and Kong ( 2008 )]. W e ﬁrst consider tw o sim ulated data sets with relativ ely wea k co v ariate- mo dulation. W e c ho ose α , β and γ in ( 7 ) such th at ¯ π 0 = 0 . 5, π 0 (0) = 0 . 55 and π 0 (1) = 0 . 45. Th e top ro w in Figur e 5 show the estimated p osterior probabilities for the 10-bin and one-bin metho ds, compared to the true p os- terior probabilities, in b ins 1, 3, 5, 7 and 9. Th e red cu rv es sh ow the median estimated co v ariate-modu lated p osterior p robabilities (solid lines) together with 0.05 and 0.95 quan tiles (d ashed lines) f rom the 1000 run s. The green curv es sho w the tru e p osterior probabilities. On ly p osterior p robabilities for p-v alues in th e range [0 , 0 . 1] are sho w n, sin ce th e p osterior probabilities are to o high to b e of interest outside th is ran ge. Notice ﬁrst th at the one-bin p osterior probability is inv arian t o ve r diﬀerent bins, since this d o es not de- p end on co v ariate-information. Ho we v er, the estimated co v ariate-mo dulated p osterior pr obabilit y and the tru e p osterior probabilit y v ary o v er diﬀerent bins. F rom the t wo top ro ws of Figure 5 , we see that the estimated 10-bin p osterior probability is v ery close to the truth. The one-bin estimated p os- terior pr obabilit y is a go o d estimate for bin s corresp on d ing to a mo d er ate v alue of the co v ariate (bin 5), but deteriorates for more extreme v alues of the cov ariate (other bins). This is as exp ecte d : Since co v ariate mo d ulation is quite wea k, the in formation in the co v ariate should not c h ange the dis- co v eries. This result is consisten t with w hat we sa w for a second real data UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 19 Fig. 5. Simulate d data sets. The top r ow shows estimate d p osterior pr ob abil ities for ¯ π 0 = 0 . 5 wi th we ak c ovariate-mo dulation, the m i dd le r ow shows r esults for ¯ π 0 = 0 . 9 wi th we ak c ovariate-mo dulation and the b ottom r ow shows r esults for ¯ π 0 = 0 . 5 with str ong c o- variate-mo dulation. T en bins wer e use d i n the c alculations, but only bins 1, 3, 5, 7 and 9 ar e shown her e. F or e ach bin, the b old gr e en curve shows the true p osterior pr ob abili ti es. The r e d curves show estimate d 10-bin c ovariate-mo dulate d p osterior pr ob abili ties, wher e the soli d curve is the me dian of the 1000 sim ulations, and the dashe d curves ar e 0.05 and 0.95 quantiles. Simi larly, the blue curves show estimate d no-c ovariate (one-bin) p osterior pr ob abilities (me dian with 0.05 and 0. 95 quantiles). set, as r ep orted in the Sup plemen tary Material [F erkingstad, F rigessi, Rue, Thorleifsson and Kong ( 2008 )]. As the second case of we ak cov ariate-modu lation, we c ho ose ¯ π 0 = 0 . 9, π 0 (0) = 0 . 95 and π 0 (1) = 0 . 85. Results, shown in the middle ro w of Fig- ure 5 , are similar to what we sa w in the ﬁrst case. Again, it is seen that the co v ariate-based estimation metho d give s a very close appro ximation to the truth, while the nonco v ariate-based m etho d misses oﬀ sligh tly , particularly for th e bins corresp onding to extreme (particularly small or large) v alues of the co v ariate. 20 FERKINGST A D ET AL. Finally , we consider a sim ulated data set where there is strong co v ariate- mo dulation. Here, w e ha v e chosen ¯ π 0 = 0 . 5, π 0 (0) = 0 . 9 and π 0 (1) = 0 . 1. The results are sh o wn in the b ottom ro w of Figure 5 . W e again see that the es- timated 10-bin co v ariate-mod ulated p osterior probabilities are v ery close to the true v alues. Clearly , the co v ariate-mo dulation metho d do es we ll in cap- turing the information in the co v ariate in this case. Th ere is a big d iﬀerence b et ween the 10-bin p osterio r probab ility an d the no-co v ariate metho d. The no-co v ariate metho d d o es n ot work well in any b ins (except p ossibly b in 5). When co v ariate information is a v ailable, using it, ev en in an un s up ervised fashion, h as a p oten tially imp ortan t impact. 7. Discussion. W e deﬁn ed the co v ariate-modu lated p osterior probabil- it y as P ( H 0 i | p i , x i ), where p i is the i th p-v alue and x i the corresp onding co v ariate for i = 1 , . . . , m hypothesis tests. More generally , we can consider P ( H 0 i | p i , x ), where x = ( x 1 , x 2 , . . . , x m ) is a vec tor of co v ariates. W e use the assump tion that g ( z i | H 0 i , x i ) = g ( z i | H 0 i ), wh ere g denotes the densit y of th e test statistics. Ho we v er, th e metho d could also b e adapted to the case w here g ( z | H 0 i , x i ) is known as a function of x i . F urth er m ore, additional b eta comp onen ts can b e ad d ed to the mixture ( 3 ). Also, it is p ossible to incorp orate m easuremen t error in the co v ariate, by adding a further lev el to the Ba yesian hierarchical mo del. In the Supplement ary Material [F erkingstad, F rigessi, R u e, Th orleifsson and Kong ( 2008 )] w e test for d iﬀeren tially expr essed genes, b et w een breast cancer tumors with and w ithout a mutation in the gene TP53. Here, we use genome-wide CGH copy num b er alteration as the co v ariate mo dulating the a priori b elief in eac h null hyp othesis. This d ata set is c hosen to illustrate a case wh ere the information con tained in the co v ariate is limited, but its use do es not corrupt the r esults. An imp ortan t application w here a co v ariate can pla y an imp ortan t r ole is genome-wide asso ciation stud ies where h u ndreds of thousands of SNPs (sin- gle nucleo tide p olymorph isms) are tested for asso ciation to quanti tativ e or qualitativ e traits. With gene expression traits, p-v alues calculated for SNPs residing in the n eigh b orho o d of the corresp ond ing genes (cis-v ariants) could again b e ev aluated u sing our app roac h. Here b oth the heritabilit y and the cis link age score could b e u sed as co v ariates, either individually or join tly . A r e- lated application is an asso ciation study for a disease trait (e.g., diab etes). Here the notion of a cis v ariant do es n ot apply and it is exp ected that a large n u m b er of SNPs, but n onetheless a very small fraction of all the SNPs typ ed, could b e associated to the disease. A common design is a case-con trol stu d y , and the asso ciation scores for the hundreds of thousands of SNPs genot yp ed and tested ha ve to b e ev aluated taking multiple comparisons into accoun t. Indeed, it has b een prop osed that the link age scores resulting from a family study of the disease could b e used to assist the analysis through d iﬀeren tial UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 21 w eighting of the p-v alues [Ro ed er et al. ( 2006 ), Genov ese et al. ( 2006 )] with the wei gh ts working like prior probabilities. The metho d we p rop osed here is an alternativ e to the weigh ting metho d. This application also raises th e p ossibilit y that instead of mo deling the distr ibution of th e p-v alues und er H 1 directly , one can mo del the eﬀect sizes of the asso ciation; giv en a d istribu- tion of the eﬀect sizes, there is a corresp onding distribution for the p-v alues. Mo deling this w a y can in theory pr o vide for eac h SNP , at the end of the analysis, not only the p robabilit y th at the alternativ e hyp othesis is tru e, but also the p osterior d istribution of its eﬀect size. Th e shrink age c haracteristics of such metho d s could b e a p oten tial solution to the w in ner’s curse, wher e the observe d eﬀect sizes for the SNPs that sho wed the strongest associations tend to b e biased upw ard ev en when they are tru e p ositiv es. Ac kn o wledgment s. W e than k the Editor, an Asso ciate E ditor and an anon ymou s referee for h elpful commen ts that h a ve led to an impro ved pap er. SUPPLEMENT AR Y MA TERIAL Unsup ervised empirical Bay esian m ultiple testing with external co v ari- ates (doi: 10.121 4/08-A OAS158 S UPP ; .p df ). REFERENCES Abramovich, F. , B enjamini , Y. , Donoho, D. L. and Johnstone, I. M. ( 2006). Adapt - ing to un known sparsit y by controlling the false disco very rate. Ann. Statist. 34 584–653. MR2281879 Allison, D. B. , Ga dbur y, G . L. , Heo, M. , Fern ´ andez, J. R. , Lee, C. , Proll a, T. A. and Weind ruch, R. (2002). A mixture mo del approac h for the analysis of microarray gene expression data. Comput. Statist. Data Anal. 39 1–20. MR1895555 Almasy, L. and Blangero, J. (1998). Multip oint quantitativ e-trait link age analysis in general p edigrees. Am. J. Hum. Genet. 62 1198–1211. Anderson, J. A. and Blair, V. (1982). P enalized maximum likelihood estimation in logistic regression and discrimination. Bi om etrika 69 123–136. MR0655677 Baldi, P. and Long, A. ( 2001). A Ba yesian framew ork for t he analysis of microarra y expression d ata: Regularized t-test and statistical inferences of gene changes. Bioinf or- matics 17 509–519. Benjami ni, Y. an d Hochberg, Y. (1995). Con t rolling t he false discov ery rate: A practical and p ow erful approach to multiple testing. J. R oy. Statist. So c. Ser. B 57 289–300. MR1325392 Bro wn, L. , Gan s, N., Ma ndelbaum, N. G. A. , Sak ov, A. , Shen, H . , Zel tyn, S . and Zhao, L. (2005). Statistical analysis of a telephone call center: A queueing-science p erspective. J. Am. Statist. Asso c. 100 36–50. MR2166068 Bystr y kh, L. , W eersing, E. , Dontje, B. , Sutton, S. , Pletcher, M. T. , W il tshir e , T. , Su, A. I. , Vellenga, E. , W an g, J. , Manl y, K. F. , Lu, L. , Chesler, E. J. , Alber ts, R. , Jansen, R. C. , Wi lliams, R. W. , Cooke, M. P. and de H aan, G. (2005). U ncov ering regulatory pathw ays that aﬀect hematop oietic stem cell function using “genetical genomics.” Nat. Genet. 37 225–232. 22 FERKINGST A D ET AL. Diaconis, P. and Yl visaker, D. (1985). Quantifying prior opinion (with discus- sion). In Bayesian Statistics 2 ( V alencia , 1983 ) 133–156. North- Holland, Amsterd am. MR0862488 Do, K. , M ¨ uller, P. and T ang, F. (2005). A Bay esian mix ture mo d el for diﬀerential gene expression. J. Roy . Statist. So c. Ser. C 54 627–644. MR2137258 Efr on, B. , Tibshirani, R. , Storey, J. and Tush e r, V. (2001). Empirical Ba yes analysis of a microarray exp erimen t. J. Amer. Statist. Asso c. 96 1151–1160. MR1946571 Ferkingst ad , E., Frige ssi, A . , Rue, H., Thorleifsson, G. and K ong, A. (2008). Supplement to “Unsup ervised empirical Bay esian multiple testing with external co v ari- ates.” DOI: 10.1214/08-A OAS158SUPP . Genovese, C. R. , Laz a r, N. A . and Nichols, T. (2002). Thresholding of statistical maps in functional neuroimaging using the false disco very rate. Neur oImage 15 870– 878. Genovese, C. R. , Roeder, K. and W asserman, L. (2006). F alse disco very rate con trol with p -v alue weigh ting. Biometrika 93 509–524. MR2261439 Gudbjar tsson, D. F. , Jonasson, K. , Frigge , M. L. and Kong, A. (2000). Allegro, a new computer p rogram for multipoint link age analysis. Nat. Genet. 25 12–13. Heikkinen, J. and Pentti nen, A. (1999). Bay esian smo othing in the estimation of the pair p otential function of gibbs p oint processes. Bernoul li 5 1119–113 6. MR1735787 Jansen, R. C. and N ap, J. P. (2001). Genetical genomics: The added v alue from segre- gation. T r ends Genet 17 388–91. Kendziorski, C. , Che n, M . , Yuan, M. , Lan, H. and A ttie, A. (2006). S t atistical metho d s for exp ression quantitativ e trait lo ci (eQTL) m ap p ing. Biometrics 62 19–27. MR2226552 Ko n g, A . and Co x, N. J. (1997). Allele-sharing mo dels: LO D scores and accurate link age tests. Am. J. H um . Genet. 61 1179–11 88. Kunsch, H . (1994). R obust priors for smo othing and image restoration. Ann. Inst. Statist. Math. 46 1–19. MR1272743 Langaas, M. , Lindqvist, B. H. and Ferkingst ad, E. (2005). Estimating th e proportion of true null hyp otheses, with application to DNA microarra y data. J. R. Stat. So c. Ser. B Stat. Metho dol. 67 555–572. MR2168204 Lonnstedt, I. and Speed , T. (2002). Replicated microarray data. Statist. Sinic a 12 31–46. MR1894187 Miller, C. J. , Genovese, C. , Ni chol, R. C. , W a sse rm an, L. , Connoll y, A. , Re- ichar t, D . , H opkins, A . , Schneider, J. and Moore, A. (2001). Control ling the false disco very rate in astrophysi cal data analysis. Astr on. J. 122 3492–3505 . Monks, S. A. , Leonardson, A. , Zhu, H. , Cundiff, P. , Pietrusiak, P. , Edw ards, S. , Phillips, J. W. , Sachs, A. and Schadt, E. E. (2004). Genetic inheritance of gene expression in human cell lines. Am. J. Hum. Genet. 75 1094–1105. Morley, M. , Molony, C. M. , Weber, T. M. , Devlin, J. L. , Ewens, K. G. , Spie lma n , R. S. and C heung, V. G. (2004). Genetic analysis of genome-wide v ariation in human gene expression. Natur e 430 743–747 . Newton, M. , Noueyri, A. , Sarkar, D. and Ahlquist, P. (2004). Detecting d iﬀerenti al gene expression with a semiparametric h ierarc hical mixt u re meth o d. Biostatistics 5 155–176 . Ott, J. (1999). A nalysis of Human Genetic Linkage , 3rd ed . Johns Hopk ins Un iv. Press, Baltimore. P arker, R. A. and Rothenber g, R. B. ( 1988). Id entifying imp ortant results from mul- tiple statistical t ests. Stat. Me d. 7 1031–10 43. UNSUPER V ISED MUL TIPLE TESTING WITH EXTERN AL COV ARIA TES 23 Ro eder, K. , B acanu, S.-A. , W a sserman, L. and Devlin, B. (2006). Using link age genome scans to improv e p ow er of association in genome scans. Am. J. Hum. Genet. 78 243–252. Rue, H. and Mar tino, S . ( 2007). Approximate-Ba yesian inference for hierarc hical Gaussian Marko v random ﬁeld mo dels. J. Statist. Plann. Infer enc e 137 3177 –3192. MR2365120 Schadt, E. E. , M onks, S. A. , Drake, T . A. , Lusis, A. J. , C he, N. , Colina yo, V. , Ruff, T. G. , Milligan, S. B . , Lamb, J. R . , Ca ve t, G. , Linsley , P. S. , Mao, M. , Stoughton, R. B. and Fri end, S. H. (2003). Gen etics of gene expression survey ed in maize, mouse and man. Natur e 422 297–302. Sham, P. C. , Purcell, S. , C herny, S. S. and Abecasi s, G. R. (2002). Po werful regression-based q uantitativ e-trait link age analysis of general p edigrees. Am. J. Hum. Genet. 71 238–253. Storey, J. D. (2002). A direct approach to false disco very rates. J. R. Stat. So c. Ser. B Stat. Metho dol. 64 479–498 . MR1924302 Storey, J. D. (2007). The optimal disco very p rocedu re: A new approach to simultaneo us signiﬁcance testing. J. R. Stat. So c. Ser. B Stat. Metho dol . 69 347–368. MR2323757 Storey, J. D. , Dai, J. Y. and Le ek, J. T. (2007). The optimal disco very p rocedu re for large-scale signiﬁcance testing, with applications to comparative microarra y exp er- iments. Bi ostatistics 8 414–432. Thompson, A. , Bro wn, J. , Ka y, J. and Titterington, D. ( 1991). A study of meth- ods of choosing the smooth in g parameter in imagerestoration by regularization. IEEE T r ans. Pattern Anal. Machine Intel l. 13 326–339. Tusher, V. G. , T ibshirani, R. and Chu, G. (2001). Signiﬁcance analysis of microarra ys applied to th e ionizing radiation resp onse. PNAS 98 5116– 5121. Wu, B. , G uan, Z. and Zh ao, H. (2006). Para metric and nonparametric FDR estimatio n revisited. Biom etrics 62 735–744. MR2247201 E. Ferkingst ad Dep ar tment of Biost atis tics University of Oslo PO box 1 122 Blindern N-0317 Oslo Nor w a y E-mail: egil.ferkingstad@medisin.uio.no A. Frigessi (sfi) 2 —St a tistics for Inn ov at ion The Nor wegian Computing Centre PO box 1 14 Blindern N-0314 Oslo Nor w a y E-mail: arnoldo.frigessi@medisi n.uio.no H. Rue Dep ar tment of Ma thematic al S ciences Nor wegian University of S cience and Technology N-7194 Trondheim Nor w a y E-mail: ha v ard.rue@math.nt n u. no G. Thorleifsson A. Kong Decode Genetics Sturluga t a 8 IS-101 Reykja vik Iceland E-mail: gudmar.thorleifsson@decode.is augustine.k ong@decode.is

Unsupervised empirical Bayesian multiple testing with external covariates

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment