Optimal inference via confidence distributions for two-by-two tables modelled as Poisson pairs: fixed and random effects

Optimal inference via conﬁdence distributions for t w o-b y-tw o tables mo delled as P oisson pairs: ﬁxed and random eﬀects C ´ eline Cunen* Univ ersity of Oslo, Oslo, Norw a y - cmlcunen@math.uio.no Nils Lid Hjort Univ ersity of Oslo, Oslo, Norw a y - nils@math.uio.no Abstract This pap er presents metho ds for meta-analysis of 2 × 2 tables, b oth with and without allowing heterogeneity in the treatmen t eﬀects. Meta-analysis is common in medical researc h, but most existing me thods are un- suited for 2 × 2 tables with rare ev ents. Usually the tables are modelled as pairs of binomial v ariables, but w e will mo del them as Poisson pairs. The metho ds presented here are based on conﬁdence distributions, and oﬀer optimal inference for the treatment eﬀect parameter. W e also prop ose an optimal metho d for inference on the ratio b et ween treatment eﬀects, and illustrate our metho ds on a real dataset. Keyw ords : combining information; meta-analysis; rare even ts. 1. In tro duction Meta-analysis is a class of widely used methods for com bination of information from independent studies. In medicine, meta-analyses of 2 × 2 tables are esp ecially common. There are for example many studies in vestigating whether a certain drug is related to harmful side-eﬀects. The usual set-up is to divide patients in to a treatment and con trol group, and coun t the n umber of side-eﬀect ev ents in both groups. If the side- eﬀect in question is v ery rare, individual studies can hav e lo w pow er, and meta-analysis is therefore esp ecially imp ortan t (Liu et al., 2014). The most common methods for meta-analysis of 2 × 2 tables rely on large sample appro ximations, whic h mak es them unsuited for analysis of tables with rare ev en ts (Bradburn et al., 2007). This applies for example to the well-kno wn Mantel-Haenszel metho d, the Peto metho d and the DerSimonian and Laird metho d (Bradburn et al. (2007) and Liu et al. (2014)). Recen tly Liu et al. (2014) introduced certain conﬁdence distributions (CDs) based metho ds for this t yp e of analysis. In this pap er w e present diﬀeren t CD-based methodology . Our methods utilise certain optimality theorems which in particular lead to CDs with a uniformly higher conﬁdence pow er than for CDs constructed via the methods of Liu et al. (2014). The n um be r of even ts (cell-counts) in k 2 × 2 tables are often mo delled as pairs of independent binomial random v ariables ( Y 0 ,i , Y 1 ,i ), with sample sizes ( m 0 ,i , m 1 ,i ) and ev ent probabilities ( p 0 ,i , p 1 ,i ). Here Y 0 ,i is the num b er of ev en ts among the con trol group in study i and Y 1 ,i is the n um b er of ev en ts in the treatment group in study i . Usually , the primary interest is not in the even t probabilities themselv es, but rather in the p oten tial discrepancy betw een the ev ent probabilities in the tw o groups. Whereas suc h a discrepancy can b e deﬁned in diﬀeren t fashions, the usual wa y in biostatistics is via the logistic transform: p 0 ,i = e θ i 1 + e θ i and p 1 ,i = e θ i + ψ i 1 + e θ i + ψ i . In particular, the o dds ratio is OR i = p 1 ,i / (1 − p 1 ,i ) p 0 ,i / (1 − p 0 ,i ) = e θ i + ψ i e θ i = e ψ i . In the case of small even t probabilities, the cell-counts can alternatively b e mo delled as pairs of indep endent P oisson v ariables, Y 0 ,i ∼ P ois( e 0 ,i λ i ) and Y 1 ,i ∼ P ois( e 1 ,i λ i γ i ) . (1) The argument for this is that binomial v ariables con v erge in distribution to P oisson when the sample size is large and the ev en t probabilities are small. Here e 0 ,i and e 1 ,i are kno wn exp osure w eigh ts reﬂecting the sample sizes, i.e. w e will exp ect more cases from a larger study , than a smaller one, indep endent of the ev ent probabilities. W e choose e 0 ,i = m 0 ,i / 100 and e 1 ,i = m 1 ,i / 100 . Th us, λ i can be interpreted as the risk of ev ent p er 100 patien ts in the con trol group, and γ i as the m ultiplica- tiv e parameter asso ciated with the p otentially diﬀerent risk (p er 100 patients) for patien ts in the treatment group compared to the con trol group (henceforth called the treatmen t eﬀect). Inference concerning odds ratio in the binomial model and γ i in the Poisson mo del is similar when ev en t probabilities are small and samples sizes are large. In fact, OR i = p 1 ,i / (1 − p 1 ,i ) p 0 ,i / (1 − p 0 ,i ) ≈ p 1 ,i p 0 ,i ≈ e 1 ,i λ i γ i /m 1 ,i e 0 ,i λ i /m 0 ,i = λ i γ i λ i = γ i . The λ i parameters v ary from table to table. In our pap er we shall consider the ﬁxed eﬀect mo del, where the γ i are equal to a common γ , as well as random eﬀect mo dels, where the γ i are allo wed to diﬀer. Our modus of inference will be that of CDs. A CD C ( θ ) for a parameter θ is a ‘distribution estimator’, i.e. a sample-dep endent cumulativ e distribution function ov er the parameter space summarising all asp ects of frequentist inference for θ (for a thorough introduction to inference with CDs see Sch w eder & Hjort (2015) and Xie & Singh (2013)). In addition to presenting the results as a cumulativ e distribution function, it is fruitful to represen t the distribution of conﬁdence as a conﬁdence curve, deﬁned as cc( θ ) = | 1 − 2 C ( θ ) | . The conﬁdence curve is a funnel plot pointing to the median conﬁdence estimate ˆ θ = C − 1 ( 1 2 ); also, conﬁdence in terv als for all levels α can b e read oﬀ, see for example Figure 1. F or parameters from mo dels of the exp onential class, there exist optimal conﬁdence distributions (Sc hw eder & Hjort, 2015, Ch. 5). These are uniformly most p o werful, in the sense of having low er exp ected conﬁdence loss than all other CDs for a large class of p enalty functions and all parameter v alues (see Sch w eder & Hjort (2002, 2015)). F or b oth the binomial and Poisson mo del these optimal CDs are given in Sch w eder & Hjort (2015, Ch. 14). If the optimal (or other exact) CDs are hard or time-consuming to compute, approximate CDs can b e constructed. There are several metho ds for this, and we will present some examples in section 3. In section 2 the optimal CD for the ﬁxed eﬀect Poisson mo del will b e pro vided, while the Poisson model with heterogeneous treatmen t eﬀects (random eﬀects) is treated in section 4. Section 5 oﬀers some concluding remarks. 2. Optimal conﬁdence distribution for the ﬁxed eﬀect mo del The P oisson ﬁxed eﬀect mo del has the following log-likelihoo d, writing z i = y 0 ,i + y 1 ,i , ℓ ( γ , λ 1 , . . . , λ k ) = k X i =1 { y 1 ,i log γ + z i log λ i − ( e 0 ,i + e 1 ,i γ ) λ i } . (2) This is a log-lik eliho o d of the exp onen tial family class. By theorems in Sch weder & Hjort (2015, Ch. 5) optimal inference for γ is based on its suﬃcien t statistic ( P Y 1 ,i ) given the ancillary statistics ( Z 1 , . . . , Z k ). The conditional distribution of Y 1 ,i | Z i is seen to b e binomial, Y 1 ,i | ( Z i = z i ) ∼ Bin  z i , e 1 ,i γ e 0 ,i + e 1 ,i γ  . (3) F rom this distribution we can construct CDs for γ from each study (with half-correction for discreteness) C i ( γ ) = P γ ( Y 1 ,i > Y 1 ,i, obs | z i, obs ) + 1 2 P γ ( Y 1 ,i = Y 1 ,i, obs | z i, obs ) . (4) Similarly , w e can construct a com bined CD for γ for all the studies, which is uniformly most p ow erful according to Sc hw eder & Hjort (2015, Ch. 5), C ∗ ( γ ) = P γ ( B > B obs | z 1 , obs , . . . , z k, obs ) + 1 2 P γ ( B = B obs | z 1 , obs , . . . , z k, obs ) . where B = P k i =1 Y 1 ,i , and B obs is the observed v alue of B . This is a sum of binomially distributed v ariables with diﬀeren t probability parameters ( e 1 ,i γ / ( e 0 ,i + e 1 ,i γ )), and its distribution might easily be sim ulated. As an illustration of this method, we ha v e applied it to a medical datasets with six studies in v estigating death rates among heart attac k patien ts (Normand, 1999). The treatment group received the drug Lido caine and the con trol group did not. The sample sizes and num b er of deaths in the six studies are listed in T able 1. Study Sample size con trol Even ts control Sample size treatment Even ts treatment 1 39 1 43 2 2 44 4 44 4 3 107 4 110 6 4 103 5 100 7 5 110 3 106 7 6 154 4 146 11 T able 1: Sample sizes and num ber of deaths for the six Lido caine studies. F or the Lido caine data, the combined CD for γ is shown in Figure 1, along with CDs from each study . 0 1 2 3 4 5 0.0 0.2 0.4 0.6 0.8 1.0 γ confidence cur v es Figure 1: Conﬁdence curves for γ for six Lido caine studies (in colour) and combined conﬁdence curve for all the studies (in black). A 95% conﬁdence interv al for γ can b e read oﬀ (dashed line), [1.01,3.01]. 3. Appro ximate conﬁdence distributions for the ﬁxed eﬀects mo del The optimal CD can be slightly time-consuming to compute, but we can construct appro ximate CDs, oﬀering a fast and often accurate alternative. Consider the log-lik eliho o d function of the conditional mo del following from (3). W e learn that the maxim um conditional lik eliho od (MCL) estimator is a monotone function of B = P Y 1 ,i . Since CDs are inv ariant with resp ect to monotone transformations, the optimal CD in the previous section is actually the same as the CD based on the MCL estimator ˆ γ mcl . Ignoring half-corrections, w e get C ∗ ( γ ) = P γ ( B > B obs | z 1 , obs , . . . , z k, obs ) = P γ ( ˆ γ mcl > ˆ γ mcl , obs ) . F or the Poisson mo del the MCL estimator of γ is the solution of the following equation, k X i =1 y 1 ,i ( e 0 ,i + e 1 ,i γ ) − z i e 1 ,i γ γ ( e 0 ,i + e 1 ,i γ ) = 0 . Inciden tally , the solution to the equation ab ov e is also the maximum likelihoo d (ML) estimator. Thus the unconditional and conditional likelihoo ds lead to the same estimators for this Poisson mo del. This pans out diﬀeren tly for other mo dels, as for example the mo del of binomial pairs (Breslow, 1981). Classical large samples metho ds for ML giv e us an appro ximate CD for γ , C approx ( γ ). The distribution of ˆ γ mcl is approximately a normal ( γ , ˆ J − 1 ), with ˆ J the Hessian matrix asso ciated with the log-likelihoo d function. This leads to the follo wing approximate CD, C approx ( γ ) = Φ(( γ − ˆ γ mcl ) / p ˆ J − 1 ) . Alternativ ely an approximate conﬁdence distributions for γ can b e constructed b y χ 2 1 -transforming the de- viance function of the proﬁled likelihoo d (see Sch weder & Hjort (2015, Ch. 2-3)). The k + 1-dimensional log-lik eliho o d function in (2) is proﬁled by maximising o ver λ 1 , . . . , λ k , leading (in this case) to an explicit proﬁle log-lik eliho o d ℓ prof ( γ ) = k X i =1 { y 1 ,i log γ − z i log( e 0 ,i + e 1 ,i γ ) } . The corresp onding deviance function and the resulting conﬁdence curv e are D ( γ ) = 2 { ℓ prof ( ˆ γ ml ) − ℓ prof ( γ ) } and cc a ( γ ) = Γ 1 ( D ( γ )) , where Γ 1 () is the cum ulative distribution function of the χ 2 1 . Both appro ximations w ork reasonably well, esp ecially when there are man y studies (large k ), see Figure 2. 0 1 2 3 4 0.0 0.2 0.4 0.6 0.8 1.0 γ confidence cur v es Figure 2: Optimal conﬁdence curv e for γ (in blac k) for the Lidocaine data, conﬁdence curve based on the appro ximate distribution of the MCL (in red) and conﬁdence curve based on proﬁle deviance (in blue). 4. Optimal and approximate inference with heterogeneous treatment eﬀects In meta-analysis of 2 × 2 tables, the o dds ratio from the binomial mo del is sometimes treated as a random eﬀect (for example in the DerSimonian and Laird metho d). Similarly , cell-counts from 2 × 2 tables can b e mo delled with a Poisson mo del with a non-constant γ parameter. This allows the treatment eﬀect γ i to v ary , as in (1). There are several p ossibilities regarding such mo dels. Our ﬁrst suggestion is based on pairwise comparisons. Consider tw o studies at the time, for example study 1 and 2, with Y 0 , 1 ∼ Pois( e 0 , 1 λ 1 ), Y 1 , 1 ∼ Pois( e 1 , 1 λ 1 γ 1 ) and with Y 0 , 2 ∼ Pois( e 0 , 2 λ 2 ), Y 1 , 2 ∼ Pois( e 1 , 2 λ 2 γ 2 ). W e can construct a CD for the ratio b etw een the tw o treatmen t eﬀects, δ = γ 2 /γ 1 . By arguments similar to those w orked with in section 2, a p ow er optimal CD for δ is found. It takes the form (with half-correction) C 1 , 2 ( δ ) = P δ ( Y 1 , 2 > Y 1 , 2 , obs | w obs , z 1 , obs , z 2 , obs ) + 1 2 P δ ( Y 1 , 2 = Y 1 , 2 , obs | w obs , z 1 , obs , z 2 , obs ) , where W = Y 1 , 1 + Y 1 , 2 , Z 1 = Y 0 , 1 + Y 1 , 1 and Z 2 = Y 0 , 2 + Y 1 , 2 . Thus we need the distribution of Y 1 , 2 | ( W, Z 1 , Z 2 ), and w e ﬁnd that it is an eccentric hypergeometric distribution, which may b e expressed as f ( y 1 , 2 | ( w , z 1 , z 2 )) =  z 1 w − y 1 , 2  z 2 y 1 , 2   e 0 , 1 e 1 , 2 e 1 , 1 e 0 , 2  y 1 , 2 δ y 1 , 2 P w u =0  z 1 w − u  z 2 u   e 0 , 1 e 1 , 2 e 1 , 1 e 0 , 2  u δ u , for y 1 , 2 from 0 to min( w, z 2 ) . In this wa y we can construct CDs for δ i,j for all pairs among the six Lido caine studies. In Figure 3 the CD for δ 2 , 6 is display ed. These are the tw o studies with the most diﬀerent median conﬁdence estimates for γ (see Figure 1). The CD for this δ indicates that 1 is a lik ely v alue (the 95% conﬁdence in terv al is [0 . 37 , 17 . 81]). Hence the γ s in the Lido caine data do not seem suﬃciently diﬀerent to justify the use of a random eﬀects mo del. 0 5 10 15 20 0.0 0.2 0.4 0.6 0.8 1.0 δ confidence curve 0 1 2 3 4 0.0 0.2 0.4 0.6 0.8 1.0 γ 0 0.0 0.1 0.2 0.3 0.4 0.0 0.2 0.4 0.6 0.8 1.0 κ Figure 3: Three conﬁdence curves are sho wn. In panel 1, for δ b etw een the tw o most diﬀerent Lido caine studies (in red); in panel 2 for γ 0 ; in panel 3 for κ . The 95% conﬁdence interv als can b e read oﬀ, [0 . 37 , 17 . 81] for δ , [1 . 02 , 3 . 00] for γ 0 and [0 , 0 . 072] for κ . The v ariation among the treatment eﬀects from diﬀerent studies can also b e inv estigated in other wa ys. A natural suggestion is to mo del the v ariation of the γ i with a b eta distribution for the probabilities π i from the conditional distribution of Y 1 ,i | ( Z i = z i ), Y 1 ,i | ( Z i = z i , γ i ) ∼ Bin( z i , π i ) with π i = e 1 ,i γ i e 0 ,i + e 1 ,i γ i , where in addition π i ∼ Beta { τ π 0 ,i , τ (1 − π 0 ,i ) } and π 0 ,i = e 1 ,i γ 0 / ( e 0 ,i + e 1 ,i γ 0 ). Letting κ = ( τ + 1) − 1 , we ha ve E( π i ) = π 0 ,i and V ar( π i ) = κπ 0 ,i (1 − π 0 ,i ) . If τ = ∞ , then κ is equal to zero and w e are bac k in the Poisson mo del with a common γ . W e can construct appro ximate conﬁdence curves for γ 0 b y using proﬁle deviance metho ds as describ ed in section 3, cc( γ 0 ) = Γ 1 ( D 1 ( γ 0 )) , D 1 ( γ 0 ) = 2 { ℓ 1 , prof ( ˆ γ 0 ) − ℓ 1 , prof ( γ 0 ) } . F or κ , cc( κ ) = | 1 − 2 P κ ( Q min ≥ Q min , obs ) | is a v alid conﬁdence curve, since it is (b y construction) nearly uniformly distributed at the true parameter v alue. W e use the natural statis tic Q min = min γ 0 k X i =1 { y 1 ,i − z i f i ( γ 0 ) } 2 z i f i ( γ 0 ) { 1 − f i ( γ 0 ) } , where f i ( γ 0 ) = e 1 ,i γ 0 e 0 ,i + e 1 ,i γ 0 , and calculate it for the observed data (ﬁnding Q min , obs ) and for a num b er of y 1 ,i v ectors, simulated under diﬀeren t κ v alues, with ﬁxed z i and ˆ γ 0 . The CD for γ 0 allo ws us to mak e inference ab out the increase in risk related to the the treatment in question. While the CD for κ is used to assess the v ariation in the treatment eﬀect. If the estimate for κ is small and the CD for κ has a signiﬁcan t point mass at zero, then th e v ariation among the treatmen t eﬀect is small and we can choose to use the ﬁxed eﬀect mo del from section 2. Otherwise, the random eﬀect mo del may b e preferable. F or the Lido caine example, the CD estimate for κ is zero and the p oint mass at zero is 0.85 (Figure 3). This supp orts our conclusion for the δ analysis ab ov e, that there is little v ariation among the γ s for the Lido caine studies, and that the ﬁxed eﬀect mo del is the most appropriate. 5. Concluding remarks There are many other asp ects of meta-analysis of 2 × 2 tables (in the CD framework) worth considering. First, w e hav e applied our CD-methods to several other datasets, including a small dataset from Narum et al. (2014), with ﬁve studies inv estigating whether drugs of a certain type (corticosteroids) are related to gastroin testinal bleeding for patients in am bulatory care. These data p oses some c hallenges for man y tra- ditional meta-analysis metho ds b ecause most of the studies hav e zero ev ent entries, either in the treatment group or the control group. Many existing metho ds demand 0.5 corrections to zero even ts in such situations, but these kinds of corrections can giv e large biases and are not recommended (Bradburn et al., 2007). One of the strengths of our metho d is that it can pro duce a combined conﬁdence distribution in this case, without needing to add anything to the data. Secondly , w e ha ve implemented the alternativ e CD based com bination metho d from Liu et al. (2014), and compared it to our metho ds. The t wo classes of methods can yield similar results, but ours yield stronger conﬁdence. Lastly we hav e considered extensions to 2 × 3 tables (which the P oisson model can handle without problems) and alternativ e w a ys to mo del the heterogeneous treatment eﬀects (for example with a P oisson-gamma mo del). References Bradburn, M. J., Deeks, J. J., Berlin, J. A., & Russell Lo calio, A. (2007). Muc h ado ab out nothing: a comparison of the p erformance of meta-analytical metho ds with rare ev en ts. Statistics in Me dicine , 26, 53–77. Breslo w, N. (1981). Odds ratio estimators when the data are sparse. Biometrika , 68, 73–84. Liu, D., Liu, R. Y., & Xie, M. (2014). Exact meta-analysis approac h for discrete data and its application to 2 × 2 tables with rare ev ents. Journal of the Americ an Statistic al Asso ciation , 109, 1450–1465. Narum, S., W estergren, T., & Klemp, M. (2014). Corticosteroids and risk of gastrointestinal bleeding: a systematic review and meta-analysis. BMJ op en , 4. Normand, S.-L. T. (1999). T utorial in biostatistics meta-analysis: formulating, ev aluating, combining, and rep orting. Statistics in Me dicine , 18, 321–359. Sc hw eder, T. & Hjort, N. L. (2002). Conﬁdence and lik eliho o d. Sc andinavian Journal of Statistics , 29, 309–332. Sc hw eder, T. & Hjort, N. L. (2015). Conﬁdenc e, Likeliho o d, Pr ob ability: Infer enc e With Conﬁdenc e Distri- butions . Cambridge Universit y Press. Xie, M. & Singh, K. (2013). Conﬁdence distribution, the frequen tist distribution estimator of a parameter: a review [with discussion and a rejoinder]. International Statistic al R eview , 81, 3–39.

Optimal inference via confidence distributions for two-by-two tables modelled as Poisson pairs: fixed and random effects

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment