Cognitive OFDM network sensing: a free probability approach

1 Cogniti v e OFDM network sensing: a free probability approach Romain Couillet ST -NXP W ireless - Supelec 505 Route des Lucioles 06560 Soph ia Anti polis, Franc e Email: romain.couillet@nxp.com M ´ erouane Debbah Alcatel-Lucent Chair , Supe lec Plateau de Moulon, 3 rue Joliot-Curie 91192 Gif sur Yvette, France Email: merouan e.debba h@supe lec.fr Abstract — In th is paper , a practical power detection scheme fo r OFDM terminals, based on recent fre e probability tools, is proposed. The objective is fo r the receiving terminal to determine the transmission power and the nu mber of the surrounding base stations in the network. Howev er , the system dimensions of the network model turn energ y detection in to an under-determined problem. The focus of this p aper is t hen twof old: (i ) discuss th e maximum amount of in for mation th at an OFDM terminal can gather from the surrounding base stations in the network, (ii) propose a practical solution for blind cell detection using th e free decon volution tool. The efﬁciency of this solution is measured through simulations, which show better performance than the classical p ower detection methods. I . I N T RO D U C T I O N The ever increasing deman d of hig h d ata rate has pushe d system designers to exploit the wireless channel medium to the smallest granular ity . In this respect, the ortho gonal frequ ency division multiplexing (OFDM) m odulation h as been c hosen as the next com mon standard in mo st wireless communication systems, e.g. WiMax [5], 3GPP-L TE [4 ]. OFDM converts a frequen cy selecti ve fading channel in to a set of ﬂat fad- ing chann els [ 20], therefo re pr oviding a high ﬂexibility in terms of p ower an d rate allocatio n. Future wireless networks therefor e tend to be based on highly load ed OFDM cells. Howe ver, in multiple cell en v ironme nts, inte r-cell in terferen ce is still the bottleneck factor whic h con siderably re duces th e network-wide capacity . Cooperation between base stations are envisioned to reach the capacity perfor mance o f the so- called br oa dcast chann el [13], but many problems (essen- tially of power allocation and user scheduling ) prevent those solutions to appe ar soon in practical standar ds. The refore, it is essential fo r mobile termin als to be ab le to d etermine which n eighbo ring cell provides the best quality of serv ice, so th at the termin al quick ly hands over this best perfor mance base station. Classically , only scarce and n arrow-band pilot sequences allow the ter minals to estimate the tra nsmission power o f the main surrou nding base stations, e.g. in 3 GPP- L TE, two sequen ces of the 0 . 7 MHz ba nd are available every 5 ms . Those synchr onization sequences are usually affected by fast channel fading and overlap data fr om other base stations; as a consequen ce numer ous occu rrences of those pilots need be accumulated to achieve a satisfying estimation of the base stations transmission p ower . The classical alterna ti ve to the pilot-aid ed ( also referred to as data-a ided ) p ower detection is to p erform a blind estimation fro m the in coming interferin g signals. This raises the fund amental cognitive radio que stion [2 3], [ 12], wh ich will be a n importan t top ic of th e presen t work: “ho w mu ch informa tion can a cogniti ve recei ver recover from the incoming signals?”. Th e response to th is q uestion answers two clas- sical concerns of eng ineers a nd system designers: (i) is the additional information brou ght by blind detection worth the computatio nal effort?, (ii) i s some gi ven blind detector solution far from providin g all the accessible informatio n?. It is clear in par ticular, from an in formatio n th eoretic v iewpoint, that the information rece i ved on the N OFDM subcarriers mu st ideally not be ﬁltere d in order to pr ovide as much inform ation as p ossible on the problem a t hand, i. e. any ﬁltering pro cess diminishes the a vailable inform ation in the Shannon’ s sense [1]. Theref ore, if as many as L consecutive OFDM symbols are received, the available inf ormation is c ontained in the received N × L matrix Y , with N typ ically large. As a consequen ce, since L cannot b e taken inﬁn itely large, N / L is non tri vial. This leads to the study of large ran dom matrices problem s, which is currently a hot to pic in the wir eless commun ication co mmun ity [ 8]. This is in sharp con trast with classical p ower detection meth ods [10], [11] which are only asymptotically unb iased, i. e. these meth ods assume that on e of th e system d imensions is large with respec t to the others and this co ndition is n ecessary to en sure the c onv ergence of the u nderly ing algorith ms. Our purpo se is to r etrieve r elev an t infor mation on the base station transmission powers. It will be shown herea fter th at, depend ing on th e a priori knowledge of the receiv er , the essential part of the p ower inf ormation is, in most pr actical situations, contained in the eigenvalue distribution of the 2 matrix 1 L YY H . This naturally leads to the consideration of recent r esearch on random matrix theory (RMT) [8] and more speciﬁcally o n fre e deconv olution [1 6]. In particular, in [7], a similar stud y of terminal p ower detection in code division multiple access (CDMA) networks is derived from these to ols. Howe ver, the mo del in [7] only c onsiders ﬂat fading ch annels and dod ges the difﬁculty of multi-p ath chann els; m oreover the structure of the CDMA encod ing matrix allows to easily recover the tr ansmitted signal variances, which is not the case of m ulti-cell OFDM in which m ultiple streams overlap with no dedicated code to separate them. W e p ropo se here ﬁrst to discuss the o ptimal amou nt of informa tion which th e receiver can extract from the incoming data to blind ly retrieve the values of the powers transmitted b y all surround ing base stations, wh en the rece i ver’ s prior state of knowledge about the environment is very limited. From th is analysis, it will be observed th at the general problem is not very tractab le b oth in term s o f mathematical derivation an d therefor e in ter ms of practical implem entation. Seco ndly , we propo se a suboptimal but im plementable approa ch to so lve the cell detection prob lem, ba sed o n the free pr obability framework, fo r which we derive a novel sig nal detectio n algorithm for OFDM. The re mainder of the pape r is structu red as fo llows: In Section II, we in troduce the mod el of the mu ltiple cell OFDM network. I n Section III, we discuss the amount of infor mation about tran smitted p owers which can be collected by the terminal from the received matrix Y . The ob servation that all the n ecessary inform ation is contained in the eig en values of 1 L YY H leads to Section I V, in which we evaluate the classical energy detection meth ods, which assume L/ N → ∞ . In Sectio n V, we intr oduce some basic concep ts of random matrix theo ry [8], which a re n eeded to the u nderstand ing o f the subsequent sections. In Sec tion VI, we provide a novel algorithm to detect th e cell transmission powers. Simu lation results are th en provided in Section VII. A d iscussion on th e gains an d limitations of this n ovel meth od is carried out in Section VIII. Finally , in Section I X we draw our conclusion s. Notations: In the following, bo ldface lower case sym bols represent vector s, capital bold face characters den ote matrices ( I N is the size- N identity ma trix). The spaces M ( A , i, j ) and M ( A , i ) are the sets of i × j and i × i matr ices over the algebra A , respectively . The transpose and Herm itian transpose operator s are den oted ( · ) T and ( · ) H , respectively . The oper ator diag( x ) tur ns the v ector x in to a diagonal matrix. The function 1 ( · ) denotes th e indicator function . I I . D O W N L I N K M O D E L Consider a network of N B base stations and on e terminal equippe d with a single receiving antenn a. Th is scena rio is depicted in Figure 1. Assume m oreover th at the receiver is already connected to o ne serving base station , which we purpo sely exclude fr om the set of the N B surroun ding base stations since we already kn ow all informa tion abou t it. The network is assumed to be sync hronized in time and fr equency and to u se OFDM m odulation with a size N discr ete Fourier transform (DFT). Denote then M the m aximum number of P S f r a g r e p l a c e m e n t s s ( l ) 1 s ( l ) 2 s ( l ) 3 s ( l ) M Fig. 1. System Model base stations the receiver expects to de tect. Ideally N B ≤ M . In the following, for simpliﬁcation, we assume that this condition is always fulﬁlled and we will only co nsider the parameter M , consider ing then a n etwork of M base stations, of which some co uld be of n ull power . Th e lin k between the terminal and th e base station k is modelled as a fast-fading complex f requen cy domain chann el h k = [ h k 1 . . . h kN ] T ∈ C N , coupled to a slow-fading path loss L k = 1 /P k with P k the mean receiv ed p ower origin ating from base station k . Th e terminal also receives additi ve white Gau ssian noise σ n ∈ C N with e ntries of variance σ 2 . T he base station k sends at time l the frequ ency-dom ain OFDM symb ol s ( l ) k = [ s ( l ) 1 k , . . . , s ( l ) N k ] T . Therefo re, the recei ved signal vector y ( l ) = [ y ( l ) 1 , . . . , y ( l ) N ] T at time l read s y ( l ) = M − 1 X k =0 P 1 2 k D k s ( l ) k + σ n ( l ) (1) with D k = dia g ( h k ) . This sum mation over the M cells can be rewritten y ( l ) = HP 1 2 θ ( l ) + σ n ( l ) (2) with θ ( l ) ∈ C M N the concatenated vector θ ( l ) = [ s ( l ) 1 , . . . , s ( l ) M ] T , H ∈ M ( C , N , M N ) the conc atenated m atrix of the D k , k ∈ { 1 , . . . , M } H =    h 11 · · · 0 · · · h M 1 · · · 0 . . . . . . . . . · · · . . . . . . . . . 0 · · · h 2 N · · · 0 · · · h M N    (3) and P ∈ M ( R , N ) th e diagon al matrix P =       P 1 0 · · · 0 0 P 2 . . . . . . . . . . . . . . . 0 0 · · · 0 P M       ⊗ I N (4) Now assum e th at the M ch annels ha ve a coher ence tim e of order (or more than) L times the OFDM symbol duration. The L samples y ( l ) , l = 1 , . . . , L , c an be concatenated into an N × L matrix Y = [ y ( 1 ) · · · y ( L ) ] to lead to the more general matrix expression of the r eceiv ed sign al Y Y = HP 1 2 Θ + σ N (5) where Θ ∈ M ( C , M N , L ) and N ∈ M ( C , N , L ) are the con- catenation matrices of th e L vectors θ ( l ) and n ( l ) , respectively . 3 I I I . P RO B L E M S TA T E M E N T The power detection pro blem consists in th e pre sent situa- tion to retrie ve the en tries of th e ma trix P fr om the rece iv e matrix Y . T he prior information at the receiver is no t s ufﬁcient to ﬁnd P in a straigh tforward manner . Inde ed, the chann el matrix H and the tr ansmitted signal Θ are un known and, worse, e ven th eir stochastic distribution are usually unkn own; in p articular, rec ent ﬂexible multiple acce ss OFDM standar ds adapt their transmission rates to the channel quality so that the termina l ca nnot a priori assume to receive eithe r QPSK, 16 -QAM, 6 4 -QAM or any other type of modu lation. The a priori kn owledge I at the terminal is therefore limited to: ( i) the appr oximated back groun d covariance E[ nn H ] = σ 2 I , (ii) the fast-fading chan nel p ower E[ h H k h k ] = 1 , (iii) th e chan nel delay spread known to be lesser than th e cyclic preﬁx len gth, (iv) the transmitted signal covariance E[ θ H k θ k ] = I N . From this amou nt of prio r inform ation I , the most reli- able chan nel model 1 is obtain ed from the ma ximum entropy principle [14]. The latter states that the transmitted data Θ must be mode lled as a Ga ussian indep endent an d identically distributed process (i.i.d. ). As fo r the sh ort-term chann els h k , giv en a delay spread τ d (counted in integer n umber of time- domain samples), the time- domain representatio n of h k must be mo delled as a Gau ssian i.i.d. vector of length τ d ; ther efore, h k is to b e modelled as a Gaussian vector with cov ariance matrix the DFT of I τ d . Since little information about τ d is initially kn own to the receiver , the chan nels must be modelled as the marginal distribution of those Ga ussian p rocesses with τ d varying from 1 to the cyclic preﬁx length. Of course, this mo del migh t be very different fr om r eality and mig ht provide totally wrong results, as lon gly discussed in [1 5]. Howev er , this is the best on e can b lindly infer on the transmission scheme from the av ailable informa tion. The objective now is to determine what is the proba bility p ( P | I ) that a sequence of transmitted powers { P 1 , . . . , P M } ﬁts the previous model kn owing I . From tho se probabilities, computed for all vectors in ( R + ) M , an estimate ˆ P of P can be designed which minimizes some erro r measure, e .g. ˆ P = E[ P | Y , I ] would be the min imum mean square erro r estimate of P . The prob ability p ( P | Y , I ) assigned to the info rmation ( P | Y , I ) can be written, thanks to Bayes’ rule p ( P | Y , I ) = p ( Y | P , I ) · p ( P | I ) p ( Y | I ) (6) in which ( P | I ) is th e a priori knowledge a bout P . It is classically assign ed a u niform d istribution over so me sub- space [0 , P max ] M for a m aximum rec eiv e power P max . As fo r p ( Y | P , I ) , it can b e expanded as p ( Y | P , I ) = Z Θ , H p ( Y | P , H , Θ , I ) p ( H | I ) p ( Θ | I )d H d Θ (7) in which all integrands are known from the max imum entropy model aforem entioned , 1 by “most reliable model”, we mean the model which satisﬁes the con- straints imposed by I and which is the most noncommittal regardi ng unknown system informati on. • p ( Θ | I ) is standard mu ltiv ariate i.i.d. Gaussian. • the compou nd chann el H is assigned a distribution p ( H | I ) = τ max X k =1 p ( H | τ d , I ) · p ( τ d | I ) (8) with τ d the channel delay spread, τ max the cyclic pr eﬁx length, p ( τ d | I ) = 1 /τ max and p ( H | τ d , I ) with stan dard i.i.d. Gaussian diagonals. Howe ver, th e explicit com putation of ( 7) is very in volved and req uires advanced tools from ran dom matrix theor y . A similar calculus was p erform ed by the autho rs f or the simpler single-cell MIMO energy detector [31]. In the latter it was shown that, surprisingly , the standard i.i.d . Gau ssian model assigned to the main system pa rameters m akes the energy detection depen d only on the eig en value distribution of the receive ma trix Y . The multi-cell detection problem at han d is very similar in conﬁgu ration, a part for the chan nel marginal- ization of equa tion (8) whic h is not i.i.d. Gaussian. Since we cannot provide an optimal info rmation theoretical solu tion to our problem and since both aforem entioned pr oblems are very similar , it seems relev an t to concentrate on the close-to-o ptimal random m atrix theoretical appro ach. Some important infor mation can noneth eless be alre ady deduced from the integral form of equation (7). If the tran s- mission ch annels are extrem ely freq uency ﬂat, i.e . for all k , h k 1 ≃ h k 2 . . . ≃ h kN , then { HP 1 2 Θ } ij = P k √ P k h k θ kj . Therefo re, even if the realizations o f N a nd Θ wer e perfectly known, o ne will have access at best to the variables √ P k h k , k = 1 , . . . , M , from which no reliable estimation of P k be drawn; in such a situation, the p osterior probab ility p ( P | Y , I ) is very b road and is m aximized on a large continuo us set of P 1 , . . . , P M . On an inf ormation theoretical viewpoint, this means th at the o ptimal in ference on P given Y and I cannot lead to any valuable information . In the random m atrix approa ch, the situatio n is even worse. If on e knew perf ectly the entries of HPH H , then nothin g at all can b e said ab out P 1 , . . . , P M . Indeed , { HPH H } ij = P k P k | h k | 2 (see Section VI fo r details) and the o nly piec e of informa tion which on e has about P 1 , . . . , P M is the sum P k P k | h k | 2 ; the l atter cannot lead to any estimate o f P when M > 1 an d the problem cannot be solved. This means that, given the limited prior informa tion of the terminal, it is impossible to come up with a reliable estimate of P when the channe ls are f requen cy ﬂat. In the rem ainder o f this paper , we shall theref ore consider that the OFDM ch annels are very freq uency selecti ve 2 . In the following, we investigate the classical power detection techniqu es, which shall prove inefﬁcient in this large n on-trivial matrix problem . I V . C L A S S I C A L P O W E R D E T E C T I O N Usual p ower detection conside rs the second o rder statistics of th e received signals. I n th e scalar case, i.e. y ( l ) reduces to a single value y ( l ) , it was proved [10] that the o ptimal detector with the aforemention ed state of knowledge at the terminal 2 note that this assumption ensures high efﬁci ency of the network in terms of per -user outage capa city , which is very desirab le in the current trend for pack et-switch ed communications. 4 consists in evaluating 1 L ([ y (1) , . . . , y ( L ) ][ y (1) , . . . , y ( L ) ] H ) − σ 2 , with L > > 1 . W e show in what follows that this classical scheme can be simply extended to our network situation but that it is very in efﬁcient for small L/ N ratios. Assuming that L/ N is very large, the expression of the normalized Gram matrix associated to Y reads YY H L − → L →∞ E  1 L ( HP 1 2 Θ + σ N )( HP 1 2 Θ + σ N ) H  − → L →∞ HP 1 2 E  ΘΘ H L  P 1 2 H H + HP 1 2 L E h ΘN H i + 1 L E h NΘ H i P 1 2 H + E  1 L NN H  − → L →∞ HPH H + σ 2 I N (9) the last line comes from th e f act that, N being ﬁnite, the N × N matrices 1 L NN H and 1 L ΘΘ H conv erge in distribution to an identity matrix, and the cross produ cts to null matrices. As a consequen ce, as will be de tailed in Section VI-B, one can estimate the values P k , k ∈ { 1 , . . . , M } from the moments { ( 1 L YY H − σ 2 I N ) k } , k ∈ { 1 , . . . , M } , wh en L is large co mpared with N . Our situa tion does not fall into this asymptotic L / N → ∞ co ntext. In present a nd f uture OFDM techn ologies, the number N of av ailable subcarr iers is large, e.g . of ord er a thousand subcarr iers, while L is limited in o ur model to the channel coh erence tim e or in general to the number of OFDM symbols the terminal is willing to memorize b efore treatin g informa tion. The refore, even if N and L ar e large, their ratio N / L is not in general close to zero. The fund amental asymptotic assumption is therefor e no long er satisﬁed. W e show in T able I that the non -trivial ratio N/ L im pairs signiﬁcantly th e p erform ance of the classical power detection. The latter is the result of a simulation in which we applied the algo rithm that will be described in Section VI-B, based on th e sample mo ments o f 1 L ( YY H − σ 2 I N ) ( instead o f the sample mom ents of 1 L HPH H , whose cor respond ing results are shown between b rackets). Th e typical situation consid ered in this examp le is a thre e-base station scenario o f respective powers P 1 = 4 , P 2 = 2 an d P 3 = 1 an d a n oise lev el σ 2 = 0 . 1 , i.e. SNR = 10 dB , N = 2 56 and L is taken in a range from 256 to 3 2 , 7 68 . It turns out indeed that L need s to be large for this method to be satisfyin g. In this pr ecise example, this compels L/ N to be of o rder 6 4 , which is not acceptab le in our current system settings. Such problems inv olv ing large matrices with no n-trivial N /L r atios a re at the hear t o f a re cent ﬁeld of research , known as rando m matrix theory (RMT), wh ich is a particular case of the more gen eral free pro bability theory intr oduced by V oiculescu [2 2]. In the subsequen t section, we pr ovide a quick introdu ction to impo rtant notions o f RMT which ar e necessary to handle the rest of the multiple cell detection study . V . R M T A N D F R E E D E C O N VO L U T I O N A. Ra ndom ma trix theory Deﬁnition 1: A ran dom matrix is a multi-variate ran - dom v ariable X = { X 11 , X 12 , . . . , X M N } f or a given N = 256 , P = { P 1 , P 2 , P 3 } = { 4 , 2 , 1 } L Estimated ˜ P [our algorithm] k P − ˜ P k 2 256 { 7 . 93 , 1 . 62 , − 2 . 5 } [ { 4 . 28 , 1 . 35 , 1 . 35 } ] 27 . 63 512 { 5 . 82 , 2 . 90 , − 1 . 7 } [ { 3 . 80 , 2 . 16 , 1 . 00 } ] 11 . 42 1024 { 4 . 26 , 3 . 60 , − 0 . 8 } [ { 3 . 62 , 2 . 37 , 0 . 94 } ] 5 . 87 2048 { 4 . 52 , 2 . 69 , − 0 . 2 } [ { 4 . 22 , 1 . 55 , 1 . 12 } ] 1 . 77 4096 { 4 . 20 , 2 . 65 , 0 . 18 } [ { 4 . 09 , 2 . 06 , 0 . 78 } ] 1 . 41 8192 { 4 . 10 , 2 . 28 , 0 . 58 } [ { 4 . 05 , 1 . 89 , 0 . 92 } ] 0 . 27 16384 { 3 . 97 , 2 . 42 , 0 . 89 } [ { 3 . 95 , 2 . 24 , 0 . 99 } ] 0 . 19 32768 { 4 . 07 , 1 . 95 , 0 . 99 } [ { 4 . 03 , 1 . 95 , 0 . 98 } ] 0 . 01 T ABL E I C L A S S I C A L M O M E N T - B A S E D M E T H O D ( M , N ) ∈ N 2 couple. As such, X is a m atrix who se entries X ij ∈ C M × N are ruled b y a joint pro bability distribution p ( X 11 , X 12 , . . . , X M N ) . Free pr obability is the study of rando m variables in non- commutative algeb ras, i.e. algebras in which the prod uct operation is non-c ommutative. The algeb ra of large He rmitian random ma trices is a particu lar case of those n on-com mutative algebras. In the f ollowing, we shall qualify fr ee any notion attached to the free prob ability (or RMT) framework while we shall q ualify c lassical any notion attached to th e cla ssical probab ility framework of commuta ti ve algebras. Similarly to the classical theory of pro bability , a free ex- pectation fun ctional φ can be d eﬁned. For a given Her mitian random m atrix X , the free expectation read s φ ( X ) = lim N →∞ E [tr N X ] (10) and we can similarly deﬁne free moments m k , k ∈ N of a random m atrix. Those are m k = φ  X k  (11) Thanks to the trace pro perties, note that the free m oments are strongly linked to the eig en values λ i , i ∈ { 1 , N } of X , since m k can also be written m k = lim N →∞ 1 N N X i =1 λ k i (12) Indeed , denote X = Q Λ ΛQ Λ H with Λ = diag( { λ 1 , . . . , λ N } ) an d Q Λ unitary , X k = Q Λ Λ k Q Λ H . T aking the trace of X k leads to equation (12). The asy mptotic ( N , L → ∞ with N /L constant) marginal distribution o f the eigen values of X is called the empirical distribution of X and will b e de noted µ X . Its associated cumulative d istribution fun ction F X reads [8] F X ( λ ) = lim n →∞ 1 N N X i =1 1 ( λ i ≤ λ ) (13) Therefo re the free m oments are directly linked to the empirical d istribution of the matrix X , m k = lim N →∞ Z R + λ k µ X ( λ )d λ (14) which is the classical deﬁnitio n of mo ments associated to the distribution µ X . 5 0 0 . 5 1 1 . 5 2 2 . 5 3 0 0 . 2 0 . 4 0 . 6 0 . 8 1 1 . 2 x Densit y µ η c ( x ) c = 0 . 1 c = 0 . 2 c = 0 . 5 Fig. 2. Marchenk o-Pastur la w µ η c Interestingly , for mo st usua l 3 random ma trices A of large dimensions N , L → ∞ with N/L = c constant, the eigenv alu e density of X = 1 L AA H conv erges to a deﬁnite distribution. For instance, in our cu rrent p roblem, since the inpu t signals Θ and noise N ar e modelled as stand ard i. i.d. Gaussian, w e are interested in the so-called W ishart matrice s that we deﬁne hereafter Deﬁnition 2: An N × N ran dom matrix X = AA H , with A a rand om N × L matrix whose co lumns are zero mean Gaussian vectors with covariance matr ix Σ , is called a generalized Wis hart matrix of L degre es of freedom. This is denoted X ∼ W N ( L, Σ) (15) W ishar t matrices W N ( L, I N ) are known to h av e an e igen- value distribution which conv erges, whe n ( N , L ) grows to in- ﬁnity with a constant ra tio c = N /L , tow ards the Marchenko- Pastur law µ η c [8]. T he Marchenko-Pastur law is deﬁned by µ η c =  1 − 1 c  + δ ( x ) + p ( x − a ) + ( b − x ) + 2 π cx (16) with ( a, b ) =  (1 − √ c ) 2 , (1 + √ c ) 2  . In Figure 2 we p rovide the distribution of µ η c for d ifferent values of c . Note that when c tends to 0 , i.e. when L/ N → ∞ , the Marchenko- Pastur law con verges to a sing le Dirac in 1 and we recover the classical law o f large num bers. Equiv alently to classical probab ility theory , many results of free pro bability in volve the distribution of sum , difference, produ ct and inverse of ran dom matr ices. The ch aracteristic function of a distribution, used to de riv e the distribution o f the sum of independen t co mmutative ran dom variables, has a free cou nterpar t called the R -Transform. The Mellin trans- form, used to derive the p roduct of indepen dent co mmutative random variables, also has a fre e counterp art, kn own as th e 3 by usual, we qualify matrices found in common wirel ess communication problems. S -Transform 4 . Giv en two large rand om Hermitian m atrices A an d B , whose sum is C = A + B and whose prod uct is D = AB , one can then deri ve the empirical distributions of C and D from th e empirical distributions of A and B , w hich we d enote µ C = µ A ⊞ µ B (17) µ D = µ A ⊠ µ B (18) Equation (17) is called a dditive free con volution and equation (18) is called multip licativ e free conv olution . Similarly , giv en only the d istributions of B , C and D , one can recover the distribution of A µ A = µ C ⊟ µ B (19) µ A = µ D  µ B (20) in which equation (19) is called additive free dec onv olu tion and equation (20) is called multiplicative free d econv olution. B. F ree deco n volution for informatio n plus n oise mo del Our interest is to trea t a particu lar comm unication model, known as the information plus noise model , Deﬁnition 3: Given two N × L ( N / L → c ) large r andom matrices R (standing for an informative signal) and X (stand - ing for a n oise ad ditive signal) an d a scalar σ (the standa rd deviation of the noise proc ess), the mo del given by W = 1 L ( R + σ X )( R + σ X ) H (21) is called the information plus noise model. W e shall therefore call W an informa tion plu s noise matrix. It has been re cently shown [7] that the empirical distribution of 1 L RR H in the pr evious deﬁnition can be recovered from the empirical distributions of the matrices W an d XX H when X is Gaussian with i.i.d. en tries of zero m ean and v ariance 1 /L . This is giv en by µ 1 L RR H = (( µ W  µ η c ) ⊟ δ σ 2 ) ⊠ µ η c (22) For more details about the demonstration of formula (22), refer to [ 16]. Thanks to th is RMT fr amew ork and th e f ree con volution tools, we can now address ou r multiple cell detectio n problem. V I . A P P L I C A T I O N O F F R E E D E C O N VO L U T I O N T O M U LT I P L E C E L L D E T E C T I O N A. Sig nal an d noise deconvolution In the m odel (5), th e N × N matrix 1 L YY H is an in- formation plu s noise matrix with N a Gaussian ran dom matrix. Th erefore, a ccording to eq uation (2 2), when N , L are sufﬁciently large, one ca n derive th e empir ical distribution 4 note that independe nce in the classical probability s ense is not enough in free probabilit y to deriv e the empiric al distribu tion of the sum , dif ference , product and in verse product of two random matri ces. This independe nce notion is ext ended to the asymptotic fr eeness concept [8] w hich is not a techni cal issue in our study . 6 µ 1 L HP 1 2 ΘΘ H P 1 2 H H from the emp irical d istribution µ 1 L YY H as follows µ 1 L HP 1 2 ΘΘ H P 1 2 H H =  ( µ 1 L YY H  µ η c ) ⊟ δ σ 2  ⊠ µ η c (23) where c = N / L , for N is of size N × L . Also, the m atrix Θ in equ ation (5) is modelled as standard i.i.d. Gaussian. Therefo re 1 L P 1 2 H H HP 1 2 ΘΘ H is a g eneralized W ishart matrix with covariance P 1 2 H H HP 1 2 . An alogou sly to (15), this can be written 1 L P 1 2 H H HP 1 2 ΘΘ H ∼ W N ( L, P 1 2 H H HP 1 2 ) (24 ) Then, the emp irical distribution of the covariance matrix P 1 2 H H HP 1 2 of the W ishart matrix 1 L P 1 2 H H HP 1 2 ΘΘ H can be recovered from the empirical distribution µ 1 L P 1 2 H H HP 1 2 ΘΘ H when the couple ( N , L ) tend s to inﬁnity with a constant r atio c ′ = M N /L ( M is constant). Consequently , similarly to (20), µ P 1 2 H H HP 1 2 = µ 1 L P 1 2 H H HP 1 2 ΘΘ H  µ η c ′ (25) The left side of equation (23) is slightly different fr om the desired expression in the rig ht side of equation (25). Still, thanks to the trace comm utativity p roperty , we have the link [8] µ 1 L P 1 2 H H HP 1 2 ΘΘ H = 1 M µ 1 L HP 1 2 ΘΘ H P 1 2 H H +  1 − 1 M  δ 0 (26) This relation is due to the fact that the positive eigen- values o f 1 L HP 1 2 ΘΘ H P 1 2 H H are the same as those of 1 L P 1 2 H H HP 1 2 ΘΘ H (since their traces are identical and then all their m oments match). But th e rank o f both matrices differ an d then a cer tain amoun t of null eigenv alues mu st be intr oduced . Here, 1 L HP 1 2 ΘΘ H P 1 2 H H is o f full r ank ( N ) while 1 L P 1 2 H H HP 1 2 ΘΘ H is on ly o f rank N fo r a matrix size M N , h ence the M factor in eq uation ( 26). Finally , we similarly c onnect the left side of eq uation (25) to µ HPH H throug h µ P 1 2 H H HP 1 2 = 1 M µ HPH H +  1 − 1 M  δ 0 (27) The emp irical distribution of HPH H was then der iv ed from the em pirical distribution of 1 L YY H . As a con sequence, the free moments d k = E[tr N ( HPH H ) k ] can be retrieved from the free momen ts m k = E[tr N ( 1 L YY H ) k ] . Surp risingly , it is shown in [16] that fo r all aforemen tioned fre e conv o lution operation s, the set o f the ﬁrst k momen ts o f the [de]co n volved distributions can be re covered fro m the set of the k ﬁrst moments of the operand s. This sub stantially reduces the com- putational effort, as is describe d in the following. Let us work in d etail all the steps to derive the mo ments d k from the momen ts m k . 1) ﬁrst, the noise contribution to the signal Y is dec on- volved thanks to formula (23). The m ultiplicative conv o - lution (resp. d econv o lution) µ ( out ) of a distribution µ ( in ) and the March enko-Pastur law µ η c can be comp uted from all the m oments m ( in ) k . It is shown in [ 7] tha t c · m ( out ) k can be compu ted from th e moments/cu mulants transform (resp . cumu lants/moments transfor m) [8] o f the coefﬁcients c · m ( in ) k , with c = N /L . As fo r additive conv olu tion (resp. deco n volution) µ ( add ) of two distribu- tions µ ( a ) and µ ( b ) , the f ree cumulan ts of µ ( add ) are the sum ( resp. difference) of th e cumulants o f µ ( a ) and µ ( b ) . From (23), the mo ments m ′ k of µ 1 L HP 1 2 ΘΘ H P 1 2 H H are obtained from the mo ments m k of µ 1 L YY H ; th erefore, in m athematical terms, this reads ( m ′ 1 , . . . , m ′ M ) = S 1 ( m 1 , . . . , m M , σ 2 ) (28) with S 1 = 1 c M  c · M  C  1 c C ( cm 1 , . . . , cm M )  − C  σ 2 , . . . , σ 2 M  (29) where the functio ns M ( · ) and C ( · ) stand respectively for the cumu lants/moments transform and the mo- ments/cumulan ts tr ansform th at b oth take as argument a vecto r of size k (the ﬁr st k mo ments or cumulants, respectively) a nd outpu t a size k vector (th e ﬁr st k cumulants or moments, respectively). Th e inner 1 c C ( c · ) operation multiplicatively deco n volve the Marc henko- Pastur law µ η c . T hen th e next C application tu rns the resulting mo ments into cumulants. T he ad ditiv e decon - volution o f δ σ 2 is then p erform ed thro ugh the cum ulant difference and the outp ut is turned back into the mo ment space throu gh the M app lication. The outer 1 c M ( c · ) is ﬁnally perfo rmed to m ultiplicatively c onv olve the result with µ η c . 2) the mo ments m ′′ k of µ 1 L P 1 2 H H HP 1 2 ΘΘ H are then gi ven throug h eq uation (26) by a simple scaling of the mo - ments m ′ k by 1 / M . This reads ( m ′′ 1 , . . . , m ′′ M ) = 1 M ( m ′ 1 , . . . , m ′ M ) (30) 3) the Marchenko-Pastur law µ η c ′ , c ′ = M N /L , is then deconv olved fr om µ 1 L P 1 2 H H HP 1 2 ΘΘ H accordin g to equation (25). This leads then to the mo ments m ′′′ k of µ P 1 2 H H HP 1 2 , ( m ′′′ 1 , . . . , m ′′′ M ) = 1 c ′ C ( c ′ d ′ 1 , . . . , c ′ d ′ M ) (31 ) 4) ﬁnally , th e resulting momen ts m ′′′ k are scaled by M to obtain th e d k coefﬁcients, ( d 1 , . . . , d M ) = M ( m ′′′ 1 , . . . , m ′′′ M ) (32) Figure 3 summarizes these steps. Our ﬁnal interest th ough is to ﬁnd the d iagonal values of P . The d istribution of the cha nnel matrix H was mo delled in Section II by a m ixture of corr elated Gaussian sub channels. It is difﬁcult, at this p oint o f kn owledge in free probability theory , to deconv olve the ef fect of H from the random ma trix HPH H . Only classical meth ods can help in this situation . Remarkably , it tu rns out that the matrix HPH H is diagonal HPH H =    P k P k | h k 1 | 2 · · · 0 . . . . . . . . . 0 · · · P k P k | h kN | 2    (33) 7 For k ∈ { 1 , . . . , M } , c omputatio n of m k = E h tr L ( YY H ) k i Deconv o lution of the additive noise ( m ′ 1 , . . . , m ′ M ) = 1 M S 1 ( { m k } , σ 2 ) Retriev al o f th e d k ’ s ( d 1 , . . . , d M ) = M c ′ C ( c ′ m ′ 1 , . . . , c ′ m ′ M ) Fig. 3. m k to d k Block Diagram Therefo re the theoretical mome nts d k of µ HPH H are th e normalized traces of th e asympto tic dia gonal m atrices of entries, n ( HPH H ) k o ij = M X k =1 P k | h ki | 2 ! k δ j i (34) and then the p th order mom ent d p = E[tr N ( HPH H ) p ] of HPH H can then be equated for large N to d p = lim N →∞ 1 N N X j =1 M X k =1 P k | h kj | 2 ! p (35) In the f ollowing we provide a meth od to estimate the entries P k under the assump tion, which is nev er veriﬁed in practice, that the channels a re extremely frequ ency selecti ve, i.e. such that, fo r any k and any coup le j 6 = j ′ , the entries h k,j and h k,j ′ are independ ent. B. Estimation of th e p owers P k As alre ady co ncluded in Section III, if the channel delay spreads are very short, then the channel frequen cy resp onses { h k 1 , . . . , h kN } , k ∈ { 1 , . . . , M } , ar e stro ngly correlated and almost nothin g can be ded uced o n the entries of P . On the contrary , assume that the chann el delay spread is very large, and rewrite equatio n (35) as d p = ¯ d p + w p (36) = 1 N N X j =1 M X k =1 P k | h kj | 2 ! p + w p (37) with ¯ d p the p th sample order mome nt ( N < ∞ ) and w p some noise pro cess. The latter conver ges to 0 when N → ∞ and the channel frequ ency response is i.i.d. Gaussian. Howe ver, w hen N is ﬁnite or when the channel is less frequency selective, then w p is a bit more difﬁcult to hand le. T o push th e computatio n for ward, we need ﬁrst to de riv e the classical order p moment m ( h ) p of the v ariables | h kj | 2 , for any co uple ( k , j ) , in the complex ca se. This g iv es m ( h ) p = E [ | h kj | 2 p ] = 1 2 2 p p X i =0 C i p (2 i )!(2[ p − i ])! i !( p − 1)! (38) One can th en derive the g eneral expression for d p as d p = p ! 2 2 p X k 1 ,...,k M P i k i = p M Y i =1 ( k i X k =0 (2 k )!(2[ k i − k ])! ( k !) 2 ([ k i − k ]!) 2 ) P k i i (39) The complete deriv atio n of fo rmula (39) is provided in the append ix. 1) Bayesian a ppr o ach: Let K be some integer greater than or equa l to M . The no ise process w = [ w 1 , . . . , w K ] T , as already men tioned, is in gen eral difﬁcult to analyze. W e shall consider in the following that one actually has a limited knowledge abou t w which reduc es to the covariance matrix C = E[ ww T ] gathered from p revious simulations 5 . Now , consider that th e set of prior informatio n I con tains the following elements: • the values o f P 1 , . . . , P M are real p ositiv e an d k nown not to b e larger than some value P max . • the typ ical err or variance C in the observed mom ents { d k } , k ∈ { 1 , . . . , M } , is k nown. In fact, th e exact covariance matrix C canno t be known since its computatio n req uires the exact k nowledge of P . Indeed , a few lines of calculu s of C = E h  d − ¯ d  H  d − ¯ d  i (40) with ¯ d = [ ¯ d 1 , . . . , ¯ d K ] T , lead to equ ation (41), which depends on P 1 , . . . , P M . Howe ver, let us ﬁrst co nsider C known before we intro duce alternative solutions wh en C is u nknown. The objective is to infer on the set { P 1 , . . . , P M } giv en I and the ob served sample moments ¯ d . An err or measu re must be con sidered to come up with an estimate o f { P 1 , . . . , P M } . W e co nsider here th e estimate of P which minimizes th e mean quadra tic err or (MMSE). Th is MMSE estimate ˜ P is given by ˜ P = E  P | ¯ d  (42) = Z P P p ( P | ¯ d , I )d P (43) = Z P P p ( ¯ d | P , I ) p ( P | I ) R P p ( ¯ d | P , I ) p ( P | I )d P d P (44) Since the prio r information ( P | I ) is limited to the fact that all entries are upper-boun ded b y P max , p ( P | I ) sh ould be set unifor m on the space [0 , P max ] M accordin g to the max imum entropy principle. However , n ote that if { ˜ P 1 , . . . , ˜ P M } mini- mizes the M MSE then also does any p ermutation of this set. Therefo re, to have a cor rectly d eﬁned problem (with a un ique solution), the set { P 1 , . . . , P M } must be order ed; we will then state in the following that P 1 ≤ P 2 ≤ P M . Ther efore the p rior p ( P k | I ) is taken un iformly on the set [0 , P k − 1 ] when P k − 1 is set, wh ich leads to p ( P | I ) = P max Q M − 1 i =1 P − 1 i . A lso, since only the error cov ariance matrix C in the observed sample moments ¯ d is known, the maxim um entropy princ iple requests that the p rocess w is assigned a Gau ssian distrib ution with 5 honesty woul d require tha t we actu ally deriv e the maximum ent ropy distrib ution of w but this would lead to in volv ed computation. 8 C a,b = − ¯ d a ¯ d b + X k 1 ,...,k M k ′ 1 ,...,k ′ M P i k i = a P j k j = b                  ( N − 1 ) a ! b ! N Y i,j 1 ≤ i ≤ a 1 ≤ j ≤ b P k i k i P k ′ j k ′ j m ( h ) k i m ( h ) k ′ j k i ! k ′ j ! + a ! b ! N Y i,j 1 ≤ i ≤ a 1 ≤ j ≤ b k i = k ′ j P 2 k i k i m ( h ) 2 k i ( k i !) 2 × Y i,j 1 ≤ i ≤ a 1 ≤ j ≤ b k i 6 = k ′ j P k i k i P k ′ j k ′ j m ( h ) k i m ( h ) k i k i ! k ′ j !                  (41) variance C . Theref ore, equ ation (4 4) bec omes ˜ P = Z P 1 ≤ ... ≤ P M P e − w ( P ) H C − 1 w ( P ) R P 1 ≤ ... ≤ P M e − w ( P ) H C − 1 w ( P ) d P d P (4 5) where we denoted w = w ( P ) to remind the actual dependence of w in th e p owers P 1 , . . . , P M (throu gh the expression of d in equation (39)). Unfortu nately , the in tegration space of equ ation ( 45) m akes both integrals rather in volved to com pute. A way to prac tically computed ˜ P co nsists in turning the integrals into ﬁnite sums over thin sliced versions of the integration space. Also, as previously m entioned, the covariance matrix C is obviously unknown when trying to decipher the cell powers P 1 , . . . , P M . Howe ver, iterative m ethods can be con sidered in which C is initially deﬁned as th e cov ariance matrix C init of an hyp othetic set o f p owers, say P 1 = P 2 = . . . = P M = P max / 2 . Then, runnin g the MMSE estimator with C init returns a ﬁrst set P (1) 1 , . . . , P (1) M from which a r eﬁned version of C can be ev alu ated ( from formu la (4 1)). Note that, in runn ing k instances o f the algor ithm, the sample mo ments ˜ d p can b e accumulated a nd the covariance matrix C has to be compute d as if as many as k N subcar riers we re actually used in the free deconv olution algorith m. This proc ess can be processed in a loop for a satisfying number of iterations. 2) Altern ative estimators: Other estimator s th an MMSE, such as m aximum- likelihood (ML), migh t b e conside red which take as an estimate the set P which minimizes w ( P ) H C − 1 w ( P ) . Howe ver , the measure associate d to the ML estimator does not suit the broa d a posteriori d istribution p ( P | ¯ d , I ) as will b e shown in simulation s. Indeed , a large estimation error is a s bad as a small estimation erro r in the ML context; theref ore, when the posterior p ( P | d , I ) is not peaky , large estimatio n errors are expected. T he MMSE estimator is, in this scenario, more appro priate. A zero-fo rcing method can also b e de riv ed. Fro m equ ation (39), if no thing wer e known abou t th e noise pro cess w p , one might naively consider solvin g th e system of M e quations (39), with p = 1 , . . . , M in the M un knowns P 1 , . . . , P M , in which d p is set equ al to the ob served ˜ d p . This can be solved by tur ning this system of equ ations into the eq uiv a lent system M X k =1 P k = Q 1 ( d 1 ) M X k =1 P 2 k = Q 2 ( d 1 , d 2 ) . . . = . . . M X k =1 P M k = Q M ( d 1 , . . . , d M ) (46) with Q k ∈ R [ d 1 , . . . , d k ] . This system can be so lved using the Newton-Girar d for - mulas [18]; the solutions P 1 , . . . , P M are foun d to be th e M roots o f an M th degree po lynom ial. This solution does not require an y kno wledge on the c ovariance matrix C , which does not h av e any signiﬁcatio n in this context. Howev er , it often turns out that the roots P k are not all real, which makes the solution u seless in pr actice. Also, this zero- forcing metho d is very a wkward as it stron gly suffers from the presence of no ise. Especially , the large variance E[ | w M | 2 ] is consider ed equally to the typica lly very small variance E[ | w 1 | 2 ] , and therefo re the ﬁrst sample mom ent ¯ d 1 is no t more impo rtant in the ﬁnal computatio n than the last sample momen t ¯ d M 6 . V I I . S I M U L A T I O N A N D R E S U LT S In the follo wing, we use th e results that wer e p reviously derived in the case of a th ree-cell network that the ter minal wishes to track . The set o f cells studied alo ng this part are of relativ e powers P 1 = 4 , P 2 = 2 , P 3 = 1 . Before perfor ming the ﬁrst simulation of the com plete algorithm , we present in Figure 4 the relative erro r in the estimation o f the momen ts ν k = tr N ( HPH H ) k from the free deco n volved sample mom ents ¯ d k . It is observed that ev en for N = 256 , L = 51 2 the m ean relative erro r in the computed third moments is of o rder 1% . This suggests that the free deconv olution techniqu e is very a ccurate ev en f or non-in ﬁnite values of N and L . The bottleneck approx imation in the estimation of P 1 , . . . , P M , as will be o bserved in the coming plots, therefor e lies in th e convergence of the entries h kj tow ards a Gaussian i.i.d. process, and not in the inﬁnite matrix size assumption. In a ﬁrst simulation, we stud y th e conv ergence properties of the p roposed scheme. W e consider a large OFDM sy stem with 6 this would be here again a dishonest considerati on as one at least knows that d M is m ore uncert ain than d 1 . 9 256 512 1 , 024 2 , 048 0 0 . 2 0 . 4 0 . 6 0 . 8 1 · 10 − 2 Num b er of sub carriers N relativ e error | ¯ d 1 − ν 1 | ν 1 | ¯ d 2 − ν 2 | ν 2 | ¯ d 3 − ν 3 | ν 3 Fig. 4. Relati ve error on recovere d moments ν k = tr N ( HPH H ) k from the free decon vol ution algor ithm, L = 2 N 1 2 4 0 0 . 02 0 . 04 0 . 06 0 . 08 0 . 10 Estimated p o w ers Densit y Fig. 5. Cell power detect ion, N = 2048 , L = 4096 , MMSE estimat e, Perfect kno wledge of C 1 2 4 0 0 . 02 0 . 04 0 . 06 0 . 08 Estimated p o w ers Densit y 1 2 4 0 0 . 02 0 . 04 0 . 06 0 . 08 Estimated p o w ers Densit y Fig. 6. Cell power detecti on, N = 512 , L = 1024 , MMSE estimate (lef t) and ML estimat e (right ), Perfect knowledg e of C 1 2 4 0 0 . 05 0 . 10 0 . 15 0 . 20 Estimated p o w ers Densit y Fig. 7. Cell po wer detection, N = 512 , L = 1024 , 10 a ccumulat ions, MMSE estimate , Perfec t knowle dge of C 1 2 4 0 1 3 2 3 1 Estimated p o w ers Cum ulativ e distribution No accum ulation 10 accum ulations Optimal estimation Fig. 8. Ce ll powe r detect ion CDF , N = 512 , L = 1024 , Perfect knowl edge of C N = 2048 subcarr iers an d L = 4096 sampling per iods under ideal Gaussian i.i.d . input symbo ls, un correlated Gaussian channel freq uency respo nses and Gau ssian add itiv e wh ite noise with signal to n oise ratio SNR = 20 dB . T he covariance matrix C is the exact covariance matrix. Figure 5 provid es the results o f this simu lation for thousan d channel realisations. It is observed that the distribution o f the eigenv alues is largely spread over the expected eigenv alu es. This is explained by the slow conver gence natu re of the compu ted d p tow ards the correspo nding m oments when N → ∞ . Also, it is ob served that th e peak ce nters ar e offset from the expected p owers. This is main ly due to th e fact that the MMSE estimator is meant to minimize th e mea n quadratic error averaged over all possible sets P 1 , P 2 , P 3 and not for the particular set selected here 7 . Also, the actually n on-Gaussian prop erty of the noise p rocess 7 it was observ ed in particula r from other simulations that cells of equal po wers are generally better estimat ed. 10 1 2 4 0 0 . 05 0 . 10 0 . 15 0 . 20 Estimated p o w ers Densit y Fig. 9. Cell po wer detecti on, N = 512 , L = 1024 , MMSE estimate, Recursi ve update of C , 10 steps w as well as the inexact results fro m the free deconv olution process contribute to the offset. Figure 6 provide s a comp arison between the results g iv en by the MMSE estimator and the ML e stimator when N = 512 an d L = 1 0 24 . It tu rns out, as discussed ear lier , that th e ML estimate is mo re largely spread ar ound the expected cell powers than the MMSE estimate. In the following, we perform more r ealistic simu lations in which input signals are QPSK modulated instead of co mplex Gaussian, an d with more realistic channels of length varying from 1 to N/ 4 symbols. T o red uce the v ariance o f the estimates, we also average the sample d p over ten channel realizations, which in practice requir es aro und 1 ms o f data to proc ess. I n Figure 7, we took N = 512 , L = 1024 and a Rayleigh ch annel o f le ngth N / 8 . The cov ariance matrix C is still the exact covariance matrix. Then a hund red r ealisations of this proc ess are run. Th e SNR is still SNR = 20 dB in this secon d experiment. The cumulative distribution f unction (CDF) of the detected p owers is presented in Figure 8 and compare d to th e CDF when no accum ulation is performed . The three thresholds correspond ing to the three d etected c ell powers ca n be observed, with a slight shift fro m the expected cell powers caused both by the non-exact Gaussian assumption on w and o n the n on-exact Gau ssian i.i. d. assumption on the channel frequen cy responses. Now we propose to examine the performance of the iterativ e cell p ower recovery . W e initially set C init to the cov ariance matrix of a set of cells of powers P 1 = P 2 = P 3 = 2 . 5 . Then ten iterations of the cell-po wer detector are run, with at each step a reﬁnement of C . Note th at, since a t each step the sample moments ¯ d p are ac cumulated , th e en tries of the covariance matrix C are comp uted accordin gly , i.e. k accumulatio ns demand that C is compu ted from an effectiv e number k N sub carriers, as presented in Section VI. Figure 9 provides the results of the recursive algorithm, which p roves to perform s very accurately , with surpr isingly little dispersion around the detected powers com pared to Figu re 7. 1 2 4 0 1 3 1 6 1 Estimated p o w ers Cum ulativ e distribution i.i.d. Gaussian L TE-ETU L TE-EV A Optimal estimation Fig. 10. Cell po wer detection CDF in L T E channe ls, N = 512 , L = 1024 , 10 accumula tions, preset C Channel T ype RMS dela y spread Channel length EV A 357 ns N / 27 ETU 991 ns N/ 13 T ABL E II 3 G P P - LT E S TA N D A R D I Z E D S H O RT D E L AY C H A N N E L S Also, we test the r obustness of our algo rithm ag ainst prac- tical shor t channels, instead of high de lay spread c hannels. This is shown in Figure 10 which prop oses a compar ison between the id eal long chann el situation and the 3GPP-L TE [4] stand ardized E xtended V ehicular A (EV A) and E xtended T ypical Urb an (ETU) c hannels with para meters g iv en in T able II. Here we considered a m obile handset with N r = 2 anten nas, 256 subc arriers p er an tenna (an d then in total an effectiv e number of N = 512 subcarr iers), L = 1024 , SNR = 20 dB . The c ovariance m atrix C of the n oise pr ocess w is an approx imated matrix ob tained from intensive simulations on short chann els. Indeed, the covariance pa ttern is very different from the m atrices used in the Gaussian i.i.d. scenario and is difﬁcult to derive an alytically; especially we noticed that the smaller the delay sprea d, the larger the un certainty on the higher moments, whic h tu rns C into an ill-condition ed matrix. The results are averaged over ten chan nel realizations. The CDF of the detected power d istribution f or those channels is provide d in Figure 10. The latter shows a r ather good behaviour in both ETU and E V A chan nels. Nonetheless their short delay spreads lead to a larger variance in the mean power estimation. Surprisingly , it turn s out that th e distribution of the in put signals s ( l ) does not impa ct the system p erforma nce as lo ng as it is i.i.d . with zero m ean and unit variance. T his is a known result in free deconv o lution which has no t been proven yet. Therefo re in our simulation s, QPSK modu lations showed the exact same behaviour as purely Gau ssian d istributed input signals. 11 V I I I . D I S C U S S I O N A. Data , p rior and c onver gence W e previously d escribed and simulated a recursive algo- rithm m eant to con verge to an accurate estimate of th e cell powers P 1 , . . . , P M , whose conver gence we did not prove yet. Actually , a m athematical as well a s a philosoph ical reasons for this convergence can be ad vanced. First, note that a large num- ber of iteration s lead to a smaller variance of w = d − ˆ d ( P ) , therefor e to smaller entries of the noise c ovariance matrix C . As a con sequence C − 1 has large entries and the exponential terms e − 1 2 ( d − ˆ d ( P )) T C − 1 ( d − ˆ d ( P )) in the MMSE integrals (45) are rele vant on ly when th e differences d − ˆ d ( P ) a re very small; this is, when ˆ d ( P ) ≃ d . Therefo re the accuracy in the entries of C − 1 is n ot of fun damental importan ce, as lon g as they are large en ough . Th e conver gence of the itera ti ve process is ensured by the accum ulated d ata d themselves; but of cou rse this convergence is accelerated with good appr oximation s of C . This obser vation is very gen eral in the Bayesian prob ability theory con text and h as been observed and analyzed tho roug hly by Jaynes [ 15]. Bayesian pr obabilities rely on a balance between priors and d ata . If the available data is scarce, then prior infor mation is very valuable; in o ur situation, as shown by simulation, if C wer e known, then e ven a sin gle chann el realization allows to app roxima tely recover the tran smitted powers. On th e contrar y , large amounts of d ata prev ail over prior information s o that even u nfortu nate prio rs may not badly alter the a posteriori pr obability; this explains why a precise estimation of C is n ot ma ndatory when large accum ulations of d ata ar e consider ed. Note th at in this case, ML estimates and MMSE e stimates show similar perform ance, since the posterior d istribution p ( P | Y , I ) is very peaky in the correct value for P . Howe ver, it is importan t to und erline the fact th at the offset problem, observed in simulatio n, in the estimation of P 1 , . . . , P M will not be redu ced by m ere data accum ulations. Indeed , the issue co mes here from the free deco n volution process which is not a ccurate for ﬁ nite N . Therefo re, an appro- priate trade-off between large N an d many accumulations mu st be foun d: large N entail mo re accu rate f ree deconv olution processes at the expense o f compu tationally dema nding large matrix p rodu cts, while many accumulatio ns ensures a faster conv ergence ( to offset cell p owers) at a lower computation cost. Ano ther app roach consists in compu ting higher o rder moments to strengthen the cell power estimation. Howe ver, high order sam ple m oments have a large variance for ﬁnite N . Their impact on the ﬁna l estimation might then be very limited since they are very un reliable. For instance we o bserved in simulations th at, for N = 51 2 , typical matrices C verify C 11 ≃ 1 e − 2 and C 33 ≃ 1 e 4 , which gives a million times more credit to the ﬁrst o rder sample moment than to th e th ird order samp le m oment. Bring ing in the compu tation f ourth order mom ents with N = 512 would turn ou t n ot worth the computatio n increase. B. Ap plicability As reminded in the intro duction of this paper, usual cell power detection tech niques u se scar ce an d largely interfered synchro nization sequen ces. M uch time, but low compu tation, is then requ ired to detect cells with high efﬁciency . From the autho rs’ experience in the co ntext of 3GPP-L TE, many problem s arise when those p ilot seq uences are syn chron ized and the emitting cells ha ve equal p owers since both signals interfere to the detrimen t of th e decode r . In the previously derived sch eme, even cells of equal power can be counted and isolated . Indeed, if M cells are expected and the CDF of the estimated powers shows a large jump of 2 / M for a g iv en power P , then we can deduce that two cells have eq ual power P . It is th erefor e n ot necessary to enhanc e th e qua lity of the estimation to sep arate tho se two cells. Howe ver, some limitations can be f ound in th e applicab ility of this cog nitive scheme . Firstly , load ed cells are req uired to ensu re reliability of th e estimates; this r equires that the underly ing standards o ptimally r euse the allocated bandwidth . Secondly , if many cells are desired to b e de tected, th en very high ord er mome nts must be computed which , as discussed earlier are on ly re liable if the num ber of subcarrie rs N and the n umber of accumulation s are very large. High order moments also require very large matrix prod ucts, which migh t be too d emandin g to the embedd ed system processors. Also , under the limited state of k nowledge of the system mod el, neighbo ring cells can only b e de tected if the propag ation channels ar e very freq uency selectiv e. The ﬂatter th e ch annels, the more num erous the accumulations requir ed to come up with a reliable estimation . The latter limitation is obviously the m ost constrainin g factor . I X . C O N C L U S I O N In this pape r , we pr ovided a practical way to blindly detect neighbo ring cells in a distributed OFDM network. Assumin g constant tran smission in those cells on a fairly large band - width, we showed that one can d etermine the ind ividual SNR of ev ery surrou nding cell provided that th e chann el delay spread is sufﬁciently large. This scheme is particularly suited for the future co gnitive OFDM systems which aim to r educe the amou nt of synchr onization sequences while keepin g track of the neighb oring cells. X . A C K N O W L E D G E M E N T This work was partially supporte d by the Europ ean Com- mission in the framework of the FP7 Network of Excellence in Wireless Com municatio ns NEWCOM++. A P P E N D I X Consider Gau ssian channels with independ ent fr equency responses h ij ( i ∈ { 1 , . . . , M } , j ∈ [1 , N ] ). Th en, for a g iv en 12 j , noise taken ap art, we denote d ( j ) p = M X i =1 P i | h ij | 2 ! p = X k 1 ,k 2 ,...,k M P m k m = p C k 1 p C k 2 p − k 1 . . . C k M p − k 1 − ... − k M − 1 M Y l =1 ( P i | h ij | 2 ) k l (47) where the pr oduct of the binom ial coefﬁcients C k 1 p C k 2 p − k 1 . . . C k M p − k 1 − ... − k M − 1 is the multinomial coefﬁcient p ! k 1 ! k 2 ! ...k M ! . Hence the simpliﬁed expression, when averaging d ( j ) p over j ∈ [1 , N ] ( with N → ∞ to ensure 1 N P j | h ij | 2 → E j [ | h ij | 2 ] ) d p = p ! X k 1 ,k 2 ,...,k M P m k m = p E j h Q M i =1 ( P i | h ij | 2 ) k i i Q M i =1 k i ! (48) The m oments of the com plex variable | h ij | 2 are deduced from the mome nts of a G aussian real variable by considering | h ij | 2 = ( h ℜ ij + i · h ℑ ij )( h ℜ ij − i · h ℑ ij ) (49) where the re al and imaginary parts h ℜ ij and h ℑ ij of h ij are standard Gaussian variables whose even moments are known E [( h ℜ ij ) 2 p ] = E [( h ℑ ij ) 2 p ] = (2 p )! 2 p p ! (50) hence equation (38). This results in d p = p ! 2 2 p X k 1 ,...,k M P i k i = p M Y i =1 ( k i X k =0 (2 k )!(2[ k i − k ])! ( k !) 2 ([ k i − k ]!) 2 ) P k i i (51) R E F E R E N C E S [1] C. E. Shannon, “ A m athemat ical theory of communications”, Bell System T echnica l Journal , vo l. 27, no. 7, pp. 379-423, 1948. [2] E. Bigli eri, J. Proakis, and S. Shamai, “Fa ding chann els: Information- theoret ic and communica tions aspects, ” IEEE Trans. on Inf. Theory , vol. 44, no. 6, pp. 2619-2692, Oct 1998. [3] J. H. W inters, “Smart Antennas for W irele ss Systems”, IEEE Personnal Communicat ions, vol . 5, no. 1, pp.23-27, Feb . 1998. [4] http://www .3gpp.org/ Highlight s/L T E/L TE.htm [5] http://wire lessman.org/ [6] K. Karakayal i, G. J. Foschini , R. A. V alenzue la, “Net work Coordinati on for Spectra lly Efﬁc ient Communicat ions in Cellul ar Systems”, August 2006, IEEE W irele ss Communica tions Magazine (in vited). [7] Ø. Ryan, M. Debbah, “Free decon voluti on for signal processing applica- tions”, under re view , http://a rxi v .org/abs/c s.IT/0701025 [8] A. M. Tulino , S. V erd ´ u, “Rando m Matrix Theory and Wire less Commu- nicat ions”, No w Publishe rs, vol. 1, Issue 1, 2004. [9] P . Biane, “Free Probability for Probabili sts”, Quantum Probability Com- municati ons, W orld Scientist, 2003. [10] H. U rko witz, “Energy dete ction of unknown dete rministic signals, ” Proc. of the IEEE, vol. 55, no. 4, pp. 523-531, Apr . 1967 [11] V . I. Kostyle v , “Energy detection of a signal with Random Am plitude ”, Proc IEE E Int. Conf. on Communication s (ICC’02). New Y ork City , pp. 1606-1610, May 2002. [12] S. Haykin, “Cogniti ve Radio: Brai n-Empowe red W ireless Communica- tions”, IEEE Journal on Selected areas in Comm. , vol 23., no. 2, Feb 2005. [13] G. Caire and S. Shamai, “On the achie vable throughput of a mul tiante nna Gaussian broadcast channel” , IEEE Trans. on Information T heory , vol. 49, no. 7, pp. 1691-1706, 2003. [14] E. T . Jaynes, “Information Theory and Statist ical Mechanics”, Physical Re vie w , APS, vol. 106, no. 4, pp. 620-630, 1957. [15] E. T . Jaynes, “Probability Theory: The Logic of Science ”, Cambridge Uni versi ty Press, 2003. [16] Ø. Ryan, M. Debbah, “Multiplica ti ve free con voluti on and information- plus-noise type matrices”, Submit ted to J ournal Of Functional Analysis , 2008, http://www .iﬁ.uio.no/ ˜ oyvindry/ multfreecon v .pdf [17] Ø. Ryan, M. Debbah, “Channe l Capac ity E stimation using Free Prob- abili ty Theory”, IEEE Trans. on Signal Proce ssing, vol. 56, no. 11, pp. 5654-5667, Nov . 2008. [18] R. Seroul, D. O’Shea, “Programming for Mathema tician s”, Spri nger , 2000 [19] M. Hall J r . “Combinato rial Theory”, John Wi ley & Sons, Inc. New Y ork, NY , US A, 1998 [20] J.A. Bingham, “Multicarr ier modulati on for data transmission: An idea whose time has come, ” IEE E Comm. Mag., vol. 28, pp. 5-14, May 1990. [21] P . V iswanath , D. Tse and R. Laroia, “Opportunistic beamforming using dumb antennas”, IEEE Internationa l Symposium on Information Theory . IEEE. 2002, pp.449 [22] D. V . V oiculesc u and K. J. Dykema and A. Nica, “Free Random V ariable s”, Americ an Mathematica l Society , 1992. [23] J. Mitola, and Jr GQ Maguire, “Cognit i ve radio: making s oftwa re radios more personal ”, Personal Communications, IEE E [see also IEEE W ireless Communicat ions] 6(4), 13-18, 1999. [24] C.B. Papadia s, “Globally con ver gent blind s ource separat ion based on a multiuse r kurtosis maximizatio n criterion” , IEE E T rans. on Signal Processing, Dec. 2000, pp. 3508-3519. [25] J. V illares, “Symbol-rate Second-Order Blind Cha nnel Estimation in Coded Transmissions”, in Proc. ICASSP , Honolulu, Hawai i, April 2007. [26] C. Herzet , et al. , “Code-Aided Turb o Synchronizati on”, IEEE Proceed- ings, Special issue on turbo techniques, 2007. [27] H. Tri gui, D. T .M. Slock, “Cocha nnel interferen ce cancel lation within the current GSM standard”, IEEE ICUPC’98, Florence, Italy , Oct. 1998. [28] R. W . Thomas, L. A. DaSilva , A.B. MacK enzie,“Cogni ti ve networks”, DySP AN 2005 [29] E. V . Belmega, S. Lasaulce and M. Debbah, ”Decentra lized Hando vers in ce llular network s with cognit i ve terminals”, ISCCSP-2008 Malt a, 2008. [30] E. V . Belmega, S. Lasaulce and M. Debbah, ”Po wer Control in Dis- trib uted Multiple Access Channels with Coordina tion”, WNC3, 2008. [31] R. Couillet , M. Debbah, “Multi ple Antenna Cogniti ve Recei vers and Signal Detecti on”, Planned for s ubmission to IE EE T rans. on Information Theory .

Cognitive OFDM network sensing: a free probability approach

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment