An integral formula for large random rectangular matrices and its application to analysis of linear vector channels
A statistical mechanical framework for analyzing random linear vector channels is presented in a large system limit. The framework is based on the assumptions that the left and right singular value bases of the rectangular channel matrix $\bH$ are ge…
Authors: Yoshiyuki Kabashima
An inte gral formula for lar ge random recta ng ular matric es and its application to analysis of linear v ector channels Y oshiyuki Kabash ima Departmen t of Co mputation al Intelligenc e and Systems Science T okyo Institute of T echn ology , Y okohama 22 6–850 2, Japan Email: k aba@dis.titech.ac.jp Abstract —A statistical mechanical framew ork for analyzing random linear ve ctor channels is presented in a lar ge system limit. Th e framework is based on th e assumptions t hat the l eft and right singular value bases of the rectangular channel matrix H are generated indep endently from un ifo rm distributions over Haar measures and the eigen values of H T H asymptotically fo ll ow a certain specific distribution. Th ese assump tions make it possible to characterize the communication perfo rmance of the channel utilizing a n in tegral f ormula with r espect to H , which is analogous to the one introduced by Marinari et. al. in J. P hys. A 27, 7647 (1994) fo r large rand om square (symmetric) matrices. A computationally feasible algorithm for approximately decoding recei ved signals b ased on the integral fo rmula is also prov id ed. I . I N T RO D U C T I O N In a genera l s cen ario for linear vector channels, multiple messages ar e tran smitted to the receiver , being linearly trans- formed to multip le output signals by a r andom matrix and degraded by channel n oises. Th is yields a complicated depe n- dence on message variables, which en sures that the prob lem of inferrin g the tra nsmitted messages from the received outpu t signals is non -trivial. In general, inf erence pro blems of this kind can be m apped to vir tual magnetic systems governed by random interaction s [1]. This similarity has promoted a sequence o f statistical mechanical analy ses of linear vector channels in a large system lim it from the beginning of this century [2], [3], [4], [5], [ 6], [7], [8]. In the simplest ana lysis, each e ntry o f the channel matrix is regarde d as an in depend ent an d identica lly distributed (IID) random variable. Howe ver , suc h a tre atment is not nec essar - ily ad equate for de scribing r ealistic systems, in which non - negligible statistical correlations acro ss th e matrix entries are created by spatial/time pro ximity of messages/antennas o r ma - trix design for enhancem ent of co mmunica tion per forman ce. Therefo re, the dev elo pment o f methodolo gies th at can deal with correla tions in the chan nel m atrix is o f great impo rtance to research in th e area o f line ar vector channels. It is intended that the present article s h ould contribute such a methodo logy for applicatio n to these commu nication channels. More precisely , we develop a statistical mechanical fram ew ork for analyzing linea r vector channels so th at the in fluence o f the corr elations across the matrix e ntries can be taken into account. Th e developed fr amew or k is applicab le not only to Gaussian ch annels of Gaussian inpu ts [9], but also general memory -less channels of continuo us/discrete inputs, which are characterized b y a factorizab le prior distribution. This article is organized a s follows: In the next section , the model of linear vector chann els that we fo cus on here in is defined. In section III, which is the main part of th e current article, an inte gr al formu la with respect to large rando m rectangu lar matrices is introdu ced. A scheme to assess the p er- forman ce of the linear vector channel an d a computationally feasible appro ximate deco ding alg orithm are developed on the basis of this for mula. The utility of th e developed schem es is examined in sectio n I V by application to an example system. The final section sum marizes th e pr esent stud y’ s finding s. I I . M O D E L D E FI N I T I O N For simplicity , we here assume that all the variables rele vant to the com municatio n are real; but extending th e following framework to complex variables is straightforward [ 10]. Let us suppose a line ar vector chann el in which an input m essage vector of K com ponents, x = ( x k ) , is linearly transformed to an M dimensional sequence, ∆ = (∆ µ ) , by a K × N channel matrix , H = ( H µk ) , as ∆ = H x . For gen erality and simplicity , we assume a g eneral memo ry-less ch annel, which implies that an N dimensio nal o utput signal vector, y = ( y µ ) , follows a certain factorizable con ditional distribution as P ( y | x ; H ) = P ( y | H x ) = N Y µ =1 P ( y µ | ∆ µ ) . (1) In a ddition, w e assume a factorizab le prior distribution P ( x ) = K Y k =1 P ( x k ) , (2) for x , wh ich ma y b e continuo us or discre te. An expr ession of the singular value decomp osition of H H = U D V T , (3) is th e basis of our f ramework. Here, the super script T de- notes the tran spose of the m atrix to which it is attach ed, D = diag ( d k ) is an N × K diagonal matrix compo sed of the singular values d k ( k = 1 , 2 , . . . , min( N , K ) ) , where min( N , K ) den otes the lesser value of N and K . The values d k are linked to th e eigenv alues of H T H , λ k , as λ k = d 2 k for k = 1 , 2 , . . . , min( N , K ) . U and V ar e orthogo nal matrices of o rder N × N and K × K , re spectiv ely . In order to han dle correlation s in H analytically , we assum e that U and V are independ ently gener ated fro m unif orm d istributions of the Haar measures o f N × N a nd K × K ortho gonal matr ices, respectively , a nd that the empir ical eigenv alue distribution o f H T H , K − 1 P K k =1 δ ( λ − λ k ) = (1 − min( N , K ) /K ) δ ( λ ) + K − 1 P K k =1 δ ( λ − d 2 k ) conv erges to a certain specific distri- bution ρ ( λ ) in the limit as N and K tend to in finity while keeping the load β = K / N ∼ O (1) . Con trolling ρ ( λ ) allows us to express various second -order correla tions in H . I I I . A N A L Y S I S A. An integr al formula for lar ge random r ectangu lar matrices W ith knowledge of H , the receiver decodes y in orde r to infer x , wh ich is perf ormed on the b asis of the Bayes formula P ( x | y ; H ) = Q N µ =1 P ( y µ | ∆ µ ) Q K k =1 P ( x k ) P ( y ; H ) . (4) Here, the probab ility P ( y ; H ) = T r x N Y µ =1 P ( y µ | ∆ µ ) K Y k =1 P ( x k ) , (5) expresses the margin al p robability with re spect to y , wher e T r x denotes su mmation or integration over the all possible states of x . Eq. (5) also serves a s the p artition func tion concern ing the m essage vector x in statistical mechanics. Let us examine statistical pro perties of Eq . (5) pr ior to analyzing Eq. (4). The expr ession P ( y ; H ) = T r u , x exp i u T H x N Y µ =1 b P ( y µ | u µ ) K Y k =1 P ( x k ) , (6) is useful fo r this purp ose, w here i = √ − 1 , u = ( u µ ) and b P ( y µ | u µ ) = (2 π ) − 1 R d ∆ µ exp ( − i u µ ∆ µ ) P ( y µ | ∆ µ ) de notes the Fourier transfor mation of likelihood P ( y µ | ∆ µ ) . W e su bsti- tute H in Eq. ( 6) by Eq. (3) and take an a verage with r espect to U and V . For this a ssessment, it is no tew or thy that fo r any fixed set of u and x , e u = U T u and e x = V T x behave as co ntinuou s random variables that satisfy strict constrain ts N − 1 | e u | 2 = N − 1 | u | 2 = T u and K − 1 | e x | 2 = K − 1 | x | 2 = T x . In th e limit N , K → ∞ keeping β = K/ N ∼ O (1 ) , which we will h ereafter a ssume if necessary , this yields an expre ssion 1 N ln exp ( i u T H x ) = F ( T x , T u ) , (7) where · · · den otes the average with respect to U and V , an d F ( ξ , η ) = Extr Λ ξ , Λ η − β 2 h ln(Λ ξ Λ η + λ ) i ρ − 1 − β 2 ln Λ ξ + β Λ ξ ξ 2 + Λ η η 2 − β 2 ln ξ − 1 2 ln η − 1 + β 2 , (8 ) where h· · ·i ρ denotes the average with respect to ρ ( λ ) , while Extr θ {· · ·} r epresents extremization with re spect to θ [11]. This formula is analogou s to the one known for en sembles of random squar e (symmetric) matrices [1 2], [13], [1 4], which is closely related to the R -tran sformation dev elo ped in fr ee probab ility th eory [15], [ 9], [16]. Se veral integral f ormulae for large rando m matrices related to Eq. (7) , but fo r d ifferent large system limits, are p resented in [ 17]. Eq. (7) implies 1 N ln T r y P ( y ; H ) = Extr T x ,T u { F ( T x , T u ) + β A x ( T x ) + A u ( T u ) } , (9) where A x ( T x ) = Extr b T x n b T x T x / 2 + ln T r x P ( x ) e − b T x x 2 / 2 o and A u ( T u ) = Extr b T u n b T u T u / 2 + ln T r y ,u b P ( y | u ) e − b T u u 2 / 2 o . The norm alization co nstraint T r y P ( y ; H ) = 1 , in con- junction with the extrem ization in Eq . (9), yield s T x = T r x x 2 P ( x ) , b T x = 0 , T u = 0 an d b T u = β h λ i ρ T x . The physical implication of these results is tha t compon ents o f ∆ = H x behave as IID Gau ssian variables of zero m ean and variance b T u in the large system limit when x is drawn fro m Eq. (2), while U and V are independen tly gener ated f rom the Haar m easures. B. P erformance assessment Now , we are read y to analy ze th e typical com munication perfor mance of the cu rrent ch annel mo del. This is perfo rmed by assessing the typ ical mutual in formatio n (per outpu t signal) between x an d y , I ( X, Y ) , based on Eqs. (4) and (5) a s I ( X, Y ) = 1 N T r y , x P ( y | x ; H ) P ( x ) ln P ( y | x ; H ) P ( y ; H ) = F + T r y Z D z P y | q b T u z ln P y | q b T u z , (10) where F = − 1 N T r y P ( y ; H ) ln P ( y ; H ) , (11) represents the cond itional entropy of y , and serves as the av- erage free energy with respect to x . D z = (2 π ) − 1 / 2 dz e − z 2 / 2 denotes the Gaussian measure. The statistical pro perties o f ∆ ev aluated in the la st parag raph are emp loyed to assess the second term on the right-h and side of the last line of Eq. (1 0). F can be e valuated by means of the r eplica method. Namely , we ev alua te the n -th mom ent of th e par tition function P ( y ; H ) for n ∈ N as T r y P n +1 ( y ; H ) = T r y , { x a } , { u a } exp i n X a =1 ( u a ) T H x a ! × n +1 Y a =1 N Y µ =1 b P ( y µ | u a µ ) × n +1 Y a =1 K Y k =1 P ( x a k ) , (12) and assess F as F = − lim n → 0 ∂ ∂ n 1 N ln T r y P n +1 ( y ; H ) , (13) analytically continuin g expressions obtained for Eq. (1 2) from n ∈ N to n ∈ R . Here, { x a } denotes a set of n + 1 replicated vectors x 0 , x 1 , · · · , x n , with { u a } defined similar ly . Eq. (13) is gene rally expressed using F ( ξ , η ) , an d the deriv ation of the expression can b e foun d in [11]. In particu lar , the expression obtaine d under the re plica symme tric ansatz, which is belie ved to be correct for th e curr ent case since the inference is perfo rmed o n th e basis of the cor rect p osterior (4) [18], is giv en in a co mpact form as F = − Extr q x ,q u {A xu ( q x , q u ) + β A x ( q x ) + A u ( q u ) } , (14) where A xu ( q x , q u ) = F ( T x − q x , q u ) + b T u q u 2 , (15) A x ( q x ) = E xtr b q x − b q x q x 2 + Z D z P ( z ; b q x ) ln P ( z ; b q x ) , (16) and A u ( q u ) = E xtr b q u − b q u q u 2 + T r y Z D z P ( y | z ; b q u ) ln P ( y | z ; b q u ) , (17) in which P ( z ; b q x ) = T r x P ( x ) e − b q x x 2 / 2+ √ b q x z x and P ( y | z ; b q u ) = R D sP y | q b T u − b q u s + p b q u z . The q x and q u determined by Eq . (14) repre sent K − 1 | h x i | 2 and − N − 1 | h u i | 2 , r espectiv ely , where h· · ·i denotes averaging over the posterior d istribution (4) while [ · · · ] indic ates th e av era ge with r espect to y , U and V . These av erag es, h· · ·i and [ · · · ] , corr espond to the therm al and qu enched averages in statistical mech anics, respectively . The quantities b q x and b q u appearin g in Eqs. (16) an d (17) can be used f or a ssessing perfor mance mea sures other th an Eq. (10), such as the mean square error ( MSE) an d th e bit err or rate (BER). C. Compu tationally feasible appr oxima te decodin g Let us su ppose a situation which requires ev alua tion of the posterior average m x = T r x x P ( x | y ; H ) , (18) where m x = ( m xk ) , with similar notatio n used fo r other vectors below . Eq. (18) serves as the estimator that min imizes the MSE in gen eral, and can be used to m inimize the BER for binary messages. Ex act assessment o f such averages is, howe ver, comp utationally difficult for large system s, which motiv ates us to develop compu tationally feasible appro xima- tion algorithms [19], [3], [20]. A generalized Gibbs free energy e Φ( m x , m u ; l ) = Extr h x , h u { h x · m x + h u · m u − ln ( Z ( h x , h u ; l )) o , (19) where Z ( h x , h u ; l ) = T r x , u Q N µ =1 b P ( y µ | u µ ) × Q K k =1 P ( x k ) × exp h x · x + h u · (i u ) + (i u ) T ( l H ) x , offers a useful basis for this pur pose since Eq . (18) is characterized as the unique saddle p oint o f E q. ( 19) for l = 1 [2 1], [2 2]. MPforP erceptron { Perform Initialization ; Iterate H-Step and V -S tep alternately sufficient times; } Initialization { χ x ← 1 K K X k =1 x 2 k P ( x k ); b χ x ← 0; Λ x ← 1 χ x − b χ x ; m xk ← T r x k x k P ( x k ) ( k = 1 , 2 , . . . , K ); h u ← H m x ; m u ← 0 ; } H-Step { Search ( χ u , Λ u ) for given ( χ x , Λ x ) to satisfy conditions χ x = fi Λ u Λ x Λ u + λ fl ρ and χ u = 1 − β Λ u + fi β Λ x Λ x Λ u + λ fl ρ ; b χ u ← 1 χ u − Λ u ; h u ← h u − b χ u m u ; m uµ ← ∂ ∂ h uµ ln „ Z Dx P ( y µ | p b χ u x + h uµ ) « ( µ = 1 , 2 , . . . , N ); h x ← H T m u ; χ u ← − 1 N N X µ =1 ∂ 2 ∂ h 2 uµ ln „ Z DsP ( y µ | p b χ u s + h uµ ) « ; Λ u ← 1 χ u − b χ u ; } V -Step { Search ( χ x , Λ x ) for given ( χ u , Λ u ) to satisfy conditions χ x = fi Λ u Λ x Λ u + λ fl ρ and χ u = 1 − β Λ u + fi β Λ x Λ x Λ u + λ fl ρ ; b χ w ← 1 χ x − Λ x ; h x ← h x + b χ x m x ; m xk ← ∂ ∂ h xk ln “ T r x P ( x ) e − 1 2 b χ x x 2 + h xk x ” ( k = 1 , 2 , . . . , K ); h u ← H m x ; χ x ← 1 K K X k =1 ∂ 2 ∂ h 2 xk ln “ T r x P ( x ) e − 1 2 b χ x x 2 + h xk x ” ; Λ x ← 1 χ x − b χ x ; } Fig. 1. Pseudocode of the m essage-passi ng algori thm M Pfor Perceptr on [23]. T he symbols “;” and “ ← ” represent the end of a command line and the operation of substitut ion, respecti ve ly . The quanti ties Λ x and Λ u are the counter parts of Λ ξ and Λ η in Eq. (8) for ξ = χ x and η = χ u , respecti vely . Unfortu nately , the ev aluation of Eq. (19) is also com puta- tionally difficult. One approa ch to overcoming this difficulty is to perform a T ay lor expansion aroun d l = 0 , for which e Φ( m x , m u ; l ) can be analytically calculated as an excep tional case, and su bstitute l = 1 in th e expression obtained [ 21]. Howe ver , the ev alu ation of h igher-order terms, which are not negligible in general, requ ires a comp licated calculation in this expansion, which sometimes prevents the sch eme f rom being practically feasible. In or der to av oid such d ifficulty , we take an a lternative appr oach here, which is inspired by a deri vativ e of E q. ( 19), ∂ e Φ( m x , m u ; l ) ∂ l = − (i u ) T H x l , (20) following a st r ategy proposed by Opper and W inthe r [22]. Here, h· · ·i l represents the a verage with r espect to the generalized weight Q N µ =1 b P ( y µ | u µ ) × Q K k =1 P ( x k ) × exp h x · x + h u · (i u ) + (i u ) T ( l H ) x , in which h x and h u are deter mined so as to satisfy h x i l = m x and h (i u ) i l = m u , respectively . The right-hand side of this equation is an average of a qua dratic f orm compo sed of many rand om v ariab les. The cen tral limit theore m implies that such an average does not d epend on the details of the o bjective distribution, but is determined only by the values of the fi rst and second moments. In order to construct a simple approx imation scheme, let us assume th at the seco nd mo ments are c haracterized mac roscop- ically by | x | 2 l − | h x i l | 2 = K χ x and | u | 2 l − | h u i l | 2 = N χ u . Evaluating the right-han d side o f Eq. (2 0) u sing a Gaussian d istribution, the first and second m oments of which are co nstrained to be identical to those of the gener alized weight, and integrating from l = 0 to l = 1 , we have e Φ( χ x , χ u , m x , m u ; 1) − e Φ( χ x , χ u , m x , m u ; 0) ≃ − m T u H m x − N F ( χ x , χ u ) , (21) where the f unction F ( ξ , η ) is provided as in Eq. (8) by the empir ical eigenv alue spectrum of H T H , ρ ( λ ) = K − 1 P K k =1 δ ( λ − λ k ) and the macrosco pic secon d m oments χ x and χ u are included in argum ents of the Gibbs free energy b ecause the rig ht-hand side of Eq. (2 0) depen ds o n these mo ments. Eq. ( 21) offers a comp utationally feasible approx imation o f Eq. (1 9) for l = 1 , since assessment of e Φ( χ x , χ u , m x , m u ; 0) , in which o ne can perfor m summa- tions with respect to relevant variables independ ently , can be achieved at a reason able com putational cost. Although evaluation of E q. ( 21) is comp utationally feasible, searching f or saddle p oints o f this fu nction within a practica l time is still a non -trivial p roblem. In Fig. 1, we presen t a message-passing type algor ithm, which was recently prop osed for a classification prob lem of single lay er percep trons [23], as a p romising heuristic solu tion for this pro blem. The efficacy of this method u nder appro priate cond itions was experime ntally confirmed fo r th e per ceptron pr oblem, and to the extent to which it has been a pplied to se veral ensembles of lin ear vector ch annels, this algo rithm has also b een shown to exhibit a reasonable perfo rmance for the current inferen ce task as well. Howe ver , its prop erties in cluding convergence condition s h av e n ot y et been fu lly clarified , and, th erefore further in vestigation is ne cessary f or the theoretical validation and improvement of the pe rforman ce of th is m ethod. I V . E X A M P L E : W E L C H B O U N D E Q U A L I T Y S E Q U E N C E S In ord er to demo nstrate the utility of the proposed appro ach, let us app ly th e curren t meth odolog ies to the analy sis of the m atrix e nsemble that is ch aracterized by ρ ( λ ) = (1 − β − 1 ) δ ( λ ) + β − 1 δ ( λ − β ) u nder the assumption β > 1 , which correspo nds to the case o f W elch bound equality sequen ces (WBES) [24]. W e fo cus o n the case of th e Gaussian chan nel P ( y | ∆) = (2 π σ 2 ) − 1 / 2 exp − ( y − ∆) 2 / (2 σ 2 ) and binary inputs x ∈ { +1 , − 1 } K , since this constitutes a simple, yet non-tr i v ial prob lem. Under these assumptions, the developed framework has a highe r cap ability than is requ ired f or the assessment of the ty pical co mmunica tion perfor mance with respect to the matrix ensemble, which can be carried out by a simpler method dev elo ped by the author an d his colleagues [25], [1 0], as was recen tly shown by Kitagawa and T an aka [26]. Nevertheless, the framework is still useful as one can derive a compu tationally f easible appr oximate d ecoding algo- rithm of good c on vergence pr operties ba sed on the pro cedure shown in Fig . 1. For Gaussian channels, Λ u in Fig. 1 can be fixed as Λ u = σ 2 in g eneral. This yie lds an algo rithm m t +1 u = 1 σ 2 + b χ t u y − H m t x + b χ t u m t u , (22) m t +1 xk = tanh N X µ =1 H µk m t +1 uµ + b χ t +1 x m t xk ! (23) ( k = 1 , 2 , . . . , K ) , for WBES, where t den otes the numbe r of iterations. b χ t u in Eq. (22) is pr ovided as b χ t u = β / Λ t x , where Λ t x is determin ed so as to satisfy χ t x = (1 − β − 1 ) / Λ t x + β − 1 σ 2 / ( σ 2 Λ t x + β ) fo r giv en χ t x = 1 − K − 1 | m t x | 2 . Utilizin g the identical Λ t x , b χ t +1 x in Eq . (2 3) is evaluated as b χ t +1 x = 1 / χ t x − Λ t x . Fig. 2 compares th e BER for the theoretical a ssessment by the replica method with the experime ntal ev aluation obtained by the algorith m of Eq s. (22) and (23). In the exper iments, the estimates of the bin ary messages are comp uted as b x k = sign( m xk ) fo r k = 1 , 2 , . . . , K , where sign( a ) = a/ | a | fo r a 6 = 0 . This deco ding scheme is optimal f or minim izing BER if m x represents the correct p osterior av era ge (1 8) [27]. The excellent ag reement between the curves and markers in this plot vali d ates both the perform ance assessment based on the replica meth od and that based on the developed algorithm. A characteristic feature of Eqs. (2 2) and (2 3) is the inclusion of macroscop ic variables b χ t u and b χ t +1 x , which are expecte d to act to cancel the self-reactions from previous states. [28]. Fig. 3 plots the influ ence of this operation , ind icating that the can cellation acts to maintain the q uality o f the c on verged solution u p to larger β un der a con dition of fixed SNR. V . S U M M A RY In summa ry , we h av e developed a framework to analyze linear vector ch annels in a large system limit. The frame- work is based on the assump tions that the left an d right singular value b ases of the channel matrix can b e regard ed as indepe ndently drawn fro m Haa r measures over or thogon al (unitary , if the n umber field is define d over the complex variables) groups, and that the e igenv alues of the cross cor re- lation matrix of the channel matrix asy mptotically appro ach 0.001 0.01 4 4.5 5 5.5 6 6.5 7 SNR[dB] P b Fig. 2. BER vs. signal-to-no ise ratio (SNR) for binary inputs for the case β = 1 . 1 . The SNR plotted on the horizontal axis is giv en by − 10 log 10 (2 σ 2 ) while the vertica l axis denotes the BER. The curves indicat e theoreti cal predict ions, which correspond to the s calar Gaussian channe l, WBES and the basic matrix ensemble (BASIC ) from the bottom. Sample matric es of B ASIC are composed of IID entries of zero mean and 1 / N va riance Gaussian random varia bles. V alues for WBES and B ASIC are assessed by the replica method. The m arke rs indica te expe rimental estimate s of the BER obtain ed from 500 sample s ystems with K = 2048 and N = 1862 on the basis of the algorithm shown in Fig. 1. Excelle nt agreeme nt betwee n the curves and marker s vali dates both the performance analysis based on the replica method and that of the dev eloped appro ximation algorithm. a certain specific distribution in the limit of large matrix size. These m odeling assumptio ns allow a characterizatio n of the system in terms of an in tegral formula in two variables, which is fully d etermined by the eige n value distrib utio n. Upon applying this form ula in conjunction with the replica method , we h av e deriv ed a general expression fo r the typical mu tual informa tion of general memory -less channels with factorizable priors of con tinuous/discrete inp uts. W e hav e fu rther proposed a co mputation ally feasible decodin g algorithm based on the formu la, an d have fo und that numer ical results obtained fro m this algor ithm ar e in excellent agreemen t with the theoretical prediction s ev aluated b y th e replica method. Future research d irections include the application of the developed fram ew ork to various mod els o f linear vector ch an- nels, and further imp rovement o f the co mputation ally feasible decodin g algorithm. A C K N O W L E D G M E N T S The auth or thank s Jean-Bernar d Z uber fo r useful discus- sions concer ning Eq. (8). Th is research was suppo rted in part by Gr ants-in-Aid M EXT/JSPS, Jap an, No s. 17340 116 and 18079 006. R E F E R E N C E S [1] H. Nishimori, Statistical Physics of Spin Glasses and Information Pr ocessing (Oxford: Oxford Uni versity Press), 2001. [2] T . T anaka , Euro phys. Lett. 54 , 540, 2001; IEEE T rans. on Infor . Theory 48 , 2888, 2002. [3] Y . Kabashima, J. Phys. A 36 , 11111, 2003. [4] R. R. M ¨ uller, IEEE T rans. on Signal Proc essing 51 2821, 2003. [5] A. L. Moustaka s IEEE T rans. on Infor . Theory 49 , 2545, 2003. [6] D. Guo and S. V erd ´ u IEEE T rans. on Infor . Theory 51 , 1983, 2005. 0.001 0.01 0.1 0 10 20 30 40 50 t P b β=1.6 β=1.6 (No SRC) β=1.5 β=1.5 (No SRC) Fig. 3. Influence of the cancellat ion of self-reac tions in the approximate decodin g for WBES. E xperiment ally assessed trajec tories are plotted for two approximat ion algorithms for the cases β = 1 . 5 and 1 . 6 , where SNR is fixed to 6.0. The horizontal axis represents the number of itera tions, while the vert ical axis denotes the BER. The data are obta ined from 100 e xperiments of K = 512 systems. The first algorithm used in these exp eriments is that presente d by Eqs. (22) and (23) while the second algorithm, the results of which are denote d by “No SRC” in the figure, is found by consideri ng v anishing valu es of the m acroscop ic v ariables b χ t u and b χ t +1 x in E qs. (22) and (23). For both algorit hms, initia l states in the experi ments were set as m xk = tanh( θ k /σ 2 ) , where, in contrast to Fig. 1, θ k = P N µ =1 H µk y µ ( k = 1 , 2 , . . . , K ) . F or β = 1 . 5 , both algorit hms con verge to almost identi cal solutions, although the con vergenc e of the first algorithm is slowe r . Howe ver , for β = 1 . 6 , the second algo rithm con verge s to solutions of significantly higher BER while solution s found by the first algorithm still exhibi t rela tiv ely lo w values of the BER. T his indicate s that the cance llation of the self-reac tion terms in the approx imation algorithm acts to maintain the quality of the con ver gent solut ions for larger va lues of β . [7] J. P . Neirotti and D. Saad, Europh ys. Lett. 71 , 866, 2005. [8] A. Montanari and D. Tse, arXiv: cs/0602028 , 2006. [9] A. M. T ulino and S. V erd ´ u, Random Matri x Theory and W irele ss Communicat ions (Hanov er , MA: No w Publishers), 2004. [10] K. T ak eda, A. Hatabu and Y . Kabashima, J . Phys. A 40 , 14085, 2007. [11] Y . Kabashima, J. Phys.: Conf. Ser . 95 , 012001, 2008. [12] E. Marina ri, G. Pari si and F . Ritort, J. Phys. A 27 , 7647, 1994. [13] G. Parisi and M. Potters, J. Phys. A 28 , 5267, 1995. [14] C. Itzykson and J . B. Z uber, J . Math. Phys. 21 , 411, 1980. [15] D. V . V oiculescu, K. J. Dyke m a and A. Nica, F ree R andom V ariables (Provid ence, R.I.: American Mathematic al S ociet y), 1992. [16] T . T anaka, J . P hys.: Conf. Ser . 95 , 012002, 2008. [17] P . Z inn-Justin and J. B. Zuber, J. Phys. A 36 , 3173, 2003. [18] H. Nishimori, Eur ophys. Lett. 57 , 302, 2002. [19] M. K. V aranasi and B. Aazha ng, IEE E T rans. on Commun. 38 , 509, 1990; ibid. . 39 , 725, 1991. [20] T . T anaka and M. Okada, IEE E T rans. on Infor . Theory 51 700, 2005. [21] T . Plefka, J. Phys. A 15 , 1971, 1982. [22] M. Opper and O. Win ther, Jo urnal of Machi ne Learning Resear ch 6 , 2177, 2005. [23] T . Shinzato and Y . Kabashi ma, arXi v:0712.4050, 2007. [24] L. R. W elch , IEE E T rans. on Infor . Theory . IT -20 , 397, 1974. [25] K. T ake da K, S. Uda and Y . Kabashima, Europ hys. Lett. 76 , 1193, 2006. [26] K. Kitagawa and T . T anaka , Proc. The 30th Symposium on Information Theory and Its A pplicat ion (SIT A2007) (in Japanese ), 799, 2007. [27] Y . Iba, J. Phys. A 32 , 3875, 1999. [28] D. J. Thouless, P . W . Anderson and R. G. Pal m er, Phil. Mag. 35 , 593, 1977.
Original Paper
Loading high-quality paper...
Comments & Academic Discussion
Loading comments...
Leave a Comment