Almost-iid information theory

Almost-iid information theory Giulia Mazzola 1 , Da vid Sutter 2 , and Renato Renner 1 1 Institute for The or etic al Physics, ETH Zurich 2 IBM Rese ar ch Eur op e – Zurich Abstract Information-theoretic tec hniques are based on the assumption that resources are w ell c haracter- ized by indep enden t and identically distributed (iid) states. This assumption cannot b e justiﬁed op erationally , since, for example, correlations betw een subsequen t systems emitted b y a source cannot be detected b y any practical tomographic proto col. Op erationally motiv ated symmetry assumptions still imply , via de Finetti theorems, that the resources are describ ed by almost-iid states. This raises the question: Are almost-iid resources as eﬀectiv e as p erfect iid resources for information-pro cessing tasks? Here w e address this question and prov e that the conditional en- trop y of almost-iid states asymptotically coincides with that of iid states. As an application, this implies that squashed entanglemen t is robust for almost-iid states, asymptotically matc hing its v alue on iid states. 1 In tro duction A common assumption in physics is that the same exp eriment can b e rep eated many times indep en- den tly . More precisely , one often assumes that the outcomes X 1 , . . . , X n + k of running the exp eriment n + k times are described b y indep endent and iden tically distributed (iid) random v ariables. In tech- nical terms, this means that their joint distribution P X n + k 1 is factorized, i.e., P X n + k 1 = ( P X ) n + k . The Italian mathematician Bruno de Finetti cautioned that the “most common and misleading error” in probabilit y theory is treating the iid assumption as fundamen tal [ 19 ]. He pro vided a mathematically con vincing argument to op erationally justify the iid assumption, relating it to the p erm utation inv ari- ance of the outcomes [ 18 , 30 ]. F or many realistic systems, p ermutation in v ariance follows from natural assumptions suc h as the indistinguishabilit y of subsystems (see [ 34 ] for a more detailed discussion). 1 Supp ose that we hav e a source that generates random v ariables X n + k 1 , where we assume that the underlying joint distribution P X n + k 1 is p ermutation-in v arian t and that each random v ariable tak es v alues in the set X . When selecting n of these n + k random v ariables, i.e. ignoring k random v ariables, de Finetti’s theorem [ 20 ] shows that there exists a probability measure µ on the set of distributions on X such that     P X n 1 − Z ( Q X ) n µ (d Q )     1 ≤ 2 dn n + k , (1) where ∥·∥ 1 denotes the ℓ 1 -norm and d = |X | . This result justiﬁes that when ignoring k random v ariables the remaining ones are approximately a conv ex combination of iid random v ariables. De Finetti theorems hav e b een generalized to the quan tum case. In [ 12 ] a quantum de Finetti theorem has b een obtained as an implication of the classical result in the case where n = ∞ . This yields a result that is applicable under the assumption of having an inﬁnite num ber of samples. Later in [ 25 , 13 ] quantum de Finetti theorems hav e b een deriv ed that work for a ﬁnite num ber of samples — analogous to Equation (1) . F or an y state | Φ ( n + k ) ⟩ on H ⊗ n + k that is symmetric (i.e. inv ariant under 1 Alternatively , p ermutation inv ariance can also b e enforced by randomly p erm uting the subsystems. 1 p erm utations of the subsystems), there exists a probabilit y measure µ on the unit sphere B ( H ) := {| θ ⟩ ∈ H : ∥ θ ∥ = 1 } suc h that     tr k [ | Φ ( n + k ) ⟩ ⟨ Φ ( n + k ) | ] − Z | θ ⟩ ⟨ θ | ⊗ n µ (d θ )     1 ≤ 2 d 2 n n + k =: ε d ( n, k ) , (2) where ∥·∥ 1 is the trace norm, tr k [ · ] denotes the partial trace of k subsystems, and d = dim( H ). It is w orth men tioning that the results stated in Equations (1) and (2) are essentially tight [ 20 , 13 ]. The main obstacle with these de Finetti results (classical and quan tum) is that they require k to b e large. Sp eciﬁcally , if we w an t the error term ε d ( n, k ) to v anish in the limit n → ∞ , w e need k to b e sup erlinear in n , i.e. k = ω ( n ). 2 Therefore, w e need to ignore a large fraction of our data. This ma y be justiﬁed in certain scenarios where the num b er of data is by default h uge 3 , how ever, it can b e prohibitiv e in other settings. 4 A second, but often less drastic, disadv antage is that, by c hoosing k = p oly( n ), the error term ε d ( n, k ) is v anishing at a slow rate of order 1 / p oly ( n ). The insigh t of [ 33 , 34 ] w as that w e can o v ercome these limitations when relaxing the iid assumption. More precisely , the exp onential de Finetti the or em [ 33 , 34 ] shows that there exist a probabilit y measure ν on the unit sphere B ( H ) and a family {| Ψ ( n ) r,θ ⟩} θ of almost-iid states in | θ ⟩ with a defect of size r such that     tr k [ | Φ ( n + k ) ⟩ ⟨ Φ ( n + k ) | ] − Z | Ψ ( n ) r,θ ⟩ ⟨ Ψ ( n ) r,θ | ν (d θ )     1 ≤ 3 k d exp  − k ( r + 1) n + k  =: ε ′ d ( n, k ) . (3) The precise statement is given in Theorem 3.1 . Almost-iid states in | θ ⟩ are sup erp ositions of states that are equal to | θ ⟩ ⊗ n − r on n − r subsystems and arbitrary on the remaining r subsystems. Note that in each elemen t of the sup erp osition the p ositions of the defects ma y b e diﬀerent. Usually , the n um b er of defects is sublinear in n , i.e. r = o ( n ). The precise deﬁnition of almost-iid states is giv en in Section 2 . Equation (3) has a drastically b etter parameter scaling compared to Equation (2) . It allo ws us to c hoose k sublinear in n , i.e. k = o ( n ), and still ha v e an error term that v anishes in the limit n → ∞ — even at an exp onen tial rate. T o see this, note that, for example, by choosing k = n 3 4 and r = √ n , w e obtain ε ′ d ( n, k ) = 3 n 3 d 4 exp( − n 5 / 4 + n 3 / 4 n + n 3 / 4 ) = O (exp( − n 1 4 )). While the exp onential de Finetti theorem justiﬁes the imp ortance of almost-iid states, it is natural to ask if these states are also relev an t in a purely classical scenario. One may deﬁne almost-iid distributions as a conv ex combination of an iid distribution on n − r subsystems and an arbitrary distribution on the remaining r subsystems, where the p osition of the defects can b e diﬀeren t in eac h element of the con v ex combination. F or such distributions, w e show that no classical equiv alen t to Equation (3) exists. In other w ords, it is not p ossible to appro ximate a classical permutation- in v ariant distribution by a conv ex combination of almost-iid distributions for k sublinear in n . This is made precise in Section 3 . The lack of a classical exponential de Finetti theorem shows that almost-iid states are considerably more p ow erful than classical almost-iid distributions. This is due to the allow ed sup erp ositions which can contain long-range entanglemen t. F or that reason, almost-iid states are more complicated to ana- lyze. In particular, it is unclear if certain (robust) applications could b ehav e diﬀerently for almost-iid and iid states. How ever, it is exp ected that they should not b ehav e diﬀerently for practical purp oses. Quan tum tomography , which is used to infer the characteristics of a system, cannot distinguish b e- t w een almost-iid and iid states for practically feasible scenarios. W e refer to Prop osition 2.7 for a mathematically rigorous statemen t. Giv en the operational relev ance of almost-iid states (ensured by the exp onential de Finetti theorem), it is crucial to understand which applications b ehav e asymptotically equally under almost-iid and iid states. In this work, we introduce the deﬁnition of mixed almost-iid states and pro v e that for 2 Note that, by deﬁnition, f ( n ) = ω ( g ( n )) ⇐ ⇒ g ( n ) = o ( f ( n )). Hence, k = ω ( n ) ⇐ ⇒ n = o ( k ), or in other words, lim n →∞ n k = 0. 3 F or example, random samples of a coin toss, which in principle can be repeated arbitrarily many times. 4 F or example, in the setting of quan tum key distribution. 2 conditional en tropies, there is no asymptotic diﬀerence betw een almost-iid and iid states. F or n ∈ N and ρ A n 1 B n 1 a mixed almost-iid state in σ AB with a defect of size r = o ( n ), w e sho w that 1 n H ( A n 1 | B n 1 ) ρ = H ( A | B ) σ + o ( n ) n , (4) where H ( A | B ) σ := H ( AB ) σ − H ( B ) σ is the conditional en trop y and H ( B ) σ := − tr[ σ B log σ B ] for σ B = tr A [ σ AB ] is the von Neumann entrop y . The exact result is giv en in Theorem 4.1 . T o prov e our results, we use information-theoretic tools that ha ve been developed to go b eyond the traditionally assumed iid structure [ 33 , 17 , 36 ]. W e conclude in Section 5 with a discussion of whether p opular entanglemen t measures asymptot- ically coincide for almost-iid and iid states. W e prov e that this holds for the squashed entanglemen t (see Corollary 5.1 ); ho w ev er, it remains an op en question for other entanglemen t measures such as the distillable en tanglemen t, the en tanglemen t cost, and the relative en trop y of entanglemen t. 2 Almost-iid states In this section, we formally deﬁne almost-iid states and discuss some of their prop erties. Histori- cally [ 33 ], almost-iid states w ere introduced as pure and symmetric states with additional structure, as explained in the following. This family of states app ears, for example, in exp onential de Finetti theorems suc h as those presented in Equation (3) . Ho w ever, we will relax the deﬁnition from [ 33 ] to capture more general families of states, including mixed ones, that hav e a similar almost-iid structure. Let S n b e the set of p ermutations on { 1 , ..., n } and let S( H ) denote the set of density matrices on a Hilb ert space H whose dimension is denoted by d . The symmetric subspace is given by Sym n ( H ) := span {| ϕ ⟩ ⊗ n : | ϕ ⟩ ∈ H} . F or a ﬁxed | θ ⟩ , let V ( H ⊗ n , | θ ⟩ ⊗ m ) := { π ( | θ ⟩ ⊗ m ⊗| Ω ( n − m ) ⟩ ) : π ∈ S n , | Ω ( n − m ) ⟩ ∈ H ⊗ n − m } and Sym n ( H , | θ ⟩ ⊗ m ) := Sym n ( H ) ∩ span V ( H ⊗ n , | θ ⟩ ⊗ m ) . (5) Let n, r ∈ N such that r ≤ n and | θ ⟩ ∈ H . In [ 33 ], an  n r  - almost-iid state in | θ ⟩ was deﬁned as a pure state | Ψ ( n ) ⟩ ∈ Sym n ( H , | θ ⟩ ⊗ n − r ). Ho w ev er, for pure states outside the symmetric subspace or mixed states, this deﬁnition needs to b e relaxed in order to capture states that intuitiv ely ha v e an almost-iid structure suc h as those men tioned in Example 2.2 b elo w. W e therefore next in tro duce a no v el deﬁnition for mixed almost-iid states. Deﬁnition 2.1 (Almost-iid states) . Let H A b e a Hilbert space, σ A ∈ S( H A ), and n, r ∈ N suc h that r ≤ n . Then, ρ A n 1 ∈ S( H ⊗ n A ) is called a  n r  - almost-iid state in σ A if there exists a puriﬁcation | θ ⟩ AE of σ A and an extension ρ A n 1 E n 1 of ρ A n 1 suc h that (i) ρ A n 1 E n 1 is p erm utation-in v ariant with resp ect to ( A i , E i ) ↔ ( A j , E j ); (ii) supp( ρ A n 1 E n 1 ) ⊆ span V ( H ⊗ n AE , | θ ⟩ ⊗ n − r AE ). W e then write ρ A n 1 ∈ S n ( H A , σ ⊗ n − r A ). Here, supp( X ) denotes the supp ort of a linear operator X and is deﬁned as supp( X ) = k er( X ) ⊥ . W e next discuss a few p edagogical examples of mixed almost-iid states. 3 Example 2.2. A few simple instances of almost-iid states include: (a) T ensor p ow er states ρ A n 1 = σ ⊗ n A . These are almost-iid states with defect size r = 0, i.e. ρ A n 1 ∈ S n ( H A , σ ⊗ n A ). (b) Conv ex combinations of tensor p ow er states with a small n um ber of defects, i.e., states of the form ρ A n 1 = 1 n ! P π ∈S n π ( σ ⊗ n − r A ⊗ ω A r 1 ) π † , where ω A r 1 denotes an arbitrary densit y op erator of size r representing the defects. W e hav e ρ A n 1 ∈ S n ( H A , σ ⊗ n − r A ). T o see this, let | θ ⟩ AE and | ω ⟩ A r 1 E r 1 b e puriﬁcations of θ A and ω A r 1 , resp ectiv ely . Then the extension ρ A n 1 E n 1 = 1 n ! X π ∈S n π  | θ ⟩ ⟨ θ | ⊗ n − r AE ⊗ | ω ⟩ ⟨ ω | A r 1 E r 1  π † (6) of ρ A n 1 clearly satisﬁes prop erty ( i ) since it is permutation-in v ariant. It also satisﬁes the prop ert y ( ii ) as it can b e written as ρ A n 1 E n 1 = P i,j ∈T β i,j | Ψ i ⟩⟨ Ψ j | for β i,j = 1 n ! δ i,j and | Ψ i ⟩ ∈ V ( H ⊗ n AE , | θ ⟩ ⊗ n − r AE ) for all i ∈ T . (c) The an tisymmetric Bell state ρ A 2 1 = | Ψ − ⟩ ⟨ Ψ − | A 2 1 with | Ψ − ⟩ A 2 1 := 1 √ 2 ( | 0 ⟩| 1 ⟩ A 2 1 − | 1 ⟩| 0 ⟩ A 2 1 ) is a  2 1  -almost-iid state in | 0 ⟩ A . W e wan t to emphasize that the class of almost-iid states contains man y more families than the ones presented in Example 2.2 . In particular, it allows for sup erp ositions of pro duct states with a small n um ber of defects. These sup erp ositions are crucial for an exp onential de Finetti theorem as in Equation (3) to hold. On the other hand, classical almost-iid distributions ha v e a simpler structure as they do not hav e superp ositions and hence are just con v ex combinations of almost-pro duct distributions. This is discussed in Section 3 . Remark 2.3. A more restrictive deﬁnition of mixed almost-iid states was in tro duced in [ 9 ] by calling ρ A n 1 ∈ S( H ⊗ n A ) an  n r  - almost-iid state in σ A if there exists a puriﬁcation | Ψ ( n ) ⟩ A n 1 E n 1 suc h that | Ψ ( n ) ⟩ ∈ Sym n ( H A ⊗ H E , | θ ⟩ ⊗ n − r AE ) for some puriﬁcation | θ ⟩ AE of σ A . Such states clearly satisfy the assumptions of Deﬁnition 2.1 . Ho w ev er, Deﬁnition 2.1 is strictly broader as it includes states that do not meet the deﬁnition giv en in [ 9 ]. One such example is giv en b y Example 2.2 ( c ) as explained in App endix A . W e call ρ A n 1 a  n r  - gener alize d-almost-iid-state in σ A if it satisﬁes ( ii ) but not necessarily ( i ). W e then write ρ A n 1 ∈ ¯ S n ( H A , σ ⊗ n − r A ). T rivially , we hav e S n ( H A , σ ⊗ n − r A ) ⊆ ¯ S n ( H A , σ ⊗ n − r A ). With the follo wing trace-preserving completely p ositiv e map PERM : X 7→ 1 n ! X π ∈S n π X π † , (7) whic h symmetrizes the input, it is p ossible to conv ert a generalized-almost-iid-state into an almost-iid state in the sense that ρ A n 1 ∈ ¯ S n ( H A , σ ⊗ n − r A ) = ⇒ PERM( ρ A n 1 ) ∈ S n ( H A , σ ⊗ n − r A ) . (8) 4 Remark 2.4 (Properties of almost-iid states) . Let ρ A n 1 ∈ S n ( H A , σ ⊗ n − r A ) b e a mixed almost- iid state according to Deﬁnition 2.1 . Then it satisﬁes the following prop erties: (a) F or any puriﬁcation | θ ⟩ AE of σ A there exists an extension ρ A n 1 E n 1 of ρ A n 1 that satisﬁes ( i ) and ( ii ). See Lemma B.1 . (b) ρ A n 1 is p erm utation-in v ariant. (c) There exists an orthonormal basis {| Ψ t ⟩} t ∈T of span V ( H ⊗ n AE , | θ ⟩ ⊗ n − r ) with vectors | Ψ t ⟩ ∈ V ( H ⊗ n AE , | θ ⟩ ⊗ n − r ) and with |T | ≤  n r  d r AE ≤ 2 nh ( r/n ) d r AE , (9) where h ( x ) := − x log x − (1 − x ) log (1 − x ) for x ∈ [0 , 1] is the binary entrop y function. With resp ect to this basis, condition ( ii ) can b e rewritten as ρ A n 1 E n 1 = X i,j ∈T β i,j | Ψ i ⟩⟨ Ψ j | , (10) for co eﬃcien ts β i,j ∈ C with β i,i ∈ [0 , 1] and P i ∈T β i,i = 1. See Lemma B.2 . (d) ρ A n + m 1 ∈ S n + m ( H A , σ ⊗ n + m − r A ) implies ρ A n 1 ∈ S n ( H A , σ ⊗ n − r A ) for any m, n ∈ N . See Lemma B.3 . (e) ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) implies ρ A n 1 ∈ S n ( H A , σ ⊗ n − r A ). This follows directly from the deﬁnition of mixed almost-iid states. (f ) F or any ρ A n 1 ∈ ¯ S n ( H A , σ ⊗ n − r A ) we hav e for any s ∈ N that ( ρ A n 1 ) ⊗ s ∈ ¯ S ns ( H A , σ ⊗ ns − rs A ). See Lemma B.4 . (g) The set of almost-iid states is con v ex. This follo ws directly from the deﬁnition of mixed almost-iid states. In the analysis of almost-iid states, the represen tation in Equation (10) is particularly useful b ecause it exploits the pro duct structure of the space V ( H ⊗ n C E , | θ ⟩ ⊗ n − r C E ). How ev er, it is sometimes preferable to go to a puriﬁed description, i.e., to w ork with a puriﬁcation of Equation (10) . The following lemma establishes that the underlying structure of V ( H ⊗ n C E , | θ ⟩ ⊗ n − r C E ) p ersists even in this puriﬁed form ulation. Lemma 2.5 (Puriﬁcation in a preferred basis) . L et | Θ ⟩ AH ∈ H AH for some c omp osite Hilb ert sp ac e H AH , and let V A ⊆ H A b e any subset of ve ctors such that span V A admits an orthonormal b asis  | Ψ j ⟩ A  j ∈T with ve ctors | Ψ j ⟩ A ∈ V A ∀ j ∈ T . Then, the fol lowing ar e e quivalent: (i) supp  tr H  | Θ ⟩ ⟨ Θ | AH  ⊆ span V A (ii) | Θ ⟩ AH = P j ∈T α j | Ψ j ⟩ A | ˜ h j ⟩ H for c o eﬃcients α j ∈ C and normalize d ve ctors | ˜ h j ⟩ H ∈ H H for al l j ∈ T . In addition, if | Θ ⟩ AH is normalize d and satisﬁes ( ii ) , we have 1 = ⟨ Θ | Θ ⟩ AH = P j ∈T | α j | 2 . Pr o of. ( i ) = ⇒ ( ii ): By the Schmidt decomposition, there exist orthonormal sets of v ectors  | χ k ⟩ A  k ⊂ H A and  | ϕ k ⟩ H  k ⊂ H H , and scalars γ k ≥ 0 suc h that | Θ ⟩ AH = P k γ k | χ k ⟩ A | ϕ k ⟩ H . T aking the partial 5 trace, w e obtain tr H  | Θ ⟩ ⟨ Θ | AH  = X m X k,l γ k γ l | χ k ⟩⟨ χ l | A ⟨ ϕ m | ϕ k ⟩ H ⟨ ϕ l | ϕ m ⟩ H = X k γ 2 k | χ k ⟩ ⟨ χ k | A . (11) By assumption ( i ), it follo ws that | χ k ⟩ ∈ span V A for all k such that γ k  = 0. (If, b y contradiction, there w ere | χ l ⟩ / ∈ span V A for some l with γ l  = 0, then w e would hav e tr H  | Θ ⟩ ⟨ Θ | AH  | χ l ⟩ = γ 2 l | χ l ⟩  = 0 = ⇒ supp( . . . ) ⊆ span V A . ) Therefore, w e may write | χ k ⟩ A = P j ∈T c ( k ) j | Ψ j ⟩ A for some co eﬃcients c ( k ) j ∈ C ∀ j dep ending on k . With that, we ﬁnd | Θ ⟩ AH = X k γ k | χ k ⟩ A | ϕ k ⟩ H = X k γ k X j ∈T c ( k ) j | Ψ j ⟩ A | ϕ k ⟩ H = X j ∈T | Ψ j ⟩ A  X k c ( k ) j γ k | ϕ k ⟩ H  (12) = X j ∈T | Ψ j ⟩ A | h j ⟩ H = X j ∈T α j | Ψ j ⟩ A | ˜ h j ⟩ H , (13) where we hav e introduced co eﬃcien ts α j ≥ 0 to normalize the vector | h j ⟩ H := P k c ( k ) j γ k | ϕ k ⟩ H (with the con v en tion that α j = 0 and | ˜ h j ⟩ H are normalized but arbitrary if | h j ⟩ H = 0). ( ii ) = ⇒ ( i ): F or any orthonormal basis  | ϕ m ⟩ H  k of H H , w e simply ev aluate tr H  | Θ ⟩ ⟨ Θ | AH  = X i,j ∈T α i α ∗ j X m ⟨ ϕ m | ˜ h i ⟩ H ⟨ ˜ h j | ϕ m ⟩ H | Ψ i ⟩⟨ Ψ j | A , (14) whic h implies supp  tr H  | Θ ⟩ ⟨ Θ | AH  ⊆ span V A since | Ψ j ⟩ A ∈ V A ∀ j ∈ T . In addition, if | Θ ⟩ AH is normalized, w e ﬁnd 1 = ⟨ Θ | Θ ⟩ AH = P j ∈T | α j | 2 since the v ectors  | Ψ j ⟩ A  j ∈T are orthonormal and the v ectors | ˜ h j ⟩ H are normalized for eac h j ∈ T . ■ An imp ortant prop erty of almost-iid states is that when tracing out man y subsystems, we obtain a state that is close to an iid state. Prop osition 2.6. L et n, r , s ∈ N such that r, s ≤ n , σ A ∈ S( H A ) , and ρ A n 1 ∈ S n ( H A , σ ⊗ n − r A ) . Then   tr n − s [ ρ A n 1 ] − σ ⊗ s A   1 ≤ 4 r r s n . (15) Pr o of. Let ˜ n := ⌊ n s ⌋ s ≤ n and deﬁne ρ A ˜ n 1 := tr n − ˜ n [ ρ A n 1 ]. Let | θ ⟩ AE b e a puriﬁcation of σ A . By assumption ρ A n 1 ∈ S n ( H A , σ ⊗ n − r A ) which according to Remark 2.4 implies ρ A ˜ n 1 ∈ S ˜ n ( H A , σ ⊗ ˜ n − r A ). Hence, due to the deﬁnition of this vector space, there exists a family {| Ψ t ⟩} t ∈T of orthonormal v ectors from V ( H ⊗ ˜ n AE , | θ ⟩ ⊗ ˜ n − r AE ) suc h that ρ A ˜ n 1 E ˜ n 1 = X t,t ′ ∈T β t,t ′ | Ψ t ⟩⟨ Ψ t ′ | , (16) with P t ∈T β t,t = 1, where ρ A ˜ n 1 E ˜ n 1 is an extension of ρ A ˜ n 1 . Now, we may in terpret | Ψ t ⟩ as a state on the ⌊ n s ⌋ blo cks. Since | Ψ t ⟩ ∈ V ( H ⊗ ˜ n AE , | θ ⟩ ⊗ ˜ n − r AE ), we know that | Ψ t ⟩ is at least in ⌊ n s ⌋ − r blo cks of the form | θ ⟩ ⊗ s and in the remaining r blocks arbitrary . F or any t ∈ T , let F t denote the set of block indices i ∈ { 1 , . . . , ⌊ n s ⌋} on whic h | Ψ t ⟩ is not of the form | θ ⟩ ⊗ s and note that |F t | ≤ r . (17) 6 Then, for an y blo ck sp eciﬁed by i , the total weigh t of the vectors | Ψ t ⟩ that deviate from | θ ⟩ ⊗ s for that blo c k is giv en b y w i := X t ∈T β t,t δ i ∈F t . (18) Summing o v er all blo c ks yields ⌊ n s ⌋ X i =1 w i Eq. (18) = ⌊ n s ⌋ X i =1 X t ∈T β t,t δ i ∈F t = X t ∈T β t,t ⌊ n s ⌋ X i =1 δ i ∈F t = X t ∈T β t,t |F t | Eq. (17) ≤ X t ∈T β t,t r = r , (19) where the ﬁnal step uses P t ∈T β t,t = 1. Equation (19) implies that w j ≤ r ⌊ n s ⌋ for some j ∈ n 1 , . . . , j n s ko . (20) F or a ﬁxed j ∈  1 , . . . ,  n s  , let Π b e the pro jector onto the subspace spanned by {| Ψ t ⟩ : j ∈ F t } , i.e. Π = X t ∈T s.t. j ∈F t | Ψ t ⟩ ⟨ Ψ t | and Π ⊥ = X t ∈T s.t. j ∈F t | Ψ t ⟩ ⟨ Ψ t | . (21) Note that Π Π ⊥ = 0 and Π + Π ⊥ = id span V ( H ⊗ ˜ n AE , | θ ⟩ ⊗ ˜ n − r AE ) . The op erator ρ ′ A ˜ n 1 E ˜ n 1 := Π ρ A ˜ n 1 E ˜ n 1 Π † is subnormalized, i.e., tr[ ρ ′ ] = tr[Π ρ Π † ] ≤ 1 , (22) whic h can b e seen, for example, via H¨ older’s inequality [ 35 , Prop osition 2.5]. The ﬁdelity b etw een tw o densit y op erators is deﬁned as F ( ρ, σ ) :=   √ ρ √ σ   2 1 . Hence, for any ω ≥ 0 we hav e F ( ω , ρ ′ ) =    √ ω p ρ ′    2 1 = tr[ ρ ′ ]      √ ω s ρ ′ tr[ ρ ′ ]      2 1 Eq. (22) ≤      √ ω s ρ ′ tr[ ρ ′ ]      2 1 = F  ω , ρ ′ tr[ ρ ′ ]  . (23) Using the p erm utation in v ariance of ρ A ˜ n 1 E ˜ n 1 w e obtain F ( ρ A s 1 E s 1 , | θ ⟩ ⟨ θ | ⊗ s ) Eq. (23) ≥ F ( ρ A j s ( j − 1) s +1 E j s ( j − 1) s +1 , ρ ′ A j s ( j − 1) s +1 E j s ( j − 1) s +1 ) DPI ≥ F ( ρ A ˜ n 1 E ˜ n 1 , ρ ′ A ˜ n 1 E ˜ n 1 ) , (24) where the ﬁnal step uses the data-pro cessing inequality for the ﬁdelity [ 21 , Lemma B.4]. By deﬁnition of the ﬁdelit y w e ha v e q F ( ρ A ˜ n 1 E ˜ n 1 , ρ ′ A ˜ n 1 E ˜ n 1 ) =    √ ρ p ρ ′    1 = tr h q √ ρρ ′ √ ρ i = tr h q √ ρ Π ρ Π † √ ρ i = tr[Π ρ ] . (25) By deﬁnition of Π w e ha v e tr[Π ρ ] = 1 − tr[Π ⊥ ρ ] Eq. (21) = 1 − X t ∈T ,j ∈F t ⟨ Ψ t | ρ | Ψ t ⟩ Eq. (16) = 1 − X t ∈T ,j ∈F t β t,t Eq. (18) = 1 − w j . (26) The F uchs-v an der Graaf inequality [ 22 ] then gives   ρ A s 1 E s 1 − | θ ⟩ ⟨ θ | ⊗ s   1 ≤ 2 q 1 − F ( ρ A s 1 E s 1 , | θ ⟩ ⟨ θ | ⊗ s ) (27) Eqs. (24) to (26) ≤ 2 q 1 − (1 − w j ) 2 (28) 7 Eq. (20) ≤ 2 √ 2 r r ⌊ n s ⌋ . (29) Recalling that the trace distance is con tractive under the partial trace [ 43 , Theorem 8.16] implies   ρ A s 1 − σ ⊗ s A   1 ≤   ρ A s 1 E s 1 − | θ ⟩ ⟨ θ | ⊗ s   1 Eq. (29) ≤ 2 √ 2 r r ⌊ n s ⌋ . (30) T o conclude the pro of of the assertion, note that for q = ⌊ n s ⌋ w e ha v e sq = s j n s k ≤ n ≤ s ( q + 1) . (31) Hence p r /q p r s/n = r n q s Eq. (31) ≤ s s ( q + 1) q s = r 1 + 1 q ≤ √ 2 , (32) where the ﬁnal step uses q ≥ 1. Putting everything together yields   ρ A s 1 − σ ⊗ s A   1 Eq. (30) ≤ 2 √ 2 r r ⌊ n s ⌋ Eq. (32) ≤ 4 r r s n . (33) ■ Another prop erty concerns the statistics of almost-iid and iid states when performing an iid mea- suremen t (as is used in a tomography pro cedure, for example) [ 33 , Theorem 4.5.2]. Let σ ∈ S( H ) and assume that n indep endent measuremen ts with resp ect to a POVM M = { M x } x ∈X are p erformed on n iid copies of σ , giving the outcomes x = ( x 1 , . . . , x n ) with x i ∈ X ∀ i . The outcomes x can b e c haracterized b y a fr e quency distribution (or typ e ) λ x , which is deﬁned as the follo wing probabilit y distribution on X , λ x ( y ) := 1 n   { i : x i = y }   , (34) for all y ∈ X . In tuitiv ely , λ x ( y ) corresp onds to the relativ e num ber of o ccurrences of y in the sequence of outcomes x = ( x 1 , . . . , x n ). F or large v alues of n , it follows by the la w of large num bers that the frequency distribution λ x is close to the probability distribution P X deﬁned b y P X ( y ) := tr( M y σ ). Prop osition 2.7 sho ws that a similar statemen t holds when performing n indep endent measuremen ts on a  n r  -almost-iid state in σ for small enough v alues of r . This prov es that almost-iid and iid states cannot be distinguished b y any practically feasible measurement. This is discussed in more detail in [ 29 ]. Prop osition 2.7 (Statistics of almost-iid states) . L et n, r ∈ N such that r ≤ 1 2 n , σ ∈ S( H ) , and ρ ( n ) ∈ S n ( H , σ ⊗ n − r ) . L et M = { M x } x ∈X b e a POVM on H , and P X ( x ) = tr[ M x σ ] for al l x ∈ X . Then, for any ε > 0 , we have P x  ∥ λ x − P X ∥ 1 > f ( ε, r, n )  ≤ ε , (35) wher e the pr ob ability is taken over the outc omes x = ( x 1 , . . . , x n ) of the pr o duct me asur ement M ⊗ n applie d to ρ ( n ) , and f ( ε, r, n ) = 2 s log  1 ε  n + |X | n log  n 2 + 1  + h  r n  + 2 r n log( d ) + 2 r n , (36) wher e h ( · ) is the binary entr opy and d = dim H . F urthermor e, if r = o ( n ) , then lim n →∞ f ( ε, r, n ) = 0 . 8 The pro of is giv en in App endix C . 3 Quan tum vs. classical exp onen tial de Finetti theorem In this section, we formally state the quantum exp onential de Finetti theorem [ 34 ] whic h justiﬁes the deﬁnition of almost-iid states. W e then sho w that for classical almost-iid distributions, deﬁned as con v ex combinations of pro duct distributions with a small num ber of defects, no classical exp onen tial de Finetti theorem can hold. Theorem 3.1 (Exp onen tial de Finetti [ 34 , Theorem 1]) . L et n, k , r ∈ N and let H b e a d - dimensional Hilb ert sp ac e. F or any | Φ ( n + k ) ⟩ ∈ Sym n + k ( H ) ther e exists a pr ob ability me asur e ν on the unit spher e B ( H ) and a family {| Ψ ( n ) r,θ ⟩} θ of states such that | Ψ ( n ) r,θ ⟩ ∈ Sym n ( H , | θ ⟩ ⊗ n − r ) and     tr k [ | Φ ( n + k ) ⟩ ⟨ Φ ( n + k ) | ] − Z | Ψ ( n ) r,θ ⟩ ⟨ Ψ ( n ) r,θ | ν (d θ )     1 ≤ 3 k d e − k ( r +1) n + k . (37) W e next deﬁne almost-iid distributions which is the classical counterpart to almost-iid states. F or a ﬁnite alphab et X , let Sym( X n ) denote the set of p ermutation-in v ariant distributions on X n . F urthermore, for a distribution q on X let V ( X n , q m ) := { π ( q m × R X n − m 1 ) : π ∈ S n , R X n − m 1 distribution on X n − m } (38) and Sym( X n , q m ) := Sym( X n ) ∩ conv V ( X n , q m ) . (39) Note that the set Sym( X n , q m ) is considerably simpler compared to Sym n ( H , | θ ⟩ ⊗ m ) since the former set consists of conv ex com binations of pro duct distributions, whereas the latter set con tains sup erp o- sitions of pro duct states. The follo wing prop osition states that no classical v ersion of the quan tum de Finetti theorem can exist where almost-iid states are replaced with almost-iid distributions. Prop osition 3.1 (No classical exponential de Finetti result) . L et n ∈ N , k = o ( n ) , r = o ( n ) , X b e a ﬁnite alphab et of size d . Ther e exists P X n + k 1 ∈ Sym( X n + k ) such that it is not p ossible to ap- pr oximate P X n 1 with a pr ob abilistic mixtur e of distributions { Q ( q ) X n 1 } q with Q ( q ) X n 1 ∈ Sym( X n , q n − r ) in the ℓ 1 -norm up to ε d ( n ) such that lim n →∞ ε d ( n ) = 0 . The proof of Prop osition 3.1 is given in App endix D . As men tioned in Equation (1) , if w e are willing to c ho ose k large, more precisely k = ω ( n ), then a classical de Finetti theorem holds. Ho w- ev er, Proposition 3.1 states that this is no longer the case for k = o ( n ), ev en if the error term would decrease at a non-exp onential rate. F urther note that Prop osition 3.1 do es not prohibit the existence of a classical exp onen tial de Finetti theorem if the deﬁnition of almost-iid distributions, Equation (39) , is relaxed. F or example, one could deﬁne the set V ( X n , q m ) in Equation (38) by only requiring the marginals of the distributions to yield an iid state instead of imp osing a pro duct structure b etw een the iid part and the defects. How ev er, suc h a structure w ould no longer be cov ered by the quan tum v ersion in Deﬁnition 2.1 and therefore, ma y p oten tially b e diﬃcult to w ork with. One may still wonder what Theorem 3.1 pro duces if it is applied to a classical symmetric distribu- tion. Can the resulting family of almost-iid states that appro ximates the classical distribution b ecome classical as w ell? T o analyze this, let P X n + k 1 ∈ Sym( X n + k ) b e a symmetric classical distribution. W e 9 can apply Theorem 3.1 to P X n + k 1 b y em b edding it in to a p erm utation-in v ariant density matrix, ρ A n + k 1 = X x n + k 1 P X n + k 1 ( x n + k 1 ) | x 1 ⟩ ⟨ x 1 | · · · | x n + k ⟩ ⟨ x n + k | . (40) Let | Φ ( n + k ) ⟩ A n + k 1 E n + k 1 b e a symmetric puriﬁcation of ρ A n + k 1 . Consider the measurement channel on the A -system M : Y AE 7→ X x | x ⟩ ⟨ x | A Y AE | x ⟩ ⟨ x | A . (41) Because ρ A n 1 is classical, w e ha v e ρ A n 1 = M ⊗ n ( ρ A n 1 ) = M ⊗ n  tr E n 1 [ ρ A n 1 E n 1 ]  = tr E n 1  M ⊗ n ( ρ A n 1 E n 1 )  (42) = tr E n 1  M ⊗ n (tr k [ | Φ ( n + k ) ⟩ ⟨ Φ ( n + k ) | ])  , (43) where ρ A n 1 E n 1 := tr k [ | Φ ( n + k ) ⟩ ⟨ Φ ( n + k ) | ], and the penultimate step uses that M only acts on the A -system and hence commutes with the partial trace ov er the E -system. Theorem 3.1 sho ws that there exist a probabilit y measure ν on the unit sphere B ( H ) and, for each | θ ⟩ ∈ B ( H ), a family {| Ψ ( n ) r,θ ⟩} θ of states such that | Ψ ( n ) r,θ ⟩ ∈ Sym n ( H , | θ ⟩ ⊗ n − r ), and such that for σ ( n ) θ := tr E n 1 [ M ⊗ n ( | Ψ ( n ) r,θ ⟩ ⟨ Ψ ( n ) r,θ | )] and ¯ ν denoting the induced measure obtained b y taking the partial trace, we ha v e     ρ A n 1 − Z σ ( n ) θ ¯ ν (d θ )     1 Eq. (43) =     tr E n 1  M ⊗ n (tr k [ | Φ ( n + k ) ⟩ ⟨ Φ ( n + k ) | ])  − tr E n 1 h M ⊗ n  Z | Ψ ( n ) r,θ ⟩ ⟨ Ψ ( n ) r,θ | ν (d θ ) i     1 ≤     tr k [ | Φ ( n + k ) ⟩ ⟨ Φ ( n + k ) | ]) − Z | Ψ ( n ) r,θ ⟩ ⟨ Ψ ( n ) r,θ | ν (d θ )     1 (44) Theorem 3.1 ≤ 3 k d e − k ( r +1) n + k , (45) where the second step uses the fact that the trace distance is con tractiv e under trace-preserving completely p ositive maps [ 43 , Theorem 8.16]. Prop osition 3.1 no w implies that the density op erators σ ( n ) θ cannot be given almost-iid distributions in the sense of Equation (39) . Intuitiv ely , this happ ens since the states | θ ⟩ appearing in Equation (44) are not necessarily classical and therefore, the almost-iid structure is not preserv ed after the measurement. 4 Conditional en trop y of almost-iid states In this section, we prov e that the conditional en trop y of almost-iid states asymptotically coincides with the conditional en tropy of iid states. In the pro of, w e develop tec hnical to ols that may be of indep enden t in terest. F or ρ AB ∈ S( H AB ) the conditional en trop y is deﬁned as H ( A | B ) ρ = H ( AB ) ρ − H ( B ) ρ , where H ( B ) ρ := − tr[ ρ B log ρ B ] is the von Neumann entrop y . It is straightforw ard to see that the conditional en trop y is additiv e for iid states, i.e., 1 n H ( A n 1 | B n 1 ) ρ ⊗ n = H ( A | B ) ρ . (46) W e next show that this prop erty is preserved for almost-iid states in the limit n → ∞ . Theorem 4.1. L et σ AB ∈ S( H AB ) and ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) for r = o ( n ) . Then 1 n H ( A n 1 | B n 1 ) ρ = H ( A | B ) σ + o ( n ) n . (47) 10 The main diﬃculty in proving the assertion of Theorem 4.1 is the fact that almost-iid states are more general than just conv ex mixtures of pro duct states where eac h element in the conv ex sum has a certain num ber of defects (see Example 2.2 ). A look at Deﬁnition 2.1 reveals that almost-iid states may con tain sup erp ositions whic h store long range correlations and entanglemen t. Dealing with these sup erp ositions is the main technical challenge in the pro of. W e do this utilizing tools from one- shot information theory such as R´ en yi and smo oth entropies. W e also wan t to emphasize that the sup erp ositions in the deﬁnition of almost-iid states are crucial for making the exp onential de Finetti theorem ( Theorem 3.1 ) p ossible. As shown in Prop osition 3.1 , no exp onential de Finetti theorem can exist without sup erp ositions. Before presenting the proof of Theorem 4.1 , whic h is given in Section 4.4 , w e need to deﬁne some en tropic quantities. W e note that an alternative proof using diﬀeren t techniques that ma y therefore b e of indep enden t in terest is giv en in App endix E . 4.1 Entropic quantities F or α ∈ [1 / 2 , 1) ∪ (1 , ∞ ) the sandwiche d R ´ enyi diver genc e [ 31 , 41 ] is given by D α ( ρ ∥ σ ) := 1 α − 1 log tr  ( σ 1 − α 2 α ρ σ 1 − α 2 α ) α  . (48) F or α = 1 2 w e hav e D 1 2 ( ρ ∥ σ ) = − log F ( ρ, σ ). In the limits α → 1 and α → ∞ the sandwiched R´ enyi div ergence con v erges to the relativ e entrop y D ( ρ ∥ σ ) and the max-relative entrop y [ 33 , 16 ] D max ( ρ ∥ σ ) := inf { λ ∈ R : ρ ≤ 2 λ σ } , (49) resp ectiv ely . F or ρ AB ∈ S( H AB ), the R´ en yi div ergence can b e used to deﬁne a conditional R ´ enyi en trop y [ 36 ] H α ( A | B ) ρ := − min σ B ∈ S( H B ) D α ( ρ AB ∥ id A ⊗ σ B ) , (50) whic h conv erges to the conditional entrop y H ( A | B ) ρ for α → 1 and to the conditional min-en trop y H min ( A | B ) ρ for α → ∞ . F or α = 1 / 2 w e obtain the conditional max-en trop y H max ( A | B ) ρ . The trace distance betw een t w o states ρ, σ ∈ S( H ) is giv en b y ∆( ρ, σ ) := 1 2 ∥ ρ − σ ∥ 1 and the puriﬁed distance [ 36 ] is deﬁned as P ( ρ, σ ) := p 1 − F ( ρ, σ ). The F uchs-v an der Graaf inequality [ 22 ] implies P ( ρ, σ ) ≥ ∆( ρ, σ ). F or ρ ∈ S( H ) and ε ∈ (0 , 1) deﬁne the ε -ball around ρ b y B ε ( ρ ) := { ρ ′ ∈ S( H ) : P ( ρ, ρ ′ ) ≤ ε } . W e then deﬁne a smo oth v arian t of the min- and max-entrop y by H ε min ( A | B ) ρ := max ρ ′ ∈B ε ( ρ ) H min ( A | B ) ρ ′ and H ε max ( A | B ) ρ := min ρ ′ ∈B ε ( ρ ) H max ( A | B ) ρ ′ . (51) 4.2 Asymptotic equipartition prop erty for almost-iid states Another statement which can b e pro v en using similar techniques and ma y b e of indep endent interest is a strong asymptotic equipartition prop ert y (AEP) for almost-iid states. T o understand this, let us recall the AEP for iid states [ 37 , 36 ]. This fundamental result ensures that for any densit y operator ρ AB ∈ S( H AB ) and an y ε ∈ (0 , 1) we hav e 1 n H ε min ( A n 1 | B n 1 ) ρ ⊗ n = H ( A | B ) ρ + o ( n ) n and 1 n H ε max ( A n 1 | B n 1 ) ρ ⊗ n = H ( A | B ) ρ + o ( n ) n . (52) Note that Equation (52) is called a str ong AEP as the error term o ( n ) n v anishes in the limit n → ∞ for an y ﬁxed ε ∈ (0 , 1). F urthermore, it is understo o d how fast the error term v anishes for ﬁnite v alues of n [ 38 ]. W e sho w that Equation (52) remains v alid when replacing iid states with almost-iid states. 11 Prop osition 4.2 (Strong AEP for almost-iid states) . L et ε ∈ (0 , 1) , σ AB ∈ S( H AB ) , and ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) for r = o ( n ) . Then 1 n H ε min ( A n 1 | B n 1 ) ρ = H ( A | B ) σ + o ( n ) n and 1 n H ε max ( A n 1 | B n 1 ) ρ = H ( A | B ) σ + o ( n ) n . (53) In [ 33 , Theorem 4.4.1] it was shown that the conditional smo oth min-entrop y for almost-iid states, whic h are classical on one subsystem, asymptotically coincides with the conditional entrop y of iid states. This w as crucial to pro v e securit y of quantum key distribution via a de Finetti argumen t. Using the duality of conditional entrop y , the result can b e lifted to the smo oth max-entrop y of almost- iid states [ 44 ]. In addition, for the case of pure almost-iid states, a similar result has b een pro v en based on [ 33 ] in [ 44 , Lemma 11]. Here, the presented pro of of Prop osition 4.2 is more general as it applies for mixed almost-iid states and is also more modular allowing one to distill other results such as Theorem 4.1 . Beyond these results, to the b est of our knowledge, little is kno wn about how en tropic functions b eha v e for almost-iid states. 4.3 Pro of of Proposition 4.2 Let ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ). By deﬁnition, there exist a puriﬁcation | θ ⟩ AB E of σ AB and an extension ρ A n 1 B n 1 E n 1 of ρ A n 1 B n 1 that can b e written as ρ A n 1 B n 1 E n 1 = X t,t ′ ∈T β t,t ′ | Ψ t ⟩⟨ Ψ t ′ | A n 1 B n 1 E n 1 , (54) for a family {| Ψ t ⟩ A n 1 B n 1 E n 1 } t ∈T of orthonormal vectors from V ( H ⊗ n AB E , | θ ⟩ ⊗ n − r AB E ) with β t,t ′ ∈ C satisfying P t ∈T β t,t = 1 and log |T | ≤ n h  r n  + r log d AB E . (55) Let ˜ ρ A n 1 B n 1 E n 1 T := X t ∈T β t,t | Ψ t ⟩ ⟨ Ψ t | A n 1 B n 1 E n 1 | {z } =: ˜ ρ ( t ) ⊗| t ⟩ ⟨ t | T . (56) Lemma 4.3. F or the setting deﬁne d ab ove, we have ρ A n 1 B n 1 E n 1 ≤ |T | ˜ ρ A n 1 B n 1 E n 1 , and hence ρ A n 1 B n 1 ≤ |T | ˜ ρ A n 1 B n 1 . (57) Pr o of. The pro of idea is similar to [ 33 , Proof of Lemma 3.1.13] but is based on pinc hing maps and therefore w orks for a more general setup. Consider the pinching map P : Z 7→ X t ∈T | Ψ t ⟩ ⟨ Ψ t | Z | Ψ t ⟩ ⟨ Ψ t | . (58) Using the fact that {| Ψ t ⟩} t ∈T is orthonormal, w e ha v e P ( ρ A n 1 B n 1 E n 1 ) Eq. (54) = X t,k,ℓ ∈T β k,ℓ | Ψ t ⟩ ⟨ Ψ t || Ψ k ⟩⟨ Ψ ℓ || Ψ t ⟩ ⟨ Ψ t | = X t ∈T β t,t | Ψ t ⟩ ⟨ Ψ t | Eq. (56) = ˜ ρ A n 1 B n 1 E n 1 . (59) Hence, w e ﬁnd ˜ ρ A n 1 B n 1 E n 1 Eq. (59) = P ( ρ A n 1 B n 1 E n 1 ) pinching inequality ≥ 1 |T | ρ A n 1 B n 1 E n 1 . (60) 12 The interested reader can ﬁnd more information on pinching maps, including a proof of the pinching inequalit y in [ 35 , Lemma 3.5]. Since the partial trace is a completely positive map Equation (60) implies ˜ ρ A n 1 B n 1 ≥ 1 |T | ρ A n 1 B n 1 . (61) ■ Lemma 4.4. L et n, r ∈ N such that r ≤ n , σ AB ∈ S( H AB ) , ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) , d AB = dim H AB , and d A = dim H A . Then 1 n H α ( A n 1 | B n 1 ) ρ ≥ H α ( A | B ) σ − 2 r n log d A − α α − 1  h  r n  + 2 r n log d AB  ∀ α > 1 (62) and 1 n H α ( A n 1 | B n 1 ) ρ ≤ H α ( A | B ) σ + 2 r n log d A + 1 1 − α  h  r n  + 2 r n log d AB  ∀ α ∈ [ 1 2 , 1) . (63) Pr o of. W e start by proving Equation (62) . F or α > 1 and ρ , ˜ ρ deﬁned ab ov e, we hav e α 1 − α log |T | + H α ( A n 1 | B n 1 ) ˜ ρ Eqs. (48) and (50) = max σ ∈ S( H ⊗ n B ) 1 1 − α log tr  ( σ 1 − α 2 α ˜ ρ A n 1 B n 1 |T | σ 1 − α 2 α ) α  (64) Lem. 4.3 ≤ max σ ∈ S( H ⊗ n B ) 1 1 − α log tr  ( σ 1 − α 2 α ρ A n 1 B n 1 σ 1 − α 2 α ) α  (65) Eqs. (48) and (50) = H α ( A n 1 | B n 1 ) ρ , (66) where the inequality step used that the function X 7→ tr[ X α ] is monotone [ 11 , Theorem 2.10]. F ur- thermore, w e ha v e H α ( A n 1 | B n 1 ) ˜ ρ [ 36 , Eq. 5.41] ≥ H α ( A n 1 | B n 1 T ) ˜ ρ (67) [ 36 , Prop. 5.4] = α 1 − α log X t ∈T β t,t exp  1 − α α H α ( A n 1 | B n 1 ) ˜ ρ ( t )  ! (68) ≥ min t ∈T H α ( A n 1 | B n 1 ) ˜ ρ ( t ) , (69) where the ﬁnal step uses that the logarithm is a quasi-linear function and that β t,t ∈ [0 , 1] with P t ∈T β t,t = 1. Recalling that ˜ ρ ( t ) = | Ψ t ⟩ ⟨ Ψ t | for | Ψ t ⟩ ∈ V ( H ⊗ n AB E , | θ ⟩ ⊗ n − r AB E ) and using the additivity of the R ´ enyi entropies under tensor pro ducts allows us to write for any t ∈ T H α ( A n 1 | B n 1 ) ˜ ρ ( t ) = ( n − r ) H α ( A | B ) σ + H α ( A r 1 | B r 1 ) Ω [ 36 , Lem. 5.11] ≥ ( n − r ) H α ( A | B ) σ − r log d A . (70) Putting ev erything together yields 1 n H α ( A n 1 | B n 1 ) ρ ≥ n − r n H α ( A | B ) σ − r n log d A − 1 n α α − 1 log |T | (71) [ 36 , Lem. 5.11]& Eq. (55) ≥ H α ( A | B ) σ − 2 r n log d A − α α − 1  h  r n  + 2 r n log d AB  , (72) where in the ﬁnal step, w e used d AB E = d 2 AB . This prov es Equation (62) . 13 The statemen t from Equation (63) follo ws similarly . F or α ∈ [1 / 2 , 1) we hav e α 1 − α log |T | + H α ( A n 1 | B n 1 ) ˜ ρ Eqs. (48) and (50) = max σ ∈ S( H ⊗ n B ) 1 1 − α log tr  ( σ 1 − α 2 α ˜ ρ A n 1 B n 1 |T | σ 1 − α 2 α ) α  (73) Lem. 4.3 ≥ max σ ∈ S( H ⊗ n B ) 1 1 − α log tr  ( σ 1 − α 2 α ρ A n 1 B n 1 σ 1 − α 2 α ) α  (74) Eqs. (48) and (50) = H α ( A n 1 | B n 1 ) ρ , (75) where the inequalit y step used that the function X 7→ tr[ X α ] is monotone [ 11 , Theorem 2.10]. In addition, w e ha v e H α ( A n 1 | B n 1 ) ˜ ρ [ 36 , Eq. 5.96] ≤ H α ( A n 1 | B n 1 T ) ˜ ρ + log |T | (76) [ 36 , Prop. 5.4] = α 1 − α log X t ∈T β t,t exp  1 − α α H α ( A n 1 | B n 1 ) ˜ ρ ( t )  ! + log |T | (77) ≤ max t ∈T H α ( A n 1 | B n 1 ) ˜ ρ ( t ) + log |T | , (78) where the ﬁnal step uses that the logarithm is a quasi-linear function and that β t,t ∈ [0 , 1] with P t ∈T β t,t = 1. Since ˜ ρ ( t ) = | Ψ t ⟩ ⟨ Ψ t | for | Ψ t ⟩ ∈ V ( H ⊗ n AB E , | θ ⟩ ⊗ n − r AB E ), the additivit y of the R ´ enyi en tropies under tensor pro ducts implies for an y t ∈ T H α ( A n 1 | B n 1 ) ˜ ρ ( t ) = ( n − r ) H α ( A | B ) σ + H α ( A r 1 | B r 1 ) Ω [ 36 , Lem. 5.11] ≤ ( n − r ) H α ( A | B ) σ + r log d A . (79) Com bining Equations (75) , (78) and (79) yields 1 n H α ( A n 1 | B n 1 ) ρ ≤ n − r n H α ( A | B ) σ + r n log d A + 1 1 − α 1 n log |T | (80) [ 36 , Lem. 5.11]& Eq. (55) ≤ H α ( A | B ) σ + 2 r n log d A + 1 1 − α  h  r n  + 2 r n log d AB  , (81) where in the ﬁnal step w e used that d E = d AB . ■ W e are now equipp ed with all the tools we need to prov e the assertion of Prop osition 4.2 . This will b e done in four steps, by provin g tw o inequalities (direct and conv erse part) for b oth the smo oth min- and max-en trop y . (i) Direct part for smo oth min-entrop y: F or α > 1 consider the error term δ ε ( n, r , α ) := 2 r n log d A + α α − 1  h  r n  + 2 r n log d AB  + 1 n 1 α − 1 log 1 ε 2 + 1 n log 1 1 − ε 2 . (82) W e can write 1 n H ε min ( A n 1 | B n 1 ) ρ [ 10 , Prop. 2.2] ≥ 1 n H α ( A n 1 | B n 1 ) ρ − 1 n 1 α − 1 log 1 ε 2 − 1 n log 1 1 − ε 2 (83) Lem. 4.4 ≥ H α ( A | B ) σ − δ ε ( n, r , α ) (84) [ 37 , Lem. 8] ≥ H ( A | B ) σ − δ ε ( n, r , α ) − 4( α − 1)(log η ) 2 , (85) for a constant η = √ 2 − H min ( A | B ) σ + √ 2 H max ( A | B ) σ + 1. F or a choice α = 1 + 1 / log ( r /n ) and recalling that r = o ( n ) w e see that δ ε ( n, r , α ) = o ( n ) n , (86) 14 where we used that lim x → 0 (log( x ) + 1) h ( x ) = 0 and lim x → 0 (log( x ) + 1) x = 0 . Hence, we obtain 1 n H ε min ( A n 1 | B n 1 ) ρ ≥ H ( A | B ) σ − o ( n ) n . (87) (ii) Direct part for smo oth max-entrop y: F or α ∈ [1 / 2 , 1) consider the error term δ ′ ε ( n, r , α ) := 2 r n log d A + 1 1 − α  h  r n  + 2 r n log d AB  + 1 n α 1 − α log 1 ε . (88) W e can write 1 n H ε max ( A n 1 | B n 1 ) ρ [ 10 , Prop. 2.2] ≤ 1 n H α ( A n 1 | B n 1 ) ρ + 1 n α 1 − α log 1 ε (89) Lem. 4.4 ≤ H α ( A | B ) σ + δ ′ ε ( n, r , α ) (90) [ 37 , Lem. 8] & [ 36 , Prop. 5.7] ≤ H ( A | B ) σ + δ ′ ε ( n, r , α ) + 4(1 − α )(log η ) 2 . (91) Similarly as ab o v e, c ho osing α = 1 − 1 / log ( r /n ) yields δ ′ ε ( n, r , α ) = o ( n ) n and hence 1 n H ε max ( A n 1 | B n 1 ) ρ ≤ H ( A | B ) σ + o ( n ) n . (92) (iii) Conv erse part for smo oth min-entrop y: F or a ﬁxed ε ∈ (0 , 1) consider an arbitrary ε ′ ∈ (0 , 1 − ε ). Then 1 n H ε min ( A n 1 | B n 1 ) ρ [ 36 , Eq. 6.107] ≤ 1 n H ε ′ max ( A n 1 | B n 1 ) ρ + 1 n log 1 1 − ( ε + ε ′ ) 2 Eq. (92) ≤ H ( A | B ) σ + o ( n ) n . (93) (iv) Conv erse part for smo oth max-entrop y: F or a ﬁxed ε ∈ (0 , 1) consider an arbitrary ε ′ ∈ (0 , 1 − ε ). Then 1 n H ε ′ max ( A n 1 | B n 1 ) ρ [ 36 , Eq. 6.107] ≥ 1 n H ε min ( A n 1 | B n 1 ) ρ − 1 n log 1 1 − ( ε + ε ′ ) 2 Eq. (87) ≥ H ( A | B ) σ − o ( n ) n . (94) Com bining Equations (87) and (92) to (94) completes the pro of. ■ 4.4 Pro of of Theorem 4.1 The monotonicit y of the R ´ enyi divergence in α [ 31 ] implies that for α > 1 we hav e 1 n H ( A n 1 | B n 1 ) ρ ≥ 1 n H α ( A n 1 | B n 1 ) ρ (95) Lem. 4.4 ≥ H α ( A | B ) σ − 2 r n log d A − α α − 1  h  r n  + 2 r n log d AB  (96) [ 37 , Lem. 8] ≥ H ( A | B ) σ − 2 r n log d A − α α − 1  h  r n  + 2 r n log d AB  − 4( α − 1)(log η ) 2 , (97) for a constant η = √ 2 − H min ( A | B ) σ + √ 2 H max ( A | B ) σ + 1. Cho osing α = 1 + 1 / log ( r /n ) and recalling that r = o ( n ) yields 5 1 n H ( A n 1 | B n 1 ) ρ ≥ H ( A | B ) σ − o ( n ) n . (98) 5 Note that lim x → 0 (log( x ) + 1) h ( x ) = 0 and lim x → 0 (log( x ) + 1) x = 0. 15 T o see the other direction, note that for ε n = 2 p r n with r = o ( n ) 1 n H ( A n 1 | B n 1 ) ρ chain rule = 1 n n X i =1 H ( A i | A i − 1 1 B n 1 ) ρ (99) SSA [ 27 ] ≤ 1 n n X i =1 H ( A i | B i ) ρ (100) perm. inv. = H ( A 1 | B 1 ) ρ (101) Prop. 2.6 ≤ H ( A | B ) σ + 2 ε n log d A + (1 + ε n ) h  ε n 1 + ε n  (102) = H ( A | B ) σ + o ( n ) n , (103) where the p en ultimate step uses the con tin uit y of en trop y [ 42 ]. Combining Equations (98) and (103) completes the pro of. 5 Robustness of information measures for almost-iid states In this work, we justiﬁed the imp ortance of almost-iid states. This prompts the question if almost-iid states are as eﬀectiv e as p erfect iid states for information-pro cessing tasks. T o answer this, it is crucial to understand if certain functionals (that c haracterize speciﬁc information-pro cessing tasks) behav e equally or diﬀeren tly for almost-iid and p erfect iid states. In Section 4 we hav e seen that the conditional entrop y is robust for almost-iid states in the sense that it asymptotically coincides with the en trop y of iid states. This implies that also the mutual information is robust for almost-iid states. T o make this precise, recall that for a bipartite densit y matrix σ AB the m utual information is deﬁned as I ( A : B ) σ := H ( A ) σ − H ( A | B ) σ . (104) The m utual information is a popular correlation measure in the sense that it satisﬁes (i) I ( A : B ) σ ≥ 0, (ii) I ( A : B ) σ = 0 iﬀ σ AB = σ A ⊗ σ B and (iii) I ( A : B C ) σ ≥ I ( A : B ). 6 Let σ AB ∈ S( H AB ) and ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) for r = o ( n ). Then 1 n I ( A n 1 : B n 1 ) ρ Eq. (104) = 1 n H ( A n 1 ) ρ − 1 n H ( A n 1 | B n 1 ) ρ (105) Thm. 4.1 = H ( A ) σ − H ( A | B ) σ + o ( n ) n (106) Eq. (104) = I ( A : B ) σ + o ( n ) n . (107) It is natural to ask if p opular entanglemen t measures are also robust for almost-iid states. In abstract terms, let E ( · ) b e an arbitrary en tanglemen t measure. Let σ AB ∈ S( H AB ) and ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) for r = o ( n ). Is it true that 1 n E ( A n 1 : B n 1 ) ρ ? = 1 n E ( A n 1 : B n 1 ) σ ⊗ n + o ( n ) n (108) In the follo wing, w e discuss the robustness of (a) squashed en tanglement, (b) en tanglemen t distillation, (c) en tanglemen t cost, and (d) relativ e entrop y of entanglemen t. 6 Properties (i) and (ii) follow b y noting that I ( A : B ) ρ = D ( ρ AB ∥ ρ A ⊗ ρ B ). The third prop erty follo ws from strong subadditivity together with the chain rule as I ( A : B C ) σ = I ( A : B ) σ + I ( A : C | B ) σ ≥ I ( A : B ) σ . 16 5.1 Robustness of squashed en tanglemen t Ab o v e w e ha ve seen that the m utual information is robust under almost-iid states. The same argumen t can b e extended to see that the conditional mutual information also coincides for almost-iid and iid states. The squashe d entanglement [ 14 ] is an entanglemen t measure that is based on the conditional m utual information. Given a biparitite density matrix ρ AB ∈ S( H AB ), the squashed en tanglemen t is deﬁned as E sq ( A : B ) ρ := 1 2 inf ρ ABE ∈ S( H ABE )  I ( A : B | E ) ρ : tr E [ ρ AB E ] = ρ AB  , (109) where there is no b ound on the dimension of E . It features many desirable prop erties such as b eing ad- ditiv e on tensor pro ducts and sup eradditive in general. W e next sho w that the squashed en tanglemen t for almost-iid and iid states coincide. Corollary 5.1. L et σ AB ∈ S( H AB ) and ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) for r = o ( n ) . Then 1 n E sq ( A n 1 : B n 1 ) ρ = E sq ( A : B ) σ + o ( n ) n . (110) Pr o of. Let d A := dim( H A ) and d B := dim( H B ). W e can employ the p ermutation inv ariance of ρ and the sup eradditivit y of the squashed en tanglemen t [ 14 , Prop osition 4] to write for ε n := 4 p r n 1 n E sq ( A n 1 : B n 1 ) ρ superadditivity ≥ 1 n n X i =1 E sq ( A i : B i ) ρ (111) perm. inv. = E sq ( A : B ) ρ (112) contin uit y & Prop. 2.6 ≥ E sq ( A : B ) σ − 12 ε n log( d A d B ) − 6 h ( ε n ) (113) = E sq ( A : B ) σ + o ( n ) n , (114) where the contin uit y of squashed en tanglement follo ws from the contin uit y of the conditional entrop y [ 1 ] as explained in [ 14 , Section IV]. It thus remains to prov e the other direction. F or any ξ > 0 there exists an extension σ AB E or σ AB with d E := dim( H E ) < ∞ such that    E sq ( A : B ) σ − 1 2 I ( A : B | E ) σ    ≤ ξ . (115) T o see this, note that by the deﬁnition of the squashed en tanglemen t there exists an extension σ ′ AB E of σ AB (with p ossibly un b ounded E -system) such that    E sq ( A : B ) σ − 1 2 I ( A : B | E ) σ ′    ≤ ξ 2 . (116) Cho ose a ﬁnite-dimensional pro jector Π E on the E -system such that tr[Π E σ ′ AB E ] = 1 − ε for some ε > 0. Let σ AB E := Π E σ ′ AB E Π † E +  1 − tr[Π E σ ′ AB E ]  | e ⟩ ⟨ e | AB E , (117) where | e ⟩ is a state orthogonal to the supp ort of Π E . By the contin uit y of the conditional entrop y [ 1 ] w e can c ho ose ε > 0 such that | I ( A : B | E ) σ ′ − I ( A : B | E ) σ | ≤ ξ , (118) 17 where w e used that the con tin uit y of the conditional entrop y do es not dep end on the dimension of the conditioning system. The triangle inequality implies    E sq ( A : B ) σ − 1 2 I ( A : B | E ) σ    ≤    E sq ( A : B ) σ − 1 2 I ( A : B | E ) σ ′    +    1 2 I ( A : B | E ) σ ′ − 1 2 I ( A : B | E ) σ    Eqs. (116) and (118) ≤ ξ , (119) whic h th us justiﬁes Equation (115) . Due to Lemma B.5 , there exists an extension ρ A n 1 B n 1 E n 1 of ρ A n 1 B n 1 whic h is an  n r  -almost-iid state in σ AB E . Hence, 1 n E sq ( A n 1 : B n 1 ) ρ ≤ 1 2 n I ( A n 1 : B n 1 | E n 1 ) ρ (120) = 1 2 n H ( A n 1 | E n 1 ) ρ − 1 2 n H ( A n 1 | B n 1 E n 1 ) ρ (121) Thm. 4.1 = 1 2 H ( A | E ) σ − 1 2 H ( A | B E ) σ + o ( n ) n (122) = 1 2 I ( A : B | E ) σ + o ( n ) n (123) Eq. (115) ≤ E sq ( A : B ) σ + ξ + o ( n ) n . (124) Since this holds for an y ξ > 0 we can consider ξ → 0, which concludes the pro of. ■ 5.2 Robustness of en tanglemen t distillation and en tanglemen t cost Let | Φ ⟩ AB denote an en tangled Bell state. Giv en a bipartite density matrix ρ AB ∈ S( H AB ), recall the deﬁnitions of entanglement distil lation [ 3 , 4 , 5 ] E D ( A : B ) ρ := lim ε → 0 lim n →∞ sup n m n : inf P n ∈ LOCC 1 2   P n ( ρ ⊗ n ) − | Φ ⟩ ⟨ Φ | ⊗ m   1 ≤ ε o (125) and entanglement c ost [ 24 ] E C ( A : B ) ρ := lim ε → 0 lim n →∞ inf n m n : inf P n ∈ LOCC 1 2   P n ( | Φ ⟩ ⟨ Φ | ⊗ m ) − ρ ⊗ n   1 ≤ ε o . (126) Question 5.2. Let σ AB ∈ S( H AB ) and ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) for r = o ( n ). Is it true that 1 n E D ( A n 1 : B n 1 ) ρ ? = E D ( A : B ) σ + o ( n ) n (127) As discussed in [ 29 ], there are strong indications that one direction of Equation (127) holds, namely that 1 n E D ( A n 1 : B n 1 ) ρ ≥ E D ( A : B ) σ + o ( n ) n . (128) Whether the other direction holds also remains an op en question. The equiv alent question for the entanglemen t cost asks: Question 5.3. Let σ AB ∈ S( H AB ) and ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) for r = o ( n ). Is it true that 1 n E C ( A n 1 : B n 1 ) ρ ? = E C ( A : B ) σ + o ( n ) n (129) 18 Ho w ev er, none of the t w o directions of Equation (129) are known to hold. A t this p oint, we emphasize that, unlike squashed entanglemen t, already the deﬁnitions of both en tanglemen t cost and en tanglemen t distillation rely on a tensor pow er (iid) structure. This may suggest that, rather than asking about the robustness of Equations (125) and (126) , one should in- corp orate robustness directly in to the deﬁnition itself. One may therefore wonder ho w these notions w ould change if an almost-iid structure were built into the deﬁnition from the outset. In [ 29 ], the authors inv estigate this question b y introducing new asymptotic state transformation rates that av oid the standard iid assumption. W e refer the in terested reader to that pap er for further details. 5.3 Robustness of relativ e en tropy of entanglemen t A popular measure to quan tify the amoun t of entanglemen t is the r elative entr opy of entanglement [ 39 ] deﬁned as E R ( A : B ) ρ := min σ AB ∈ SEP( A : B ) D ( ρ AB ∥ σ AB ) , (130) where SEP( A : B ) := con v {| ϕ ⟩ ⟨ ϕ | A ⊗ | φ ⟩ ⟨ φ | B : | ϕ ⟩ A ∈ H A , | φ ⟩ B ∈ H B , ⟨ ϕ | ϕ ⟩ = ⟨ φ | φ ⟩ = 1 } . It is kno wn [ 40 ] that the relative entrop y of en tanglemen t is not additiv e under the tensor product, whic h justiﬁes the deﬁnition of a regularized version E ∞ R ( A : B ) ρ := lim k →∞ 1 k E R ( A k 1 : B k 1 ) ρ ⊗ k . The limit in the regularization exists due to F ekete’s subadditivity lemma. Question 5.4. Let σ AB ∈ S( H AB ) and ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) for r = o ( n ). Is it true that 1 n E R ( A n 1 : B n 1 ) ρ ? = 1 n E R ( A n 1 : B n 1 ) σ ⊗ n + o ( n ) n (131) If Equation (131) were true, this would sav e the original proof of the generalized quantum Stein’s lemma [ 23 , 26 ] b y Brand˜ ao and Plenio [ 9 ] (see also [ 6 , 7 ]). One direction of Equation (131) follows from the results developed in this pap er. T o see this, c ho ose s n a monotonically increasing sequence of in tegers such that s n r = o ( n ) and lim n →∞ s n = ∞ . Let k n = ⌈ n/s n ⌉ . Note that n ≤ s n k n ≤ n + s n . Hence, using the monotonicit y of the en tanglement of formation under partial trace, 1 n E R ( A n 1 : B n 1 ) ρ ≤ 1 n E R  A s n k n 1 : B s n k n 1  ρ (132) ≤ 1 s n k n E R  A s n k n 1 : B s n k n 1  ρ + s n n log d A d B , (133) where the ﬁnal step uses E R ( A m : B m ) ρ ≤ m log ( d A d B ) and s n = o ( n ). In the following steps, we omit the subscripts n for b etter readability . Let ω s ∈ arg min τ A s 1 B s 1 ∈ SEP D ( ρ A s 1 B s 1 ∥ τ A s 1 B s 1 ). Then for ρ s = tr n − s [ ρ n ] w e ha v e 1 n E R ( A n 1 : B n 1 ) ρ Eq. (133) ≤ 1 sk E R  A sk 1 : B sk 1  ρ + o ( n ) n (134) = 1 sk min τ A sk 1 B sk 1 ∈ SEP D ( ρ A sk 1 B sk 1 ∥ τ A sk 1 B sk 1 ) + o ( n ) n (135) ≤ 1 sk D  ρ sk ∥ ( ω s ) ⊗ k  + o ( n ) n (136) = − 1 sk H ( ρ sk ) − 1 s tr[ ρ s log ω s ] + o ( n ) n (137) Thm. 4.1 = − 1 s H ( σ ⊗ s ) − 1 s tr[ ρ s log ω s ] + o ( n ) n (138) 19 contin uit y & Prop. 2.6 ≤ 1 s H ( ρ s ) − 1 s tr[ ρ s log ω s ] + o ( n ) n (139) = 1 s E R ( A s 1 : B s 1 ) ρ + o ( n ) n (140) contin uit y & Prop. 2.6 ≤ 1 s E R ( A s 1 : B s 1 ) σ ⊗ s + o ( n ) n (141) = 1 n E R ( A n 1 : B n 1 ) σ ⊗ n + o ( n ) n , (142) where the contin uit y of the von Neumann entrop y and the relative entrop y of entanglemen t can b e found in [ 2 , 32 , 42 ]. In the last step, we use that lim n →∞ s n = ∞ and that the limit in the RHS of Equation (131) exists due to F ekete’s subadditivity lemma. The other direction of Equation (131) app ears more complicated and remains an op en question. Ac kno wledgemen ts W e thank F ernando Brand˜ ao for his talk on almost-iid states at the SwissMAP Researc h Station (SRS) conference in Les Diablerets 2024, which motiv ated us to write this pap er. W e further thank F r´ ed´ eric Dupuis and Ludo vico Lami for insightful discussions on this topic at the same conference. GM and RR ackno wledge supp ort from the NCCR SwissMAP , the ETH Zurich Quantum Cen ter, the SNSF pro ject No. 20QU-1 225171, and the CHIST-ERA pro ject MoDIC. App endix A Justiﬁcation of Remark 2.3 T o justify the assertion of Remark 2.3 , we need to show that for all puriﬁcations | ψ ⟩ A 2 1 E 2 1 of ρ A 2 1 and for all puriﬁcations | θ ⟩ AE of | 0 ⟩ A it follo ws that | ψ ⟩ A 2 1 E 2 1 ∈ Sym 2 ( H AE ) ∩ span V ( H ⊗ 2 AE , | θ ⟩ AE ) T o see this note that, b ecause | 0 ⟩ A is pure, | θ ⟩ AE = | 0 ⟩ A ⊗ | ϑ ⟩ E . Similarly , b ecause ρ A 2 1 = | Ψ − ⟩ ⟨ Ψ − | is pure, any puriﬁcation on E m ust b e of the form | Ψ − ⟩ A 1 A 2 ⊗ | φ ⟩ E 1 E 2 =: | ψ ⟩ A 2 1 E 2 1 . T o ensure that ρ A 2 1 is an almost-iid state according to the deﬁnition from [ 9 ], we need | ψ ⟩ A 2 1 E 2 1 ∈ Sym 2 ( H AE ) ∩ span V ( H ⊗ 2 AE , | θ ⟩ AE ). Th us, the swap op eration π must satisfy π | ψ ⟩ A 2 1 E 2 1 = ( π | Ψ − ⟩ A 1 A 2 ) ⊗ ( π | φ ⟩ E 1 E 2 ) = −| Ψ − ⟩ A 1 A 2 ⊗ ( π | φ ⟩ E 1 E 2 ) ! = | Ψ − ⟩ A 1 A 2 ⊗ | φ ⟩ E 1 E 2 . (143) This yields π | φ ⟩ E 1 E 2 = −| φ ⟩ E 1 E 2 , i.e., | φ ⟩ is anti-symmetric. Expressing this in an orthonormal basis {| i ⟩ E } i of E with | 0 ⟩ E = | ϑ ⟩ E giv es | φ ⟩ E 1 E 2 = P d E − 1 i,j =0 α i,j | i ⟩ E 1 | j ⟩ E 2 with α i,j = − α j,i for all i, j ∈ { 0 , . . . , d E − 1 } . Hence, | ψ ⟩ A 2 1 E 2 1 = 1 √ 2 X i 0 we hav e P h ∥ λ x ′ − P X ∥ 1 > r 2(ln 2)  δ + |X | log( n − r + 1) n − r i [ 33 , Cor. B.3.3] ≤ 2 − ( n − r ) δ . (169) Using r ≤ n 2 this can b e simpliﬁed to P h ∥ λ x ′ − P X ∥ 1 > r 2(ln 2)  δ + 2 |X | log( n 2 + 1) n i ≤ 2 − nδ 2 . (170) Using λ x = n − r n λ x ′ + r n λ x ′′ yields ∥ λ x − P X ∥ 1 triangle ≤ n − r n ∥ λ x ′ − P X ∥ 1 + r n ∥ λ x ′′ − P X ∥ 1 ≤ ∥ λ x ′ − P X ∥ 1 + 2 r n . (171) Hence, P h ∥ λ x − P X ∥ 1 > r 2(ln 2)  δ + 2 |X | log( n 2 + 1) n  + 2 r n i Eq. (171) ≤ P h ∥ λ x ′ − P X ∥ 1 > r 2(ln 2)  δ + 2 |X | log( n 2 + 1) n i (172) Eq. (170) ≤ 2 − nδ 2 . (173) This can b e rewritten as P x ←| Ψ t ⟩ [ x ∈ W δ ] ≤ 2 − nδ 2 , (174) for W δ = n x ∈ X n : ∥ λ x − P X ∥ 1 > r 2(ln 2)  δ + 2 |X | log( n 2 + 1) n  + 2 r n o . (175) The notation x ← | Ψ t ⟩ indicates that x is distributed according to the outcomes of the measurement applied to | Ψ t ⟩ . F or M x = M x 1 ⊗ . . . ⊗ M x n w e ﬁnd P x ← ρ A n 1 E n 1 [ x ∈ W δ ] = X x ∈W δ tr[ ρ A n 1 E n 1 M x ] (176) Eq. (57) ≤ X x ∈W δ |T | X t ∈T β t,t ⟨ Ψ t | M x | Ψ t ⟩ (177) = |T | X t ∈T β t,t P x ←| Ψ t ⟩ [ x ∈ W δ ] (178) Eq. (174) ≤ |T | 2 − nδ 2 (179) Remark 2.4 ≤ 2 − n ( δ 2 − h ( r n )) d 2 r . (180) Cho osing δ = 2 log ( 1 ε ) n + 2 h ( r n ) + 4 r n log( d ) yields P h ∥ λ x − P X ∥ 1 > s 4(ln 2)  log( 1 ε ) n + h  r n  + 2 r n log( d ) + |X | log( n 2 + 1) n  i Eq. (180) ≤ ε , (181) whic h completes the pro of. ■ 24 D Pro of of Prop osition 3.1 Consider a ( n + k )-bit string with m ones and n + k − m zeros. If we throw a w ay k bits, then the probabilit y of ha ving j ones in the remaining n -bit string is p j =  n j  k m − j   n + k m  . (182) Note that P m j =0 p j = 1 follows from a kno wn identit y due to V andermonde which states that for all r , s, t ∈ N we hav e  r + s t  = t X ℓ =0  r ℓ  s t − ℓ  . (183) T o see Equation (182) , let v ∈ { 0 , 1 , . . . , k } denote the num ber of ones that hav e b een thro wn a w a y . Hence, p j = 1 c  n j  k v  = 1 c  n j  k m − j  . (184) for some normalization constan t c = m X j =0  n j  k m − j  Eq. (183) =  n + k m  . (185) F act D.1. F or the setting ab ove, we have V ar p [ X ] = k mn ( n + k − m ) ( n + k − 1)( n + k ) 2 . (186) Pr o of. Recall that j  n j  = n  n − 1 j − 1  . (187) With this w e can write E p [ X ] = m X j =0 j p j (188) Eq. (182) = m X j =0 j  n j  k m − j   n + k m  (189) Eq. (187) = n  n + k m  m X j =1  n − 1 j − 1  k m − j  (190) = n  n + k m  m − 1 X j =0  n − 1 j  k m − 1 − j  (191) Eq. (183) = n  n + k m   n + k − 1 m − 1  (192) Eq. (187) = nm n + k . (193) 25 Similarly , we ﬁnd E p [ X 2 ] = m X j =0 j 2 p j (194) Eq. (182) = 1  n + k m  m X j =0 j 2  n j  k m − j  (195) = n  n + k m  m − 1 X j =0 ( j + 1)  n − 1 j  k m − 1 − j  (196) = n  n + k m    m − 1 X j =0 j  n − 1 j  k m − 1 − j  + m − 1 X j =0  n − 1 j  k m − 1 − j    (197) Eqs. (183) and (187) = n  n + k m    ( n − 1) m − 2 X j =0  n − 2 j  k m − 2 − j  +  n + k − 1 m − 1    (198) Eq. (183) = n  n + k m   ( n − 1)  n + k − 2 m − 2  +  n + k − 1 m − 1  (199) = n ( n − 1)  n + k − 1 m − 1  n + k − 2 m − 2   n + k m  n + k − 1 m − 1  + n  n + k − 1 m − 1   n + k m  (200) = n ( n − 1) m ( m − 1) ( n + k )( n + k − 1) + n m n + k . (201) Com bining ev erything yields V ar p [ X ] = E p [ X 2 ] − ( E p [ X ]) 2 Eqs. (193) and (201) = k mn ( n + k − m ) ( n + k − 1)( n + k ) 2 . (202) ■ Let Q ( q ) X n 1 ∈ V ( X n , q n − r ) and consider a binary random string X n 1 ∼ Q ( q ) X n 1 . Then without loss of generalit y assume that the r defects are at the end of the random string, and hence V ar h n X i =1 X i i = V ar h n − r X i =1 X i i + V ar h n X i = n − r +1 X i i | {z } ≥ 0 ≥ ( n − r )V ar q [ X ] , (203) where the ﬁrst equalit y uses that the defects are indep enden t of iid parts. Example D.2. Let α ∈ (0 , 1), m = n + k 2 , k = n α and r = o ( n ). Then Fact D.1 giv es V ar p [ X ] = n 1+ α 4( n + n α − 1) = Θ( n α ). F urthermore, Equation (203) yields V ar h P n i =1 X i i ≥ Θ( n ). F act D.3. L et t ∈ [0 , 1] and p, q b e two pr ob ability distributions. Then, V ar tp +(1 − t ) q [ X ] ≥ t V ar p [ X ] + (1 − t )V ar q [ X ] . (204) Pr o of. By deﬁnition of the v ariance, we hav e V ar tp +(1 − t ) q [ X ] = E tp +(1 − t ) q [ X 2 ] − ( E tp +(1 − t ) q [ X ]) 2 (205) 26 = t E p [ X 2 ] + (1 − t ) E q [ X 2 ] − ( t E p [ X ] + (1 − t ) E q [ X ]) 2 (206) ≥ t E p [ X 2 ] + (1 − t ) E q [ X 2 ] − ( t E p [ X ] 2 + (1 − t ) E q [ X ] 2 ) (207) = t V ar p [ X ] + (1 − t )V ar q [ X ] . (208) ■ Putting ev erything together, w e obtain for α ∈ (0 , 1) V ar p [ X ] Ex. D.2 = Θ( n α ) and V ar R Q ( q ) X ν (d q ) [ X ] Fact D.3 ≥ Z ν (d q )V ar Q ( q ) X [ X ] Ex. D.2 = Θ( n ) . (209) This prov es the assertion of Prop osition 3.1 . Note that for b etter readabilit y , the ab ov e proof has b een done for the sp eciﬁc c hoice k = n α = o ( n ) for α ∈ (0 , 1), but the same argumen t remains v alid for an arbitrary k = o ( n ). ■ E Alternativ e pro of of Theorem 4.1 In this section, w e presen t an alternative pro of for Theorem 4.1 which uses an en tirely diﬀerent pro of tec hnique which may b e of indep endent interest. How ev er, we note that the scaling of the defects r is sligh tly w orse than in Theorem 4.1 . Note that it suﬃces to prov e the following result. Theorem E.1. L et σ A ∈ S( H A ) and ρ A n 1 ∈ S n ( H A , σ ⊗ n − r A ) for r = o ( √ n ) . Then 1 n H ( A n 1 ) ρ = H ( A ) σ + o ( n ) n . (210) Theorem E.1 implies that the conditional en tropy of almost-iid states coincides asymptotically with the conditional en trop y of iid states. Corollary E.2. L et σ AB ∈ S( H AB ) and ρ A n 1 B n 1 ∈ S n ( H AB , σ ⊗ n − r AB ) for r = o ( √ n ) . Then 1 n H ( A n 1 | B n 1 ) ρ = H ( A | B ) σ + o ( n ) n . (211) Pr o of. By deﬁnition of the conditional entrop y , we hav e 1 n H ( A n 1 | B n 1 ) ρ = 1 n H ( A n 1 B n 1 ) ρ − 1 n H ( B n 1 ) ρ Thm. E.1 = H ( AB ) σ − H ( B ) σ + o ( n ) n = H ( A | B ) σ + o ( n ) n . ■ E.1 Pro of of Theorem E.1 One direction of Equation (210) is simple. T o see this, let ε n = 2 p rs n for s = ⌊ √ n ⌋ and consider 1 n H ( A n 1 ) ρ subadditivity ≤ H ( A 1 ) ρ Prop. 2.6 ≤ H ( A ) σ + ε n log d + h ( ε n ) = H ( A ) σ + o ( n ) n , (212) where the p en ultimate step uses the con tin uit y of entrop y [ 2 , 32 , 42 ]. The other direction is more complicated. Recall that strong subadditivit y of quantum en tropy (SSA) [ 27 , 28 ] ensures I ( A : C | B ) ρ ≥ 0. F urthermore, the conditional m utual information satisﬁes a c hain rule I ( A : B C ) ρ = I ( A : B ) ρ + I ( A : C | B ) ρ . Let | θ ⟩ AE b e a puriﬁcation of σ A and let ρ A n 1 E n 1 27 b e an extension of ρ A n 1 that satisﬁes the tw o conditions of Deﬁnition 2.1 . Since ρ A n 1 E n 1 is p erm utation- in v ariant, for en trop y and m utual information terms the indices of the considered subsystems can b e c hanged. Hence, w e ﬁnd for any k ≤ ⌊ n 2 ⌋ =: n 0 , k ′ ≤ n 0 + 1 and for any ℓ = k , . . . , n 0 , ℓ ′ = k ′ , . . . , n 0 + 1 I ( A n 0 1 E n 0 1 : A n n 0 +1 E n n 0 +1 ) ρ SSA ≥ I ( A n 0 1 : A n n 0 +1 ) ρ (213) chain rule = I ( A ℓ 1 : A n n 0 +1 | A n 0 ℓ +1 ) ρ + I ( A n 0 ℓ +1 : A n n 0 +1 ) ρ (214) SSA ≥ I ( A k 1 : A n n 0 +1 | A n 0 ℓ +1 ) ρ (215) chain rule = I ( A k 1 : A n 0 + ℓ ′ n 0 +1 | A n 0 ℓ +1 A n n 0 + ℓ ′ +1 ) ρ + I ( A k 1 : A n n 0 + ℓ ′ +1 | A n 0 ℓ +1 ) ρ (216) SSA ≥ I ( A k 1 : A n 0 + k ′ n 0 +1 | A n 0 ℓ +1 A n n 0 + ℓ ′ +1 ) ρ (217) perm . inv . = I ( A k 1 : A k + k ′ k +1 | A m k + k ′ +1 ) ρ , (218) where in the last step, we p ermute all the remaining systems app earing in the conditioning (there are n − ℓ − ℓ ′ man y) in to neighboring systems labeled b y indices from ( k + k ′ + 1) to m := ( k + k ′ + n − ℓ − ℓ ′ ). Th us, for an y m ≥ k + k ′ w e ﬁnd I ( A k 1 : A k + k ′ k +1 | A m k + k ′ +1 ) ρ Eq. (218) ≤ I ( A n 0 1 B n 0 1 : A n n 0 +1 E n n 0 +1 ) ρ (219) = H ( A n 0 1 E n 0 1 ) ρ + H ( A n n 0 +1 E n n 0 +1 ) ρ − H ( A n 1 E n 1 ) ρ (220) ≤ H ( A n 0 1 E n 0 1 ) ρ + H ( A n n 0 +1 E n n 0 +1 ) ρ . (221) In the pro of of Lemma B.3 it is sho wn that ρ A n 0 1 E n 0 1 ∈ span V ( H ⊗ n 0 AE , | θ ⟩ ⊗ n 0 − r AE ). This implies H ( A n 0 1 E n 0 1 ) ρ Eq. (9) ≤ log  2 n 0 h ( r/n 0 ) d r AE  = n 0 h  r n 0  + r log d AE . (222) Similarly , we obtain H ( A n n 0 +1 E n n 0 +1 ) ρ ≤ ( n − n 0 ) h  r n − n 0  + r log d AE . (223) Hence w e ﬁnd I ( A k 1 : A k + k ′ k +1 | A m k + k ′ +1 ) ρ Eq. (221) ≤ H ( A n 0 1 B n 0 1 ) ρ + H ( A n n 0 +1 B n n 0 +1 ) ρ (224) Eqs. (222) and (223) ≤ nh  2 r n  + 2 r log d AE . (225) F or any ℓ < s = ⌊ (log n ) 3 2 ⌋ , w e th us ha v e    ρ A ℓ +1 1 − σ ⊗ ( ℓ +1) A    1 ≤   ρ A s 1 − σ ⊗ s A   1 Prop. 2.6 ≤ ε n , (226) where the ﬁrst step follows since the trace distance is contractiv e under trace-preserving completely p ositiv e maps [ 43 , Theorem 8.16]. The contin uit y of conditional entrop y [ 1 ] then implies that | H ( A 1 | A ℓ +1 2 ) ρ − H ( A ) σ | = | H ( A 1 | A ℓ +1 2 ) ρ − H ( A 1 | A ℓ +1 2 ) σ ⊗ ( ℓ +1) | ≤ 4 ε n log d A + 2 h ( ε n ) =: δ n , (227) where d A = dim( A ). The chain rule together with p ermutation inv ariance allows us to write H ( A n 1 ) ρ chain rule = H ( A n − ℓ 1 | A n n − ℓ +1 ) ρ + H ( A n n − ℓ +1 ) ρ (228) ≥ H ( A n − ℓ 1 | A n n − ℓ +1 ) ρ (229) 28 chain rule = n − ℓ X i =1 H ( A i | A n n − ℓ +1 ) ρ − n − ℓ X i =1 I ( A i − 1 1 : A i | A n n − ℓ +1 ) ρ (230) perm. inv. = ( n − ℓ ) H ( A 1 | A ℓ +1 2 ) ρ − n − ℓ X i =1 I ( A i − 1 1 : A i | A n n − ℓ +1 ) ρ (231) Eq. (227) ≥ ( n − ℓ ) H ( A ) σ − n − ℓ X i =1 I ( A i − 1 1 : A i | A n n − ℓ +1 ) ρ − nδ n (232) = ( n − ℓ ) H ( A ) σ − n − s X i =1 I ( A i − 1 1 : A i | A n n − ℓ +1 ) ρ − n − ℓ X i = n − s +1 I ( A i − 1 1 : A i | A n n − ℓ +1 ) ρ − nδ n (233) ≥ ( n − ℓ ) H ( A ) σ − n − s X i =1 I ( A i − 1 1 : A i | A n n − ℓ +1 ) ρ − 2( s − ℓ ) log d A − nδ n (234) perm. inv. = ( n − ℓ ) H ( A ) σ − n − s X i =1 I ( A i − 1 1 : A i | A n − s + ℓ n − s +1 ) ρ − 2( s − ℓ ) log d A − nδ n . (235) Claim E.3. L et n ∈ N , exists ℓ ≤ s = ⌊ √ n ⌋ such that for P n − s i =1 I ( A i − 1 1 : A i | A n − s + ℓ n − s +1 ) ρ =: ξ n we have lim n →∞ ξ n n = 0 . Pr o of. The proof follows the idea from [ 8 , Equation (5)], whic h prov es a v ariant of Claim E.3 for a purely classical scenario. By the p ermutation-in v ariance w e ha v e for any 1 ≤ i < k ≤ m ≤ n I ( A i − 1 1 : A i | A m k +1 ) ρ = I ( A i − 1 1 : A m | A m − 1 k ) ρ . (236) This allo ws us to write n X m = k I ( A i − 1 1 : A i | A m k +1 ) ρ ( 236 ) = n X m = k I ( A i − 1 1 : A m | A m − 1 k ) ρ chain rule = I ( A i − 1 1 : A n k ) ρ . (237) Summing Equation (237) o v er all 1 ≤ i < k and dividing by n − k + 1 gives 1 n − k + 1 n X m = k k − 1 X i =1 I ( A i − 1 1 : A i | A m k +1 ) ρ = 1 n − k + 1 k − 1 X i =1 I ( A i − 1 1 : A n k ) ρ . (238) Hence, there exists m ∗ ∈ { k , k + 1 , . . . , n } such that k − 1 X i =1 I ( A i − 1 1 : A i | A m ∗ k +1 ) ρ ≤ 1 n − k + 1 k − 1 X i =1 I ( A i − 1 1 : A n k ) ρ . (239) Cho osing k = n − s , Equation (239) c an b e rewritten (as there exists 0 ≤ ℓ ≤ s such that m ∗ = k + ℓ ) as n − s − 1 X i =1 I ( A i − 1 1 : A i | A n − s + ℓ n − s +1 ) ρ ≤ 1 s + 1 n − s X i =1 I ( A i − 1 1 : A n n − s ) ρ (240) W e next show that Equation (225) implies I ( A i − 1 1 : A n n − s ) ρ ≤ 2 nh  2 r n  + 4 r log d AE ∀ i = 1 , . . . , n − s . (241) 29 T o see this, we can assume w.l.o.g. 8 that n is large enough suc h that k ′ := s + 1 = j (log n ) 3 2 k + 1 ≤ j n 2 k + 1 = n 0 + 1 . (242) Let us no w consider tw o cases: In case i − 1 ≤ n 0 , the assertion follo ws from Equation (225) b y choosing m = k + k ′ , since I ( A i − 1 1 : A n n − s ) ρ perm. inv. = I ( A i − 1 1 : A i + s i ) ρ ≤ nh  2 r n  + 2 r log d AE . (243) In case i − 1 > n 0 , w e can use the c hain rule to write I ( A i − 1 1 : A n n − s ) ρ = I ( A n 0 1 : A n n − s ) ρ + I ( A i − 1 n 0 +1 : A n n − s | A n 0 1 ) ρ (244) perm. inv. = I ( A n 0 1 : A n 0 +1+ s n 0 +1 ) ρ + I ( A i − 1 − n 0 1 : A i − n 0 + s i − n 0 | A i + s i − n 0 + s +1 ) ρ , (245) where b oth terms on the right-hand side are b ounded from ab o v e by nh ( 2 r n ) + 2 r log d AE via Equa- tion (225) . Putting things together yields n − s X i =1 I ( A i − 1 1 : A i | A n − s + ℓ n − s +1 ) ρ = n − s − 1 X i =1 I ( A i − 1 1 : A i | A n − s + ℓ n − s +1 ) ρ + I ( A n − s − 1 1 : A n − s | A n − s + ℓ n − s +1 ) ρ (246) Eq. (240) ≤ 1 s + 1 n − s X i =1 I ( A i − 1 1 : A n n − s ) ρ + 2 log d A (247) Eq. (241) ≤ n − s s + 1  2 nh  2 r n  + 4 r log d AE  + 2 log d A (248) = o ( n ) , (249) where the ﬁnal step uses that for s = ⌊ √ n ⌋ and r = o ( n ) w e ha v e lim n →∞ n s h  2 r n  = 0 . (250) ■ Since lim n →∞ δ n = 0, com bining Claim E.3 with Equation (235) yields for some ℓ ≤ s = ⌊ √ n ⌋ H ( A n 1 ) ρ ≥ ( n − ℓ ) H ( A ) σ − o ( n ) = nH ( A ) σ − o ( n ) . (251) ■ References [1] R. Alicki and M. F annes. Con tin uit y of quantum conditional information. Journal of Physics A: Mathematic al and Gener al , 37(5):55–57, 2004. DOI: doi:10.1088/0305-4470/37/5/L01 . [2] K. M. R. Audenaert. A sharp con tin uit y estimate for the v on Neumann en tropy . Journal of Physics A: Mathematic al and The or etic al , 40(28):8127, 2007. Av ailable online: http://stacks. iop.org/1751- 8121/40/i=28/a=S18 . 8 It can b e shown that s + 1 ≤ n 0 + 2 for all n ∈ N . Hence, if s + 1 = n 0 + 2, due to the chain rule we can write I ( A i − 1 1 : A n n − s ) ρ = I ( A i − 1 1 : A n n − s +1 ) ρ + I ( A i − 1 1 : A n − s n − s | A n n − s +1 ) ρ ≤ I ( A i − 1 1 : A n n − s +1 ) ρ + 2 log d A , and the argument above still works. 30 [3] C. H. Bennett, H. J. Bernstein, S. Popescu, and B. Sch umacher. Concen trating partial entangle- men t b y lo cal op erations. Phys. R ev. A , 53:2046–2052, 1996. DOI: 10.1103/PhysRevA.53.2046 . [4] C. H. Bennett, G. Brassard, S. P op escu, B. Sch umacher, J. A. Smolin, and W. K. W o otters. Puriﬁcation of noisy entanglemen t and faithful telep ortation via noisy channels. Phys. R ev. L ett. , 76:722–725, 1996. DOI: 10.1103/PhysRevLett.76.722 . [5] C. H. Bennett, D. P . DiVincenzo, J. A. Smolin, and W. K. W o otters. Mixed-state en tanglemen t and quantum error correction. Physic al R eview A , 54(5):3824–3851, 1996. DOI: 10.1103/PhysRevA.54.3824 . [6] M. Berta, F. G. S. L. Brand˜ ao, G. Gour, L. Lami, M. B. Plenio, B. Regula, and M. T omamichel. On a gap in the pro of of the generalised quantum Stein’s lemma and its consequences for the rev ersibilit y of quantum resources. Quantum , 7:1103, 2023. DOI: 10.22331/q-2023-09-07-1103 . [7] M. Berta, F. G. S. L. Brand˜ ao, G. Gour, L. Lami, M. B. Plenio, B. Regula, and M. T omamichel. The tangled state of quantum h ypothesis testing. Natur e Physics , 20(2):172–175, 2024. DOI: 10.1038/s41567-023-02289-9 . [8] M. Berta, L. Gav alakis, and I. Konto yiannis. A third information-theoretic approach to ﬁnite de Finetti theorems, 2023. DOI: 10.48550/arXiv.2304.05360 . [9] F. G. S. L. Brand˜ ao and M. B. Plenio. A generalization of quantum Stein’s lemma. Communic a- tions in Mathematic al Physics , 295(3):791–828, 2010. DOI: 10.1007/s00220-010-1005-z . [10] F. Buscemi, D. Sutter, and M. T omamic hel. An information-theoretic treatment of quan tum dic hotomies. Quantum , 3:209, 2019. DOI: 10.22331/q-2019-12-09-209 . [11] E. Carlen. T r ac e Ine qualities and Quantum Entr opy: A n Intr o ductory Course . Con temp orary Mathematics, 2009. DOI: 10.1090/conm/529 . [12] C. M. Ca v es, C. A. F uchs, and R. Schac k. Unknown quan tum states: The quan tum de Finetti rep- resen tation. Journal of Mathematic al Physics , 43(9):4537–4559, 2002. DOI: 10.1063/1.1494475 . [13] M. Christandl, R. K¨ onig, G. Mitc hison, and R. Renner. One-and-a-half quantum de Finetti theorems. Communic ations in Mathematic al Physics , 273(2):473–498, 2007. DOI: 10.1007/s00220-007-0189-3 . [14] M. Christandl and A. Win ter. “Squashed en tanglemen t”: An additiv e entanglemen t measure. Journal of Mathematic al Physics , 45(3):829–840, 2004. DOI: 10.1063/1.1643788 . [15] T. M. Cov er and J. A. Thomas. Elements of Information The ory . Wiley Interscience, 2006. DOI: 10.1002/047174882X . [16] N. Datta. Min- and max-relative entropies and a new en tanglement monotone. IEEE T r ansactions on Information The ory , 55(6):2816–2826, 2009. DOI: 10.1109/TIT.2009.2018325 . [17] N. Datta and R. Renner. Smo oth entropies and the quantum information spectrum. IEEE T r ansactions on Information The ory , 55(6):2807–2815, 2009. DOI: 10.1109/TIT.2009.2018340 . [18] B. De Finetti. La pr ´ evision: ses lois logiques, ses sources sub jectives. In Annales de l’institut Henri Poinc ar´ e , volume 7, pages 1–68, 1937. [19] B. de Finetti. Logical foundations and measuremen t of sub jectiv e probability . A cta Psycholo gic a , 34:129–145, 1970. DOI: https://doi.org/10.1016/0001-6918(70)90012-0 . [20] P . Diaconis and D. F reedman. Finite Exc hangeable Sequences. The A nnals of Pr ob ability , 8(4):745 – 764, 1980. DOI: 10.1214/aop/1176994663 . 31 [21] O. F awzi and R. Renner. Quan tum conditional mutual information and appro xi- mate Mark o v chains. Communic ations in Mathematic al Physics , 340(2):575–611, 2015. DOI: 10.1007/s00220-015-2466-x . [22] C. F uchs and J. v an de Graaf. Cryptographic distinguishabilit y measures for quantum- mec hanical states. IEEE T r ansactions on Information The ory , 45(4):1216 –1227, 1999. DOI: 10.1109/18.761271 . [23] M. Ha y ashi and H. Y amasaki. The generalized quantum Stein’s lemma and the second la w of quan tum resource theories. Natur e Physics , 21(12):1988–1993, 2025. DOI: 10.1038/s41567-025-03047-9 . [24] P . M. Hayden, M. Horo decki, and B. M. T erhal. The asymptotic en tanglemen t cost of prepar- ing a quantum state. Journal of Physics A: Mathematic al and Gener al , 34(35):6891, 2001. DOI: 10.1088/0305-4470/34/35/314 . [25] R. K¨ onig and R. Renner. A de Finetti representation for ﬁnite symmetric quan tum states. Journal of Mathematic al Physics , 46(12):122108, 2005. DOI: 10.1063/1.2146188 . [26] L. Lami. A solution of the generalized quan tum Stein’s lemma. IEEE T r ansactions on Information The ory , 71(6):4454–4484, 2025. DOI: 10.1109/TIT.2025.3543610 . [27] E. H. Lieb and M. B. Rusk ai. A fundamental prop erty of quantum-mec hanical entrop y . Physic al R eview L etters , 30:434–436, 1973. DOI: 10.1103/PhysRevLett.30.434 . [28] E. H. Lieb and M. B. Rusk ai. Pro of of the strong subadditivity of quantum-mec hanical entrop y . Journal of Mathematic al Physics , 14(12):1938–1941, 1973. DOI: 10.1063/1.1666274 . [29] G. Mazzola and R. Renner. Asymptotic transformation rates with almost iid resources, 2026. in preparation. [30] P . Monari and D. Cocchi. Introduction to Bruno de Finetti’s “probabili´ a e induzione”. Co op er ativa Libr aria Universitaria Editric e, Bolo gna , 1993. [31] M. M ¨ uller-Lennert, F. Dupuis, O. Szehr, S. F ehr, and M. T omamichel. On quantum R ´ enyi en tropies: A new generalization and some prop erties. Journal of Mathematic al Physics , 54(12), 2013. DOI: http://dx.doi.org/10.1063/1.4838856 . [32] D. Petz. Quantum Information The ory and Quantum Statistics . Springer, 2008. DOI: 10.1007/978-3-540-74636-2 . [33] R. Renner. Securit y of quantum k ey distribution. PhD thesis, ETH Zurich , 2005. a v ailable at arXiv: quant-ph/0512258 . [34] R. Renner. Symmetry of large physical systems implies independence of subsystems. Natur e Physics , 3(9):pp. 645–649, 2007. Av ailable online: http://www.nature.com/nphys/journal/ v3/n9/suppinfo/nphys684_S1.html . [35] D. Sutter. Appr oximate Quantum Markov Chains . Springer Intern ational Publishing, 2018. DOI: 10.1007/978-3-319-78732-9 5 . [36] M. T omamichel. Quantum Information Pr o c essing with Finite R esour c es , v olume 5 of Springer- Briefs in Mathematic al Physics . Springer, 2015. DOI: 10.1007/978-3-319-21891-5 . [37] M. T omamichel, R. Colb ec k, and R. Renner. A fully quan tum asymptotic equipar- tition prop erty . IEEE T r ansactions on Information The ory , 55(12):5840–5847, 2009. DOI: 10.1109/TIT.2009.2032797 . 32 [38] M. T omamic hel and M. Hay ashi. A hierarch y of information quan tities for ﬁnite block length analysis of quan tum tasks. IEEE T r ansactions on Information The ory , 59(11):7693–7710, 2013. DOI: 10.1109/TIT.2013.2276628 . [39] V. V edral, M. B. Plenio, M. A. Rippin, and P . L. Knight. Quantifying entanglemen t. Phys. R ev. L ett. , 78:2275–2279, 1997. DOI: 10.1103/PhysRevLett.78.2275 . [40] K. G. H. V ollbrech t and R. F. W erner. En tanglemen t measures under symmetry . Phys. R ev. A , 64:062307, 2001. DOI: 10.1103/PhysRevA.64.062307 . [41] M. M. Wilde, A. Winter, and D. Y ang. Strong con v erse for the classical capacity of en tanglemen t- breaking and Hadamard c hannels via a sandwic hed R ´ enyi relative en trop y . Communic ations in Mathematic al Physics , 331(2):593–622, 2014. DOI: 10.1007/s00220-014-2122-x . [42] A. Winter. Tigh t uniform con tin uit y b ounds for quantum en tropies: Conditional en trop y , relative en trop y distance and energy constrain ts. Communic ations in Mathematic al Physics , 347(1):291– 313, 2016. DOI: 10.1007/s00220-016-2609-8 . [43] M. M. W olf. Quan tum c hannels & op erations: Guided tour, 2012. Lecture notes av ailable at https://www- m5.ma.tum.de/foswiki/pub/M5/Allgemeines/MichaelWolf/ QChannelLecture.pdf . [44] Y.-D. W u and G. Chirib ella. Detecting quantum capacities of con tin uous-v ariable quantum c han- nels. Phys. R ev. R es. , 4:043149, 2022. DOI: 10.1103/PhysRevResearch.4.043149 . 33

Almost-iid information theory

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment