Black hole entropy: the heat kernel method

As part of my ongoing love affair with black holes, I’ve been digging more deeply into what it means for them to have entropy, which of course necessitates investigating how this is assigned in the first place. This is a notoriously confusing issue — indeed, one which lies at the very heart of the firewall paradox — which is further complicated by the fact that there are a priori three distinct physical entropies at play: thermodynamic, entanglement, and gravitational. (Incidentally, lest my previous post on entropy cause confusion, let me stress that said post dealt only with the relation between thermodynamic and information-theoretic a.k.a. Shannon entropy, at a purely classical level: neither entanglement nor gravity played any role there. I also didn’t include the Shannon entropy in the list above, because — as explained in the aforementioned post — this isn’t an objective/physical entropy in the sense of the other three; more on this below.)

My research led me to a review on black hole entanglement entropy by Solodukhin [1], which is primarily concerned with the use of the conical singularity method (read: replica trick) to isolate the divergences that arise whenever one attempts to compute entanglement entropy in quantum field theory. The structure of these divergences turns out to provide physical insight into the nature of this entropy, and sheds some light on the relation to thermodynamic/gravitational entropy as well, so these sorts of calculations are well-worth understanding in detail.

While I’ve written about the replica trick at a rather abstract level before, for present purposes we must be substantially more concrete. To that end, the main technical objective of this post is to elucidate one of the central tools employed by these computations, known as the heat kernel method. This is a rather powerful method with applications scattered throughout theoretical physics, notably the calculation of 1-loop divergences and the study of anomalies. Our exposition will mostly follow the excellent pedagogical review by Vassilevich [2]. Before diving into the details however, let’s first review the replica trick à la [1], in order to see how the heat kernel arises in the present context.

Consider some quantum field {\psi(X)} in {d}-dimensional Euclidean spacetime, where {X^\mu=\{\tau,x,z^i\}} with {i=1,\ldots,d\!-\!2}. The Euclidean time {\tau} is related to Minkowski time {t=-i\tau} via the usual Wick rotation, and we have singled-out one of the transverse coordinates {x} for reasons which will shortly become apparent. For simplicity, let us consider the wavefunction for the vacuum state, which we prepare by performing the path integral over the lower ({\tau\leq0}) half of Euclidean spacetime with the boundary condition {\psi(\tau=0,x,z)=\psi_0(x,z)}:

\displaystyle \Psi[\psi_0(x)]=\int_{\psi(0,x,z)=\psi_0(x,z)}\!\mathcal{D}\psi\,e^{-W[\psi]}~, \ \ \ \ \ (1)

where we have used {W} to denote the Euclidean action for the matter field, so as to reserve {S} for the entropy.

Now, since we’re interested in computing the entanglement entropy of a subregion, let us divide the {\tau=0} surface into two halves, {x\!<\!0} and {x\!>\!0}, by defining a codimension-2 surface {\Sigma} by the condition {x=0}, {\tau=0}. Correspondingly, let us denote the boundary data

\displaystyle \psi_0(x,z)\equiv \begin{cases} \psi_-(x,z) \;\; & x<0\\ \psi_+(x,z) \;\; & x>0~. \end{cases} \ \ \ \ \ (2)

The reduced density matrix that describes the {x\!>\!0} subregion of the vacuum state is then obtained by tracing over the complementary set of boundary fields {\psi_-}. In the Euclidean path integral, this corresponds to integrating out {\psi_-} over the entire spacetime, but with a cut from negative infinity to {\Sigma} along the {\tau\!=\!0} surface (i.e., along {x\!<\!0}). We must therefore impose boundary conditions for the remaining field {\psi_+} as this cut is approached from above ({\psi_+^1}) and below ({\psi_+^2}). Hence:

\displaystyle \rho(\psi_+^1,\psi_+^2)=\int\!\mathcal{D}\psi_-\Psi[\psi_+^1,\psi_-]\Psi[\psi_+^2,\psi_-]~. \ \ \ \ \ (3)

Formally, this object simply says that the transition elements {\langle\psi_+^2|\rho|\psi_+^1\rangle} are computed by performing the path integral with the specified boundary conditions along the cut.

Unfortunately, explicitly computing the von Neumann entropy {S=-\mathrm{tr}\rho\ln\rho} is an impossible task for all but the very simplest systems. Enter the replica trick. The basic idea is to consider the {n}-fold cover of the above geometry, introduce a conical deficit at the boundary of the cut {\Sigma}, and then differentiate with respect to the deficit angle, whereupon the von Neumann entropy is recovered in the appropriate limit. To see this in detail, it is convenient to represent the {(\tau,x)} subspace in polar coordinates {(r,\phi)}, where {\tau=r\sin\phi} and {x=r\cos\phi}, such that the cut corresponds to {\phi=2\pi k} for {k=1,\ldots,n}. In constructing the {n}-sheeted cover, we glue sheets along the cut such that the fields are smoothly continued from {\psi_+^{1,2}\big|_k} to {\psi_+^{1,2}\big|_{k+1}}. The resulting space is technically a cone, denoted {C_n}, with angular deficit {2\pi(1-n)} at {\Sigma}, on which the partition function for the fields is given by

\displaystyle Z[C_n]=\mathrm{tr}\rho^n~, \ \ \ \ \ (4)

where {\rho^n} is the {n^\mathrm{th}} power of the density matrix (3). At this point in our previous treatment of the replica trick, we introduced the {n^\mathrm{th}} Rényi entropy

\displaystyle S_n=\frac{1}{1-n}\ln\mathrm{tr}\rho^n~, \ \ \ \ \ (5)

which one can think of as the entropy carried by the {n} copies of the chosen subregion, and showed that the von Neumann entropy is recovered in the limit {n\rightarrow1}. Equivalently, we may express the von Neumann entropy directly as

\displaystyle S(\rho)=-(n\partial_n-1)\ln\mathrm{tr}\rho^n\big|_{n=1}~. \ \ \ \ \ (6)


\displaystyle -(n\partial_n-1)\ln\mathrm{tr}\,\rho^n\big|_{n=1} =-\left[\frac{n}{\mathrm{tr}\rho^n}\,\mathrm{tr}\!\left(\rho^n\ln\rho\right)-\ln\mathrm{tr}\rho^n\right]\bigg|_{n=1} =-\mathrm{tr}\left(\rho\ln\rho\right)~, \ \ \ \ \ (7)

where in the last step we took the path integral to be appropriately normalized such that {\mathrm{tr}\rho=1}, whereupon the second term vanishes. Et voilà! As I understand it, the aforementioned “conical singularity method” is essentially an abstraction of the replica trick to spacetimes with conical singularities. Hence, re-purposing our notation, consider the effective action {W[\alpha]=-\ln Z[C_\alpha]} for fields on a Euclidean spacetime with a conical singularity at {\Sigma}. The cone {C_\alpha} is defined, in polar coordinates, by making {\phi} periodic with period {2\pi\alpha}, and taking the limit in which the deficit {(1-\alpha)\ll1}. The entanglement entropy for fields on this background is then

\displaystyle S=(\alpha\partial_\alpha-1)W[\alpha]\big|_{\alpha=1}~. \ \ \ \ \ (8)

As a technical aside: in both these cases, there is of course the subtlety of analytically continuing the parameter {n} resp. {\alpha} to non-integer values. We’ve discussed this issue before in the context of holography, where one surmounts this by instead performing the continuation in the bulk. We shall not digress upon this here, except to note that the construction relies on an abelian (rotational) symmetry {\phi\rightarrow\phi+w}, where {w} is an arbitrary constant. This is actually an important constraint to bear in mind when attempting to infer general physical lessons from our results, but we’ll address this caveat later. Suffice to say that given this assumption, the analytical continuation can be uniquely performed without obstruction; see in particular section 2.7 of [1] for details.

We have thus obtained, in (8), an expression for the entanglement entropy in terms of the effective action in the presence of a conical singularity. And while this is all well and pretty, in order for this expression to be of any practical use, we require a means of explicitly computing {W}. It is at this point that the heat kernel enters the game. The idea is to represent the Green function — or rather, the (connected) two-point correlator — as an integral over an auxiliary “proper time” {s} of a kernel satisfying the heat equation. This enables one to express the effective action as

\displaystyle W=-\frac{1}{2}\int_0^\infty\!\frac{\mathrm{d} s}{s}K(s,D)~, \ \ \ \ \ (9)

where {K(s,D)} is the (trace of the) heat kernel for the Laplacian operator {D}. We’ll now spend some time unpacking this statement, which first requires that we review some basic facts about Green functions, propagators, and all that.

Consider a linear differential operator {L_x=L(x)} acting on distributions with support on {\mathbb{R}^n}. If {L_x} admits a right inverse, then the latter defines the Green function {G(x,x')} as the solution to the inhomogeneous differential equation

\displaystyle L_xG(x,x')=\delta^n(x-x')~, \ \ \ \ \ (10)

where {x,\,x'} represent vectors in {\mathbb{R}^n}. We may also define the kernel of {L_x} as the solution to the homogeneous differential equation

\displaystyle L_xK(x,x')=0~. \ \ \ \ \ (11)

The Green function is especially useful for solving linear differential equations of the form

\displaystyle L_xu(x)=f(x)~. \ \ \ \ \ (12)

To see this, simply multiply both sides of (10) by {f(x')} and integrate w.r.t {x'}; by virtue of the delta function (and the fact that {L_x} is independent of {x'}), one identifies

\displaystyle u(x)=\int\!\mathrm{d} x'G(x,x')f(x')~. \ \ \ \ \ (13)

The particular boundary conditions we impose on {u(x)} then determine the precise form of {G} (e.g., retarded vs. advanced Green functions in QFT).

As nicely explained in this Stack Exchange answer, the precise relation to propagators is ambiguous, because physicists use this term to mean either the Green function or the kernel, depending on context. For example, the Feynman propagator {\Delta_F(x-x')} for a scalar field {\phi(x)} is a Green function for the Klein-Gordon operator, since it satisfies the equation

\displaystyle (\square_x+m^2)\Delta_F(x-x')=\delta^n(x-x')~. \ \ \ \ \ (14)

In contrast, the corresponding Wightman functions

\displaystyle G^+(x,x')=\langle0|\phi(x)\phi(x')|0\rangle~,\quad\quad G^-(x,x')=\langle0|\phi(x')\phi(x)|0\rangle~, \ \ \ \ \ (15)

are kernels for this operator, since they satisfy

\displaystyle (\square_x+m^2)G^{\pm}(x,x')=0~. \ \ \ \ \ (16)

The reader is warmly referred to section 2.7 of Birrell & Davies classic textbook [3] for a more thorough explanation of Green functions in this context (see also part 1 of my QFT in curved space sequence).

In the present case, we shall be concerned with the heat kernel

\displaystyle K(s;x,y;D)=\langle x|e^{-sD}|y\rangle~, \ \ \ \ \ (17)

where {D} is some Laplacian operator, by which we mean that it admits a local expression of the form

\displaystyle D=-\left(g^{\mu\nu}\partial_\mu\partial_\nu+a^\mu\partial_\mu+b\right) =-\left(g^{\mu\nu}\nabla_\mu\nabla_\nu+E\right)~, \ \ \ \ \ (18)

for some matrix-valued functions {a^\mu~,b}. In the second equality, we’ve written the operator in the so-called canonical form for a Laplacian operator on a vector bundle, where {E} is an endomorphism on the bundle over the manifold, and the covariant derivative {\nabla} includes both the familiar Riemann part as well as the contribution from the gauge (bundle) part; the field strength for the latter will be denoted {\Omega_{\mu\nu}}. Fibre bundles won’t be terribly important for our purposes, but we’ll need some of this notation later; see section 2.1 of [2] for details.

The parameter {s} in (17) is some auxiliary Euclidean time variable (note that if we were to take {s=it}, the right-hand side of (17) would correspond to the transition amplitude {\langle x|U|y\rangle} for some unitary operator {U}). {K} is so-named because it satisfied the heat equation,

\displaystyle (\partial_s+D_x)K(s;x,y;D)=0~, \ \ \ \ \ (19)

with the initial condition

\displaystyle K(0;x,y;D)=\delta(x,y)~, \ \ \ \ \ (20)

where the subscript on {D_x} is meant to emphasize the fact that the operator {D} acts only on the transverse variables, not on the auxiliary time {s}. Our earlier claim that the Green function can be expressed in terms of an integral over the latter is then based on the observation that one can invert (17) to obtain the propagator

\displaystyle D^{-1}=\int_0^\infty\!\mathrm{d} s\,K(s;x,y;D)~. \ \ \ \ \ (21)

Note that here, “propagator” indeed refers to the Green function of the field in the path integral representation, insofar as the latter serves as the generating functional for the former. Bear with me a bit longer as we elucidate this last claim, as this will finally bring us full-circle to (9)

Denote the Euclidean path integral for the fields on some fixed background by

\displaystyle Z[J]=\int\!\mathcal{D}\psi\,e^{-W[\psi,J]}~, \ \ \ \ \ (22)

where {J} is the source for the matter field {\psi}. The heat kernel method applies to one-loop calculations, in which case it suffices to expand the action to quadratic order in fluctuations around the classical saddle-point {S_\mathrm{cl}}, whence the Gaussian integral may be written

\displaystyle Z[J]=e^{-S_\mathrm{cl}}\,\mathrm{det}^{-1/2}(D)\,\mathrm{exp}\left(\frac{1}{4}JD^{-1}J\right)~. \ \ \ \ \ (23)

(We’re glossing over some mathematical caveats/assumptions here, notably that {D} be self-adjoint w.r.t. to the scalar product of the fields; see [2] for details). Thus we see that taking two functional derivatives w.r.t. the source {J} brings down the operator {D^{-1}}, thereby identifying it with the two-point correlator for {\psi},

\displaystyle G(x,y)=\langle\psi(x)\psi(y)\rangle=\frac{1}{Z[0]}\left.\frac{\delta^2 Z[J]}{\delta J(x)\delta J(y)}\right|_{J=0}=D^{-1}~, \ \ \ \ \ (24)

which is trivially (by virtue of the far r.h.s.) a Green function of {D} in the sense of (10).

We now understand how to express the Green function for the operator {D} in terms of the heat kernel. But we’re after the connected two-point correlator (sometimes called the connected Green function), which encapsulates the one-loop contributions. Recall that the connected Feynman diagrams are generated by the effective action {W} introduced above. After properly normalizing, we have only the piece which depends purely on {D}:

\displaystyle W=\frac{1}{2}\ln\mathrm{det}(D)~. \ \ \ \ \ (25)

Vassilevich [2] then provides a nice heuristic argument that relates this to the heat kernel (as well as a more rigorous treatment from spectral theory, for the less cavalier among you), which relies on the identity

\displaystyle \ln\lambda=-\int_0^\infty\!\frac{\mathrm{d} s}{s}e^{-s\lambda}~, \ \ \ \ \ (26)

for {\lambda>0} (strictly speaking, this identity is only correct up to a constant, but we may normalize away this inconvenience anyway; did I mention I’d be somewhat cavalier?). We then apply this identity to every (positive) eigenvalue {\lambda} of {D}, whence

\displaystyle W=\frac{1}{2}\ln\mathrm{det}(D)=\frac{1}{2}\mathrm{tr}\ln D =-\frac{1}{2}\int\!\frac{\mathrm{d} s}{s}\mathrm{tr}\left(e^{-sD}\right) =-\frac{1}{2}\int\!\frac{\mathrm{d} s}{s}K(s,D)~, \ \ \ \ \ (27)


\displaystyle K(s,D)\equiv \mathrm{tr}\left(e^{-sD}\right) =\int\!\mathrm{d}^dx\sqrt{g}\,\langle x|e^{-sD}|x\rangle =\int\!\mathrm{d}^dx\sqrt{g}\,K(s;x,x;D)~. \ \ \ \ \ (28)

Let us pause to take stock: so far, we’ve merely elucidated eq. (9). And while this expression itself is valid in general (not just for manifolds with conical singularities) the physical motivation for this post was the investigation of divergences in the entanglement entropy (8). And indeed, the expression for the effective action (27) is divergent at both limits! In the course of regulating this behaviour, we shall see that the UV divergences in the entropy (8) are captured by the so-called heat kernel coefficients.

To proceed, we shall need the fact that on manifolds without boundaries (or else with suitable local boundary conditions on the fields), {K(s,D)} — really, the self-adjoint operator {D} — admits an asymptotic expansion of the form

\displaystyle K(s,D)=\mathrm{tr}\left(e^{-sD}\right)\simeq (4\pi s)^{-d/2}\sum_{k\geq0}a_k(D)s^{k}~, \ \ \ \ \ (29)

cf. eq. (67) of [1]. A couple technical remarks are in order. First, recall that in contrast to a convergent series — which gives finite results for arbitrary, fixed {s} in the limit {k\rightarrow\infty} — an asymptotic series gives finite results for fixed {k} in the limit {s^{-1}\rightarrow\infty}. Second, we are ignoring various subtleties regarding the rigorous definition of the trace, wherein both (29) and (28) are properly defined via the use of an auxiliary function; cf. eq. (2.21) of [2] (n.b., Vassilevich’s coefficients do not include the normalization {(4\pi)^{-d/2}} from the volume integrals; see below).

The most important property of the heat kernel coefficients {a_k} is that they can be expressed as integrals of local invariants—tensor quantities which remain invariant under local diffeomorphisms; e.g., the Riemann curvature tensor and covariant derivatives thereof. Thus the first step in the procedure for calculating the heat kernel coefficients is to write down the integral over all such local invariants; for example, the first three coefficients are

\displaystyle \begin{aligned} a_0(D)=\int_M\!\mathrm{d}^dx\sqrt{g}&\,\alpha_0~,\\ a_1(D)=\frac{1}{6}\int_M\!\mathrm{d}^dx\sqrt{g}\,&\left(\alpha_1E+\alpha_2R\right)~,\\ a_2(D)=\frac{1}{360}\int_M\!\mathrm{d}^dx\sqrt{g}\,&\left(\alpha_3\nabla E+\alpha_4RE+\alpha_5E^2+\alpha_6\nabla R\right.\\ &\left.+\alpha_7R^2+\alpha_8R_{ij}+\alpha_9R_{ijkl}^2+\alpha_{10}\Omega_{ij}^2\right)~,\\ \end{aligned} \ \ \ \ \ (30)

where {E} and {\Omega_{ij}} were introduced in (18). A word of warning, for those of you cross-referencing with Solodukhin [1] and Vassilevich [2]: these coefficients correspond to eqs. (4.13) – (4.15) in [2], except that we have already included the normalization {(4\pi)^{-d/2}} in our expansion coefficients (29), consistent with [1]. Additionally, note that Vassilevich’s coefficients are labeled with even integers, while ours/Solodukhin’s include both even and odd—cf. (2.21) in [2]. The reason for this discrepancy is that all odd coefficients in Vassilevich’s original expansion vanish, as a consequence of the fact that there are no odd-dimensional invariants on manifolds without boundary; Solodukhin has simply relabeled the summation index for cleanliness, and we have followed his convention in (30).

It now remains to calculate the constants {\alpha_i}. This is a rather involved technical procedure, but is explained in detail in section 4.1 of [2]. One finds

\displaystyle \begin{aligned} \alpha_1=6~,\;\; \alpha_2=1~,\;\; \alpha_3=60~,\;\; \alpha_4=60~,\;\; \alpha_5=180~,\\ \alpha_6=12~,\;\; \alpha_7=5~,\;\; \alpha_8=-2~,\;\; \alpha_9=2~,\;\; \alpha_{10}=30~. \end{aligned} \ \ \ \ \ (31)

Substituting these into (30), and doing a bit of rearranging, we have

\displaystyle \begin{aligned} a_0(D)&=\int_M1~,\\ a_1(D)&=\int_M\left(E+\tfrac{1}{6}R\right)~,\\ a_2(D)&=\int_M\left[\tfrac{1}{180}R_{ijkl}^2-\tfrac{1}{180}R_{ij}^2+\tfrac{1}{6}\nabla^2\left(E+\tfrac{1}{5}R\right)+\tfrac{1}{2}\left(E+\tfrac{1}{6}R\right)^2\right]~, \end{aligned} \ \ \ \ \ (32)

where we have suppressed the integration measure for compactness, i.e., {\int_M=\int_M\mathrm{d}^dx\sqrt{g}\,}; we have also set the gauge field strength {\Omega_{ij}=0} for simplicity, since we will consider only free scalar fields below. These correspond to what Solodukhin refers to as regular coefficients, cf. his eq. (69). If one is working on a background with conical singularities, then there are additional contributions from the singular surface {\Sigma} [1]:

\displaystyle \begin{aligned} a_0^\Sigma(D)=&0~,\\ a_1^\Sigma(D)=&\frac{\pi}{3}\frac{(1-\alpha)(1+\alpha)}{\alpha}\int_\Sigma1~,\\ a_2^\Sigma(D)=&\frac{\pi}{3}\frac{(1-\alpha)(1+\alpha)}{\alpha}\int_\Sigma\left(E+\tfrac{1}{6}R\right)\\ &-\frac{\pi}{180}\frac{(1-\alpha)(1+\alpha)(1+\alpha^2)}{\alpha^3}\int_\Sigma\left(R_{ii}+2R_{ijij}\right)~, \end{aligned} \ \ \ \ \ (33)

where in the last expression {R_{ii}=R_{\mu\nu}n^\mu_in^\nu_i} and {R_{ijij}=R_{\mu\nu\rho\sigma}n^\mu_in^\nu_jn^\rho_in^\sigma_j}, where {n^k=n^\mu_k\partial_\mu} are orthonormal vectors orthogonal to {\Sigma}. In this case, if the manifold {M} in (32) is the {n}-fold cover {C_\alpha} constructed above, then the Riemannian curvature invariants are actually computed on the regular points {C_\alpha/\Sigma}, and are related to their flat counterparts by eq. (55) of [1]. Of course, here and in (33), {\alpha} refers to the conical deficit {2\pi(1-\alpha)}, not to be confused with the constants {\alpha_i} in (31).

Finally, we are in position to consider some of the physical applications of this method discussed in the introduction to this post. As a warm-up to the entanglement entropy of black holes, let’s first take the simpler case of flat space. Despite the lack of conical singularities in this completely regular, seemingly boring spacetime, the heat kernel method above can still be used to calculate the leading UV divergences in the entanglement entropy. While in this very simple case, there are integral identities that make the expansion into heat kernel coefficients unnecessary (specifically, the Sommerfeld formula employed in section 2.9 of [1]), the conical deficit method is more universal, and will greatly facilitate our treatment of the black hole below.

Consider a free scalar field with {\mathcal{D}=-(\nabla^2+X)}, where {X} is some scalar function (e.g., for a massive non-interacting scalar, {X=-m^2)}, on some background spacetime {E_\alpha} in {d>2} dimensions, with a conical deficit at the codimension-2 surface {\Sigma}. The leading UV divergence (we don’t care about the regular piece) in the entanglement entropy across this {(d\!-\!2)}-dimensional surface may be calculated directly from the coefficient {a_1^\Sigma} above. To this order in the expansion, the relevant part of {W} is

\displaystyle \begin{aligned} W[\alpha]&\simeq-\frac{1}{2}\int_{\epsilon^2}^\infty\!\frac{\mathrm{d} s}{s}(4\pi s)^{-d/2}\left(a_0^\Sigma+a_1^\Sigma s\right) =-\frac{\pi}{6}\frac{(1-\alpha)(1+\alpha)}{\alpha}\int_{\epsilon^2}^{\infty}\!\frac{\mathrm{d} s}{(4\pi s)^{d/2}}\int_\Sigma1\\ &=\frac{-1}{12(d\!-\!2)(4\pi)^{(d-2)/2}}\frac{A(\Sigma)}{\epsilon^{d-2}}\frac{(1-\alpha)(1+\alpha)}{\alpha}~, \end{aligned} \ \ \ \ \ (34)

where we have introduced the UV-cutoff {\epsilon} (which appears as {\epsilon^2} in the lower limit of integration, to make the dimensions of the auxiliary time variable work-out; this is perhaps clearest by examining eq. (1.12) of [2]) and the area of the surface {A(\Sigma)=\int_\Sigma\,}. Substituting this expression for the effective action into eq. (8) for the entropy, we obtain

\displaystyle S_\mathrm{flat}=\frac{1}{6(d\!-\!2)(4\pi)^{(d-2)/2}}\frac{A(\Sigma)}{\epsilon^{d-2}}~. \ \ \ \ \ (35)

which is eq. (81) in [1]. As explained therein, the reason this matches the flat space result — that is, the case in which {\Sigma} is a flat plane — is because even in curved spacetime, any surface can be locally approximated by flat Minkowski space. In particular, we’ll see that this result remains the leading-order correction to the black hole entropy, because the near-horizon region is approximately Rindler (flat). In other words, this result is exact for flat space (hence the equality), but provides only the leading-order divergence for more general, curved geometries.

For concreteness, we’ll limit ourselves to four dimensions henceforth, in which the flat space result above is

\displaystyle S_\mathrm{flat}=\frac{A(\Sigma)}{48\pi\epsilon^2}~. \ \ \ \ \ (36)

In the presence of a black hole, there will be higher-order corrections to this expression. In particular, in {d\!=\!4} we also have a log divergence from the {a_2^\Sigma s^2} term:

\displaystyle \begin{aligned} W[\alpha]&=-\frac{1}{2}\int_{\epsilon^2}^\infty\!\frac{\mathrm{d} s}{s}(4\pi s)^{-d/2}\left(a_0^\Sigma+a_1^\Sigma s+a_2^\Sigma s^2+\ldots\right)\\ &=-\frac{1}{32\pi^2}\int_{\epsilon^2}^\infty\!\mathrm{d} s\left(\frac{a_1^\Sigma}{s^2}+\frac{a_2^\Sigma}{s}+O(s^0)\right) \simeq-\frac{1}{32\pi^2}\left(\frac{a_1^\Sigma}{\epsilon^2}-2a_2^\Sigma\ln\epsilon\right)~, \end{aligned} \ \ \ \ \ (37)

where in the last step, we’ve dropped higher-order terms as well as the log IR divergence, since here we’re only interested in the UV part. From (33), we then see that the {a_2^\Sigma} term results in an expression for the UV divergent part of the black hole entanglement entropy of the form

\displaystyle S_\mathrm{ent}=\frac{A(\sigma)}{48\pi\epsilon^2} -\frac{1}{144\pi}\int_\Sigma\left[6E+R-\frac{1}{5}\left(R_{ii}-2R_{ijij}\right)\right]\ln\epsilon~, \ \ \ \ \ (38)

cf. (82) [1]. Specifying to a particular black hole solution then requires working out the projections of the Ricci and Riemann tensors on the subspace orthogonal to {\Sigma} ({R_{ii}} and {R_{ijij}}, respectively). For the simplest case of a massless, minimally coupled scalar field ({X=0}) on a Schwarzschild black hole background, the above yields

\displaystyle S_\mathrm{ent}=\frac{A(\Sigma)}{48\pi\epsilon^2}+\frac{1}{45}\ln\frac{r_+}{2}~, \ \ \ \ \ (39)

where {r_+} is the horizon radius; see section 3.9.1 of [1] for more details. Note that since the Ricci scalar {R=0}, the logarithmic term represents a purely topological correction to the flat space entropy (36) (in contrast to flat space, the Euler number for a black hole geometry is non-zero). Curvature corrections can still show up in UV-finite terms, of course, but that’s not what we’re seeing here: in this sense the log term is universal.

Note that we’ve specifically labeled the entropy in (39) with the subscript “ent” to denote that this is the entanglement entropy due to quantum fields on the classical background. We now come to the confusion alluded in the opening paragraph of this post, namely, what is the relation between the entanglement entropy of the black hole and either the thermodynamic or gravitational entropies, if any?

Recall that in classical systems, the thermodynamic entropy and the information-theoretic entropy coincide: not merely formally, but ontologically as well. The reason is that the correct probability mass function will be as broadly distributed as possible subject to the physical constraints on the system (equivalently, in the case of statistical inference, whatever partial information we have available). If only the average energy is fixed, then this corresponds to the Boltzmann distribution. Note that this same logic extends to the entanglement entropy as well (modulo certain skeptical reservations to which I alluded before), which is the underlying physical reason why the Shannon and quantum mechanical von Neumann entropies take the same form as well. In simple quantum systems therefore, the entanglement entropy of the fields coincides with this thermodynamic (equivalently, information-theoretic/Shannon) entropy.

More generally however, “thermodynamic entropy” is a statement about the internal microstates of the system, and is quantified by the total change in the free energy {F=\beta^{-1}\ln Z[\beta,g]} w.r.t. temperature {\beta^{-1}}:

\displaystyle S_\mathrm{thermo}=\frac{\mathrm{d} F}{\mathrm{d} T}=-\beta^2\frac{\mathrm{d} F}{\mathrm{d} \beta} =\left(\beta\frac{\mathrm{d}}{\mathrm{d}\beta}-1\right)W_\mathrm{tot}[\beta,g]~, \ \ \ \ \ (40)

where the total effective action {W_\mathrm{tot}[\beta,g]=\ln Z[\beta,g]}. Crucially, observe that here, in contrast to our partition function (22) for fields on a fixed background, the total Euclidean path integral that prepares the state also includes an integration over the metrics:

\displaystyle Z[\beta,g]=\int\!\mathcal{D} g_{\mu\nu}\,\mathcal{D}\psi\,e^{-W_\mathrm{gr}[g]+W_\mathrm{mat}[\psi,g]}~, \ \ \ \ \ (41)

where {W_\mathrm{gr}} represents the gravitational part of the action (e.g., the Einstein-Hilbert term), and {W_\mathrm{mat}} the contribution from the matter fields. (For reference, we’re following the reasoning and notation in section 4.1 of [1] here).

In the information-theoretic context above, introducing a black hole amounts to imposing additional conditions on (i.e., more information about) the system. Specifically, it imposes constraints on the class of metrics in the Euclidean path integral: the existence of a fixed point {\Sigma} of the isometry generated by the Killing vector {\partial_\tau} in the highly symmetric case of Schwarzschild, and suitable asymptotic behaviour at large radius. Hence in computing this path integral, one first performs the integration over matter fields {\psi} on backgrounds with a conical singularity at {\Sigma}:

\displaystyle \int\!\mathcal{D}\psi\,e^{-W_\mathrm{mat}[\psi,g]}=e^{-W[\beta,g]}~, \ \ \ \ \ (42)

where on the r.h.s., {W[\beta,g]} represents the effective action for the fields described above; note that the contribution from entanglement entropy is entirely encoded in this portion. The path integral (41) now looks like this:

\displaystyle Z[\beta,g]=\int\!\mathcal{D} g\,e^{-W_\mathrm{gr}[\psi,g]-W[\beta,g]} \simeq e^{-W_\mathrm{tot}[\beta,\,g(\beta)]}~, \ \ \ \ \ (43)

where {W_\mathrm{tot}} on the far right is the semiclassical effective action obtained from the saddle-point approximation. That is, the metric {g_{\mu\nu}(\beta)} is the solution to

\displaystyle \frac{\delta W_\mathrm{tot}[\beta,g]}{\delta g}=0~, \qquad\mathrm{with}\qquad W_\mathrm{tot}=W_\mathrm{gr}+W \ \ \ \ \ (44)

at fixed {\beta}. Since the saddle-point returns an on-shell action, {g_{\mu\nu}(\beta)} is a regular metric (i.e., without conical singularities). One can think of this as the equilibrium geometry at fixed {\beta} around which the metric {g} fluctuates due to the quantum corrections represented by the second term {W}. Note that the latter represents an off-shell contribution, since it is computed on singular backgrounds which do not satisfy the equations of motion for the metric (44).

To compute the thermodynamic entropy of the black hole, we now plug this total effective action into (40). A priori, this expression involves a total derivative w.r.t. {\beta}, so that we write

\displaystyle S_\mathrm{thermo}= \beta\left(\partial_\beta W_\mathrm{tot}[\beta,g]+\frac{\delta g_{\mu\nu}(\beta)}{\delta\beta}\frac{\delta W_\mathrm{tot}[\beta,g]}{\delta g_{\mu\nu}(\beta)}\right) -W_\mathrm{tot}[\beta,g]~, \ \ \ \ \ (45)

except that due to the equilibrium condition (44), the second term in parentheses vanishes anyway, and the thermodynamic entropy is given by

\displaystyle S_\mathrm{thermo}=\left(\beta\partial_\beta-1\right)W_\mathrm{tot}[\beta,g] =\left(\beta\partial_\beta-1\right)\left(W_\mathrm{gr}+W\right) =S_\mathrm{gr}+S_\mathrm{ent}~. \ \ \ \ \ (46)

This, then, is the precise relationship between the thermodynamic, gravitational, and entanglement entropies (at least for the Schwarzschild black hole). The thermodynamic entropy {S_\mathrm{thermo}} is a statement about the possible internal microstates at equilibrium — meaning, states which satisfy the quantum-corrected Einstein equations (44) — which therefore includes the entropy from possible (regular) metric configurations {S_\mathrm{gr}} as well as the quantum corrections {S_\mathrm{ent}} given by (39).

Note that the famous Bekenstein-Hawking entropy {S_\mathrm{BH}} refers only to the gravitational entropy, i.e., {S_\mathrm{BH}=S_\mathrm{gr}}. This is sometimes referred to as the “classical” part, because it represents the tree-level contribution to the path integral result. That is, if we were to restore Planck’s constant, we’d find that {S_\mathrm{gr}} comes with a {1/\hbar} prefactor, while {S_\mathrm{ent}} is order {h^0}. Accordingly, {S_\mathrm{ent}} is often called the first/one-loop quantum correction to the Bekenstein-Hawking entropy. (Confusion warning: the heat kernel method allowed us to compute {S_\mathrm{ent}} itself to one-loop in the expansion of the matter action, i.e., quadratic order in the source {J}, but the entire contribution {S_\mathrm{ent}} appears at one-loop order in the {\hbar}-expansion.)

However, despite the long-standing effort to elucidate this entropy, it’s still woefully unclear what microscopic (read: quantum-gravitational) degrees of freedom are truly being counted. The path integral over metrics makes it tempting to interpret the gravitational entropy as accounting for all possible geometrical configurations that satisfy the prescribed boundary conditions. And indeed, as we have remarked before, this is how one would interpret the Bekenstein entropy in the absence of Hawking’s famous calculation. But insofar as entropy is a physical quantity, the interpretation of {S_\mathrm{gr}} as owing to the equivalence class of geometries is rather unsatisfactory, since in this case the entropy must be associated to a single black hole, whose geometry certainly does not appear to be in any sort of quantum superposition. The situation is even less clear once one takes normalization into account, whereby — in most situations at least — the gravitational and matter couplings are harmoniously renormalized in such a way that one cannot help but question the fundamental distinction between the two; see [1] for an overview of renormalization in this context.

Furthermore, this entire calculational method relies crucially on the presence of an abelian isometry w.r.t. the Killing vector {\partial_\tau} in Euclidean time, i.e., the rotational symmetry {\phi\rightarrow\phi+w} mentioned above. But the presence of such a Killing isometry is by no means a physical requisite for a given system/region to have a meaningful entropy; things are simply (fiendishly!) more difficult to calculate in systems without such high degrees of symmetry. However, this means that cases in which all three of these entropies can be so cleanly assigned to the black hole horizon may be quite rare. Evaporating black holes are perhaps the most topical example of a non-static geometry that causes headaches. As a specific example in this vein, Netta Engelhardt and collaborators have compellingly argued that it is the apparent horizon, rather than the event horizon, to which one can meaningfully associated a coarse-grained entropy à la Shannon [4,5]. Thus, while we’re making exciting progress, even the partial interpretation for which we’ve labored above should be taken with caution. There is more work to be done!


  1. S. N. Solodukhin, “Entanglement entropy of black holes,” arXiv:1104.3712.
  2. D. V. Vassilevich, “Heat kernel expansion: User’s manual,” arXiv:hep-th/0306138.
  3. N. D. Birrell and P. C. W. Davies, “Quantum Fields in Curved Space,” Cambridge Monographs on Mathematical Physics.
  4. N. Engelhardt, “Entropy of pure state black holes,” Talk given at the Quantum Gravity and Quantum Information workshop at CERN, March 2019.
  5. N. Engelhardt and A. C. Wall, “Decoding the Apparent Horizon: Coarse-Grained Holographic Entropy,” arXiv:1706.02038.
This entry was posted in Physics. Bookmark the permalink.

4 Responses to Black hole entropy: the heat kernel method

  1. nueww says:

    first congrats for your awesome blog (so much theme for which I profondly share your enthusiasm but not your expertise) ;
    second congrats squared for your awesome reasearch papers (esp. your comments on BH interiors & modular inclusions) ;
    third, and I’m now coming to the real point of my comment: I would really really like to read your thoughts on the last paper of Geoffrey Penington (@GQFI_MPI told me you were on the process of reading it) which looks so promising to me.
    Some insights on the highly related paper by Ahmed Almheiri, Netta Engelhardt, Donald Marolf & Henry Maxfield would be so very appreciated, esp. regarding their more careful conclusions about the resolution of the paradox(es). Are the problems they noticed there relevant for Penington’s construction or has he circumvented them?
    Lastly, can you already see a clear link with your work on modular inclusion ?

    Thanks again for what you have alraedy accomplished.


    • Thanks for your interest!

      Indeed, as @GQFI_MPI alluded, I was studying these papers in preparation for a talk I recently gave on black hole interiors in Kyoto. Since this was mostly concerned with the question of state dependence, I was primarily focused on understanding precisely what various authors mean by this phrase, and have finally finished collecting my thoughts into a blog post on the topic: Black hole interiors, state dependence, and all that.

      The papers by Penington and Almheiri et al. are closely related, though personally I find the latter to be both clearer and more precise. I don’t think either solves the firewall paradox, but I’m still struggling to understand various aspects of their constructions, so I’m afraid I don’t have much more to say here beyond what I wrote in the aforementioned post. However, Henry Maxfield will give a talk about the latter paper in August as part of our group’s virtual seminar series, so be sure to check our YouTube channel afterwards if you’re interested!

      Regarding the link to my work on modular inclusions: Almheiri et al. did in fact make an interesting remark in their Discussion as to whether the spacetime in the gap between the left and right wedges may be understood to emerge from the entanglement with the bath. This is very similar to the emergent spacetime picture I presented in my paper, where what they call the “gap” corresponds to the non-trivial centre between the enlarged exterior algebras. I believe the movement of horizons in their model should correspond to the same inclusion structure, and I’d love to understand whether their construction can be phrased precisely in this language.


  2. nueww says:

    I used some elements of your conclusion in my answer to a question about log correction of BH entropy at PSE ( with a link to your blog post. I hope it’s OK with you (if not, I can of course edit my answer.)


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s