This article is about the family of orthogonal polynomials on the real line. For polynomial interpolation on a segment using derivatives, see Hermite interpolation. For integral transform of Hermite polynomials, see Hermite transform.
Hermite polynomials were defined by Pierre-Simon Laplace in 1810,[1][2] though in scarcely recognizable form, and studied in detail by Pafnuty Chebyshev in 1859.[3] Chebyshev's work was overlooked, and they were named later after Charles Hermite, who wrote on the polynomials in 1864, describing them as new.[4] They were consequently not new, although Hermite was the first to define the multidimensional polynomials.
Like the other classical orthogonal polynomials, the Hermite polynomials can be defined from several different starting points. Noting from the outset that there are two different standardizations in common use, one convenient method is as follows:
The "probabilist's Hermite polynomials" are given by
while the "physicist's Hermite polynomials" are given by
These equations have the form of a Rodrigues' formula and can also be written as,
The two definitions are not exactly identical; each is a rescaling of the other:
These are Hermite polynomial sequences of different variances; see the material on variances below.
The nth-order Hermite polynomial is a polynomial of degree n. The probabilist's version Hen has leading coefficient 1, while the physicist's version Hn has leading coefficient 2n.
Hn(x) and Hen(x) are nth-degree polynomials for n = 0, 1, 2, 3,.... These polynomials are orthogonal with respect to the weight function (measure)
or
i.e., we have
The Hermite polynomials (probabilist's or physicist's) form an orthogonal basis of the Hilbert space of functions satisfying
in which the inner product is given by the integral
including the Gaussian weight function w(x) defined in the preceding section
An orthogonal basis for L2(R, w(x) dx) is a complete orthogonal system. For an orthogonal system, completeness is equivalent to the fact that the 0 function is the only function f ∈ L2(R, w(x) dx) orthogonal to all functions in the system.
Since the linear span of Hermite polynomials is the space of all polynomials, one has to show (in physicist case) that if f satisfies
for every n ≥ 0, then f = 0.
One possible way to do this is to appreciate that the entire function
vanishes identically. The fact then that F(it) = 0 for every real t means that the Fourier transform of f(x)e−x2 is 0, hence f is 0 almost everywhere. Variants of the above completeness proof apply to other weights with exponential decay.
In the Hermite case, it is also possible to prove an explicit identity that implies completeness (see section on the Completeness relation below).
An equivalent formulation of the fact that Hermite polynomials are an orthogonal basis for L2(R, w(x) dx) consists in introducing Hermite functions (see below), and in saying that the Hermite functions are an orthonormal basis for L2(R).
The probabilist's Hermite polynomials are solutions of the differential equation
where λ is a constant. Imposing the boundary condition that u should be polynomially bounded at infinity, the equation has solutions only if λ is a non-negative integer, and the solution is uniquely given by , where denotes a constant.
Rewriting the differential equation as an eigenvalue problem
the Hermite polynomials may be understood as eigenfunctions of the differential operator . This eigenvalue problem is called the Hermite equation, although the term is also used for the closely related equation
whose solution is uniquely given in terms of physicist's Hermite polynomials in the form , where denotes a constant, after imposing the boundary condition that u should be polynomially bounded at infinity.
The general solutions to the above second-order differential equations are in fact linear combinations of both Hermite polynomials and confluent hypergeometric functions of the first kind. For example, for the physicist's Hermite equation
the general solution takes the form
where and are constants, are physicist's Hermite polynomials (of the first kind), and are physicist's Hermite functions (of the second kind). The latter functions are compactly represented as where are Confluent hypergeometric functions of the first kind. The conventional Hermite polynomials may also be expressed in terms of confluent hypergeometric functions, see below.
The sequence of probabilist's Hermite polynomials also satisfies the recurrence relation
Individual coefficients are related by the following recursion formula:
and a0,0 = 1, a1,0 = 0, a1,1 = 1.
For the physicist's polynomials, assuming
we have
Individual coefficients are related by the following recursion formula:
and a0,0 = 1, a1,0 = 0, a1,1 = 2.
The Hermite polynomials constitute an Appell sequence, i.e., they are a polynomial sequence satisfying the identity
An integral recurrence that is deduced and demonstrated in [6] is as follows:
The physicist's Hermite polynomials can be written explicitly as
These two equations may be combined into one using the floor function:
The probabilist's Hermite polynomials He have similar formulas, which may be obtained from these by replacing the power of 2x with the corresponding power of √2x and multiplying the entire sum by 2−n/2:
This equality is valid for all complex values of x and t, and can be obtained by writing the Taylor expansion at x of the entire function z → e−z2 (in the physicist's case). One can also derive the (physicist's) generating function by using Cauchy's integral formula to write the Hermite polynomials as
Using this in the sum
one can evaluate the remaining integral using the calculus of residues and arrive at the desired generating function.
The moments of the standard normal (with expected value zero) may be read off directly from the relation for even indices:
where (2n − 1)!! is the double factorial. Note that the above expression is a special case of the representation of the probabilist's Hermite polynomials as moments:
Asymptotically, as n → ∞, the expansion[8]
holds true. For certain cases concerning a wider range of evaluation, it is necessary to include a factor for changing amplitude:
which, using Stirling's approximation, can be further simplified, in the limit, to
A better approximation, which accounts for the variation in frequency, is given by
A finer approximation,[9] which takes into account the uneven spacing of the zeros near the edges, makes use of the substitution
with which one has the uniform approximation
Similar approximations hold for the monotonic and transition regions. Specifically, if
then
while for with t complex and bounded, the approximation is
where Ai is the Airy function of the first kind.
Similar to Taylor expansion, some functions are expressible as an infinite sum of Hermite polynomials. Specifically, if , then it has an expansion in the physicist's Hermite polynomials.[10]
Given such , the partial sums of the Hermite expansion of converges to in the norm if and only if .[11]
The probabilist's Hermite polynomials satisfy the identity where D represents differentiation with respect to x, and the exponential is interpreted by expanding it as a power series. There are no delicate questions of convergence of this series when it operates on polynomials, since all but finitely many terms vanish.
Since the power-series coefficients of the exponential are well known, and higher-order derivatives of the monomial xn can be written down explicitly, this differential-operator representation gives rise to a concrete formula for the coefficients of Hn that can be used to quickly compute these polynomials.
Since the formal expression for the Weierstrass transformW is eD2, we see that the Weierstrass transform of (√2)nHen(x/√2) is xn. Essentially the Weierstrass transform thus turns a series of Hermite polynomials into a corresponding Maclaurin series.
The existence of some formal power series g(D) with nonzero constant coefficient, such that Hen(x) = g(D)xn, is another equivalent to the statement that these polynomials form an Appell sequence. Since they are an Appell sequence, they are a fortiori a Sheffer sequence.
From the generating-function representation above, we see that the Hermite polynomials have a representation in terms of a contour integral, as
with the contour encircling the origin.
The probabilist's Hermite polynomials defined above are orthogonal with respect to the standard normal probability distribution, whose density function is
which has expected value 0 and variance 1.
Scaling, one may analogously speak of generalized Hermite polynomials[12]
of variance α, where α is any positive number. These are then orthogonal with respect to the normal probability distribution whose density function is
They are given by
Now, if
then the polynomial sequence whose nth term is
is called the umbral composition of the two polynomial sequences. It can be shown to satisfy the identities
and
The last identity is expressed by saying that this parameterized family of polynomial sequences is known as a cross-sequence. (See the above section on Appell sequences and on the differential-operator representation, which leads to a ready derivation of it. This binomial type identity, for α = β = 1/2, has already been encountered in the above section on #Recursion relations.)
Since polynomial sequences form a group under the operation of umbral composition, one may denote by
the sequence that is inverse to the one similarly denoted, but without the minus sign, and thus speak of Hermite polynomials of negative variance. For α > 0, the coefficients of are just the absolute values of the corresponding coefficients of .
These arise as moments of normal probability distributions: The nth moment of the normal distribution with expected value μ and variance σ2 is
where X is a random variable with the specified normal distribution. A special case of the cross-sequence identity then says that
One can define the Hermite functions (often called Hermite-Gaussian functions) from the physicist's polynomials:
Thus,
Since these functions contain the square root of the weight function and have been scaled appropriately, they are orthonormal:
and they form an orthonormal basis of L2(R). This fact is equivalent to the corresponding statement for Hermite polynomials (see above).
The Hermite functions satisfy the differential equation
This equation is equivalent to the Schrödinger equation for a harmonic oscillator in quantum mechanics, so these functions are the eigenfunctions.
The Hermite functions ψn(x) are a set of eigenfunctions of the continuous Fourier transformF. To see this, take the physicist's version of the generating function and multiply by e−1/2x2. This gives
The Fourier transform of the left side is given by
The Fourier transform of the right side is given by
Equating like powers of t in the transformed versions of the left and right sides finally yields
The Hermite functions ψn(x) are thus an orthonormal basis of L2(R), which diagonalizes the Fourier transform operator.[16]
The Wigner distribution function of the nth-order Hermite function is related to the nth-order Laguerre polynomial. The Laguerre polynomials are
leading to the oscillator Laguerre functions
For all natural integers n, it is straightforward to see[17] that
where the Wigner distribution of a function x ∈ L2(R, C) is defined as
This is a fundamental result for the quantum harmonic oscillator discovered by Hip Groenewold in 1946 in his PhD thesis.[18] It is the standard paradigm of quantum mechanics in phase space.
There are further relations between the two families of polynomials.
In the Hermite polynomial Hen(x) of variance 1, the absolute value of the coefficient of xk is the number of (unordered) partitions of an n-element set into k singletons and n − k/2 (unordered) pairs. Equivalently, it is the number of involutions of an n-element set with precisely k fixed points, or in other words, the number of matchings in the complete graph on n vertices that leave k vertices uncovered (indeed, the Hermite polynomials are the matching polynomials of these graphs). The sum of the absolute values of the coefficients gives the total number of partitions into singletons and pairs, the so-called telephone numbers
1, 1, 2, 4, 10, 26, 76, 232, 764, 2620, 9496,... (sequence A000085 in the OEIS).
This combinatorial interpretation can be related to complete exponential Bell polynomials as
where xi = 0 for all i > 2.
These numbers may also be expressed as a special value of the Hermite polynomials:[19]
Moreover, the following completeness identity for the above Hermite functions holds in the sense of distributions:
where δ is the Dirac delta function, ψn the Hermite functions, and δ(x − y) represents the Lebesgue measure on the line y = x in R2, normalized so that its projection on the horizontal axis is the usual Lebesgue measure.
This distributional identity follows Wiener (1958) by taking u → 1 in Mehler's formula, valid when −1 < u < 1:
which is often stated equivalently as a separable kernel,[20][21]
The function (x, y) → E(x, y; u) is the bivariate Gaussian probability density on R2, which is, when u is close to 1, very concentrated around the line y = x, and very spread out on that line. It follows that
when f and g are continuous and compactly supported.
This yields that f can be expressed in Hermite functions as the sum of a series of vectors in L2(R), namely,
With this representation for Hn(x) and Hn(y), it is evident that
and this yields the desired resolution of the identity result, using again the Fourier transform of Gaussian kernels under the substitution
^Tom H. Koornwinder, Roderick S. C. Wong, and Roelof Koekoek et al. (2010) and Abramowitz & Stegun.
^Hurtado Benavides, Miguel Ángel. (2020). De las sumas de potencias a las sucesiones de Appell y su caracterización a través de funcionales. [Tesis de maestría]. Universidad Sergio Arboleda.
^In this case, we used the unitary version of the Fourier transform, so the eigenvalues are (−i)n. The ensuing resolution of the identity then serves to define powers, including fractional ones, of the Fourier transform, to wit a Fractional Fourier transform generalization, in effect a Mehler kernel.
^Folland, G. B. (1989), Harmonic Analysis in Phase Space, Annals of Mathematics Studies, vol. 122, Princeton University Press, ISBN978-0-691-08528-9
Laplace, P. S. (1810), "Mémoire sur les intégrales définies et leur application aux probabilités, et spécialement a la recherche du milieu qu'il faut choisir entre les résultats des observations", Mémoires de l'Académie des Sciences: 279–347 Oeuvres complètes 12, pp.357-412, English translationArchived 2016-03-04 at the Wayback Machine.
Shohat, J.A.; Hille, Einar; Walsh, Joseph L. (1940), A bibliography on orthogonal polynomials, Bulletin of the National Research Council, Washington D.C.: National Academy of Sciences - 2000 references of Bibliography on Hermite polynomials.