Optical diffraction in close proximity to plane apertures, part 2, comparison of half-plane diffraction theories

Klaus D. Mielenz

The accuracy and physical significance of the classical Rayleigh-Sommerfeld and Kirchhoff diffraction integrals are assessed in the context of Sommerfeld’s rigorous theory of half-plane diffraction and Maxwell’s equations. It is shown that the Rayleigh-Sommerfeld integrals are in satisfactory agreement with Sommerfeld’s theory in most of the positive near zone, except at sub-wavelength distances from the screen. On account of the bidirectional nature of diffraction by metallic screens the Rayleigh-Sommerfeld integrals themselves cannot be used for irradiance calculations, but must first be resolved into their forward and reverse components and it is found that Kirchhoff’s integral is the appropriate measure of the forward irradiance. Because of the inadequate boundary conditions assumed in their derivation the Rayleigh-Sommerfeld and Kirchhoff integrals do not correctly describe the flow of energy through the aperture.

Keywords: bidirectional fields; diffraction; half plane, irradiance; Kirchhoff; Maxwell; metallic screen; near zone; optics; Poynting; Rayleigh; scalar wave functions; Sommerfeld; wave equation.

1. Introduction

In a previous paper (1) this author derived mathematically rigorous expressions for the classical Rayleigh-Sommerfeld and Kirchhoff boundary-value diffraction integrals pertaining to circular apertures and slits illuminated by normally incident plane waves. In spite of their functional differences, these diffraction integrals-were found to be surprisingly similar and nearly indistinguishable in most of the near zone. They exhibited significant differences only in the immediate proximity of the aperture, but in this region their physical properties were obscured by the fact that they or their normal derivatives, or both, do not reproduce the assumed incident field. In these circumstances it was not possible to assess their physical significance by merely comparing them to one another. In the present paper, they will be re-examined by applying them to the specific case of diffraction by a reflecting half plane and their physical properties will be interpreted in the context of Sommerfeld’s (2) rigorous theory of half-plane diffraction and Maxwell’s equations.

2. Comparison of Scalar Wave Functions

The scalar wave functions U discussed in this paper all denote the complex disturbance at a point of observation P(x, y, z) in the diffraction pattern of a perfectly conducting, infinitesimally thin, semi-infinite screen that occupies the half plane x > 0, z = 0 of a cartesian coordinate system, as depicted in Fig. Al of Appendix A. The primary field is assumed to be a monochromatic plane wave with irradiance [E.sub.0], wave-length [lambda], and circular wave number k = 2[pi]/[lambda] that is normally incident from the half space z<0 and is plane polarized so that, in accordance with Maxwell's equations, [partial]U/[partial]z or U are continuous and equal to zero on crossing the screen. The resulting diffraction pattern is independent of y and will be denoted by U = [square root of ([E.sub.0] u(x,z)], so that we have \u\ 1.

Sommerfeld’s half-plane theory dates back to the late 1800s and used to be discussed at length in textbooks (3-5). However, it appears to be no longer included in modern curricula of theoretical optics, and therefore its main features are summarized and supplemented by new expressions for the diffracted irradiance in Appendix A, below. On combining Eqs. (A3a-c) and (A8a,b) of Appendix A it follows that, for normally incident light, Sommerfeld’s solution is reduced to

[u.sup.(p,s).sub.s] (x, z) = [u.sub.s] [- or +] [u.sub.s],

[u.sub.s] = [e.sup.ikz] V(p) [u.sub.s] = [e.sup.ikz] V(p), (1a)

[rho] = [square root of (2/[lambda]][-sign(z))][square root of (r + x)] – [square root of (r – x)], z [not equal to] 0, (1b)

[rho] = [square root of (2/[lambda]][-sign(z))][square root of (r + x +)] [square root of (r – x)], z [not equal to] 0, (1c)

where r = [square root of ([x.sup.2] + [z.sup.2])] and V([rho]) is the complex Fresneltype integral defined by Eq. (A3d). These expressions are rigorously valid everywhere in the xz-plane of Fig. A1, except that along the x-axis [rho] and [rho] must be evaluated as

[rho] = 2[square root of ([absolute value of x]]/[lambda])], [rho] = 2sign(x)[square root of ([absolute value of x]]/[lambda], z = -0, (1d)

[rho] = -2sign(x) [square root of ([absolute value of x]/[lambda])], [rho] = -2 [square root of ([absolute value of x]/[lambda])], z = +0, (1e)

where z = [+ or -]0 refers to the positive and negative sides of the screen, respectively. This distinction is necessary because [u.sup.(p).sub.s] and [[partial]u.sup.(s).sub.s]/[partial]z are discontinuous on crossing the screen, and is taken into account in Sommerfeld’s theory by “wrapping” the diffracting half plane in a semi-infinite, two-sided Riemann surface so that its positive and negative sides are distinguished by the values 2[pi] and 0 of the polar angle [PHI] in Fig. A1.

The corresponding results given by the Rayleigh-Sommerfeld theory are obtained from Eqs. (10a,b) of Ref. (1) by suitably modifying the limits of integration, leading to

[u.sup.(p).sub.RS] (x, z) = k/2 [[integral].sup.-x/-[infinity]] d([xi] – x)[H.sup.(1).sub.0] ([beta]),

[beta] = k [square root of [([xi] – x).sup.2] + [z.sup.2], z > 0, (2a)

[u.sup.(s).sub.RS] (x,z) = [ik.sup.2]z/2 [[integral].sup.-x/-[infinity]] d([xi] – x) [H.sup.(1).sub.1] ([beta])/[beta], z > 0, (2b)

where [H.sup.(1).sub.n] = [J.sub.n] + [iY.sub.n] are Hankel functions of the first kind and nth order. These expressions are valid for z > 0, only, and will be supplemented in this paper by the assumptions made in their derivation for z [less than or equal to] 0; namely,

[[partial]u.sup.(p).sub.RS] (x,+0)/[partial]z = ik or 0, [u.sup.(s).sub.RS] (x,+0) = 1 or 0,

x < 0 or x [greater than or equal to] 0, for z = +0, (2c)

and

[u.sup.(p,s).sub.RS] (X,Z) = [e.sup.ikz] or ([e.sup.ikz] [- or +] [e.sup.ikz]),

x<0 or x [greater than or equal to] 0, for z = -0, (2d)

respectively.

Kirchhoffs diffraction integral, which will be required for the discussion in Sec. 3 is equal to the arithmetic mean of the Rayleigh-Sommerfeld integrals (2a,b),

[u.sub.k] (x,z) = 1/2[[u.sup.(p).sub.RS](x,z) + [u.sup.(s).sub.RS](x,z)], (2e)

and can therefore be easily deduced from the above expressions.

In the paraxial Fresnel approximation where z is positive and large compared to [lambda] and x/z is small all of the above-mentioned solutions converge to the familiar Fresnel limit [u.sub.F]. That is,

[u.sup.(p,s).sub.s] (x,z) ~ [u.sup.(p,s).sub.RS](x,z) ~ [u.sub.k] (x,z) ~ [u.sub.F](x,z)

=[e.sup.ikz] V(-x[square root of (2/[lambda]z)]), z [much greater than][absolute value of x], (3a)

where the right-hand expression follows from Eqs. (1a-c) by letting [square root of (r [+ or -] x)] ~ [square root of (z)] [1[+ or -] x/(2z)], so that [rho] = -x[square root of (2/[lambda]z)], [rho] = -[square root of (8z/[lambda], and [u.sub.s [much less than] [u.sub.s]. (1) The same result is obtained from Eqs. (2a,b) on replacing [H.sup.(1).sub.0], [H.sup.(1).sub.1], and [beta] by the leading terms of their asymptotic and Taylor expansions. The Fresnel approximation, Eq. (3 a), is estimated to be accurate within 1 % for z [much greater than] 100[lambda].

For numerical applications it is also useful to know that the above solutions all predict the same value,

[u.sup.(p,s).sub.S] (0,z) = [u.sup.(p,s).sub.RS] (0,z) = [u.sub.k] (0,z)

= [u.sub.F] (0,z) = 1/2 [e.sup.ikz], (3b)

in the positive shadow boundary (x = 0, z > 0). In the case of the Rayleigh-Sommerfeld integrals, Eqs. (2a,b), this result follows from the identity

[[integral].sup.[infinity].sub.0] dt[H.sup.(1).sub.0] ([square root of ([a.sup.2] + [t.sup.2])] = [e.sup.ia], (3c)

and was used in this work as the starting value for recursive numerical integrations as described in Ref. (6).

The above expressions for [u.sup.(p,s).sub.S] (x, z) and [u.sup.(p,s).sub.RS] (x, z) were used to compute the squared magnitudes of these functions in the immediate proximity of the positive and negative sides of the aperture plane, as shown in Figs. 1 and 2. For these computations, Eqs. (2a,b) were evaluated as noted above and the Fresnel sine and cosine integrals required for the computation of V([rho]) and V([rho]) were evaluated using the algorithms of Ref. (7). The main conclusions drawn from these results are as follows.

(1) On the positive side of the aperture plane the Sommerfeld and Rayleigh-Sommerfeld solutions are surprisingly similar, even at very small distances z. The real and imaginary parts of [u.sup.(p,s).sub.S] and [u.sup.(p,s).sub.RS] contributing to the results plotted in Fig. 1 agree within [+ or -] 1 % or better for z = 0.1[lambda], and additional computations showed that this agreement improves rapidly for larger values of z. It follows that for all practical purposes the Rayleigh-Sommerfeld integrals are adequate for computations throughout the positive near zone, and hence it may be inferred that this will also be the case for the corresponding solutions for circular apertures and slits derived in Ref. (1).

(2) The agreement for negative values of z is unsatisfactory. In Sommerfeld’s theory diffraction manifests itself as a field phenomenon that occurs on both sides of the aperture plane, so that the incident geometrical field is modified before it reaches the screen. On the other hand, in the Rayleigh-Sommerfeld theory diffraction on the source side is explicitly ruled out, and here the results obtained from Sommerfeld’s theory show that the assumed geometrical field (2d) is only a crude approximation of the true field. Thus, the main problem with the Rayleigh-Sommerfeld and Kirchhoff integrals appears to be not so much that they fail to reproduce the assumed geometrical field values, but that the latter are themselves objectionable.

The residual differences between [u.sup.(p,s).sub.RS] and [u.sup.(p,s).sub.S] for z > 0 can be attributed to the imperfect boundary conditions assumed in the Rayleigh-Sommerfeld theory. These boundary values are step functions that violate the wave equation and are the probable cause of the fact, shown in Appendix B, that the Rayleigh-Sommerfeld integrals also do not obey the wave equation in the immediate proximity of the aperture plane. Although this wave-equation failure is small in most of the near zone, and thus unimportant for practical purposes, it is worthwhile to mention that it might be remedied by replacing the boundary values Eq. (2c) with the corresponding values given by Sommerfeld’s theory for z = +0; namely,

1/ik [[partial]u.sub.s.sup.(p)](x, +0)/[partial]z = 1 or 2V(-2 [square root of (x/[lambda])]) – i/[pi] [square root of ([lambda]/2x)] [e.sup.i (kx [pi]/4])],

x < 0 or x [greater than or equal to] 0, (4a)

[u.sub.s.sup.(s)] (x, +0) = 1 or 2V(-2 [square root of (x/[lambda])]), x < 0 or x [greater than or equal to] 0, (4b)

The real and imaginary parts of these functions are plotted in Fig. 3, where it should be noted that [[partial]u.sub.s.sup.(p)]/[partial]z is discontinuous and singular, and [u.sub.s.sup.(s)] is not continuously differentiable, for x = 0. Nonetheless, they constitute improved boundary values because Sommerfeld’s theory obeys the wave equation even at the diffracting edge itself (see Appendix B).

When Eqs. (4a,b) are substituted into the derivation of the Rayleigh-Sommerfeld integrals for the half plane one finds

[u.sub.s.sup.(p)] (x, z) [equivalent to] k/2 [[integral].sup.[infinity].sup.-[infinity]] d([xi] – x) [[partial]u.sub.s.sup.(p)] (x, +0)/[partial]z [H.sup.(1).sub.0] ([beta]), z [greater than or equal to] 0, (4c)

[u.sub.s.sup.(s)] (x, z) [equivalent to] [ik.sup.2]z/2 [[integral].sup.[infinity].sub.-[infinity]] d([xi] – x] [u.sub.s.sup.(s)] (x, 0) [H.sup.(1).sub.1] ([beta])/[beta], z [greater than or equal to] 0, (4d)

where the integration now extends from -[infinity] to +[infinity]. Because the boundary values Eq. (2c) and Eqs. (4a,b) are the same for x < 0 and the former are zero for x [greater than or equal to] 0, this can be rewritten as

[u.sup.(p,s).sub.s](x,z) [equivalent to] [u.sup.(p,s).sub.RS](x,z) + [DELTA][u.sup.(p,s)].sub.RS], (4e)

where

[DELTA][u.sup.(p).sub.RS] = [u.sup.(p).sub.s] – [u.sup.(p).sub.RS]

= k [[integral].sup.[infinity].sub.-x] d([xi] – x)[V(-2[square root of x/[lambda]])-i/2[pi][[square root of [lambda]/2x]e.sup.i(kx-[pi]/4)][H.sup.(1).sub.0] ([beta]), (4f)

[DELTA][u.sup.(s).sub.RS] = [u.sup.(s).sub.S] – [u.sup.(s).sub.RS] = [ik.sup.2] z [[integra].sup.[infinity].sub.-x]d([xi] – x)V(-2[square root of x/[lambda]])[H.sup.(1).sub.1]([beta])/[beta] (4g)

are correction terms that can be added to the Rayleigh-Sommerfeld integrals to convert them to the exact values given by Sommerfeld’s theory. These expressions should be free of errors because Eqs. (4c) and (4d) are rigorous expressions of the Helmholtz’ theorem in which [u.sup.(p).sub.s] and [u.sup.(s).sub.s] are the same on both sides of the equal sign.

This method was originally proposed by Braunbek (8-10), who envisioned its use for constructing improved solutions for large apertures of finite width and are bounded by straight or even curved edges. Braunbek’s work involved the assumption that [partial][u.sup.(p).sub.s/[partial]z and [u.sup.(s).sub.s] rapidly become negligibly small on the dark side of the screen, so that the effective ranges of integration in Eqs. (4f,g) are only a few wavelengths wide and approximative methods can be used. According to Fig. 3 this is a valid assumption for [partial][u.sup.(p).sub.s]/[partial]z but not for [u.sup.(s).sub.s], so that computational difficulties could be encountered in the case of[DELTA][u.sup.(s).sub.RS]

3. Irradiance and Energy Flow

Although the squared magnitudes of scalar wave functions are commonly identified with the irradiance of the field, the data plotted in Figs. 1 and 2 must not be interpreted in this manner. The diffracted field specified by Sommerfeld’s solution is a bidirectional field composed of two plane waves, [u.sub.s] and [+ or -][u.sub.s] which propagate in the opposite directions of the incident primary field and its reflection from the screen. When Maxwell’s equations are invoked, as in Eqs. (A5) through (A7) of Appendix A, it is found that in accordance with the principle of interference these waves cannot interfere with one another (2) so that the effective energy flow is composed of mutually incoherent components m the forward and reverse directions. For normally incident light, these respective directions are parallel and anti-parallel to the unit vector n = (0,0,1) in the direction of the positive z-axis, and the final expression for the time-averaged zPoynting vector (A7c) is

< [S.sub.s] > = [[E.sub.s](x,z) – [E.sub.s](x,z)]n = [E.sub.0]([\[u.sub.s]\.sup.2] – [\[u.sub.s]\.sup.2])n, (5)

where [E.sub.s] and [E.sub.s] are the forward and reverse irradiances incident on the opposite sides of any given area element dx dy containing the point of observation P. (3). These irradiances are given by the squared magnitudes of the basic Sommerfeld functions [u.sub.s] and [u.sub.s] themselves, and thus the quantities [\[u.sub.s] – [u.sub.s]\.sup.2] or [\[u.sub.s] + [u.sub.s]\.sup.2] do not represent the irradiances of the field for p- and s-polarized light. Accordingly, the forward and reverse irradiances of the field are independent of the state of polarization of the incident light, and in this connection it should also be noted that in practice the reverse irradiance [E.sub.s] is not easily observable as it may be obscured by a detector placed in the path of the forward field.

It now seems reasonable to interpret the Rayleigh-Sommerfeld theory in a like manner, so that the quantities [u.sup.(p).sub.RS] and [u.sup.(s).sub.RS] defined by Eqs. (2a,b) are also regarded as bidirectional wave functions that can be resolved into mutually incoherent forward and reverse components, [u.sub.k] and [u.sub.k]. Thus we define, in analogy to Eq. (la),

[u.sub.k] = 1/2[[u.sup.(p).sub.RS](x,z) + [u.sup.(s).sub.RS](x,z])],

[u.sub.k] = 1/2[[u.sup.(p).sub.RS](x,z) – [u.sup.(s).sub.RS](x,z])]. (6a)

and hence it follows that the corresponding forward and reverse irradiances, [E.sub.k] and [E.sub.k], will be given by an expression analogous to Eq. (5),

< [S.sub.k] > = [[E.sub.k](x,z) – [E.sub.k](x,z)]n = [E.sub.0]([\[u.sub.k]\.sup.2] – [\[u.sub.k]\.sup.2]n. (6b)

It will be noted that the forward wave function UK defined by Eq. (6a) and Kirchhoff’s integral (2e) are identically the same, and therefore the subscript “K” was retained in the above equations. The Kirchhoff and Rayleigh-Sommerfeld solutions were originally derived on the mutually exclusive assumptions of black and metallic screens, and it is generally agreed that Eq. (2c) has no definable physical meaning as it would somehow imply the coherent superposition of two orthogonal states of polarization. However, in the present context, the Rayleigh-Sommerfeld integrals are interpreted as composite quantities and taking their sum and difference is tantamount to resolving them into their basic components. Accordingly, Kirchhoff’s integral [u.sub.k] now appears as an integral part of the Rayleigh-Sommerfeld theory for metallic screens so that [u.sup.(p).sub.RS] and provide the framework for the evaluation of all field parameters while [u.sub.K] and its counterpart [u.sub.K] define the flow of field energy. This n ew interpretation of Kirchhoffs integral has a precise, physically realizable meaning.

A numerical comparison of the forward irradiances [E.sub.S] and [E.sub.K] defined by Eqs. (5) and (6b) is presented in Figs. 4 and 5. As expected, these quantities are essentially the same on the positive side of the aperture plane, the agreement being on the order of a few percent for z = +0.1[lambda] and increasingly better for larger values of z. This confirms that the identification of [[absolute value of [u.sub.K]].sup.2] with the forward irradiance [E.sub.K] is a valid assumption. As also expected, the agreement is poor on the negative side because in this region [E.sub.K] represents only the undiffracted geometrical field. The even symmetry of the irradiance [E.sub.s] shown in Fig. 5 suggests that the modification of the geometrical field due to diffraction is isotropic in the immediate vicinity of the edge.

4. Conclusions

The above comparison of the classical Rayleigh-Sommerfeld boundary-value theories with Sommerfeld’s rigorous theory for diffraction by a perfectly reflecting half plane has added substantially to the understanding of the physical significance of these theories.

It was found that the mathematical expressions and algorithms presented in Ref. (1) for the Rayleigh-Sommerfeld integrals are in very satisfactory agreement with Sommerfeld’s half-plane theory. Thus, they are well suited for computations in most of the positive near zone, and it is inferred that this will also be the case for the corresponding Rayleigh-Sommerfeld integrals and slits derived in Ref. (1). Sommerfeld’s theory also confirms that, on the whole, the differences between these respective solutions for p- and s-polarized incident light are small so that polarization effects are small, as might be expected for normally incident light. All in all, it appears that the use of Helmholtz’ theorem has proved remarkably effective in compensating for the inadequate boundary conditions assumed in deriving the classical boundary-value integrals. The residual differences between the Rayleigh-Sommerfeld and Sommerfeld solutions are confined to sub-wavelength differences from the screen, and it is shown in Appendi x B that in this region the former do not obey the wave equation.

The comparison with Sommerfeld’s theory and its interpretation in terms of Maxwell’s equations has also revealed a previously overlooked aspect of diffraction by a reflecting screen; namely, that the optical field is bidirectional and comprises light traveling in opposite directions even on the positive side of the screen. According to the principle of interference, the observable Poynting vector is given by the incoherent vector sum of its components in the forward and reverse components, and thus it is impermissible to express the near-zone irradiance of the field as the squared magnitudes of scalar wave functions. Rather, the latter must be resolved into their forward and reverse component and it turns out that Kirchhoffs integral is the appropriate expression for the forward irradiance of the field even in the Rayleigh-Sommerfeld theory. The forward and reverse irradiances were found to be independent of the state of polarization of the incident field.

It was noted that the residual deficiencies of the Rayleigh-Sommerfeld and Kirchhoff solutions in the proximity of the positive aperture plane can be removed by replacing the originally assumed boundary values with those predicted by Sommerfeld’s theory. This was not be pursued further as it would produce only marginal improvements on the positive side of the screen, without removing the problem that the classical boundary-value integrals all exhibit discontinuities with respect to the incident geometrical field. A more effective approach would be the derivation of improved approximations for the entire field by constructing analytical continuations of the existing boundary-value solutions into the half space z [less than or equal to] 0. This will be attempted in a subsequent publication.

5. Appendix A. Sommerfeld’s Half-Plane Theory

In Sommerfeld’s rigorous treatment of diffraction by a straight edge the screen is assumed to be a perfectly conducting, infinitesimally thin, semi-infinite sheet that covers the half-plane x > 0, z = 0 of the Cartesian coordinate system in Fig. A1. It is assumed, further, that the primary field is a monochromatic plane wave, p- or spolarized with respect to the xz-plane and incident upon the screen in a given angular direction [alpha], As the optical field so defined must be independent of y it will be convenient to use cylindrical coordinates (r, [phi], y) given by

r = [square root of ([x.sup.2] + [z.sup.2])], [phi] = arccos (x/r), x = r cos [phi], z = -r sin [phi], (Ala)

so that [phi] and [alpha] are measured clockwise from the positive x-axis, and the illuminated and shaded sides of the screen are distinguished by [phi] = 0 and 2[pi], respectively. (4) In this notation, the primary field, its reflection by the screen, and the unit vectors in their respective directions of propagation are

[U.sub.geom] = [square root of ([E.sub.0])][e.sup.-ikr cos([phi]-[alpha])], [U.sub.geom] = [- or +][square root of ([E.sub.0])][e.sup.-ikr cos ([phi]+[alpha])]. (A1b)

where the time factor of the field is assumed as [e.sup.ia**], k = 2[pi]/[lambda] is the circular wavenumber of the light, [E.sub.0] is the incident irradiance, and the dual sign of [U.sub.geom] accounts for polarization-dependent phase changes on reflection.

According to these definitions, the diffracted field at a given point P(r,[phi]) must obey the scalar wave equation,

[delta][U.sup.(p,s)] = [[partial].sup.2][U.sup.(p,s)]/ [partial][r.sup.2] + 1/r [partial][U.sup.(p,s)] [partial]r + 1/[r.sup.2] [[partial].sup.2][U.sup.(p,s)]/[partial][[phi].sup.2] = -[k.sup.2][U.sup.(p,s)], (A2a)

as well as the boundary conditions for [phi] = 0 and [phi] = 2[pi],

[U.sup.(p)] = 0 or [partial][U.sup.(s)]/[partial]z = 0, for [phi] = 0 and 2[pi], (A2b)

according as the light is p- or s-polarized. Furthermore, in the limit r [right arrow] [infinity] these solutions must correspond to the optical field according to geometrical optics, and in this respect it is necessary to distinguish three regions of space as indicated in Fig. A1:

I. The reflection space (0 < [empty set] < [alpha]), where the incident and reflected waves are both present and the geometrical field is [U.sup.(p,s).sub.geom] = [U.sub.geom] [+ or -] [U.sub.geom].

II. The transmission space ([alpha] < [empty set] < [alpha] + [pi]), where only the incident wave is present and the geometrical field is [U.sup.(p,s).sub.geom] = [U.sub.geom].

III. The shadow space ([alpha] + [pi] < [empty set] < 2[pi]), where the geometrical field [U.sup.(p,s).sub.geom] is zero.

Sommerfeld’s solution of the diffraction problem so defined is:

[U.sup.(p,s).sub.S(r,[empty set] = [square root of ([E.sub.0]([u.sub.S][+ or -][u.sub.S]))], (A3a)

[u.sub.S] = [e.sup.-ikrcos([empty set]-[alpha])V([rho]), [u.sub.S] = [e.sup.-ikros([empty set]+[alpha])V([rho]), (A3b)

[rho] = [square root of (8r/[lambda])]cos[1/2([empty set] – [alpha])], [rho] = [square root of (8r/[lambda])]cos[1/2([empty set]+[alpha])](A3c)

V(rho) = [e.sup.-i[pi]/4]/[square root of (2)] [[integral].sup.[rho].sub.-[infinity]][dre.sup.i[pi][r.sup.2]/2] = 1/2[1 = C(rho]+S([rho])]

-i/2[C([rho])-S([rho])], (A3d)

where C([rho]) and S([rho]) denote the usual Fresnel cosine and sine integrals.

Although the derivation of these expressions is too complicated to be included in this paper, it is not difficult to verify that they have the following properties:

(1) As shown in Appendix B, [u.sub.S] and [u.sub.S] obey the wave equation, Eq. (A2a), everywhere in space, inclusive of the diffracting edge itself. They represent plane waves which propagate with space-dependent amplitudes, V([rho]) and V([rho]), in the respective directions of the unit vectors, Eq. (A1c). Except in the reflection space and at small distances r from the diffracting edge, [u.sup.S] is significantly smaller than [u.sub.S], and in the limit r [right arrow] [infinity] Eqs. (A3a,b) are reduced to the above-mentioned geometrical solutions.

(2) In addition to the usual plane-wave components which are proportional to [u.sub.S] and [u.sub.S] themselves, the derivatives,

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], (A4a)

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], (A4b)

contain terms which involve the derivatives of V([rho]) and V([rho]) with respect to r and, thus, are singular as 1/ [square root of (r)] at the diffracting edge. This suggests the existence of cylindrical waves which originate at the edge, but nonetheless the edge does not radiate energy. Although the radiant intensity I of the edge is infinite as 1/ [square root of (r)], the radiant flux Id[ohm] emitted into any given solid angle element d[ohm]=rd[empty set] dy is zero as r [right arrow] 0. These cylindrical waves are evanescent and vanish in the limit r[much greater than] [lambda].

(3) Equations (A3b) and (A4a,b) show that ([u.sub.S] – [u.sub.S]) and [partial]([u.sub.S] – [u.sub.S])/[partial]x are continuous on crossing the screen ([empty set] = 0 [right arrow] [empty set] = 2[pi]), whereas [partial]([u.sub.S] – [u.sub.S])/[partial]z is not. Conversely, [partial]([u.sub.S] + [u.sub.S])/[partial]x are discontinuous. These are the expected properties of the tangential (magnetic or electric) field vector components when the light field is polarized parallel or perpendicular to the diffracting edge, and thus it is permissible to apply Maxwell’s equations in the form

[H.sup.(p)] = [square root of (2[E.sub.0])] 4[square root of ([epsilon]/[micro]([u.sub.S] – [u.sub.S])m, [E.sup.(p)] = i/k [square root of ([micro]/[epsilon])]curl[H.sup.(p)], (A5a)

[E.sup.(s)] = [square root of (2[E.sub.0] 4[square root of ([micro]/[epsilon])]([u.sub.S] + [u.sub.S])m, [H.sup.(s)] = -i/k[square root of ([epsilon]/[micro]curl[E.sup.(p)], (A5b)

where m = (0,1,0) is the unit vector in the direction of the positive y-axis, [epsilon] and [micro] are the dielectric constant and magnetic permeability of the medium of propagation, and normalization factors were used so that the squared field vectors have the dimension of irradiance [W/[m.sup.2]]. On substitution of the derivatives Eqs. (A4a,b) and re-arranging terms, this leads to

[E.sup.(p) = [square root of (2[E.sub.0] 4[square root of ([micro]/[epsilon])]{[-sin[alpha]([u.sub.S] = [u.sub.S]), 0, -cos[alpha]([u.sub.S] – [u.sub.S])]

+[e.sup.i(kr-[pi]/4)/ik[square root of ([lambda]r[sin ([empty set/2 + [alpha]/2) – sin([empty set/2 – [alpha]/2,

0, cos([empty set]/2 + [alpha]/2) – cos([empty set]/2 – [alpha]/2)]},

= [square root of (2[E.sub.0]4[square root of ([micro]/[epsilon])]{[[u.sub.S]t+[e.sup.i(kr-[pi]/4)/ik[square root of ([lambda]r)]v]-[[u.sub.S]t+[e.sup.i(kr-[pi].4)/ik[square root of ([lambda]r)]v,]} (A6a)

[H.sup.(s)] = [square root of (2[E.sub.0])][4 square root of ([epsilon]/[micro])]{[-sin [alpha]([u.sub.S]+[u.sub.S]), 0, -cos[alpha]([u.sub.S] – [u.sub.S])]

+[e.sup.i(kr-[pi]/4])/ik[square root of ([lambda]r])][sin([phi]/2+[alpha]/2]+sin([phi]/2-[alpha]/2),

0,cos([phi]/2+[alpha]/2)+cos([phi]/2-[alpha]/2)]},

= [square root of (2[E.sub.0])][4th square root of ([epsilon]/[micro])]{[[u.sub.S]t+[e.sup.i(kr-[pi]/4])]/ik[square root of ([lambda]r)]v]+[[u.sub.S]t+[e.sup.i(kr-[pi]/4)]/ik[square root of ([lambda]r]v]} (A6b)

where

t = [-sin[alpha],0,-cos[alpha]], t = [sin[alpha],0,-cos[alpha]] (A6c)

are the unit tangent vectors indicated in Fig. Al, and

v = [sin([phi]/2+[alpha]/2),0,cos([phi]/2+[alpha]/2)],

v = [sin([phi]/2-[alpha]/2),0,cos([phi]/2-[alpha]/2)], (A6d)

are unit vectors in the directions of the above-mentioned evanescent cylindrical waves. The middle portions of Eqs. (A6a) and (A6b) are equivalent to the expressions cited by Bouwkamp (11) for the electromagnetic field components given by Sommerfeld’s solution. However, it appears that their representation in terms of the unit vectors defined by Eqs. (A6c,d) has not previously appeared in the literature.

(4) Equations (A5a,b) and (A6a,b) express each of the electromagnetic field vectors as the sum or difference of two components which, like Sommerfeld’s wave functions u and u themselves, are easily recognized as representative of a forward or reverse wave motion. It is obvious that these components must be mutually incoherent because, in all cases of practical interest, they propagate in opposite or nearly opposite directions, thus precluding any interference between them. Therefore, the corresponding Poynting vectors are given by the expressions

[E.sup.(p)]x[H.sup.(p)] = 2[E.sub.0]{[-[u.sub.S]t+[e.sup.i(kr-[pi]/4)]/ik[square root of ([lambda]r)]v]

x[u.sub.S.sup.*]m]-[[u.sub.S]t+[e.sup.i(kr-[pi]/r)]/ik[square root of ([lambda]r)]v]x[u.sub.S]m]}, (A7a)

[E.sup.(s)]x[H.sup.[(s).sup.*]] = 2[E.sub.0]{[u.sub.S]mx[[u.sub.S.sup.*]t-[e.sup.-i(kr-[pi]/4)]/ik[squa re root of ([lambda]r)]/v]

+[u.sub.S]mx[[u.sub.S]t-[e.sup.-i(kr-[pi]/4)]/ik[square root of ([lambda]r)v]}, (A7b)

where it must be also be taken into account that, as mentioned above, the terms in 1/[square root of (r)] have no energetic significance. Thus, these terms must be ignored and the following simple result is obtained for the observable Poynting vectors (5) of the field,

~~ = 1/2Re[E.sup.(p)]x[H.sup.[(p).sup.*]] [equivalent to] 1/2Re[[E.sup.(s)]x[H.sup.[(s).sup.*]]~~

= [E.sub.0]([\[u.sub.S]\.sup.2]s+[\[u.sub.S]\.sup.2]s) = [E.sub.S]s+[E.sub.S]s, (A7c)

where [E.sub.S] and [E.sub.S] are the forward and reverse irradiances (5) incident on area elements normal to the unit vectors in the directions of propagation of the incident and reflected field,

s = mxt=[-cos[alpha],0,sin[alpha]],

s = mxt=[-cos[alpha],0,sin[alpha]], (A7d)

as indicated in Fig. A1. These equations are not a part of Sommerfeld’s original theory and may be interpreted as follows:

(a) In the vicinity of the diffracting edge the wave functions [u.sub.S] and [u.sub.S], and thus the forward and reverse irradiances [E.sub.S] and [E.sub.S] as well, are similar in magnitude. Therefore, light traveling in both directions is present on both sides of the screen except, in the Fresnel approximation where [u.sub.S] is negligibly small. This is a significant departure from the classical formulation of Huygens’ principle, where a reverse flow of energy on the positive side of the aperture is precluded by the explicit assumption that light does not travel backwards. Instead, Sommerfeld’s theory asserts the presence of a bidirectional flow of energy on both sides of the aperture plane, and in this connection it is relevant to cite two earlier papers on Sommerfeld’s theory by Braunbek (12) and Braunbek and Laukien (13). The latter includes an interesting diagram depicting a swirling, bidirectional flow of energy at sub-wavelength distances from the diffracting edge. Although it appears that Braunbek a nd Laukien assumed a coherent superposition of the forward and reverse fields, the eddy currents shown in their diagram can be regarded as Maxwellian analogues of Huygens’ wavelets.

(b) In spite of the explicit assumption of separate boundary conditions for p- and s-polarized incident light, the forward and reverse irradiances [E.sub.s] and [E.sub.s] defined by Eq. (A7c) are the same in both cases. The composite wave functions [u.sub.s] [+ or -] [u.sub.s] pertain to different states of polarizations only insofar as their phases are concerned, but their squared magnitudes cannot be used to describe the energy flow in the field as they contain non-observable cross terms in [u.sub.s][u.sub.s]. This distinction disappears in the Fresnel region, where the field on the positive side of the aperture plane is unidirectional ([u.sub.s] [much less tha] [u.sub.s]) and the usual definition of irradiance as the squared magnitude of the total wave function is justified.

(6) For a normally incident field ([alpha] = [pi]/2) one finds

cos([phi][+ or -][alpha]) = [+ or -]sin [phi], (A8a)

cos[1/2 ([phi] [+ or -] [alpha])]=1/[square root of (2)](cos [phi]/2 [+ or -]sin [phi]/2)

= 1/2 ([square root of (1+cos [phi])] [+ or -] [square root of (1 – cos[phi]]). (A8b)

Hence, the starting equations in Secs. 2 and 3 of the main text are obtained by using Eqs. (Ala) to reintroduce Cartesian coordinates and noting that the sine term in the central portion of Eq. (A8b) is always positive, while the cosine term and z are opposite in sign.

6. Appendix B. Wave-Equation Conformance

It is commonly agreed that one of the most important measures of the physical significance and mathematical rigor of scalar diffraction theories is whether, or how well, they satisfy the wave equation [Eq. (A2a)]. In this appendix, this aspect of the Sommerfeld and Rayleigh-Sommerfeld theories is analyzed. It is shown that Sommerfeld’s solution obeys the wave equation rigorously everywhere in space, whereas the Rayleigh-Sommerfeld integrals (and, thus, Kirchhoff’s integral as well) exhibit deviations from the wave equation in the immediate proximity of the aperture plane.

6.1 Sommerfeld’s Solution

In this subsection, Sommerfeld’s solution (A3a-d) will be written as

[u.sub.s] or [u.sub.s] = u = [e.sup.ikrcos[beta]]V([rho]),

[rho] or [rho] = [square root of (8r/[lambda])]cos (1/2 [beta]), [beta] = [phi][+ or -] [alpha]. (A9a)

Thus, [partial]/[partial][phi] = [partial]/[partial][beta] and

[partial]V/[partial]r = 1/[square root of ([lambda]r)]cos(1/2[beta]))[e.sup.i[kr(1-cos[beta])-[phi]/4],

[partial]V/[partial][phi] = 1/[square root of ([lambda]r)]cos(1/2[beta]))[e.sup.i[kr(1-cos[beta])-[phi]/4], (A9b)

[partial]u/[partial]r = -ik cos [beta]u + 1/[square root of ([lambda]r)]cos(1/2[beta])[e.sup.i(kr-[pi]/4), (A9c)

[[partial].sup.2]u/[partial][r.sup.2] = -[k.sup.2][cos.sup.2][beta]u

-1/[square roof of ([lambda]r)][1/2r-2ik [sin.sup.2](1/2 [beta])]cos(1/2[beta]), (A9d)

[partial]u/[partial][phi] = ikr sin [beta]u – [square roof of (r/[lambda])]sin(1/2[beta])[e.sup.i(kr-[pi]/4), (A9e)

[[partial].sup.2]u/[partial][[phi].sup.2] = -([k.sup.2][r.sup.2][sin.sup.2] [beta] – ikr cos [beta])u

-1/[square roof of ([lambda]r)][r/2 + 2[ikr.sup.2][sin.sup.2](1/2[beta])]cos(1/2[beta])[e.sup.i(kr-[pi]/4)] (A9f)

Hence it follows immediately that the wave equation, Eq. (A2a), is rigorously satisfied. It should be noted that this is true everywhere in space, even at the diffracting edge itself where each component of the Laplace operator is singular.

6.2 Rayleigh-Sommerfeld Integrals

In this next subsection, the wave equation conformance of the Rayleigh-Sommerfeld integrals [u.sup.(p,s).sub.RS] (x, z) is analyzed by computing numerical values of the quantities,

[w.sup.(p,s).sub.RS](x,z) = [1/[k.sup.2]([[partial].sup.2]/[partial][x.sup.2] + [[partial].sup.2]/[partial][z.sup.2]) + 1] [u.sup.(p,s).sub.RS](x,z), (A10a)

which would everywhere be identically equal to zero if the wave equation is satisfied. For this purpose, the first and second derivatives of the Hankel functions appearing in Eqs. (2a,b) are evaluated by means of the identities

d[H.sup.(1).sub.0][beta]/d[beta] = -[H.sup.(1).sub.1] [beta], d/d[beta] [[H.sup.(1).sub.1]([beta]/[beta]] = -[H.sup.(1).sub.2]([beta])/[beta], (A10b)

d/d[beta][[H.sup.(1).sub.2]([beta])/[[beta].sup.2]] = -[H.sup.(1).sub.3]([beta])/[[beta].sup.2], [partial][beta]/[partial]x = [k.sup.2]x/[beta], [partial][beta]/[partial]z = [k.sup.2]z/[beta]. (A10c)

Hence, the required x-derivatives of [u.sup.(p,s).sub.RS] are found by differentiating Eqs. (2a,b) with respect to the upper limit of integration, yielding

[partial][u.sup.(p).sub.RS]/[partial]x = k[H.sup.(1).sub.0]([[beta].sub.x])/2, [partial][u.sup.(s).sub.RS]/[partial]x = i[k.sup.2] z[H.sup.(1).sub.1]([[beta].sub.x])/2[[beta].sub.x].

[[beta].sub.x] = k[square root of ([x.sup.2] + [z.sup.2])], (A10d)

[[partial].sup.2][u.sup.(p).sub.RS]/[partial][x.sup.2] = [k.sup.3] x[H.sup.(1).sub.1]/[[beta].sub.x],[[partial].sup.2][u.sup.(s).sub.RS] /[partial][x.sup.2] = -i[k.sup.4] xz[H.sup.(1).sub.2]([[beta].sub.x])/2[[beta].sup.2.sub.x], (A10e)

and the z-derivatives are obtained by substitution of

[[partial].sup.2][H.sup.(1).sub.0]([beta])/[partial][z.sup.2] = -[k.sup.2] [[H.sup.(1).sub.1] ([beta])/[beta] – [(kz).sup.2] [H.sup.(1).sub.2]([beta])/[[beta].sup.2]], (A10f)

[partial].sup.2]/[partial[z.sup.2][i[k.sup.2]z[H.sup.(1).sub.1]([beta ])/2[beta]] = -i[k.sup.4]z/2[3[H.sup.(1).sub.2]([beta])/[[beta].sup.2] – [(kz).sup.2] [H.sup.(1).sub.3]([beta])/[[beta].sup.3]], (A10g)

into Eqs. (2a,b). The final expressions are

[w.sup.(p).sub.RS](x,z) = -kx[H.sup.(1).sub.1]([[beta].sub.x])/[[beta].sub.x]

+k/2 [[integral].sup.-x.sub.-[infinity]]d([xi] – x)[[H.sup.(1).sub.0]([beta]) – [H.sup.(1).sub.1]([beta])/[beta] + [(kz).sup.2][H.sup.(1).sub.2]([beta])/[[beta].sup.2]], (A10h)

[w.sup.(s).sub.RS](x,z) = i[k.sup.2]xz[H.sup.(1).sub.2]([[beta].sub.x])/[[beta].sup.2.sub.x]

+ i[k.sup.2]z/2 [[integral].sup.-x.sub.-[infinity]]d([xi] – x) [[H.sup.(1).sub.1]([beta])/[beta] – 3[H.sup.(1).sub.2]([beta])/[[beta].sup.2] + [(kz}.sup.2] [H.sup.(1).sub.3] ([beta]/[[beta].sup.3]]. (A10i)

The integrals on the right-hand side of these expressions were evaluated by recursive numerical integration as described in Ref. (7). In accordance with Eq. (3b), the starting values used for these computations were [w.sup.(p,s).sub.RS](0, z) = 0

The real and imaginary parts of (A10h,i) are plotted in Fig. A2 for z = 0.1, showing that at this distance the wave-equation failure is substantial. Additional computations indicated that the corresponding values of [w.sup.(p,s).sub.RS] (x, z) decrease at larger distances z, but fall below 0.01 only when z > 30[lambda] so that calculable departures from the wave equation are present throughout the near zone. This is not surprising because, otherwise, the Rayleigh-Sommerfeld and Kirchhoff boundary-value integrals would be rigorously correct. On the other hand, the above results are at odds with a fallacious belief that these integrals must obey the wave equation because their integrands do. The fact of the matter is that the assumed boundary conditions Eqs. (2c,d) abruptly truncate the incident field at the edge and, thus, transform these integrands into discontinuous functions that violate the wave equation.

[FIGURE A2 OMITTED]

[FIGURE 1 OMITTED]

[FIGURE 2 OMITTED]

[FIGURE 3 OMITTED]

[FIGURE 4 OMITTED]

[FIGURE 5 OMITTED]

Accepted: December 11, 2002

(1.) For large negative values of z, Sommerfeld’s theory also yields a comlementary term for the Fresnel approximation of the reflected diffraction pattern on the lit side of the half plane.

(2.) Except for grazing incidence, where interference can occur as in Fresnel’s mirror experiment.

(3.) For oblique incidence of the primary field, as assumed in Eq. (A8a), these irradiances are defined with respect to area elements which are perpendicular to the directions of propagation of the incident and reflected geometrical field.

(4.) For numerical computations the expressions [phi] = ACOS(x/r) if z[less than or equal to]0 and [phi] = 2[pi]-ACOS(x/r) if z[less than or equal to]0 must be used, where ACOS(x/r) is the principal value of arccos (x/r).

(5.) In order to conform to established nomenclature, the letter “E” is used in this paper to denote irradiance as well as electric-field vectors. There should be no confusion, as these different quantities are consistently indicated in lightface italic and boldface roman type, respectively

7. References

(1.) K. D. Mielenz, J. Res. Natl. Inst. Stand. Technol. 107, 355-362 (2002).

(2.) A. Sommerfeld, Math. Ann. 47, 317 (1896).

(3.) P. Drude, Lehrbuch der Optik, S. Hirzel, Leipzig (1906). Transl.: Theory of Optics, Longmans, Green & Co., London etc (1933).

(4.) A. Sommerfeld, Optik, Dieterich’sche Verlagsb., Wiesbaden (1950). Transl.: Optics, Acad. Press, New York (1964).

(5.) M. Born and E. Wolf, Principles of Optics, Pergamon Press, Oxford (1970).

(6.) K. D. Mielenz, J. Res. Natl. Inst. Stand. Technol. 105, 81 (2000).

(7.) K. D. Mielenz, J. Res. Natl. Inst. Stand. Technol. 105, 589 (2000).

(8.) W. Braunbek, Z. Phys. 127, 381 (1950).

(9.) W. Braunbek, Z. Physik 127, 405 (1950).

(10.) W. Braunbek, Z. Physik 138, 80 (1954).

(11.) C. J. Bouwkamp, Rep. Progr. Phys. (London) 17, 35 (1953).

(12.) W. Braunbek, Ann. Physik VI 6, 53 (1949).

(13.) W. Braunbek and G. Laukien, Optik 9, 174 (1952).

About the author: Klaus D. Mielenz is a physicist and retired Chief of the Radiometric Physics Division of the NIST Physics Laboratory. The National Institute of Standards and Technology is an agency of the Technology Administration, U.S. Department of Commerce.

*[Text unreadable in original source]

COPYRIGHT 2003 National Institute of Standards and Technology

COPYRIGHT 2004 Gale Group