The resolving power of telescopes was investigated also by J. B. L. Foucault, who employed a scale of equal bright and dark alternate parts; it was found to be proportional to the aperture and independent of the focal length. In telescopes of the best construction and of moderate aperture the performance is not sensibly prejudiced by outstanding aberration, and the limit imposed by the finiteness of the waves of light is practically reached. M. E. Verdet has compared Foucault’s results with theory, and has drawn the conclusion that the radius of the visible part of the image of a luminous point was equal to half the radius of the first dark ring.
The application, unaccountably long delayed, of this principle to the microscope by H. L. F. Helmholtz in 1871 is the foundation of the important doctrine of themicroscopic limit. It is true that in 1823 Fraunhofer, inspired by his observations upon gratings, had very nearly hit the mark.3And a little before Helmholtz, E. Abbe published a somewhat more complete investigation, also founded upon the phenomena presented by gratings. But although the argument from gratings is instructive and convenient in some respects, its use has tended to obscure the essential unity of the principle of the limit of resolution whether applied to telescopes or microscopes.
In fig. 4, AB represents the axis of an optical instrument (telescope or microscope), A being a point of the object and B a point of the image. By the operation of the object-glass LL′ all the rays issuing from A arrive in the same phase at B. Thus if A be self-luminous, the illumination is a maximum at B, where all the secondary waves agree in phase. B is in fact the centre of the diffraction disk which constitutes the image of A. At neighbouring points the illumination is less, in consequence of the discrepancies of phase which there enter. In like manner if we take a neighbouring point P, also self-luminous, in the plane of the object, the waves which issue from it will arrive at B with phases no longer absolutely concordant, and the discrepancy of phase will increase as the interval APincreases. When the interval is very small the discrepancy, though mathematically existent, produces no practical effect; and the illumination at B due to P is as important as that due to A, the intensities of the two luminous sources being supposed equal. Under these conditions it is clear that A and P are not separated in the image. The question is to what amount must the distance AP be increased in order that the difference of situation may make itself felt in the image. This is necessarily a question of degree; but it does not require detailed calculations in order to show that the discrepancy first becomes conspicuous when the phases corresponding to the various secondary waves which travel from P to B range over a complete period. The illumination at B due to P then becomes comparatively small, indeed for some forms of aperture evanescent. The extreme discrepancy is that between the waves which travel through the outermost parts of the object-glass at L and L′; so that if we adopt the above standard of resolution, the question is where must P be situated in order that the relative retardation of the rays PL and PL’ may on their arrival at B amount to a wave-length (λ). In virtue of the general law that the reduced optical path is stationary in value, this retardation may be calculated without allowance for the different paths pursued on the farther side of L, L′, so that the value is simply PL − PL′. Now since AP is very small, AL′ − PL′ = AP sin α, where α is the angular semi-aperture L′AB. In like manner PL − AL has the same value, so that
PL − PL′ = 2AP sin α.
According to the standard adopted, the condition of resolution is therefore that AP, or ε, should exceed ½λ/sin α. If ε be less than this, the images overlap too much; while if ε greatly exceed the above value the images become unnecessarily separated.
In the above argument the whole space between the object and the lens is supposed to be occupied by matter of one refractive index, and λ represents the wave-lengthin this mediumof the kind of light employed. If the restriction as to uniformity be violated, what we have ultimately to deal with is the wave-length in the medium immediately surrounding the object.
Calling the refractive index μ, we have as the critical value of ε,
ε = ½λ0/μ sin α, (1),
λ0being the wave-lengthin vacuo. The denominator μ sin α is the quantity well known (after Abbe) as the “numerical aperture.”
The extreme value possible for α is a right angle, so that for the microscopic limit we have
ε = ½λ0/μ (2).
The limit can be depressed only by a diminution in λ0, such as photography makes possible, or by an increase in μ, the refractive index of the medium in which the object is situated.
The statement of the law of resolving power has been made in a form appropriate to the microscope, but it admits also of immediate application to the telescope. If 2R be the diameter of the object-glass and D the distance of the object, the angle subtended by AP is ε/D, and the angular resolving power is given by
λ/2D sin α = λ/2R (3).
This method of derivation (substantially due to Helmholtz) makes it obvious that there is no essential difference of principle between the two cases, although the results are conveniently stated in different forms. In the case of the telescope we have to deal with a linear measure of aperture and an angular limit of resolution, whereas in the case of the microscope the limit of resolution is linear, and it is expressed in terms of angular aperture.
It must be understood that the above argument distinctly assumes that the different parts of the object are self-luminous, or at least that the light proceeding from the various points is without phase relations. As has been emphasized by G. J. Stoney, the restriction is often, perhaps usually, violated in the microscope. A different treatment is then necessary, and for some of the problems which arise under this head the method of Abbe is convenient.
The importance of the general conclusions above formulated, as imposing a limit upon our powers of direct observation, can hardly be overestimated; but there has been in some quarters a tendency to ascribe to it a more precise character than it can bear, or even to mistake its meaning altogether. A few words of further explanation may therefore be desirable. The first point to be emphasized is that nothing whatever is said as to the smallness of a single object that may be made visible. The eye, unaided or armed with a telescope, is able to see, as points of light, stars subtending no sensible angle. The visibility of a star is a question of brightness simply, and has nothing to do with resolving power. The latter element enters only when it is a question of recognizing the duplicity of a double star, or of distinguishing detail upon the surface of a planet. So in the microscope there is nothing except lack of light to hinder the visibility of an object however small. But if its dimensions be much less than the half wave-length, it can only be seen as a whole, and its parts cannot be distinctly separated, although in cases near the border line some inference may be possible, founded upon experience of what appearances are presented in various cases. Interesting observations upon particles,ultra-microscopicin the above sense, have been recorded by H. F. W. Siedentopf and R. A. Zsigmondy (Drude’s Ann., 1903, 10, p. 1).
In a somewhat similar way a dark linear interruption in a bright ground may be visible, although its actual width is much inferior to the half wave-length. In illustration of this fact a simple experiment may be mentioned. In front of the naked eye was held a piece of copper foil perforated by a fine needle hole. Observed through this the structure of some wire gauze just disappeared at a distance from the eye equal to 17 in., the gauze containing 46 meshes to the inch. On the other hand, a single wire 0.034 in. in diameter remained fairly visible up to a distance of 20 ft. The ratio between the limiting angles subtended by the periodic structure of the gauze and the diameter of the wire was (.022/.034) × (240/17) = 9.1. For further information upon this subject reference may be made toPhil. Mag., 1896, 42, p. 167;Journ. R. Micr. Soc., 1903, p. 447.
6.Coronas or Glories.—The results of the theory of the diffraction patterns due to circular apertures admit of an interesting application tocoronas, such as are often seen encircling the sun and moon. They are due to the interposition of small spherules of water, which act the part of diffracting obstacles. In order to the formation of a well-defined corona it is essential that the particles be exclusively, or preponderatingly, of one size.
If the origin of light be treated as infinitely small, and be seen in focus, whether with the naked eye or with the aid of a telescope, the whole of the light in the absence of obstacles would be concentrated in the immediate neighbourhood of the focus. At other parts of the field the effect is the same, in accordance with the principle known as Babinet’s, whether the imaginary screen in front of the object-glass is generally transparent but studded with a number of opaque circular disks, or is generally opaque but perforated with corresponding apertures. Since at these points the resultant due to the whole aperture is zero, any two portions into which the whole may be divided must give equal and opposite resultants. Consider now the light diffracted in a direction many times more oblique than any with which we should be concerned, were the whole aperture uninterrupted, and take first the effect of a single small aperture. The light in the proposed direction is that determined by the size of the small aperture in accordance with the laws already investigated, and its phase depends upon the position of the aperture. If we take a direction such that the light (of given wave-length) from a single aperture vanishes, the evanescence continues even when the whole series of apertures is brought into contemplation. Hence, whatever else may happen, there must be a system of dark rings formed, the same as from a single small aperture. In directions other than these it is a more delicate question how the partial effects should be compounded. If we make the extreme suppositions of an infinitely small source and absolutely homogeneous light, there is no escape from the conclusion that the light in a definite direction is arbitrary, that is, dependent upon the chance distribution of apertures. If, however, as in practice, the light be heterogeneous, the source of finite area, the obstacles in motion, and the discrimination of different directions imperfect, we are concerned merely with the mean brightness found by varying the arbitrary phase-relations, and this is obtained by simply multiplying the brightness due to a single aperture by the number of apertures (n) (seeInterference of Light, § 4). The diffraction pattern is therefore that due to a single aperture, merely brightened n times.
In his experiments upon this subject Fraunhofer employed plates of glass dusted over with lycopodium, or studded with small metallic disks of uniform size; and he found that the diameters of the rings were proportional to the length of the waves and inversely as the diameter of the disks.
In another respect the observations of Fraunhofer appear at first sight to be in disaccord with theory; for his measures of the diameters of the red rings, visible when white light was employed, correspond with the law applicable to dark rings, and not to the different law applicable to the luminous maxima. Verdet has, however, pointed out that the observation in this form is essentially different from that in which homogeneous red light is employed, and that the position of the red rings would correspond to theabsenceof blue-green light rather than to the greatest abundance of red light. Verdet’s own observations, conducted with great care, fully confirm this view, and exhibit a complete agreement with theory.
By measurements of coronas it is possible to infer the size of the particles to which they are due, an application of considerable interest in the case of natural coronas—the general rule being the larger the corona the smaller the water spherules. Young employed this method not only to determine the diameters of cloud particles (e.g.1⁄1000in.), but also those of fibrous material, for which the theory is analogous. His instrument was called theeriometer(see “Chromatics,” vol. iii. of supp. toEncy. Brit., 1817).
7.Influence of Aberration. Optical Power of Instruments.—Our investigations and estimates of resolving power have thus far proceeded upon the supposition that there are no optical imperfections, whether of the nature of a regular aberration or dependent upon irregularities of material and workmanship. Inpractice there will always be a certain aberration or error of phase, which we may also regard as the deviation of the actual wave-surface from its intended position. In general, we may say that aberration is unimportant when it nowhere (or at any rate over a relatively small area only) exceeds a small fraction of the wave-length (λ). Thus in estimating the intensity at a focal point, where, in the absence of aberration, all the secondary waves would have exactly the same phase, we see that an aberration nowhere exceeding ¼λ can have but little effect.
The only case in which the influence of small aberration upon the entire image has been calculated (Phil. Mag., 1879) is that of a rectangular aperture, traversed by a cylindrical wave with aberration equal to cx³. The aberration is here unsymmetrical, the wave being in advance of its proper place in one half of the aperture, but behind in the other half. No terms in x or x² need be considered. The first would correspond to a general turning of the beam; and the second would imply imperfect focusing of the central parts. The effect of aberration may be considered in two ways. We may suppose the aperture (a) constant, and inquire into the operation of an increasing aberration; or we may take a given value of c (i.e.a given wave-surface) and examine the effect of a varying aperture. The results in the second case show that an increase of aperture up to that corresponding to an extreme aberration of half a period has no ill effect upon the central band (§ 3), but it increases unduly the intensity of one of the neighbouring lateral bands; and the practical conclusion is that the best results will be obtained from an aperture giving an extreme aberration of from a quarter to half a period, and that with an increased aperture aberration is not so much a direct cause of deterioration as an obstacle to the attainment of that improved definition which should accompany the increase of aperture.
If, on the other hand, we suppose the aperture given, we find that aberration begins to be distinctly mischievous when it amounts to about a quarter period,i.e.when the wave-surface deviates at each end by a quarter wave-length from the true plane.
As an application of this result, let us investigate what amount of temperature disturbance in the tube of a telescope may be expected to impair definition. According to J. B. Biot and F. J. D. Arago, the index μ for air at t° C. and at atmospheric pressure is given by
If we take 0° C. as standard temperature,
δμ = -1.1 × 10-6.
Thus, on the supposition that the irregularity of temperature t extends through a length l, and produces an acceleration of a quarter of a wave-length,
¼λ = 1.1 lt × 10-6;
or, if we take λ = 5.3 × 10-5,
lt = 12,
the unit of length being the centimetre.
We may infer that, in the case of a telescope tube 12 cm. long, a stratum of air heated 1° C. lying along the top of the tube, and occupying a moderate fraction of the whole volume, would produce a not insensible effect. If the change of temperature progressed uniformly from one side to the other, the result would be a lateral displacement of the image without loss of definition; but in general both effects would be observable. In longer tubes a similar disturbance would be caused by a proportionally less difference of temperature. S. P. Langley has proposed to obviate such ill-effects by stirring the air included within a telescope tube. It has long been known that the definition of a carbon bisulphide prism may be much improved by a vigorous shaking.
We will now consider the application of the principle to the formation of images, unassisted by reflection or refraction (Phil. Mag., 1881). The function of a lens in forming an image is to compensate by its variable thickness the differences of phase which would otherwise exist between secondary waves arriving at the focal point from various parts of the aperture. If we suppose the diameter of the lens to be given (2R), and its focal length ƒ gradually to increase, the original differences of phase at the image of an infinitely distant luminous point diminish without limit. When ƒ attains a certain value, say ƒ1, the extreme error of phase to be compensated falls to ¼λ. But, as we have seen, such an error of phase causes no sensible deterioration in the definition; so that from this point onwards the lens is useless, as only improving an image already sensibly as perfect as the aperture admits of. Throughout the operation of increasing the focal length, the resolving power of the instrument, which depends only upon the aperture, remains unchanged; and we thus arrive at the rather startling conclusion that a telescope of any degree of resolving power might be constructed without an object-glass, if only there were no limit to the admissible focal length. This last proviso, however, as we shall see, takes away almost all practical importance from the proposition.
To get an idea of the magnitudes of the quantities involved, let us take the case of an aperture of1⁄5in., about that of the pupil of the eye. The distance ƒ1, which the actual focal length must exceed, is given by
√ (ƒ1² + R²) − ƒ1= ¼λ;
so that
ƒ1= 2R²/λ (1).
Thus, if λ =1⁄40000, R =1⁄10, we find
ƒ1= 800 inches.
The image of the sun thrown upon a screen at a distance exceeding 66 ft., through a hole1⁄5in. in diameter, is therefore at least as well defined as that seen direct.
As the minimum focal length increases with the square of the aperture, a quite impracticable distance would be required to rival the resolving power of a modern telescope. Even for an aperture of 4 in., ƒ1would have to be 5 miles.
A similar argument may be applied to find at what point an achromatic lens becomes sensibly superior to a single one. The question is whether, when the adjustment of focus is correct for the central rays of the spectrum, the error of phase for the most extreme rays (which it is necessary to consider) amounts to a quarter of a wave-length. If not, the substitution of an achromatic lens will be of no advantage. Calculation shows that, if the aperture be1⁄5in., an achromatic lens has no sensible advantage if the focal length be greater than about 11 in. If we suppose the focal length to be 66 ft., a single lens is practically perfect up to an aperture of 1.7 in.
Another obvious inference from the necessary imperfection of optical images is the uselessness of attempting anything like an absolute destruction of spherical aberration. An admissible error of phase of ¼λ will correspond to an error of1⁄8λ in a reflecting and ½λ in a (glass) refracting surface, the incidence in both cases being perpendicular. If we inquire what is the greatest admissible longitudinal aberration (δƒ) in an object-glass according to the above rule, we find
δƒ = λα-2(2),
α being the angular semi-aperture.
In the case of a single lens of glass with the most favourable curvatures, δƒ is about equal to ᲃ, so that α4must not exceed λ/ƒ. For a lens of 3 ft. focus this condition is satisfied if the aperture does not exceed 2 in.
When parallel rays fall directly upon a spherical mirror the longitudinal aberration is only about one-eighth as great as for the most favourably shaped single lens of equal focal length and aperture. Hence a spherical mirror of 3 ft. focus might have an aperture of 2½ in., and the image would not suffer materially from aberration.
On the same principle we may estimate the least visible displacement of the eye-piece of a telescope focused upon a distant object, a question of interest in connexion with range-finders. It appears (Phil. Mag., 1885, 20, p. 354) that a displacement δf from the true focus will not sensibly impair definition, provided
δƒ < ƒ²λ/R² (3),
2R being the diameter of aperture. The linear accuracy required is thus a function of theratioof aperture to focal length. The formula agrees well with experiment.
The principle gives an instantaneous solution of the question of the ultimate optical efficiency in the method of “mirror-reading,” as commonly practised in various physical observations. A rotation by which one edge of the mirror advances ¼λ (while the other edge retreats to a like amount) introduces a phase-discrepancy of a whole period where before the rotation there was complete agreement. A rotation of this amount should therefore be easily visible, but the limits of resolving power are being approached; and the conclusion is independent of the focal length of the mirror, and of the employment of a telescope, provided of course that the reflected image is seen in focus, and that the full width of the mirror is utilized.
A comparison with the method of a material pointer, attached to the parts whose rotation is under observation, and viewed through a microscope, is of interest. The limiting efficiency of the microscope is attained when the angular aperture amounts to 180°; and it is evident that a lateral displacement of the point under observation through ½λ entails (at the old image) a phase-discrepancy of a whole period, one extreme ray being accelerated and the other retarded by half that amount. We may infer that the limits of efficiency in the two methods are the same when the length of the pointer is equal to the width of the mirror.
We have seen that in perpendicular reflection a surface error not exceeding1⁄8λ may be admissible. In the case of oblique reflection at an angle φ, the error of retardation due to an elevation BD (fig. 5) is
QQ′ − QS = BD sec φ(1 − cos SQQ′) = BD sec φ (1 + cos 2φ) = 2BD cos φ;
from which it follows that an error of given magnitude in the figure of a surface is less important in oblique than in perpendicular reflection. It must, however, be borne in mind that errors can sometimes be compensated by altering adjustments. If a surface intended to be flat is affected with a slight general curvature, a remedy may be found in an alteration of focus, and the remedy is the less complete as the reflection is more oblique.
The formula expressing the optical power of prismatic spectroscopes may readily be investigated upon the principles of the wave theory. Let A0B0be a plane wave-surface of the light before it falls upon the prisms, AB the corresponding wave-surface for a particular part of the spectrum after the light has passed the prisms, or after it has passed the eye-piece of the observing telescope. The path of a ray from the wave-surface A0B0to A or B is determined by the condition that the optical distance, ∫ μ ds, is a minimum; and, as AB is by supposition a wave-surface, this optical distance is the same for both points. Thus
∫μ ds (for A) =∫μ ds (for B) (4).
We have now to consider the behaviour of light belonging to a neighbouring part of the spectrum. The path of a ray from the wave-surface A0B0to the point A is changed; but in virtue of the minimum property the change may be neglected in calculating the optical distance, as it influences the result by quantities of the second order only in the changes of refrangibility. Accordingly, the optical distance from A0B0to A is represented by ∫(μ + δμ)ds, the integration being along the original path A0... A; and similarly the optical distance between A0B0and B is represented by ∫ (μ + δμ)ds, the integration being along B0... B. In virtue of (4) the difference of the optical distances to A and B is
∫δμ ds (along B0... B) −∫δμ ds (along A0... A) (5).
The new wave-surface is formed in such a position that the optical distance is constant; and therefore thedispersion, or the angle through which the wave-surface is turned by the change of refrangibility, is found simply by dividing (5) by the distance AB. If, as in common flint-glass spectroscopes, there is only one dispersing substance, ∫ δμ ds = δμ·s, where s is simply the thickness traversed by the ray. If t2and t1be the thicknesses traversed by the extreme rays, and a denote the width of the emergent beam, the dispersion θ is given by
θ = δμ(t2− t1)/a,
or, if t1be negligible,
θ = δμt/a (6).
The condition of resolution of a double line whose components subtend an angle θ is that θ must exceed λ/a. Hence, in order that a double line may be resolved whose components have indices μ and μ + δμ, it is necessary that t should exceed the value given by the following equation:—
t = λ/δμ (7).
8.Diffraction Gratings.—Under the heading “Colours of Striated Surfaces,” Thomas Young (Phil. Trans., 1802) in his usual summary fashion gave a general explanation of these colours, including the law of sines, the striations being supposed to be straight, parallel and equidistant. Later, in his article “Chromatics” in the supplement to the 5th edition of this encyclopaedia, he shows that the colours “lose the mixed character of periodical colours, and resemble much more the ordinary prismatic spectrum, with intervals completely dark interposed,” and explains it by the consideration that any phase-difference which may arise at neighbouring striae is multiplied in proportion to the total number of striae.
The theory was further developed by A. J. Fresnel (1815), who gave a formula equivalent to (5) below. But it is to J. von Fraunhofer that we owe most of our knowledge upon this subject. His recent discovery of the “fixed lines” allowed a precision of observation previously impossible. He constructed gratings up to 340 periods to the inch by straining fine wire over screws. Subsequently he ruled gratings on a layer of gold-leaf attached to glass, or on a layer of grease similarly supported, and again by attacking the glass itself with a diamond point. The best gratings were obtained by the last method, but a suitable diamond point was hard to find, and to preserve. Observing through a telescope with light perpendicularly incident, he showed that the position of any ray was dependent only upon the grating interval, viz. the distance from the centre of one wire or line to the centre of the next, and not otherwise upon the thickness of the wire and the magnitude of the interspace. In different gratings the lengths of the spectra and their distances from the axis were inversely proportional to the grating interval, while with a given grating the distances of the various spectra from the axis were as 1, 2, 3, &c. To Fraunhofer we owe the first accurate measurements of wave-lengths, and the method of separating the overlapping spectra by a prism dispersing in the perpendicular direction. He described also the complicated patterns seen when a point of light is viewed through two superposed gratings, whose lines cross one another perpendicularly or obliquely. The above observations relate to transmitted light, but Fraunhofer extended his inquiry to the lightreflected. To eliminate the light returned from the hinder surface of an engraved grating, he covered it with a black varnish. It then appeared that under certain angles of incidence parts of the resulting spectra werecompletely polarized. These remarkable researches of Fraunhofer, carried out in the years 1817-1823, are republished in hisCollected Writings(Munich, 1888).
The principle underlying the action of gratings is identical with that discussed in § 2, and exemplified in J. L. Soret’s “zone plates.” The alternate Fresnel’s zones are blocked out or otherwise modified; in this way the original compensation is upset and a revival of light occurs in unusual directions. If the source be a point or a line, and a collimating lens be used, the incident waves may be regarded as plane. If, further, on leaving the grating the light be received by a focusing lens,e.g.the object-glass of a telescope, the Fresnel’s zones are reduced to parallel and equidistant straight strips, which at certain angles coincide with the ruling. The directions of the lateral spectra are such that the passage from one element of the grating to the corresponding point of the next implies a retardation of an integral number of wave-lengths. If the grating be composed of alternate transparent and opaque parts, the question may be treated by means of the general integrals (§ 3) by merely limiting the integration to the transparent parts of the aperture. For an investigation upon these lines the reader is referred to Airy’sTracts, to Verdet’sLeçons, or to R. W. Wood’sPhysical Optics. If, however, we assume the theory of a simple rectangular aperture (§ 3); the results of the ruling can be inferred by elementary methods, which are perhaps more instructive.
Apart from the ruling, we know that the image of a mathematical line will be a series of narrow bands, of which the central one is by far the brightest. At the middle of this band there is complete agreement of phase among the secondary waves. The dark lines which separate the bands are the places at which the phases of the secondary wave range over an integral number of periods. If now we suppose the aperture AB to be covered by a great number of opaque strips or bars of width d, separated by transparent intervals of width a, the condition of things in the directions just spoken of is not materially changed. At the central point there is still complete agreement of phase; but the amplitude is diminished in the ratio of a : a + d. In another direction, making a small angle with the last, such that the projection of AB upon it amounts to a few wave-lengths, it is easy to see that the mode of interference is the same as if there were no ruling. For example, when the direction is such that the projection of AB upon it amounts to one wave-length, the elementary components neutralize one another, because their phases are distributed symmetrically, though discontinuously, round the entire period. The only effect of the ruling is to diminish the amplitude in the ratio a : a + d; and, except for the difference in illumination, the appearance of a line of light is the same as if the aperture were perfectly free.
The lateral (spectral) images occur in such directions that the projection of the element (a + d) of the grating upon them is an exact multiple of λ. The effect of each of the n elements of the grating is then the same; and, unless this vanishes on account of a particular adjustment of the ratio a : d, the resultant amplitude becomes comparatively very great. These directions, in which the retardation between A and B is exactly mnλ, may be called the principal directions. On either side of any one of them the illumination is distributed according to the same law as for the central image (m = 0), vanishing, for example, when the retardation amounts to (mn ± 1)λ. In considering the relative brightnesses of the different spectra, it is therefore sufficient to attend merely to the principal directions, provided that the whole deviation be not so great that its cosine differs considerably from unity.
We have now to consider the amplitude due to a single element, which we may conveniently regard as composed of a transparent part a bounded by two opaque parts of width ½d. The phase of the resultant effect is by symmetry that of the component which comes from the middle of a. The fact that the other components have phases differing from this by amounts ranging between ± amπ/(a + d) causes the resultant amplitude to be less than for the central image (where there is complete phase agreement).If Bmdenote the brightness of the mthlateral image, and B0that of the central image, we have
If B denotes the brightness of the central image when the whole of the space occupied by the grating is transparent, we have
B0: B = a² : (a + d)²,
and thus
The sine of an angle can never be greater than unity; and consequently under the most favourable circumstances only 1/m²π² of the original light can be obtained in the mthspectrum. We conclude that, with a grating composed of transparent and opaque parts, the utmost light obtainable in any one spectrum is in the first, and there amounts to 1/π², or about1⁄10, and that for this purpose a and d must be equal. When d = a the general formula becomes
showing that, when m is even, Bmvanishes, and that, when m is odd,
Bm: B = 1/m²π².
The third spectrum has thus only1⁄9of the brilliancy of the first.
Another particular case of interest is obtained by supposing a small relatively to (a + d). Unless the spectrum be of very high order, we have simply
Bm: B = {a/(a + d)}² (4);
so that the brightnesses of all the spectra are the same.
The light stopped by the opaque parts of the grating, together with that distributed in the central image and lateral spectra, ought to make up the brightness that would be found in the central image, were all the apertures transparent. Thus, if a = d, we should have
1 = ½ + ¼ + 2/π² (1 +1⁄9+1⁄25+ ...),
which is true by a known theorem. In the general case
a formula which may be verified by Fourier’s theorem.
According to a general principle formulated by J. Babinet, the brightness of a lateral spectrum is not affected by an interchange of the transparent and opaque parts of the grating. The vibrations corresponding to the two parts are precisely antagonistic, since if both were operative the resultant would be zero. So far as the application to gratings is concerned, the same conclusion may be derived from (2).
From the value of Bm: B0we see that no lateral spectrum can surpass the central image in brightness; but this result depends upon the hypothesis that the ruling acts by opacity, which is generally very far from being the case in practice. In an engraved glass grating there is no opaque material present by which light could be absorbed, and the effect depends upon a difference of retardation in passing the alternate parts. It is possible to prepare gratings which give a lateral spectrum brighter than the central image, and the explanation is easy. For if the alternate parts were equal and alike transparent, but so constituted as to give a relative retardation of ½λ, it is evident that the central image would be entirely extinguished, while the first spectrum would be four times as bright as if the alternate parts were opaque. If it were possible to introduce at every part of the aperture of the grating an arbitrary retardation, all the light might be concentrated in any desired spectrum. By supposing the retardation to vary uniformly and continuously we fall upon the case of an ordinary prism: but there is then no diffraction spectrum in the usual sense. To obtain such it would be necessary that the retardation should gradually alter by a wave-length in passing over any element of the grating, and then fall back to its previous value, thus springing suddenly over a wave-length (Phil. Mag., 1874, 47, p. 193). It is not likely that such a result will ever be fully attained in practice; but the case is worth stating, in order to show that there is no theoretical limit to the concentration of light of assigned wave-length in one spectrum, and as illustrating the frequently observed unsymmetrical character of the spectra on the two sides of the central image.4
We have hitherto supposed that the light is incident perpendicularly upon the grating; but the theory is easily extended. If the incident rays make an angle θ with the normal (fig. 6), and the diffracted rays make an angle φ (upon the same side), the relative retardation from each element of width (a + d) to the next is (a + d) (sinθ + sinφ); and this is the quantity which is to be equated to mλ. Thus
sinθ + sinφ = 2sin ½(θ + φ) cos ½ (θ − φ) = mλ/(a + d) (5).
The “deviation” is (θ + φ), and is therefore a minimum when θ = φ,i.e.when the grating is so situated that the angles of incidence and diffraction are equal.
In the case of a reflection grating the same method applies. If θ and φ denote the angles with the normal made by the incident and diffracted rays, the formula (5) still holds, and, if the deviation be reckoned from the direction of the regularly reflected rays, it is expressed as before by (θ + φ), and is a minimum when θ = φ, that is, when the diffracted rays return upon the course of the incident rays.
In either case (as also with a prism) the position of minimum deviation leaves the width of the beam unaltered,i.e.neither magnifies nor diminishes the angular width of the object under view.
From (5) we see that, when the light falls perpendicularly upon a grating (θ = 0), there is no spectrum formed (the image corresponding to m = 0 not being counted as a spectrum), if the grating interval σ or (a + d) is less than λ. Under these circumstances, if the material of the grating be completely transparent, the whole of the light must appear in the direct image, and the ruling is not perceptible. From the absence of spectra Fraunhofer argued that there must be a microscopic limit represented by λ; and the inference is plausible, to say the least (Phil. Mag., 1886). Fraunhofer should, however, have fixed the microscopic limit at ½λ, as appears from (5), when we suppose θ = ½π, φ = ½π.
We will now consider the important subject of the resolving power of gratings, as dependent upon the number of lines (n) and the order of the spectrum observed (m). Let BP (fig. 8) be the direction of the principal maximum (middle of central band) for the wave-length λ in the mthspectrum. Then the relative retardation of the extreme rays (corresponding to the edges A, B of the grating) is mnλ. If BQ be the direction for the first minimum (the darkness between the central and first lateral band), the relative retardation of the extreme rays is (mn + 1)λ. Suppose now that λ + δλ is the wave-length for which BQ gives the principal maximum, then
(mn + 1)λ = mn(λ + δλ);
whence
δλ/λ = 1/mn (6).
According to our former standard, this gives the smallest difference of wave-lengths in a double line which can be just resolved; and we conclude that the resolving power of a grating depends only upon the total number of lines, and upon the order of the spectrum, without regard to any other considerations. It is here of course assumed that the n lines are really utilized.
In the case of the D lines the value of δλ/λ is about 1/1000; so that to resolve this double line in the first spectrum requires 1000 lines, in the second spectrum 500, and so on.
It is especially to be noticed that the resolving power does not depend directly upon the closeness of the ruling. Let us take the case of a grating 1 in. broad, and containing 1000 lines, and consider the effect of interpolating an additional 1000 lines, so as to bisect the former intervals. There will be destruction by interference of the first, third and odd spectra generally; while the advantage gained in the spectra of even order is not in dispersion, nor in resolving power, but simply in brilliancy, which is increased four times. If we now suppose half the grating cut away, so as to leave 1000 lines in half an inch, the dispersion will not be altered, while the brightness and resolving power are halved.
There is clearly no theoretical limit to the resolving power of gratings, even in spectra of given order. But it is possible that, as suggested by Rowland,5the structure of natural spectra may be too coarse to give opportunity for resolving powers much higher than those now in use. However this may be, it would always be possible, with the aid of a grating of given resolving power, to construct artificially from white light mixtures of slightly different wave-length whose resolution or otherwise would discriminate between powers inferior and superior to the given one.6
If we define as the “dispersion” in a particular part of the spectrum the ratio of the angular interval dθ to the corresponding increment of wave-length dλ, we may express it by a very simple formula. For the alteration of wave-length entails, at the two limits of a diffracted wave-front, a relative retardation equal to mndλ. Hence, if a be the width of the diffracted beam, and dθ the angle through which the wave-front is turned,
adθ = mn dλ,
or
dispersion = mn/a (7).
The resolving power and the width of the emergent beam fix the optical character of the instrument. The latter element must eventually be decreased until less than the diameter of the pupil of the eye. Hence a wide beam demands treatment with further apparatus (usually a telescope) of high magnifying power.
In the above discussion it has been supposed that the ruling is accurate, and we have seen that by increase of m a high resolving power is attainable with a moderate number of lines. But this procedure (apart from the question of illumination) is open to the objection that it makes excessive demands upon accuracy. According to the principle already laid down it can make but little difference in the principal direction corresponding to the first spectrum, provided each line lie within a quarter of an interval (a + d) from its theoretical position. But, to obtain an equally good result in the mthspectrum, the error must be less than 1/m of the above amount.7
There are certain errors of a systematic character which demand special consideration. The spacing is usually effected by means of a screw, to each revolution of which corresponds a large number (e.g.one hundred) of lines. In this way it may happen that although there is almost perfect periodicity with each revolution of the screw after (say) 100 lines, yet the 100 lines themselves are not equally spaced. The “ghosts” thus arising were first described by G. H. Quincke (Pogg. Ann., 1872, 146, p. 1), and have been elaborately investigated by C. S. Peirce (Ann. Journ. Math., 1879, 2, p. 330), both theoretically and experimentally. The general nature of the effects to be expected in such a case may be made clear by means of an illustration already employed for another purpose. Suppose two similar and accurately ruled transparent gratings to be superposed in such a manner that the lines are parallel. If the one set of lines exactly bisect the intervals between the others, the grating interval is practically halved, and the previously existing spectra of odd order vanish. But a very slight relative displacement will cause the apparition of the odd spectra. In this case there is approximate periodicity in the half interval, but complete periodicity only after the whole interval. The advantage of approximate bisection lies in the superior brilliancy of the surviving spectra; but in any case the compound grating may be considered to be perfect in the longer interval, and the definition is as good as if the bisection were accurate.
The effect of a gradual increase in the interval (fig. 9) as we pass across the grating has been investigated by M. A. Cornu (C.R., 1875, 80, p. 655), who thus explains an anomaly observed by E. E. N. Mascart. The latter found that certain gratings exercised a converging power upon the spectra formed upon one side, and a corresponding diverging power upon the spectra on the other side. Let us suppose that the light is incident perpendicularly, and that the grating interval increases from the centre towards that edge which lies nearest to the spectrum under observation, and decreases towards the hinder edge. It is evident that the waves frombothhalves of the grating are accelerated in an increasing degree, as we pass from the centre outwards, as compared with the phase they would possess were the central value of the grating interval maintained throughout. The irregularity of spacing has thus the effect of a convex lens, which accelerates the marginal relatively to the central rays. On the other side the effect is reversed. This kind of irregularity may clearly be present in a degree surpassing the usual limits, without loss of definition, when the telescope is focused so as to secure the best effect.
It may be worth while to examine further the other variations from correct ruling which correspond to the various terms expressing the deviation of the wave-surface from a perfect plane. If x and y be co-ordinates in the plane of the wave-surface, the axis of y being parallel to the lines of the grating, and the origin corresponding to the centre of the beam, we may take as an approximate equation to the wave-surface
and, as we have just seen, the term in x² corresponds to a linear error in the spacing. In like manner, the term in y² corresponds to a generalcurvatureof the lines (fig. 10), and does not influence the definition at the (primary) focus, although it may introduce astigmatism.8If we suppose that everything is symmetrical on the two sides of the primary plane y = 0, the coefficients B, β, δ vanish. In spite of any inequality between ρ and ρ’, the definition will be good to this order of approximation, provided α and γ vanish. The former measures thethicknessof the primary focal line, and the latter measures itscurvature. The error of ruling giving rise to α is one in which the intervals increase or decrease inbothdirections from the centre outwards (fig. 11), and it may often be compensated by a slight rotation in azimuth of the object-glass of the observing telescope. The term in γ corresponds to avariationof curvature in crossing the grating (fig. 12).
When the plane zx is not a plane of symmetry, we have to consider the terms in xy, x²y, and y³. The first of these corresponds to a deviation from parallelism, causing the interval to alter gradually as we passalongthe lines (fig. 13). The error thus arising may be compensated by a rotation of the object-glass about one of the diameters y = ± x. The term in x²y corresponds to a deviation from parallelism in the same direction on both sides of the central line (fig. 14); and that in y³ would be caused by a curvature such that there is a point of inflection at the middle of each line (fig. 15).
All the errors, except that depending on α, and especially those depending on γ and δ, can be diminished, without loss of resolving power, by contracting theverticalaperture. A linear error in the spacing, and a general curvature of the lines, are eliminated in the ordinary use of a grating.
The explanation of the difference of focus upon the two sides as due to unequal spacing was verified by Cornu upon gratings purposely constructed with an increasing interval. He has also shown how to rule a plane surface with lines so disposed that the grating shall of itself give well-focused spectra.
A similar idea appears to have guided H. A. Rowland to his brilliant invention of concave gratings, by which spectra can be photographed without any further optical appliance. In these instruments the lines are ruled upon a spherical surface of speculum metal, and mark the intersections of the surface by a system of parallel and equidistant planes, of which the middle member passes through the centre of the sphere. If we consider for the present only the primary plane of symmetry, the figure is reduced to two dimensions. Let AP (fig. 16) represent the surface of the grating, O being the centre of the circle. Then, if Q be any radiant point and Q’ its image (primary focus) in the spherical mirror AP, we have
where v1= AQ′, u = AQ, a = OA, φ = angle of incidence QAO, equal to the angle of reflection Q′AO. If Q be on the circle described upon OA as diameter, so that u = a cos φ, then Q′ lies also upon the same circle; and in this case it follows from the symmetry that the unsymmetrical aberration (depending upon a) vanishes.
This disposition is adopted in Rowland′s instrument; only, in addition to the central image formed at the angle φ′ = φ, there are a series of spectra with various values of φ’, but all disposed upon the same circle. Rowland’s investigation is contained in the paper already referred to; but the following account of the theory is in the form adopted by R. T. Glazebrook (Phil. Mag., 1883).
In order to find the difference of optical distances between the courses QAQ′, QPQ′, we have to express QP − QA, PQ′ − AQ′. To find the former, we have, if OAQ = φ, AOP = ω,
QP² = u² + 4a²sin²½ω − 4au sin ½ω sin (½ω − φ)= (u + a sin φ sin ω)² − a² sin²φ sin²ω + 4a sin² ½ω(a − u cosφ).
Now as far as ω4
4 sin² ½ω = sin²ω + ¼sin4ω,
and thus to the same order
QP² = (u + a sin φ sin ω)²− a cos φ(u − a cos φ) sin²ω + ¼ a(a − u cos φ) sin4ω.
pose that Q lies on the circle u = a cos φ, the middle term vanishes, and we get, correct as far as ω4,
so that
QP − u = a sin φ sin ω +1⁄8a sin φ tan φ sin4ω (9),
in which it is to be noticed that the adjustment necessary to secure the disappearance of sin²ω is sufficient also to destroy the term in sin³ω.
A similar expression can be found for Q’P − Q′A; and thus, if Q′A = v, Q′AO = φ′, where v = a cos φ′, we get
QP + PQ′ − QA -AQ′ = a sin ω (sin φ − sin φ′)+1⁄8a sin4ω (sin φ tan φ + sin φ′ tan φ′) (10).
If φ′ = φ, the term of the first order vanishes, and the reduction of the difference of pathviaP andviaA to a term of the fourth order proves not only that Q and Q′ are conjugate foci, but also that the foci are exempt from the most important term in the aberration. In the present application φ′ is not necessarily equal to φ; but if P correspond to a line upon the grating, the difference of retardations for consecutive positions of P, so far as expressed by the term of the first order, will be equal to ± mλ (m integral), and therefore without influence, provided
σ (sin φ − sinφ′) = ± mλ (11),
where σ denotes the constant interval between the planes containing the lines. This is the ordinary formula for a reflecting plane grating, and it shows that the spectra are formed in the usual directions. They are here focused (so far as the rays in the primary plane are concerned) upon the circle OQ′A, and the outstanding aberration is of the fourth order.
In order that a large part of the field of view may be in focus at once, it is desirable that the locus of the focused spectrum should be nearly perpendicular to the line of vision. For this purpose Rowland places the eye-piece at O, so that φ = 0, and then by (11) the value of φ′ in the mthspectrum is
σ sin φ’ = ± mλ (12).
If ω now relate to the edge of the grating, on which there are altogether n lines,
nσ = 2a sin ω,
and the value of the last term in (10) becomes
1⁄16nσsin³ ω sin φ′ tan φ′,
or
1⁄16mnλ sin³ω tan φ′ (13).
This expresses the retardation of the extreme relatively to the central ray, and is to be reckoned positive, whatever may be the signs of ω, and φ′. If the semi-angular aperture (ω) be1⁄100, and tan φ′ = 1, mn might be as great as four millions before the error of phase would reach ¼λ. If it were desired to use an angular aperture so large that the aberration according to (13) would be injurious, Rowland points out that on his machine there would be no difficulty in applying a remedy by making σ slightly variable towards the edges. Or, retaining σ constant, we might attain compensation by so polishing the surface as to bring the circumference slightly forward in comparison with the position it would occupy upon a true sphere.
It may be remarked that these calculations apply to the rays in the primary plane only. The image is greatly affected with astigmatism; but this is of little consequence, if γ in (8) be small enough. Curvature of the primary focal line having a very injurious effect upon definition, it may be inferred from the excellent performance of these gratings that γ is in fact small. Its value does not appear to have been calculated. The other coefficients in (8) vanish in virtue of the symmetry.
The mechanical arrangements for maintaining the focus are of great simplicity. The grating at A and the eye-piece at O are rigidly attached to a bar AO, whose ends rest on carriages, moving on rails OQ, AQ at right angles to each other. A tie between the middle point of the rod OA and Q can be used if thought desirable.
The absence of chromatic aberration gives a great advantage in the comparison of overlapping spectra, which Rowland has turned to excellent account in his determinations of the relative wave-lengths of lines in the solar spectrum (Phil. Mag., 1887).
For absolute determinations of wave-lengths plane gratings are used. It is found (Bell,Phil. Mag., 1887) that the angular measurements present less difficulty than the comparison of the grating interval with the standard metre. There is also some uncertainty as to the actual temperature of the grating when in use. In order to minimize the heating action of the light, it might be submitted to a preliminary prismatic analysis before it reaches the slit of the spectrometer, after the manner of Helmholtz.
In spite of the many improvements introduced by Rowland and of the care with which his observations were made, recent workers have come to the conclusion that errors of unexpected amount have crept into his measurements of wave-lengths, and there is even a disposition to discard the grating altogether for fundamental work in favour of the so-called “interference methods,” as developed by A. A. Michelson, and by C. Fabry and J. B. Pérot. The grating would in any case retain its utility for the reference of new lines to standards otherwise fixed. For such standards a relative accuracy of at least one part in a million seems now to be attainable.
Since the time of Fraunhofer many skilled mechanicians have given their attention to the ruling of gratings. Those of Nobert were employed by A. J. Ångström in his celebrated researches upon wave-lengths. L. M. Rutherfurd introduced into common use the reflection grating, finding that speculum metal was less trying than glass to the diamond point, upon the permanence of which so much depends. In Rowland’s dividing engine the screws were prepared by a special process devised by him, and the resulting gratings, plane and concave, have supplied the means for much of the best modern optical work. It would seem, however, that further improvements are not excluded.
There are various copying processes by which it is possible to reproduce an original ruling in more or less perfection. The earliest is that of Quincke, who coated a glass grating with a chemical silver deposit, subsequently thickened with copper in an electrolytic bath. The metallic plate thus produced formed, when stripped from its support, a reflection grating reproducing many of the characteristics of the original. It is best to commence the electrolytic thickening in a silver acetate bath. At the present time excellent reproductions of Rowland’s speculum gratings are on the market (Thorp, Ives, Wallace), prepared, after a suggestion of Sir David Brewster, by coating the original with a varnish,e.g.of celluloid. Much skill is required to secure that the film when stripped shall remain undeformed.
A much easier method, applicable to glass originals, is that of photographic reproduction by contact printing. In several papers dating from 1872, Lord Rayleigh (seeCollected Papers, i. 157, 160, 199, 504; iv. 226) has shown that success may be attained by a variety of processes, including bichromated gelatin and the old bitumen process, and has investigated the effect of imperfect approximation during the exposure between the prepared plate and the original. For many purposes the copies, containing lines up to 10,000 to the inch, are not inferior. It is to be desired that transparent gratings should be obtained from first-class ruling machines. To save the diamond point it might be possible to use something softer than ordinary glass as the material of the plate.
9.Talbot’s Bands.—These very remarkable bands are seen under certain conditions when a tolerably pure spectrum is regarded with the naked eye, or with a telescope,half the aperture being covered by a thin plate,e.g.of glass or mica. The view of the matter taken by the discoverer (Phil. Mag., 1837, 10, p. 364) was that any ray which suffered in traversing the plate a retardation of an odd number of half wave-lengths would be extinguished, and that thus the spectrum would be seen interrupted by a number of dark bars. But this explanation cannot be accepted as it stands, being open to the same objection as Arago’s theory of stellar scintillation.9It is as far as possible from being true that a body emitting homogeneous light would disappear on merely covering half the aperture of vision with a half-wave plate. Such a conclusion would be in the face of the principle of energy, which teaches plainly that the retardation in question leaves the aggregate brightness unaltered. The actual formation ofthe bands comes about in a very curious way, as is shown by a circumstance first observed by Brewster. When the retarding plate is held on the side towards the red of the spectrum,the bands are not seen. Even in the contrary case, the thickness of the plate must not exceed a certain limit, dependent upon the purity of the spectrum. A satisfactory explanation of these bands was first given by Airy (Phil. Trans., 1840, 225; 1841, 1), but we shall here follow the investigation of Sir G. G. Stokes (Phil. Trans., 1848, 227), limiting ourselves, however, to the case where the retarded and unretarded beams are contiguous and of equal width.
The aperture of the unretarded beam may thus be taken to be limited by x = -h, x = 0, y = -l, y= +l; and that of the beam retarded by R to be given by x = 0, x = h, y= -l, y = +l. For the former (1) § 3 gives
on integration and reduction.
For the retarded stream the only difference is that we must subtract R from at, and that the limits of x are 0 and +h. We thus get for the disturbance at ξ, η, due to this stream
If we put for shortness π for the quantity under the last circular function in (1), the expressions (1), (2) may be put under the forms u sin τ, v sin (τ − α) respectively; and, if I be the intensity, I will be measured by the sum of the squares of the coefficients of sin τ and cos τ in the expression
u sin τ + v sin (τ − α),
so that
I = u² + v² + 2uv cos α,
which becomes on putting for u, v, and α their values, and putting
If the subject of examination be a luminous line parallel to η, we shall obtain what we require by integrating (4) with respect to η from −∞ to +∞. The constant multiplier is of no especial interest so that we may take as applicable to the image of a line