Principle of Relativity
At the present time, different opinions are being held about the fundamental equations of Electro-dynamics for moving bodies. The Hertzian[9]forms must be given up, for it has appeared that they are contrary to many experimental results.
In 1895 H. A. Lorentz[10]published his theory of optical and electrical phenomena in moving bodies; this theory was based upon the atomistic conception (vorstellung) of electricity, and on account of its great success appears to have justified the bold hypotheses, by which it has been ushered into existence. In his theory, Lorentz proceeds from certain equations, which must hold at every point of “Äther”; then by forming the average values over “Physically infinitely small” regions, which however contain large numbers of electrons, the equations for electro-magnetic processes in moving bodies can be successfully built up.
In particular, Lorentz’s theory gives a good account of the non-existence of relative motion of the earth and the luminiferous “Äther”; it shows that this fact is intimately connected with the covariance of the original equation, when certain simultaneous transformations of the space and time co-ordinates are effected; these transformations have therefore obtained from H. Poincare[11]the name of Lorentz-transformations. The covariance of these fundamental equations, when subjected to the Lorentz-transformation is a purely mathematical facti.e.not based on any physical considerations; I will call this the Theorem of Relativity; this theorem rests essentially on the form of the differential equations for the propagation of waves with the velocity of light.
Now withoutrecognizingany hypothesis about the connection between “Äther” and matter, we can expect these mathematically evident theorems to have their consequences so far extended—that thereby even those laws of ponderable media which are yet unknown may anyhow possess this covariance when subjected to a Lorentz-transformation; by saying this, we do not indeed express an opinion, but rather a conviction,—and this conviction I may be permitted to call the Postulate of Relativity. The position of affairs here is almost the same as when the Principle of Conservation of Energy was postulated in cases, where the corresponding forms of energy were unknown.
Now if hereafter, we succeed in maintaining this covariance as a definite connection between pure and simple observable phenomena in moving bodies, the definite connection may be styled ‘the Principle of Relativity.’
These differentiations seem to me to be necessary for enabling us to characterise the present day position of the electro-dynamics for moving bodies.
H. A. Lorentz[12]has found out the “Relativity theorem” and has created the Relativity-postulate as a hypothesis that electrons and matter suffer contractions in consequence of their motion according to a certain law.
A. Einstein[13]has brought out the point very clearly, that this postulate is not an artificial hypothesis but is rather a new way of comprehending the time-concept which is forced upon us by observation of natural phenomena.
The Principle of Relativity has not yet been formulated for electro-dynamics of moving bodies in the sense characterized by me. In the present essay, while formulating this principle, I shall obtain the fundamental equations for moving bodies in a sense which is uniquely determined by this principle.
But it will be shown that none of the forms hitherto assumed for these equations can exactly fit in with this principle.[14]
We would at first expect that the fundamental equations which are assumed by Lorentz for moving bodies would correspond to the Relativity Principle. But it will be shown that this is not the case for the general equations which Lorentz has for any possible, and also for magnetic bodies; but this is approximately the case (if neglect the square of the velocity of matter in comparison to the velocity of light) for those equations which Lorentz hereafter infers for non-magnetic bodies. But this latter accordance with the Relativity Principle is due to the fact that the condition of non-magnetisation has been formulated in a way not corresponding to the Relativity Principle; therefore the accordance is due to the fortuitous compensation of two contradictions to the Relativity-Postulate. But meanwhile enunciation of the Principle in a rigid manner does not signify any contradiction to the hypotheses of Lorentz’s molecular theory, but it shall become clear that the assumption of the contraction of the electron in Lorentz’s theory must be introduced at an earlier stage than Lorentz has actually done.
In an appendix, I have gone into discussion of the position of Classical Mechanics with respect to the Relativity Postulate. Any easily perceivable modification of mechanics for satisfying the requirements of the Relativity theory would hardly afford any noticeable difference in observable processes; but would lead to very surprising consequences. By laying down the Relativity-Postulate from the outset, sufficient means have been created for deducing henceforth the complete series of Laws of Mechanics from the principle of conservation of Energy alone (the form of the Energy being given in explicit forms).
Let a rectangular system (x,y,z,t,) of reference be given in space and time. The unit of time shall be chosen in such a manner with reference to the unit of length that the velocity of light in space becomes unity.
Although I would prefer not to change the notations used by Lorentz, it appears important to me to use a different selection of symbols, for thereby certain homogeneity will appear from the very beginning. I shall denote the vector electric force by E, the magnetic induction by M, the electric induction byeand the magnetic force bym, so that (E, M,e,m) are used instead of Lorentz’s (E, B, D, H) respectively.
I shall further make use of complex magnitudes in a way which is not yet current in physical investigations,i.e., instead of operating with (t), I shall operate with (i t), whereidenotes √(-1). If now instead of (x,y,z,i t), I use the method of writing with indices, certain essential circumstances will come into evidence; on this will be based a general use of the suffixes (1, 2, 3, 4). The advantage of this method will be, as I expressly emphasize here, that we shall have to handle symbols which have apparently a purely real appearance; we can however at any moment pass to real equations if it is understood that of the symbols with indices, such ones as have the suffix 4 only once, denote imaginary quantities, while those which have not at all the suffix 4, or have it twice denote real quantities.
An individual system of values of (x,y,z,t)i. e., of (x₁x₂x₃x₄) shall be called a space-time point.
Further letudenote the velocity vector of matter, ε the dielectric constant, μ the magnetic permeability, σ the conductivity of matter, while ρ denotes the density of electricity in space, andxthe vector of “Electric Current” which we shall some across in§7and§8.
By using the electron theory, Lorentz in his above mentioned essay traces the Laws of Electro-dynamics of Ponderable Bodies to still simpler laws. Let us now adhere to these simpler laws, whereby we require that for the limiting case ε = 1, μ = 1, σ = 0, they should constitute the laws for ponderable bodies. In this ideal limiting case ε = 1, μ = 1, σ = 0, E will be equal toe, and M tom. At every space time point (x,y,z,t) we shall have the equations[15]
(i) Curlm- (δe/δt) = ρu(ii) dive= ρ(iii) Curle+ δm/δt= 0(iv) div m = 0
(i) Curlm- (δe/δt) = ρu(ii) dive= ρ(iii) Curle+ δm/δt= 0(iv) div m = 0
(i) Curlm- (δe/δt) = ρu
(i) Curlm- (δe/δt) = ρu
(ii) dive= ρ
(ii) dive= ρ
(iii) Curle+ δm/δt= 0
(iii) Curle+ δm/δt= 0
(iv) div m = 0
(iv) div m = 0
I shall now write (x₁x₂x₃x₄) for (x,y,z,t) and (ρ₁, ρ₂, ρ₃, ρ₄) for
Formula.
i.e.the components of the convection current ρu, and the electric density multiplied by √ -1
Further I shall write
f2 3,f3 1,f1 2,f1 4,f2 4,f3 4.
f2 3,f3 1,f1 2,f1 4,f2 4,f3 4.
f2 3,f3 1,f1 2,f1 4,f2 4,f3 4.
f2 3,f3 1,f1 2,f1 4,f2 4,f3 4.
for
mx, my, mz, -iex, -iey, -iez.
mx, my, mz, -iex, -iey, -iez.
mx, my, mz, -iex, -iey, -iez.
mx, my, mz, -iex, -iey, -iez.
i.e., the components of m and (-i.e.) along the three axes; now if we take any two indices (h. k) out of the series
3, 4),fk h= -fk h,
3, 4),fk h= -fk h,
3, 4),fk h= -fk h,
3, 4),fk h= -fk h,
Therefore
f₃₂= -f₂₃,f₁₃= -f₃₁,f₂₁= -f₁₂f₄₁= -f₁₄,f₄₄= -f₂₄,f₄₃= -f₃₄
f₃₂= -f₂₃,f₁₃= -f₃₁,f₂₁= -f₁₂f₄₁= -f₁₄,f₄₄= -f₂₄,f₄₃= -f₃₄
f₃₂= -f₂₃,f₁₃= -f₃₁,f₂₁= -f₁₂f₄₁= -f₁₄,f₄₄= -f₂₄,f₄₃= -f₃₄
f₃₂= -f₂₃,f₁₃= -f₃₁,f₂₁= -f₁₂
f₄₁= -f₁₄,f₄₄= -f₂₄,f₄₃= -f₃₄
Then the three equations comprised in (i), and the equation (ii) multiplied by i becomes
Formula."Formula A."
"Formula A."
"Formula A."
On the other hand, the three equations comprised in (iii) and the (iv) equation multiplied by (i) becomes
Formula."Formula B."
"Formula B."
"Formula B."
By means of this method of writing we at once notice the perfect symmetry of the 1st as well as the 2nd system of equations as regards permutation with the indices, (1, 2, 3, 4).
It is well-known that by writing the equations i) to iv) in the symbol of vector calculus, we at once set in evidence an invariance (or rather a (covariance) of the system of equationsA)as well as ofB), when the co-ordinate system is rotated through a certain amount round the null-point. For example, if we take a rotation of the axes round the z-axis, through an amount φ, keeping e, m fixed in space, and introduce new variablesx₁′x₂′x₃′x₄′instead ofx₁x₂x₃x₄wherex′₁=x₁cos φ +x₂sin φ,x′₂= -x₁sin φ +x₂cos φ,x′₃=x₃,x′₄=x₄, and introduce magnitudes ρ′₁, ρ′₂, ρ′₃, ρ′₄, where ρ₁′ = ρ₁ cos φ + ρ₂ sin φ, ρ₂′ = - ρ₁ sin φ + ρ₂ cos φ andf′1 2, ... ...f′3 4, where
f′₂₃=f₂₃cos φ +f₃₁sin φ,f′₃₁= -f₂₃sin φ +f₃₁cos φ,f′₁₂=f₁₂,f′₁₄=f₁₄cos φ +f₂₄sin φ,f′₂₄= -f₁₄sin φ +f₂₄cos φ,f′₃₄=f₃₄3 4,f′k h= -fk h(h l k = 1, 2, 3, 4).
f′₂₃=f₂₃cos φ +f₃₁sin φ,f′₃₁= -f₂₃sin φ +f₃₁cos φ,f′₁₂=f₁₂,f′₁₄=f₁₄cos φ +f₂₄sin φ,f′₂₄= -f₁₄sin φ +f₂₄cos φ,f′₃₄=f₃₄3 4,f′k h= -fk h(h l k = 1, 2, 3, 4).
f′₂₃=f₂₃cos φ +f₃₁sin φ,f′₃₁= -f₂₃sin φ +f₃₁cos φ,f′₁₂=f₁₂,f′₁₄=f₁₄cos φ +f₂₄sin φ,f′₂₄= -f₁₄sin φ +f₂₄cos φ,f′₃₄=f₃₄3 4,f′k h= -fk h(h l k = 1, 2, 3, 4).
f′₂₃=f₂₃cos φ +f₃₁sin φ,
f′₃₁= -f₂₃sin φ +f₃₁cos φ,
f′₁₂=f₁₂,
f′₁₄=f₁₄cos φ +f₂₄sin φ,
f′₂₄= -f₁₄sin φ +f₂₄cos φ,
f′₃₄=f₃₄3 4,
f′k h= -fk h(h l k = 1, 2, 3, 4).
then out of the equations (A) would follow a corresponding system of dashed equations (A´) composed of the newly introduced dashed magnitudes.
So upon the ground of symmetry alone of the equations (A) and (B) concerning thesuffixes(1, 2, 3, 4), the theorem of Relativity, which was found out by Lorentz, follows without any calculation at all.
I will denote byiψ a purely imaginary magnitude, and consider the substitution
x₁′=x₁,x₂′=x₂,x₃′=x₃cosiψ +x₄siniψ, (1)x₄′´ = -x₃siniψ +x₄cosiψ,
x₁′=x₁,x₂′=x₂,x₃′=x₃cosiψ +x₄siniψ, (1)x₄′´ = -x₃siniψ +x₄cosiψ,
x₁′=x₁,x₂′=x₂,x₃′=x₃cosiψ +x₄siniψ, (1)x₄′´ = -x₃siniψ +x₄cosiψ,
x₁′=x₁,
x₂′=x₂,
x₃′=x₃cosiψ +x₄siniψ, (1)
x₄′´ = -x₃siniψ +x₄cosiψ,
Putting
Formula."(2)."
"(2)."
"(2)."
We shall have cosiψ = 1/√(1 -q²), siniψ =iq/√(1 -q²)
where -1 Let us now writex′₁=x′,x′₂=y′,x′₃=z′,x′₄=it′(3) then the substitution 1) takes the form x′=x,y′=y,z′= (z-qt)/√(1 -q²),t′= (-qz+t)/√(1 -q²), (4) x′=x,y′=y,z′= (z-qt)/√(1 -q²),t′= (-qz+t)/√(1 -q²), (4) x′=x,y′=y,z′= (z-qt)/√(1 -q²),t′= (-qz+t)/√(1 -q²), (4) x′=x,y′=y,z′= (z-qt)/√(1 -q²),t′= (-qz+t)/√(1 -q²), (4) the coefficients being essentially real. If now in the above-mentioned rotation round the
Z-axis, we replace 1, 2, 3, 4 throughout by 3, 4, 1, 2, and
φ byiψ, we at once perceive that simultaneously, new
magnitudes ρ′₁, ρ′₂, ρ′₃, ρ′₄, where ρ′₁ = ρ₁, ρ′₂ = ρ₂, ρ′₃ = ρ₃ cosiψ + ρ₄ siniψ,ρ′₄ = - ρ₃ siniψ + ρ₄ cosiψ), ρ′₁ = ρ₁, ρ′₂ = ρ₂, ρ′₃ = ρ₃ cosiψ + ρ₄ siniψ,ρ′₄ = - ρ₃ siniψ + ρ₄ cosiψ), ρ′₁ = ρ₁, ρ′₂ = ρ₂, ρ′₃ = ρ₃ cosiψ + ρ₄ siniψ,ρ′₄ = - ρ₃ siniψ + ρ₄ cosiψ), ρ′₁ = ρ₁, ρ′₂ = ρ₂, ρ′₃ = ρ₃ cosiψ + ρ₄ siniψ, ρ′₄ = - ρ₃ siniψ + ρ₄ cosiψ), andf′1 2...f′3 4, where f′4 1=f4 1cosiψ +f1 3siniψ,f′1 3= -f4 1siniψ +f1 3cosiψ,f′3 4=f3 4,f′3 2=f3 2cosiψ +f4 2siniψ,f′4 2= -f3 2siniψ +f4 2cosiψ,f′1 2=f1 2,fk h= -f′k h, f′4 1=f4 1cosiψ +f1 3siniψ,f′1 3= -f4 1siniψ +f1 3cosiψ,f′3 4=f3 4,f′3 2=f3 2cosiψ +f4 2siniψ,f′4 2= -f3 2siniψ +f4 2cosiψ,f′1 2=f1 2,fk h= -f′k h, f′4 1=f4 1cosiψ +f1 3siniψ,f′1 3= -f4 1siniψ +f1 3cosiψ,f′3 4=f3 4,f′3 2=f3 2cosiψ +f4 2siniψ,f′4 2= -f3 2siniψ +f4 2cosiψ,f′1 2=f1 2,fk h= -f′k h, f′4 1=f4 1cosiψ +f1 3siniψ, f′1 3= -f4 1siniψ +f1 3cosiψ, f′3 4=f3 4, f′3 2=f3 2cosiψ +f4 2siniψ, f′4 2= -f3 2siniψ +f4 2cosiψ, f′1 2=f1 2,fk h= -f′k h, must be introduced. Then the systems of equations in
(A) and (B) are transformed into equations (A´), and (B´),
the new equations being obtained by simply dashing the
old set. All these equations can be written in purely real figures,
and we can then formulate the last result as follows. If the real transformations 4) are taken, andx´y´z´t´be taken as a new frame of reference, then we shall have (5) ρ´ = ρ [(-quz+ 1)/√(1 -q²)],ρ´uz´ = ρ[(uz-q)/√(1 -q²)],ρ´ux´ = ρux,ρ´uy´ = ρuy.(6)e´x´= (ex-qmy)/(√(1 -q²)),m´r´= (qex+my)/(√(1 -q²)),e´z´=ez.(7)m´x´= (mx-qey)/(√(1 -q²)),e´y´= (qmx+ey)/(√(1 -q²)),m´z´=mz. (5) ρ´ = ρ [(-quz+ 1)/√(1 -q²)],ρ´uz´ = ρ[(uz-q)/√(1 -q²)],ρ´ux´ = ρux,ρ´uy´ = ρuy.(6)e´x´= (ex-qmy)/(√(1 -q²)),m´r´= (qex+my)/(√(1 -q²)),e´z´=ez.(7)m´x´= (mx-qey)/(√(1 -q²)),e´y´= (qmx+ey)/(√(1 -q²)),m´z´=mz. (5) ρ´ = ρ [(-quz+ 1)/√(1 -q²)],ρ´uz´ = ρ[(uz-q)/√(1 -q²)],ρ´ux´ = ρux,ρ´uy´ = ρuy. (5) ρ´ = ρ [(-quz+ 1)/√(1 -q²)], ρ´uz´ = ρ[(uz-q)/√(1 -q²)], ρ´ux´ = ρux, ρ´uy´ = ρuy. (6)e´x´= (ex-qmy)/(√(1 -q²)),m´r´= (qex+my)/(√(1 -q²)),e´z´=ez. (6)e´x´= (ex-qmy)/(√(1 -q²)), m´r´= (qex+my)/(√(1 -q²)), e´z´=ez. (7)m´x´= (mx-qey)/(√(1 -q²)),e´y´= (qmx+ey)/(√(1 -q²)),m´z´=mz. (7)m´x´= (mx-qey)/(√(1 -q²)), e´y´= (qmx+ey)/(√(1 -q²)), m´z´=mz. Then we have for these newly introduced vectorsu´,e´,m´(with componentsux´,uy´,uz´;ex´,ey´,ez´;mx´,my´,mz´), and the quantity ρ´ a series of equations I´), II´),
III´), IV´) which are obtained from I), II), III), IV) by
simply dashing the symbols. We remark here thatex-qmy,ey+qmxare components
of the vectore+ [vm], wherevis a vector in the direction
of the positive Z-axis, and |v| =q, and [vm] is the vector
product ofvandm; similarly -qex+my,mx+qeyare the
components of the vectorm- [ve]. The equations 6) and 7), as they stand in pairs, can be
expressed as. e′x′+im′x′= (ex+imx) cosiψ + (ey+imy) siniψ,e′y′+im′y′= - (ex+imx) siniψ + (ey+imy) cosiψ,e′z′+im′z′=e′z+imz. e′x′+im′x′= (ex+imx) cosiψ + (ey+imy) siniψ,e′y′+im′y′= - (ex+imx) siniψ + (ey+imy) cosiψ,e′z′+im′z′=e′z+imz. e′x′+im′x′= (ex+imx) cosiψ + (ey+imy) siniψ, e′x′+im′x′= (ex+imx) cosiψ + (ey+imy) siniψ, e′y′+im′y′= - (ex+imx) siniψ + (ey+imy) cosiψ, e′y′+im′y′= - (ex+imx) siniψ + (ey+imy) cosiψ, e′z′+im′z′=e′z+imz. e′z′+im′z′=e′z+imz. If φ denotes any other real angle, we can form the
following combinations:— (e′x′+im′x′) cos. φ + (e′y″+im′y′) sin φ= (ex+imx) cos. (φ +iψ) + (ey+imy) sin (φ +iψ),= (e′x′+im′x′) sin φ + (e′y′+im′y′) cos. φ= - (ex+imx) sin (φ +iψ) + (ey+imy) cos. (φ +iψ). (e′x′+im′x′) cos. φ + (e′y″+im′y′) sin φ= (ex+imx) cos. (φ +iψ) + (ey+imy) sin (φ +iψ),= (e′x′+im′x′) sin φ + (e′y′+im′y′) cos. φ= - (ex+imx) sin (φ +iψ) + (ey+imy) cos. (φ +iψ). (e′x′+im′x′) cos. φ + (e′y″+im′y′) sin φ (e′x′+im′x′) cos. φ + (e′y″+im′y′) sin φ = (ex+imx) cos. (φ +iψ) + (ey+imy) sin (φ +iψ), = (ex+imx) cos. (φ +iψ) + (ey+imy) sin (φ +iψ), = (e′x′+im′x′) sin φ + (e′y′+im′y′) cos. φ = (e′x′+im′x′) sin φ + (e′y′+im′y′) cos. φ = - (ex+imx) sin (φ +iψ) + (ey+imy) cos. (φ +iψ). = - (ex+imx) sin (φ +iψ) + (ey+imy) cos. (φ +iψ). The rôle which is played by the Z-axis in the transformation
(4) can easily be transferred to any other axis
when the system of axes are subjected to a transformation
about this last axis. So we came to a more general
law:— Letvbe a vector with the componentsvx,vy,vz,
and let |v| =q< 1. Byṽwe shall denote any vector
which is perpendicular tov, and byrv,rṽwe shall denote
components ofrin direction ofṽandv. Instead of (x,y,z,t), new magnetudes (x′y′z′t′) will
be introduced in the following way. If for the sake of
shortness,ris written for the vector with the components
(x,y,z) in the first system of reference,r′for the same
vector with the components (x′y′z′) in the second system
of reference, then for the direction ofv, we have (10)r′v= (rv-qt)/√(1 -q²) (10)r′v= (rv-qt)/√(1 -q²) (10)r′v= (rv-qt)/√(1 -q²) (10)r′v= (rv-qt)/√(1 -q²) and for the perpendicular directionṽ, (11)r′ṽ=rṽ (11)r′ṽ=rṽ (11)r′ṽ=rṽ (11)r′ṽ=rṽ and further (12)t′= (-qrv+t)/√(1 -q²). The notations (r′ṽ,r′v) are to be understood in the sense
that with the directionsv, and every directionṽperpendicular
tovin the system (x,y,z) are always associated
the directions with the same direction cosines in the system
(x′y′z′). A transformation which is accomplished by means of
(10), (11), (12) with the condition 0 If further ρ′ and the vectorsu′,e′,m′, in the system
(x′y′z′) are so defined that, (13) ρ′ = ρ[(-quv+ 1)/√(1 -q²)],ρ′u′v= ρ(uv-q)/√(1 -q²),ρ′uṽ= ρ′uv, (13) ρ′ = ρ[(-quv+ 1)/√(1 -q²)],ρ′u′v= ρ(uv-q)/√(1 -q²),ρ′uṽ= ρ′uv, (13) ρ′ = ρ[(-quv+ 1)/√(1 -q²)],ρ′u′v= ρ(uv-q)/√(1 -q²),ρ′uṽ= ρ′uv, (13) ρ′ = ρ[(-quv+ 1)/√(1 -q²)], ρ′u′v= ρ(uv-q)/√(1 -q²), ρ′uṽ= ρ′uv, further (14) (e′+im′)ṽ= ((e+im) -i[v, (e+im])']ṽ)/√(1 -q²).(15) (e′+im′)v= (e+im) -i[u, (e+im)]v. (14) (e′+im′)ṽ= ((e+im) -i[v, (e+im])']ṽ)/√(1 -q²).(15) (e′+im′)v= (e+im) -i[u, (e+im)]v. (14) (e′+im′)ṽ= ((e+im) -i[v, (e+im])']ṽ)/√(1 -q²). (14) (e′+im′)ṽ= ((e+im) -i[v, (e+im])']ṽ)/√(1 -q²). (15) (e′+im′)v= (e+im) -i[u, (e+im)]v. (15) (e′+im′)v= (e+im) -i[u, (e+im)]v. Then it follows that the equations I), II), III), IV) are
transformed into the corresponding system with dashes. The solution of the equations (10), (11), (12) leads to (16)rv= (r′v+qt′)/√(1 -q²),rṽ=r′ṽ,t= (qr′v+t′)/√(1 -q²), (16)rv= (r′v+qt′)/√(1 -q²),rṽ=r′ṽ,t= (qr′v+t′)/√(1 -q²), (16)rv= (r′v+qt′)/√(1 -q²),rṽ=r′ṽ,t= (qr′v+t′)/√(1 -q²), (16)rv= (r′v+qt′)/√(1 -q²), rṽ=r′ṽ, t= (qr′v+t′)/√(1 -q²), Now we shall make a very important observation
about the vectorsuandu′. We can again introduce
the indices 1, 2, 3, 4, so that we write (x₁′,x₂′,x₃′,x₄′)
instead of (x′,y′,z′,it′) and ρ₁′, ρ₂′, ρ₃′, ρ₄′ instead of
(ρ′u′{x′}, ρ′u′{y′}, ρ′u′{z′},iρ′). Like the rotation round the Z-axis, the transformation
(4), and more generally the transformations (10), (11),
(12), are also linear transformations with the determinant
+ 1, so that (17)x₁²+x₂²+x₃²+x₄²i. e.x²+y²+z²-t², (17)x₁²+x₂²+x₃²+x₄²i. e.x²+y²+z²-t², (17)x₁²+x₂²+x₃²+x₄²i. e.x²+y²+z²-t², (17)x₁²+x₂²+x₃²+x₄²i. e.x²+y²+z²-t², is transformed into x₁′²+x₂′²+x₃′²+x₄′²i. e.x′²+y′²+z′²-t′². x₁′²+x₂′²+x₃′²+x₄′²i. e.x′²+y′²+z′²-t′². x₁′²+x₂′²+x₃′²+x₄′²i. e.x′²+y′²+z′²-t′². x₁′²+x₂′²+x₃′²+x₄′²i. e.x′²+y′²+z′²-t′². On the basis of the equations (13), (14), we shall have
(ρ₁² + ρ₂² + ρ₃² + ρ₄²) = ρ²(1 -ux², -uy², -uz²) = ρ²(1 -u²)
transformed into ρ²(1 -u²) or in other words, (18) ρ√(1 -u²) (18) ρ√(1 -u²) (18) ρ√(1 -u²) (18) ρ√(1 -u²) is an invariant in a Lorentz-transformation. If we divide (ρ₁, ρ₂, ρ₃, ρ₄) by this magnitude, we obtain
the four values (ω₁, ω₂, ω₃, ω₄) = (1/√(1 -u²))(ux,uy,uz,i)
so that ω₁² + ω₂² + ω₃² + ω₄² = -1. It is apparent that these four values are determined
by the vectoruand inversely the vectoruof magnitude
< 1 follows from the 4 values ω₁, ω₂, ω₃, ω₄; where
(ω₁, ω₂, ω₃) are real, -iω₄ real and positive and condition
(19) is fulfilled. The meaning of (ω₁, ω₂, ω₃, ω₄) here is, that they are
the ratios ofdx₁,dx₂,dx₃,dx₄to (20) √(-(dx₁²+dx₂²+dx₃²+dx₄²)) =dt√(1 -u²). (20) √(-(dx₁²+dx₂²+dx₃²+dx₄²)) =dt√(1 -u²). (20) √(-(dx₁²+dx₂²+dx₃²+dx₄²)) =dt√(1 -u²). (20) √(-(dx₁²+dx₂²+dx₃²+dx₄²)) =dt√(1 -u²). The differentials denoting the displacements of matter
occupying the spacetime point (x₁,x₂,x₃,x₄) to the
adjacent space-time point. After the Lorentz-transformation is accomplished the
velocity of matter in the new system of reference for the
same space-time point (x′y′z′t′) is the vectoru′with the
ratiosdx′/dt′,dy′/dt′,dz′/dt′,dl′/dt′, as components. Now it is quite apparent that the system of values x₁= ω₁,x₂= ω₂,x₃= ω₃,x₄= ω₄ x₁= ω₁,x₂= ω₂,x₃= ω₃,x₄= ω₄ x₁= ω₁,x₂= ω₂,x₃= ω₃,x₄= ω₄ x₁= ω₁,x₂= ω₂,x₃= ω₃,x₄= ω₄ is transformed into the values x₁′= ω₁′,x₂′= ω₂′,x₃′= ω₃′,x₄′= ω₄′ x₁′= ω₁′,x₂′= ω₂′,x₃′= ω₃′,x₄′= ω₄′ x₁′= ω₁′,x₂′= ω₂′,x₃′= ω₃′,x₄′= ω₄′ x₁′= ω₁′,x₂′= ω₂′,x₃′= ω₃′,x₄′= ω₄′ in virtue of the Lorentz-transformation (10), (11), (12). The dashed system has got the same meaning for the
velocityu′after the transformation as the first system
of values has got forubefore transformation. If in particular the vectorvof the special Lorentz-transformation
be equal to the velocity vectoruof matter at
the space-time point (x₁,x₂,x₃,x₄) then it follows out of
(10), (11), (12) that ω₁′ = 0, ω₂′ = 0, ω₃′ = 0, ω₄′ =i ω₁′ = 0, ω₂′ = 0, ω₃′ = 0, ω₄′ =i ω₁′ = 0, ω₂′ = 0, ω₃′ = 0, ω₄′ =i ω₁′ = 0, ω₂′ = 0, ω₃′ = 0, ω₄′ =i Under these circumstances therefore, the corresponding
space-time point has the velocityv′= 0 after the transformation,
it is as if we transform to rest. We may call
the invariant ρ√(1 -u²) the rest-density of Electricity.[16] If we take the principal result of the Lorentz transformation
together with the fact that the system (A) as well
as the system (B) is covariant with respect to a rotation
of the coordinate-system round the null point, we obtain
the generalrelativity theorem. In order to make the
facts easily comprehensible, it may be more convenient to
define a series of expressions, for the purpose of expressing
the ideas in a concise form, while on the other hand
I shall adhere to the practice of using complex magnitudes,
in order to render certain symmetries quite evident. Let us take a linear homogeneous transformation, Formula. the Determinant of the matrix is +1, all co-efficients without
the index 4 occurring once are real, whilea₄₁,a₄₂,a₄₃, are purely imaginary, buta₄₄is real and > 0, andx₁²+x₂²+x₃²+x₄²transforms intox₁′²+x₂′²+x₃′²+x₄′². The operation shall be called a general Lorentz
transformation. (This notation, which is due to Dr. C. E. Cullis of the Calcutta
University, has been used throughout instead of Minkowski’s notation,x₁=a₁₁x₁′+a₁₂x₂′+a₁₃x₃′+a₁₄x₄′.) If we putx₁′=x′,x₂′=y′,x₃′=z′,x₄′=it′, then
immediately there occurs a homogeneous linear transformation
of (x,y,z,t) to (x′,y′,z′,t′) with essentially real
co-efficients, whereby the aggregate -x²-y²-z²+t²transforms into -x′²-y′²-z′²+t′², and to every such
system of valuesx,y,z,twith a positivet, for which
this aggregate > 0, there always corresponds a positivet’;
this last is quite evident from the continuity of the
aggregatex,y,z,t. The last vertical column of co-efficients has to fulfil
the condition 22)a₁₄²+a₂₄²+a₃₄²+a₄₄²= 1. Ifa₁₄=a₂₄=a₃₄= 0, thena₄₄= 1, and the Lorentz
transformation reduces to a simple rotation of the spatial
co-ordinate system round the world-point. Ifa₁₄,a₂₄,a₃₄are not all zero, and if we puta₁₄:a₂₄:a₃₄:a₄₄=vx:vy:vz:i q= √(vx² +vy² +vz²) < 1. q= √(vx² +vy² +vz²) < 1. q= √(vx² +vy² +vz²) < 1. q= √(vx² +vy² +vz²) < 1. On the other hand, with every set of values ofa₁₄,a₂₄,a₃₄,a₄₄which in this way fulfil the condition
22) with real values ofvx,vy,vz, we can construct the
special Lorentz transformation (16) with (a₁₄,a₂₄,a₃₄,a₄₄)
as the last vertical column,—and then every Lorentz-transformation
with the same last vertical column
(a₁₄,a₂₄,a₃₄,a₄₄) can be supposed to be composed of
the special Lorentz-transformation, and a rotation of the
spatial co-ordinate system round the null-point. The totality of all Lorentz-Transformations forms a
group. Under a space-time vector of the 1st kind shall
be understood a system of four magnitudes (ρ₁, ρ₂, ρ₃, ρ₄)
with the condition that in case of a Lorentz-transformation
it is to be replaced by the set (ρ₁′, ρ₂′, ρ₃′, ρ₄′), where
these are the values of (x₁′,x₂′,x₃′,x₄′), obtained by
substituting (ρ₁, ρ₂, ρ₃, ρ₄) for (x₁,x₂,x₃,x₄) in the
expression (21). Besides the time-space vector of the 1st kind (x₁,x₂,x₃,x₄) we shall also make use of another space-time vector
of the first kind (y₁,y₂,y₃,y₄), and let us form the linear
combination (23)f₂₃(x₂y₃-x₃y₂) +f₃₁(x₃y₁-x₁y₃) +f₁₂(x₁y₂-x₂y₁) +f₁₄(x₁y₄-x₄y₁) +f₂₄(x₂y₄-x₄y₂) +f₃₄(x₃y₄-x₄y₃) (23)f₂₃(x₂y₃-x₃y₂) +f₃₁(x₃y₁-x₁y₃) +f₁₂(x₁y₂-x₂y₁) +f₁₄(x₁y₄-x₄y₁) +f₂₄(x₂y₄-x₄y₂) +f₃₄(x₃y₄-x₄y₃) (23)f₂₃(x₂y₃-x₃y₂) +f₃₁(x₃y₁-x₁y₃) +f₁₂(x₁y₂-x₂y₁) +f₁₄(x₁y₄-x₄y₁) +f₂₄(x₂y₄-x₄y₂) +f₃₄(x₃y₄-x₄y₃) (23)f₂₃(x₂y₃-x₃y₂) +f₃₁(x₃y₁-x₁y₃) +f₁₂(x₁y₂ -x₂y₁) +f₁₄(x₁y₄-x₄y₁) +f₂₄(x₂y₄-x₄y₂) + f₃₄(x₃y₄-x₄y₃) with six coefficientsf₂₃--f₃₄. Let us remark that in the
vectorial method of writing, this can be constructed out of
the four vectors. x₁,x₂,x₃;y₁,y₂,y₃;f₂₃,f₃₁,f₁₂;f₁₄,f₂₄,f₃₄and
the constantsx₄andy₄, at the same time it is symmetrical
with regard the indices (1, 2, 3, 4). If we subject (x₁,x₂,x₃,x₄) and (y₁,y₂,y₃,y₄) simultaneously
to the Lorentz transformation (21), the combination
(23) is changed to: (24)f₂₃′(x₂′y₃′-x₃′y₂′) +f₃₁(x₃′y₁′-x₁′y₃′) +f₁₂(x₁′y₂′-x₂′y₁′) +f₁₄′(x₁′y₄′) -x₄′y₁′) +f₂₄′(x₂′y₄′-x₄′y₂′) +f₃₄′(x₃′y₄′-x₄′y₃′), (24)f₂₃′(x₂′y₃′-x₃′y₂′) +f₃₁(x₃′y₁′-x₁′y₃′) +f₁₂(x₁′y₂′-x₂′y₁′) +f₁₄′(x₁′y₄′) -x₄′y₁′) +f₂₄′(x₂′y₄′-x₄′y₂′) +f₃₄′(x₃′y₄′-x₄′y₃′), (24)f₂₃′(x₂′y₃′-x₃′y₂′) +f₃₁(x₃′y₁′-x₁′y₃′) +f₁₂(x₁′y₂′-x₂′y₁′) +f₁₄′(x₁′y₄′) -x₄′y₁′) +f₂₄′(x₂′y₄′-x₄′y₂′) +f₃₄′(x₃′y₄′-x₄′y₃′), (24)f₂₃′(x₂′y₃′-x₃′y₂′) +f₃₁(x₃′y₁′-x₁′y₃′) +f₁₂ (x₁′y₂′-x₂′y₁′) +f₁₄′(x₁′y₄′) -x₄′y₁′) +f₂₄′(x₂′y₄′ -x₄′y₂′) +f₃₄′(x₃′y₄′-x₄′y₃′), where the coefficientsf₂₃′,f₃₁′,f₁₂′,f₁₄′,f₂₄′,f₃₄′, depend
solely on (f₂₃f₂₄) and the coefficientsa₁₁...a₄₄. We shall define a space-time Vector of the 2nd kind
as a system of six-magnitudesf₂₃,f₃₁...f₃₄, with the
condition that when subjected to a Lorentz transformation,
it is changed to a new systemf₂₃′... f₃₄, ... which satisfies
the connection between (23) and (24). I enunciate in the following manner the general
theorem of relativity corresponding to the equations (I)-(iv),—which
are the fundamental equations for Äther. Ifx,y,z,it(space co-ordinates, and timeit) is subjected
to a Lorentz transformation, and at the same time
(pux,puy,puz,iρ) (convection-current, and charge density
ρi) is transformed as a space time vector of the 1st kind,
further (mx,my,mz, -iex, -iey, -iez) (magnetic force,
and electric induction × (-i) is transformed as a space
time vector of the 2nd kind, then the system of equations
(I), (II), and the system of equations (III), (IV) transforms
into essentially corresponding relations between the
corresponding magnitudes newly introduced into the
system. These facts can be more concisely expressed in these
words: the system of equations (I and II) as well as the
system of equations (III) (IV) are covariant in all cases
of Lorentz-transformation, where (ρu,iρ) is to be transformed
as a space time vector of the 1st kind, (m-ie) is
to be treated as a vector of the 2nd kind, or more
significantly,— (ρu,iρ) is a space time vector of the 1st kind, (m-ie)[17]is a space-time vector of the 2nd kind. I shall add a few more remarks here in order to elucidate
the conception of space-time vector of the 2nd kind.
Clearly, the following are invariants for such a vector when
subjected to a group of Lorentz transformation. (i)m²-e²=f₂₃²+f₃₁²+f₁₂²+f₁₄²+f₂₄²+f₂₄²me=i(f₂₃f₁₄+f₃₁f₂₄+f₁₂f₃₄). (i)m²-e²=f₂₃²+f₃₁²+f₁₂²+f₁₄²+f₂₄²+f₂₄²me=i(f₂₃f₁₄+f₃₁f₂₄+f₁₂f₃₄). (i)m²-e²=f₂₃²+f₃₁²+f₁₂²+f₁₄²+f₂₄²+f₂₄² (i)m²-e²=f₂₃²+f₃₁²+f₁₂²+f₁₄²+f₂₄²+f₂₄² me=i(f₂₃f₁₄+f₃₁f₂₄+f₁₂f₃₄). me=i(f₂₃f₁₄+f₃₁f₂₄+f₁₂f₃₄). A space-time vector of the second kind (m-ie), where
(mande) are real magnitudes, may be called singular,
when the scalar square (m-ie)² = 0,iem²-e²= 0, and at
the same time (m e) = 0,iethe vectormandeare equal and
perpendicular to each other; when such is the case, these
two properties remain conserved for the space-time vector
of the 2nd kind in every Lorentz-transformation. If the space-time vector of the 2nd kind is not
singular, we rotate the spacial co-ordinate system in such
a manner that the vector-product [me] coincides with
the Z-axis,i.e.mx= 0,ex= 0. Then (mx, -i ex)² + (my, -i ey)² ≠ 0. (mx, -i ex)² + (my, -i ey)² ≠ 0. (mx, -i ex)² + (my, -i ey)² ≠ 0. (mx, -i ex)² + (my, -i ey)² ≠ 0. Therefore (ey+i my)/(ex+i ex) is different from +i,
and we can therefore define a complex argument (φ +iψ)
in such a manner that tan (φ +iψ)ey+i my= ---------------ex+i mx tan (φ +iψ)ey+i my= ---------------ex+i mx tan (φ +iψ) tan (φ +iψ) ey+i my= ---------------ex+i mx ey+i my = --------------- ex+i mx If then, by referring back to equations (9), we carry out
the transformation (1) through the angle ψ and a subsequent
rotation round the Z-axis through the angle φ, we perform a
Lorentz-transformation at the end of whichmy= 0,ey= 0,
and thereforemandeshall both coincide with the new
Z-axis. Then by means of the invariantsm²-e², (me)
the final values of these vectors, whether they are of the
same or of opposite directions, or whether one of them is
equal to zero, would be at once settled. By the Lorentz transformation, we are allowed to effect
certainchangesof the time parameter. In consequence
of this fact, it is no longer permissible to speak of the
absolute simultaneity of two events. The ordinary idea
of simultaneity rather presupposes that six independent
parameters, which are evidently required for defining a
system of space and time axes, are somehow reduced to
three. Since we are accustomed to consider that these
limitations represent in a unique way the actual facts
very approximately, we maintain that the simultaneity of
two events exists of themselves.[18]In fact, the following
considerations will prove conclusive. Let a reference system (x,y,z,t) for space time points
(events) be somehow known. Now if a space point A
(x₀,y₀,z₀) the timet₀be compared with a space
point P (x,y,z) at the timet, and if the difference of
timet-t₀, (lett>t₀) be less than the length A Pi.e.less
than the time required for the propagation of light from
A to P, and ifq= (t-t₀)/(A P) < 1, then by a special Lorentz
transformation, in which A P is taken as the axis, and which
has the momentq, we can introduce a time parametert′, which
(see equation 11, 12, § 4) has got the same valuet′=0for
both space-time points (A,t₀), and (P, t). So the two
events can now be comprehended to be simultaneous. Further, let us take at the same timet₀= 0, two
different space-points A, B, or three space-points (A, B, C)
which are not in the same space-line, and compare
therewith a space point P, which is outside the line A B,
or the plane A B C, at another timet, and let the time
differencet-t₀(t >t₀) be less than the time which light
requires for propagation from the line A B, or the plane
(A B C) to P. Let q be the quotient of (t-t₀) by the
second time. Then if a Lorentz transformation is taken
in which the perpendicular from P on A B, or from P on
the plane A B C is the axis, and q is the moment, then
all the three (or four) events (A,t₀), (B,t₀), (C,t₀) and
(P, t) are simultaneous. If four space-points, which do not lie in one plane, are
conceived to be at the same timet₀, then it is no longer permissible
to make a change of the time parameter by a Lorentz-transformation,
without at the same time destroying the
character of the simultaneity of these four space points. To the mathematician, accustomed on the one hand to
the methods of treatment of the poly-dimensional
manifold, and on the other hand to the conceptual figures
of the so-called non-Euclidean Geometry, there can be no
difficulty in adopting this concept of time to the application
of the Lorentz-transformation. The paper of Einstein which
has been cited in the Introduction, has succeeded to some
extent in presenting the nature of the transformation
from the physical standpoint. After these preparatory works, which have been first
developed on account of the small amount of mathematics
involved in the limiting case ε = 1, μ = 1, σ = 0, let
us turn to the electro-magnetic phenomena in matter.
We look for those relations which make it possible for
us—when proper fundamental data are given—to
obtain the following quantities at every place and time,
and therefore at every space-time point as functions of
(x,y,z,t):—the vector of the electric force E, the
magnetic induction M, the electrical inductione, the
magnetic forcem, the electrical space-density ρ, the
electric current s (whose relation hereafter to the conduction
current is known by the manner in which conductivity
occurs in the process), and lastly the vectorv, the
velocity of matter. The relations in question can be divided into two
classes. Firstly—those equations, which,—whenv, the velocity
of matter is given as a function of (x,y,z,t),—lead us to
a knowledge of other magnitude as functions ofx,y,z,t—I
shall call this first class of equations the fundamental
equations— Secondly, the expressions for the ponderomotive force,
which, by the application of the Laws of Mechanics, gives
us further information about the vectoruas functions of
(x,y,z,t). For the case of bodies at rest,i.e.whenu(x,y,z,t)
= 0 the theories of Maxwell (Heaviside, Hertz) and
Lorentz lead to the same fundamental equations. They
are;— (1) The Differential Equations:—which contain no
constant referring to matter:—
Back to IndexNext