Supplement 2MATHEMATICS
In the course of our discussion of machines that think, we have had to refer without much explanation to a number of mathematical ideas. The purpose of this supplement is to explain a few of these ideas a little more carefully than seemed easy to do in the text and, at the end of the supplement, to put down briefly some additional notes for reference.
Suppose that we have to multiply 372 by 465. With the ordinary school method, we write 465 under the 372 and proceed about as follows: 5 times 2 is 10, put down the 0 and carry the 1; 5 times 7 is 35, 35 and 1 is 36, put down the 6 and carry the 3; 5 times 3 is 15, 15 and 3 is 18, put down the 8 and carry the 1; ... The method is based mainly on a well-learned subroutine of continually changing steps:
We can, however, simplify this subroutine for a machine by delaying the carrying. We collect in one place all the right-hand digits of partial products, collect in another place all the left-hand digits, and delay all addition until the end.
For example, let us multiply 372 by 465 with this method:
37570 is called theright-hand componentof the product. It is convenient to fill in with 0 the space at the end of 13541 and to call 135410 theleft-hand componentof the product.
This process is calledmultiplying by right- and left-hand components. It has the great advantage that no carrying is necessary to complete any line of the original multiplications. Some computing machines use this process. Built into the hardware of the machine is a multiplication table up to 9 × 9. The machine, therefore, can find automatically the right-hand digit and the left-hand digit of any partial product. In a computing machine that uses this process, all the left-hand digits are automatically added in one register, and all the right-hand digits are added in another register. The only carrying that is needed is the carrying as the right-hand digits are accumulated and as the left-hand digits are accumulated. At the end of the multiplication, one of the registers is automatically added into the other, giving the product.
Another device used in computing machines for multiplying is to change the multiplier into a set of digits 0 to 5 that are either positive or negative. For example, suppose that we want to multiply 897 by 182. We note that 182 equals 200 minus 20 plus 2, and so we can write it as
The minus over the 2 marks it as anegative digit2. Then to multiply we have:
The middle 1794 is subtracted. This process is usually calledshort-cut multiplication. Everybody discovers this trick when he decides that multiplying by 99 is too much work, that it is easier to multiply by 100 and subtract once.
We are well accustomed to decimal notation in which we use 10 decimal digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 and write them in combinations to designate decimal numbers. Inbinary notationwe use two binary digits 0, 1 and write them in combinations to designatebinary numbers. For example, the first 17 numbers, from 0 to 16 in the decimal notation, correspond with the following numbers in binary notation:
In decimal notation, 101 means one times a hundred, no tens, and one. In binary notation, 101 means one times four, no twos, and one. The successive digits in a decimal number from right to left count 1, 10, 100, 1000, 10000, ...—successivepowersof 10 (for this term, see the end of this supplement). The successive digits in a binary number from right to left count 1, 2, 4, 8, 16, ...—powers of 2.
The decimal notation is convenient when equipment for computing has ten positions, like the fingers of a man, or the positions of a counter wheel. The binary notation is convenient when equipment for computing has just two positions, like “yes” or “no,” or current flowing or no current flowing.
Addition, subtraction, multiplication, and division can all be carried out unusually simply in binary notation. The addition table is simple and consists only of four entries.
The multiplication table is also simple and contains only four entries.
Suppose that we add in binary notation 101 and 1001:
We proceed: 1 and 1 is 10; write down 0 and carry 1; 0 and 0 is 0, and 1 to carry is 1; and 1 and 0 is 1; and then we just copy the last 1. To check this we can convert to decimal and see that 101 is 5, 1001 is 9, and 1110 is 14, and we can verify that 5 and 9 is 14.
One of the easiest ways to subtract in binary notation is to add aones complement(that is, the analogue of the nines complement) and use end-around-carry (for these two terms, see the end of this supplement). A ones complement can be written down at sight by just putting 1 for 0 and 0 for 1. For example, suppose that we subtract 101 from 1110:
Multiplication in the binary notation is simple. It amounts to (1) adding if the multiplier digit is 1 and not adding if the multiplier digit is 0, and (2) moving over or shifting. For example, let us multiply 111 by 101:
The digit 1 in the 6th (ornth)binaryplace from the right in 100011 stands for 1 times 2 to the 5th (orn-1 th) power, 2 × 2 × 2 × 2 × 2 = 32. The result 100011 is translated into 32 plus 2 plus 1, which equals 35 and verifies.
Division in the binary notation is also simple. It amounts to (1) subtracting (yielding a quotient digit 1) or not subtracting (yielding a quotient digit 0), and (2) shifting. We never need to try multiples of the divisor to find the largest that can be subtracted yet leave a positive remainder. For example, let us divide 1010 (10 in decimal) into 10001110 (142 in decimal):
In decimal notation, digits to the right of the decimal point count powers of ⅒. In binary notation, digits to the right of thebinary pointcount powers of½: ½, ¼, ⅛, ¹/₁₆.... For example, 0.1011 equals½ + ⅛ + ¹/₁₆, or ¹¹/₁₆.
If we were accustomed to using binary numbers, all our arithmetic would be very simple. Furthermore, binary numbers are in many ways much better for calculating machinery than any other numbers. The main problem is converting numbers from decimal notation to binary. One method depends on storing the powers of 2 in decimal notation. The rule is: subtract successively smaller powers of 2; start with the largest that can be subtracted, and count 1 for each power that goes and 0 for each power that does not. For example, 86 in decimal becomes 1010110 in binary:
It is a little troublesome to remember long series of 1’s and 0’s; in fact, to write any number in binary notation takes about 3⅓ times as much space as decimal notation. For this reason we can separate binary numbers into triples beginning at the right and label each triple as follows:
For example, 1010110 would become 1 010 110 or 126. This notation is often calledoctal notation, because it is notation in the scale of eight.
Another kind of notation for numbers isbiquinary notation, so called because it uses both 2’s and 5’s. Essentially this notation is very like Roman numerals, ancient style. By ancient style we mean, for example, VIIII instead of IX. In the following table we show the first two dozen numbers in decimal, biquinary, and ancient Roman notation:
The biquinary columns alternate in going from 0 to 4 and from 0 to 1. The digits from 0 to 4 are not changed. The digits from 5 to 9 are changed into 10 to 14. We see that thebiquinary digitsare 0 to 4 in odd columns and 0, 1 in even columns, counting from the right.
This is the notation actually expressed by theabacus. The beads of the abacus show by their positions groups of 2 and 5 (see Fig. 1).
Fig. 1.Abacus and notations.
Fig. 1.Abacus and notations.
One of the operations of algebra that is important for a mechanical brain isapproximation, the problem of getting close to the right value of a number. Take, for example, findingsquare root(see the end of this supplement). The ordinary process taught in school is rather troublesome. We can set down another process, however, using a desk calculator to do division, which gives us square root with great speed.
Suppose that we want to find the square root of a numberN, and suppose that we havex₀ as a guessed square root correct to one figure. For example,Nmight be 67.2 andx₀ might be 8, chosen because 8 × 8 is 64, and 9 × 9 is 81, and it seems as if 8 should be near the square root of 67.2. Here is the process:
Now repeat:
Every time this process is repeated, the newxcomes a great deal closer to the correct square root. In fact it can be shown that, ifx₀ is correct to one figure, then:
Let us see how this actually works out with 67.2 and a 10-column desk calculator.
Round 1: 8 divided into 67.2 gives 8.4. One half of 8 plus 8.4 is 8.2. This isx₁.Round 2: 8.2 divided into 67.2 gives 8.195122. One half of 8.2 plus 8.195122 is 8.197561. This isx₂.Round 3: 8.197561 divided into 67.2 gives 8.197560225. One half of 8.197561 and 8.197560225 is 8.1975606125. This isx₃.Checkingx₃, we find that 8.1975606125 divided into 67.2 gives 8.1975606126 approximately.
Round 1: 8 divided into 67.2 gives 8.4. One half of 8 plus 8.4 is 8.2. This isx₁.
Round 2: 8.2 divided into 67.2 gives 8.195122. One half of 8.2 plus 8.195122 is 8.197561. This isx₂.
Round 3: 8.197561 divided into 67.2 gives 8.197560225. One half of 8.197561 and 8.197560225 is 8.1975606125. This isx₃.
Checkingx₃, we find that 8.1975606125 divided into 67.2 gives 8.1975606126 approximately.
In this case, then,x₃ is correct to more than 10 figures. In other words, with a reasonable guess and two or three divisions we can obtain all the accuracy we can ordinarily use. This process is calledrapid approximation, orrapidly convergent approximation, since itconverges(points or comes together) very quickly to the number we are seeking.
Another important operation of algebra isinterpolation, the problem of putting values smoothly in between other values. For example, suppose that we have the table:
Suppose that we want to find the value thaty(oryₓ) ought to have whenxhas the value of 7.2. This is the problem ofinterpolating yso as to findyat the value of 7.2,y₇ˌ₂.
One way of doing this is to discover the formula that expressesyand then to putxinto that formula. This is not always easy. Another way is to take the difference betweeny₇ andy₈, 15, and share the difference appropriately over the distance 7 to 7.2 and 7.2 to 8. We can, for example, take ²/₁₀ of 15 = 3, add that toy₇ = 50, and obtain an estimatedy₇ˌ₂ = 53. This is calledlinear interpolation, since the difference 0.2 in the value ofxis used only to the first power. It is a good practical way to carry out most interpolation quickly and approximately.
Actually herey=x² + 1, and so the true value ofy₇ˌ₂ is (7.2 × 7.2) + 1, or 52.84, which is rather close to 53. Types of interpolation procedures more accurate than linear interpolation will come much nearer still to the true value.
We turn now to thealgebra of logic. The first half ofChapter 9, “Reasoning” (through the section “Logical-Truth Calculation byAlgebra”), introduces this subject. There the termstruth values,truth tables,logical connectives, andalgebra of logicare explained. The part ofChapter 3, “A Machine That Will Think,” that discusses the operationsgreater-thanandselection, also explains some of the algebra of logic. It introduces, for example, the formula
p=T(a>b) = 1, 0
This is a way of saying briefly that the truth value of the statement “ais greater thanb” equalsp;pis 1 if the statement is true and 0 if the statement is false. The truth value 1 corresponds with “yes.” The truth value 0 corresponds with “no.”
With mechanical brains we are especially interested in handling mathematics and logic without any sharp dividing line between them. For example, suppose that we have a register in which a ten-digit number like 1,765,438,890 may be stored. We should be able to use that register to store a number consisting of only 1’s and 0’s, like 1,100,100,010. Such a number may designate the answers to 10 successive questions: yes, yes, no, no, yes, no, no, no, yes, no. Or it may tell 10 successive binary digits. The register then is three times as useful: it can store either decimal numbers or truth values or binary digits. We need, of course, a way to obtain from the register any desired digit. For this purpose we may have two instructions to the machine: (1) read the left-hand end digit; (2) shift the number around in a circle. The second instruction is the same as multiplying by 10 and then putting the left-most digit at the right-hand end. For example, suppose that we want the 3rd digit from the left in 1,100,100,010. The result of the first circular shift is 1,001,000,101; the result of the second circular shift is 0,010,001,011; and reading the left-most digit gives 0. A process like this has been calledextractionand is being built into the newest mechanical brains.
Using truth values, we can put down very neatly some truths of ordinary algebra. For example:
|a| =a·T(a≥ 0) -a·T(a< 0)
For another example:
T(a>b) +T(a=b) +T(a
Many common logical operations, like selecting and comparing, and the behavior of many simple mechanisms, like a light or a lock, can be expressed by truth values.Chapter 4, on punch-card mechanisms, contains a number of examples.
In ordinary language, apronoun, like “he,” “she,” “it,” “the former,” “the latter,” is a word that usually stands for a noun previously referred to. A pronoun usually stands for the last preceding noun that the grammar allows. In mathematics, avariable, like “a,” “b,” “x,” “m₁,” “m₂” closely resembles a pronoun in ordinary language. A variable is a symbol that usually stands for a number previously referred to, and usually it stands for the same number throughout a particular discussion.
Augendandaddendare names of registers in the Harvard Mark II calculator (see Chapter 10).
Two digits that add to 9 (0 and 9, 1 and 8, 2 and 7, 3 and 6, 4 and 5) are callednines complementsof each other. Thenines complementof a numberais the numberbin which each digit ofbis the nines complement of the corresponding digit ofa; for example, the nines complement of 173 is 826. Ordinary subtraction is the same as addition as of the nines complement, with a simple correction; for example, 562 less 173 (equal to 389) is the same as 562 plus 826 (equal to 1388) less 1000 plus 1.
The correction “less 1000 plus 1” of the foregoing example may be thought of as carrying the 1 (in the result 1388) around from the left-hand end to the right-hand end, where it is there added. So the 1 is calledend-around-carry.
Two digits that add to 10 are calledtens complementsof each other. Thetens complementof a numbera, however, is equal to the nines complement of the number plus 1. For example, the tens complement of 173 is 827. When subtracting by adding a tens complement, the left-most digit 1 in the result is dropped. For example, 562 less 173 (equal to 389) is the same as 562 plus 827 (equal to 1389) less 1000.
Apowerof any numberaisamultiplied by itself some number of times.a×a×a... ×awhereaappearsbtimes is writtenaᵇ and is readato thebth power.a², a to the 2nd power, isa×aand is calleda squaredor thesquareofa.a³,ato the 3rd power,a×a×a, is calleda cubed, or thecubeofa.a⁰,ato the zero power, is equal to 1 for everya.a¹,ato the power 1, isaitself. The first power is often calledlinear.ato some negative power is the same as 1 divided by that power; that is,a⁻ᵇ = 1/aᵇ.a⁻¹,ato the power minus 1, is 1/a, and is called thereciprocalofa.a¹ᐟ²,ato the one-half power, is a numbercsuch thatc×c=a, and is called thesquare rootofaand often denoted by √a.
An example of atableis:
The numbers in the body of the table, calledtabular values, depend on or are determined by the numbers along the edge of the table, calledarguments. In this example, if 1, 2, 3 are choices of a numbern, and if 0.025, 0.03 are choices of a numberi, then each tabular valueyis equal to 1 plusiraised to thenth power.nandiare also calledindependent variables, andyis called thedependent variable. The table expresses afunctionorformulaorrule. The rule could be stated as: addito 1; raise the result to thenth power.
A number is said to be aconstantif it has the same value under all conditions. For example, in the formula:
(area of a circle) = π × (radius)²,π is a constant, equal to 3.14159 ...,
applying equally well to all circles.
Mathematics recognizes several kinds of infinity. One of them occurs when numbers become very large. For example, the quotient of 12 divided by a numberx, asxbecomes closer and closer to 0, becomes indefinitely large, and the limit is calledinfinityand is denoted∞.
An example of two linear simultaneousequationsis:
7x+ 8t= 22
3x+ 5t= 11
xandtare calledunknowns—that is, unknown variables—because the objective of solving the equations is to find them. These equations are calledsimultaneousbecause they are to be solved together, at the same time, for values ofxandtwhich will fit in both equations. The equations are calledlinearbecause the only powers of the unknowns that appear are the first power. Values that solve equations are said tosatisfythem. It is easy to solve these two equations and find thatx= 2 andt= 1 is their solution. But it is a long process to solve 10 linear simultaneous equations in 10 unknowns, and it is almost impossible (without using a mechanical brain) to solve 100 linear simultaneous equations in 100 unknowns.
See the sections inChapter 5entitled “Differential Equations,” “Physical Problems,” and “Solving Physical Problems.” There these ideas and, to some extent, also the following ideas were explained: formula, equation, function, differential function, instantaneous rate of change, interval, inverse, integrating. See also a textbook on calculus. Ifyis a function ofx, then a mathematical symbol for the derivative ofywith respect toxisDₓ y, and a symbol for the integral ofywith respect tox, is ∫y dx. An integral with given initial conditions (see p. 83) is adefinite integral.
A famous mathematical function is theexponential. It equals the constanteraised to thexpower,eˣ, whereeequals 2.71828.... The exponential lies between the powers of 2 and the powers of 3. It can be computed from:
It is a solution of the differential equationDₓy=y. See also a textbook on calculus. Theexponential to the base 10is10ˣ.
Another important mathematical function is thelogarithm. It is writtenlogxorlogₑxand can be computed from the two equations:
loguv= logu+ logv
It is a solution of the differential equationDₓy= 1/y. Ifyis the logarithm ofx, thenxis theantilogarithmofy. Thelogarithm to the base 10ofx, log₁₀x, equals thelogarithm to the base eofx, logₑx, divided by logₑ 10. See also textbooks on algebra and calculus.
These also are important mathematical functions. Thesineandcosineare solutions of the differential equationDₓ(Dₓy) =-yand are written as sinxand cosx. They can be computed from
Thetangentofxis simply sine ofxdivided by cosine ofx. Ifyis the tangent ofx, thenxis theantitangentofy. See also references on trigonometry and on calculus.Trigonometrictables include sine, cosine, tangent, and related functions.
These are mathematical functions that were named after Friedrich W. Bessel, a Prussian astronomer who lived from 1784 to 1846. Bessel functions are found as some of the solutions of the differential equation
x²Dₓ(Dₓy) + xDₓy+ (x² -n²)y= O
This equation arises in a number of physical problems in the fields of electricity, sound, heat flow, air flow, etc.
Amatrixis a table (orarray) of numbers in rows and columns, for which addition, multiplication, etc., with similar tables is specially defined. For example, the matrix
plus the matrix
equals the matrix
(Can you guess the rule defining addition?)
Calculations using matrices are useful in physics, engineering, psychology, statistics, etc. To add asquare matrixof 100 terms in an array of 10 columns and 10 rows to another such matrix, 100 ordinary additions of numbers are needed. To multiply one such matrix by another, 1000 ordinary multiplications and 900 ordinary additions are needed. See references on matrix algebra and matrix calculus.
Onp. 221, a sequence of values ofyis shown: 26, 37, 50, 65, 82. Suppose, however, the second value ofywas reported as 47 instead of 37. Then thedifferencesofyas we pass down the sequence would not be 11, 13, 15, 17 (which is certainly regular orsmooth) but 21, 3, 15, 17 (which is certainly not smooth). The second set of differences would strongly suggest a mistake in the reporting ofy. Thesmoothnessof differences is often a useful check on a sequence of reported values.