In connection to Sylvester's resultant, both Sylvester and Cayley quote
His method for computing resultants is preliminarly described by Bézout
a11X2X3X4 - a12X1X3X4 + a13X1X2X4 - a14X1X2X3 | i = 1 | |
(a11a22 - a21a12)X3X4 - (a11a23 - a21a13)X2X4 | ||
+ | (a11a24 - a21a14)X2X3 + (a12a23 - a22a13)X1X4 | i = 2 |
- | (a12a24 - a22a14)X1X3 + (a13a24 - a23a14)X1X2 | |
- | ||
+ | i = 3 | |
- |
the unreal product#math504# #tex2html_wrap_inline12181#Xj at the very outset must have been a sore puzzle to students. [...]
To throw light upon the process, let us compare the above solution of a set of three linear equations with the following solution, which from one point of view may be looked upon as an improvement on the ordinary determinantal modes of solution as presented to modern readers.
[...] The numerators of the values of#math505#X1, X2, X3 and the common denominator are [...] the coefficients of #math506#X1, X2, X3, X4 in the determinant
More precisely, Muir explains, if we denote
(a11a22 - a21a12) |
|||
+ | (a11a24 - a21a14)) |
||
- | (a12a24 - a22a14) |
+ |
The same method, <#2651#>id est<#2651#> an expansion of proper determinants expressed with a similar notation and process, is then applied by Bézout to <#2652#>resolve<#2652#> different systems of polynomial equations, including
The Sylvester resultant was, at least implicitly, introduced by Euler
The equation system can be expressed as
It is clear that the procedure proposed by Euler is equivalent to the computation of the Sylvester resultant, so that
In order to present the English School view of Bézout's abridged method, I refer to Salmon's <#2780#>Higher Algebra<#2780#>
Given two polynomials of the same degree
is, therefore, as it ought to be, of the nth degree in the coefficients of [U], and of the mth in those of [V].
The first which studied the <#2822#>bezoutic matrix<#2822#>
Sylvester
[Conceive] a number of cubic blocks each of which has two numbers, termed its <#2895#>characteristics<#2895#>, inscribed upon one of its faces, upon which the values of such a block (itself called an <#2896#>element<#2896#>) depends.
For instance, the value of the <#2897#>element<#2897#>, whose <#2898#>characteristics<#2898#> are r, s, is the difference between two products: the one of the coefficient rth in order occurring in the polynomial U, by that which comes sth in order of the polynomial V; the other product is that of the coefficient sth in order of the polynomials U, by that rth in order of V; so that if the degree of each equation be n, there will be altogether#math549# #tex2html_wrap_inline12436#n(n + 1) such elements.
The blocks are formed into squares or flats (<#2900#>plafonds<#2900#>) of which the number is#math550# #tex2html_wrap_inline12438# or #math551# #tex2html_wrap_inline12440#, according as n is even or odd. The first of these contains n blanks 103 in a side, the next (n - 2), the next (n - 4), till finally we reach a square of four blocks or of one, according as n is even or odd. These flats are laid upon one another so as to form a regularly ascending pyramid, of which the two diagonal planes are termed the planes of separation and symmetry respectively. The former divides the pyramid into two halves, such that no element on the one side of it is the same as that of any block in the other. The plan of symmetry, as the name denotes, divides the pyramid into two exactly <#2903#>similar<#2903#> parts; it being a rule, that <#2904#>all elements lying in any given line of a square (platfond) parallel to the plane of separation are identical<#2904#>; moreover the sum of the characteristics is the same for <#2905#>all<#2905#> elements lying <#2906#>anywhere<#2906#> in a <#2907#>plane<#2907#> parallel to that of separation.
The formula behind this rule is
If m ;SPMgt; n the same formula is applied simply by expressing V as
In 1857 Cayley
<#4882#>la forme la plus simple sous laquelle on peut présenter cette méthode.
Pour éliminer [x]entre deux équations du #math555#n<#1#>ième<#1#> degré
on n'a qu'a former l'équation identique <#4881#>
= | (yn-1, yn-2,…, 1) |
<#4881#> oú l'expression qui forme le second membre représente la fonction suivant
+ | |||
... | |||
+ |
le résultat de l'élimination sera
<#4882#>
Sylvester
The determinant formed by arranging in a square the n sets of coefficients of the n Bezoutians, and which I shall term the Bezoutian matrix, gives, as is well known, the Resultant (meaning thereby the Result in its simplest form of eliminating the variables out) of U and V.
Eliminating dialytically, first Xn-1 between the first and the second, then Xn-1 and Xn-2 between the first, second and the third, and so on, and finally, all the powers of X between the first, second, third,...,nth of these Bezoutians, and repeating the first of them, we obtain a derived set of n equations, the right-hand members of which I shall term the secondary Bezoutains to U and V.
The 'dialytical elimination' performed by Sylvester on the expressions
V0U - U0V | = | F1 | = : | B1 |
(α21V0 - α11V1)U - (α21U0 - α11U1)V | = | α21F1 - α11F2 | = : | B2 |
… | ||||
SρU - TρV | = : | Bρ | ||
… | ||||
Sm-1U - Tm-1V | = : | Bm-1. |
perfectly unrelated, and each the most general function that can be formed of the same degreeand in case m = n
n successive <#3152#>Secondary Bezoutians<#3152#><#3156#>id est<#3156#>to the system U, V [...] will (saving at least a numerical factor of a magnitude and algebraic sign to be determined, but which, when proper conventions are made, will be subsequently proved to be +1) represent the simplified [...] residue[s] to #math574# #tex2html_wrap_inline12587#.
Once obtained the Bezoutic square of two polynomials f, φ of the same degree
m, Sylvester remarks that
this square [...] is symmetrical about one of its diagonals, and corresponds therefore (as every symmetrical matrix must do) to a homogeneous quadratic function of m variables of which it expresses the determinant. This quadratic function, which plays a great part in [...] the theory of real roots, I term the Bezoutiant.
[...]
In Section V. Arts. 56.57, I show that the <#3161#>total<#3161#> number of effective intercalations between the roots of two functions of the same degree is given by the <#3162#>inertia<#3162#> of that quadratic formwhich we agreed to term the Bezoutiant to f and φ; and in the following article (58) the result is extended to embrace the case contemplated in M.Sturm's theorem; that is to say, I show, that on replacing the function of x by a homogeneous function of x and y, the Bezoutiant of the two functions, which are respectively the differential derivates of f with respect to x and with respect to y, will serve to determine by its form or <#3177#>inertia<#3177#> the total number of real roots and of <#3178#>equal<#3178#> roots in f (x) . The subject is pursued in the following Arts. 59,60. 107 [...] In Arts. 61, 62, 63, it is proved that the Bezoutiant is an invariative function of the functions from which it is derived; and in Art. 64 the important remark is added, that it is an invariant of that particular class to which I have given the name of Combinants, which have the property of remaining unaltereted, not only for linear transformations of the variables, but also for linear combinations of the functions containing the variables , possessing thus a character of double invariability. In Arts. 65, 66 I consider the relation of the Bezoutiant to the differential determinant, so called by Jacoby, but which for greater brevity I call the Jacobian. On proper substitutions being made in the Bezoutiant for the m variables which it contains [...], the Bezoutiant becomes identical with the Jacobian of f and Φ.
To illustrate the `proper substitution to be done' I give again the word to Sylvester
[The Bezoutiant]#math594#B(u1,…, um) being a covariant of the system f and φ [...] on making #math595#u1,…, um equal to [ #math596#xm-1, xm-2y,…, ym-1], B will become [...] what I am in the habit of calling the Jacobian (after the name of the late but ever-illustrious Jacobi), a term capable of application to any number of homogeneous functions of as many variables. In the case before us, where we have two functions of two variables, the Jacobian 109
[...] So in the case of a single function F of the degree m, the Bezoutoid, that is the Bezoutiant to#math598# #tex2html_wrap_inline12677#, #tex2html_wrap_inline12678#, on making the (m - 1) variables which it contains identical with #math599#xm-2, xm-3y,…, ym-2 respectively, becomes identical with the Jacobian to #math600# #tex2html_wrap_inline12682#, #tex2html_wrap_inline12683#, that is the Hessian of F, namely
As an example of this property of the Bezoutiant, suppose
f | = | ax3 + bx2y + cxy2 + dy3, | |
φ | = | αx3 + βx2y + γxy2 + δy3. |
The Bezoutiant matrix becomes
aβ - bα, | aγ - cα, | aδ - dα, |
aγ - cα, | bγ - cβ, | |
aδ - dα, | bγ - cβ, | cδ - dγ. |
The Bezoutiant accordingly will be the quadratic function
(aβ - bα)u12 + |
|||
+ | 2(aγ - cα)u1u2 +2(aδ - dα)u3u1 +2(bγ - cβ)u2u3, |
which on making
becomes
where L, M, N, P, Q respectively will be the sum of the terms lying in the successive bands drawn parallel to the sinister diagonal of the Bezoutiant matrix, that is
L | = | (aβ - bα), | |
M | = | 2(aγ - cα), | |
N | = | 3(aδ - dα) + (bγ - cβ), | |
P | = | 2(bγ - cβ), | |
Q | = | (cδ - dγ). |
The biquadratic function in x and y[...] will be found on computation to be identical in point of form with the Jacobian to f,φ, namely <#3241#>
<#3241#> this latter being in fact
and concludes commenting:
The remark is not without some interest, that in fact the Bezoutiant, which is capable (as has been shown already) of being mechanically constructed, gives the best and readiest means of calculating the Jacobian; for in summing the sinister bands transverse to the axis of symmmetry the only numerical operation to be performed is that of addition of positive integers, whereas the direct method involves the necessity of numerical subtractions as well as additions, inasmuch as the same terms will be repeated with different signs.and remarks, in a different example, that, unlike the computation via Bezoutiant, the direct evaluation requires to effectively employ also <#3245#>division<#3245#> in order to reduce the Jacobian to its simplest form, being divisible by