Hurwitz's theorem (composition algebras)

In mathematics, Hurwitz's theorem is a theorem of Adolf Hurwitz (1859–1919), published posthumously in 1923, solving the Hurwitz problem for finite-dimensional unital real non-associative algebras endowed with a nondegenerate positive-definite quadratic form. The theorem states that if the quadratic form defines a homomorphism into the positive real numbers on the non-zero part of the algebra, then the algebra must be isomorphic to the real numbers, the complex numbers, the quaternions, or the octonions, and that there are no other possibilities. Such algebras, sometimes called Hurwitz algebras, are examples of composition algebras.

The theory of composition algebras has subsequently been generalized to arbitrary quadratic forms and arbitrary fields.^[1] Hurwitz's theorem implies that multiplicative formulas for sums of squares can only occur in 1, 2, 4 and 8 dimensions, a result originally proved by Hurwitz in 1898. It is a special case of the Hurwitz problem, solved also in Radon (1922). Subsequent proofs of the restrictions on the dimension have been given by Eckmann (1943) using the representation theory of finite groups and by Lee (1948) and Chevalley (1954) using Clifford algebras. Hurwitz's theorem has been applied in algebraic topology to problems on vector fields on spheres and the homotopy groups of the classical groups^[2] and in quantum mechanics to the classification of simple Jordan algebras.^[3]

Euclidean Hurwitz algebras

Definition

A Hurwitz algebra or composition algebra is a finite-dimensional not necessarily associative algebra $A$ with identity endowed with a nondegenerate quadratic form $q$ such that $q (a b) = q (a) q (b)$ . If the underlying coefficient field is the reals and $q$ is positive-definite, so that $(a, b) = .mw-parser-output .sfrac{white-space:nowrap}.mw-parser-output .sfrac.tion,.mw-parser-output .sfrac .tion{display:inline-block;vertical-align:-0.5em;font-size:85%;text-align:center}.mw-parser-output .sfrac .num{display:block;line-height:1em;margin:0.0em 0.1em;border-bottom:1px solid}.mw-parser-output .sfrac .den{display:block;line-height:1em;margin:0.1em 0.1em}.mw-parser-output .sr-only{border:0;clip:rect(0,0,0,0);clip-path:polygon(0px 0px,0px 0px,0px 0px);height:1px;margin:-1px;overflow:hidden;padding:0;position:absolute;width:1px}⁠1/2⁠[q(a + b) − q(a) − q(b)]$ is an inner product, then $A$ is called a Euclidean Hurwitz algebra or (finite-dimensional) normed division algebra.^[4]

If $A$ is a Euclidean Hurwitz algebra and $a$ is in $A$ , define the involution and right and left multiplication operators by

a^{*}=-a+2(a,1)1,\quad L(a)b=ab,\quad R(a)b=ba.

Evidently the involution has period two and preserves the inner product and norm. These operators have the following properties:

the involution is an antiautomorphism, i.e. $(ab)* = b * a *$
$aa * = ‖ a ‖ 2 1 = a * a$
$L (a *) = L (a)*$ , $R (a *) = R (a)*$ , so that the involution on the algebra corresponds to taking adjoints
$Re (ab) = Re (ba)$ if $Re x = (x + x *)/2 = (x, 1)1$
$Re (ab) c = Re a (bc)$
$L (a 2) = L (a) 2$ , $R (a 2) = R (a) 2$ , so that $A$ is an alternative algebra.

These properties are proved starting from the polarized version of the identity $(ab, ab) = (a, a)(b, b)$ :

\displaystyle {2(a,b)(c,d)=(ac,bd)+(ad,bc).}

Setting $b = 1$ or $d = 1$ yields $L (a *) = L (a)*$ and $R (c *) = R (c)*$ .

Hence $Re(ab) = (ab, 1)1 = (a, b *)1 = (ba, 1)1 = Re(ba)$ .

Similarly $Re (ab) c = ((ab) c,1)1 = (ab, c *)1 = (b, a * c *)1 = (bc, a *)1 = (a (bc),1)1 = Re a (bc)$ .

Hence $((ab)*, c) = (ab, c *) = (b, a * c *) = (1, b *(a * c *)) = (1, (b * a *) c *) = (b * a *, c)$ , so that $(ab)* = b * a *$ .

By the polarized identity $‖ a ‖ 2 (c, d) = (ac, ad) = (a * (ac), d)$ so $L (a *) L (a) = L (‖ a ‖ 2)$ . Applied to 1 this gives $a * a = ‖ a ‖ 21$ . Replacing $a$ by $a *$ gives the other identity.

Substituting the formula for $a *$ in $L (a *) L (a) = L (a * a)$ gives $L (a) 2 = L (a 2)$ . The formula $R (a 2) = R (a) 2$ is proved analogously.

Classification

It is routine to check that the real numbers $R$ , the complex numbers $C$ and the quaternions $H$ are examples of associative Euclidean Hurwitz algebras with their standard norms and involutions. There are moreover natural inclusions $R \subset C \subset H$ .

Analysing such an inclusion leads to the Cayley–Dickson construction, formalized by A.A. Albert. Let $A$ be a Euclidean Hurwitz algebra and $B$ a proper unital subalgebra, so a Euclidean Hurwitz algebra in its own right. Pick a unit vector $j$ in $A$ orthogonal to $B$ . Since $(j, 1) = 0$ , it follows that $j * = - j$ and hence $j 2 = -1$ . Let $C$ be subalgebra generated by $B$ and $j$ . It is unital and is again a Euclidean Hurwitz algebra. It satisfies the following Cayley–Dickson multiplication laws:

\displaystyle {C=B\oplus Bj,\,\,\,(a+bj)^{*}=a^{*}-bj,\,\,\,(a+bj)(c+dj)=(ac-d^{*}b)+(bc^{*}+da)j.}

$B$ and $Bj$ are orthogonal, since $j$ is orthogonal to $B$ . If $a$ is in $B$ , then $j a = a * j$ , since by orthogonal $0 = 2(j, a *) = ja - a * j$ . The formula for the involution follows. To show that $B \oplus B j$ is closed under multiplication $Bj = jB$ . Since $Bj$ is orthogonal to 1, $(bj)* = - bj$ .

$b (cj) = (cb) j$ since $(b, j) = 0$ so that, for $x$ in $A$ , $(b (cj), x) = (b (jx), j (cj)) = -(b (jx), c *) = -(cb, (jx)*) = -((cb) j, x *) = ((cb) j, x)$ .
$(jc) b = j (bc)$ taking adjoints above.
$(bj)(cj) = - c * b$ since $(b, cj)$ = 0, so that, for $x$ in $A$ , $((bj)(cj), x) = -((cj) x *, bj) = (bx *, (cj) j) = -(c * b, x)$ .

Imposing the multiplicativity of the norm on $C$ for $a + bj$ and $c + dj$ gives:

\displaystyle {(\|a\|^{2}+\|b\|^{2})(\|c\|^{2}+\|d\|^{2})=\|ac-d^{*}b\|^{2}+\|bc^{*}+da\|^{2},}

which leads to

\displaystyle {(ac,d^{*}b)=(bc^{*},da).}

Hence $d (ac) = (da) c$ , so that $B$ must be associative.

This analysis applies to the inclusion of $R$ in $C$ and $C$ in $H$ . Taking $O = H \oplus H$ with the product and inner product above gives a noncommutative nonassociative algebra generated by $J = (0, 1)$ . This recovers the usual definition of the octonions or Cayley numbers. If $A$ is a Euclidean algebra, it must contain $R$ . If it is strictly larger than $R$ , the argument above shows that it contains $C$ . If it is larger than $C$ , it contains $H$ . If it is larger still, it must contain $O$ . But there the process must stop, because $O$ is not associative. In fact $H$ is not commutative and $a (bj) = (ba) j \neq (ab) j$ in $O$ .^[5]

Theorem. The only Euclidean Hurwitz algebras are the real numbers, the complex numbers, the quaternions and the octonions.

Other proofs

The proofs of Lee (1948) and Chevalley (1954) use Clifford algebras to show that the dimension $N$ of $A$ must be 1, 2, 4 or 8. In fact the operators $L (a)$ with $(a, 1) = 0$ satisfy $L (a) 2 = -‖ a ‖ 2$ and so form a real Clifford algebra. If $a$ is a unit vector, then $L (a)$ is skew-adjoint with square $- I$ . So $N$ must be either even or 1 (in which case $A$ contains no unit vectors orthogonal to 1). The real Clifford algebra and its complexification act on the complexification of $A$ , an $N$ -dimensional complex space. If $N$ is even, $N - 1$ is odd, so the Clifford algebra has exactly two complex irreducible representations of dimension $2 N /2 - 1$ . So this power of 2 must divide $N$ . It is easy to see that this implies $N$ can only be 1, 2, 4 or 8.

The proof of Eckmann (1943) uses the representation theory of finite groups, or the projective representation theory of elementary abelian 2-groups, known to be equivalent to the representation theory of real Clifford algebras. Indeed, taking an orthonormal basis $e i$ of the orthogonal complement of 1 gives rise to operators $U i = L (e i)$ satisfying

U_{i}^{2}=-I,\quad U_{i}U_{j}=-U_{j}U_{i}\,\,(i\neq j).

This is a projective representation of a direct product of $N - 1$ groups of order 2. ( $N$ is assumed to be greater than 1.) The operators $U i$ by construction are skew-symmetric and orthogonal. In fact Eckmann constructed operators of this type in a slightly different but equivalent way. It is in fact the method originally followed in Hurwitz (1923).^[6] Assume that there is a composition law for two forms

\displaystyle {(x_{1}^{2}+\cdots +x_{N}^{2})(y_{1}^{2}+\cdots +y_{N}^{2})=z_{1}^{2}+\cdots +z_{N}^{2},}

where $z i$ is bilinear in $x$ and $y$ . Thus

\displaystyle {z_{i}=\sum _{j=1}^{N}a_{ij}(x)y_{j}}

where the matrix $T (x) = (a ij)$ is linear in $x$ . The relations above are equivalent to

\displaystyle {T(x)T(x)^{t}=x_{1}^{2}+\cdots +x_{N}^{2}.}

Writing

\displaystyle {T(x)=T_{1}x_{1}+\cdots +T_{N}x_{N},}

the relations become

\displaystyle {T_{i}T_{j}^{t}+T_{j}T_{i}^{t}=2\delta _{ij}I.}

Now set $V i = (T N) t T i$ . Thus $V N = I$ and the $V 1, ... , V N - 1$ are skew-adjoint, orthogonal satisfying exactly the same relations as the $U i$ 's:

\displaystyle {V_{i}^{2}=-I,\quad V_{i}V_{j}=-V_{j}V_{i}\,\,(i\neq j).}

Since $V i$ is an orthogonal matrix with square $- I$ on a real vector space, $N$ is even.

Let $G$ be the finite group generated by elements $v i$ such that

\displaystyle {v_{i}^{2}=\varepsilon ,\quad v_{i}v_{j}=\varepsilon v_{j}v_{i}\,\,(i\neq j),}

where $ε$ is central of order 2. The commutator subgroup $[G, G]$ is just formed of 1 and $ε$ . If $N$ is odd this coincides with the center while if $N$ is even the center has order 4 with extra elements $γ = v 1 ... v N - 1$ and $ε γ$ . If $g$ in $G$ is not in the center its conjugacy class is exactly $g$ and $εg$ . Thus there are $2 N - 1 + 1$ conjugacy classes for $N$ odd and $2 N - 1 + 2$ for $N$ even. $G$ has $| G / [G, G] | = 2 N - 1$ 1-dimensional complex representations. The total number of irreducible complex representations is the number of conjugacy classes. So since $N$ is even, there are two further irreducible complex representations. Since the sum of the squares of the dimensions equals $| G |$ and the dimensions divide $| G |$ , the two irreducibles must have dimension $2 (N - 2)/2$ . When $N$ is even, there are two and their dimension must divide the order of the group, so is a power of two, so they must both have dimension $2 (N - 2)/2$ . The space on which the $V i$ 's act can be complexified. It will have complex dimension $N$ . It breaks up into some of complex irreducible representations of $G$ , all having dimension $2 (N - 2)/2$ . In particular this dimension is $\leq N$ , so $N$ is less than or equal to 8. If $N = 6$ , the dimension is 4, which does not divide 6. So N can only be 1, 2, 4 or 8.

Applications to Jordan algebras

Let $A$ be a Euclidean Hurwitz algebra and let $M n (A)$ be the algebra of $n$ -by- $n$ matrices over $A$ . It is a unital nonassociative algebra with an involution given by

\displaystyle {(x_{ij})^{*}=(x_{ji}^{*}).}

The trace $Tr(X)$ is defined as the sum of the diagonal elements of $X$ and the real-valued trace by $Tr R (X) = Re Tr(X)$ . The real-valued trace satisfies:

\operatorname {Tr} _{\mathbf {R} }XY=\operatorname {Tr} _{\mathbf {R} }YX,\qquad \operatorname {Tr} _{\mathbf {R} }(XY)Z=\operatorname {Tr} _{\mathbf {R} }X(YZ).

These are immediate consequences of the known identities for $n = 1$ .

In $A$ define the associator by

\displaystyle {[a,b,c]=a(bc)-(ab)c.}

It is trilinear and vanishes identically if $A$ is associative. Since $A$ is an alternative algebra $[a, a, b] = 0$ and $[b, a, a] = 0$ . Polarizing it follows that the associator is antisymmetric in its three entries. Furthermore, if $a$ , $b$ or $c$ lie in $R$ then $[a, b, c] = 0$ . These facts imply that $M 3 (A)$ has certain commutation properties. In fact if $X$ is a matrix in $M 3 (A)$ with real entries on the diagonal then

\displaystyle {[X,X^{2}]=aI,}

with $a$ in $A$ . In fact if $Y = [X, X 2]$ , then

\displaystyle {y_{ij}=\sum _{k,\ell }[x_{ik},x_{k\ell },x_{\ell j}].}

Since the diagonal entries of $X$ are real, the off-diagonal entries of $Y$ vanish. Each diagonal entry of $Y$ is a sum of two associators involving only off diagonal terms of $X$ . Since the associators are invariant under cyclic permutations, the diagonal entries of $Y$ are all equal.

Let $H n (A)$ be the space of self-adjoint elements in $M n (A)$ with product $X \circ Y = ⁠ 1 / 2 ⁠ (X Y + Y X)$ and inner product $(X, Y) = Tr R (X Y)$ .

Theorem. $H n (A)$ is a Euclidean Jordan algebra if $A$ is associative (the real numbers, complex numbers or quaternions) and $n \geq 3$ or if $A$ is nonassociative (the octonions) and $n = 3$ .

The exceptional Jordan algebra $H 3 (O)$ is called the Albert algebra after A.A. Albert.

To check that $H n (A)$ satisfies the axioms for a Euclidean Jordan algebra, the real trace defines a symmetric bilinear form with $(X, X) = Σ ‖ x ij ‖ 2$ . So it is an inner product. It satisfies the associativity property $(Z \circ X, Y) = (X, Z \circ Y)$ because of the properties of the real trace. The main axiom to check is the Jordan condition for the operators $L (X)$ defined by $L (X) Y = X \circ Y$ :

\displaystyle {[L(X),L(X^{2})]=0.}

This is easy to check when $A$ is associative, since $M n (A)$ is an associative algebra so a Jordan algebra with $X \circ Y = ⁠ 1 / 2 ⁠ (X Y + Y X)$ . When $A = O$ and $n = 3$ a special argument is required, one of the shortest being due to Freudenthal (1951).^[7]

In fact if $T$ is in $H 3 (O)$ with $Tr T = 0$ , then

\displaystyle {D(X)=TX-XT}

defines a skew-adjoint derivation of $H 3 (O)$ . Indeed,

\operatorname {Tr} (T(X(X^{2}))-T(X^{2}(X)))=\operatorname {Tr} T(aI)=\operatorname {Tr} (T)a=0,

so that

(D(X),X^{2})=0.

Polarizing yields:

(D(X),Y\circ Z)+(D(Y),Z\circ X)+(D(Z),X\circ Y)=0.

Setting $Z = 1$ shows that $D$ is skew-adjoint. The derivation property $D (X \circ Y) = D (X)\circ Y + X \circ D (Y)$ follows by this and the associativity property of the inner product in the identity above.

With $A$ and $n$ as in the statement of the theorem, let $K$ be the group of automorphisms of $E = H n (A)$ leaving invariant the inner product. It is a closed subgroup of $O (E)$ so a compact Lie group. Its Lie algebra consists of skew-adjoint derivations. Freudenthal (1951) showed that given $X$ in $E$ there is an automorphism $k$ in $K$ such that $k (X)$ is a diagonal matrix. (By self-adjointness the diagonal entries will be real.) Freudenthal's diagonalization theorem immediately implies the Jordan condition, since Jordan products by real diagonal matrices commute on $M n (A)$ for any non-associative algebra $A$ .

To prove the diagonalization theorem, take $X$ in $E$ . By compactness $k$ can be chosen in $K$ minimizing the sums of the squares of the norms of the off-diagonal terms of $k (X)$ . Since $K$ preserves the sums of all the squares, this is equivalent to maximizing the sums of the squares of the norms of the diagonal terms of $k (X)$ . Replacing $X$ by $k X$ , it can be assumed that the maximum is attained at $X$ . Since the symmetric group $S n$ , acting by permuting the coordinates, lies in $K$ , if $X$ is not diagonal, it can be supposed that $x 12$ and its adjoint $x 21$ are non-zero. Let $T$ be the skew-adjoint matrix with $(2, 1)$ entry $a$ , $(1, 2)$ entry $- a *$ and 0 elsewhere and let $D$ be the derivation ad $T$ of $E$ . Let $k t = exp tD$ in $K$ . Then only the first two diagonal entries in $X (t) = k t X$ differ from those of $X$ . The diagonal entries are real. The derivative of $x 11 (t)$ at $t = 0$ is the $(1, 1)$ coordinate of $[T, X]$ , i.e. $a * x 21 + x 12 a = 2(x 21, a)$ . This derivative is non-zero if $a = x 21$ . On the other hand, the group $k t$ preserves the real-valued trace. Since it can only change $x 11$ and $x 22$ , it preserves their sum. However, on the line $x + y =$ constant, $x 2 + y 2$ has no local maximum (only a global minimum), a contradiction. Hence $X$ must be diagonal.

Notes

^
See:
^
See:
- Eckmann 1989
- Eckmann 1999
^ Jordan, von Neumann & Wigner 1934
^ Faraut & Koranyi 1994, p. 82
^ Faraut & Koranyi 1994, pp. 81–86
^
See:
- Hurwitz 1923, p. 11
- Herstein 1968, pp. 141–144
^
See:
- Faraut & Koranyi 1994, pp. 88–91
- Postnikov 1986

References

Albert, A. A. (1934), "On a certain algebra of quantum mechanics", Ann. of Math., 35 (1): 65–73, doi:10.2307/1968118, JSTOR 1968118
Chevalley, C. (1954), The algebraic theory of spinors and Clifford algebras, Columbia University Press
Eckmann, Beno (1943), "Gruppentheoretischer Beweis des Satzes von Hurwitz–Radon über die Komposition quadratischer Formen", Comment. Math. Helv., 15: 358–366, doi:10.1007/bf02565652, S2CID 123322808
Eckmann, Beno (1989), "Hurwitz–Radon matrices and periodicity modulo 8", Enseign. Math., 35: 77–91, archived from the original on 2013-06-16
Eckmann, Beno (1999), "Topology, algebra, analysis—relations and missing links", Notices Amer. Math. Soc., 46: 520–527
Faraut, J.; Koranyi, A. (1994), Analysis on symmetric cones, Oxford Mathematical Monographs, Oxford University Press, ISBN 978-0198534778
Freudenthal, Hans (1951), Oktaven, Ausnahmegruppen und Oktavengeometrie, Mathematisch Instituut der Rijksuniversiteit te Utrecht
Freudenthal, Hans (1985), "Oktaven, Ausnahmegruppen und Oktavengeometrie", Geom. Dedicata, 19: 7–63, doi:10.1007/bf00233101, S2CID 121496094 (reprint of 1951 article)
Herstein, I. N. (1968), Noncommutative rings, Carus Mathematical Monographs, vol. 15, Mathematical Association of America, ISBN 978-0883850152
Hurwitz, A. (1898), "Über die Composition der quadratischen Formen von beliebig vielen Variabeln", Goett. Nachr.: 309–316
Hurwitz, A. (1923), "Über die Komposition der quadratischen Formen", Math. Ann., 88 (1–2): 1–25, doi:10.1007/bf01448439, S2CID 122147399
Jacobson, N. (1968), Structure and representations of Jordan algebras, American Mathematical Society Colloquium Publications, vol. 39, American Mathematical Society
Jordan, P.; von Neumann, J.; Wigner, E. (1934), "On an algebraic generalization of the quantum mechanical formalism", Ann. of Math., 35 (1): 29–64, doi:10.2307/1968117, JSTOR 1968117
Lam, Tsit-Yuen (2005), Introduction to Quadratic Forms over Fields, Graduate Studies in Mathematics, vol. 67, American Mathematical Society, ISBN 978-0-8218-1095-8, MR 2104929, Zbl 1068.11023
Lee, H. C. (1948), "Sur le théorème de Hurwitz-Radon pour la composition des formes quadratiques", Comment. Math. Helv., 21: 261–269, doi:10.1007/bf02568038, S2CID 121079375, archived from the original on 2014-05-03
Porteous, I.R. (1969), Topological Geometry, Van Nostrand Reinhold, ISBN 978-0-442-06606-2, Zbl 0186.06304
Postnikov, M. (1986), Lie groups and Lie algebras. Lectures in geometry. Semester V, Mir
Radon, J. (1922), "Lineare scharen orthogonaler matrizen", Abhandlungen aus dem Mathematischen Seminar der Universität Hamburg, 1: 1–14, doi:10.1007/bf02940576, S2CID 120583389
Rajwade, A. R. (1993), Squares, London Mathematical Society Lecture Note Series, vol. 171, Cambridge University Press, ISBN 978-0-521-42668-8, Zbl 0785.11022
Schafer, Richard D. (1995) [1966], An introduction to non-associative algebras, Dover Publications, ISBN 978-0-486-68813-8, Zbl 0145.25601
Shapiro, Daniel B. (2000), Compositions of quadratic forms, De Gruyter Expositions in Mathematics, vol. 33, Walter de Gruyter, ISBN 978-3-11-012629-7, Zbl 0954.11011