Free Access

On m-tuples of nilpotent $2 \times 2$ matrices over an arbitrary field

Department of Mathematics, Universidade Estadual de Campinas (UNICAMP), 651 Sergio Buarque de Holanda, 13083-859 Campinas, SP, Brazil

E-mail Address: dr.artem.lopatin@gmail.com

Search for more papers by this author

https://doi.org/10.1142/S0218196724500504Cited by:0 (Source: Crossref)

Abstract

The algebra of ${GL}_{n}$ -invariants of m-tuples of $n \times n$ matrices with respect to the action by simultaneous conjugation is a classical topic in case of infinite base field. On the other hand, in case of a finite field generators of polynomial invariants are not known even for a pair of $2 \times 2$ matrices. Working over an arbitrary field we classified all ${GL}_{2}$ -orbits on m-tuples of $2 \times 2$ nilpotent matrices for all $m > 0$ . As a consequence, we obtained a minimal separating set for the algebra of ${GL}_{2}$ -invariant polynomial functions of m-tuples of $2 \times 2$ nilpotent matrices. We also described the least possible number of elements of a separating set for an algebra of invariant polynomial functions over a finite field.

Communicated: Ualbai Umirbaev

Keywords:

AMSC: 16R30, 13A50

1. Introduction

1.1. Algebras of invariants

All vector spaces, algebras, and modules are over an arbitrary (possibly finite) field $𝔽$ of arbitrary characteristic $p \geq 0$ unless otherwise stated. By an algebra we always mean an associative algebra with unity.

Consider an n-dimensional vector space V over the field $𝔽$ with a fixed basis $v_{1}, \dots, v_{n}$ . The coordinate ring $𝔽 [V] = 𝔽 [x_{1}, \dots, x_{n}]$ of V is isomorphic to the symmetric algebra $S (V^{*})$ over the dual space $V^{*}$ , where $x_{1}, \dots, x_{n}$ is the basis for $V^{*}$ . Let G be a subgroup of $GL (V) ≅ {GL}_{n} (𝔽)$ . The algebra $𝔽 [V]$ becomes a G-module with

(g \cdot f) (v) = f (g - 1 \cdot v) <math display="block" altimg="eq-00022.gif"><mo stretchy="false">(</mo><mi>g</mi><mo>\cdot</mo><mi>f</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>v</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><msup><mrow><mi>g</mi></mrow><mrow><mo>-</mo><mn>1</mn></mrow></msup><mo>\cdot</mo><mi>v</mi><mo stretchy="false">)</mo></math> (1)

for all

$f \in V^{*}$ and

$v \in V$ . The algebra of polynomial invariants is defined as follows:

𝔽 [V] G = {f \in 𝔽 [V] | g \cdot f = f for all g \in G} . <math display="block" altimg="eq-00025.gif"><mrow><mi>𝔽</mi><msup><mrow><mo stretchy="false">[</mo><mi>V</mi><mo stretchy="false">]</mo></mrow><mrow><mi>G</mi></mrow></msup><mo>=</mo><mo stretchy="false">{</mo><mi>f</mi><mo>\in</mo><mi>𝔽</mi><mo stretchy="false">[</mo><mi>V</mi><mo stretchy="false">]</mo><mi>|</mi><mi>g</mi><mo>\cdot</mo><mi>f</mi><mo>=</mo><mi>f</mi><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">for all</mtext></mstyle><mspace width="0.5em"></mspace><mi>g</mi><mo>\in</mo><mi>G</mi><mo stretchy="false">}</mo><mo>.</mo></mrow></math>

The algebra of polynomial functions $𝒪 (V) = 𝔽 [{\bar{x}}_{1}, \dots, {\bar{x}}_{n}]$ , where ${\bar{x}}_{i} : v \mapsto v_{i}$ for $v = (v_{1}, \dots, v_{n}) \in V$ with respect to the fixed basis of V. The algebra $𝒪 (V)$ becomes a G-module by formula (1) for all $f \in 𝒪 (V)$ and $v \in V$ . The algebra of invariant polynomial functions is defined as follows:

𝒪 (V) G = {f \in 𝒪 (V) | g \cdot f = f for all g \in G} = {f \in 𝒪 (V) | f (g \cdot v) = f (v) for all g \in G, v \in V} . <math display="block" altimg="eq-00032.gif"><mrow><mtable displaystyle="true" equalrows="false" equalcolumns="false"><mtr><mtd columnalign="right"><mi mathvariant="script">𝒪</mi><msup><mrow><mo stretchy="false">(</mo><mi>V</mi><mo stretchy="false">)</mo></mrow><mrow><mi>G</mi></mrow></msup></mtd><mtd columnalign="center"><mo>=</mo></mtd><mtd columnalign="left"><mo stretchy="false">{</mo><mi>f</mi><mo>\in</mo><mi mathvariant="script">𝒪</mi><mo stretchy="false">(</mo><mi>V</mi><mo stretchy="false">)</mo><mi>|</mi><mi>g</mi><mo>\cdot</mo><mi>f</mi><mo>=</mo><mi>f</mi><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">for all</mtext></mstyle><mspace width="0.5em"></mspace><mi>g</mi><mo>\in</mo><mi>G</mi><mo stretchy="false">}</mo></mtd></mtr><mtr><mtd columnalign="right"></mtd><mtd columnalign="center"><mo>=</mo></mtd><mtd columnalign="left"><mo stretchy="false">{</mo><mi>f</mi><mo>\in</mo><mi mathvariant="script">𝒪</mi><mo stretchy="false">(</mo><mi>V</mi><mo stretchy="false">)</mo><mi>|</mi><mi>f</mi><mo stretchy="false">(</mo><mi>g</mi><mo>\cdot</mo><mi>v</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>v</mi><mo stretchy="false">)</mo><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">for all</mtext></mstyle><mspace width="0.5em"></mspace><mi>g</mi><mo>\in</mo><mi>G</mi><mo>,</mo><mi>v</mi><mo>\in</mo><mi>V</mi><mo stretchy="false">}</mo><mo>.</mo></mtd></mtr></mtable></mrow></math>

Obviously,

$𝒪 (V) ≃ 𝔽 [V] ∕ Ker (Ψ)$ for the homomorphism of algebras

$Ψ : 𝔽 [V] \to 𝒪 (V)$ defined by

$x_{i} \to {\bar{x}}_{i}$ . In case

$𝔽$ is infinite we have that

$Ker (Ψ) = 0$ and

$𝔽 {[V]}^{G} = 𝒪 {(V)}^{G}$ . If

$𝔽 = 𝔽_{q}$ is a finite field, then the ideal

$Ker (Ψ)$ is generated by

$x_{i}^{q} - x_{i}$ for all

$1 \leq i \leq n$ . Using

$Ψ$ we can consider any element

$f \in 𝔽 [V]$ as the map

$V \to 𝔽$ . Namely, we write

$f (v)$ for

$Ψ (f) (v)$ , where

$v \in V$ . For any infinite extension

$𝔽 \subset 𝕂$ we denote

$V_{𝕂} = V \otimes_{𝔽} 𝕂$ . Therefore,

𝔽 [V] G = {f \in 𝔽 [V] | f (g \cdot v) = f (v) for all g \in G, v \in V 𝕂} \subset {f \in 𝔽 [V] | f (g \cdot v) = f (v) for all g \in G, v \in V} . <math display="block" altimg="eq-00051.gif"><mrow><mtable displaystyle="true" equalrows="false" equalcolumns="false"><mtr><mtd columnalign="right"><mi>𝔽</mi><msup><mrow><mo stretchy="false">[</mo><mi>V</mi><mo stretchy="false">]</mo></mrow><mrow><mi>G</mi></mrow></msup></mtd><mtd columnalign="center"><mo>=</mo></mtd><mtd columnalign="left"><mo stretchy="false">{</mo><mi>f</mi><mo>\in</mo><mi>𝔽</mi><mo stretchy="false">[</mo><mi>V</mi><mo stretchy="false">]</mo><mi>|</mi><mi>f</mi><mo stretchy="false">(</mo><mi>g</mi><mo>\cdot</mo><mi>v</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>v</mi><mo stretchy="false">)</mo><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">for all</mtext></mstyle><mspace width="0.5em"></mspace><mi>g</mi><mo>\in</mo><mi>G</mi><mo>,</mo><mi>v</mi><mo>\in</mo><msub><mrow><mi>V</mi></mrow><mrow><mi>𝕂</mi></mrow></msub><mo stretchy="false">}</mo></mtd></mtr><mtr><mtd columnalign="right"></mtd><mtd columnalign="center"><mo>\subset</mo></mtd><mtd columnalign="left"><mo stretchy="false">{</mo><mi>f</mi><mo>\in</mo><mi>𝔽</mi><mo stretchy="false">[</mo><mi>V</mi><mo stretchy="false">]</mo><mi>|</mi><mi>f</mi><mo stretchy="false">(</mo><mi>g</mi><mo>\cdot</mo><mi>v</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>v</mi><mo stretchy="false">)</mo><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">for all</mtext></mstyle><mspace width="0.5em"></mspace><mi>g</mi><mo>\in</mo><mi>G</mi><mo>,</mo><mi>v</mi><mo>\in</mo><mi>V</mi><mo stretchy="false">}</mo><mo>.</mo></mtd></mtr></mtable></mrow></math>

Assume that $W \subset V$ is a G-invariant subset of V. The algebra of polynomial functions $𝒪 (W)$ on W is generated by functions $y_{1}, \dots, y_{n} : W \to 𝔽$ , where $y_{i}$ is the restriction of ${\bar{x}}_{i} : V \to 𝔽$ to W. The algebra $𝒪 (W)$ becomes a G-module by formula (1) for all $f \in 𝒪 (W)$ and $v \in W$ . Denote by $I (W)$ the ideal of all $f \in 𝒪 (W)$ that are zeros over W. Then we have $𝒪 (W) ≃ 𝒪 (V) ∕ I (W)$ and $y_{i} = {\bar{x}}_{i} + I (W)$ . The algebra $𝒪 {(W)}^{G}$ of invariant polynomial functions on W is defined in the same way as $𝒪 {(V)}^{G}$ . The degree of $f \in 𝒪 (W)$ is defined as the minimal degree of a polynomial $h \in 𝔽 [V]$ with $f = Ψ (h) + I (W)$ .

1.2. Separating invariants

In 2002, Derksen and Kemper [6] (see [7] for the second edition) introduced the notion of separating invariants as a weaker concept than generating invariants. Given a subset S of $𝒪 {(W)}^{G}$ , we say that elements $u, v$ of W are separated by S if exists an invariant $f \in S$ with $f (u) \neq f (v)$ . If $u, v \in W$ are separated by $𝒪 {(W)}^{G}$ , then we simply say that they are separated. A subset $S \subset 𝒪 {(W)}^{G}$ of the invariant ring is called separating if for any $v, w$ from W that are separated we have that they are separated by S. A separating subset for $𝔽 {[V]}^{G}$ is defined in the same manner. We say that a separating set is minimal if it is minimal w.r.t. inclusion. Obviously, any generating set is also separating. Denote by $β sep (𝒪 {(W)}^{G})$ the minimal integer $β sep$ such that the set of all invariant polynomial functions of degree less than or equal to $β sep$ is separating for $𝒪 {(W)}^{G}$ . Minimal separating sets for different actions were constructed in [5, 16, 17, 27, 28, 33, 34, 37].

Separating invariants for $𝔽 {[V]}^{G}$ in case of a finite field $𝔽 = 𝔽_{q}$ with q elements were studied by Kemper, Lopatin and Reimers in [28]. Namely, it was shown that the minimal number of separating invariants for $𝔽 {[V]}^{G}$ is $γ (q, κ) = ⌈ {log}_{q} (κ) ⌉$ , where $κ$ stands for the number of G-orbits in V, and an explicit construction of a minimal separating set of $γ (q, κ)$ invariants of degree at most $| G | n (q - 1)$ was given. Moreover, a minimal separating set of $γ (q, κ)$ invariants of degree $\leq n (q - 1)$ in the non-modular case was constructed as well as in the case when G consists entirely of monomial matrices.

1.3. Matrix invariants

For $n > 1$ and $m \geq 1$ , the direct sum $M_{n}^{m}$ of m copies of the space of $n \times n$ matrices over $𝔽$ is a ${GL}_{n}$ -module with respect to the diagonal action by conjugation: $g \cdot \underset{̲}{A} = (g A_{1} g^{- 1}, \dots, g A_{d} g^{- 1})$ for $g \in {GL}_{n}$ and $\underset{̲}{A} = (A_{1}, \dots, A_{d})$ from $M_{n}^{m}$ . Given a matrix A, denote by $A_{i j}$ the $(i, j) th$ entry of A. Any element of the coordinate ring

𝔽 [M m n] GL n = 𝔽 [x i j (k) | 1 \leq i, j \leq n, 1 \leq k \leq m] <math display="block" altimg="eq-00103.gif"><mrow><mi>𝔽</mi><msup><mrow><mo stretchy="false">[</mo><msubsup><mrow><mi>M</mi></mrow><mrow><mi>n</mi></mrow><mrow><mi>m</mi></mrow></msubsup><mo stretchy="false">]</mo></mrow><mrow><msub><mrow><mstyle><mtext mathvariant="normal">GL</mtext></mstyle></mrow><mrow><mi>n</mi></mrow></msub></mrow></msup><mo>=</mo><mi>𝔽</mi><mo stretchy="false">[</mo><msub><mrow><mi>x</mi></mrow><mrow><mi>i</mi><mi>j</mi></mrow></msub><mo stretchy="false">(</mo><mi>k</mi><mo stretchy="false">)</mo><mspace width=".275em"></mspace><mi>|</mi><mspace width=".275em"></mspace><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>,</mo><mi>j</mi><mo>\leq</mo><mi>n</mi><mo>,</mo><mn>1</mn><mo>\leq</mo><mi>k</mi><mo>\leq</mo><mi>m</mi><mo stretchy="false">]</mo></mrow></math>

$M_{n}^{m}$ can be considered as the polynomial function

$x_{i j} (k) : M_{n}^{m} \to 𝔽$ , which sends

$(A_{1}, \dots, A_{m})$ to

${(A_{k})}_{i j}$ . The algebra of invariant polynomial functions

$𝒪 {(M_{n}^{m})}^{{GL}_{n}}$ is defined as in Sec. 1.1.

Sibirskii [38], Procesi [35] and Donkin [18] established that the algebra of invariant polynomial functions $𝒪 {(M_{n}^{m})}^{{GL}_{n}}$ is generated by $σ_{t} (X_{k_{1}} \dots X_{k_{r}})$ in case $𝔽$ is infinite, where $X_{k}$ stands for the $n \times n$ generic matrix ${(x_{i j} (k))}_{1 \leq i, j \leq n}$ and $σ_{t}$ stands for the $t th$ coefficient of the characteristic polynomial, i.e., $det (λ E - A) = \sum_{t = 0}^{n} {(- 1)}^{t} λ^{n - t} σ_{t} (A)$ for any $n \times n$ matrix A over a commutative ring. In particular, $σ_{0} (A) = 1$ , $σ_{1} (A) = tr (A)$ and $σ_{n} (A) = det (A)$ .

For a monomial $c \in 𝔽 [M_{n}^{m}]$ denote by $deg c \in ℕ$ its degree and by $mdeg c \in ℕ^{m}$ its multidegree, where $ℕ$ stands for the set of non-negative integers. Namely, $mdeg c = (t_{1}, \dots, t_{d})$ , where $t_{k}$ is the total degree of the monomial c in $x_{i j} (k)$ , $1 \leq i, j \leq n$ , and $deg c = t_{1} + \dots + t_{m}$ .

In case of an infinite field of arbitrary characteristic the following minimal separating set for $𝒪 {(M_{2}^{m})}^{{GL}_{2}}$ was given by Kaygorodov, Lopatin and Popov [27]:

tr (X 2 k), 1 \leq k \leq m; tr (X k 1 \dots X k r), r \in {1, 2, 3}, 1 \leq k 1 \dots k r \leq m . <math display="block" altimg="eq-00132.gif"><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msubsup><mrow><mi>X</mi></mrow><mrow><mi>k</mi></mrow><mrow><mn>2</mn></mrow></msubsup><mo stretchy="false">)</mo><mo>,</mo><mspace width="1em"></mspace><mn>1</mn><mo>\leq</mo><mi>k</mi><mo>\leq</mo><mi>m</mi><mo>;</mo><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>X</mi></mrow><mrow><msub><mrow><mi>k</mi></mrow><mrow><mn>1</mn></mrow></msub></mrow></msub><mo>\dots</mo><msub><mrow><mi>X</mi></mrow><mrow><msub><mrow><mi>k</mi></mrow><mrow><mi>r</mi></mrow></msub></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mspace width="1em"></mspace><mi>r</mi><mo>\in</mo><mo stretchy="false">{</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>,</mo><mn>3</mn><mo stretchy="false">}</mo><mo>,</mo><mspace width="1em"></mspace><mn>1</mn><mo>\leq</mo><msub><mrow><mi>k</mi></mrow><mrow><mn>1</mn></mrow></msub><mo>\dots</mo><msub><mrow><mi>k</mi></mrow><mrow><mi>r</mi></mrow></msub><mo>\leq</mo><mi>m</mi><mo>.</mo></math> (2)

Note that set (2) generates the algebra

$𝒪 {(M_{2}^{m})}^{{GL}_{2}}$ if and only if the characteristic of

$𝔽$ is different from two or

$m \leq 3$ (see [11, 36]). A minimal generating set for

$𝒪 {(M_{3}^{m})}^{{GL}_{3}}$ was given by Lopatin in [29, 30, 31] in the case of an arbitrary infinite field. Over a field of characteristic zero a minimal generating set for

$𝒪 {(M_{n}^{2})}^{{GL}_{n}}$ was established by Drensky and Sadikova [19] in case

$n = 4$ (see also [40]) and by Đoković [8] in case

$n = 5$ (see also [9]). Some upper bounds on degrees of generating and separating invarints for

$𝒪 {(M_{n}^{m})}^{{GL}_{n}}$ were given by Derksen and Makam in [13, 14]. A minimal separating set for the algebra of matrix semi-invariants

$𝒪 {(M_{2}^{m})}^{{SL}_{2} \times {SL}_{2}}$ was explicitly described by Domokos [16] over an arbitrary algebraically closed field. Note that a minimal generating set for

$𝒪 {(M_{2}^{m})}^{{SL}_{2} \times {SL}_{2}}$ was given by Lopatin [32] over an arbitrary infinite field (see also [17]). Elmer [20, 21] obtained lower bounds on the least possible number of elements for separating sets for

$𝒪 {(M_{2}^{m})}^{{GL}_{2}}$ and

$𝒪 {(M_{2}^{m})}^{{SL}_{2} \times {SL}_{2}}$ in case

$𝔽 = ℂ$ . More results on separating invariants of matrices can be found in [39].

1.4. Nilpotent matrices

Denote by $𝒩_{n}$ the set of all $n \times n$ nilpotent matrices, i.e., $A \in M_{n}$ belongs to $𝒩_{n}$ if and only if $σ_{1} (A) = σ_{2} (A) = \dots = σ_{n} (A) = 0$ , or equivalently, $A^{n} = 0$ .

Over a finite field, the number of ${GL}_{n}$ -orbits on $𝒩_{n}^{m}$ was studied by Hua [24]. The variety of pairs of commuting nilpotent matrices has extensively been studied over the past 40 years (see [1, 2, 3, 4, 25, 26] for more details).

The algebra of polynomial functions $𝒪 (𝒩_{n}^{m})$ of $𝒩_{n}^{m} \subset M_{n}^{m}$ is generated by $y_{i j} (k)$ for $1 \leq i, j \leq n$ and $1 \leq k \leq m$ , where $y_{i j} (k) : 𝒩_{n}^{m} \to 𝔽$ sends $(A_{1}, \dots, A_{m})$ to ${(A_{k})}_{i j}$ . As in Sec. 1.1 we have $𝒪 (𝒩_{n}^{m}) = 𝒪 (M_{n}^{m}) ∕ I (𝒩_{n}^{m})$ . The algebra $𝒪 {(𝒩_{n}^{m})}^{{GL}_{n}}$ of invariant polynomial functions on nilpotent matrices is defined in the same way as $𝒪 {(M_{n}^{m})}^{{GL}_{n}}$ . Note that $σ_{t} (Y_{k_{1}} \dots Y_{k_{r}})$ lies in $𝒪 {(𝒩_{n}^{m})}^{{GL}_{n}}$ , where $Y_{k} = {(y_{i j} (k))}_{1 \leq i, j \leq n}$ stands for the generic nilpotent $n \times n$ matrix. Obviously, $σ_{t} (Y_{k}^{s}) = 0$ for all $1 \leq t \leq n$ , $s > 0$ and $Y_{k}^{n} = 0$ . It is easy to see that in case of an algebraically closed field of zero characteristic the algebra $𝒪 {(𝒩_{n}^{m})}^{{GL}_{n}}$ is generated by $tr (Y_{i_{1}} \dots Y_{i_{r}})$ (for example, see [5]), but in general case generators for $𝒪 {(𝒩_{n}^{m})}^{{GL}_{n}}$ are not known for $n, m \geq 2$ . Moreover, over a finite field $𝔽$ generators for algebras $𝒪 {(M_{n}^{m})}^{{GL}_{n}}$ and $𝔽 {[M_{n}^{m}]}^{{GL}_{n}}$ are not known as well.

Working over an algebraically closed field of zero characteristic Cavalcante and Lopatin in [5] showed that the set

S 2, m = {tr (Y i Y j), 1 \leq i < j \leq m; tr (Y i Y j Y k), 1 \leq i < j < k \leq m} <math display="block" altimg="eq-00180.gif"><mrow><msub><mrow><mi>S</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow></msub><mo>=</mo><mo stretchy="false">{</mo><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><msub><mrow><mi>Y</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>&lt;</mo><mi>j</mi><mo>\leq</mo><mi>m</mi><mo>;</mo><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><msub><mrow><mi>Y</mi></mrow><mrow><mi>j</mi></mrow></msub><msub><mrow><mi>Y</mi></mrow><mrow><mi>k</mi></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>&lt;</mo><mi>j</mi><mo>&lt;</mo><mi>k</mi><mo>\leq</mo><mi>m</mi><mo stretchy="false">}</mo></mrow></math>

is a minimal generating set and a minimal separating set for the algebra

$𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ for

$m > 0$ . Minimal generating sets and minimal separating sets for the algebras

$𝒪 {(𝒩_{3}^{2})}^{{GL}_{3}}$ and

$𝒪 {(𝒩_{3}^{3})}^{{GL}_{3}}$ were also obtained in [5] in case

$p = 0$ .

1.5. Results

For $𝔽 = 𝔽_{q}$ in Sec. 2 we extend the results from [28] given in Sec. 1.2 to the algebra $𝒪 {(W)}^{G}$ of G-invariant polynomial functions on a G-invariant subset $W \subset V$ (see Theorem 2.1). Note that a separating set for $𝒪 {(W)}^{G}$ separates all G-orbits on W in case $𝔽$ is finite (for example, it follows from the proof of Theorem 2.1).

In Sec. 3, working over an arbitrary field, we explicitly describe a minimal set of representatives of all ${GL}_{2}$ -orbits on $𝒩_{2}^{m}$ in Theorem 3.4. As a consequence, in case of $𝔽 = 𝔽_{q}$ we explicitly calculate the number of orbits in Corollary 3.5 as well as the least possible number of elements for a separating set for $𝒪 {(𝒩_{2}^{2})}^{{GL}_{2}}$ . We formulate Conjecture 3.7 about the number of ${GL}_{n}$ -orbits on $𝒩_{n}^{m}$ .

To formulate results about separating sets, introduce some notations. For an arbitrary field $𝔽$ denote

S (2) 2, m = {tr (Y i Y j), 1 \leq i < j \leq m} . <math display="block" altimg="eq-00198.gif"><mrow><msubsup><mrow><mi>S</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow><mrow><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow></msubsup><mo>=</mo><mo stretchy="false">{</mo><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><msub><mrow><mi>Y</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>&lt;</mo><mi>j</mi><mo>\leq</mo><mi>m</mi><mo stretchy="false">}</mo><mo>.</mo></mrow></math>

Now assume that $𝔽$ is finite. For all $1 \leq i, j \leq n$ and $α \in 𝔽^{\times}$ define $ζ (Y_{i})$ and $η_{α} (Y_{i}, Y_{j})$ from $𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ as follows:

ζ (A) = {0 if A \neq 0, 1 if A = 0, <math display="block" altimg="eq-00205.gif"><mi>ζ</mi><mo stretchy="false">(</mo><mi>A</mi><mo stretchy="false">)</mo><mo>=</mo><mfenced separators="" open="{" close=""><mrow><mtable displaystyle="true" equalrows="false" equalcolumns="false"><mtr><mtd columnalign="left"><mn>0</mn></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">if</mtext></mstyle><mspace width="0.5em"></mspace><mi>A</mi><mo>\neq</mo><mn>0</mn><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>1</mn></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">if</mtext></mstyle><mspace width="1em"></mspace><mi>A</mi><mo>=</mo><mn>0</mn><mo>,</mo></mtd></mtr></mtable></mrow></mfenced></math> (3)

η α (A, B) = {0 if α A \neq B, 1 if α A = B, <math display="block" altimg="eq-00206.gif"><msub><mrow><mi>η</mi></mrow><mrow><mi>α</mi></mrow></msub><mo stretchy="false">(</mo><mi>A</mi><mo>,</mo><mi>B</mi><mo stretchy="false">)</mo><mo>=</mo><mfenced separators="" open="{" close=""><mrow><mtable displaystyle="true" equalrows="false" equalcolumns="false"><mtr><mtd columnalign="left"><mn>0</mn></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">if</mtext></mstyle><mspace width="0.5em"></mspace><mi>α</mi><mi>A</mi><mo>\neq</mo><mi>B</mi><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>1</mn></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">if</mtext></mstyle><mspace width="1em"></mspace><mi>α</mi><mi>A</mi><mo>=</mo><mi>B</mi><mo>,</mo></mtd></mtr></mtable></mrow></mfenced></math> (4)

where

$A, B \in 𝒩_{2}$ . The presentations of

$ζ (Y_{i})$ and

$η_{α} (Y_{i}, Y_{j})$ as polynomials in

${y_{i j} (k)}$ are explicitly given in Sec. 4. In Theorem 4.5, we establish that the set

H (2) 2, m = S (2) 2, m ⊔ {ζ (Y i), 1 \leq i \leq m; η α (Y i, Y j), 1 \leq i < j \leq m, α \in 𝔽 ∖ {0, 1}} <math display="block" altimg="eq-00211.gif"><mrow><msubsup><mrow><mi>H</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow><mrow><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow></msubsup><mo>=</mo><msubsup><mrow><mi>S</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow><mrow><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow></msubsup><mo>⊔</mo><mo stretchy="false">{</mo><mi>ζ</mi><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>\leq</mo><mi>m</mi><mo>;</mo><msub><mrow><mi>η</mi></mrow><mrow><mi>α</mi></mrow></msub><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><mo>,</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>&lt;</mo><mi>j</mi><mo>\leq</mo><mi>m</mi><mo>,</mo><mi>α</mi><mo>\in</mo><mi>𝔽</mi><mspace width=".17em"></mspace><mo>∖</mo><mo stretchy="false">{</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo stretchy="false">}</mo><mo stretchy="false">}</mo></mrow></math>

is a minimal separating set for

$𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ in case

$char 𝔽 = 2$ and the set

H 2, m = S 2, m ⊔ {ζ (Y i), 1 \leq i \leq m; η α (Y i, Y j), 1 \leq i < j \leq m, α \in 𝔽 ∖ {0, 1}} <math display="block" altimg="eq-00214.gif"><mrow><msub><mrow><mi>H</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow></msub><mo>=</mo><msub><mrow><mi>S</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow></msub><mo>⊔</mo><mo stretchy="false">{</mo><mi>ζ</mi><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mspace width="0.3em"></mspace><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>\leq</mo><mi>m</mi><mo>;</mo><mspace width="1em"></mspace><msub><mrow><mi>η</mi></mrow><mrow><mi>α</mi></mrow></msub><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><mo>,</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mspace width="0.3em"></mspace><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>&lt;</mo><mi>j</mi><mo>\leq</mo><mi>m</mi><mo>,</mo><mspace width="0.3em"></mspace><mi>α</mi><mo>\in</mo><mi>𝔽</mi><mspace width=".17em"></mspace><mo>∖</mo><mo stretchy="false">{</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo stretchy="false">}</mo><mo stretchy="false">}</mo></mrow></math>

is a minimal separating set for

$𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ in case

$char 𝔽 > 2$ .

In Theorem 5.4, we describe a minimal separating set in case of an infinite field. Note that over an infinite field the functions $ζ (Y_{i})$ and $η_{α} (Y_{i}, Y_{j})$ do not belong to $𝒪 (𝒩_{2}^{m})$ . Some corollaries are given in Sec. 6.

1.6. Notations

If for $\underset{̲}{A}, \underset{̲}{B} \in M_{n}^{m}$ there exists $g \in {GL}_{n}$ such that $g \cdot \underset{̲}{A} = \underset{̲}{B}$ , then we write $\underset{̲}{A} \sim \underset{̲}{B}$ and say that $\underset{̲}{A}$ , $\underset{̲}{B}$ are similar. We say that $\underset{̲}{A}$ has no zeros if $A_{i}$ is non-zero for all i. Denote by $wz (\underset{̲}{A})$ the result of elimination of all zero matrices from $\underset{̲}{A}$ . As an example, for $\underset{̲}{A} = (0, B, 0, C, D)$ with non-zero $B, C, D \in M_{n}$ , we have $wz (\underset{̲}{A}) = (B, C, D)$ . For a permutation $σ \in 𝒮_{m}$ , we write ${\underset{̲}{A}}_{σ}$ for $(A_{σ (1)}, \dots, A_{σ (m)})$ .

Denote by E the identity matrix and by $E_{i j}$ the matrix such that the $(i, j) th$ entry is equal to one and the rest of entries are zeros. Denote by $𝔽^{\times}$ the set of all non-zero elements of $𝔽$ . For short, we write $\underset{̲}{0}$ for $(0, \dots, 0) \in M_{n}^{m}$ . Given $A \in M_{n}$ and $r \geq 0$ , we write $A^{(r)}$ for $(\underset{r}{\underset{︸}{A, \dots, A}}) \in M_{n}^{r}$ .

2. Minimal Separating Invariants for $𝒪 {(W)}^{G}$ over a Finite Field

In this section, we assume that $𝔽 = 𝔽_{q}$ , $W \subset V$ is a G-invariant subset of V, and $dim V = n$ . Denote $γ = γ (q, κ) = ⌈ {log}_{q} (κ) ⌉$ , where $κ$ stands for the number of G-orbits on W. For any vector $w = (w_{1}, \dots, w_{n}) \in W$ consider the following polynomial function from $𝒪 (W)$ :

f w = (- 1) n \prod α \in 𝔽 ∖ {w 1} (y 1 - α) \dots \prod α \in 𝔽 ∖ {w n} (y n - α) . <math display="block" altimg="eq-00254.gif"><mrow><msub><mrow><mi>f</mi></mrow><mrow><mi>w</mi></mrow></msub><mo>=</mo><msup><mrow><mo stretchy="false">(</mo><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><mrow><mi>n</mi></mrow></msup><munder><mrow><mo>\prod</mo></mrow><mrow><mi>α</mi><mo>\in</mo><mi>𝔽</mi><mo>∖</mo><mo stretchy="false">{</mo><msub><mrow><mi>w</mi></mrow><mrow><mn>1</mn></mrow></msub><mo stretchy="false">}</mo></mrow></munder><mo stretchy="false">(</mo><msub><mrow><mi>y</mi></mrow><mrow><mn>1</mn></mrow></msub><mo>-</mo><mi>α</mi><mo stretchy="false">)</mo><mspace width=".275em"></mspace><mspace width=".275em"></mspace><mo>\dots</mo><munder><mrow><mo>\prod</mo></mrow><mrow><mi>α</mi><mo>\in</mo><mi>𝔽</mi><mo>∖</mo><mo stretchy="false">{</mo><msub><mrow><mi>w</mi></mrow><mrow><mi>n</mi></mrow></msub><mo stretchy="false">}</mo></mrow></munder><mo stretchy="false">(</mo><msub><mrow><mi>y</mi></mrow><mrow><mi>n</mi></mrow></msub><mo>-</mo><mi>α</mi><mo stretchy="false">)</mo><mo>.</mo></mrow></math>

Note that

$f_{w} (w) = {(- 1)}^{n} {(\prod_{α \in 𝔽^{*}} α)}^{n} = 1$ and for each

$v \in W$ we have

f w (v) = {1, v = w, 0, otherwise. <math display="block" altimg="eq-00257.gif"><mrow><msub><mrow><mi>f</mi></mrow><mrow><mi>w</mi></mrow></msub><mo stretchy="false">(</mo><mi>v</mi><mo stretchy="false">)</mo><mo>=</mo><mfenced separators="" open="{" close=""><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="left"><mn>1</mn><mo>,</mo></mtd><mtd columnalign="left"><mi>v</mi><mo>=</mo><mi>w</mi><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>0</mn><mo>,</mo></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">otherwise.</mtext></mstyle></mtd></mtr></mtable></mrow></mfenced></mrow></math>

We present W as a union of G-orbits:

$W = U_{1} ⊔ \dots ⊔ U_{κ}$ . For every

$1 \leq j \leq κ$ define

f j = \sum u \in U j f u . <math display="block" altimg="eq-00260.gif"><msub><mrow><mi>f</mi></mrow><mrow><mi>j</mi></mrow></msub><mo>=</mo><munder><mrow><mo>\sum</mo></mrow><mrow><mi>u</mi><mo>\in</mo><msub><mrow><mi>U</mi></mrow><mrow><mi>j</mi></mrow></msub></mrow></munder><msub><mrow><mi>f</mi></mrow><mrow><mi>u</mi></mrow></msub><mo>.</mo></math> (5)

Then for each

$v \in W$ , we have

f j (v) = {1 v \in U j, 0 otherwise . <math display="block" altimg="eq-00262.gif"><msub><mrow><mi>f</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">(</mo><mi>v</mi><mo stretchy="false">)</mo><mo>=</mo><mfenced separators="" open="{" close=""><mrow><mtable displaystyle="true" equalrows="false" equalcolumns="false"><mtr><mtd columnalign="left"><mn>1</mn><mspace width="1em"></mspace></mtd><mtd columnalign="left"><mi>v</mi><mo>\in</mo><msub><mrow><mi>U</mi></mrow><mrow><mi>j</mi></mrow></msub><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>0</mn><mspace width="1em"></mspace></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">otherwise</mtext></mstyle><mo>.</mo></mtd></mtr></mtable></mrow></mfenced></math> (6)

Therefore,

$f_{1}, \dots, f_{κ}$ belong to

$𝒪 {(W)}^{G}$ and they form a separating set for

$𝒪 {(W)}^{G}$ . By the definition of

$γ$ we have that

$q^{γ - 1} < κ \leq q^{γ}$ . Thus, there exist

$κ$ pairwise different vectors in

$𝔽^{γ}$ , which we may put together in a matrix

$(α_{i j})$ over

$𝔽$ of size

$(γ \times κ)$ .

Theorem 2.1.

(1)

Every separating set for $𝒪 {(W)}^{G}$ contains at least $γ (q, κ)$ elements.

(2)

Let $(α_{i j}) \in 𝔽^{γ \times κ}$ be a matrix whose columns are pairwise different and define

h i = α i 1 f 1 + \dots + α i κ f κ (1 \leq i \leq γ) . <math display="block" altimg="eq-00276.gif"><mrow><msub><mrow><mi>h</mi></mrow><mrow><mi>i</mi></mrow></msub><mo>=</mo><msub><mrow><mi>α</mi></mrow><mrow><mi>i</mi><mn>1</mn></mrow></msub><msub><mrow><mi>f</mi></mrow><mrow><mn>1</mn></mrow></msub><mo>+</mo><mo>\dots</mo><mo>+</mo><msub><mrow><mi>α</mi></mrow><mrow><mi>i</mi><mi>κ</mi></mrow></msub><msub><mrow><mi>f</mi></mrow><mrow><mi>κ</mi></mrow></msub><mspace width="1em"></mspace><mo stretchy="false">(</mo><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>\leq</mo><mi>γ</mi><mo stretchy="false">)</mo><mo>.</mo></mrow></math>

Then

${h_{1}, \dots, h_{γ}}$ is a minimal separating set for

$𝒪 {(W)}^{G}$ consisting of (non-homogeneous) elements of degree less than or equal to

$n (q - 1)$ .

Proof. Consider some representatives of G-orbits on W: $u_{1} \in U_{1}, \dots, u_{κ} \in U_{κ}$ .

(1) Assume that ${l_{1}, \dots, l_{r}}$ is a separating set for $𝒪 {(W)}^{G}$ . Then the column vectors

(l 1 (u j) ⋮ l r (u j)) with 1 \leq j \leq κ <math display="block" altimg="eq-00283.gif"><mrow><mfenced separators="" open="(" close=")"><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="center"><msub><mrow><mi>l</mi></mrow><mrow><mn>1</mn></mrow></msub><mo stretchy="false">(</mo><msub><mrow><mi>u</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo></mtd></mtr><mtr><mtd columnalign="center"><mo>⋮</mo></mtd></mtr><mtr><mtd columnalign="center"><msub><mrow><mi>l</mi></mrow><mrow><mi>r</mi></mrow></msub><mo stretchy="false">(</mo><msub><mrow><mi>u</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo></mtd></mtr></mtable></mrow></mfenced><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">with</mtext></mstyle><mspace width="0.5em"></mspace><mn>1</mn><mo>\leq</mo><mi>j</mi><mo>\leq</mo><mi>κ</mi></mrow></math>

are pairwise different. Hence,

$κ \leq q^{r}$ and

$γ \leq r$ follows from the definition of

$γ$ .

(2) Applying formula (6), we obtain that

(h 1 (u j) ⋮ h γ (u j)) = (α 1 j ⋮ α γ j) for all 1 \leq j \leq κ . <math display="block" altimg="eq-00287.gif"><mrow><mfenced separators="" open="(" close=")"><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="center"><msub><mrow><mi>h</mi></mrow><mrow><mn>1</mn></mrow></msub><mo stretchy="false">(</mo><msub><mrow><mi>u</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo></mtd></mtr><mtr><mtd columnalign="center"><mo>⋮</mo></mtd></mtr><mtr><mtd columnalign="center"><msub><mrow><mi>h</mi></mrow><mrow><mi>γ</mi></mrow></msub><mo stretchy="false">(</mo><msub><mrow><mi>u</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo></mtd></mtr></mtable></mrow></mfenced><mo>=</mo><mfenced separators="" open="(" close=")"><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="center"><msub><mrow><mi>α</mi></mrow><mrow><mn>1</mn><mi>j</mi></mrow></msub></mtd></mtr><mtr><mtd columnalign="center"><mo>⋮</mo></mtd></mtr><mtr><mtd columnalign="center"><msub><mrow><mi>α</mi></mrow><mrow><mi>γ</mi><mi>j</mi></mrow></msub></mtd></mtr></mtable></mrow></mfenced><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">for all</mtext></mstyle><mspace width="0.5em"></mspace><mn>1</mn><mo>\leq</mo><mi>j</mi><mo>\leq</mo><mi>κ</mi><mo>.</mo></mrow></math>

By the conditions of the theorem, these column vectors are pairwise different, hence

${h_{1}, \dots, h_{γ}}$ is a separating set. The minimality follows from part 1. □

3. Classification of ${GL}_{2}$ -orbits on $𝒩_{2}^{m}$

In this section we assume that $𝔽$ is an arbitrary field.

Lemma 3.1. If $\underset{̲}{A} \in 𝒩_{2}^{2}$ has no zeros, then $\underset{̲}{A} \sim (E_{12}, α E_{12})$ or $\underset{̲}{A} \sim (E_{12}, α E_{21})$ for some non-zero $α \in 𝔽$ .

Proof. For each field $𝔽$ we have that $A_{1} \sim E_{12}$ . Therefore, without loss of generality, we can assume that $A_{1} = E_{12}$ . Denote $A_{2} = (\begin{matrix} a_{1} & a_{2} \\ a_{3} & - a_{1} \end{matrix})$ for some $a_{1}, a_{2}, a_{3} \in 𝔽$ and assume that $g = (\begin{matrix} g_{1} & g_{2} \\ 0 & g_{1} \end{matrix})$ lies in ${GL}_{2}$ . Then we have that $g \cdot A_{1} = A_{1}$ and

g \cdot A 2 = (h 1 h 2 a 3 - h 1), <math display="block" altimg="eq-00304.gif"><mrow><mi>g</mi><mo>\cdot</mo><msub><mrow><mi>A</mi></mrow><mrow><mn>2</mn></mrow></msub><mo>=</mo><mfenced separators="" open="(" close=")"><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="center"><msub><mrow><mi>h</mi></mrow><mrow><mn>1</mn></mrow></msub></mtd><mtd columnalign="center"><msub><mrow><mi>h</mi></mrow><mrow><mn>2</mn></mrow></msub></mtd></mtr><mtr><mtd columnalign="center"><msub><mrow><mi>a</mi></mrow><mrow><mn>3</mn></mrow></msub></mtd><mtd columnalign="center"><mo>-</mo><msub><mrow><mi>h</mi></mrow><mrow><mn>1</mn></mrow></msub></mtd></mtr></mtable></mrow></mfenced><mo>,</mo></mrow></math>

for

$h_{1} = a_{1} + \frac{g_{2}}{g_{1}} a_{3}$ and

$h_{2} = (a_{2} g_{1}^{2} - 2 a_{1} g_{1} g_{2} - a_{3} g_{2}^{2}) ∕ g_{1}^{2}$ .

Assume that $a_{3} \neq 0$ . Then we take $g_{1} = 1$ and $g_{2} = - \frac{a_{1}}{a_{3}}$ to obtain that $h_{1} = 0$ . Since $g \cdot A_{2}$ is nilpotent, we obtain $h_{2} = 0$ and $g \cdot A_{2} = a_{3} E_{21}$ .

Assume that $a_{3} = 0$ . Since $A_{2}$ is nilpotent, we obtain $A_{2} = a_{2} E_{12}$ . □

For $β, γ, δ$ of $𝔽$ denote

D (β, γ, δ) = (β γ δ - β) . <math display="block" altimg="eq-00319.gif"><mrow><mi>D</mi><mo stretchy="false">(</mo><mi>β</mi><mo>,</mo><mi>γ</mi><mo>,</mo><mi>δ</mi><mo stretchy="false">)</mo><mo>=</mo><mfenced separators="" open="(" close=")"><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="center"><mi>β</mi></mtd><mtd columnalign="center"><mi>γ</mi></mtd></mtr><mtr><mtd columnalign="center"><mi>δ</mi></mtd><mtd columnalign="center"><mo>-</mo><mi>β</mi></mtd></mtr></mtable></mrow></mfenced><mo>.</mo></mrow></math>

Remark 3.2. Assume that $\underset{̲}{A}, {\underset{̲}{A}}^{'} \in 𝒩_{2}^{3}$ satisfy $A_{1} = A_{1}^{'} = E_{12}$ , $A_{2} = α E_{21}$ , and $A_{2}^{'} = α^{'} E_{21}$ for some $α, α^{'} \in 𝔽^{\times}$ . Assume that $t_{i j} : = tr (A_{i} A_{j}) - tr (A_{i}^{'} A_{j}^{'}) = 0$ for all $1 \leq i < j \leq 3$ and $t_{123} : = tr (A_{1} A_{2} A_{3}) - tr (A_{1}^{'} A_{2}^{'} A_{3}^{'}) = 0$ . Then $\underset{̲}{A} = {\underset{̲}{A}}^{'}$ .

Proof. Denote $A_{3} = D (β, γ, δ)$ and $A_{3}^{'} = D (β^{'}, γ^{'}, δ^{'})$ for some $β, γ, δ, β^{'}, γ^{'}, δ^{'}$ from $𝔽$ . Since $t_{12} = 0$ , we have $α = α^{'}$ . Consequently applying the equalities $t_{13} = 0$ , $t_{23} = 0$ , $t_{123} = 0$ , we obtain that $δ = δ^{'}$ , $α γ = α γ^{'}$ , $α β = α β^{'}$ . □

Lemma 3.3. If $\underset{̲}{A} \in 𝒩_{2}^{m}$ has no zeros, then $\underset{̲}{A}$ is similar to one and only one of the following elements:

(a)	$(E_{12}, α_{2} E_{12}, \dots, α_{m} E_{12})$ , where $α_{2}, \dots, α_{m} \in 𝔽^{\times}$ ;
(b)	$(E_{12}, α_{2} E_{12}, \dots, α_{r} E_{12}, α_{r + 1} E_{21}, D_{1}, \dots, D_{s})$ , where $1 \leq r \leq m - 1$ , $s = m - r - 1$ , $α_{2}, \dots, α_{r + 1} \in 𝔽^{\times}$ , $D_{i} \in 𝒩_{2}$ is non-zero for every i.

Proof. In case $m = 1$ , we have $A_{1} \sim E_{12}$ and the required is proven.

Assume $m \geq 2$ . By Lemma 3.1 either $(A_{1}, A_{2}) \sim (E_{12}, α_{2} E_{12})$ or $(A_{1}, A_{2}) \sim (E_{12}, α_{2} E_{21})$ for some $α_{2} \in 𝔽^{\times}$ . In the first case we have $A_{1} = A_{2} ∕ α_{2}$ and applying Lemma 3.1 to the pair $(A_{1}, A_{3})$ we can see that either $(A_{1}, A_{2}, A_{3}) \sim (E_{12}, α_{2} E_{12}, α_{3} E_{12})$ or $(A_{1}, A_{2}, A_{3}) \sim (E_{12}, α_{2} E_{12}, α_{3} E_{21})$ for some $α_{3} \in 𝔽^{\times}$ . Repeating this procedure, we finally obtain that either case (a) or (b) holds.

To prove uniqueness, consider some $\underset{̲}{A}, {\underset{̲}{A}}^{'} \in 𝒩_{2}^{m}$ of type (a) or (b) and satisfying the condition $\underset{̲}{A} \sim {\underset{̲}{A}}^{'}$ . Note that $\underset{̲}{A}$ has type (a) if and only if for every $2 \leq i \leq m$ exists $α_{i} \in 𝔽^{\times}$ such that $A_{1} = A_{i} ∕ α_{i}$ . Since this property of $\underset{̲}{A}$ is not changed by the action of ${GL}_{2}$ , then we have one of the following two cases.

(1) Both $\underset{̲}{A}$ and ${\underset{̲}{A}}^{'}$ have type (a), i.e.,

A ̲ = (E 12, α 2 E 12, \dots, α m E 12) and A ̲' = (E 12, α' 2 E 12, \dots, α' m E 12), <math display="block" altimg="eq-00371.gif"><mrow><munder accentunder="false"><mrow><mi>A</mi></mrow><mo accent="true">̲</mo></munder><mo>=</mo><mo stretchy="false">(</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><msub><mrow><mi>α</mi></mrow><mrow><mn>2</mn></mrow></msub><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><mo>\dots</mo><mo>,</mo><msub><mrow><mi>α</mi></mrow><mrow><mi>m</mi></mrow></msub><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">and</mtext></mstyle><mspace width="1em"></mspace><msup><mrow><munder accentunder="false"><mrow><mi>A</mi></mrow><mo accent="true">̲</mo></munder></mrow><mrow><mi>'</mi></mrow></msup><mo>=</mo><mo stretchy="false">(</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><msubsup><mrow><mi>α</mi></mrow><mrow><mn>2</mn></mrow><mrow><mi>'</mi></mrow></msubsup><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><mo>\dots</mo><mo>,</mo><msubsup><mrow><mi>α</mi></mrow><mrow><mi>m</mi></mrow><mrow><mi>'</mi></mrow></msubsup><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo>,</mo></mrow></math>

where

$α_{2}, \dots, α_{m}, α_{2}^{'}, \dots, α_{m}^{'} \in 𝔽^{\times}$ . Consider

$g \in {GL}_{2}$ such that

$g \cdot \underset{̲}{A} = {\underset{̲}{A}}^{'}$ . Since

$g \cdot A_{i} = A_{i}^{'}$ for all i, we obtain that

$g \cdot E_{12} = E_{12}$ and

$α_{i} = α_{i}^{'}$ for

$2 \leq i \leq m$ . Therefore,

$\underset{̲}{A} = {\underset{̲}{A}}^{'}$ .

(2) $\underset{̲}{A}$ and ${\underset{̲}{A}}^{'}$ have both type (b), i.e.,

where

$r, r^{'} \geq 1$ ,

$s, s^{'} \geq 0$ ,

$α_{2}, \dots, α_{r + 1}, α_{2}^{'}, \dots, α_{r^{'} + 1}^{'} \in 𝔽^{\times}$ ,

$D_{i}, D_{i}^{'} \in 𝒩_{2}$ is non-zero for every i. By the definition of similarity we have that

$t_{i j} : = tr (A_{i} A_{j}) - tr (A_{i}^{'} A_{j}^{'}) = 0$ and

$t_{i j k} : = tr (A_{i} A_{j} A_{k}) - tr (A_{i}^{'} A_{j}^{'} A_{k}^{'}) = 0$ for all

$1 \leq i, j, k \leq m$ . Since

min {2 \leq v \leq m | tr (A v - 1 A v) \neq 0} = r + 1, min {2 \leq v \leq m | tr (A' v - 1 A' v) \neq 0} = r' + 1, <math display="block" altimg="eq-00390.gif"><mrow><mtable displaystyle="true" equalrows="false" equalcolumns="false"><mtr><mtd columnalign="right"><mo>min</mo><mo stretchy="false">{</mo><mn>2</mn><mo>\leq</mo><mi>v</mi><mo>\leq</mo><mi>m</mi><mspace width=".17em"></mspace><mi>|</mi><mspace width=".17em"></mspace><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>A</mi></mrow><mrow><mi>v</mi><mo>-</mo><mn>1</mn></mrow></msub><msub><mrow><mi>A</mi></mrow><mrow><mi>v</mi></mrow></msub><mo stretchy="false">)</mo><mo>\neq</mo><mn>0</mn><mo stretchy="false">}</mo></mtd><mtd columnalign="center"><mo>=</mo></mtd><mtd columnalign="left"><mi>r</mi><mo>+</mo><mn>1</mn><mo>,</mo></mtd></mtr><mtr><mtd columnalign="right"><mo>min</mo><mo stretchy="false">{</mo><mn>2</mn><mo>\leq</mo><mi>v</mi><mo>\leq</mo><mi>m</mi><mspace width=".17em"></mspace><mi>|</mi><mspace width=".17em"></mspace><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msubsup><mrow><mi>A</mi></mrow><mrow><mi>v</mi><mo>-</mo><mn>1</mn></mrow><mrow><mi>'</mi></mrow></msubsup><msubsup><mrow><mi>A</mi></mrow><mrow><mi>v</mi></mrow><mrow><mi>'</mi></mrow></msubsup><mo stretchy="false">)</mo><mo>\neq</mo><mn>0</mn><mo stretchy="false">}</mo></mtd><mtd columnalign="center"><mo>=</mo></mtd><mtd columnalign="left"><msup><mrow><mi>r</mi></mrow><mrow><mi>'</mi></mrow></msup><mo>+</mo><mn>1</mn><mo>,</mo></mtd></mtr></mtable></mrow></math>

we obtain that

$r = r^{'}$ and

$s = s^{'}$ . Consider

$1 \leq i \leq s$ . Applying Remark 3.2 to the pair of triples

$(A_{1}, A_{r + 1}, A_{r + i + 1})$ and

$(A_{1}^{'}, A_{r + 1}^{'}, A_{r + i + 1}^{'})$ , we obtain that

$α_{r + 1} = α_{r + 1}^{'}$ and

$A_{r + i + 1} = A_{r + i + 1}^{'}$ . In case

$r \geq 2$ we use

$t_{j, r + 1} = 0$ to obtain that

$α_{j} = α_{j}^{'}$ for all

$2 \leq j \leq r$ . Therefore,

$\underset{̲}{A} = {\underset{̲}{A}}^{'}$ . □

Lemma 3.3 implies the following description of ${GL}_{2}$ -orbits on $𝒩_{2}^{m}$ .

Theorem 3.4. Each ${GL}_{2}$ -orbit on $𝒩_{2}^{m}$ contains one and only one element of the following types:

(0)

$(0, \dots, 0)$ ;

(1)

$(α_{1} E_{12}, \dots, α_{m} E_{12})$ , where

	•	at least one element from the list $α_{1}, \dots, α_{m} \in 𝔽$ is non-zero,
	•	$α_{v} = 1$ for $v = min {1 \leq i \leq m \| α_{i} \neq 0}$ ;

(2)

$(α_{1} E_{12}, \dots, α_{r} E_{12}, α_{r + 1} E_{21}, D_{1}, \dots, D_{s})$ , where

	•	$1 \leq r \leq m - 1$ , $s = m - r - 1$ ,
	•	at least one element from the set ${α_{1}, \dots, α_{r}}$ is non-zero and $α_{r + 1} \in 𝔽^{\times}$ ,
	•	$α_{v} = 1$ for $v = min {1 \leq i \leq r \| α_{i} \neq 0}$ ,
	•	$D_{i} \in 𝒩_{2}$ for every $1 \leq i \leq s$ .

For every integer k denote

Sk(q)={qk+qk−1+⋯+1=qk+1−1q−1,ifk≥0,0,ifk<0.<math display="block" altimg="eq-00421.gif"><mrow><msub><mrow><mi>S</mi></mrow><mrow><mi>k</mi></mrow></msub><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">)</mo><mo>=</mo><mfenced separators="" open="{" close=""><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="left"><msup><mrow><mi>q</mi></mrow><mrow><mi>k</mi></mrow></msup><mo>+</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>k</mi><mo>−</mo><mn>1</mn></mrow></msup><mo>+</mo><mo>⋯</mo><mo>+</mo><mn>1</mn><mo>=</mo><mfrac><mrow><msup><mrow><mi>q</mi></mrow><mrow><mi>k</mi><mo>+</mo><mn>1</mn></mrow></msup><mo>−</mo><mn>1</mn></mrow><mrow><mi>q</mi><mo>−</mo><mn>1</mn></mrow></mfrac><mo>,</mo></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">if</mtext></mstyle><mspace width="0.5em"></mspace><mi>k</mi><mo>≥</mo><mn>0</mn><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>0</mn><mo>,</mo></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">if</mtext></mstyle><mspace width="0.5em"></mspace><mi>k</mi><mo>&lt;</mo><mn>0</mn><mo>.</mo></mtd></mtr></mtable></mrow></mfenced></mrow></math>

Corollary 3.5. Assume $𝔽 = 𝔽_{q}$ and $m \geq 1$ . Then

(a)

the number of ${GL}_{2}$ -orbits on $𝒩_{2}^{m}$ is equal to

κ=1+(qm−1)(qm−1+q)q2−1;<math display="block" altimg="eq-00426.gif"><mrow><mi>κ</mi><mo>=</mo><mn>1</mn><mo>+</mo><mfrac><mrow><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi></mrow></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi><mo>−</mo><mn>1</mn></mrow></msup><mo>+</mo><mi>q</mi><mo stretchy="false">)</mo></mrow><mrow><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>−</mo><mn>1</mn></mrow></mfrac><mo>;</mo></mrow></math>

(b)

$κ$ has the following presentation as a polynomial in q with non-negative integer coefficients:

κ=κ(q)={1+Sm−1(q)+(qm−1−1)Sm−22(q2)ifmis even,1+Sm−1(q)+(qm−1)Sm−32(q2)ifmis odd,<math display="block" altimg="eq-00428.gif"><mrow><mi>κ</mi><mo>=</mo><mi>κ</mi><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">)</mo><mo>=</mo><mfenced separators="" open="{" close=""><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="left"><mn>1</mn><mo>+</mo><msub><mrow><mi>S</mi></mrow><mrow><mi>m</mi><mo>−</mo><mn>1</mn></mrow></msub><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">)</mo><mo>+</mo><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi><mo>−</mo><mn>1</mn></mrow></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo><msub><mrow><mi>S</mi></mrow><mrow><mfrac><mrow><mi>m</mi><mo>−</mo><mn>2</mn></mrow><mrow><mn>2</mn></mrow></mfrac></mrow></msub><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn></mrow></msup><mo stretchy="false">)</mo></mtd><mtd columnalign="left"><mstyle><mi mathvariant="italic">if</mi></mstyle><mspace width="0.5em"></mspace><mi>m</mi><mspace width="0.5em"></mspace><mstyle><mi mathvariant="italic">is even</mi></mstyle><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>1</mn><mo>+</mo><msub><mrow><mi>S</mi></mrow><mrow><mi>m</mi><mo>−</mo><mn>1</mn></mrow></msub><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">)</mo><mo>+</mo><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi></mrow></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo><msub><mrow><mi>S</mi></mrow><mrow><mfrac><mrow><mi>m</mi><mo>−</mo><mn>3</mn></mrow><mrow><mn>2</mn></mrow></mfrac></mrow></msub><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn></mrow></msup><mo stretchy="false">)</mo></mtd><mtd columnalign="left"><mstyle><mi mathvariant="italic">if</mi></mstyle><mspace width="0.5em"></mspace><mi>m</mi><mspace width="0.5em"></mspace><mstyle><mi mathvariant="italic">is odd</mi></mstyle><mo>,</mo></mtd></mtr></mtable></mrow></mfenced></mrow></math>

(c)

the least possible number of elements for a separating set for $𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ is

γ = {1 if m = 1, 3 if m = q = 2, 2 m - 2 otherwise. <math display="block" altimg="eq-00430.gif"><mrow><mi>γ</mi><mo>=</mo><mfenced separators="" open="{" close=""><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="left"><mn>1</mn></mtd><mtd columnalign="left"><mstyle><mi mathvariant="italic">if</mi></mstyle><mspace width="0.5em"></mspace><mi>m</mi><mo>=</mo><mn>1</mn><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>3</mn></mtd><mtd columnalign="left"><mstyle><mi mathvariant="italic">if</mi></mstyle><mspace width="0.5em"></mspace><mi>m</mi><mo>=</mo><mi>q</mi><mo>=</mo><mn>2</mn><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>2</mn><mi>m</mi><mo>-</mo><mn>2</mn></mtd><mtd columnalign="left"><mstyle><mi mathvariant="italic">otherwise. </mi></mstyle></mtd></mtr></mtable></mrow></mfenced></mrow></math>

Proof. (a) Denote by $κ_{0} (m)$ , $κ_{1} (m)$ , $κ_{2} (m)$ , respectively, the number of ${GL}_{2}$ -orbits on $𝒩_{2}^{m}$ of type (0), (1), (2), respectively (see Theorem 3.4). Then $κ_{0} (m) = 1$ , $κ_{1} (m) = S_{m - 1} (q)$ , and

κ 2 (m) = m - 1 \sum r = 1 κ 1 (r) (q - 1) | 𝒩 2 | m - r - 1 = m - 1 \sum r = 1 (q r - 1) q 2 m - 2 r - 2, <math display="block" altimg="eq-00438.gif"><mrow><msub><mrow><mi>κ</mi></mrow><mrow><mn>2</mn></mrow></msub><mo stretchy="false">(</mo><mi>m</mi><mo stretchy="false">)</mo><mo>=</mo><munderover accentunder="true" accent="true"><mrow><mo>\sum</mo></mrow><mrow><mi>r</mi><mo>=</mo><mn>1</mn></mrow><mrow><mi>m</mi><mo>-</mo><mn>1</mn></mrow></munderover><msub><mrow><mi>κ</mi></mrow><mrow><mn>1</mn></mrow></msub><mo stretchy="false">(</mo><mi>r</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>q</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo><mi>|</mi><msub><mrow><mi mathvariant="script">𝒩</mi></mrow><mrow><mn>2</mn></mrow></msub><msup><mrow><mi>|</mi></mrow><mrow><mi>m</mi><mo>-</mo><mi>r</mi><mo>-</mo><mn>1</mn></mrow></msup><mo>=</mo><munderover accentunder="true" accent="true"><mrow><mo>\sum</mo></mrow><mrow><mi>r</mi><mo>=</mo><mn>1</mn></mrow><mrow><mi>m</mi><mo>-</mo><mn>1</mn></mrow></munderover><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>r</mi></mrow></msup><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn><mi>m</mi><mo>-</mo><mn>2</mn><mi>r</mi><mo>-</mo><mn>2</mn></mrow></msup><mo>,</mo></mrow></math>

since we apply the equality

$| 𝒩_{n} | = q^{n (n - 1)}$ , which was proven by Fine and Herstein [22]. Hence,

κ 2 (m) = q 2 m - 2 (m - 1 \sum r = 0 q - r - m - 1 \sum r = 0 q - 2 r) = q 2 m - 2 (S m - 1 (q - 1) - S m - 1 (q - 2)), <math display="block" altimg="eq-00440.gif"><mrow><msub><mrow><mi>κ</mi></mrow><mrow><mn>2</mn></mrow></msub><mo stretchy="false">(</mo><mi>m</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn><mi>m</mi><mo>-</mo><mn>2</mn></mrow></msup><mfenced separators="" open="(" close=")"><mrow><munderover accentunder="true" accent="true"><mrow><mo>\sum</mo></mrow><mrow><mi>r</mi><mo>=</mo><mn>0</mn></mrow><mrow><mi>m</mi><mo>-</mo><mn>1</mn></mrow></munderover><msup><mrow><mi>q</mi></mrow><mrow><mo>-</mo><mi>r</mi></mrow></msup><mo>-</mo><munderover accentunder="true" accent="true"><mrow><mo>\sum</mo></mrow><mrow><mi>r</mi><mo>=</mo><mn>0</mn></mrow><mrow><mi>m</mi><mo>-</mo><mn>1</mn></mrow></munderover><msup><mrow><mi>q</mi></mrow><mrow><mo>-</mo><mn>2</mn><mi>r</mi></mrow></msup></mrow></mfenced><mo>=</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn><mi>m</mi><mo>-</mo><mn>2</mn></mrow></msup><mfenced separators="" open="(" close=")"><mrow><msub><mrow><mi>S</mi></mrow><mrow><mi>m</mi><mo>-</mo><mn>1</mn></mrow></msub><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mo>-</mo><mn>1</mn></mrow></msup><mo stretchy="false">)</mo><mo>-</mo><msub><mrow><mi>S</mi></mrow><mrow><mi>m</mi><mo>-</mo><mn>1</mn></mrow></msub><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mo>-</mo><mn>2</mn></mrow></msup><mo stretchy="false">)</mo></mrow></mfenced><mo>,</mo></mrow></math>

and we obtain

κ2(m)=(qm−1)(qm−1−1)q2−1.<math display="block" altimg="eq-00441.gif"><mrow><msub><mrow><mi>κ</mi></mrow><mrow><mn>2</mn></mrow></msub><mo stretchy="false">(</mo><mi>m</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mrow><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi></mrow></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi><mo>−</mo><mn>1</mn></mrow></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><mrow><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>−</mo><mn>1</mn></mrow></mfrac><mo>.</mo></mrow></math>

Therefore,

κ=κ0(m)+κ1(m)+κ2(m)=1+Sm−1(q)+(qm−1)(qm−1−1)q2−1,<math display="block" altimg="eq-00442.gif"><mi>κ</mi><mo>=</mo><msub><mrow><mi>κ</mi></mrow><mrow><mn>0</mn></mrow></msub><mo stretchy="false">(</mo><mi>m</mi><mo stretchy="false">)</mo><mo>+</mo><msub><mrow><mi>κ</mi></mrow><mrow><mn>1</mn></mrow></msub><mo stretchy="false">(</mo><mi>m</mi><mo stretchy="false">)</mo><mo>+</mo><msub><mrow><mi>κ</mi></mrow><mrow><mn>2</mn></mrow></msub><mo stretchy="false">(</mo><mi>m</mi><mo stretchy="false">)</mo><mo>=</mo><mn>1</mn><mo>+</mo><msub><mrow><mi>S</mi></mrow><mrow><mi>m</mi><mo>−</mo><mn>1</mn></mrow></msub><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">)</mo><mo>+</mo><mfrac><mrow><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi></mrow></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi><mo>−</mo><mn>1</mn></mrow></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><mrow><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>−</mo><mn>1</mn></mrow></mfrac><mo>,</mo></math>(7)

and the claim of part (a) easily follows.

(b) If $m = 1$ , then $κ = 2$ and part (b) holds. Assume $m \geq 2$ . Considering the case of even m and the case of odd m, applying formula (7), we prove the claim of part (b).

(c) By Theorem 2.1, the least possible number of elements for a separating set for $𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ is $γ = ⌈ {log}_{q} (κ) ⌉$ .

If $m = 1$ , then $κ = 2$ and $γ = 1$ .

If $m = q = 2$ , then $κ = 5$ and $γ = 3$ .

Assume $m \geq 2$ and $m, q$ are not simultaneously equal to 2. By part (a), the claim that $γ = 2 m - 2$ is equivalent to inequalities

q2m−3<1+(qm−1)(qm−1+q)q2−1≤q2m−2.<math display="block" altimg="eq-00457.gif"><mrow><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn><mi>m</mi><mo>−</mo><mn>3</mn></mrow></msup><mo>&lt;</mo><mn>1</mn><mo>+</mo><mfrac><mrow><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi></mrow></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi><mo>−</mo><mn>1</mn></mrow></msup><mo>+</mo><mi>q</mi><mo stretchy="false">)</mo></mrow><mrow><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>−</mo><mn>1</mn></mrow></mfrac><mo>≤</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn><mi>m</mi><mo>−</mo><mn>2</mn></mrow></msup><mo>.</mo></mrow></math>

The left inequality follows from:

q2m−3<(qm−1)(qm−1+q)q2−1.<math display="block" altimg="eq-00458.gif"><mrow><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn><mi>m</mi><mo>−</mo><mn>3</mn></mrow></msup><mo>&lt;</mo><mfrac><mrow><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi></mrow></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi><mo>−</mo><mn>1</mn></mrow></msup><mo>+</mo><mi>q</mi><mo stretchy="false">)</mo></mrow><mrow><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>−</mo><mn>1</mn></mrow></mfrac><mo>.</mo></mrow></math>

The right inequality is equivalent to

(q 2 m - q 2 m - 1 - q 2 m - 2 - q m + 1) + (q m - 1 - q 2) + q + 1 \geq 0 . <math display="block" altimg="eq-00459.gif"><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn><mi>m</mi></mrow></msup><mo>-</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn><mi>m</mi><mo>-</mo><mn>1</mn></mrow></msup><mo>-</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn><mi>m</mi><mo>-</mo><mn>2</mn></mrow></msup><mo>-</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi><mo>+</mo><mn>1</mn></mrow></msup><mo stretchy="false">)</mo><mo>+</mo><mo stretchy="false">(</mo><msup><mrow><mi>q</mi></mrow><mrow><mi>m</mi><mo>-</mo><mn>1</mn></mrow></msup><mo>-</mo><msup><mrow><mi>q</mi></mrow><mrow><mn>2</mn></mrow></msup><mo stretchy="false">)</mo><mo>+</mo><mi>q</mi><mo>+</mo><mn>1</mn><mo>\geq</mo><mn>0</mn><mo>.</mo></math> (8)

In case $m = 2$ and $q \geq 3$ , inequality (8) is equivalent to $q^{2} (q^{2} - 2 q - 2) + 2 q + 1 \geq 0$ , which holds since $q^{2} - 2 q - 2 \geq 0$ for $q \geq 3$ .

In case $m \geq 3$ we have $q^{2 m} = q^{2 m - 2} q^{2} \geq q^{2 m - 2} q + q^{2 m - 2} + q^{2 m - 2} \geq q^{2 m - 1} + q^{2 m - 2} + q^{m + 1}$ , since $q^{2} \geq q + 2$ for $q \geq 2$ ; inequality (8) follows. □

Corollary 3.5 implies the following remark:

Remark 3.6. Assume $𝔽 = 𝔽_{q}$ . Then

•	for $m = 1$ we have $κ = 2$ ;
•	for $m = 2$ we have $κ = 2 q + 1$ ;
•	for $m = 3$ we have $κ = q^{3} + q^{2} + q + 1$ ;
•	for $m = 4$ we have $κ = q^{5} + 2 q^{3} + q + 1$ ;
•	for $m = 5$ we have $κ = q^{7} + q^{5} + q^{4} + q^{3} + q + 1$ .

Corollary 3.5 implies that the following conjecture holds for $n = 2$ and $m \geq 1$ .

Conjecture 3.7. Assume $𝔽 = 𝔽_{q}$ . If we fix $n \geq 2$ and $m \geq 1$ , then the number of ${GL}_{n}$ -orbits on $𝒩_{n}^{m}$ is a polynomial in q with integer coefficients.

It was proven by Hua [24] that the number of ${GL}_{n}$ -orbits on $𝒩_{n}^{m}$ is a polynomial in q with rational coefficients. Hua also conjectured that the number of absolutely indecomposable ${GL}_{n}$ -orbits on $𝒩_{n}^{m}$ is a polynomial in q with non-negative integer coefficients (see [24, Conjecture 4.1]). This conjecture was shown to be true for $m = 2$ and $1 \leq n \leq 6$ .

4. Separating set for $𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$

If $𝔽 = 𝔽_{q}$ and $A = (\begin{matrix} a_{1} & a_{2} \\ a_{3} & a_{4} \end{matrix})$ lies in $𝒩_{2}$ , then we have that $ζ (A) = a_{1}^{q - 1} - a_{2}^{q - 1} - a_{3}^{q - 1} + 1$ , since $a_{1}^{2} = - a_{2} a_{3}$ . In particular,

ζ (Y i) = y 11 (i) q - 1 - y 12 (i) q - 1 - y 21 (i) q - 1 + 1 . <math display="block" altimg="eq-00499.gif"><mi>ζ</mi><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><mo stretchy="false">)</mo><mo>=</mo><msub><mrow><mi>y</mi></mrow><mrow><mn>1</mn><mn>1</mn></mrow></msub><msup><mrow><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo></mrow><mrow><mi>q</mi><mo>-</mo><mn>1</mn></mrow></msup><mo>-</mo><msub><mrow><mi>y</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><msup><mrow><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo></mrow><mrow><mi>q</mi><mo>-</mo><mn>1</mn></mrow></msup><mo>-</mo><msub><mrow><mi>y</mi></mrow><mrow><mn>2</mn><mn>1</mn></mrow></msub><msup><mrow><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo></mrow><mrow><mi>q</mi><mo>-</mo><mn>1</mn></mrow></msup><mo>+</mo><mn>1</mn><mo>.</mo></math> (9)

Similarly, we can see that

η α (Y i, Y j) = \prod u, v \in {1, 2} ((α y u v (i) - y u v (j)) q - 1 - 1) . <math display="block" altimg="eq-00500.gif"><msub><mrow><mi>η</mi></mrow><mrow><mi>α</mi></mrow></msub><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><mo>,</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo><mo>=</mo><munder><mrow><mo>\prod</mo></mrow><mrow><mi>u</mi><mo>,</mo><mi>v</mi><mo>\in</mo><mo stretchy="false">{</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo stretchy="false">}</mo></mrow></munder><mo stretchy="false">(</mo><msup><mrow><mo stretchy="false">(</mo><mi>α</mi><mspace width=".17em"></mspace><msub><mrow><mi>y</mi></mrow><mrow><mi>u</mi><mi>v</mi></mrow></msub><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo><mo>-</mo><msub><mrow><mi>y</mi></mrow><mrow><mi>u</mi><mi>v</mi></mrow></msub><mo stretchy="false">(</mo><mi>j</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo></mrow><mrow><mi>q</mi><mo>-</mo><mn>1</mn></mrow></msup><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo><mo>.</mo></math> (10)

Remark 4.1. Let H be one of the following sets: $S_{2, m}$ , $S_{2, m}^{(2)}$ , $H_{2, m}$ , $H_{2, m}^{(2)}$ . Assume that $\underset{̲}{A}, \underset{̲}{B} \in 𝒩_{2}^{m}$ are not separated by H. Then

(a)	for any $σ \in 𝒮_{m}$ we have that ${\underset{̲}{A}}_{σ}, {\underset{̲}{B}}_{σ}$ are not separated by H;
(b)	for any ${\underset{̲}{A}}^{'}, {\underset{̲}{B}}^{'} \in 𝒩_{n}^{m}$ with $\underset{̲}{A} \sim {\underset{̲}{A}}^{'}$ and $\underset{̲}{B} \sim {\underset{̲}{B}}^{'}$ we have that ${\underset{̲}{A}}^{'}, {\underset{̲}{B}}^{'}$ are not separated by H.

Proof. Consider some $A_{1}, A_{2}, A_{3}$ from $𝒩_{2}$ . It is easy to see that $η_{α} (A_{1}, A_{2}) = η_{α^{- 1}} (A_{2}, A_{1})$ for all $α \in 𝔽^{\times}$ . Moreover, the Cayley–Hamilton theorem implies that

tr (A 1 A 2 A 3) = - tr (A 1 A 3 A 2) . <math display="block" altimg="eq-00516.gif"><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>A</mi></mrow><mrow><mn>1</mn></mrow></msub><msub><mrow><mi>A</mi></mrow><mrow><mn>2</mn></mrow></msub><msub><mrow><mi>A</mi></mrow><mrow><mn>3</mn></mrow></msub><mo stretchy="false">)</mo><mo>=</mo><mo>-</mo><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>A</mi></mrow><mrow><mn>1</mn></mrow></msub><msub><mrow><mi>A</mi></mrow><mrow><mn>3</mn></mrow></msub><msub><mrow><mi>A</mi></mrow><mrow><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo>.</mo></math> (11)

Thus, part (a) is proven. Part (b) is trivial. □

Lemma 4.2. Assume that $\underset{̲}{A}, \underset{̲}{B} \in 𝒩_{2}^{m}$ are not separated by $S_{2, m}$ . Then there exists a permutation $σ \in 𝒮_{m}$ such that one of the next cases holds:

(a)

where

$k, l, r, s \geq 0, k + l + r + s = m$ ,

$α_{1}, \dots, α_{r + s}, β_{1}, \dots, β_{l + s} \in 𝔽^{\times}$ ; moreover,

$α_{1} = 1$ in case

$\underset{̲}{A} \neq \underset{̲}{0}$ and

$β_{1} = 1$ in case

$\underset{̲}{B} \neq \underset{̲}{0}$ ;

(b)

${\underset{̲}{A}}_{σ} \sim (0^{(k)}, E_{12}, α_{2} E_{12}, \dots, α_{r} E_{12}, α_{r + 1} E_{21}, D_{1}, \dots, D_{s})$ and $\underset{̲}{B} \sim \underset{̲}{A}$ , where $r \geq 1$ , $k, s \geq 0$ , $k + r + s + 1 = m$ , $α_{2}, \dots, α_{r + 1} \in 𝔽^{\times}$ , and $D_{i} \in 𝒩_{2}$ is non-zero for every i.

Proof. We use Remark 4.1 without reference to it. Applying Lemma 3.3 to $\underset{̲}{A}$ together with $𝒮_{m}$ -action on $\underset{̲}{A}$ we can see that one of the next two cases holds.

(1) ${\underset{̲}{A}}_{σ} \sim (0^{(k)}, α_{1} E_{12}, \dots, α_{r} E_{12})$ , where $k, r \geq 0$ , $k + r = m$ , and $α_{1}, \dots, α_{r} \in 𝔽^{\times}$ with $α_{1} = 1$ in case $\underset{̲}{A} \neq \underset{̲}{0}$ . Applying Lemma 3.3 to ${\underset{̲}{B}}_{σ}$ together with $𝒮_{m}$ -action on $({\underset{̲}{A}}_{σ}, {\underset{̲}{B}}_{σ})$ we can see that one of the next two cases holds:

(1a)	there is $τ \in 𝒮_{m}$ such that $({\underset{̲}{A}}_{τ}, {\underset{̲}{B}}_{τ})$ satisfies case (a) maybe without condition that $β_{1} = 1$ in case $\underset{̲}{B} \neq \underset{̲}{0}$ . In the latter case we act by $(\begin{matrix} β_{1}^{- 1} & 0 \\ 0 & 1 \end{matrix}) \in {GL}_{2}$ on ${\underset{̲}{B}}_{τ}$ to obtain case (a).
(1b)	${\underset{̲}{A}}_{σ}$ is the same as above and ${\underset{̲}{B}}_{σ} \sim \underset{̲}{C}$ , where for some $i < j$ we have $C_{i} = E_{12}$ and $C_{j} = α E_{21}$ for $α \in 𝔽^{\times}$ . In this case, we have $tr (A_{σ (i)} A_{σ (j)}) = tr (C_{i} C_{j})$ ; hence, $α = 0$ , a contradiction.

(2) ${\underset{̲}{A}}_{σ} \sim (0^{(k)}, α_{1} E_{12}, \dots, α_{r} E_{12}, α_{r + 1} E_{21}, D_{1}, \dots, D_{s})$ , where $r \geq 1$ , $k, s \geq 0$ , $α_{1} = 1$ , $α_{2}, \dots, α_{r + 1} \in 𝔽^{\times}$ , and $D_{i} \in 𝒩_{2}$ is non-zero for every i. Applying Lemma 3.3 to ${\underset{̲}{B}}_{σ}$ we can see that one of the next two cases holds:

(2a)

$w z ({\underset{̲}{B}}_{σ}) \sim (β_{1} E_{12}, \dots, β_{l} E_{12})$ for some $l \geq 0$ and $β_{1}, \dots, β_{l} \in 𝔽^{\times}$ , where $β_{1} = 1$ in case $\underset{̲}{B} \neq \underset{̲}{0}$ . Then $B_{i} B_{j} = 0$ for all $1 \leq i, j \leq m$ . On the other hand, $tr (A_{σ (k + r)} A_{σ (k + r + 1)}) = α_{r} α_{r + 1}$ is non-zero; a contradiction.

(2b)

$w z ({\underset{̲}{B}}_{σ}) \sim (β_{1} E_{12}, \dots, β_{r^{'}} E_{12}, β_{r^{'} + 1} E_{21}, D_{1}^{'}, \dots, D_{s^{'}}^{'})$ , where $r^{'} \geq 1$ , $s^{'} \geq 0$ , $β_{1} = 1$ , $β_{2}, \dots, β_{r^{'} + 1} \in 𝔽^{\times}$ , $D_{i}^{'} \in 𝒩_{2}$ is non-zero for every $1 \leq i \leq s^{'}$ . Therefore,

B ̲ σ \sim (\dots, β r' E 12, 0, \dots, 0, β r' + 1 E 21, \dots), î ĵ <math display="block" altimg="eq-00582.gif"><mrow><mtable displaystyle="true" equalrows="false" equalcolumns="false"><mtr><mtd columnalign="right"></mtd><mtd columnalign="center"><msub><mrow><munder accentunder="false"><mrow><mi>B</mi></mrow><mo accent="true">̲</mo></munder></mrow><mrow><mi>σ</mi></mrow></msub><mo>\sim</mo><mo stretchy="false">(</mo><mo>\dots</mo><mo>,</mo><msub><mrow><mi>β</mi></mrow><mrow><msup><mrow><mi>r</mi></mrow><mrow><mi>'</mi></mrow></msup></mrow></msub><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><mn>0</mn><mo>,</mo><mo>\dots</mo><mo>,</mo><mn>0</mn><mo>,</mo><msub><mrow><mi>β</mi></mrow><mrow><msup><mrow><mi>r</mi></mrow><mrow><mi>'</mi></mrow></msup><mo>+</mo><mn>1</mn></mrow></msub><msub><mrow><mi>E</mi></mrow><mrow><mn>2</mn><mn>1</mn></mrow></msub><mo>,</mo><mo>\dots</mo><mo stretchy="false">)</mo><mo>,</mo></mtd><mtd columnalign="left"></mtd></mtr><mtr><mtd columnalign="right"></mtd><mtd columnalign="center"><mspace width="2em"></mspace><mspace width=".275em"></mspace><mi>î</mi><mspace width="2em"></mspace><mspace width="2em"></mspace><mspace width="1em"></mspace><mspace width=".275em"></mspace><mspace width=".275em"></mspace><mi>ĵ</mi></mtd><mtd columnalign="left"></mtd></mtr></mtable></mrow></math>

where

$β_{r^{'}} E_{12}$ and

$β_{r^{'} + 1} E_{21}$ , respectively, stands in position i and j, respectively, for some

$1 \leq i < j \leq m$ . Since

the conditions of the lemma imply that

$j = k + r + 1$ . For

$1 \leq u \leq k + r$ , we have

tr (B σ (u) B σ (k + r + 1)) = tr (A σ (u) A σ (k + r + 1)) = {α u - k α r + 1 if u > k, 0 if u \leq k . <math display="block" altimg="eq-00589.gif"><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>B</mi></mrow><mrow><mi>σ</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo></mrow></msub><msub><mrow><mi>B</mi></mrow><mrow><mi>σ</mi><mo stretchy="false">(</mo><mi>k</mi><mo>+</mo><mi>r</mi><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo></mrow></msub><mo stretchy="false">)</mo><mo>=</mo><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>A</mi></mrow><mrow><mi>σ</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo></mrow></msub><msub><mrow><mi>A</mi></mrow><mrow><mi>σ</mi><mo stretchy="false">(</mo><mi>k</mi><mo>+</mo><mi>r</mi><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo></mrow></msub><mo stretchy="false">)</mo><mo>=</mo><mfenced separators="" open="{" close=""><mrow><mtable displaystyle="true" equalrows="false" equalcolumns="false"><mtr><mtd columnalign="left"><msub><mrow><mi>α</mi></mrow><mrow><mi>u</mi><mo>-</mo><mi>k</mi></mrow></msub><msub><mrow><mi>α</mi></mrow><mrow><mi>r</mi><mo>+</mo><mn>1</mn></mrow></msub></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">if</mtext></mstyle><mspace width="0.5em"></mspace><mi>u</mi><mo>&gt;</mo><mi>k</mi><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>0</mn></mtd><mtd columnalign="left"><mstyle><mtext mathvariant="normal">if</mtext></mstyle><mspace width="0.5em"></mspace><mi>u</mi><mo>\leq</mo><mi>k</mi><mo>.</mo></mtd></mtr></mtable></mrow></mfenced></math> (12)

Therefore,

${\underset{̲}{B}}_{σ} \sim (0^{(k)}, β_{1} E_{12}, \dots, β_{r} E_{12}, β_{r + 1} E_{21}, D_{1}^{″}, \dots, D_{s}^{″})$ , where some of the matrices

$D_{1}^{″}, \dots, D_{s}^{″} \in 𝒩_{2}$ can be zero. In particular,

$r = r^{'}$ . Since

$α_{1} = β_{1} = 1$ , equality (12) implies that

$α_{r + 1} = β_{r + 1}$ ; therefore,

$α_{i} = β_{i}$ for all

$2 \leq i \leq r$ . Applying Remark 3.2 to the pair of triples

$(A_{σ (k + r)}, A_{σ (k + r + 1)}, A_{σ (k + r + 1 + u)})$ and

$(B_{σ (k + r)}, B_{σ (k + r + 1)}, B_{σ (k + r + 1 + u)})$ , where

$1 \leq u \leq s$ , we obtain that

$D_{u} = D_{u}^{'}$ . Therefore,

${\underset{̲}{A}}_{σ} \sim {\underset{̲}{B}}_{σ}$ and case (b) of the formulation of this lemma holds.

□

Remark 4.3. For $\underset{̲}{A} \in 𝒩_{2}^{2}$ , we have

Proof. Since $η_{α} (A_{1}, A_{2})$ and $tr (A_{1} A_{2})$ are constants on the ${GL}_{2}$ -orbits on $𝒩_{2}^{2}$ , by Lemma 3.1 we can assume that on $\underset{̲}{A}$ lies in the following list: $(0, 0)$ , $(0, E_{12})$ , $(E_{12}, 0)$ , $(E_{12}, γ E_{12})$ , $(E_{12}, γ E_{21})$ , where $γ \in 𝔽^{\times}$ . The required follows from case by case consideration. □

Lemma 4.4. Assume that $char 𝔽 = 2$ and $\underset{̲}{A}, \underset{̲}{B} \in 𝒩_{2}^{3}$ are not separated by $S_{3}^{(2)} = {tr (Y_{1} Y_{2}), tr (Y_{1} Y_{3}), tr (Y_{2} Y_{3})}$ . Then $tr (A_{1} A_{2} A_{3}) = tr (B_{1} B_{2} B_{3})$ .

Proof. If $A_{i} = B_{j} = 0$ for some $1 \leq i, j \leq 3$ , then $tr (A_{1} A_{2} A_{3}) = tr (B_{1} B_{2} B_{3}) = 0$ . Therefore, without loss of generality, we can assume that $A_{i}$ is non-zero for all i.

(1) Assume that $B_{j} = 0$ for some j. By formula (11), without loss of generality, we can assume that $B_{1} = 0$ . Applying Lemma 3.3 to $\underset{̲}{A}$ , we obtain that one of the following cases holds:

where

$α_{2}, α_{3} \in 𝔽^{\times}$ and

$D \in 𝒩_{2}$ is non-zero. In cases (a) and (c), we have

$tr (A_{1} A_{2} A_{3}) = 0$ and the required holds. In case (b), we have

$α_{2} = tr (A_{1} A_{2}) = tr (B_{1} B_{2}) = 0$ ; a contradiction.

(2) Assume that $B_{j}$ is non-zero for all j. Applying Lemma 3.3 to $\underset{̲}{A}$ , we obtain that one of the above cases (a), (b), (c) holds for $\underset{̲}{A}$ . Applying Lemma 3.3 to $\underset{̲}{B}$ , we obtain that one of the following cases holds:

where

$β_{2}, β_{3} \in 𝔽^{\times}$ and

$D^{'} \in 𝒩_{2}$ is non-zero. Since

$tr (A_{1} A_{2}) = tr (B_{1} B_{2})$ and

$tr (A_{1} A_{3}) = tr (B_{1} B_{3})$ , we have one of the following three cases:

•	Cases (a) and ( $a’$ ) hold. Then $tr (A_{1} A_{2} A_{3}) = tr (B_{1} B_{2} B_{3}) = 0$ .
•	Cases (b) and ( $b’$ ) hold. Since $tr (A_{1} A_{2}) = tr (B_{1} B_{2})$ , we obtain that $α_{2} = β_{2}$ . Denote $D = D (a_{1}, a_{2}, a_{3})$ and $D^{'} = D (b_{1}, b_{2}, b_{3})$ . The equalities $tr (A_{1} A_{3}) = tr (B_{1} B_{3})$ and $tr (A_{2} A_{3}) = tr (B_{2} B_{3})$ , respectively, imply that $a_{3} = b_{3}$ and $a_{2} = b_{2}$ , respectively. It follows from equalities $det (D) = det (D^{'}) = 0$ that $a_{1} = b_{1}$ , since $char 𝔽 = 2$ . Therefore, $\underset{̲}{A} \sim \underset{̲}{B}$ and the required follows.
•	Cases (c) and ( $c’$ ) hold. Then $tr (A_{1} A_{2} A_{3}) = tr (B_{1} B_{2} B_{3}) = 0$ .

□

Theorem 4.5. Assume that $𝔽 = 𝔽_{q}$ is finite. Then

•	$H_{2, m}^{(2)}$ , in case $char 𝔽 = 2$ ;
•	$H_{2, m}$ , in case $char 𝔽 > 2$ ;

is a minimal separating set for the algebra of invariant polynomial functions on nilpotent matrices $𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ for all $m > 0$ .

Proof. Denote by H the set $H_{2, m}^{(2)}$ or $H_{2, m}$ , respectively, in case $char 𝔽 = 2$ or $char 𝔽 > 2$ , respectively. Assume that $\underset{̲}{A}, \underset{̲}{B} \in 𝒩_{2}^{m}$ are not separated by H. To prove that H is separating it is enough to show that $\underset{̲}{A} \sim \underset{̲}{B}$ . By Remarks 4.1, 4.3 and Lemma 4.4, we obtain in both cases that $\underset{̲}{A}, \underset{̲}{B}$ are not separated by

H' 2, m = H 2, m \cup {η α (Y i, Y j), 1 \leq i, j \leq m, α \in 𝔽 \times} . <math display="block" altimg="eq-00671.gif"><mrow><msubsup><mrow><mi>H</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow><mrow><mi>'</mi></mrow></msubsup><mo>=</mo><msub><mrow><mi>H</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow></msub><mo>\cup</mo><mo stretchy="false">{</mo><msub><mrow><mi>η</mi></mrow><mrow><mi>α</mi></mrow></msub><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>i</mi></mrow></msub><mo>,</mo><msub><mrow><mi>Y</mi></mrow><mrow><mi>j</mi></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mspace width="1em"></mspace><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>,</mo><mi>j</mi><mo>\leq</mo><mi>m</mi><mo>,</mo><mspace width=".275em"></mspace><mi>α</mi><mo>\in</mo><msup><mrow><mi>𝔽</mi></mrow><mrow><mo>\times</mo></mrow></msup><mo stretchy="false">}</mo><mo>.</mo></mrow></math>

Applying Lemma 4.2 together with the fact $A_{i} = 0$ if and only if $B_{i} = 0$ ( $1 \leq i \leq m$ ), we obtain that one of the next two cases holds:

•	${\underset{̲}{A}}_{σ} \sim (0^{(k)}, E_{12}, α_{k + 2} E_{12}, \dots, α_{m} E_{12})$ , ${\underset{̲}{B}}_{σ} \sim (0^{(k)}, E_{12}, β_{k + 2} E_{12}, \dots, β_{m} E_{12})$ , where $σ \in 𝒮_{m}$ , $0 \leq k < m$ , $α_{k + 2}, \dots, α_{m}, β_{k + 2}, \dots, β_{m} \in 𝔽^{\times}$ ;
•	$\underset{̲}{A} \sim \underset{̲}{B}$ .

Assume that the first case holds. Since for every $k + 2 \leq i \leq m$ and every $α \in 𝔽^{\times}$ we have

A σ (k + 1) - α A σ (i) = 0 if and only if B σ (k + 1) - α B σ (i) = 0, <math display="block" altimg="eq-00683.gif"><mrow><msub><mrow><mi>A</mi></mrow><mrow><mi>σ</mi><mo stretchy="false">(</mo><mi>k</mi><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo></mrow></msub><mo>-</mo><mi>α</mi><msub><mrow><mi>A</mi></mrow><mrow><mi>σ</mi><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo></mrow></msub><mo>=</mo><mn>0</mn><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">if and only if</mtext></mstyle><mspace width="1em"></mspace><msub><mrow><mi>B</mi></mrow><mrow><mi>σ</mi><mo stretchy="false">(</mo><mi>k</mi><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo></mrow></msub><mo>-</mo><mi>α</mi><msub><mrow><mi>B</mi></mrow><mrow><mi>σ</mi><mo stretchy="false">(</mo><mi>i</mi><mo stretchy="false">)</mo></mrow></msub><mo>=</mo><mn>0</mn><mo>,</mo></mrow></math>

then

$α_{i} = β_{i}$ . Therefore,

${\underset{̲}{A}}_{σ} \sim {\underset{̲}{B}}_{σ}$ and H is separating.

Claims 1–4 (see below) show that H is a minimal separating set for all $m > 0$ .

Claim 1. The set $H_{2, 1} ∖ {ζ (Y_{1})} = \emptyset$ is not separating for $𝒪 {(𝒩_{2})}^{{GL}_{2}}$ .

To prove this claim it is enough to consider $\underset{̲}{A} = (0)$ and $\underset{̲}{B} = (E_{12})$ .

Claim 2. Given $β \in 𝔽 ∖ {0, 1}$ , the set $H_{2, 2} ∖ {η_{β} (Y_{1}, Y_{2})}$ is not separating for $𝒪 {(𝒩_{2}^{2})}^{{GL}_{2}}$ .

To prove this claim consider $\underset{̲}{A} = (E_{12}, E_{12})$ and $\underset{̲}{B} = (E_{12}, β E_{12})$ from $𝒩_{2}^{2}$ . Then $tr (A_{1} A_{2}) = tr (B_{1} B_{2}) = 0$ and $η_{α} (A_{1}, A_{2}) = η_{α} (B_{1}, B_{2}) = 0$ for all $α \in 𝔽 ∖ {0, 1, β}$ , but $η_{β} (A_{1}, A_{2}) \neq η_{β} (B_{1}, B_{2})$ .

Claim 3. The set $H_{2, 2} ∖ {tr (Y_{1} Y_{2})}$ is not separating for $𝒪 {(𝒩_{2}^{2})}^{{GL}_{2}}$ .

To prove this claim consider $\underset{̲}{A} = (E_{12}, E_{12})$ and $\underset{̲}{B} = (E_{12}, E_{21})$ from $𝒩_{2}^{2}$ . Then $η_{α} (A_{1}, A_{2}) = η_{α} (B_{1}, B_{2}) = 0$ for all $α \in 𝔽 ∖ {0, 1}$ , but $tr (A_{1} A_{2}) \neq tr (B_{1} B_{2})$ .

Claim 4. Let $m = 3$ and $char 𝔽 \neq 2$ . Then $H_{2, 3} ∖ {tr (Y_{1} Y_{2} Y_{3})}$ is not separating for $𝒪 {(𝒩_{2}^{3})}^{{GL}_{2}}$ .

To prove this claim we consider $\underset{̲}{A} = (E_{12}, E_{21}, A_{3})$ and $\underset{̲}{B} = (E_{12}, E_{21}, B_{3})$ from $𝒩_{2}^{3}$ , where

A 3 = (11 - 1 - 1) and B 3 = (- 1 1 - 1 1) . <math display="block" altimg="eq-00716.gif"><mrow><msub><mrow><mi>A</mi></mrow><mrow><mn>3</mn></mrow></msub><mo>=</mo><mfenced separators="" open="(" close=")"><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="center"><mn>1</mn></mtd><mtd columnalign="center"><mn>1</mn></mtd></mtr><mtr><mtd columnalign="center"><mo>-</mo><mn>1</mn></mtd><mtd columnalign="center"><mo>-</mo><mn>1</mn></mtd></mtr></mtable></mrow></mfenced><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">and</mtext></mstyle><mspace width="1em"></mspace><msub><mrow><mi>B</mi></mrow><mrow><mn>3</mn></mrow></msub><mo>=</mo><mfenced separators="" open="(" close=")"><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="center"><mo>-</mo><mn>1</mn></mtd><mtd columnalign="center"><mn>1</mn></mtd></mtr><mtr><mtd columnalign="center"><mo>-</mo><mn>1</mn></mtd><mtd columnalign="center"><mn>1</mn></mtd></mtr></mtable></mrow></mfenced><mo>.</mo></mrow></math>

Thus,

$η_{α} (A_{i}, A_{j}) = η_{α} (B_{i}, B_{j}) = 0$ for all

$1 \leq i < j \leq 3$ and

$α \in 𝔽 ∖ {0, 1}$ . Moreover,

$tr (A_{1} A_{2}) = tr (B_{1} B_{2}) = 1$ ,

$tr (A_{1} A_{3}) = tr (B_{1} B_{3}) = - 1$ ,

$tr (A_{2} A_{3}) = tr (B_{2} B_{3}) = 1$ . On the other hand,

$tr (A_{1} A_{2} A_{3}) = 1$ and

$tr (B_{1} B_{2} B_{3}) = - 1$ are different. □

5. The Case of Infinite Field

In this section we assume that $𝔽$ is infinite. We say that a G-invariant subset $W \subset V$ is a G-invariant cone if the conditions $w \in W$ and $α \in 𝔽^{\times}$ imply that $α w \in W$ .

Remark 5.1. If $W \subset V$ is a G-invariant cone, then the algebra $𝒪 {(W)}^{G}$ has $ℕ$ -grading by degrees.

Note that $𝒩_{n}^{m}$ is a multi-cone, i.e., if $\underset{̲}{A} \in 𝒩_{n}^{m}$ and $α_{1}, \dots, α_{m} \in 𝔽^{\times}$ , then $(α_{1} A_{1}, \dots, α_{m} A_{m}) \in 𝒩_{n}^{m}$ . Thus, the following extension of Remark 5.1 to the case of multidegree holds.

Remark 5.2. The algebra $𝒪 {(𝒩_{n}^{m})}^{{GL}_{n}}$ has $ℕ^{m}$ -grading by multidegrees, where the multidegree of $f \in 𝒪 (𝒩_{n}^{m}) ≃ 𝔽 [M_{n}^{m}] ∕ I (𝒩_{n}^{m})$ is defined as the minimal multidegree (with respect to some fixed lexicographical order) of a polynomial $h \in 𝔽 [M_{n}^{m}]$ with $f = h + I (𝒩_{n}^{m})$ .

Lemma 5.3. If $f \in 𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ is $ℕ^{m}$ -homogeneous and $f \notin 𝔽$ , then $f (α_{1} E_{12}, \dots, α_{m} E_{12}) = 0$ for all $α_{1}, \dots, α_{m} \in 𝔽$ .

Proof. Denote the multidegree of f by $\underset{̲}{δ} = (δ_{1}, \dots, δ_{m})$ , where $δ_{!} + \dots + δ_{m} > 0$ . Then

f = β y 12 (1) δ 1 \dots y 12 (m) δ m + \sum h \in Ω β h h, <math display="block" altimg="eq-00749.gif"><mrow><mi>f</mi><mo>=</mo><mi>β</mi><mspace width=".17em"></mspace><msub><mrow><mi>y</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><msup><mrow><mo stretchy="false">(</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><mrow><msub><mrow><mi>δ</mi></mrow><mrow><mn>1</mn></mrow></msub></mrow></msup><mo>\dots</mo><msub><mrow><mi>y</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><msup><mrow><mo stretchy="false">(</mo><mi>m</mi><mo stretchy="false">)</mo></mrow><mrow><msub><mrow><mi>δ</mi></mrow><mrow><mi>m</mi></mrow></msub></mrow></msup><mo>+</mo><munder><mrow><mo>\sum</mo></mrow><mrow><mi>h</mi><mo>\in</mo><mi mathvariant="normal">Ω</mi></mrow></munder><msub><mrow><mi>β</mi></mrow><mrow><mi>h</mi></mrow></msub><mi>h</mi><mo>,</mo></mrow></math>

for some

$β, β_{h} \in 𝔽$ , where

$Ω$ is the set of all monomials in

${y_{i j} (k) | i, j \in {1, 2}, 1 \leq k \leq m}$ different from

$y_{12} {(1)}^{d_{1}} \dots y_{12} {(m)}^{d_{m}}$ for all

$d_{1}, \dots, d_{m} \geq 0$ .

For $g = (\begin{matrix} γ & 0 \\ 0 & 1 \end{matrix}) \in {GL}_{2}$ with $γ \in 𝔽^{\times}$ , we have

β = f (E 12, \dots, E 12) = f (g \cdot (E 12, \dots, E 12)) = f (γ E 12, \dots, γ E 12) = β γ δ 1 + \dots + δ m <math display="block" altimg="eq-00757.gif"><mrow><mtable displaystyle="true" equalrows="false" equalcolumns="false"><mtr><mtd columnalign="right"><mi>β</mi></mtd><mtd columnalign="center"><mo>=</mo></mtd><mtd columnalign="left"><mi>f</mi><mo stretchy="false">(</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><mo>\dots</mo><mo>,</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo stretchy="false">)</mo></mtd><mtd columnalign="center"><mo>=</mo></mtd><mtd columnalign="left"><mi>f</mi><mo stretchy="false">(</mo><mi>g</mi><mo>\cdot</mo><mo stretchy="false">(</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><mo>\dots</mo><mo>,</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo stretchy="false">)</mo></mtd></mtr><mtr><mtd columnalign="right"></mtd><mtd columnalign="center"><mo>=</mo></mtd><mtd columnalign="left"><mi>f</mi><mo stretchy="false">(</mo><mi>γ</mi><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><mo>\dots</mo><mo>,</mo><mi>γ</mi><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo stretchy="false">)</mo></mtd><mtd columnalign="center"><mo>=</mo></mtd><mtd columnalign="left"><mi>β</mi><mspace width=".17em"></mspace><msup><mrow><mi>γ</mi></mrow><mrow><msub><mrow><mi>δ</mi></mrow><mrow><mn>1</mn></mrow></msub><mo>+</mo><mo>\dots</mo><mo>+</mo><msub><mrow><mi>δ</mi></mrow><mrow><mi>m</mi></mrow></msub></mrow></msup></mtd></mtr></mtable></mrow></math>

for all

$γ \in 𝔽^{\times}$ . Hence,

$β = 0$ , since

$𝔽$ is infinite. Therefore,

$f (α_{1} E_{12}, \dots, α_{m} E_{12}) = β = 0$ . □

Theorem 5.4. Assume that $𝔽$ is infinite. Then

•	$S_{2, m}^{(2)}$ , in case $char 𝔽 = 2$ ;
•	$S_{2, m}$ , in case $char 𝔽 > 2$ ;

is a minimal separating set for the algebra of invariant polynomial functions on nilpotent matrices $𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ for all $m > 0$ .

Proof. Denote by S the set $S_{2, m}^{(2)}$ or $S_{2, m}$ , respectively, in case $char 𝔽 = 2$ or $char 𝔽 > 2$ , respectively. Assume that $\underset{̲}{A}, \underset{̲}{B} \in 𝒩_{2}^{m}$ are not separated by S and $f \in 𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ . To prove that S is separating we have to show that $f (\underset{̲}{A}) = f (\underset{̲}{B})$ . By Remark 5.2, without loss of generality, we can assume that f is $ℕ^{m}$ -homogeneous of multidegree $\underset{̲}{δ}$ . Moreover, without loss of generality, we can assume that $f (\underset{̲}{A}) \neq 0$ . By Lemmas 4.2 and 4.4, one of the next two cases holds:

•

where

$σ \in 𝒮_{m}$ ,

$k, l, r, s \geq 0$ ,

$α_{1}, \dots, α_{r + s}, β_{1}, \dots, β_{l + s} \in 𝔽^{\times}$ ; moreover,

$α_{1} = 1$ in case

$\underset{̲}{A} \neq 0$ and

$β_{1} = 1$ in case

$\underset{̲}{B} \neq 0$ ;

•

$\underset{̲}{A} \sim \underset{̲}{B}$ .

In the second case, we have $f (\underset{̲}{A}) = f (\underset{̲}{B})$ . Assume that the first case holds.

Define $f_{σ^{- 1}}$ as the result of substitutions $y_{i j} (t) \to y_{i j} (σ^{- 1} (t))$ in f for all $1 \leq t \leq m$ , $1 \leq i, j \leq 2$ . Then $f_{σ^{- 1}} \in 𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ and $f_{σ^{- 1}} ({\underset{̲}{C}}_{σ}) = f (\underset{̲}{C})$ for all $\underset{̲}{C} \in 𝒩_{2}^{m}$ . Therefore, to prove that $f (\underset{̲}{A}) = f (\underset{̲}{B})$ it is enough to show that $f_{σ^{- 1}} ({\underset{̲}{A}}_{σ}) = f_{σ^{- 1}} ({\underset{̲}{B}}_{σ})$ . Note that ${\underset{̲}{A}}_{σ}$ , ${\underset{̲}{B}}_{σ}$ are not separated by S by Remark 4.1. Therefore, without loss of generality, we can assume that $σ \in 𝒮_{m}$ is the trivial permutation.

If $δ_{i} \neq 0$ for some $1 \leq i \leq k + l$ , then $f (\underset{̲}{A}) = 0$ ; a contradiction. Otherwise, $δ_{1} = \dots = δ_{k + l} = 0$ . Hence, f is a polynomial in ${y_{i j} (k + l + 1), \dots, y_{i j} (m) | i, j \in {1, 2}}$ . Therefore, $f (E_{12}^{(k + l)}, α_{1} E_{12}, \dots, α_{r + s} E_{12}) = f (\underset{̲}{A})$ is non-zero; a contradiction to Lemma 5.3. Thus, S is separating.

Taking $\underset{̲}{A}, \underset{̲}{B}$ from Claims 3, 4 from the proof of Theorem 4.5 we obtain that S is a minimal separating set for all $m > 0$ . □

6. Corollaries

As in Sec. 1.1, assume that V is an n-dimensional vector space over $𝔽$ , G is a subgroup of $GL (V)$ , and $W \subset V$ is a G-invariant subset of V. We say that an $m_{0}$ -tuple $\underset{̲}{j} \in ℕ^{m_{0}}$ is m-admissible if $1 \leq j_{1} < \dots < j_{m_{0}} \leq m$ . For any m-admissible $\underset{̲}{j} \in ℕ^{m_{0}}$ and $f \in 𝒪 {(W^{m_{0}})}^{G}$ we define the invariant polynomial function $f^{(\underset{̲}{j})} \in 𝒪 {(W^{m})}^{G}$ as the result of the following substitution of variables in f:

y 1, i \to y j 1, i, \dots, y m 0, i \to y j m 0, i (for all 1 \leq i \leq n) . <math display="block" altimg="eq-00818.gif"><mrow><msub><mrow><mi>y</mi></mrow><mrow><mn>1</mn><mo>,</mo><mi>i</mi></mrow></msub><mo>\to</mo><msub><mrow><mi>y</mi></mrow><mrow><msub><mrow><mi>j</mi></mrow><mrow><mn>1</mn></mrow></msub><mo>,</mo><mi>i</mi></mrow></msub><mo>,</mo><mo>\dots</mo><mo>,</mo><msub><mrow><mi>y</mi></mrow><mrow><msub><mrow><mi>m</mi></mrow><mrow><mn>0</mn></mrow></msub><mo>,</mo><mi>i</mi></mrow></msub><mo>\to</mo><msub><mrow><mi>y</mi></mrow><mrow><msub><mrow><mi>j</mi></mrow><mrow><msub><mrow><mi>m</mi></mrow><mrow><mn>0</mn></mrow></msub></mrow></msub><mo>,</mo><mi>i</mi></mrow></msub><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">(for all</mtext></mstyle><mspace width="0.5em"></mspace><mn>1</mn><mo>\leq</mo><mi>i</mi><mo>\leq</mo><mi>n</mi><mo stretchy="false">)</mo><mo>.</mo></mrow></math>

Given a set

$S \subset 𝒪 {(W^{m_{0}})}^{G}$ , we define its expansion

$S^{[m]} \subset 𝒪 {(W^{m})}^{G}$ by

S [m] = {f (j ̲) | f \in S and j ̲ \in ℕ m 0 is m - a d m i s s i b l e} . <math display="block" altimg="eq-00821.gif"><msup><mrow><mi>S</mi></mrow><mrow><mo stretchy="false">[</mo><mi>m</mi><mo stretchy="false">]</mo></mrow></msup><mo>=</mo><mo stretchy="false">{</mo><msup><mrow><mi>f</mi></mrow><mrow><mo stretchy="false">(</mo><munder accentunder="false"><mrow><mi>j</mi></mrow><mo accent="true">̲</mo></munder><mo stretchy="false">)</mo></mrow></msup><mspace width=".17em"></mspace><mi>|</mi><mspace width=".17em"></mspace><mi>f</mi><mo>\in</mo><mi>S</mi><mspace width="0.5em"></mspace><mstyle><mtext mathvariant="normal">and</mtext></mstyle><mspace width="0.5em"></mspace><munder accentunder="false"><mrow><mi>j</mi></mrow><mo accent="true">̲</mo></munder><mo>\in</mo><msup><mrow><mi>ℕ</mi></mrow><mrow><msub><mrow><mi>m</mi></mrow><mrow><mn>0</mn></mrow></msub></mrow></msup><mspace width="0.5em"></mspace><mstyle><mtext mathvariant="normal">is</mtext></mstyle><mspace width="0.5em"></mspace><mi>m</mi><mstyle><mtext mathvariant="normal">-</mtext></mstyle><mi>a</mi><mi>d</mi><mi>m</mi><mi>i</mi><mi>s</mi><mi>s</mi><mi>i</mi><mi>b</mi><mi>l</mi><mi>e</mi><mo stretchy="false">}</mo><mo>.</mo></math> (13)

Remark 6.1 (Cf. [10, Remark 1.3]). Assume that $S_{1}$ and $S_{2}$ are separating sets for $𝒪 {(V^{m_{0}})}^{G}$ and assume that $m > m_{0}$ . Then $S_{1}^{[m]}$ is separating for $𝒪 {(V^{m})}^{G}$ if and only if $S_{2}^{[m]}$ is separating for $𝒪 {(V^{m})}^{G}$ .

Denote by $σ sep (𝒪 (W), G)$ the minimal number $m_{0}$ such that the expansion of some separating set S for $𝒪 {(W^{m_{0}})}^{G}$ produces a separating set for $𝒪 {(W^{m})}^{G}$ for all $m \geq m_{0}$ . It immediately follows from the main result of [33] that $σ sep (𝒪 (V), 𝒮_{n}) \leq ⌊ \frac{n}{2} ⌋ + 1$ over an arbitrary field $𝔽$ , where $𝒮_{n}$ acts on V by the permutation of the coordinates. Moreover, $σ sep (𝒪 (V), 𝒮_{n}) \leq ⌊ {log}_{2} (n) ⌋ + 1$ in case $𝔽 = 𝔽_{2}$ (see [28, Corollary 4.12]).

Corollary 6.2. Assume that $𝔽 = 𝔽_{q}$ is finite and $m \geq 2$ . Then

•	$β sep (𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}) \leq 2$ , in case $q = 2$ ;
•	$β sep (𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}) \leq 4 (q - 1)$ , in case $q > 2$ .

Proof. See Theorem 4.5 and formulas (9) and (10). □

Corollary 6.3. Assume that $𝔽$ is infinite and $m \geq 2$ . Then

•	$β sep (𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}) \leq 2$ , in case $char 𝔽 = 2$ or $m = 2$ ;
•	$β sep (𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}) \leq 3$ , in case $char 𝔽 \neq 2$ and $m > 2$ .

Proof. See Theorem 5.4. □

Corollary 6.4. Assume that $𝔽$ is an arbitrary field. Then

σ sep (𝒪 (𝒩 2), GL 2) = {2 if char 𝔽 = 2, 3 if char 𝔽 \neq 2 . <math display="block" altimg="eq-00855.gif"><mrow><mi>σ</mi><mstyle><mtext mathvariant="normal">sep</mtext></mstyle><mo stretchy="false">(</mo><mi mathvariant="script">𝒪</mi><mo stretchy="false">(</mo><msub><mrow><mi mathvariant="script">𝒩</mi></mrow><mrow><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><msub><mrow><mstyle><mtext mathvariant="normal">GL</mtext></mstyle></mrow><mrow><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo>=</mo><mfenced separators="" open="{" close=""><mrow><mtable equalrows="false" equalcolumns="false"><mtr><mtd columnalign="left"><mn>2</mn></mtd><mtd columnalign="center"><mstyle><mi mathvariant="italic">if</mi></mstyle><mspace width="0.5em"></mspace><mstyle><mtext mathvariant="normal">char</mtext></mstyle><mspace width="0.1em"></mspace><mi>𝔽</mi><mo>=</mo><mn>2</mn><mo>,</mo></mtd></mtr><mtr><mtd columnalign="left"><mn>3</mn></mtd><mtd columnalign="center"><mstyle><mi mathvariant="italic">if</mi></mstyle><mspace width="0.5em"></mspace><mstyle><mtext mathvariant="normal">char</mtext></mstyle><mi>𝔽</mi><mo>\neq</mo><mn>2</mn><mo>.</mo></mtd></mtr></mtable></mrow></mfenced></mrow></math>

Proof. The upper bound on $σ sep (𝒪 (𝒩_{2}), {GL}_{2})$ follows from Theorems 4.5 and 5.4. To obtain the lower bound on $σ sep (𝒪 (𝒩_{2}), {GL}_{2})$ we consider $\underset{̲}{A}, \underset{̲}{B}$ from Claims 3 and 4 of the proof of Theorem 4.5. □

Corollary 6.5. Assume that $𝔽 = 𝔽_{q}$ . Then a minimal separating set

•	$H_{2, m}^{(2)}$ , in case $char 𝔽 = 2$ ;
•	$H_{2, m}$ , in case $char 𝔽 > 2$ ;

for $𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ contains the least possible number of elements for a separating set for $𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ if and only if $m = 1$ or $m = q = 2$ .

Proof. The set from the formulation of corollary is a minimal separating set for $𝒪 {(𝒩_{2}^{m})}^{{GL}_{2}}$ by Theorem 4.5. We have

|H(2)2,m|=m+(m2)(q−1)and|H2,m|=|H(2)2,m|+(m3),<math display="block" altimg="eq-00869.gif"><mrow><mi>|</mi><msubsup><mrow><mi>H</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow><mrow><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow></msubsup><mi>|</mi><mo>=</mo><mi>m</mi><mo>+</mo><mfenced separators="" open="(" close=")"><mfrac linethickness="0.0pt"><mrow><mi>m</mi></mrow><mrow><mn>2</mn></mrow></mfrac></mfenced><mo stretchy="false">(</mo><mi>q</mi><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo><mspace width="1em"></mspace><mstyle><mtext mathvariant="normal">and</mtext></mstyle><mspace width="1em"></mspace><mi>|</mi><msub><mrow><mi>H</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow></msub><mi>|</mi><mo>=</mo><mi>|</mi><msubsup><mrow><mi>H</mi></mrow><mrow><mn>2</mn><mo>,</mo><mi>m</mi></mrow><mrow><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow></msubsup><mi>|</mi><mo>+</mo><mfenced separators="" open="(" close=")"><mfrac linethickness="0.0pt"><mrow><mi>m</mi></mrow><mrow><mn>3</mn></mrow></mfrac></mfenced><mo>,</mo></mrow></math>

where the binomial coefficient

$(\binom{m}{k})$ is zero in case

$m < k$ .

Assume $m = 1$ . Then Corollary 3.5 implies that $γ = | H_{2, m}^{(2)} | = | H_{2, m} | = 1$ ; i.e., the required is proven.

Assume $m = q = 2$ . Then $char 𝔽 = 2$ and Corollary 3.5 implies that $γ = | H_{2, m}^{(2)} | = 3$ ; i.e., the required is proven.

Assume $m = 2$ and $q \geq 3$ . Then Corollary 3.5 implies that $γ = 2$ , but $| H_{2, m} | = | H_{2, m}^{(2)} | = q + 1 > γ$ .

Assume $m \geq 3$ . Then Corollary 3.5 implies that $γ = 2 m - 2$ , but $| H_{2, m} | > | H_{2, m}^{(2)} | = m + (\binom{m}{2}) (q - 1) > γ$ , since $(\binom{m}{2}) \geq m$ . □

Example 6.6. Assume $𝔽 = 𝔽_{2}$ and $m = 2$ . By Theorem 3.4, each ${GL}_{2}$ -orbit on $𝒩_{2}^{2}$ contains one and only one element from the following set:

(0, 0), (E 12, 0), (E 12, E 12), (0, E 12), (E 12, E 12) . <math display="block" altimg="eq-00889.gif"><mrow><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo><mo>,</mo><mspace width=".275em"></mspace><mo stretchy="false">(</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo><mo>,</mo><mspace width=".275em"></mspace><mo stretchy="false">(</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mspace width=".275em"></mspace><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mspace width=".275em"></mspace><mo stretchy="false">(</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo>,</mo><msub><mrow><mi>E</mi></mrow><mrow><mn>1</mn><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo>.</mo></mrow></math>

Corollary 6.5 implies that the set

H (2) 2, 2 = {tr (Y 1 Y 2), ζ (Y 1), ζ (Y 2)} <math display="block" altimg="eq-00890.gif"><mrow><msubsup><mrow><mi>H</mi></mrow><mrow><mn>2</mn><mo>,</mo><mn>2</mn></mrow><mrow><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow></msubsup><mo>=</mo><mo stretchy="false">{</mo><mstyle><mtext mathvariant="normal">tr</mtext></mstyle><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mn>1</mn></mrow></msub><msub><mrow><mi>Y</mi></mrow><mrow><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mspace width=".17em"></mspace><mi>ζ</mi><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mn>1</mn></mrow></msub><mo stretchy="false">)</mo><mo>,</mo><mspace width=".17em"></mspace><mi>ζ</mi><mo stretchy="false">(</mo><msub><mrow><mi>Y</mi></mrow><mrow><mn>2</mn></mrow></msub><mo stretchy="false">)</mo><mo stretchy="false">}</mo></mrow></math>

is a separating set for

$𝒪 {(𝒩_{2}^{2})}^{{GL}_{2}}$ containing the least possible number of elements.

Acknowledgment

The work was supported by FAPESP 2018/23690-6.

ORCID

Artem Lopatin https://orcid.org/0000-0003-2495-6050

References

[1] V. Baranovsky , The variety of pairs of commuting nilpotent matrices is irreducible, Transform. Groups 6(1) (2001) 3–8. Crossref, Google Scholar
[2] R. Basili , On the irreducibility of commuting varieties of nilpotent matrices, J. Algebra 268 (2003) 58–80. Crossref, Web of Science, Google Scholar
[3] R. Basili and A. Iarrobino , Pairs of commuting nilpotent matrices, and Hilbert function, J. Algebra 320 (2008) 1235–1254. Crossref, Web of Science, Google Scholar
[4] V. Bondarenko, V. Futorny, A. Petravchuk and V. Sergeichuk , Pairs of commuting nilpotent operators with one-dimensional intersection of kernels and matrices commuting with a Weyr matrix, Linear Algebra Appl. 612 (2021) 188–205. Crossref, Web of Science, Google Scholar
[5] F. B. Cavalcante and A. Lopatin , Separating invariants of three nilpotent $3 \times 3$ matrices, Linear Algebra Appl. 607 (2020) 9–28. Crossref, Google Scholar
[6] H. Derksen and G. Kemper , Computational Invariant Theory, Encyclopaedia of Mathematical Sciences, Vol. 130 (Springer-Verlag, Berlin, 2002), pp. x+268. Crossref, Google Scholar
[7] H. Derksen and G. Kemper , Computational Invariant Theory, 2nd edn., Encyclopaedia of Mathematical Sciences, Vol. 130 (Springer-Verlag, Berlin, Heidelberg, 2015), pp. xxii+366. Crossref, Google Scholar
[8] D. Đoković , Poincaré series of some pure and mixed trace algebras of two generic matrices, J. Algebra 309 (2007) 654–671. Crossref, Web of Science, Google Scholar
[9] D. Đoković , On orthogonal and special orthogonal invariants of a single matrix of small order, Lin. Multilin. Algebra 57(4) (2009) 345–354. Crossref, Web of Science, Google Scholar
[10] M. Domokos , Typical separating invariants, Transform. Groups 12 (2007) 49–63. Crossref, Web of Science, Google Scholar
[11] M. Domokos, S. G. Kuzmin and A. N. Zubkov , Rings of matrix invariants in positive characteristic, J. Pure Appl. Algebra 176 (2002) 61–80. Crossref, Web of Science, Google Scholar
[12] H. Derksen and V. Makam , Polynomial degree bounds for matrix semi-invariants, Adv. Math. 310 (2017) 44–63. Crossref, Web of Science, Google Scholar
[13] H. Derksen and V. Makam , Generating invariant rings of quivers in arbitrary characteristic, J. Algebra 489 (2017) 435–445. Crossref, Web of Science, Google Scholar
[14] H. Derksen and V. Makam , Algorithms for orbit closure separation for invariants and semi-invariants of matrices, Algebra Number Theory 14(10) (2020) 2791–2813. Crossref, Web of Science, Google Scholar
[15] H. Derksen and V. Makam , Weyl’s polarization theorem in positive characteristic, Transform. Groups 26 (2021) 1241–1260. Crossref, Web of Science, Google Scholar
[16] M. Domokos , Characteristic free description of semi-invariants of $2 \times 2$ matrices, J. Pure Appl. Algebra 224(5) (2020) 106220. Crossref, Google Scholar
[17] M. Domokos, Addendum to characteristic free description of semi-invariants of $2 \times 2$ matrices, [J. Pure Appl. Algebra 224(5) (2020) 106220], J. Pure Appl. Algebra 224(6) (2020) 106270. Google Scholar
[18] S. Donkin , Invariants of several matrices, Invent. Math. 110 (1992) 389–401. Crossref, Web of Science, Google Scholar
[19] V. Drensky and L. Sadikova , Generators of invariants of two $4 \times 4$ matrices, C. R. Acad. Bulgare Sci. 59(5) (2006) 477–484. Google Scholar
[20] J. Elmer , The separating variety for $2 \times 2$ matrix invariants, Linear and Multilinear Algebra 72(3) (2024) 389–411. Crossref, Google Scholar
[21] J. Elmer , The separating variety for matrix semi-invariants, Linear Algebra Its Appl. 674 (2023) 466–492. Crossref, Web of Science, Google Scholar
[22] N. J. Fine and I. U. Herstein , The probability that a matrix be nilpotent, Illinois J. Math. 2 (1958) 499–504. Crossref, Google Scholar
[23] J. Hua , Counting representations of quivers over finite fields, J. Algebra 226 (2000) 1011–1033. Crossref, Web of Science, Google Scholar
[24] J. Hua, On numbers of tuples of nilpotent matrices over finite fields under simultaneous conjugation (2021), arXiv:2102.07290. Google Scholar
[25] J. Hua, Canonical forms for a class of pairs of commuting nilpotent matrices under simultaneous similarity (2023), arXiv:2305.00176. Google Scholar
[26] J. Hua, Canonical forms for pairs of commuting nilpotent $4 \times 4$ matrices under simultaneous similarity (2023), arXiv:2303.15862. Google Scholar
[27] I. Kaygorodov, A. Lopatin and Y. Popov , Separating invariants for $2 \times 2$ matrices, Linear Algebra Its Appl. 559 (2018) 114–124. Crossref, Google Scholar
[28] G. Kemper, A. Lopatin and F. Reimers , Separating invariants over finite fields, J. Pure Appl. Algebra 226 (2022) 106904. Crossref, Web of Science, Google Scholar
[29] A. A. Lopatin, The invariant ring of triples of $3 \times 3$ matrices over a field of arbitrary characteristic, Sibirsk. Mat. Zh. 45(3) (2004) 624–633 (Russian). English translation: Sib. Math. J. 45(3) (2004) 513–521. Google Scholar
[30] A. A. Lopatin , The algebra of invariants of $3 \times 3$ matrices over a field of arbitrary characteristic, Commun. Algebra 32(7) (2004) 2863–2883. Crossref, Web of Science, Google Scholar
[31] A. A. Lopatin , Relatively free algebras with the identity $x^{3} = 0$ , Commun. Algebra 33(10) (2005) 3583–3605. Crossref, Web of Science, Google Scholar
[32] A. A. Lopatin , Minimal generating set for semi-invariants of quivers of dimension two, Linear Algebra Its Appl. 434(8) (2011) 1920–1944. Crossref, Web of Science, Google Scholar
[33] A. Lopatin and F. Reimers , Separating invariants for multisymmetric polynomials, Proc. Amer. Math. Soc. 149 (2021) 497–508. Crossref, Web of Science, Google Scholar
[34] A. Lopatin and A. N. Zubkov , Seperating $G_{2}$ -invariants of several octonions, Algebra Number Theory 18(12) (2024) 2157–2177. Crossref, Google Scholar
[35] C. Procesi , The invariant theory of $n \times n$ matrices, Adv. Math. 19 (1976) 306–381. Crossref, Web of Science, Google Scholar
[36] C. Procesi , Computing with $2 \times 2$ matrices, J. Algebra 87 (1984) 342–359. Crossref, Web of Science, Google Scholar
[37] F. Reimers , Separating invariants for two copies of the natural $S_{n}$ -action, Commun. Algebra 48 (2020) 1584–1590. Crossref, Google Scholar
[38] K. S. Sibirskii, Algebraic invariants of a system of matrices, Sibirsk. Mat. Zh. 9(1) (1968) 152–164 (Russian). English translation: Soviet Math. Dokl. 8 (1967) 36–40. Google Scholar
[39] R. J. Souza Ferreira and A. Lopatin , Minimal generating and separating sets for $O (3)$ -invariants of several matrices, Oper. Matrices 17(3) (2023) 639–651. Crossref, Google Scholar
[40] Y. Teranishi , The ring of invariants of matrices, Nagoya Math. J. 104 (1986) 149–161. Crossref, Web of Science, Google Scholar