The pseudovariety Com∗D viewed as a power pseudovariety
Abstract
The objective of this paper is to document that the pseudovariety equation PX=Com∗D admits a solution U. This solution U is a pseudovariety raised up by Almeida in his book on finite semigroups and universal algebra as a prospective candidate for such a solution. This result achieved in the present paper resolves an open problem posed by Almeida in his book a long time ago.
Communicated: J. Almeida
1. Introduction
The semidirect product on the level of pseudovarieties of semigroups has been widely studied in the past. See [2, Chap. 10] for a survey of results obtained until mid-nineties in this direction. Likewise the power operator on the level of pseudovarieties of semigroups has been the subject of intensive study in the past. See [2, Chap. 11] for an overview of results obtained by that time in this area. From time to time these two routes of research have met up with each other. One particular instance of such a fruitful encounter is represented by the study of locally commutative power pseudovarieties, whose results are summarized in [2, Sec. 11.9]. In this context, the pseudovariety Com∗D has arisen, which has been previously the object of concern of researchers in the domain of semidirect products of pseudovarieties of semigroups. See [2, Sec. 10.7] in this respect, for example.
Recall yet that Com is the pseudovariety of all commutative semigroups, D is the pseudovariety of all definite semigroups, that is, the pseudovariety of all semigroups whose idempotents act as right zeros and Com∗D is the pseudovariety of semigroups generated by the class of all semidirect products of commutative semigroups by definite semigroups. An effective characterization of the pseudovariety Com∗D has been provided by Thérien and Weiss in [10]. Their result actually says that Com∗D is the pseudovariety of semigroups determined by the pseudoidentity exfyezf≏ezfyexf. See also [2, Sec. 10.7] once again in this regard.
Let LCom be the pseudovariety of all locally commutative semigroups. That is, LCom consists of all finite semigroups S having the property that, for every idempotent e in S, the submonoid eSe of S is a commutative monoid. Yet otherwise stated, LCom is the pseudovariety of semigroups determined by the pseudoudentity exeye≏eyexe. In this context, Almeida has proposed in [1] and in [2, Sec. 11.9] to consider the subpseudovariety U of the pseudovariety LCom determined within LCom by the additional pseudoidentity (ef)ωexf≏exf. It is easy to check that, within LCom, the pseudoidentity (ef)ωexf≏exf is equivalent to the pseudoidentity exf(ef)ω≏exf, which shows that the pseudovariety U is self-dual. In fact, it is straightforward to infer hence that the pseudovariety U consists of all finite semigroups S having the property that, for every idempotents e,f in S, the subsemigroup eSf of S is a commutative monoid which admits the element (ef)ω for its identity.
Furthermore, Almeida considered in [1] and in [2, Sec. 11.9] the power pseudovariety PU. Recall that PU is the pseudovariety generated by the class of all power semigroups 𝒫(S), where S are arbitrary semigroups from the pseudovariety U. Having this pseudovariety in view, Almeida has shown in [1] and in [2, Sec. 11.9] that the pseudovariety Com∗D provides an upper bound for the power pseudovariety PU. That is, he has established the inclusion PU⊆Com∗D, and he has done so by verifying directly that the power semigroups 𝒫(S), for all semigroups S in the pseudovariety U, do satisfy the above-mentioned pseudoidentity exfyezf≏ezfyexf. He has also shown, in addition, that, for any pseudovariety V of semigroups, the condition PV⊆Com∗D is equivalent to the condition V⊆U. Thus U is the largest pseudovariety for which the inclusion PU⊆Com∗D holds. However, the question whether the equality PU=Com∗D actually holds here has been left open in [2, Sec. 11.9].
The purpose of this paper is to establish the missing inclusion Com∗D⊆PU. This is the inclusion that is opposite relative to the one discussed above. In this way, it eventually emerges that, in reality, the equality PU=Com∗D does hold. Therefore the pseudovariety Com∗D is indeed a power pseudovariety. It is equal to the pseudovariety PU, where U is the pseudovariety introduced above.
In order to achieve the appointed goal, we shall take advantage of the tools furnished by formal language theory. It is a celebrated result in the theory of varieties of recognizable languages attributed to Eilenberg that there are mutually inverse order preserving one-to-one correspondences between the lattice of all pseudovarieties of semigroups and the lattice of all varieties of languages. See [3, Sec. VII.3] for Eilenberg’s original treatment of this subject. Having this in view, we shall let 𝒱 be the variety of languages corresponding to the pseudovariety Com∗D, and we shall let 𝒲 be the variety of languages corresponding to the pseudovariety PU. Then, in our efforts to establish the inclusion Com∗D⊆PU, we shall be done if we can show that we have 𝒱⊆𝒲. This is exactly what we shall strive for in this paper.
The other tools from formal language theory that we shall make use of include Straubing’s wreath product principle, expounded in detail by Pin and Weil in [7], which makes it possible to underpin the languages appertaining to the variety 𝒱 corresponding to the pseudovariety Com∗D, and Straubing’s characterization of languages recognized by power semigroups, obtained in [9], which allows us to handle the languages belonging to the variety 𝒲 corresponding to the power pseudovariety PU.
2. Preliminaries
All semigroups appearing in this paper, with the exception of free semigroups, will be finite. For every semigroup S, we shall denote by E(S) the set of all idempotents in S. We shall denote by S1 the monoid which is equal to S provided that S already is a monoid, and which is obtained from S by adjoining an identity to it otherwise.
By a pseudovariety of semigroups we shall mean any class of finite semigroups which is closed under the formation of direct products of finite families of semigroups and under the formation of subsemigroups and homomorphic images of semigroups. Such pseudovarieties can be determined by collections of pseudoidentities, as it has been the case on several instances in the previous section. We shall not need the concept of pseudoidentity in its full generality, and so we shall not embark on this subject here. We only mention that, in such a pseudoidentity u≏v, apart from letters like x,y,z,… which are reserved for variables that may be substituted by arbitrary elements from a semigroup S in which the pseudoidentity u≏v is evaluated, also letters like e,f,g,… may appear which are reserved for constituents that may be substituted only by idempotents from E(S).
We have met already in the previous section the pseudovariety D of all definite semigroups. This pseudovariety consists of all finite semigroups S having the property that every idempotent in E(S) acts as a right zero in S. Thus D is the pseudovariety determined by the pseudoidentity xe≏e. For every positive integer m, let Dm be the pseudovariety determined by the pseudoidentity xy1y2⋯ym≏y1y2⋯ym. Then it turns out that D=⋃∞m=1Dm. Later on, we shall also need the pseudovariety K which is dual with respect to the pseudovariety D. That is, K consists of all finite semigroups S having the property that every idempotent in E(S) acts as a left zero in S. Thus K is the pseudovariety determined by the pseudoidentity ex≏e. For every positive integer m, let Km be the pseudovariety determined by the pseudoidentity x1x2⋯xmy≏x1x2⋯xm. Then it turns out that K=⋃∞m=1Km.
We shall have to deal in this paper with semidirect products of pseudovarieties of semigroups. To start with, we shall have to work with semidirect products of individual semigroups of the form S∗T, where S and T are arbitrary semigroups. We shall observe here the custom introduced by Almeida at the beginning of [2, Sec. 10.1], according to which, even if T is viewed as a semigroup, if T happens coincidentally to be a monoid, then the underlying left action of T on S giving rise to the semidirect product S∗T should be left unitary. With that, if V and W are any pseudovarieties of semigroups, then the semidirect product V∗W of these two pseudovarieties is defined to be the pseudovariety of semigroups generated by the class of all semidirect products of the form S∗T, where S is a semigroup from V and T is a semigroup from W, observing the custom regarding the underlying left action of T on S mentioned above. It is easy to verify that this semidirect product V∗W consists, in fact, of all divisors of the semidirect products S∗T just described.
Aside from ordinary semigroups, we shall have to deal also with transformation semigroups. To begin with, let P be a finite set. We shall denote by 𝒯P the monoid of all transformations of the set P. Note that we shall let the transformations from 𝒯P act on the elements of the set P on the right. That is, given an element p∈P and a transformation τ of the set P, we shall denote by pτ the result of the application of τ on p. Therefore the composition of transformations from 𝒯P should be interpreted accordingly, that is, it has to be read from the left to the right. Having this at hand, by a transformation semigroup we shall mean any pair X=(P,S) where P is a finite set and S is a subsemigroup of the monoid 𝒯P of all transformations of the set P. The set P will be called the underlying set of X and the semigroup S will be called the action semigroup of X.
In accordance with [3, Sec. I.10] or [2, Sec. 10.1], we introduce the operation of wreath product of transformation semigroups in the following manner. Let X=(P,S) and Y=(Q,T) be transformation semigroups. The wreath product X∘Y of these transformation semigroups is the transformation semigroup having for its underlying set the cartesian product P×Q and having for its action semigroup the semidirect product SQ∗T of the semigroup SQ of all functions of Q into S by the semigroup T determined by the left action of T on SQ given by the following formula. For every transformation τ∈T and for every function f:Q→S, the function τf:Q→S is defined by the rule q(τf)=(qτ)f, for every q∈Q, where functions are written on the right of their arguments. In order to keep the promise that X∘Y so determined should indeed be a transformation semigroup, it remains to observe that the action semigroup SQ∗T may be viewed as a subsemigroup of the monoid 𝒯P×Q in the following way. For every function f:Q→S and for every transformation τ∈T, the element (f,τ) of SQ∗T determines a transformation ϱ(f,τ) of the set P×Q sending every pair (p,q) from P×Q to the pair (p(qf),qτ). Then the assignment (f,τ)↦ϱ(f,τ) is an embedding of SQ∗T into 𝒯P×Q. Therefore the semigroup SQ∗T may indeed be treated as a subsemigroup of the monoid 𝒯P×Q. In this way, X∘Y becomes again a transformation semigroup.
However, the operation of wreath product X∘Y of transformation semigroups X=(P,S) and Y=(Q,T) so introduced has a certain defect. It appears when the semigroup T is a monoid, but the transformation representing the identity of T is not equal to the identity on the set Q. In such a case, the left action of T on SQ determined above need not be left unitary. And this is in conflict with our previous requirements regarding the underlying left action of the semigroup T on SQ, if the semidirect product SQ∗T has to be properly defined. Having this circumstance in view, we have introduced in [4] the concept of proper transformation semigroups. A transformation semigroup X=(P,S) is said to be proper if either S is not a monoid, or S is a monoid and the identity of S acts as the identity on the set P. If we restrict the scope of the above definition of wreath products only to proper transformation semigroups, then the difficulty that we have encountered above disappears. And it is not too hard to verify that the wreath product X∘Y of two proper transformation semigroups X and Y is a proper transformation semigroup again.
Most frequent examples of proper transformation semigroups are the right regular representations of ordinary semigroups. For any semigroup S and for every element s∈S, we may consider the transformation ρs:S1→S1 which maps every element t∈S1 to the element ts. Then the mapping S→𝒯S1 given by the formula s↦ρs is an embedding of the semigroup S into 𝒯S1. In this way, a proper transformation semigroup (S1,S) arises, which is called the right regular representation of S. Thus, for any semigroups S and T, we may consider the wreath product (S1,S)∘(T1,T) of the right regular representations of these semigroups. This wreath product is again a proper transformation semigroup and its action semigroup ST1∗T is just the standard wreath product S∘T of the semigroups S and T. It is a familiar fact that, for any semigroups S and T and for every left action of T on S, where the custom regarding the left actions that has been introduced above is observed, the semidirect product S∗T determined by this left action can be embedded into the wreath product S∘T.
As a consequence, we see that, for any two pseudovarieties of semigroups V and W, the semidirect product V∗W of these pseudovarieties is equal to the pseudovariety generated by the class of all wreath products of the form S∘T, where S is a semigroup in V and T is a semigroup in W. In fact, this semidirect product V∗W can be obtained as the class of all divisors of the wreath products S∘T just described. Finally note that, on the other hand, the semidirect product V∗W contains all action semigroups SQ∗T of the wreath products X∘Y of arbitrary proper transformation semigroups X=(P,S) and Y=(Q,T), where the semigroup S belongs to V and the semigroup T belongs to W.
We close this section with a few remarks on power semigroups and power pseudovarieties. For any semigroup S, we denote by 𝒫(S) the set of all subsets of S equipped with the usual multiplication. In this way, 𝒫(S) becomes a semigroup, called the power semigroup of S. For any pseudovariety V of semigroups, we let PV be the pseudovariety generated by all semigroups of the form 𝒫(S), where S stands for arbitrary semigroups from V. Note that, if V contains a non-trivial monoid, then PV can be actually obtained as the class of all semigroups T that divide some semigroup of the form 𝒫(S), where S is any semigroup in V. The pseudovariety PV arising in this way from V is called the power pseudovariety of V.
3. Formal Language Theory
Let A be a finite set. Within the context of the theory of formal languages, we shall say that A is an alphabet. Elements of this set A will be called letters. By A∗ we shall denote the free monoid on the set A, and by A+ we shall denote the free semigroup on the set A. Elements of A∗ will be called words in the alphabet A. By 1 we shall denote the empty word, that is, the identity of the monoid A∗. By a language over the alphabet A we shall mean any set of words in the alphabet A, that is, any subset L of the free monoid A∗. Later on, however, we shall have to deal with languages L that will be subsets of the free semigroup A+, that is, such languages L will not contain the empty word 1.
By an automaton we shall mean any triplet 𝒜=(Q,A,⋅) where Q is a finite set of states, A is an alphabet and ⋅ is a function Q×A→Q. One can extend this function ⋅ to a function Q×A∗→Q, also denoted by ⋅, by putting successively q⋅1=q and q⋅(wa)=(q⋅w)⋅a, for all q∈Q, w∈A∗ and a∈A.
Let L⊆A∗ be a language and let 𝒜=(Q,A,⋅) be an automaton. We shall say that the language L is recognized by the automaton 𝒜 if there exist a state q0∈Q, called the initial state and a subset F⊆Q of states, called the set of terminal states, such that L={w∈A∗:q0⋅w∈F}. Languages that are recognized by some automaton are called recognizable languages.
If we have to deal only with languages L⊆A+, then a slightly different notion of recognizability of such languages by means of automata suggests itself. Let again 𝒜=(Q,A,⋅) be an automaton. Then we may say that the language L is recognized by the automaton 𝒜 if there exist an initial state q0∈Q and a subset F⊆Q of terminal states such that L={w∈A+:q0⋅w∈F}. But if a language L⊆A+ is recognized by an automaton 𝒜 in this latter sense, then it can also be recognized by an automaton ¯𝒜 in the former sense, where the automaton ¯𝒜 is only slightly adapted from 𝒜. For languages L⊆A+, we shall usually have in view just this latter altered notion of recognizability.
One can consider quite a lot of operations on the languages over the given alphabet A. In the first place these are the standard Boolean operations, namely the union K∪L and the intersection K∩L of any two languages K,L⊆A∗, and the complement A∗\L of a language L⊆A∗. Further operations that one may consider are the product KL of two languages K,L⊆A∗ and the star L∗ of a language L⊆A∗. The product KL is defined to be the language KL={uw∈A∗:u∈K,w∈L}. The star L∗ is defined to be the submonoid of A∗ generated by the language L.
We proceed to introduce another family of languages, namely the so-called rational languages. The class of all rational languages over the alphabet A is defined to be the smallest subclass of the class of all languages over the alphabet A which contains all finite languages and is closed under the formation of unions of pairs of languages, of products of pairs of languages and of stars of languages. Now the classical Kleene’s theorem asserts that a language over an alphabet A is recognizable if and only if it is rational. See [5, Sec. 6.3], for example.
Given an automaton 𝒜=(Q,A,⋅), every word w∈A∗ determines a transformation ζw of the set Q given by the rule q↦q⋅w, for every q∈Q. The transformations ζw of the set Q, for all words w∈A∗, form a submonoid of the monoid 𝒯Q of all transformations of the set Q. This submonoid, which we shall denote by M(𝒜), is called the transition monoid of the automaton 𝒜. Note that this monoid M(𝒜) is generated by the set of all transformations of the set Q of the form ζa, for arbitrary elements a∈A. If we take in the above considerations only the transformations ζw of the set Q such that w∈A+, then these transformations will form a subsemigroup of the monoid 𝒯Q. This subsemigroup, which we shall denote by S(𝒜), is called the transition semigroup of the automaton 𝒜. Note also that then the pair (Q,S(𝒜)) forms a transformation semigroup. Later on, by the transition semigroup of the automaton 𝒜 we shall often mean just this transformation semigroup.
From now on, we shall confine ourselves to languages L⊆A+. So let L⊆A+ be such a language, let S be a finite semigroup, and let φ:A+→S be a homomorphism of semigroups. We shall say that the language L is recognized by the homomorphism φ if there exists a subset P⊆S such that L=φ−1(P). If this is the case, then we shall also say that the language L is recognized by the semigroup S. It turns out that a language L⊆A+ is recognized by a semigroup in this way if and only if L is recognized by an automaton, as described above. Indeed, if the language L⊆A+ is recognized by a semigroup S and if φ:A+→S is the underlying homomorphism of semigroups, then L is recognized by the automaton 𝒜S=(S1,A,⋅), where the function ⋅ is determined, for any s∈S1 and any a∈A, by the formula s⋅a=sφ(a). On the other hand, if the language L⊆A+ is recognized by an automaton 𝒜=(Q,A,⋅), then L is recognized by the transition semigroup S(𝒜) of this automaton.
The possibility to treat in this manner the notion of recognizability of languages over an alphabet A in terms of homomorphisms of the free semigroup A+ into finite semigroups makes it possible to verify easily the following statements. If L⊆A+ is a recognizable language, then its complement A+\L is also a recognizable language. If K,L⊆A+ are recognizable languages, then their union K∪L and their intersection K∩L are also recognizable languages.
It is also worth noticing that if L⊆A+ is a language that can be recognized by a semigroup S and if T is a semigroup such that S divides T, then L can also be recognized by T.
Let again L⊆A+ be a language. We shall say that this language L is recognized by a transformation semigroup X=(Q,S) if there exist a homomorphism of semigroups φ:A+→S, a state q0∈Q, called the initial state, and a subset F⊆Q of states, called the set of terminal states, such that L={w∈A+:q0φ(w)∈F}. As before, it turns out that a language L⊆A+ is recognized by a transformation semigroup in the way just specified if and only if L is recognized by an automaton in the way described above. Namely, if the language L⊆A+ is recognized by a transformation semigroup X=(Q,S) and if φ:A+→S is the underlying homomorphism of semigroups, then L is recognized by the automaton 𝒜X=(Q,A,⋅), where the function ⋅ is determined, for any q∈Q and any a∈A, by the formula q⋅a=qφ(a). On the other hand, if the language L⊆A+ is recognized by an automaton 𝒜=(Q,A,⋅), then L is recognized by the transition semigroup (Q,S(𝒜)) of this automaton.
Also the following notes on the various notions of recognizability will come in handy. If a language L⊆A+ is recognized by a transformation semigroup X=(Q,S), then L is recognized by the transformation semigroup (S1,S). The latter semigroup, which is the right regular representation of the semigroup S, is itself a proper transformation semigroup. It is also easy to see that a language L⊆A+ is recognized by a semigroup S if and only if L is recognized by its right regular representation (S1,S).
If X=(Q,S) is a transformation semigroup and if L⊆A+ is a language that can be recognized by the semigroup S, then L is a Boolean combination of languages over the alphabet A that can be recognized by X. Indeed, if φ:A+→S is a homomorphism and if P⊆S is a subset such that L=φ−1(P), then it turns out that L=⋃τ∈P⋂q∈Q{w∈A+:qφ(w)=qτ}. The latter languages are clearly recognized by X.
There are plenty of other operations on the languages over various alphabets A. For example, given a language L⊆A+ and a word u∈A+, one can define the left quotient u−1L and the right quotient Lu−1 of the language L relative to the word u as the languages u−1L={w∈A+:uw∈L} and Lu−1={w∈A+:wu∈L}. As previously, if L⊆A+ is a recognizable language and u∈A+ is any word, then the left quotient u−1L and the right quotient Lu−1 are recognizable languages, as well.
Another operation on languages over distinct alphabets A and B is that of taking inverse images of languages over B with respect to various homomorphisms of free semigroups A+→B+. Thus let L⊆B+ be a language and let ψ:A+→B+ be a homomorphism of free semigroups. Then one may consider the inverse image ψ−1(L) of the language L with respect to ψ. Then, of course, ψ−1(L)⊆A+. And once again, if L is a recognizable language over B, then ψ−1(L) turns out to be a recognizable language over A.
Let L⊆A+ be any language. We shall say that a congruence ≈ on the free semigroup A+ saturates the language L if L is a union of certain classes of the partition A+/≈. There exists the coarsest congruence ∼L on A+ which saturates the language L. We will call this congruence ∼L the syntactic congruence of the language L. It is given, for all words u,v∈A+, by the formula
Now we are ready to evoke the celebrated Eilenberg’s theorem relating to each other pseudovarieties of finite semigroups and varieties of recognizable languages. Recall once again that we confine ourselves here to recognizable languages L⊆A+, for various alphabets A. Within this framework, this theorem has been presented already by Eilenberg in [3, Sec. VII.3]. Another exposition of this result has been provided by Pin in [6, Sec. 2.2]. The other version of this theorem involving pseudovarieties of finite monoids and recognizable languages L⊆A∗ has been furnished, aside from [3] and [6], also by Lallement in [5, Sec. 6.5].
We begin with yet another definition. By a class of recognizable languages we shall mean any mapping 𝒞 assigning to every alphabet A a set A+𝒞 of recognizable languages over A. (All of these languages are assumed to be subsets of A+.) If 𝒞 and 𝒟 are two classes of recognizable languages, then we shall write 𝒞⊆𝒟 if, for every alphabet A, we have the inclusion A+𝒞⊆A+𝒟. In this way, a partial order on the family of all classes of recognizable languages is introduced.
By a variety of languages we shall mean any class 𝒱 of recognizable languages such that the following conditions are satisfied:
(1) | For every alphabet A, the set A+𝒱 is a Boolean algebra, that is, A+𝒱 is closed with respect to the operation of set theoretical complement of languages (relative to the base set A+) and with respect to the operations of set theoretical union and intersection of pairs of languages. | ||||
(2) | For every alphabet A, for every language L in A+𝒱 and for every element a∈A, also the languages a−1L and La−1 belong to A+𝒱. | ||||
(3) | Whenever A and B are alphabets and ψ:A+→B+ is a homomorphism of free semigroups, then the preimage ψ−1(L) of any language L from B+𝒱 belongs to A+𝒱. |
Let now V be any pseudovariety of semigroups. We assign to V a class of recognizable languages 𝒱 in the following way. For every alphabet A, we let A+𝒱 be the set of all languages L⊆A+ having the property that the syntactic semigroup A+/∼L belongs to V. Then 𝒱 is a variety of languages.
We may also proceed the other way round. Let 𝒱 be any variety of languages. We assign to 𝒱 a pseudovariety V of semigroups as follows. We let V be the pseudovariety generated by all syntactic semigroups of the form A+/∼L, where A is any alphabet and L is any language from A+𝒱.
Then we may formulate the announced Eilenberg’s theorem.
Theorem 3.1. The assignments V↦𝒱 and 𝒱↦V described above determine mutually inverse order preserving one-to-one correspondences between the lattice of all pseudovarieties of semigroups and the partially ordered family of all varieties of languages.
Of course, it hence ensues immediately that also the partially ordered family of all varieties of languages forms a lattice.
In the continuation of this section, we proceed to explain the aforementioned Straubing’s wreath product principle making it possible to handle languages recognized by semigroups that belong to the semidirect products of couples of pseudovarieties of semigroups. As the original source of these ideas, the paper [8] of Straubing is usually being quoted. But it is not straightforward to identify this principle in that paper. Our exposition in this paper will follow closely that of Pin and Weil in [7, Sec. 3], where they have drawn it up for ordered semigroups. The unordered version of these results hence follows as a special case. Besides, in the course of our exposition that we offer here, we shall try to fix some shortcomings that have appeared in [7].
By a sequential transducer we shall mean a machine which can be formally described as any sextuplet 𝒯=(Q,A,B,q0,⋅,∗), where Q is a finite set of states, A is an alphabet called the input alphabet, B is another alphabet called the output alphabet, q0∈Q is the initial state, ⋅ is a function Q×A→Q called the transition function, and ∗ is a function Q×A→B∗ called the output function. Thus the triplet 𝒜=(Q,A,⋅) is an automaton, and hence the transition function ⋅ can be extended to a function Q×A∗→Q, also denoted by ⋅, in the manner described previously. In addition, the output function ∗ can be extended to a function Q×A∗→B∗, also denoted by ∗, by putting successively q∗1=1 and q∗(wa)=(q∗w)((q⋅w)∗a), for all q∈Q, w∈A∗ and a∈A. The function realized by the given sequential transducer 𝒯 is the function ς:A∗→B∗ defined by the formula ς(w)=q0∗w, for all words w∈A∗. Finally, given two alphabets A and B, by a sequential function we shall mean any function σ:A∗→B∗ which can be realized by a sequential transducer of the form 𝒯=(Q,A,B,q0,⋅,∗). Having all that at hand, and arguing in quite the same way as in [7, Sec. 3.1], we arrive at the following result.
Theorem 3.2. Let A and B be any alphabets and let σ:A∗→B∗ be a sequential function realized by a sequential transducer 𝒯=(Q,A,B,q0,⋅,∗). Let Y=(Q,T) be the transition semigroup of the underlying automaton 𝒜=(Q,A,⋅). Let Z=(Q,U) be any proper transformation semigroup such that T⊆U. If L⊆B+ is a language recognized by a proper transformation semigroup X=(P,S), then the language σ−1(L)⊆A+ is recognized by the wreath product X∘Z of the transformation semigroups X and Z.
Let A be an alphabet, let T be a finite semigroup, and let φ:A+→T be a homomorphism of semigroups. We shall associate with this homomorphism φ a sequential transducer 𝒯φ which we shall construct in the following way. We set BT=T1×A. Then we put 𝒯φ=(T1,A,BT,1,⋅,∗), where the transition function ⋅:T1×A→T1 is given, for each t∈T1 and a∈A, by the formula t⋅a=tφ(a), and the output function ∗:T1×A→B∗T is given, again for each t∈T1 and a∈A, by the formula t∗a=(t,a). The sequential function σφ:A∗→B∗T realized by this sequential transducer 𝒯φ will be called the sequential function associated with the homomorphism φ. Then it is easy to verify that, for any positive integer h and for arbitrary elements a1,a2,…,ah∈A, the equality
Note that, when constructing the sequential transducer 𝒯φ, in comparison with the opening passage of [7, Sec. 3.2], we do not assume here the homomorphism φ:A+→T to be surjective. Therefore also the transition semigroup of the underlying automaton 𝒜φ=(T1,A,⋅) need not be equal to the entire transformation semigroup (T1,T). But these properties are not exploited in the subsequent reasonings in [7, Sec. 3.2]. Thus we can manage without these properties.
Let now X=(P,S) and Y=(Q,T) be two proper transformation semigroups, let X∘Y=(P×Q,SQ∗T) be the wreath product of these semigroups, and let L⊆A+ be a language recognized by the transformation semigroup X∘Y. This means that there exist a state (p0,q0)∈P×Q, a subset F⊆P×Q and a homomorphism η:A+→SQ∗T such that L={w∈A+:(p0,q0)η(w)∈F}. Let π denote the natural projection from SQ∗T onto T, which is given by π(f,t)=t, for all f:Q→S and all t∈T. Then put φ=π∘η. Thus φ is a homomorphism of A+ into T. Now, as in the last paragraph but one, we may associate with φ the sequential transducer 𝒯φ=(T1,A,BT,1,⋅,∗) and the sequential function σφ:A∗→B∗T realized by this sequential transducer. In this situation, the arguments assembled in [7, Sec. 3.2] can be applied. It is, however, important to note in this connection that the identity of the monoid T1 can be required to act as the identity on the set Q. The fact that this is a feasible requirement follows from our assumption that Y=(Q,T) is a proper transformation semigroup. With this amendment, proceeding otherwise as in [7, Sec. 3.2], for the above language L we obtain the following statement.
Theorem 3.3. The language L⊆A+ is a finite union of languages of the form M∩σ−1φ(N), where M⊆A+ is a language recognized by the homomorphism φ and N⊆B+T is a language recognized by the transformation semigroup X.
From Theorems 3.2 and 3.3 it is easy to deduce the following consequence. This result, which will be of crucial importance in the following section, has also its counterpart in [7, Sec. 3.2]. However, since the respective arguments in [7] must be viewed as virtually flawed, we provide a full proof below.
Corollary 3.4. Let V and W be two pseudovarieties of semigroups, let 𝒱 and 𝒲 be the varieties of languages associated with the pseudovarieties V and W, respectively, and let 𝒵 be the variety of languages associated with the semidirect product V∗W of the pseudovarieties V and W. Then, for every alphabet A, the collection of languages A+𝒵 is the smallest Boolean algebra containing all languages from A+𝒲 and all languages of the form σ−1φ(N), where φ:A+→T is a homomorphism of A+ into a semigroup T∈W,σφ is the sequential function associated with φ, and N is any language from B+T𝒱.
Proof. Clearly, the collection A+𝒵 is a Boolean algebra and it contains all languages from A+𝒲. We shall show that it also contains all languages of the above form σ−1φ(N). Let 𝒯φ=(T1,A,BT,1,⋅,∗) be the sequential transducer associated with φ. Then σφ:A∗→B∗T is the sequential function realized by this sequential transducer. Let Y=(T1,R) be the transition semigroup of the underlying automaton 𝒜φ=(T1,A,⋅). Then R is a subsemigroup of T, and the right regular representation Z=(T1,T) of the semigroup T is a proper transformation semigroup. Recall also that T∈W. The language N is recognized by a proper transformation semigroup X=(P,S) such that S∈V. Now, applying Theorem 3.2, we come to the conclusion that the language σ−1φ(N) is recognized by the wreath product X∘Z. Consequently, this language σ−1φ(N) is recognized by the action semigroup of the wreath product X∘Z. But this action semigroup is equal to the semidirect product ST1∗T, and so it belongs to the pseudovariety V∗W. Therefore the language σ−1φ(N) indeed belongs to A+𝒵.
On the other hand, we have seen in the previous section (see the last paragraph but one in that section) that the semidirect product V∗W of the given pseudovarieties is obtained as the class of all divisors of the action semigroups SQ∗T of the wreath products X∘Y of arbitrary proper transformation semigroups X=(P,S) and Y=(Q,T) such that S∈V and T∈W. Furthermore, according to what we have seen previously in this section, every language over the alphabet A which is recognized by any divisor of such an action semigroup SQ∗T must be recognized already by this action semigroup itself, and every language over the alphabet A which is recognized by such an action semigroup SQ∗T is a Boolean combination of languages over the alphabet A that can be recognized by the initial wreath product X∘Y. Thus it remains to show that every language over the alphabet A which can be recognized by any wreath product X∘Y of the mentioned form belongs to the Boolean algebra generated by the languages from A+𝒲 and by the languages of the above form σ−1φ(N). But this follows immediately from Theorem 3.3. □
Now we approach the characterization of languages which can be recognized by the semigroups in the power pseudovariety PV, for any given pseudovariety of semigroups V. The result that we are going to present here has been obtained by Straubing in [9].
Let A and B be two alphabets and let 𝜗:A+→B+ be a homomorphism of semigroups. We say that this homomorphism 𝜗 is length preserving if the inclusion 𝜗(A)⊆B holds. Then, for every word u∈A+, the length of the word 𝜗(u) is the same as that of u. Then the following statement holds.
Theorem 3.5. Let V be any pseudovariety of semigroups, let 𝒱 be the variety of languages associated with V, and let 𝒲 be the variety of languages associated with the power pseudovariety PV. Then, for every alphabet B, the collection of languages B+𝒲 is the smallest Boolean algebra containing all languages of the form 𝜗(M), where M is any language from A+𝒱 for some alphabet A, and 𝜗:A+→B+ is a length preserving homomorphism.
We conclude this section with a characterization of languages that are recognized by commutative semigroups. That is, we shall characterize the languages belonging to the variety of languages 𝒞om that is associated to the pseudovariety Com of all commutative semigroups. This result comes from [6, Sec. 2.3].
Let A be an alphabet, let w∈A+ be a word and let a∈A be a letter. We denote by |w|a the number of occurrences of the letter a in the word w. For every positive integer n and for every k∈{0,1,2,…,n−1}, consider the language
Furthermore, for every non-negative integer r, consider the language
From these notes and from the last corollary in [6, Sec. 2.3] we infer the following description of the recognizable languages forming the variety of languages that is associated to the pseudovariety of all commutative semigroups.
Proposition 3.6. For every alphabet A, the collection of languages is the smallest Boolean algebra containing all languages over A of the form K(a,k,n), where n is a positive integer, and and all languages over A of the form L(a,r), where and r is a non-negative integer.
4. The Equality of Pseudovarieties
As we have already explained in the introduction, the purpose of this paper is to establish the equality of pseudovarieties given in the heading of this section. The inclusion has been verified already by Almeida in [1] and in [2, Sec. 11.9] using purely algebraic arguments. Thus it remains to establish also the opposite inclusion . This once, we shall do so by means of language theoretical methods. We have recalled in Theorem 3.1 the celebrated result of Eilenberg, according to which there are mutually inverse order preserving one-to-one correspondences between the lattice of all pseudovarieties of semigroups and the lattice of all varieties of languages. As in the introduction, we shall let be the variety of languages corresponding to the pseudovariety , and we shall let be the variety of languages corresponding to the pseudovariety . Then, in order to establish the inclusion , we shall struggle to verify that we have . That is, we shall strive to show that, for every alphabet A, we have the inclusion .
Once again, let be the variety of languages corresponding to the pseudovariety . Let further be the variety of languages corresponding to the pseudovariety , and let be the variety of languages corresponding to the pseudovariety . Since clearly , we have , which means that, for every alphabet A, we have the inclusion . Next we turn to the pseudovariety and to its corresponding variety of languages .
According to Straubing’s wreath product principle captured in Corollary 3.4, for every alphabet A, the collection of languages is the smallest Boolean algebra containing all languages from and all languages of the form , where is a homomorphism of into a semigroup , is the sequential function associated with , and J is any language from . Since clearly , we have , which means that, for every alphabet A, we have the inclusion . We have seen in the previous paragraph that we also have the inclusion . Therefore, for every alphabet A, we get the inclusion . Thus it remains to look after the languages of the form specified above. We need to show that also these languages belong to the collection .
Thus let be a homomorphism of into a semigroup , let be the sequential function associated with , and let J be any language from . Since and , there exists a positive integer m such that . Therefore the semigroup T satisfies the pseudoidentity . Recall that the sequential function is given, for every positive integer h and for arbitrary elements , by the formula
According to Proposition 3.6, the language J from is a Boolean combination of languages over the alphabet of the form , where is a letter in , so that and , n is a positive integer and , and of languages of the form , where is again a letter in , so that and and r is a non-negative integer. We have to show that then the language belongs to the collection of languages . Since the operator commutes with the Boolean operations, we see that it will be enough if we show that the languages of the forms and belong to . We already know from Theorem 3.2 that these languages are recognizable languages over the alphabet A.
We start with the languages of the form , where , , n is a positive integer, and . We shall see that, in this case, these languages actually belong already to the subset of . For this purpose, we shall show that the syntactic semigroups of these languages belong to the pseudovariety . We shall document this statement by verifying that these syntactic semigroups satisfy the pseudoidentities and .
Lemma 4.1. The syntactic semigroup of the language satisfies the pseudoidentity .
Proof. Let us denote briefly by the syntactic congruence of the language . Thus we wish to show that the syntactic semigroup satisfies the pseudoidentity . So let and be arbitrary words such that the class is an idempotent in . We need to show that then we have . By the definition of the syntactic congruence , this means to check that, for arbitrary words , we have
Now we have to return to our previous considerations regarding the computation of the word , for arbitrary elements , and to apply them to the above words and . Since , the first m letters of the alphabet in the words and occur within the initial segment of these words, and therefore they are the same. As far as the other letters of in the words and are concerned, we have seen that these letters arise from segments of the words and of length . Once again, since , such segments must appear as segments in one of the words , , and . In particular, if such a segment hits the word u in , then it must be a segment of the word . But then it is also a segment of the same word in . The same holds true of the segments of length of the words and which hit the word v. This consideration reveals that the number of occurrences of each individual letter of in both words and is the same. Hence, by the definition of the language , we certainly have , for arbitrary words , which yields that , as required. □
Lemma 4.2. The syntactic semigroup of the language satisfies the pseudoidentity .
Proof. As before, let us denote briefly by the syntactic congruence of the language . Thus we wish to show that the syntactic semigroup satisfies the pseudoidentity . So let and be arbitrary words such that the classes and are idempotents in . Let g be any positive integer such that the class is an idempotent in . We need to show that then we have . By the definition of the syntactic congruence , this means to check that, for arbitrary words , we have
Once again, we have to return to our previous considerations regarding the computation of the word , for arbitrary elements A, and to apply them to the above words and . Since , the first m letters of the alphabet appearing in these two words occur within the initial segment of these words, and therefore they are the same. Thus we have to look after the other appearances of the letters of the alphabet in these two words. We have seen that these letters arise from segments of the words and of length . Since and , the segments of length which may occur anywhere in the word , beyond those occurrences of such segments which appear already in the word , must be segments of the word , not counting the occurrences of these segments in the last factor of this word. Moreover, every such segment in the word must be a segment of either of the words or . Hence, since the exponent g is divisible by n, the number of these occurrences of every such segment in the word must be divisible by n. These considerations reveal that the numbers of occurrences of each individual letter of in the words and are congruent modulo n. Therefore, by the definition of the language , we have , for arbitrary words , which yields that , as required. □
We continue by verifying that the languages of the form , where , and r is a non-negative integer, belong to . We begin with a few preliminary considerations. So let be an arbitrary word. This means that . However, by the definition of the language , this is equivalent to the condition that the letter of the alphabet has exactly r appearances in the word . Assume, for the time being, that the length of the word w is greater than m. Let be the number of appearances of the mentioned letter in the initial segment of the word of length m. Then we can write for some non-negative integer . From our earlier considerations we know that the appearances of the letter in the word which do not occur in the initial segment of of length m arise from some segments of the word w of length . There may be several such pairwise distinct segments in the word w. So let be any of these segments of the word w of length which give rise to the letter . Let have s occurrences in the word w. Then . (We permit also the possibility that .) Consider now all s occurrences of this segment in the word w. Some of these occurrences of may overlap in w or they may touch each other. Let be all maximal segments of the word w which are covered completely by the mentioned occurrences of the segment in w. Then and for some words and . Let be an alphabet disjoint with A whose letters are in a one-to-one correspondence with the letters of A. Consider next the language . (If , and so , then take the language instead.) This language is a rational language over the alphabet , and therefore, by Kleene’s theorem, it is a recognizable language.
We shall have to deal with all possible languages over the alphabet of the form just mentioned. Every such language is completely determined by the given -tuple of words . Recall that such -tuples are characterized by the following properties. First of all, there is a word over the alphabet A of length which gives rise to the letter of the alphabet . According to what we have already specified above, this means that, if where , then and . Thereafter, the mentioned -tuple consists of non-empty words over the alphabet A such that the total number of occurrences of the segment in any of the words is equal to s and, at the same time, every letter of any of the words is covered by some occurrence of the segment in this word. Then certainly and there are evidently only finitely many possibilities how such -tuples may look like. Therefore there are only finitely many languages of the form which may arise in this way. (If , and so , then there is only the language which may so arise.) Let now be the finite union of all of these languages so determined. Then is again a recognizable language over the alphabet .
We are going to show that the recognizable language just designated belongs to the collection of languages which the variety of languages assigns to the alphabet . We intend to attain this goal by checking that the syntactic semigroup of this language belongs to the pseudovariety . We shall achieve this aim by verifying that this syntactic semigroup satisfies the pseudoidentities and .
For the sake of notational ease, let us denote briefly by the syntactic congruence of the language . Then the syntactic semigroup of the language is of the form . Recall that the language is a finite union of recognizable languages of the form . Let be those finitely many languages of the form just mentioned whose union constitutes the language . Let be the syntactic congruences of these languages. Then the syntactic semigroups of these languages are of the forms , . Consider the subdirect product of these syntactic semigroups consisting of all z-tuples of the form , where . Then this subdirect product recognizes the language , and so the syntactic semigroup of this language is a canonical homomorphic image of the mentioned subdirect product of syntactic semigroups of the languages . This canonical homomorphism maps every z-tuple of the form to the element .
Lemma 4.3. The syntactic semigroup of the language satisfies the pseudoidentity .
Proof. So let and be any words such that the class is an idempotent in the semigroup . We need to show that then we have . The element is the image under the above canonical homomorphism of the z-tuple . Since our syntactic semigroups are finite semigroups, there exists a positive integer such that the z-tuple is an idempotent. Thus, for every , the element is an idempotent, and so we have . By the definition of the syntactic congruence , for every words , we get . However, having in view that the language is of the form , we come to the following conclusion. If the word contained some elements from the alphabet A and if, for some words , we had , then we would certainly have . However, this means that the above condition would be violated. Thus, whenever , then, for any words , we must have . Therefore, in particular, in such a case, for any words , we must have and . However, by the definition of the syntactic congruence , this means that . This relation must hold for all . By the reasonings preceding this lemma, it hence eventually ensues that . Since here the element is an idempotent, and so it is equal to the element , we hence lastly infer that , as desired. Thus it remains to treat the case when .
We have to verify that, in this case, we also have . By the definition of the syntactic congruence , this means to check that, for any words , we have . Thus assume that . Since is a finite union of languages of the form , it hence ensues that we must have for some -tuple . Since we now have , the three appearances of the segment in the word must fall into some of the factors of the form or in the language . If , then the whole segment must fall into the same factor of the form or . But then, clearly, we get that , and so . The same conclusion follows if . Thus we may further assume that and . Then there exist indices satisfying such that the segment comes from the language and the segment comes from the language , and they are the leftmost and the rightmost positions of the factors of the form in these two languages that the appearances of the segment in the words and come from. But then, clearly, the word is an element of the language
Lemma 4.4. The syntactic semigroup of the language satisfies the pseudoidentity .
Proof. To start with, let us to recall the considerations performed in the paragraph preceding Lemma 4.3 and the notations introduced in that paragraph. Thus let now and be any words such that the classes and are idempotents in . Let g be any positive integer such that the class is an idempotent in . We need to show that then we have . The element is the image of the z-tuple , and the element is the image of the z-tuple . Then there exists a positive integer such that the z-tuple is an idempotent, and the z-tuple is an idempotent. Thus, for every , the element is an idempotent, which means that we have . By the definition of the syntactic congruence , for every words , we get . However, since the language is of the form , we can infer the following conclusion. If the word contained some elements from the alphabet A and if, for some words , we had , then we would certainly have . But this would contradict the above condition . Thus, whenever , then, for any words , we must have . Therefore, in particular, in such a case, for any words , we must have and . But, by the definition of the syntactic congruence , this means that . This relation must hold for all . By the reasonings performed in the paragraph preceding Lemma 4.3, it hence eventually ensues that . Since here the elements and are idempotents, and so they are equal to the elements and , respectively, we hence deduce, in the end, that , as desired. We have arrived at this conclusion under the assumption that . Likewise, we could come to the same conclusion if we went from the assumption that . Thus it remains to deal with the case when and .
We have to verify that, in this case, we also have . By the definition of the syntactic congruence , this means to check that, for any words , we have . Thus assume that . Since is a finite union of languages of the form , it hence ensues that we must have for some -tuple . Since we now have and , we see that the whole segment in the word must fall into a single factor of the form or in the language . However, then it is evident that we also get , and so we obtain . In a similar manner, it is possible to show that if , then also . These findings assure that we indeed have , again as desired. □
Let further be the language over the alphabet A consisting of all words which one may obtain from the words of the language by replacing in them every occurrence of any letter from by its respective letter form A. It is clear that then the language comprises exactly those words over the alphabet A which contain at least s occurrences of the segment . Now we wish to sort out of these words those ones which contain exactly s occurrences of the segment . That is, we intend to remove from those words which contain more than s occurrences of . For this purpose, we must be aware that, up to now, we have performed all our considerations with an arbitrary non-negative integer s. Thus we may replace everywhere in the foregoing reasonings this integer s with the integer . In this way, we come up with the languages and . Then, of course, the language comprises exactly those words over the alphabet A which contain at least occurrences of the segment . And then the language consists precisely of those words over the alphabet A which contain exactly s occurrences of the segment .
We have shown in Lemmas 4.3 and 4.4 that the syntactic semigroups of the languages over the alphabet , for whichever non-negative integers s, belong to the pseudovariety . We have considered subsequently the languages , again for any non-negative integers s, over the alphabet A, whose creation from the languages can also be expressed in the following terms. Let us consider the homomorphism of free semigroups determined by the requirements that should map every element of A to itself and should map every element of the disjoint copy of A to its respective counterpart in A. Then is a length preserving homomorphism. Then we have . By Theorem 3.5, we know that then are again recognizable languages which are recognized by suitable semigroups from the power pseudovariety . Therefore these languages belong to the collection of languages . Of course, then also the language belongs to this collection . We have next seen above that the language consists precisely of those words over the alphabet A which contain exactly s occurrences of the segment . Recall that has been a word in the alphabet A of length giving rise to the letter of the alphabet . There may be several such distinct words which give rise to the same letter in this way. So let be all words enjoying this property. Let further be any non-negative integers. Then for every , we may consider the language . Then the language consists of those words which, for every , contain exactly occurrences of the segment . Denote this last language by . This language then again belongs to the collection . Let finally be any non-negative integer. Consider all -tuples of non-negative integers such that . Then we may form the language . This language, which we shall denote briefly by , then also belongs to the collection .
It should now be clear that the language consists precisely of those words which enjoy the following property. We focus first on the words whose length is greater than m. Then such a word w belongs to the language if and only if the number of those appearances of the letter in the word which do not occur in the initial segment of of length m is equal to . As far as the words whose length does not exceed m are concerned, the following holds true. If , then all such words w belong to the language . If, on the contrary, , then the language contains no words w of this kind.
Let be a non-negative integer such that . There are only finitely many words u in the alphabet A of length equal to m. Let be the set of all words u of this kind having the property that the number of appearances of the letter in the word is equal to . Then is a finite language over the alphabet A. Consider next the language . Then is a recognizable language. It is a familiar fact— see [6, Sec. 2.3]— that this language can be recognized by a semigroup from the pseudovariety . Since clearly , and as , we see that this language can be recognized by a semigroup from the pseudovariety , and hence it belongs to the collection of languages . Note that this language contains no words w whose length does not exceed m. Take next any non-negative integer and consider the language . This language then also belongs to the collection and it consists of all those words w in the alphabet A of length greater that m that have the property that, in the word , there are appearances of the letter in the initial segment of this word of length m and there are appearances of the letter in the rest of this word. Finally, for every non-negative integer r, consider all pairs of non-negative integers such that . Then we may consider the language . This last language then again belongs to the collection and it consists of those words w in the alphabet A whose length is greater than m and which possess the property that, in the word , there are exactly r appearances of the letter .
At last, given a non-negative integer r, we may consider the set of all words v in the alphabet A of length not exceeding m and having the property that, in the word , there are exactly r appearances of the letter . Then is a finite language over the alphabet A, which can therefore be recognized by a nilpotent semigroup, and so it can be certainly recognized by a semigroup from the pseudovariety . Hence this language belongs to the collection of languages . Finally we may consider the language . This last language then again belongs to the collection and it consists of all those words w in the alphabet A which possess the property that, in the word , there are exactly r appearances of the letter . But this property means that . Consequently, the last-mentioned language is precisely our language . Therefore this language belongs to the collection , as desired.