Loading [MathJax]/jax/element/mml/optable/BasicLatin.js
World Scientific
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at customercare@wspc.com for any enquiries.

The pseudovariety ComD viewed as a power pseudovariety

    https://doi.org/10.1142/S0218196725500043Cited by:0 (Source: Crossref)

    Abstract

    The objective of this paper is to document that the pseudovariety equation PX=ComD admits a solution U. This solution U is a pseudovariety raised up by Almeida in his book on finite semigroups and universal algebra as a prospective candidate for such a solution. This result achieved in the present paper resolves an open problem posed by Almeida in his book a long time ago.

    1. Introduction

    The semidirect product on the level of pseudovarieties of semigroups has been widely studied in the past. See [2, Chap. 10] for a survey of results obtained until mid-nineties in this direction. Likewise the power operator on the level of pseudovarieties of semigroups has been the subject of intensive study in the past. See [2, Chap. 11] for an overview of results obtained by that time in this area. From time to time these two routes of research have met up with each other. One particular instance of such a fruitful encounter is represented by the study of locally commutative power pseudovarieties, whose results are summarized in [2, Sec. 11.9]. In this context, the pseudovariety ComD has arisen, which has been previously the object of concern of researchers in the domain of semidirect products of pseudovarieties of semigroups. See [2, Sec. 10.7] in this respect, for example.

    Recall yet that Com is the pseudovariety of all commutative semigroups, D is the pseudovariety of all definite semigroups, that is, the pseudovariety of all semigroups whose idempotents act as right zeros and ComD is the pseudovariety of semigroups generated by the class of all semidirect products of commutative semigroups by definite semigroups. An effective characterization of the pseudovariety ComD has been provided by Thérien and Weiss in [10]. Their result actually says that ComD is the pseudovariety of semigroups determined by the pseudoidentity exfyezfezfyexf. See also [2, Sec. 10.7] once again in this regard.

    Let LCom be the pseudovariety of all locally commutative semigroups. That is, LCom consists of all finite semigroups S having the property that, for every idempotent e in S, the submonoid eSe of S is a commutative monoid. Yet otherwise stated, LCom is the pseudovariety of semigroups determined by the pseudoudentity exeyeeyexe. In this context, Almeida has proposed in [1] and in [2, Sec. 11.9] to consider the subpseudovariety U of the pseudovariety LCom determined within LCom by the additional pseudoidentity (ef)ωexfexf. It is easy to check that, within LCom, the pseudoidentity (ef)ωexfexf is equivalent to the pseudoidentity exf(ef)ωexf, which shows that the pseudovariety U is self-dual. In fact, it is straightforward to infer hence that the pseudovariety U consists of all finite semigroups S having the property that, for every idempotents e,f in S, the subsemigroup eSf of S is a commutative monoid which admits the element (ef)ω for its identity.

    Furthermore, Almeida considered in [1] and in [2, Sec. 11.9] the power pseudovariety PU. Recall that PU is the pseudovariety generated by the class of all power semigroups 𝒫(S), where S are arbitrary semigroups from the pseudovariety U. Having this pseudovariety in view, Almeida has shown in [1] and in [2, Sec. 11.9] that the pseudovariety ComD provides an upper bound for the power pseudovariety PU. That is, he has established the inclusion PUComD, and he has done so by verifying directly that the power semigroups 𝒫(S), for all semigroups S in the pseudovariety U, do satisfy the above-mentioned pseudoidentity exfyezfezfyexf. He has also shown, in addition, that, for any pseudovariety V of semigroups, the condition PVComD is equivalent to the condition VU. Thus U is the largest pseudovariety for which the inclusion PUComD holds. However, the question whether the equality PU=ComD actually holds here has been left open in [2, Sec. 11.9].

    The purpose of this paper is to establish the missing inclusion ComDPU. This is the inclusion that is opposite relative to the one discussed above. In this way, it eventually emerges that, in reality, the equality PU=ComD does hold. Therefore the pseudovariety ComD is indeed a power pseudovariety. It is equal to the pseudovariety PU, where U is the pseudovariety introduced above.

    In order to achieve the appointed goal, we shall take advantage of the tools furnished by formal language theory. It is a celebrated result in the theory of varieties of recognizable languages attributed to Eilenberg that there are mutually inverse order preserving one-to-one correspondences between the lattice of all pseudovarieties of semigroups and the lattice of all varieties of languages. See [3, Sec. VII.3] for Eilenberg’s original treatment of this subject. Having this in view, we shall let 𝒱 be the variety of languages corresponding to the pseudovariety ComD, and we shall let 𝒲 be the variety of languages corresponding to the pseudovariety PU. Then, in our efforts to establish the inclusion ComDPU, we shall be done if we can show that we have 𝒱𝒲. This is exactly what we shall strive for in this paper.

    The other tools from formal language theory that we shall make use of include Straubing’s wreath product principle, expounded in detail by Pin and Weil in [7], which makes it possible to underpin the languages appertaining to the variety 𝒱 corresponding to the pseudovariety ComD, and Straubing’s characterization of languages recognized by power semigroups, obtained in [9], which allows us to handle the languages belonging to the variety 𝒲 corresponding to the power pseudovariety PU.

    2. Preliminaries

    All semigroups appearing in this paper, with the exception of free semigroups, will be finite. For every semigroup S, we shall denote by E(S) the set of all idempotents in S. We shall denote by S1 the monoid which is equal to S provided that S already is a monoid, and which is obtained from S by adjoining an identity to it otherwise.

    By a pseudovariety of semigroups we shall mean any class of finite semigroups which is closed under the formation of direct products of finite families of semigroups and under the formation of subsemigroups and homomorphic images of semigroups. Such pseudovarieties can be determined by collections of pseudoidentities, as it has been the case on several instances in the previous section. We shall not need the concept of pseudoidentity in its full generality, and so we shall not embark on this subject here. We only mention that, in such a pseudoidentity uv, apart from letters like x,y,z, which are reserved for variables that may be substituted by arbitrary elements from a semigroup S in which the pseudoidentity uv is evaluated, also letters like e,f,g, may appear which are reserved for constituents that may be substituted only by idempotents from E(S).

    We have met already in the previous section the pseudovariety D of all definite semigroups. This pseudovariety consists of all finite semigroups S having the property that every idempotent in E(S) acts as a right zero in S. Thus D is the pseudovariety determined by the pseudoidentity xee. For every positive integer m, let Dm be the pseudovariety determined by the pseudoidentity xy1y2ymy1y2ym. Then it turns out that D=m=1Dm. Later on, we shall also need the pseudovariety K which is dual with respect to the pseudovariety D. That is, K consists of all finite semigroups S having the property that every idempotent in E(S) acts as a left zero in S. Thus K is the pseudovariety determined by the pseudoidentity exe. For every positive integer m, let Km be the pseudovariety determined by the pseudoidentity x1x2xmyx1x2xm. Then it turns out that K=m=1Km.

    We shall have to deal in this paper with semidirect products of pseudovarieties of semigroups. To start with, we shall have to work with semidirect products of individual semigroups of the form ST, where S and T are arbitrary semigroups. We shall observe here the custom introduced by Almeida at the beginning of [2, Sec. 10.1], according to which, even if T is viewed as a semigroup, if T happens coincidentally to be a monoid, then the underlying left action of T on S giving rise to the semidirect product ST should be left unitary. With that, if V and W are any pseudovarieties of semigroups, then the semidirect product VW of these two pseudovarieties is defined to be the pseudovariety of semigroups generated by the class of all semidirect products of the form ST, where S is a semigroup from V and T is a semigroup from W, observing the custom regarding the underlying left action of T on S mentioned above. It is easy to verify that this semidirect product VW consists, in fact, of all divisors of the semidirect products ST just described.

    Aside from ordinary semigroups, we shall have to deal also with transformation semigroups. To begin with, let P be a finite set. We shall denote by 𝒯P the monoid of all transformations of the set P. Note that we shall let the transformations from 𝒯P act on the elements of the set P on the right. That is, given an element pP and a transformation τ of the set P, we shall denote by pτ the result of the application of τ on p. Therefore the composition of transformations from 𝒯P should be interpreted accordingly, that is, it has to be read from the left to the right. Having this at hand, by a transformation semigroup we shall mean any pair X=(P,S) where P is a finite set and S is a subsemigroup of the monoid 𝒯P of all transformations of the set P. The set P will be called the underlying set of X and the semigroup S will be called the action semigroup of X.

    In accordance with [3, Sec. I.10] or [2, Sec. 10.1], we introduce the operation of wreath product of transformation semigroups in the following manner. Let X=(P,S) and Y=(Q,T) be transformation semigroups. The wreath product XY of these transformation semigroups is the transformation semigroup having for its underlying set the cartesian product P×Q and having for its action semigroup the semidirect product SQT of the semigroup SQ of all functions of Q into S by the semigroup T determined by the left action of T on SQ given by the following formula. For every transformation τT and for every function f:QS, the function τf:QS is defined by the rule q(τf)=(qτ)f, for every qQ, where functions are written on the right of their arguments. In order to keep the promise that XY so determined should indeed be a transformation semigroup, it remains to observe that the action semigroup SQT may be viewed as a subsemigroup of the monoid 𝒯P×Q in the following way. For every function f:QS and for every transformation τT, the element (f,τ) of SQT determines a transformation ϱ(f,τ) of the set P×Q sending every pair (p,q) from P×Q to the pair (p(qf),qτ). Then the assignment (f,τ)ϱ(f,τ) is an embedding of SQT into 𝒯P×Q. Therefore the semigroup SQT may indeed be treated as a subsemigroup of the monoid 𝒯P×Q. In this way, XY becomes again a transformation semigroup.

    However, the operation of wreath product XY of transformation semigroups X=(P,S) and Y=(Q,T) so introduced has a certain defect. It appears when the semigroup T is a monoid, but the transformation representing the identity of T is not equal to the identity on the set Q. In such a case, the left action of T on SQ determined above need not be left unitary. And this is in conflict with our previous requirements regarding the underlying left action of the semigroup T on SQ, if the semidirect product SQT has to be properly defined. Having this circumstance in view, we have introduced in [4] the concept of proper transformation semigroups. A transformation semigroup X=(P,S) is said to be proper if either S is not a monoid, or S is a monoid and the identity of S acts as the identity on the set P. If we restrict the scope of the above definition of wreath products only to proper transformation semigroups, then the difficulty that we have encountered above disappears. And it is not too hard to verify that the wreath product XY of two proper transformation semigroups X and Y is a proper transformation semigroup again.

    Most frequent examples of proper transformation semigroups are the right regular representations of ordinary semigroups. For any semigroup S and for every element sS, we may consider the transformation ρs:S1S1 which maps every element tS1 to the element ts. Then the mapping S𝒯S1 given by the formula sρs is an embedding of the semigroup S into 𝒯S1. In this way, a proper transformation semigroup (S1,S) arises, which is called the right regular representation of S. Thus, for any semigroups S and T, we may consider the wreath product (S1,S)(T1,T) of the right regular representations of these semigroups. This wreath product is again a proper transformation semigroup and its action semigroup ST1T is just the standard wreath product ST of the semigroups S and T. It is a familiar fact that, for any semigroups S and T and for every left action of T on S, where the custom regarding the left actions that has been introduced above is observed, the semidirect product ST determined by this left action can be embedded into the wreath product ST.

    As a consequence, we see that, for any two pseudovarieties of semigroups V and W, the semidirect product VW of these pseudovarieties is equal to the pseudovariety generated by the class of all wreath products of the form ST, where S is a semigroup in V and T is a semigroup in W. In fact, this semidirect product VW can be obtained as the class of all divisors of the wreath products ST just described. Finally note that, on the other hand, the semidirect product VW contains all action semigroups SQT of the wreath products XY of arbitrary proper transformation semigroups X=(P,S) and Y=(Q,T), where the semigroup S belongs to V and the semigroup T belongs to W.

    We close this section with a few remarks on power semigroups and power pseudovarieties. For any semigroup S, we denote by 𝒫(S) the set of all subsets of S equipped with the usual multiplication. In this way, 𝒫(S) becomes a semigroup, called the power semigroup of S. For any pseudovariety V of semigroups, we let PV be the pseudovariety generated by all semigroups of the form 𝒫(S), where S stands for arbitrary semigroups from V. Note that, if V contains a non-trivial monoid, then PV can be actually obtained as the class of all semigroups T that divide some semigroup of the form 𝒫(S), where S is any semigroup in V. The pseudovariety PV arising in this way from V is called the power pseudovariety of V.

    3. Formal Language Theory

    Let A be a finite set. Within the context of the theory of formal languages, we shall say that A is an alphabet. Elements of this set A will be called letters. By A we shall denote the free monoid on the set A, and by A+ we shall denote the free semigroup on the set A. Elements of A will be called words in the alphabet A. By 1 we shall denote the empty word, that is, the identity of the monoid A. By a language over the alphabet A we shall mean any set of words in the alphabet A, that is, any subset L of the free monoid A. Later on, however, we shall have to deal with languages L that will be subsets of the free semigroup A+, that is, such languages L will not contain the empty word 1.

    By an automaton we shall mean any triplet 𝒜=(Q,A,) where Q is a finite set of states, A is an alphabet and ⋅ is a function Q×AQ. One can extend this function ⋅ to a function Q×AQ, also denoted by ⋅, by putting successively q1=q and q(wa)=(qw)a, for all qQ, wA and aA.

    Let LA be a language and let 𝒜=(Q,A,) be an automaton. We shall say that the language L is recognized by the automaton 𝒜 if there exist a state q0Q, called the initial state and a subset FQ of states, called the set of terminal states, such that L={wA:q0wF}. Languages that are recognized by some automaton are called recognizable languages.

    If we have to deal only with languages LA+, then a slightly different notion of recognizability of such languages by means of automata suggests itself. Let again 𝒜=(Q,A,) be an automaton. Then we may say that the language L is recognized by the automaton 𝒜 if there exist an initial state q0Q and a subset FQ of terminal states such that L={wA+:q0wF}. But if a language LA+ is recognized by an automaton 𝒜 in this latter sense, then it can also be recognized by an automaton ¯𝒜 in the former sense, where the automaton ¯𝒜 is only slightly adapted from 𝒜. For languages LA+, we shall usually have in view just this latter altered notion of recognizability.

    One can consider quite a lot of operations on the languages over the given alphabet A. In the first place these are the standard Boolean operations, namely the union KL and the intersection KL of any two languages K,LA, and the complement A\L of a language LA. Further operations that one may consider are the product KL of two languages K,LA and the star L of a language LA. The product KL is defined to be the language KL={uwA:uK,wL}. The star L is defined to be the submonoid of A generated by the language L.

    We proceed to introduce another family of languages, namely the so-called rational languages. The class of all rational languages over the alphabet A is defined to be the smallest subclass of the class of all languages over the alphabet A which contains all finite languages and is closed under the formation of unions of pairs of languages, of products of pairs of languages and of stars of languages. Now the classical Kleene’s theorem asserts that a language over an alphabet A is recognizable if and only if it is rational. See [5, Sec. 6.3], for example.

    Given an automaton 𝒜=(Q,A,), every word wA determines a transformation ζw of the set Q given by the rule qqw, for every qQ. The transformations ζw of the set Q, for all words wA, form a submonoid of the monoid 𝒯Q of all transformations of the set Q. This submonoid, which we shall denote by M(𝒜), is called the transition monoid of the automaton 𝒜. Note that this monoid M(𝒜) is generated by the set of all transformations of the set Q of the form ζa, for arbitrary elements aA. If we take in the above considerations only the transformations ζw of the set Q such that wA+, then these transformations will form a subsemigroup of the monoid 𝒯Q. This subsemigroup, which we shall denote by S(𝒜), is called the transition semigroup of the automaton 𝒜. Note also that then the pair (Q,S(𝒜)) forms a transformation semigroup. Later on, by the transition semigroup of the automaton 𝒜 we shall often mean just this transformation semigroup.

    From now on, we shall confine ourselves to languages LA+. So let LA+ be such a language, let S be a finite semigroup, and let φ:A+S be a homomorphism of semigroups. We shall say that the language L is recognized by the homomorphism φ if there exists a subset PS such that L=φ1(P). If this is the case, then we shall also say that the language L is recognized by the semigroup S. It turns out that a language LA+ is recognized by a semigroup in this way if and only if L is recognized by an automaton, as described above. Indeed, if the language LA+ is recognized by a semigroup S and if φ:A+S is the underlying homomorphism of semigroups, then L is recognized by the automaton 𝒜S=(S1,A,), where the function ⋅ is determined, for any sS1 and any aA, by the formula sa=sφ(a). On the other hand, if the language LA+ is recognized by an automaton 𝒜=(Q,A,), then L is recognized by the transition semigroup S(𝒜) of this automaton.

    The possibility to treat in this manner the notion of recognizability of languages over an alphabet A in terms of homomorphisms of the free semigroup A+ into finite semigroups makes it possible to verify easily the following statements. If LA+ is a recognizable language, then its complement A+\L is also a recognizable language. If K,LA+ are recognizable languages, then their union KL and their intersection KL are also recognizable languages.

    It is also worth noticing that if LA+ is a language that can be recognized by a semigroup S and if T is a semigroup such that S divides T, then L can also be recognized by T.

    Let again LA+ be a language. We shall say that this language L is recognized by a transformation semigroup X=(Q,S) if there exist a homomorphism of semigroups φ:A+S, a state q0Q, called the initial state, and a subset FQ of states, called the set of terminal states, such that L={wA+:q0φ(w)F}. As before, it turns out that a language LA+ is recognized by a transformation semigroup in the way just specified if and only if L is recognized by an automaton in the way described above. Namely, if the language LA+ is recognized by a transformation semigroup X=(Q,S) and if φ:A+S is the underlying homomorphism of semigroups, then L is recognized by the automaton 𝒜X=(Q,A,), where the function ⋅ is determined, for any qQ and any aA, by the formula qa=qφ(a). On the other hand, if the language LA+ is recognized by an automaton 𝒜=(Q,A,), then L is recognized by the transition semigroup (Q,S(𝒜)) of this automaton.

    Also the following notes on the various notions of recognizability will come in handy. If a language LA+ is recognized by a transformation semigroup X=(Q,S), then L is recognized by the transformation semigroup (S1,S). The latter semigroup, which is the right regular representation of the semigroup S, is itself a proper transformation semigroup. It is also easy to see that a language LA+ is recognized by a semigroup S if and only if L is recognized by its right regular representation (S1,S).

    If X=(Q,S) is a transformation semigroup and if LA+ is a language that can be recognized by the semigroup S, then L is a Boolean combination of languages over the alphabet A that can be recognized by X. Indeed, if φ:A+S is a homomorphism and if PS is a subset such that L=φ1(P), then it turns out that L=τPqQ{wA+:qφ(w)=qτ}. The latter languages are clearly recognized by X.

    There are plenty of other operations on the languages over various alphabets A. For example, given a language LA+ and a word uA+, one can define the left quotient u1L and the right quotient Lu1 of the language L relative to the word u as the languages u1L={wA+:uwL} and Lu1={wA+:wuL}. As previously, if LA+ is a recognizable language and uA+ is any word, then the left quotient u1L and the right quotient Lu1 are recognizable languages, as well.

    Another operation on languages over distinct alphabets A and B is that of taking inverse images of languages over B with respect to various homomorphisms of free semigroups A+B+. Thus let LB+ be a language and let ψ:A+B+ be a homomorphism of free semigroups. Then one may consider the inverse image ψ1(L) of the language L with respect to ψ. Then, of course, ψ1(L)A+. And once again, if L is a recognizable language over B, then ψ1(L) turns out to be a recognizable language over A.

    Let LA+ be any language. We shall say that a congruence on the free semigroup A+ saturates the language L if L is a union of certain classes of the partition A+/. There exists the coarsest congruence L on A+ which saturates the language L. We will call this congruence L the syntactic congruence of the language L. It is given, for all words u,vA+, by the formula

    uLvif and only iffor allp,qA:puqLpvqL.
    In this way, the quotient semigroup A+/L arises. This quotient semigroup A+/L will be called the syntactic semigroup of the language L. In general, this semigroup A+/L need not be finite. However, it turns out that the language LA+ is recognizable if and only if its syntactic semigroup A+/L is finite. If this is the case, then the language L is recognized by its syntactic semigroup A+/L. Moreover, whenever S is any semigroup such that L is recognized by S, then the syntactic semigroup A+/L divides S.

    Now we are ready to evoke the celebrated Eilenberg’s theorem relating to each other pseudovarieties of finite semigroups and varieties of recognizable languages. Recall once again that we confine ourselves here to recognizable languages LA+, for various alphabets A. Within this framework, this theorem has been presented already by Eilenberg in [3, Sec. VII.3]. Another exposition of this result has been provided by Pin in [6, Sec. 2.2]. The other version of this theorem involving pseudovarieties of finite monoids and recognizable languages LA has been furnished, aside from [3] and [6], also by Lallement in [5, Sec. 6.5].

    We begin with yet another definition. By a class of recognizable languages we shall mean any mapping 𝒞 assigning to every alphabet A a set A+𝒞 of recognizable languages over A. (All of these languages are assumed to be subsets of A+.) If 𝒞 and 𝒟 are two classes of recognizable languages, then we shall write 𝒞𝒟 if, for every alphabet A, we have the inclusion A+𝒞A+𝒟. In this way, a partial order on the family of all classes of recognizable languages is introduced.

    By a variety of languages we shall mean any class 𝒱 of recognizable languages such that the following conditions are satisfied:

    (1)

    For every alphabet A, the set A+𝒱 is a Boolean algebra, that is, A+𝒱 is closed with respect to the operation of set theoretical complement of languages (relative to the base set A+) and with respect to the operations of set theoretical union and intersection of pairs of languages.

    (2)

    For every alphabet A, for every language L in A+𝒱 and for every element aA, also the languages a1L and La1 belong to A+𝒱.

    (3)

    Whenever A and B are alphabets and ψ:A+B+ is a homomorphism of free semigroups, then the preimage ψ1(L) of any language L from B+𝒱 belongs to A+𝒱.

    Let now V be any pseudovariety of semigroups. We assign to V a class of recognizable languages 𝒱 in the following way. For every alphabet A, we let A+𝒱 be the set of all languages LA+ having the property that the syntactic semigroup A+/L belongs to V. Then 𝒱 is a variety of languages.

    We may also proceed the other way round. Let 𝒱 be any variety of languages. We assign to 𝒱 a pseudovariety V of semigroups as follows. We let V be the pseudovariety generated by all syntactic semigroups of the form A+/L, where A is any alphabet and L is any language from A+𝒱.

    Then we may formulate the announced Eilenberg’s theorem.

    Theorem 3.1. The assignments V𝒱 and 𝒱V described above determine mutually inverse order preserving one-to-one correspondences between the lattice of all pseudovarieties of semigroups and the partially ordered family of all varieties of languages.

    Of course, it hence ensues immediately that also the partially ordered family of all varieties of languages forms a lattice.

    In the continuation of this section, we proceed to explain the aforementioned Straubing’s wreath product principle making it possible to handle languages recognized by semigroups that belong to the semidirect products of couples of pseudovarieties of semigroups. As the original source of these ideas, the paper [8] of Straubing is usually being quoted. But it is not straightforward to identify this principle in that paper. Our exposition in this paper will follow closely that of Pin and Weil in [7, Sec. 3], where they have drawn it up for ordered semigroups. The unordered version of these results hence follows as a special case. Besides, in the course of our exposition that we offer here, we shall try to fix some shortcomings that have appeared in [7].

    By a sequential transducer we shall mean a machine which can be formally described as any sextuplet 𝒯=(Q,A,B,q0,,), where Q is a finite set of states, A is an alphabet called the input alphabet, B is another alphabet called the output alphabet, q0Q is the initial state, ⋅ is a function Q×AQ called the transition function, and ∗ is a function Q×AB called the output function. Thus the triplet 𝒜=(Q,A,) is an automaton, and hence the transition function ⋅ can be extended to a function Q×AQ, also denoted by ⋅, in the manner described previously. In addition, the output function ∗ can be extended to a function Q×AB, also denoted by ∗, by putting successively q1=1 and q(wa)=(qw)((qw)a), for all qQ, wA and aA. The function realized by the given sequential transducer 𝒯 is the function ς:AB defined by the formula ς(w)=q0w, for all words wA. Finally, given two alphabets A and B, by a sequential function we shall mean any function σ:AB which can be realized by a sequential transducer of the form 𝒯=(Q,A,B,q0,,). Having all that at hand, and arguing in quite the same way as in [7, Sec. 3.1], we arrive at the following result.

    Theorem 3.2. Let A and B be any alphabets and let σ:AB be a sequential function realized by a sequential transducer 𝒯=(Q,A,B,q0,,). Let Y=(Q,T) be the transition semigroup of the underlying automaton 𝒜=(Q,A,). Let Z=(Q,U) be any proper transformation semigroup such that TU. If LB+ is a language recognized by a proper transformation semigroup X=(P,S), then the language σ1(L)A+ is recognized by the wreath product XZ of the transformation semigroups X and Z.

    Let A be an alphabet, let T be a finite semigroup, and let φ:A+T be a homomorphism of semigroups. We shall associate with this homomorphism φ a sequential transducer 𝒯φ which we shall construct in the following way. We set BT=T1×A. Then we put 𝒯φ=(T1,A,BT,1,,), where the transition function :T1×AT1 is given, for each tT1 and aA, by the formula ta=tφ(a), and the output function :T1×ABT is given, again for each tT1 and aA, by the formula ta=(t,a). The sequential function σφ:ABT realized by this sequential transducer 𝒯φ will be called the sequential function associated with the homomorphism φ. Then it is easy to verify that, for any positive integer h and for arbitrary elements a1,a2,,ahA, the equality

    σφ(a1a2ah)=(1,a1)(φ(a1),a2)(φ(a1a2),a3)(φ(a1a2ah1),ah)
    holds.

    Note that, when constructing the sequential transducer 𝒯φ, in comparison with the opening passage of [7, Sec. 3.2], we do not assume here the homomorphism φ:A+T to be surjective. Therefore also the transition semigroup of the underlying automaton 𝒜φ=(T1,A,) need not be equal to the entire transformation semigroup (T1,T). But these properties are not exploited in the subsequent reasonings in [7, Sec. 3.2]. Thus we can manage without these properties.

    Let now X=(P,S) and Y=(Q,T) be two proper transformation semigroups, let XY=(P×Q,SQT) be the wreath product of these semigroups, and let LA+ be a language recognized by the transformation semigroup XY. This means that there exist a state (p0,q0)P×Q, a subset FP×Q and a homomorphism η:A+SQT such that L={wA+:(p0,q0)η(w)F}. Let π denote the natural projection from SQT onto T, which is given by π(f,t)=t, for all f:QS and all tT. Then put φ=πη. Thus φ is a homomorphism of A+ into T. Now, as in the last paragraph but one, we may associate with φ the sequential transducer 𝒯φ=(T1,A,BT,1,,) and the sequential function σφ:ABT realized by this sequential transducer. In this situation, the arguments assembled in [7, Sec. 3.2] can be applied. It is, however, important to note in this connection that the identity of the monoid T1 can be required to act as the identity on the set Q. The fact that this is a feasible requirement follows from our assumption that Y=(Q,T) is a proper transformation semigroup. With this amendment, proceeding otherwise as in [7, Sec. 3.2], for the above language L we obtain the following statement.

    Theorem 3.3. The language LA+ is a finite union of languages of the form Mσ1φ(N), where MA+ is a language recognized by the homomorphism φ and NB+T is a language recognized by the transformation semigroup X.

    From Theorems 3.2 and 3.3 it is easy to deduce the following consequence. This result, which will be of crucial importance in the following section, has also its counterpart in [7, Sec. 3.2]. However, since the respective arguments in [7] must be viewed as virtually flawed, we provide a full proof below.

    Corollary 3.4. Let V and W be two pseudovarieties of semigroups, let 𝒱 and 𝒲 be the varieties of languages associated with the pseudovarieties V and W, respectively, and let 𝒵 be the variety of languages associated with the semidirect product VW of the pseudovarieties V and W. Then, for every alphabet A, the collection of languages A+𝒵 is the smallest Boolean algebra containing all languages from A+𝒲 and all languages of the form σ1φ(N), where φ:A+T is a homomorphism of A+ into a semigroup TW,σφ is the sequential function associated with φ, and N is any language from B+T𝒱.

    Proof. Clearly, the collection A+𝒵 is a Boolean algebra and it contains all languages from A+𝒲. We shall show that it also contains all languages of the above form σ1φ(N). Let 𝒯φ=(T1,A,BT,1,,) be the sequential transducer associated with φ. Then σφ:ABT is the sequential function realized by this sequential transducer. Let Y=(T1,R) be the transition semigroup of the underlying automaton 𝒜φ=(T1,A,). Then R is a subsemigroup of T, and the right regular representation Z=(T1,T) of the semigroup T is a proper transformation semigroup. Recall also that TW. The language N is recognized by a proper transformation semigroup X=(P,S) such that SV. Now, applying Theorem 3.2, we come to the conclusion that the language σ1φ(N) is recognized by the wreath product XZ. Consequently, this language σ1φ(N) is recognized by the action semigroup of the wreath product XZ. But this action semigroup is equal to the semidirect product ST1T, and so it belongs to the pseudovariety VW. Therefore the language σ1φ(N) indeed belongs to A+𝒵.

    On the other hand, we have seen in the previous section (see the last paragraph but one in that section) that the semidirect product VW of the given pseudovarieties is obtained as the class of all divisors of the action semigroups SQT of the wreath products XY of arbitrary proper transformation semigroups X=(P,S) and Y=(Q,T) such that SV and TW. Furthermore, according to what we have seen previously in this section, every language over the alphabet A which is recognized by any divisor of such an action semigroup SQT must be recognized already by this action semigroup itself, and every language over the alphabet A which is recognized by such an action semigroup SQT is a Boolean combination of languages over the alphabet A that can be recognized by the initial wreath product XY. Thus it remains to show that every language over the alphabet A which can be recognized by any wreath product XY of the mentioned form belongs to the Boolean algebra generated by the languages from A+𝒲 and by the languages of the above form σ1φ(N). But this follows immediately from Theorem 3.3. □

    Now we approach the characterization of languages which can be recognized by the semigroups in the power pseudovariety PV, for any given pseudovariety of semigroups V. The result that we are going to present here has been obtained by Straubing in [9].

    Let A and B be two alphabets and let 𝜗:A+B+ be a homomorphism of semigroups. We say that this homomorphism 𝜗 is length preserving if the inclusion 𝜗(A)B holds. Then, for every word uA+, the length of the word 𝜗(u) is the same as that of u. Then the following statement holds.

    Theorem 3.5. Let V be any pseudovariety of semigroups, let 𝒱 be the variety of languages associated with V, and let 𝒲 be the variety of languages associated with the power pseudovariety PV. Then, for every alphabet B, the collection of languages B+𝒲 is the smallest Boolean algebra containing all languages of the form 𝜗(M), where M is any language from A+𝒱 for some alphabet A, and 𝜗:A+B+ is a length preserving homomorphism.

    We conclude this section with a characterization of languages that are recognized by commutative semigroups. That is, we shall characterize the languages belonging to the variety of languages 𝒞om that is associated to the pseudovariety Com of all commutative semigroups. This result comes from [6, Sec. 2.3].

    Let A be an alphabet, let wA+ be a word and let aA be a letter. We denote by |w|a the number of occurrences of the letter a in the word w. For every positive integer n and for every k{0,1,2,,n1}, consider the language

    K(a,k,n)={wA+:|w|akmod
    Then K(a,k,n) is a recognizable language. Indeed, let n be the additive group of integers modulo n and let φa:A+n be the homomorphism assigning to every word wA+ the number |w|a taken modulo n. Then we have K(a,k,n)=φa1(k). Since obviously nCom, we hence see that K(a,k,n)A+𝒞om.

    Furthermore, for every non-negative integer r, consider the language

    L(a,r)={wA+:|w|a=r}.
    Then L(a,r) is also a recognizable language. Indeed, let n be any positive integer greater than r and let 1,n be the cyclic monoid of order n+1 and period 1. This means that if g is the generator, then the elements of 1,n are 1,g,g2,,gn with the convention that gn+1=gn. Then let ψa:A+1,n be the homomorphism assigning to every word wA+ the element g|w|a. Here g0 is to be understood as 1. Then we have L(a,r)=ψa1(gr). Since clearly 1,nCom, we see that L(a,r)A+𝒞om.

    From these notes and from the last corollary in [6, Sec. 2.3] we infer the following description of the recognizable languages forming the variety of languages 𝒞om that is associated to the pseudovariety Com of all commutative semigroups.

    Proposition 3.6. For every alphabet A, the collection of languages A+𝒞om is the smallest Boolean algebra containing all languages over A of the form K(a,k,n), where aA, n is a positive integer, and k{0,1,2,,n1}, and all languages over A of the form L(a,r), where aA, and r is a non-negative integer.

    4. The Equality of Pseudovarieties PU=ComD

    As we have already explained in the introduction, the purpose of this paper is to establish the equality of pseudovarieties given in the heading of this section. The inclusion PUComD has been verified already by Almeida in [1] and in [2, Sec. 11.9] using purely algebraic arguments. Thus it remains to establish also the opposite inclusion ComDPU. This once, we shall do so by means of language theoretical methods. We have recalled in Theorem 3.1 the celebrated result of Eilenberg, according to which there are mutually inverse order preserving one-to-one correspondences between the lattice of all pseudovarieties of semigroups and the lattice of all varieties of languages. As in the introduction, we shall let 𝒱 be the variety of languages corresponding to the pseudovariety ComD, and we shall let 𝒲 be the variety of languages corresponding to the pseudovariety PU. Then, in order to establish the inclusion ComDPU, we shall struggle to verify that we have 𝒱𝒲. That is, we shall strive to show that, for every alphabet A, we have the inclusion A+𝒱A+𝒲.

    Once again, let 𝒞om be the variety of languages corresponding to the pseudovariety Com. Let further 𝒟 be the variety of languages corresponding to the pseudovariety D, and let 𝒰 be the variety of languages corresponding to the pseudovariety U. Since clearly UPU, we have 𝒰𝒲, which means that, for every alphabet A, we have the inclusion A+𝒰A+𝒲. Next we turn to the pseudovariety ComD and to its corresponding variety of languages 𝒱.

    According to Straubing’s wreath product principle captured in Corollary 3.4, for every alphabet A, the collection of languages A+𝒱 is the smallest Boolean algebra containing all languages from A+𝒟 and all languages of the form σφ1(J), where φ:A+T is a homomorphism of A+ into a semigroup TD, σφ is the sequential function associated with φ, and J is any language from BT+𝒞om. Since clearly DU, we have 𝒟𝒰, which means that, for every alphabet A, we have the inclusion A+𝒟A+𝒰. We have seen in the previous paragraph that we also have the inclusion A+𝒰A+𝒲. Therefore, for every alphabet A, we get the inclusion A+𝒟A+𝒲. Thus it remains to look after the languages of the form σφ1(J) specified above. We need to show that also these languages belong to the collection A+𝒲.

    Thus let φ:A+T be a homomorphism of A+ into a semigroup TD, let σφ be the sequential function associated with φ, and let J be any language from BT+𝒞om. Since TD and D=m=1Dm, there exists a positive integer m such that TDm. Therefore the semigroup T satisfies the pseudoidentity xy1y2ymy1y2ym. Recall that the sequential function σφ is given, for every positive integer h and for arbitrary elements a1,a2,,ahA, by the formula

    σφ(a1a2ah)=(1,a1)(φ(a1),a2)(φ(a1a2),a3)(φ(a1a2ah1),ah).
    Consider any index i{1,2,,h} such that m<i. Then we have
    (φ(a1a2ai1),ai)=(φ(a1)φ(a2)φ(ai1),ai)=(φ(aim)φ(aim+1)φ(ai1),ai)=(φ(aimaim+1ai1),ai).
    This finding shows that, when computing the word σφ(a1a2ah), apart from the initial m segments a1, a1a2,,a1a2am of the word a1a2ah, they are only the segments of the word a1a2ah of length m+1 which do matter. Thus, in the word σφ(a1a2ah), only the initial letters (1,a1), (φ(a1),a2), (φ(a1a2),a3), …, (φ(a1a2am1),am) of the alphabet BT will appear first, and then only the letters of the form (φ(ajaj+1aj+m1),aj+m), for all j{1,2,,hm}, of the alphabet BT will appear subsequently.

    According to Proposition 3.6, the language J from BT+𝒞om is a Boolean combination of languages over the alphabet BT=T1×A of the form K((t,a),k,n), where (t,a) is a letter in BT, so that tT1 and aA, n is a positive integer and k{0,1,2,,n1}, and of languages of the form L((t,a),r), where (t,a) is again a letter in BT, so that tT1 and aA and r is a non-negative integer. We have to show that then the language σφ1(J) belongs to the collection of languages A+𝒲. Since the operator σφ1 commutes with the Boolean operations, we see that it will be enough if we show that the languages of the forms σφ1(K((t,a),k,n)) and σφ1(L((t,a),r)) belong to A+𝒲. We already know from Theorem 3.2 that these languages are recognizable languages over the alphabet A.

    We start with the languages of the form σφ1(K((t,a),k,n)), where tT1, aA, n is a positive integer, and k{0,1,2,,n1}. We shall see that, in this case, these languages actually belong already to the subset A+𝒰 of A+𝒲. For this purpose, we shall show that the syntactic semigroups of these languages belong to the pseudovariety U. We shall document this statement by verifying that these syntactic semigroups satisfy the pseudoidentities exeyeeyexe and (ef)ωexfexf.

    Lemma 4.1. The syntactic semigroup of the language σφ1(K((t,a),k,n)) satisfies the pseudoidentity exeyeeyexe.

    Proof. Let us denote briefly by K the syntactic congruence of the language σφ1(K((t,a),k,n)). Thus we wish to show that the syntactic semigroup A+/K satisfies the pseudoidentity exeyeeyexe. So let u,vA+ and 𝜀A+ be arbitrary words such that the class [𝜀]K is an idempotent in A+/K. We need to show that then we have 𝜀u𝜀v𝜀K𝜀v𝜀u𝜀. By the definition of the syntactic congruence K, this means to check that, for arbitrary words p,qA, we have

    p𝜀u𝜀v𝜀qσφ1(K((t,a),k,n))p𝜀v𝜀u𝜀qσφ1(K((t,a),k,n)).
    Equivalently, this means to verify that, for any words p,qA, we have
    σφ(p𝜀u𝜀v𝜀q)K((t,a),k,n)σφ(p𝜀v𝜀u𝜀q)K((t,a),k,n).
    Since the class [𝜀]K is an idempotent in A+/K, we have 𝜀K𝜀2, which says that, for arbitrary words c,dA, we have c𝜀dσφ1(K((t,a),k,n))c𝜀2dσφ1(K((t,a),k,n)), or, which is the same, σφ(c𝜀d)K((t,a),k,n)σφ(c𝜀2d)K((t,a),k,n). But this entails that, in the formulae displayed above, we could replace every occurrence of the word 𝜀 with an arbitrarily large positive power of 𝜀. Therefore we may assume, in addition, that the length |𝜀| of the word 𝜀 is greater than the value m drawn above.

    Now we have to return to our previous considerations regarding the computation of the word σφ(a1a2ah), for arbitrary elements a1,a2,,ahA, and to apply them to the above words σφ(p𝜀u𝜀v𝜀q) and σφ(p𝜀v𝜀u𝜀q). Since |𝜀|>m, the first m letters of the alphabet BT in the words σφ(p𝜀u𝜀v𝜀q) and σφ(p𝜀v𝜀u𝜀q) occur within the initial segment σφ(p𝜀) of these words, and therefore they are the same. As far as the other letters of BT in the words σφ(p𝜀u𝜀v𝜀q) and σφ(p𝜀v𝜀u𝜀q) are concerned, we have seen that these letters arise from segments of the words p𝜀u𝜀v𝜀q and p𝜀v𝜀u𝜀q of length m+1. Once again, since |𝜀|>m, such segments must appear as segments in one of the words p𝜀, 𝜀u𝜀, 𝜀v𝜀 and 𝜀q. In particular, if such a segment hits the word u in p𝜀u𝜀v𝜀q, then it must be a segment of the word 𝜀u𝜀. But then it is also a segment of the same word 𝜀u𝜀 in p𝜀v𝜀u𝜀q. The same holds true of the segments of length m+1 of the words p𝜀u𝜀v𝜀q and p𝜀v𝜀u𝜀q which hit the word v. This consideration reveals that the number of occurrences of each individual letter of BT in both words σφ(p𝜀u𝜀v𝜀q) and σφ(p𝜀v𝜀u𝜀q) is the same. Hence, by the definition of the language K((t,a),k,n), we certainly have σφ(p𝜀u𝜀v𝜀q)K((t,a),k,n)σφ(p𝜀v𝜀u𝜀q)K((t,a),k,n), for arbitrary words p,qA, which yields that 𝜀u𝜀v𝜀K𝜀v𝜀u𝜀, as required. □

    Lemma 4.2. The syntactic semigroup of the language σφ1(K((t,a),k,n)) satisfies the pseudoidentity (ef)ωexfexf.

    Proof. As before, let us denote briefly by K the syntactic congruence of the language σφ1(K((t,a),k,n)). Thus we wish to show that the syntactic semigroup A+/K satisfies the pseudoidentity (ef)ωexfexf. So let uA+ and 𝜀,ϰA+ be arbitrary words such that the classes [𝜀]K and [ϰ]K are idempotents in A+/K. Let g be any positive integer such that the class [(𝜀ϰ)g]K is an idempotent in A+/K. We need to show that then we have (𝜀ϰ)g𝜀uϰK𝜀uϰ. By the definition of the syntactic congruence K, this means to check that, for arbitrary words p,qA, we have

    p(𝜀ϰ)g𝜀uϰqσφ1(K((t,a),k,n))p𝜀uϰqσφ1(K((t,a),k,n)).
    Equivalently, this means to verify that, for any words p,qA, we have
    σφ(p(𝜀ϰ)g𝜀uϰq)K((t,a),k,n)σφ(p𝜀uϰq)K((t,a),k,n).
    The same arguments as previously show that, in the above formulae, we could replace every occurrence of any of the words 𝜀 and ϰ with an arbitrarily large positive power of this word. Therefore we may assume, in addition, that the lengths |𝜀| and |ϰ| of the words 𝜀 and ϰ are both greater than the value m drawn above. Furthermore, virtually the same reasoning also reveals that, in the above formulae, we could replace the word (𝜀ϰ)g with an arbitrary positive power of this word. Therefore we may assume, in addition, that the exponent g is divisible by the positive integer n.

    Once again, we have to return to our previous considerations regarding the computation of the word σφ(a1a2ah), for arbitrary elements a1,a2,,ah A, and to apply them to the above words σφ(p(𝜀ϰ)g𝜀uϰq) and σφ(p𝜀uϰq). Since |𝜀|>m, the first m letters of the alphabet BT appearing in these two words occur within the initial segment σφ(p𝜀) of these words, and therefore they are the same. Thus we have to look after the other appearances of the letters of the alphabet BT in these two words. We have seen that these letters arise from segments of the words p(𝜀ϰ)g𝜀uϰq and p𝜀uϰq of length m+1. Since |𝜀|>m and |ϰ|>m, the segments of length m+1 which may occur anywhere in the word p(𝜀ϰ)g𝜀uϰq, beyond those occurrences of such segments which appear already in the word p𝜀uϰq, must be segments of the word (𝜀ϰ)g𝜀, not counting the occurrences of these segments in the last factor 𝜀 of this word. Moreover, every such segment in the word (𝜀ϰ)g𝜀 must be a segment of either of the words 𝜀ϰ or ϰ𝜀. Hence, since the exponent g is divisible by n, the number of these occurrences of every such segment in the word (𝜀ϰ)g𝜀 must be divisible by n. These considerations reveal that the numbers of occurrences of each individual letter of BT in the words σφ(p(𝜀ϰ)g𝜀uϰq) and σφ(p𝜀uϰq) are congruent modulo n. Therefore, by the definition of the language K((t,a),k,n), we have σφ(p(𝜀ϰ)g𝜀uϰq)K((t,a),k,n)σφ(p𝜀uϰq)K((t,a),k,n), for arbitrary words p,qA, which yields that (𝜀ϰ)g𝜀uϰK𝜀uϰ, as required. □

    We continue by verifying that the languages of the form σφ1(L((t,a),r)), where tT1, aA and r is a non-negative integer, belong to A+𝒲. We begin with a few preliminary considerations. So let wσφ1(L((t,a),r)) be an arbitrary word. This means that σφ(w)L((t,a),r). However, by the definition of the language L((t,a),r), this is equivalent to the condition that the letter (t,a) of the alphabet BT has exactly r appearances in the word σφ(w). Assume, for the time being, that the length |w| of the word w is greater than m. Let r0 be the number of appearances of the mentioned letter (t,a) in the initial segment of the word σφ(w) of length m. Then we can write r=r0+r̄ for some non-negative integer r̄. From our earlier considerations we know that the appearances of the letter (t,a) in the word σφ(w) which do not occur in the initial segment of σφ(w) of length m arise from some segments of the word w of length m+1. There may be several such pairwise distinct segments in the word w. So let ξ be any of these segments of the word w of length m+1 which give rise to the letter (t,a). Let ξ have s occurrences in the word w. Then sr̄. (We permit also the possibility that s=0.) Consider now all s occurrences of this segment ξ in the word w. Some of these occurrences of ξ may overlap in w or they may touch each other. Let ϖ1,ϖ2,,ϖ be all maximal segments of the word w which are covered completely by the mentioned occurrences of the segment ξ in w. Then s and w=υ0ϖ1υ1ϖ2υ2υ1ϖυ for some words υ0,υA and υ1,υ2,,υ1A+. Let A¯ be an alphabet disjoint with A whose letters are in a one-to-one correspondence with the letters of A. Consider next the language A¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯. (If s=0, and so =0, then take the language A¯+ instead.) This language is a rational language over the alphabet AA¯, and therefore, by Kleene’s theorem, it is a recognizable language.

    We shall have to deal with all possible languages over the alphabet AA¯ of the form just mentioned. Every such language is completely determined by the given -tuple of words (ϖ1,ϖ2,,ϖ). Recall that such -tuples are characterized by the following properties. First of all, there is a word ξ over the alphabet A of length m+1 which gives rise to the letter (t,a) of the alphabet BT. According to what we have already specified above, this means that, if ξ=c0c1c2cm where c0,c1,c2,,cmA, then φ(c0c1cm1)=t and cm=a. Thereafter, the mentioned -tuple (ϖ1,ϖ2,,ϖ) consists of non-empty words over the alphabet A such that the total number of occurrences of the segment ξ in any of the words ϖ1,ϖ2,,ϖ is equal to s and, at the same time, every letter of any of the words ϖ1,ϖ2,,ϖ is covered by some occurrence of the segment ξ in this word. Then certainly s and there are evidently only finitely many possibilities how such -tuples (ϖ1,ϖ2,,ϖ) may look like. Therefore there are only finitely many languages of the form A¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯ which may arise in this way. (If s=0, and so =0, then there is only the language A¯+ which may so arise.) Let now Lξ(s) be the finite union of all of these languages so determined. Then Lξ(s) is again a recognizable language over the alphabet AA¯.

    We are going to show that the recognizable language Lξ(s) just designated belongs to the collection of languages which the variety of languages 𝒰 assigns to the alphabet AA¯. We intend to attain this goal by checking that the syntactic semigroup of this language Lξ(s) belongs to the pseudovariety U. We shall achieve this aim by verifying that this syntactic semigroup satisfies the pseudoidentities exeyeeyexe and (ef)ωexfexf.

    For the sake of notational ease, let us denote briefly by L the syntactic congruence of the language Lξ(s). Then the syntactic semigroup of the language Lξ(s) is of the form (AA¯)+/L. Recall that the language Lξ(s) is a finite union of recognizable languages of the form A¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯. Let L1,L2,,Lz be those finitely many languages of the form just mentioned whose union constitutes the language Lξ(s). Let L1,L2,,Lz be the syntactic congruences of these languages. Then the syntactic semigroups of these languages are of the forms (AA¯)+/L1, (AA¯)+/L2,,(AA¯)+/Lz. Consider the subdirect product of these syntactic semigroups consisting of all z-tuples of the form ([w]L1,[w]L2,,[w]Lz), where w(AA¯)+. Then this subdirect product recognizes the language Lξ(s), and so the syntactic semigroup (AA¯)+/L of this language Lξ(s) is a canonical homomorphic image of the mentioned subdirect product of syntactic semigroups of the languages L1,L2,,Lz. This canonical homomorphism maps every z-tuple of the form ([w]L1,[w]L2,,[w]Lz) to the element [w]L.

    Lemma 4.3. The syntactic semigroup (AA¯)+/L of the language Lξ(s) satisfies the pseudoidentity exeyeeyexe.

    Proof. So let 𝜀(AA¯)+ and u,v(AA¯)+ be any words such that the class [𝜀]L is an idempotent in the semigroup (AA¯)+/L. We need to show that then we have 𝜀u𝜀v𝜀L𝜀v𝜀u𝜀. The element [𝜀]L is the image under the above canonical homomorphism of the z-tuple ([𝜀]L1,[𝜀]L2,,[𝜀]Lz). Since our syntactic semigroups are finite semigroups, there exists a positive integer η such that the z-tuple ([𝜀η]L1,[𝜀η]L2,,[𝜀η]Lz) is an idempotent. Thus, for every i{1,2,,z}, the element [𝜀η]Li is an idempotent, and so we have 𝜀ηLi𝜀2η. By the definition of the syntactic congruence Li, for every words c,d(AA¯), we get c𝜀ηdLic𝜀2ηdLi. However, having in view that the language Li is of the form A¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯, we come to the following conclusion. If the word 𝜀 contained some elements from the alphabet A and if, for some words c,d(AA¯), we had c𝜀ηdLi, then we would certainly have c𝜀2ηdLi. However, this means that the above condition c𝜀ηdLic𝜀2ηdLi would be violated. Thus, whenever 𝜀A¯+, then, for any words c,d(AA¯), we must have c𝜀ηdLi. Therefore, in particular, in such a case, for any words p,q(AA¯), we must have p𝜀ηu𝜀ηv𝜀ηqLi and p𝜀ηv𝜀ηu𝜀ηqLi. However, by the definition of the syntactic congruence Li, this means that 𝜀ηu𝜀ηv𝜀ηLi𝜀ηv𝜀ηu𝜀η. This relation must hold for all i{1,2,,z}. By the reasonings preceding this lemma, it hence eventually ensues that 𝜀ηu𝜀ηv𝜀ηL𝜀ηv𝜀ηu𝜀η. Since here the element [𝜀]L is an idempotent, and so it is equal to the element [𝜀η]L, we hence lastly infer that 𝜀u𝜀v𝜀L𝜀v𝜀u𝜀, as desired. Thus it remains to treat the case when 𝜀A¯+.

    We have to verify that, in this case, we also have 𝜀u𝜀v𝜀L𝜀v𝜀u𝜀. By the definition of the syntactic congruence L, this means to check that, for any words p,q(AA¯), we have p𝜀u𝜀v𝜀qLξ(s)p𝜀v𝜀u𝜀qLξ(s). Thus assume that p𝜀u𝜀v𝜀qLξ(s). Since Lξ(s) is a finite union of languages of the form A¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯, it hence ensues that we must have p𝜀u𝜀v𝜀qA¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯ for some -tuple (ϖ1,ϖ2,,ϖ). Since we now have 𝜀A¯+, the three appearances of the segment 𝜀 in the word p𝜀u𝜀v𝜀q must fall into some of the factors of the form A¯ or A¯+ in the language A¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯. If uA¯+, then the whole segment 𝜀u𝜀 must fall into the same factor of the form A¯ or A¯+. But then, clearly, we get that p𝜀v𝜀u𝜀qA¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯, and so p𝜀v𝜀u𝜀qLξ(s). The same conclusion follows if vA¯+. Thus we may further assume that uA¯+ and vA¯+. Then there exist indices i,j,k{1,2,,} satisfying ij<k such that the segment 𝜀u𝜀 comes from the language A¯+ϖiA¯+ϖi+1A¯+A¯+ϖjA¯+ and the segment 𝜀v𝜀 comes from the language A¯+ϖj+1A¯+ϖj+2A¯+A¯+ϖkA¯+, and they are the leftmost and the rightmost positions of the factors of the form A¯+ in these two languages that the appearances of the segment 𝜀 in the words 𝜀u𝜀 and 𝜀v𝜀 come from. But then, clearly, the word p𝜀v𝜀u𝜀q is an element of the language

    A¯ϖ1A¯+ϖ2A¯+A¯+ϖi1A¯+ϖj+1A¯+ϖj+2A¯+A¯+ϖkA¯+ϖiA¯+ϖi+1A¯+A¯+ϖjA¯+ϖk+1A¯+ϖk+2A¯+A¯+ϖA¯.
    However, this language is also one of the constituents of the language Lξ(s). Therefore we see that we again have p𝜀v𝜀u𝜀qLξ(s). Similarly we can show that if p𝜀v𝜀u𝜀qLξ(s), then also p𝜀u𝜀v𝜀qLξ(s). This finding verifies that we indeed have 𝜀u𝜀v𝜀L𝜀v𝜀u𝜀, again as desired. □

    Lemma 4.4. The syntactic semigroup (AA¯)+/L of the language Lξ(s) satisfies the pseudoidentity (ef)ωexfexf.

    Proof. To start with, let us to recall the considerations performed in the paragraph preceding Lemma 4.3 and the notations introduced in that paragraph. Thus let now 𝜀,ϰ(AA¯)+ and u(AA¯)+ be any words such that the classes [𝜀]L and [ϰ]L are idempotents in (AA¯)+/L. Let g be any positive integer such that the class [(𝜀ϰ)g]L is an idempotent in (AA¯)+/L. We need to show that then we have (𝜀ϰ)g𝜀uϰL𝜀uϰ. The element [𝜀]L is the image of the z-tuple ([𝜀]L1,[𝜀]L2,,[𝜀]Lz), and the element [ϰ]L is the image of the z-tuple ([ϰ]L1,[ϰ]L2,,[ϰ]Lz). Then there exists a positive integer η such that the z-tuple ([𝜀η]L1,[𝜀η]L2,,[𝜀η]Lz) is an idempotent, and the z-tuple ([ϰη]L1,[ϰη]L2,,[ϰη]Lz) is an idempotent. Thus, for every i{1,2,,z}, the element [𝜀η]Li is an idempotent, which means that we have 𝜀ηLi𝜀2η. By the definition of the syntactic congruence Li, for every words c,d(AA¯), we get c𝜀ηdLic𝜀2ηdLi. However, since the language Li is of the form A¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯, we can infer the following conclusion. If the word 𝜀 contained some elements from the alphabet A and if, for some words c,d(AA¯), we had c𝜀ηdLi, then we would certainly have c𝜀2ηdLi. But this would contradict the above condition c𝜀ηdLic𝜀2ηdLi. Thus, whenever 𝜀A¯+, then, for any words c,d(AA¯), we must have c𝜀ηdLi. Therefore, in particular, in such a case, for any words p,q(AA¯), we must have p(𝜀ηϰη)g𝜀ηuϰηqLi and p𝜀ηuϰηqLi. But, by the definition of the syntactic congruence Li, this means that (𝜀ηϰη)g𝜀ηuϰηLi𝜀ηuϰη. This relation must hold for all i{1,2,,z}. By the reasonings performed in the paragraph preceding Lemma 4.3, it hence eventually ensues that (𝜀ηϰη)g𝜀ηuϰηL𝜀ηuϰη. Since here the elements [𝜀]L and [ϰ]L are idempotents, and so they are equal to the elements [𝜀η]L and [ϰη]L, respectively, we hence deduce, in the end, that (𝜀ϰ)g𝜀uϰL𝜀uϰ, as desired. We have arrived at this conclusion under the assumption that 𝜀A¯+. Likewise, we could come to the same conclusion if we went from the assumption that ϰA¯+. Thus it remains to deal with the case when 𝜀A¯+ and ϰA¯+.

    We have to verify that, in this case, we also have (𝜀ϰ)g𝜀uϰL𝜀uϰ. By the definition of the syntactic congruence L, this means to check that, for any words p,q(AA¯), we have p(𝜀ϰ)g𝜀uϰqLξ(s)p𝜀uϰqLξ(s). Thus assume that p(𝜀ϰ)g𝜀uϰqLξ(s). Since Lξ(s) is a finite union of languages of the form A¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯, it hence ensues that we must have p(𝜀ϰ)g𝜀uϰqA¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯ for some -tuple (ϖ1,ϖ2,,ϖ). Since we now have 𝜀A¯+ and ϰA¯+, we see that the whole segment (𝜀ϰ)g𝜀 in the word p(𝜀ϰ)g𝜀uϰq must fall into a single factor of the form A¯ or A¯+ in the language A¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯. However, then it is evident that we also get p𝜀uϰqA¯ϖ1A¯+ϖ2A¯+A¯+ϖA¯, and so we obtain p𝜀uϰqLξ(s). In a similar manner, it is possible to show that if p𝜀uϰqLξ(s), then also p(𝜀ϰ)g𝜀uϰqLξ(s). These findings assure that we indeed have (𝜀ϰ)g𝜀uϰL𝜀uϰ, again as desired. □

    Let further L̂ξ(s) be the language over the alphabet A consisting of all words which one may obtain from the words of the language Lξ(s) by replacing in them every occurrence of any letter from A¯ by its respective letter form A. It is clear that then the language L̂ξ(s) comprises exactly those words over the alphabet A which contain at least s occurrences of the segment ξ. Now we wish to sort out of these words those ones which contain exactly s occurrences of the segment ξ. That is, we intend to remove from L̂ξ(s) those words which contain more than s occurrences of ξ. For this purpose, we must be aware that, up to now, we have performed all our considerations with an arbitrary non-negative integer s. Thus we may replace everywhere in the foregoing reasonings this integer s with the integer s+1. In this way, we come up with the languages Lξ(s+1) and L̂ξ(s+1). Then, of course, the language L̂ξ(s+1) comprises exactly those words over the alphabet A which contain at least s+1 occurrences of the segment ξ. And then the language L̂ξ(s)\L̂ξ(s+1) consists precisely of those words over the alphabet A which contain exactly s occurrences of the segment ξ.

    We have shown in Lemmas 4.3 and 4.4 that the syntactic semigroups of the languages Lξ(s) over the alphabet AA¯, for whichever non-negative integers s, belong to the pseudovariety U. We have considered subsequently the languages L̂ξ(s), again for any non-negative integers s, over the alphabet A, whose creation from the languages Lξ(s) can also be expressed in the following terms. Let us consider the homomorphism λ:(AA¯)+A+ of free semigroups determined by the requirements that λ should map every element of A to itself and λ should map every element of the disjoint copy A¯ of A to its respective counterpart in A. Then λ is a length preserving homomorphism. Then we have L̂ξ(s)=λ(Lξ(s)). By Theorem 3.5, we know that then L̂ξ(s) are again recognizable languages which are recognized by suitable semigroups from the power pseudovariety PU. Therefore these languages belong to the collection of languages A+𝒲. Of course, then also the language L̂ξ(s)\L̂ξ(s+1) belongs to this collection A+𝒲. We have next seen above that the language L̂ξ(s)\L̂ξ(s+1) consists precisely of those words over the alphabet A which contain exactly s occurrences of the segment ξ. Recall that ξ has been a word in the alphabet A of length m+1 giving rise to the letter (t,a) of the alphabet BT. There may be several such distinct words ξ which give rise to the same letter (t,a) in this way. So let ξ1,ξ2,,ξμ be all words enjoying this property. Let further s1,s2,,sμ be any non-negative integers. Then for every i{1,2,,μ}, we may consider the language L̂ξi(si)\L̂ξi(si+1). Then the language i=1μ(L̂ξi(si)\L̂ξi(si+1)) consists of those words which, for every i{1,2,,μ}, contain exactly si occurrences of the segment ξi. Denote this last language by L¯ξ1,,ξμs1,,sμ. This language then again belongs to the collection A+𝒲. Let finally r̄ be any non-negative integer. Consider all μ-tuples (s1,s2,,sμ) of non-negative integers such that s1+s2++sμ=r̄. Then we may form the language s1++sμ=r̄L¯ξ1,,ξμs1,,sμ. This language, which we shall denote briefly by L¯(r̄), then also belongs to the collection A+𝒲.

    It should now be clear that the language L¯(r̄) consists precisely of those words wA+ which enjoy the following property. We focus first on the words wA+ whose length |w| is greater than m. Then such a word w belongs to the language L¯(r̄) if and only if the number of those appearances of the letter (t,a) in the word σφ(w) which do not occur in the initial segment of σφ(w) of length m is equal to r̄. As far as the words wA+ whose length |w| does not exceed m are concerned, the following holds true. If r̄=0, then all such words w belong to the language L¯(r̄). If, on the contrary, r̄>0, then the language L¯(r̄) contains no words w of this kind.

    Let r0 be a non-negative integer such that r0m. There are only finitely many words u in the alphabet A of length |u| equal to m. Let M(r0) be the set of all words u of this kind having the property that the number of appearances of the letter (t,a) in the word σφ(u) is equal to r0. Then M(r0) is a finite language over the alphabet A. Consider next the language M¯(r0)=M(r0)A+. Then M¯(r0) is a recognizable language. It is a familiar fact see [6, Sec. 2.3] that this language M¯(r0) can be recognized by a semigroup from the pseudovariety K. Since clearly KU, and as UPU, we see that this language M¯(r0) can be recognized by a semigroup from the pseudovariety PU, and hence it belongs to the collection of languages A+𝒲. Note that this language contains no words w whose length |w| does not exceed m. Take next any non-negative integer r̄ and consider the language M¯(r0)L¯(r̄). This language then also belongs to the collection A+𝒲 and it consists of all those words w in the alphabet A of length |w| greater that m that have the property that, in the word σφ(w), there are r0 appearances of the letter (t,a) in the initial segment of this word of length m and there are r̄ appearances of the letter (t,a) in the rest of this word. Finally, for every non-negative integer r, consider all pairs (r0,r̄) of non-negative integers such that r0+r̄=r. Then we may consider the language r0+r̄=r(M¯(r0)L¯(r̄)). This last language then again belongs to the collection A+𝒲 and it consists of those words w in the alphabet A whose length |w| is greater than m and which possess the property that, in the word σφ(w), there are exactly r appearances of the letter (t,a).

    At last, given a non-negative integer r, we may consider the set N¯(r) of all words v in the alphabet A of length |v| not exceeding m and having the property that, in the word σφ(v), there are exactly r appearances of the letter (t,a). Then N¯(r) is a finite language over the alphabet A, which can therefore be recognized by a nilpotent semigroup, and so it can be certainly recognized by a semigroup from the pseudovariety PU. Hence this language belongs to the collection of languages A+𝒲. Finally we may consider the language N¯(r)r0+r̄=r(M¯(r0)L¯(r̄)). This last language then again belongs to the collection A+𝒲 and it consists of all those words w in the alphabet A which possess the property that, in the word σφ(w), there are exactly r appearances of the letter (t,a). But this property means that σφ(w)L((t,a),r). Consequently, the last-mentioned language is precisely our language σφ1(L((t,a),r)). Therefore this language belongs to the collection A+𝒲, as desired.