Some methods for constructing solutions of cluster analysis problems based on processing a set of solutions for the cluster analysis problem by a set of heuristic clustering algorithms, as well as regression analysis problems using a set of different supervised classification algorithms, are considered. The solution to the problem of cluster analysis is defined as the construction of a set of equivalent information matrices. Two information matrices are called equivalent to each other if they are equaled up to a permutation of the columns. The construction of the optimal committee solution of the problem of cluster analysis by a team of algorithms is reduced to solving one discrete optimization problem on permutations. An upper bound for the variation of the basic functional is obtained. An effective (polynomial) algorithm is proposed for its minimization. Heuristic methods for constructing collective clustering are considered.
The problem of restoring a function (regression recovery) as an application of some function (corrector) to the set of solutions of special supervised classification problems is considered. A preset training sample of feature descriptions of objects is assumed. In this case, all the problems associated with the heterogeneity of the features, their different informative or incomplete sampling, are failed on the classification problems. Based on this sample and the values of the dependent variable, a random collective of classification tasks is constructed. For a recognizable object, the class is computed, to which it belongs according to each algorithm. Then the corrector found earlier is applied to the class numbers. Two types of correctors are considered: the Bayesian and the average linear corrector. In the Bayesian corrector, the naive Bayesian method is used. The average linear corrector consists of calculating a linear combination of the mean values of the dependent value from the calculated values of the classes.