Please login to be able to save your searches and receive alerts for new content matching your search criteria.
Recently, the research interest in multi-objective optimization has increased remarkably. Most of the proposed methods use a population of solutions that are simultaneously improved trying to approximate them to the Pareto-optimal front. When the population size increases, the quality of the solutions tends to be better, but the runtime is higher. This paper presents how to apply parallel processing to enhance the convergence to the Pareto-optimal front, without increasing the runtime. In particular, we present an island-based parallelization of five multi-objective evolutionary algorithms: NSGAII, SPEA2, PESA, msPESA, and a new hybrid version we propose. Experimental results in some test problems denote that the quality of the solutions tends to improve when the number of islands increases.
Feature selection has become a powerful dimensional reduction strategy and an effective tool in handling high-dimensional data. Feature selection aims to reduce the dimension of the feature space, to speed up and reduce the cost of the learning model and that by selecting the most relevant feature subset to data mining and machine learning tasks. The selection of optimal feature subset is an optimization problem that proved to be NP-hard. Metaheuristics are traditionally used to deal with NP-hard problems since they are well known for solving complex and real-world problems in reasonable period of time. Genetic algorithm (GA) is one of the most popular metaheuristics algorithms, which proved to be effective for an accurate feature selection task. However, in the last few decades, data have become progressively larger in both numbers of instances and features. This paradigm is being popularly termed as Big Data. With the tremendous growth of dataset sizes, most current feature selection algorithms and exceptionally GA become unscalable. To improve the scalability of a feature selection algorithm on big data, the distributed computing strategy is always adopted such as MapReduce model and Hadoop system. In this paper, we first present a review for the most recent works which handle the use of Parallel Genetic algorithm in large datasets. Then, we will propose a new Parallel Genetic algorithm based on the Coarse-grained parallelization model (island model). The parallelization of the process and the distribution of the partitioning of data will be performed using Hadoop system with an Amazon cluster. The performance and the scalability of the proposed method were theoretically and empirically compared to the existing feature selection methods when handling large-scale datasets and results confirm the effectiveness of our proposed method.