CLASSIFICATION OF TUMORS BASED ON GENETIC EXPRESSIONS
Abstract
This paper analyzes the ability of different machine learning algorithms to find patterns in the levels of gene expression for the correct classification of the five different types of tumors: breast, colon, kidney, lung, and prostate. The machine learning techniques were selected according to the most used algorithms in the related works: Bayesian method, Decision Trees, and K-Nearest Neighbors. Three metrics were applied to test the performance of the classifiers: Precision, Recall, and F1-score. The results of Precision of the algorithms were 95.03% (Bayesian), 96.73% (Decision Trees), and 99.52% (K-Nearest Neighbors). A software prototype was developed to classify tumors based on genetic expressions utilizing these three algorithms with satisfactory results.
Publisher's Note:
There is an update on 2nd author’s affiliation. This information has been updated on 16th November 2022.