Identifying Non-Redundant Gene Markers from Microarray Data: A Multiobjective Variable Length PSO-Based Approach


Abstract

Identifying relevant genes which are responsible for various types of cancer is an important problem. In this context, important genes refer to the marker genes which change their expression level in correlation with the risk or progression of a disease, or with the susceptibility of the disease to a given treatment. Gene expression profiling by microarray technology has been successfully applied to classification and diagnostic prediction of cancers. However, extracting these marker genes from a huge set of genes contained by the microarray data set is a major problem. Most of the existing methods for identifying marker genes find a set of genes which may be redundant in nature. Motivated by this, a multiobjective optimization method has been proposed which can find a small set of non-redundant disease related genes providing high sensitivity and specificity simultaneously. In this article, the optimization problem has been modeled as a multiobjective one which is based on the framework of variable length particle swarm optimization. Using some real-life data sets, the performance of the proposed algorithm has been compared with that of other state-of-the-art techniques.