On the stochastic gradient descent matrix factorization in application to the supervised classification of microarrays

 pdf (776K)  / List of references

Microarray datasets are highly dimensional, with a small number of collected samples in comparison to thousands of features. This poses a significant challenge that affects the interpretation, applicability and validation of the analytical results. Matrix factorizations have proven to be a useful method for describing data in terms of a small number of meta-features, which reduces noise, while still capturing the essential features of the data. Three novel and mutually relevant methods are presented in this paper: 1) gradient-based matrix factorization with two adaptive learning rates (in accordance with the number of factor matrices) and their automatic updates; 2) nonparametric criterion for the selection of the number of factors; and 3) nonnegative version of the gradient-based matrix factorization which doesn't require any extra computational costs in difference to the existing methods. We demonstrate effectiveness of the proposed methods to the supervised classification of gene expression data.

Keywords: matrix factorization, unsupervised learning, number of factors, nonnegativity, bioinformatics, leaveone-out, classification
Citation in English: Nikulin V.N. On the stochastic gradient descent matrix factorization in application to the supervised classification of microarrays // Computer Research and Modeling, 2013, vol. 5, no. 2, pp. 131-140
Citation in English: Nikulin V.N. On the stochastic gradient descent matrix factorization in application to the supervised classification of microarrays // Computer Research and Modeling, 2013, vol. 5, no. 2, pp. 131-140
DOI: 10.20537/2076-7633-2013-5-2-131-140
Citations: 4 (RSCI).

Full-text version of the journal is also available on the web site of the scientific electronic library eLIBRARY.RU

The journal is included in the Russian Science Citation Index

The journal is included in the List of Russian peer-reviewed journals publishing the main research results of PhD and doctoral dissertations.

International Interdisciplinary Conference "Mathematics. Computing. Education"

The journal is included in the RSCI

Indexed in Scopus