The R package vrnmf
implements a set of methods to
perform non-negative matrix decomposition with minimum volume
constraints. A general problem is to decompose a non-negative matrix
in a product of non-negative matrix
and matrix
of lower rank r:
.
In case of additional non-negativity constraints on the matrix
, the
problem is known as NMF.
This problem, and NMF as a particular case, is not identifiable in the general case, meaning that there are potentially many different solutions that deliver the same decomposition quality [1]. This both makes interpretation of factorized matrices challenging and limits applications of NMF to instrumental dimensionality reduction. However, recent theoretical advances have shown that the issue can be overcome under a relatively mild assumption based on “spread”. That is, the column vectors of C are known as “sufficiently spread”[2-3] if the matrix C is non-negative and the matrix C has sufficiently spread column vectors then the volume minimization of a matrix D delivers a correct and unique, up to a scale and permutation, solution (C, D).
The AnchorFree approach enables efficient estimation of matrix C by reformulating the problem in the covariance domain using the application of the volume minimization criterion [4]. A short walkthrough can be found here
A more general formulation of the problem that accounts for noise in matrix X, such that only approximately , is called volume-regularized NMF (vrnmf). To balance goodness of matrix approximation and matrix D volume, vrnmf minimizes the following objective function [5-6]:
We provide implementation of vrnmf
approach and devise
its reformulation in covariance domain.
To install the stable version from CRAN, use:
install.packages('vrnmf')
To install the latest version of vrnmf, use:
install.packages('devtools')
::install_github('kharchenkolab/vrnmf') devtools
The R package can be cited as:
Ruslan Soldatov, Peter Kharchenko, Viktor Petukhov, and Evan Biederstedt (2021). vrnmf:
Volume-regularized structured matrix factorization. R package version
1.0.2. https://github.com/kharchenkolab/vrnmf
If you find this software useful for your research, please cite the corresponding paper:
Vladimir B. Seplyarskiy Ruslan A. Soldatov, et al.
Population sequencing data reveal a compendium of mutational processes in the human germ line.
Science, 12 Aug 2021. doi: 10.1126/science.aba7408
[1] K. Huang, N. D. Sidiropoulos and A. Swami. Non-Negative Matrix Factorization Revisited: Uniqueness and Algorithm for Symmetric Decomposition. IEEE Transactions on Signal Processing, vol. 62, no. 1, pp. 211-224, Jan.1, 2014, doi: 10.1109/TSP.2013.2285514.
[2] C.-H. Lin, W.-K. Ma, W.-C. Li, C.-Y. Chi, and A. Ambikapathi. Identifiability of the simplex volume minimization criterion for blind hyperspectral unmixing: The no-pure-pixel case. IEEE Trans. Geosci. Remote Sens., vol. 53, no. 10, pp. 5530–5546, Oct 2015.
[3] X. Fu, W.-K. Ma, K. Huang, and N. D. Sidiropoulos. Blind separation of quasi-stationary sources: Exploiting convex geometry in covariance domain. IEEE Trans. Signal Process., vol. 63, no. 9, pp. 2306–2320, May 2015.
[4] X. Fu, K. Huang, N. D. Sidiropoulos, Q. Shi, and M. Hong. Anchorfree correlated topic modeling. IEEE Trans. Patt. Anal. Machine Intell., vol. to appear, DOI: 10.1109/TPAMI.2018.2827377, 2018.
[5] X. Fu, K. Huang, B. Yang, W. Ma and N. D. Sidiropoulos. Robust Volume Minimization-Based Matrix Factorization for Remote Sensing and Document Clustering. IEEE Transactions on Signal Processing, vol. 64, no. 23, pp. 6254-6268, 1 Dec.1, 2016, doi: 10.1109/TSP.2016.2602800.
[6] A. M. S. Ang and N. Gillis. Algorithms and Comparisons of Nonnegative Matrix Factorizations With Volume Regularization for Hyperspectral Unmixing. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 12, no. 12, pp. 4843-4853, Dec. 2019, doi: 10.1109/JSTARS.2019.2925098.