We started with the representation of the basic tensor product. We were inspired by the elegance of the convoluting structure called block-circulants becoming diagonal products after a Fourier transform. That resembled the duality we expected to see when we shifted from one space to its dual. We wondered whether other components necessary to form an algebra could be defined, and whether we could form decompositions with their help.
We realized it was possible to generalize our framework—from Fourier to any invertible transformation—and to prove the optimality of approximated presentations.
One challenge of high-dimensional constructs is the excessive computational requirements they impose. Many applications today involve operating on large-dimensional datasets, and to handle the ever-increasing sizes we need to develop scalable algorithms.
Our new tensor algebra offers a way to manipulate tensor structures while preserving their dimensional integrity and offering favorable algebraic properties that so far were only found in matrix algebra. Our results indicate that truncation of decomposition using the new tensor algebra offers optimal approximation results, in a mimetic way that the Eckrat-Young theorem offers optimality for matrix representations.
We’ve based the result on spectral analysis of the residual error of approximated tensor-tensor representation of an equal dimensional spanning space of matrix representation. Our research also indicates that the tensor representation is superior, and at worst similar to the matrix representation.
To compare the new truncated tensor approximation with other tensorial frameworks, we show that few popular tensorial frameworks can be viewed as special cases of the new algebra and be expressed using the new framework. In the paper, we demonstrate that the proposed decomposition in the new tensor algebra outperforms its counterparts.
Apart from applying the results to improve the data analysis by autonomous vehicles, our research could also be of use for other AI applications. For example, one could create an end-to-end neural network that ingests tensorial objects and performs tensorial computation. Numerical (multi)linear algebra that involves computation on matrices and tensors is at the heart of many domains including scientific computing, machine learning and AI, quantum computing, signal processing, data analysis, and others. We are confident that our results can take high-dimensional data processing to a whole new level, unveiling latent insights and opening windows to new dimensions.
This joint research was done in collaboration with Misha Kilmer, the William Walker Professor of Mathematics and Adjunct Professor of Computer Science at Tufts University, Haim Avron, associate professor, Department of Applied Mathematics at Tel Aviv University, and Elizabeth Neuman, distinguished visiting assistant professor in Scientific Computing at Emory University.
T. Kolda, B. Bader. Tensor decompositions and applications. SIAM Rev., 51(3), 455-500. (2009). ↩
F. Hitchcock. The expression of a tensor or a polyadic as a sum of products. Journal of Mathematics and Physics., 6(1-4), 164-189. (1927). ↩
L. Lathauwer, B. Moor, J. Vandewalle. A Multilinear Singular Value Decomposition. SIAM J. Matrix Anal. Appl., 21(4), 1253–1278. (2000). ↩ ↩2
I. Oseledets. Tensor-train decomposition. SIAM J. Sci. Comput., 33(5), 2295–2317. (2011). ↩
L. Tucker. Implications of factor analysis of three-way matricesfor measurement of change. Problems in measuring change., 15, 122-137. (1963). ↩
Kilmer, M., Horesh, L., Avron, H., et al., Tensor-Tensor Algebra for Optimal Representationand Compression of Multiway Data. Proceedings of the National Academy of Sciences of the United States of America. VOL. PG. (2021). ↩
V. De Silva, L. Lim. Tensor rank and the ill-posedness of the best low-rank approximation problem. SIAM J. Matrix Anal. Appl., 30(3), 1084–1127. (2008). ↩
A. Cichocki, R. Zdunek, A. Phan, S. Amari. Nonnegative matrix and tensor factorizations: applications to exploratory multi-way data analysis and blind source separation. John Wiley & Sons. (2009). ↩