Deep canonical correlation analysis (DCCA) is often applied to paired data samples from diverse sources to extract meaningful common information. However, when the data sources are heterogeneous, some of the useful information may be complementary but not exactly common. In spite of this fact, existing techniques learn maximally correlated representations from multiple views and are formulated so that they aim to yield identical latent subspaces for each view. This approach is sub-optimal in estimating the true signal subspaces for heterogeneous data sources. We propose a residual relaxation for deep canonical correlation analysis (RDCCA) based on a subspace distance metric, which generalizes the existing problem formulation and extracts representations that are better estimates of the actual, non-identical subspaces. We demonstrate that when using such a relaxation, the learned representations are closer to the true ones and that RDCCA outperforms CCA and DCCA in scenarios with heterogeneous data.
Published: April 5, 2024
Citation
Kuschel M., T.P. Marrinan, and T. Hasija. 2023.Geodesic-based relaxation for deep canonical correlation analysis. In IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP 2023), September 17-20, 2023, Rome, Italy, 1-6. Piscataway, New Jersey:IEEE.PNNL-SA-188975.doi:10.1109/MLSP55844.2023.10285937