Vol. 35 No. 3 (2023)
Articles

From plain to sparse correspondence analysis: a generalized SVD approach

Hervé Abdi
School of Behavioral and Brain Sciences, The University of Texas at Dallas, Richardson, TX
Vincent Guillemot
Institut Pasteur, Université Paris Cité, Bioinformatics and Biostatistics Hub, Paris
Ruiping Liu
School of Applied Science, Beijing Information Science and Technology Univer- sity, Beijing
Ndeye Niang
Cedric Lab, Conservatoire national des arts et métiers, Paris
Gilbert Saporta
Cedric Lab, Conservatoire national des arts et métiers, Paris
Ju-Chi Yu
Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto

Published 2024-07-26

Keywords

  • Sparsity,
  • Generalized singular value decomposition,
  • Correspondence analysis,
  • LASSO,
  • Penalized matrix decomposition

How to Cite

Abdi, H., Guillemot, V., Liu, R., Niang, N., Saporta, G., & Yu, J.-C. (2024). From plain to sparse correspondence analysis: a generalized SVD approach. Italian Journal of Applied Statistics, 35(3), 301–338. https://doi.org/10.26398/IJAS.0035-014

Abstract

Correspondence Analysis (CA) is the method of choice to analyze contingency matrices and is widely applied in text analysis, psychometrics, chemometrics etc. But CA becomes difficult to interpret when the number of rows or columns is large, a configuration routinely found in contemporary statistical practice. For principal component analysis (PCA), this interpretation problem has been traditionally handled with rotation and more recently with sparsification methods such as the LASSO. Curiously, despite the strong connections between CA and PCA, sparsifying correspondence analysis remains essentially unexplored. In this paper, we derive an extension of the Penalized Matrix Decomposition (a method based on the singular value decomposition) to sparsify CA. We present some theoretical results and properties of the resulting sparse correspondence analysis and illustrate this new method with an analysis of the causes of deaths in the United States in 2019.