Taylor & Francis Group
Browse
DOCUMENT
MCCCA_app.pdf (122.16 kB)
DATASET
section3data.csv (3.72 kB)
DATASET
section3data_cls.csv (2.95 kB)
DATASET
section3data_ext.csv (3.72 kB)
1/0
4 files

Visualizing Class Specific Heterogeneous Tendencies in Categorical Data

Version 2 2022-03-28, 19:20
Version 1 2022-02-02, 21:20
dataset
posted on 2022-03-28, 19:20 authored by Mariko Takagishi, Michel van de Velden

In multiple correspondence analysis, both individuals (observations) and categories can be represented in a biplot that jointly depicts the relationships across categories and individuals, as well as the associations between them. Additional information about the individuals can enhance interpretation capacities, such as by including class information for which the interdependencies are not of immediate concern, but that facilitate the interpretation of the plot with respect to relationships between individuals and categories. This article proposes a new method which we call multiple-class cluster correspondence analysis that identifies clusters specific to classes. The proposed method can construct a biplot that depicts heterogeneous tendencies of individual members, as well as their relationships with the original categorical variables. A simulation study to investigate the performance of the proposed method and an application to data regarding road accidents in the United Kingdom confirms the viability of this approach. Supplementary materials for this article are available online.

Funding

This work was supported by the Japan Society for the Promotion of Science KAKENHI grants 20K19755.

History