Open Conference Systems, CLADAG2023

Font Size: 
Clustering for sparsely sampled longitudinal data based on basis expansions
Michio Yamamoto, Yoshikazu Terada

Last modified: 2023-06-06

Abstract


In longitudinal data, the observations often occur at different time points for each subject. In such a case, ordinary clustering algorithms like K-means clustering cannot be applied directly. Instead, one may apply a smoothing technique to get individual continuous trajectories, followed by finding groups among the trajectories using some clustering algorithm. However, this is inappropriate when each subject’s data are observed at only a few time points. Thus, we develop a new clustering algorithm for sparsely sampled longitudinal data, which can be considered a natural extension of the K-means clustering. We show the consistency of the proposed estimator under mild regularity conditions. We also evaluate its performance through simulation studies and data applications.