Clustering Higher-Order Data
Paul McNicholas, McMaster University
12-1pm 17th Oct 2019
Abstract
There is an extensive body of literature on clustering univariate and multivariate data. However, attention the use of multidimensional arrays for clustering has thus far been limited to two-dimensional arrays, i.e., matrices or order-two tensors. Work on clustering data matrices, or three-way data, is presented before an approach for clustering multi-way data is introduced. The latter is based on a finite mixture of multidimensional arrays., i.e., a finite mixture of d-dimensional arrays, for d>2. For both matrix- and tensor-variate approaches, the Gaussian component approach is introduced first but approaches that use non-Gaussian components are also discussed. Simulated and real data are used for illustration.
Short Bio
Paul McNicholas is the Canada Research Chair in Computational Statistics and an E.W.R. Steacie Memorial Fellow. He is a Professor and University Scholar in the Department of Mathematics and Statistics at McMaster University (Ontario, Canada), where he is also Director of the MacDATA Institute. He is an SCSS alumnus, having completed his Ph.D. in Statistics at Trinity College Dublin in 2007. He has published extensively in computational statistics, with the vast majority of his journal articles, and one of his monographs, focusing on mixture model-based clustering and related topics. He has been an associate or guest editor for several international journals, and is currently an associate editor for Journal of Multivariate Analysis, Journal of Classification, and Advances in Data Analysis and Classification. He is currently President of The Classification Society, a Senior Member of the IEEE, and a member of the College of the Royal Society of Canada.