You are here

Clustering Higher-Order Data

Paul McNicholas, McMaster University
12-1pm  17th Oct 2019

Abstract

There is an extensive body of literature on clustering univariate and multivariate data. However, attention the use of multidimensional arrays for clustering has thus far been limited to two-dimensional arrays, i.e., matrices or order-two tensors. Work on clustering data matrices, or three-way data, is presented before an approach for clustering multi-way data is introduced. The latter is based on a finite mixture of multidimensional arrays., i.e., a finite mixture of d-dimensional arrays, for d>2. For both matrix- and tensor-variate approaches, the Gaussian component approach is introduced first but approaches that use non-Gaussian components are also discussed. Simulated and real data are used for illustration.

Short Bio

Paul McNicholas is the Canada Research Chair in Computational Statistics and an E.W.R. Steacie Memorial Fellow. He is a Professor and University Scholar in the Department of Mathematics and Statistics at McMaster University (Ontario, Canada), where he is also Director of the MacDATA Institute. He is an SCSS alumnus, having completed his Ph.D. in Statistics at Trinity College Dublin in 2007. He has published extensively in computational statistics, with the vast majority of his journal articles, and one of his monographs, focusing on mixture model-based clustering and related topics. He has been an associate or guest editor for several international journals, and is currently an associate editor for Journal of Multivariate AnalysisJournal of Classification, and Advances in Data Analysis and Classification. He is currently President of The Classification Society, a Senior Member of the IEEE, and a member of the College of the Royal Society of Canada.

Venue

Large Conference Room, O'Reilly Institute