Doctor of Philosophy (Ph.D.), Bowling Green State University, 2023, Data Science
Deep Gaussian Mixture Models (DGMMs) are probabilistic models that combine multiple layers of latent variables (Viroli and McLachlan, 2019). DGMMs adeptly capture intricate, non-linear interactions among variables, facilitating efficient unsupervised learning. This study strives to improve DGMMs by introducing a method to systematically choose optimal covariance structures for each DGMM layer.
Our proposal involves a closed multiple-testing procedure that utilizes the likelihood-ratio test to select the most suitable covariance structure from a set of candidate structures. Our proposal is inspired by the likelihood-ratio method proposed by Greselin and Punzo (2013). Typically, information criteria are widely used in the context of model selection, but they have different properties and may be more appropriate for different types of data or modeling situations. Additionally, the selection of a specifc information criterion is subjective and many practitioners tend to use a particular method routinely, which can limit the potential for discovering the best covariance structure for the data at hand (Greselin and Punzo (2013), Punzo et al. (2016)).
The proposed method draws inspiration from McNicholas and Murphy (2008), in the context of mixture factor analyzers, where constraints are applied to covariance structures. In Deep Gaussian Mixture Models (DGMMs), these covariance structures can be defned at each layer, creating a range of complexities. To aid covariance structure selection in DGMMs, it is assumed that each cluster within a layer shares the same covariance structure. The chosen structure achieves a balance between model complexity, enhancing performance and predictive accuracy. The method employs a closed multiple-testing approach based on the likelihood ratio test, comparing likelihoods of different covariance structures for the DGMM.
We conduct a series of simulations considering multiple heteroscedasticity configurations that represent different cova (open full item for complete abstract)
Committee: Junfeng Shang Ph.D. (Committee Chair); Lauren Maziarz Ph.D. (Other); Hanfeng Chen Ph.D. (Committee Member); Rob Green Ph.D. (Committee Member)
Subjects: Statistics