SDS Seminar Series - Jonathan Huggins, Boston University
Oct
24
2025
Oct
24
2025
Description
The Fall 2025 SDS Seminar Series continues on October 24th from 2:00 p.m. to 3:00 p.m. with Dr. Jonathan Huggins (Assistant Professor, Department of Mathematics & Statistics, Boston University). This event is in-person in POB 6.304.
Title: Robust Model Selection for Discovery of Latent Mechanistic Processes
Abstract: When learning interpretable latent structures using model-based approaches, even small deviations from modeling assumptions can lead to inferential results that are not mechanistically meaningful. For example, many latent structures consist of K mechanistic processes (with K unknown). When the model is misspecified, likelihood-based model selection methods can substantially overestimate K as the sample size grows, while nonparametric methods can be overly conservative no matter how large the sample size. Hence, there is need for model selection methods that combine the precision of likelihood-based approaches with the robustness of nonparametrics. To address this need in a principled manner, we first formalize the problem of robust model selection in latent variable models designed for mechanistic understanding as requiring an estimator for K to satisfy a robust model selection consistency property. The definition of robust model selection consistency motivates a particular family of model selection procedures, which rely on plug-in estimates of a component-wise discrepancy measure we call the accumulated cutoff discrepancy criterion (ACDC). We provide a method for constructing mechanistically meaningful component-wise discrepancies for a class of latent variable models that includes unsupervised and supervised variants of probabilistic matrix factorization (including factor analysis) and mixture models. We prove that ACDC provides robust model selection consistency for unsupervised matrix factorization and mixture models. Numerical results show that in practice our approach reliably identifies a physically meaningful number of latent processes in four illustrative applications, outperforming widely used model selection methods. An in-depth case study of cell type discovery using single-cell RNA sequencing data demonstrates ACDC outperforms two widely used software packages designed specifically for single-cell data analysis.
Other Events in This Series
Sep
5
2025
SDS Seminar Series – Sarah Coleman, University of Texas at Austin
A Linear Mixed Effects Model for Evaluating Synthetic Gene Circuits
2:00 pm – 3:00 pm • In Person
Speaker(s): Sarah Coleman
Sep
12
2025
SDS Seminar Series – Lydia Lucchesi, University of Texas at Austin
Visual Documentation for Data Preprocessing in R and Python
2:00 pm – 3:00 pm • In Person
Speaker(s): Lydia Lucchesi
Sep
19
2025
SDS Seminar Series – Tuan Pham, University of Texas at Austin
Time-uniform Bounds for Iterated Algorithms
2:00 pm – 3:00 pm • In Person
Speaker(s): Tuan Pham
Sep
26
2025
SDS Seminar Series - Ryan Giordano, University of California, Berkeley
Local Weighting--Based Diagnostics for Bayesian Multilevel Regression with Poststratification
2:00 pm – 3:00 pm • In Person
Speaker(s): Ryan Giordano
Oct
3
2025
SDS Seminar Series – Rafael Campello de Alcantara, University of Texas at Austin
Searching for Parallel Trends: A Decision Tree Algorithm for Discovering Conditional Diff-in-Diff Estimators
2:00 pm – 3:00 pm • In Person
Speaker(s): Rafael Campello de Alcantara
Oct
10
2025
SDS Seminar Series – Michele Guindani, University of California, Los Angeles
Embracing Heterogeneity: Bayesian Clustering Methods for Neuroscience Data
2:00 pm – 3:00 pm • In Person
Speaker(s): Michele Guindani
Oct
17
2025
SDS Seminar Series – Wenyi Wang, MD Anderson Cancer Center
Deciphering Tumor Heterogeneity for Benefits from Immunotherapy in Cancer
2:00 pm – 3:00 pm • In Person
Speaker(s): Wenyi Wang
Oct
31
2025
SDS Seminar Series – Max Goplerud, University of Texas at Austin
Generalized Bilinear Mixed Models and Variational Inference
2:00 pm – 3:00 pm • In Person
Speaker(s): Max Goplerud
Nov
7
2025
SDS Seminar Series – Jeffrey Miller, Harvard University
Bayesian Model Criticism Using Uniform Parametrization Checks
2:00 pm – 3:00 pm • In Person
Speaker(s): Jeffrey Miller