SDS Seminar Series – Dr. Purna Sarkar
Mar
29
2024
Mar
29
2024
Description
The Spring 2024 SDS Seminar Series continues on March 29th from 2:00 p.m. to 3:00 p.m. with Dr. Purna Sarkar (Statistics and Data Sciences, University of Texas at Austin). This event is in-person.
Title: Some New Results for Streaming Principal Component Analysis
Abstract: Streaming PCA, also known as Oja's algorithm, with roots going back to 1949, has attracted much attention in Statistics and Computer Science in the last decade. In this talk, I will discuss two of our works that consider this problem under slight departures from the setup considered widely in the literature.
Our first work looks at data streams generated from a Markov chain. While streaming PCA is typically analyzed under the IID data model, in many applications like distributed optimization, data points are sampled from a Markov chain and, therefore, are dependent. The naive approach of dropping data leads to a suboptimal rate. We use a novel linearization argument to remove the logarithmic dependence on the number of samples n.
Typically, the analysis of Oja's algorithm assumes that the effective rank of the covariance matrix is much smaller than n. Our second work examines online sparse PCA, where the effective rank is comparable to n. This differs from previously studied settings because the Oja vector does not concentrate on the true population eigenvector. Here, we show that a simple thresholding yields a consistent estimate of the population eigenvector. Both are joint works with Syamantak Kumar.
Location
Peter O’Donnell Jr. Building (POB) 2.302
Share
Other Events in This Series
Mar
1
2024
SDS Seminar Series – Dr. Laura Hatfield
Predict, Correct, Select: A New General Identification Strategy for Controlled Pre-Post Designs
2:00 pm – 3:00 pm • Virtual
Speaker(s): Laura Hatfield
Mar
22
2024
SDS Seminar Series – Dr. Sivaraman Balakrishnan
Statistical Inference for Optimal Transport
2:00 pm – 3:00 pm • In Person
Speaker(s): Sivaraman Balakrishnan
Apr
12
2024
SDS Seminar Series – Dr. Daniela Witten
Data Thinning and Its Applications
2:00 pm – 3:00 pm • In Person
Apr
19
2024
SDS Seminar Series – Dr. William Rosenberger
Design and Inference for Enrichment Trials with a Continuous Biomarker
2:00 pm – 3:00 pm • In Person
Speaker(s): William Rosenberger
Apr
26
2024
SDS Seminar Series – Dr. Bodhisattva Sen
Extending the Scope of Nonparametric Empirical Bayes
2:00 pm – 3:00 pm • In Person
Speaker(s): Bodhisattva Sen
Sep
6
2024
SDS Seminar Series – Christine Peterson, University of Texas MD Anderson Cancer Center
New Methods for Microbiome Data Integration
2:00 pm – 3:00 pm • In Person
Speaker(s): Christine Peterson
Sep
13
2024
SDS Seminar Series – Matthew Vanaman, University of Texas at Austin
Data Analysis from the Zoo to the Wild and Back
2:00 pm – 3:00 pm • In Person
Speaker(s): Matthew Vanaman
Sep
20
2024
SDS Seminar Series – Saptarshi Roy, University of Texas at Austin
On the Computational Complexity of Private High-dimensional Model Selection
2:00 pm – 3:00 pm • In Person
Speaker(s): Saptarshi Roy
Sep
27
2024
SDS Seminar Series – Abhra Sarkar, University of Texas at Austin
(Bayesian) Semiparametric Local Inference (and Other Stories)
2:00 pm – 3:00 pm • In Person
Speaker(s): Abhra Sarkar
Oct
4
2024
SDS Seminar Series – Huiyan Sang, Texas A&M University
GS-BART: Graph Split Additive Decision Trees for Spatial and Network Data
2:00 pm – 3:00 pm • In Person
Speaker(s): Huiyan Sang