SDS Seminar Series – Dr. Purna Sarkar
Mar
29
2024
Mar
29
2024
Description
The Spring 2024 SDS Seminar Series continues on March 29th from 2:00 p.m. to 3:00 p.m. with Dr. Purna Sarkar (Statistics and Data Sciences, University of Texas at Austin). This event is in-person.
Title: Some New Results for Streaming Principal Component Analysis
Abstract: Streaming PCA, also known as Oja's algorithm, with roots going back to 1949, has attracted much attention in Statistics and Computer Science in the last decade. In this talk, I will discuss two of our works that consider this problem under slight departures from the setup considered widely in the literature.
Our first work looks at data streams generated from a Markov chain. While streaming PCA is typically analyzed under the IID data model, in many applications like distributed optimization, data points are sampled from a Markov chain and, therefore, are dependent. The naive approach of dropping data leads to a suboptimal rate. We use a novel linearization argument to remove the logarithmic dependence on the number of samples n.
Typically, the analysis of Oja's algorithm assumes that the effective rank of the covariance matrix is much smaller than n. Our second work examines online sparse PCA, where the effective rank is comparable to n. This differs from previously studied settings because the Oja vector does not concentrate on the true population eigenvector. Here, we show that a simple thresholding yields a consistent estimate of the population eigenvector. Both are joint works with Syamantak Kumar.
Location
Peter O’Donnell Jr. Building (POB) 2.302
Share
Other Events in This Series
Oct
11
2024
SDS Seminar Series – Mingyuan Zhou, University of Texas at Austin
Building Faster, Better, and Safer Deep Generative Models via Score Identity Distillation
2:00 pm – 3:00 pm • In Person
Speaker(s): Mingyuan Zhou
Oct
18
2024
SDS Seminar Series – Sherry Zhang, University of Texas at Austin
Pivoting between Space and Time: Spatio-Temporal Analysis with Cubble
2:00 pm – 3:00 pm • In Person
Speaker(s): Sherry Zhang
Oct
25
2024
SDS Seminar Series – Matt Koslovsky, Colorado State University
Sparse Dirichlet-Multinomial Models
2:00 pm – 3:00 pm • In Person
Speaker(s): Matt Koslovsky
Nov
1
2024
SDS Seminar Series – Aaditya Ramdas, Carnegie Mellon University
A Game-Theoretic Theory of Statistical Evidence
2:00 pm – 3:00 pm • In Person
Speaker(s): Aaditya Ramdas
Nov
8
2024
SDS Seminar Series – Myungsoo Yoo, University of Texas at Austin
Dynamic Spatio-Temporal Model Integrating Physics for Fire Front Propagation
2:00 pm – 3:00 pm • In Person
Speaker(s): Myungsoo Yoo
Nov
15
2024
SDS Seminar Series – Rafael Irizarry, Harvard University
Twenty-Five Years of Data Science: Music, Genomics, and Public Health Surveillance
2:00 pm – 3:00 pm • In Person
Speaker(s): Rafael Irizarry
Mar
28
2025
SDS Seminar Series – Po-Ling Loh, University of Cambridge
TBA
2:00 pm – 3:00 pm • In Person
Speaker(s): Po-Ling Loh
Apr
18
2025
SDS Seminar Series – Richard Samworth, University of Cambridge
TBA
2:00 pm – 3:00 pm • In Person
Speaker(s): Richard Samworth