SDS Seminar Series – Dr. Purna Sarkar
Mar
29
2024

Mar
29
2024
Description
The Spring 2024 SDS Seminar Series continues on March 29th from 2:00 p.m. to 3:00 p.m. with Dr. Purna Sarkar (Statistics and Data Sciences, University of Texas at Austin). This event is in-person.
Title: Some New Results for Streaming Principal Component Analysis
Abstract: Streaming PCA, also known as Oja's algorithm, with roots going back to 1949, has attracted much attention in Statistics and Computer Science in the last decade. In this talk, I will discuss two of our works that consider this problem under slight departures from the setup considered widely in the literature.
Our first work looks at data streams generated from a Markov chain. While streaming PCA is typically analyzed under the IID data model, in many applications like distributed optimization, data points are sampled from a Markov chain and, therefore, are dependent. The naive approach of dropping data leads to a suboptimal rate. We use a novel linearization argument to remove the logarithmic dependence on the number of samples n.
Typically, the analysis of Oja's algorithm assumes that the effective rank of the covariance matrix is much smaller than n. Our second work examines online sparse PCA, where the effective rank is comparable to n. This differs from previously studied settings because the Oja vector does not concentrate on the true population eigenvector. Here, we show that a simple thresholding yields a consistent estimate of the population eigenvector. Both are joint works with Syamantak Kumar.
Location
Peter O’Donnell Jr. Building (POB) 2.302
Share
Other Events in This Series
Sep
12
2025
SDS Seminar Series – Lydia Lucchesi, University of Texas at Austin
Visual Documentation for Data Preprocessing in R and Python
2:00 pm – 3:00 pm • In Person
Speaker(s): Lydia Lucchesi
Sep
19
2025
SDS Seminar Series – Tuan Pham, University of Texas at Austin
Time-uniform Bounds for Iterated Algorithms
2:00 pm – 3:00 pm • In Person
Speaker(s): Tuan Pham
Sep
26
2025
SDS Seminar Series - Ryan Giordano, University of California, Berkeley
TBA
2:00 pm – 3:00 pm • In Person
Speaker(s): Ryan Giordano
Oct
3
2025
SDS Seminar Series – Rafael Campello de Alcantara, University of Texas at Austin
Searching for Parallel Trends: A Decision Tree Algorithm for Discovering Conditional Diff-in-Diff Estimators
2:00 pm – 3:00 pm • In Person
Speaker(s): Rafael Campello de Alcantara
Oct
10
2025
SDS Seminar Series – Michele Guindani, University of California, Los Angeles
TBA
2:00 pm – 3:00 pm • In Person
Speaker(s): Michele Guindani
Oct
17
2025
SDS Seminar Series – Wenyi Wang, MD Anderson Cancer Center
TBA
2:00 pm – 3:00 pm • In Person
Speaker(s): Wenyi Wang
Oct
31
2025
SDS Seminar Series – Max Goplerud, University of Texas at Austin
TBA
2:00 pm – 3:00 pm • In Person
Speaker(s): Max Goplerud
Nov
7
2025
SDS Seminar Series – Jeffrey Miller, Harvard University
TBA
2:00 pm – 3:00 pm • In Person
Speaker(s): Jeffrey Miller