Seminar Series - Dr. Baharan Mirzasoleiman
Mar
10
2023

Mar
10
2023
Description
The Spring 2023 SDS Seminar Series continues on Friday, March 10th from 2:00 p.m. to 3:00 p.m. with Dr. Baharan Mirzasoleiman (Assistant Professor at the University of California, Los Angeles). This event is virtual.
Title: Coresets for Efficient and Robust Learning from Massive Datasets
Abstract: Large datasets have been crucial to the success of modern machine learning models. However, training on massive data has two major limitations. First, it is contingent on exceptionally large and expensive computational resources, and incurs a substantial cost due to the significant energy consumption. Second, in many real-world applications such as medical diagnosis, self-driving cars, and fraud detection, big data contains highly imbalanced classes, noisy labels, and malicious data points. In such cases, training on the entire data does not result in a high-quality model.
In this talk, I will argue that we can address the above limitations by developing techniques that can identify and extract the most informative subsets for learning from massive datasets. Training on such subsets not only reduces the substantial costs of learning from big data, but also improves their accuracy, and robustness against noisy labels and data poisoning attacks. I will discuss how we can develop effective and theoretically rigorous techniques that provide strong guarantees for the learned models’ quality and robustness against noisy labels. I discuss this problem in both supervised and unsupervised settings
Share
Other Events in This Series
Feb
17
2023
Seminar Series - Dr. Yen-Chi Chen
Pattern Graphs: a Graphical Approach to Nonmonotone Missing Data
2:00 pm — 3:00 pm •Virtual
Speaker(s): Yen-Chi Chen
Feb
24
2023
Seminar Series - Dr. Antik Chakraborty
Bayesian inference on high-dimensional multivariate binary responses
2:00 pm — 3:00 pm •Virtual
Speaker(s): Antik Chakraborty
Mar
3
2023
Seminar Series - Dr. Connor Jerzak
Optimal Stochastic Interventions with High-Dimensional Factorial Experiments: Application to Conjoint Analysis
2:00 pm — 3:00 pm •In Person & Virtual
Speaker(s): Connor Jerzak
Mar
24
2023
Seminar Series - Dr. Lindsay Berry
The Department for Statistics and Data Sciences at UT Austin presents its Spring 23 Seminar Series with speaker Dr. Lindsay Berry
2:00 pm — 3:00 pm •In Person & Virtual
Speaker(s): Lindsay Berry
Mar
31
2023
Seminar Series - Dr. Mevin Hooten
Running on Empty: Recharge Dynamics from Animal Movement Data
2:00 pm — 3:00 pm •In Person & Virtual
Speaker(s): Mevin Hooten
Apr
7
2023
Ph.D. Program Poster Session
Ph.D. Poster Session
3:00 pm — 4:30 pm •In Person
Speaker(s): PhD Students
Apr
14
2023
Seminar Series - Dr. Eric Vance
Teaching Collaboration in Statistics and Data Science
2:00 pm — 3:00 pm •In Person & Virtual
Speaker(s): Eric Vance
Apr
21
2023
Seminar Series - Dr. Faming Liang
A Stochastic Neural Network Bridging from Linear Models to Deep Learning
2:00 pm — 3:00 pm •Virtual & In Person
Speaker(s): Faming Liang
May
5
2023
CANCELED: Seminar Series - Dr. Ana-Maria Staicu
Spatial functional principal component analysis
2:00 pm — 3:00 pm •In Person & Virtual
Speaker(s): Ana-Maria Staicu