SDS Seminar Series – Matthew Vanaman, University of Texas at Austin

Art by Susan Wilkinson
Event starts on this day

Sep

13

2024

Event starts at this time 2:00 pm – 3:00 pm
In Person (view details)
Featured Speaker(s): Matthew Vanaman
Cost: Free
Data Analysis from the Zoo to the Wild and Back

Description

The Fall 2024 SDS Seminar Series will continue on September 13th from 2:00 p.m. to 3:00 p.m. with Dr. Matthew Vanaman (Postdoctoral Fellow, Department of Statistics and Data Sciences, University of Texas at Austin). This event is in-person in CBA 4.348.     

Title: Data Analysis from the Zoo to the Wild and Back

Abstract: Across the sciences, analysts are taught to analyze data in zoo-like classroom settings. In these settings, it is easy to distinguish the “well-trained” analysts from the feral ones: obey the zookeeper by identifying the appropriate model for your unrealistically clean data, properly understand how this model works under optimal conditions, interpret its output correctly and objectively, and report the results as a series of linear decisions with no apparent deviation from what was planned all along. Release this impressively domesticated analyst into the wild, and what could go wrong? They quickly find out. Confronted with entirely new challenges that the classroom can only imitate in their most idealized form, it becomes clear that real-world analysis requires an additional kind of expertise. I call this expertise analytic fluency. In asking one’s fellow analyst what these skills are, the answers often resemble something instinctual. Unfortunately, instincts cannot be taught, which is a problem given the litany of emerging data challenges facing new analysts, such as our massively increasing quantity of data, new kinds of measurements, replication crises, concerns about widespread use of questionable research practices, and a host of novel ethical challenges. It is therefore imperative that we elevate analytic fluency from the level of instincts to something explicit and formalized. In so doing, we prepare our next generation of analysts so that they do not have to rely on the slow-going instruction of experience. I take an initial step toward that end by reporting results from a qualitative pilot study probing what data analysts have learned from their experience. I consider what their testimonies imply about how we should teach data analysis, practice it, evaluate its success, and how our understanding of “good data analysis” might be revised to better reflect the conditions of the wild.

Location

CBA 4.348

Share


Audience

Other Events in This Series

Mar

1

2024

Seminar Series

SDS Seminar Series – Dr. Laura Hatfield

Predict, Correct, Select: A New General Identification Strategy for Controlled Pre-Post Designs

2:00 pm – 3:00 pm Virtual

Speaker(s): Laura Hatfield

Mar

22

2024

Seminar Series

SDS Seminar Series – Dr. Sivaraman Balakrishnan

Statistical Inference for Optimal Transport

2:00 pm – 3:00 pm In Person

Speaker(s): Sivaraman Balakrishnan

Mar

29

2024

Seminar Series

SDS Seminar Series – Dr. Purna Sarkar

Some New Results for Streaming Principal Component Analysis

2:00 pm – 3:00 pm In Person

Speaker(s): Purna Sarkar

Apr

12

2024

Seminar Series

SDS Seminar Series – Dr. Daniela Witten

Data Thinning and Its Applications

2:00 pm – 3:00 pm In Person

Apr

19

2024

Seminar Series

SDS Seminar Series – Dr. William Rosenberger

Design and Inference for Enrichment Trials with a Continuous Biomarker

2:00 pm – 3:00 pm In Person

Speaker(s): William Rosenberger

Apr

26

2024

Seminar Series

SDS Seminar Series – Dr. Bodhisattva Sen

Extending the Scope of Nonparametric Empirical Bayes

2:00 pm – 3:00 pm In Person

Speaker(s): Bodhisattva Sen

Sep

6

2024

Seminar Series

SDS Seminar Series – Christine Peterson, University of Texas MD Anderson Cancer Center

New Methods for Microbiome Data Integration

2:00 pm – 3:00 pm In Person

Speaker(s): Christine Peterson

Sep

20

2024

Seminar Series

SDS Seminar Series – Saptarshi Roy, University of Texas at Austin

On the Computational Complexity of Private High-dimensional Model Selection

2:00 pm – 3:00 pm In Person

Speaker(s): Saptarshi Roy

Sep

27

2024

Seminar Series

SDS Seminar Series – Abhra Sarkar, University of Texas at Austin

(Bayesian) Semiparametric Local Inference (and Other Stories)

2:00 pm – 3:00 pm In Person

Speaker(s): Abhra Sarkar

Oct

4

2024

Seminar Series

SDS Seminar Series – Huiyan Sang, Texas A&M University

GS-BART: Graph Split Additive Decision Trees for Spatial and Network Data

2:00 pm – 3:00 pm In Person

Speaker(s): Huiyan Sang