Selected talks

Embracing SQL in data science workflows
Southern Data Science Conference, September 2022 Watch

A deep dive into the dbt manifest
NYC dbt meetup, June 2022 Slides

Parallel processing in Python: The current landscape
PyData Global, November 2020 Watch

Next-generation big data pipelines with Prefect and Dask
ODSC West, October 2020 Watch

Your data fits in RAM: How to avoid cluster computing
PyData Miami, January 2019 Watch

Dynamic healthcare dataset generation, curation, and quality with PySpark
Spark+AI Summit, June 2018 Watch

Bio and Headshot

Aaron Richter is a software developer with a passion for all things data. His work involves making sure data is clean and accessible, and that the tools to access it are at peak performance. Aaron is currently a data engineer at Robinhood. Previously, he supported the analytics platform at Squarespace, built the data warehouse at Modernizing Medicine, and worked as a data science advocate at Saturn Cloud. He has given talks at several major data conferences, including Data Council, PyData, Spark Summit, and ODSC. Aaron is based in NYC and holds a PhD in machine learning from Florida Atlantic University.

headshot-center.jpg

 The laundry list

Embracing SQL in data science workflows
Southern Data Science Conference, September 2022 Watch

A deep dive into the dbt manifest
NYC dbt meetup, June 2022 Slides

Panel: Building vs. buying when it comes to your data stack
DataOps Unleashed, Feb 2022 Watch

Jupyter notebooks for teams: Best practices for quality, reproducibility, and collaboration
ODSC East, March 2021 Watch

Data Science at Internet Scale with GPUs in the Cloud
Saturn Cloud webinar, March 2021 Watch

Machine Learning Without Limits: Snowflake and Python Together In The Cloud
Saturn Cloud x Snowflake webinar, January 2021 Watch

Accelerating XGBoost with Python
Saturn Cloud x Capital One webinar, December 2020 Watch

Parallel processing in Python: The current landscape
PyData Global, November 2020 Watch

Scaling machine learning in Python
Saturn Cloud workshop, November 2020 Watch

Data & AI Accessibility: The Democratization of Data Science
Saturn Cloud x Travis Oliphant webinar, October 2020 Watch

Next-generation big data pipelines with Prefect and Dask
ODSC West, October 2020 Watch

High Performance Jupyter: Faster workloads with Dask and RAPIDS
JupyterCon, October 2020 Watch

Watson Institute master course: Intro to machine learning
Watson Institute at Lynn University, April 2020 Watch

Running and analyzing machine learning experiments in the cloud
IEEE ICMLA, December 2019 Watch

Intro to machine learning + ML for mobile
South Florida Mobile Developers meetup, December 2019 Watch

Predicting melanoma risk from electronic health records with machine learning techniques
PhD dissertation defense, July 2019 Watch

Iā€™m a data scientist, ask me anything! (Live AMA)
Ironhack Miami, March 2019

Data @ modmed (Ironhack DataXperience)
Ironhack Miami, February 2019

Your data fits in RAM: How to avoid cluster computing
PyData Miami, January 2019 Watch

Shaping the future of data analytics
Panel member, Lynn University Business Symposium, November 2018

A modern big data architecture for healthcare research
FAU Big Data Science conference, October 2018

Doing big data with Spark
Data.miami bootcamp, July 2018 Watch

Dynamic healthcare dataset generation, curation, and quality with PySpark
Spark+AI Summit, June 2018 Watch

Predicting melanoma risk from electronic health record data
Miami Machine Learning meetup, May 2018

Feature learning with matrix factorization and neural networks
Ft. Lauderdale Machine Learning meetup, February 2018 Watch

Machine learning in the cloud with Amazon Web Services
Miami Machine Learning meetup, February 2018 Watch

Machine learning with big data using Spark
Ft. Lauderdale Machine Learning meetup, November 2015