Selected talks
Embracing SQL in data science workflows
Southern Data Science Conference, September 2022 Watch
A deep dive into the dbt manifest
NYC dbt meetup, June 2022 Slides
Parallel processing in Python: The current landscape
PyData Global, November 2020 Watch
Next-generation big data pipelines with Prefect and Dask
ODSC West, October 2020 Watch
Your data fits in RAM: How to avoid cluster computing
PyData Miami, January 2019 Watch
Dynamic healthcare dataset generation, curation, and quality with PySpark
Spark+AI Summit, June 2018 Watch
Bio and Headshot
Aaron Richter is a software developer with a passion for all things data. His work involves making sure data is clean and accessible, and that the tools to access it are at peak performance. Aaron is currently a data engineer at Robinhood. Previously, he supported the analytics platform at Squarespace, built the data warehouse at Modernizing Medicine, and worked as a data science advocate at Saturn Cloud. He has given talks at several major data conferences, including Data Council, PyData, Spark Summit, and ODSC. Aaron is based in NYC and holds a PhD in machine learning from Florida Atlantic University.
The laundry list
Embracing SQL in data science workflows
Southern Data Science Conference, September 2022 Watch
A deep dive into the dbt manifest
NYC dbt meetup, June 2022 Slides
Panel: Building vs. buying when it comes to your data stack
DataOps Unleashed, Feb 2022 Watch
Jupyter notebooks for teams: Best practices for quality, reproducibility, and collaboration
ODSC East, March 2021 Watch
Data Science at Internet Scale with GPUs in the Cloud
Saturn Cloud webinar, March 2021 Watch
Machine Learning Without Limits: Snowflake and Python Together In The Cloud
Saturn Cloud x Snowflake webinar, January 2021 Watch
Accelerating XGBoost with Python
Saturn Cloud x Capital One webinar, December 2020 Watch
Parallel processing in Python: The current landscape
PyData Global, November 2020 Watch
Scaling machine learning in Python
Saturn Cloud workshop, November 2020 Watch
Data & AI Accessibility: The Democratization of Data Science
Saturn Cloud x Travis Oliphant webinar, October 2020 Watch
Next-generation big data pipelines with Prefect and Dask
ODSC West, October 2020 Watch
High Performance Jupyter: Faster workloads with Dask and RAPIDS
JupyterCon, October 2020 Watch
Watson Institute master course: Intro to machine learning
Watson Institute at Lynn University, April 2020 Watch
Running and analyzing machine learning experiments in the cloud
IEEE ICMLA, December 2019 Watch
Intro to machine learning + ML for mobile
South Florida Mobile Developers meetup, December 2019 Watch
Predicting melanoma risk from electronic health records with machine learning techniques
PhD dissertation defense, July 2019 Watch
Iām a data scientist, ask me anything! (Live AMA)
Ironhack Miami, March 2019
Data @ modmed (Ironhack DataXperience)
Ironhack Miami, February 2019
Your data fits in RAM: How to avoid cluster computing
PyData Miami, January 2019 Watch
Shaping the future of data analytics
Panel member, Lynn University Business Symposium, November 2018
A modern big data architecture for healthcare research
FAU Big Data Science conference, October 2018
Doing big data with Spark
Data.miami bootcamp, July 2018 Watch
Dynamic healthcare dataset generation, curation, and quality with PySpark
Spark+AI Summit, June 2018 Watch
Predicting melanoma risk from electronic health record data
Miami Machine Learning meetup, May 2018
Feature learning with matrix factorization and neural networks
Ft. Lauderdale Machine Learning meetup, February 2018 Watch
Machine learning in the cloud with Amazon Web Services
Miami Machine Learning meetup, February 2018 Watch
Machine learning with big data using Spark
Ft. Lauderdale Machine Learning meetup, November 2015