Blog

Plotting the 2019 Tour de France
The Tour de France is a sporting event decided by mere minutes; to see exactly how those minutes were earned, read on for my plots!
Improving Wikipedia's Hour Record Plot
I love Wikipedia, I love cycling, and I love data! So today, I improve Wikipedia's Hour Record Plot! Come take a look!
Wayback Machine Archiver: Backup Pages with Python
The Internet Archive's Wayback Machine tries to keep a complete copy of the internet. With this script, you can submit pages for effortless indexing.
The Gender Pay Gap in Data Science Salaries
How do the salaries of woman data scientists compare to those of men? This month we explore pay by gender and location.
Python Patterns: @total_ordering
Your classes can make use of the rich Python comparison operators just like the built-in classes. Here I'll show you how to do it while minimizing boilerplate.
Data Science Salaries
How do data scientists salaries vary by experience and location? Read on to find out!
SWITRS: On What Days Do Cyclists Crash?
California accident data doesn't just cover cars, it covers bikes too! This time we look at when cyclists crash in California.
Python Patterns: Enum
Things often come in sets of specific items, like states, Pokémon, or playing cards. Python has an elegant way of representing them using enum.
Python Patterns: Named Tuples
Sometimes I need to store an ordered dataset, but reference specific members from it. Named tuples in Python provide a clean way to do this!
SWITRS: Accidents After Daylight Saving Time Ends
Day light saving time leaves leads to more traffic accidents, but what about when DST ends? Some researchers have found that it does lead to more accidents, so I take a look using California's SWITRS data.