Topic: Data Science

I am a data scientist by profession and personality. I love trying to answer a question by finding the right data and pulling the answer out of it. I write about the tools I use to do data science, the methods, and even some of my results. You can find my articles on the subject below:

Jupyter Notebook Templates for Data Science: Plotting
Jumpstart your visualizations with this Jupyter plotting notebook!
Jupyter Notebook Templates for Data Science
Jupyter notebooks are great for data exploration; jumpstart your work with this library of useful notebook templates!
The Gender Pay Gap in Data Science Salaries
How do the salaries of woman data scientists compare to those of men? This month we explore pay by gender and location.
Data Science Salaries
How do data scientists salaries vary by experience and location? Read on to find out!
SWITRS: On What Days Do Cyclists Crash?
California crash data doesn't just cover cars, it covers bikes too! This time we look at when cyclists crash in California.
SWITRS: Car Crashes After Daylight Saving Time Ends
Day light saving time leaves leads to more traffic collisions, but what about when DST ends? Some researchers have found that it does lead to more crashes, so I take a look using California's SWITRS data.
Fate Dice: Statistics Testing Is Hard
A few months ago I tested my Fate dice for biases. Now, I retest the "biased" set and see if it really is unlucky! Unfortunately, things aren't so clear...
Fate Dice Intervals
What does a "normal" distribution of rolls from a fair set of Fate dice look like? There are a lot of ways to estimate it. In this post I'll go through four methods.
Fate Dice Statistics
My friends and I played a Fate RPG for over two years. During that time we rolled a lot of dice and developed a lot of superstitions, but were any of them correct?
Visualizing Multiple Data Distributions
Need to compare a set of distributions of some variable? Histograms are OK, but try something fancier! Read on to learn about box, strip, swarm, and violin plots!
SWITRS: Car Crashes After Daylight Saving Time
Day light saving time leaves us drowsy and cranky at work, but it also leads to an increase in traffic collisions! Find out exactly how many more there are with this analysis!
SWITRS: On What Days Do Motorcycles Crash?
Motorcycles riders are a different breed, born to chase excitment! So when do they crash? Using California's SWITRS data I find out! I'll give you a hint: it is not on the way to their 9-5!
Software Testing for Data Science
Much of data science involves writing code; for data cleaning, parsing, and modeling. Software tests can ensure that your code does what you think it does!
SWITRS: On What Days Do People Crash?
What day of the year has the most car crashes? The fewest? Find out as I look at California's crash data! Hint: they're both holidays!
WhereTo.Photo: Using Data Science to Take Great Photos
Where is the best spot to take a photo in San Francisco? Learn how I answered this question with my Insight Data Science project!
Further Double-checking FiveThirtyEight's 2016 Primary Predictions
Is FiveThirtyEight's Polls Plus model biased against any candidate? I continue my double-checking their model by looking at each candidate individually.
Double-checking FiveThirtyEight's 2016 Primary Predictions
How well did FiveThirtyEight do in predicting the primary results? I Double-check FiveThirtyEight's Polls Plus model by comparing its predictions to the outcomes of the 2016 primaries.