Topic: Data Science
I am a data scientist by profession and personality. I love trying to answer a question by finding the right data and pulling the answer out of it. I write about the tools I use to do data science, the methods, and even some of my results. You can find my articles on the subject below:
California was put under a stay-at-home order in March, 2020. As expected, traffic volume decreased, but what happened to rate of fatal accidents? They skyrocketed!
From Analyst to Machine Learning
Data science has left the era of the Unicorn and entered the era of the team, but that means there is now a whole spectrum of data science jobs. Here is what they do.
From Unicorns to Teams
When data science started the job covered everything from setting up databases to running experiments to making models. But finding Unicorns was impossible; something had to give.
When I released the SWITRS dataset, I had to find a place to host a 5 Gig dataset. Here is what I learned.
Jumpstart your time series visualizations with this Jupyter plotting notebook!
Jumpstart your visualizations with this Jupyter plotting notebook!
Jupyter notebooks are great for data exploration; jumpstart your work with this library of useful notebook templates!
How do the salaries of woman data scientists compare to those of men? This month we explore pay by gender and location.
How do data scientists salaries vary by experience and location? Read on to find out!
California crash data doesn't just cover cars, it covers bikes too! This time we look at when cyclists crash in California.
Day light saving time leaves leads to more traffic collisions, but what about when DST ends? Some researchers have found that it does lead to more crashes, so I take a look using California's SWITRS data.
A few months ago I tested my Fate dice for biases. Now, I retest the "biased" set and see if it really is unlucky! Unfortunately, things aren't so clear...
What does a "normal" distribution of rolls from a fair set of Fate dice look like? There are a lot of ways to estimate it. In this post I'll go through four methods.
My friends and I played a Fate RPG for over two years. During that time we rolled a lot of dice and developed a lot of superstitions, but were any of them correct?
Need to compare a set of distributions of some variable? Histograms are OK, but try something fancier! Read on to learn about box, strip, swarm, and violin plots!
Day light saving time leaves us drowsy and cranky at work, but it also leads to an increase in traffic collisions! Find out exactly how many more there are with this analysis!
Motorcycles riders are a different breed, born to chase excitment! So when do they crash? Using California's SWITRS data I find out! I'll give you a hint: it is not on the way to their 9-5!
Much of data science involves writing code; for data cleaning, parsing, and modeling. Software tests can ensure that your code does what you think it does!
What day of the year has the most car crashes? The fewest? Find out as I look at California's crash data! Hint: they're both holidays!
Where is the best spot to take a photo in San Francisco? Learn how I answered this question with my Insight Data Science project!
Is FiveThirtyEight's Polls Plus model biased against any candidate? I continue my double-checking their model by looking at each candidate individually.
How well did FiveThirtyEight do in predicting the primary results? I Double-check FiveThirtyEight's Polls Plus model by comparing its predictions to the outcomes of the 2016 primaries.