Topic: Machine Learning
Machine learning is what first attracted me to data science as I finished my PhD. The ability to train a machine to do a better job at certain tasks than a human was awe inspiring. My articles about machine learning can be found here:
Using Large Language Models To Clean Data
Manually fixing messy data is tedious and slow. But thankfully, LLMs are pretty good are piecing together mangled text. Read on to find out how!
Manually fixing messy data is tedious and slow. But thankfully, LLMs are pretty good are piecing together mangled text. Read on to find out how!
Machine Learning Deployment:
Return Actions, Not Scores
A poorly designed machine learning model API will leave you trapped. Properly hiding your implementation will make life much easier!
Return Actions, Not Scores
A poorly designed machine learning model API will leave you trapped. Properly hiding your implementation will make life much easier!
Claude Solves SAT Analogies
Word2Vec failed to solve SAT analogies, can modern language models do better? A small test of Anthropic's Claude LLM.
Word2Vec failed to solve SAT analogies, can modern language models do better? A small test of Anthropic's Claude LLM.
When Are Large Language Models Useful?
Large language models (LLMs) are incredibly valuable tools, but they're not for everything. Here's a simple rule to know when to use them and when to avoid them.
Large language models (LLMs) are incredibly valuable tools, but they're not for everything. Here's a simple rule to know when to use them and when to avoid them.
How I Write with ChatGPT
OpenAI's ChatGPT is viewed as entertaining but not useful because it makes up facts. But I find it incredibly valuable for writing. Here is how I use it.
OpenAI's ChatGPT is viewed as entertaining but not useful because it makes up facts. But I find it incredibly valuable for writing. Here is how I use it.
AI, Artists, and Technology
AI generated art took off with the open-source release of Stable Diffusion, leaving some artists worried. As an artist and machine learning engineer, here is my take.
AI generated art took off with the open-source release of Stable Diffusion, leaving some artists worried. As an artist and machine learning engineer, here is my take.
Using Scikit-learn Pipelines with Pandas Dataframes
Pandas and scikiet-learn are two important libraries for building machine learning models. Here is how to get them to work together.
Pandas and scikiet-learn are two important libraries for building machine learning models. Here is how to get them to work together.
Computing Machine Learning Features in Real-time
Models often derive great value from real-time features, but computing them is hard because it has to be done quickly. Here is one way I have done it successfully.
Models often derive great value from real-time features, but computing them is hard because it has to be done quickly. Here is one way I have done it successfully.
Machine Learning Deployment: Shadow Mode
Deploying machine learning models is hard; Shadow Mode is one way to make testing a little easier.
Deploying machine learning models is hard; Shadow Mode is one way to make testing a little easier.
Interview Question: What Machine Learning Metric to Use
One of my favorite questions to ask in an interview is "What metric should you use to decide if your model works?". Read on to find out what a good answer looks like!
One of my favorite questions to ask in an interview is "What metric should you use to decide if your model works?". Read on to find out what a good answer looks like!
SAT2Vec: Word2Vec Versus SAT Analogies
Could Word2Vec pass the SAT analogies section and get accepted to a good college? I take a pre-trained model and find out!
Could Word2Vec pass the SAT analogies section and get accepted to a good college? I take a pre-trained model and find out!