Polars vs Pandas: A Real-World Performance Comparison (Part 1/2)

Recently, I decided to experiment with the Python library Polars at work. I’ve been hearing a lot about it, and I wanted to see how it compares to Pandas, which we currently use for our data preprocessing. Here’s what I learned! Why Polars? Polars has been trending in the data engineering world, and I was curious to see if it lived up to the hype. Plus, I’m a firm believer in learning by doing, so I thought this would be a great opportunity to get hands-on experience with a new tool....

July 13, 2024 · 3 min · Me

Notes on ML in production

Today, I find myself working on another Machine Learning (ML) model with the goal of putting it in production. I thought a good first step would be to look back on my previous experiences and identify what (not) to do! I won’t explain what ML is in this article, or which algorithm(s) to pick according to different cases; I’ll go through good practices like: how to start, what not to forget about and little tricks to make your life easier....

January 22, 2022 · 7 min · Me

Introduction

Hello there / bonjour 👋 I am Thomas, a 26yo French Data Scientist and this is my first ever blog post ! This blog will be about everything, from what I learnt after 3 years working as a Data Scientist, to random thoughts about the world ! For years, I have been passionate about self improvement. Trying to find ways to combine pleasure, entertainment and self growth. Back in high school, I decided to better my English and picked-up the Hobbit in its original version....

January 10, 2022 · 1 min · Me