Open in app

Sign in

Write

Sign in

Armand Sauzay
Armand Sauzay

404 Followers

Home

Lists

About

Published in

Towards Data Science

·Oct 26

Which Data Format to Use For Your Big Data Project?

Pickle, Parquet, CSV, Feather, HDF5, ORC, JSON: which one should you be using and why? — Choosing the right data format is crucial in Data Science projects, impacting everything from data read/write speeds to memory consumption and interoperability. This article explores seven popular serialization/deserialization formats in Python, focusing on their speed and memory usage implications. Through the analysis, we’ll also see how we can use profiling…

Data

6 min read

Which Data Format to Use For Your Big Data Project?
Which Data Format to Use For Your Big Data Project?
Data

6 min read


Jun 5

Python Project Setup: A Step-by-Step Guide to Industry Best Practices

Learn how to properly setup and kickstart your python project. And all the nitty-gritty details of which tools to use and how to configure them together, through building a simple CLI. — As you start working on your python project, you’ll likely need to set it up in a consistent and collaboration-friendly way. In this article, I’ll describe a setup that works great for our projects at Turo, and also my personal ones. It includes many industry best practices (semantic versioning, pre-commit…

Python

16 min read

Python Project Setup: A Step-by-Step Guide to Industry Best Practices
Python Project Setup: A Step-by-Step Guide to Industry Best Practices
Python

16 min read


Feb 14

Introduction to Cryptography: Understanding Hashing and Public-key Encryption with Code Examples

What’s cryptography? What does hashing and public-key encryption mean? And which tool can you use to start writing cryptography code? — You probably heard the name cryptography a lot recently, especially during the cryptocurrency bull market of 2021, or with the recent FTX saga. But what does it really mean? Is it useful? And how can you start using some cryptography yourself? As always, we’ll go through a little bit of…

Crypto

8 min read

Introduction to Cryptography: Understanding Hashing and Public-key Encryption with Code Examples
Introduction to Cryptography: Understanding Hashing and Public-key Encryption with Code Examples
Crypto

8 min read


Jul 13, 2022

How Fair Are Your Machine Learning Models?

A quick introduction to the topic of fairness with hands on coding. Evaluate your machine learning model fairness in just a few lines of code. — Are Machine Learning models “fair”? When increasingly more decisions are backed by ML algorithms, it becomes important to understand the biases they can create. But what does “fairness” mean? This is where it gets a little political (and mathematical)… To illustrate our thoughts, we’ll take the example of a machine…

Machine Learning

6 min read

How Fair Are Your Machine Learning Models?
How Fair Are Your Machine Learning Models?
Machine Learning

6 min read


Jun 26, 2022

SHAP values: Machine Learning interpretability and feature selection made easy.

Machine learning interpretability with hands on code with SHAP. — Machine Learning interpretability is becoming increasingly important, especially as ML algorithms are getting more complex. How good is your Machine Learning algorithm if it cant be explained? Less performant but explainable models (like linear regression) are sometimes preferred over more performant but black box models (like XGBoost or Neural Networks)…

Machine Learning

5 min read

SHAP values: Machine Learning interpretability and feature selection made easy.
SHAP values: Machine Learning interpretability and feature selection made easy.
Machine Learning

5 min read


Jun 16, 2022

5 tools I wish I knew when I started writing Machine Learning code

A few tools that will get you on the right track for your Machine Learning projects using python. — A few years back, I first learnt how to write machine learning code as I took my first ML class while pursuing my graduate studies in Applied Mathematics. …

Machine Learning

4 min read

5 tools I wish I knew when I started writing Machine Learning code
5 tools I wish I knew when I started writing Machine Learning code
Machine Learning

4 min read


Jun 11, 2022

Hydra, the most efficient config handling library for your Python/ML code

Improve your python code by using hydra for configuration. — In this tutorial, we’ll go through some available options that you might encounter for config handling, then explain why hydra is my favorite pick, and finally go through some code examples to highlight the key functionalities of Hydra. Context and available options As one works on a Python project, especially for machine learning, the…

Machine Learning

5 min read

Hydra, the most efficient config handling library for your Python/ML code
Hydra, the most efficient config handling library for your Python/ML code
Machine Learning

5 min read


Jun 7, 2022

Add pre-commit checks to format your files and commit messages

Stop committing wrongly formatted code and start using pre-commit checks. — How many times have you seen a commit message like ‘test’ or ‘modif’ or ‘reran notebook’? Commit messages can be very useful and their format can help get the relevant information in a simple look. This is what conventional commits is trying to achieve: standardize the commit format to be…

Github

3 min read

Add pre-commit checks to format your files and commit messages
Add pre-commit checks to format your files and commit messages
Github

3 min read


Jun 5, 2022

Command Line 101: a Basic Guide to Using the Terminal

Get started using your terminal and get one step closer to being an experienced developer. — All of the code for this tutorial can be found here. Developers use the command line to navigate through file and perform operations. Once you get used to it, it is definitely the most efficient way to access files and perform operations. Also, when you start virtual machines on the…

Coding

3 min read

Command Line 101: a Basic Guide to Using the Terminal
Command Line 101: a Basic Guide to Using the Terminal
Coding

3 min read


Published in

Dev Genius

·Jun 3, 2022

Using Conda environments for Python, all you need to know

Hands on creating and managing conda environments and env variables for your python projects. — All of the code for this tutorial can be found here. Python code is great, but being able to reproduce the code is even better! …

Python

4 min read

Using Conda environments for Python, all you need to know
Using Conda environments for Python, all you need to know
Python

4 min read

Armand Sauzay

Armand Sauzay

404 Followers

Applied Scientist at Turo - ex Airbnb - ex Amazon - UC Berkeley

Following
  • Foteini Ntelopoulou

    Foteini Ntelopoulou

  • Nouriel Gino Yazdinian

    Nouriel Gino Yazdinian

  • The Money Bull

    The Money Bull

  • Fabio Matricardi

    Fabio Matricardi

  • SunDeep Mehra: Digital Entrepreneur

    SunDeep Mehra: Digital Entrepreneur

See all (99)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams