In this repository, I'm uploading code, notebooks and articles from my personal blog : https://maelfabien.github.io/. Don't hesitate to the repo if you enjoy my work ! New articles are being published weekly !
I recently started a newsletter in which I gather some cool articles I wrote on a topic, interesting Github repositories, projects, papers and more! I’ll try to send 1 to 2 emails per month. If you want to stay in the loop, just click here : http://eepurl.com/gyYzi5
NEW: I'm looking for motivated Data Scientists to help me build high environmental impact algorithms (CV essentially). Please contact me if you're interested (from my website, contact section)
First of all, if you're not familiar with the key concepts of machine learrning, make sure to check this first article : https://maelfabien.github.io/machinelearning/ml_base/
The repository is organized the following way :
You would like to work on an article with me ? Or you would like me to work on a specific topic ? Feel free to reach out ! ([email protected])
For the moment, these cheat sheets are written manually. I'd like to create a visual content later that would both dive in the maths and illustrate clearly each algorithm.
I have made a series of projects, all of which are available on my blog : https://maelfabien.github.io/portfolio/#
SP - Voice Gender Detection web application: How to extract relevant features and build a voice gender detection application using MFCC, GMMs and a provided dataset.
SP - Sound Visualization (3/3): Dive into spectrograms, chromagrams, tempograms, spectral power density and more...
SP - Sound Feature Extraction (2/3): An overview with a Python implementation of the different sound features to extract.
SP - Introduction to Voice Processing in Python (1/3): Summary of the book "Voice Computing with Python" with concepts, code and examples.
SP - Building a Voice Activity Detection web application : Voice detection can be used to start a voice assistant or in emergency cases for example. Here's how to implement it using simple methods.
CV - Implementing YoloV3 for Object Detection : Learn how to implement YoloV3 and detect objects on your images and videos.
NLP - Easy Question Answering with AllenNLP : Understand the core concepts and create a simple example of Question Answering.
NLP - Data Augmentation in NLP : Details of the implementation of “Easy Data Augmentation” paper.
NLP - Character-level LSTMs to predict gender of first names : 90% accuracy on predictiong the gender of French and US first names.
NLP - Few Shot Text Classification : Implementation of a simple paper that leverages pre-trained models for few shot text classification.
NLP - Improved Few Shot Text Classification : Improving previous results with Data Augmentation and more complex models.
RL - Introduction to Reinforcement Learning : An introduction to the basic building blocks of reinforcement learning.
RL - Markov Decision Process : Overview of Markov Decision Process and Bellman Equation.
RL - Planning by Dynamic Programming : Introduction to Dynamic Programming, including Policy and Value Iteration.
NLP - I trained a Neural Network to speak like me : Having written over 100 articles, I trained a NN to write articles just like me.
DL - How do Neural Networks learn? : Dive into feedforward process and back-propagation.
See MoreArticle Title | Read Time | Article | Code Folder |
---|---|---|---|
The linear regression model (1/2) | 14mn | here | here |
The linear regression model (3/2) | 10mn | here | here |
Basics of Statistical Hypothesis Testing | 5mn | here | --- |
The Logistic Regression | 4mn | here | here |
Statistics in Matlab | 4mn | here | --- |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
The Basics of Machine Learning | 4mn | here | --- |
Bayes Classifier | 1mn | here | --- |
Linear Discriminant Analysis | 3mn | here | --- |
Adaboost and Boosting | 7mn | here | here |
Gradient Boosting Regression | 6mn | here | here |
Gradient Boosting Classification | 3mn | here | --- |
Large Scale Kernel Methods for SVM | 9mn | here | here |
Anomaly Detection | 3mn | here | --- |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
Introduction to Time Series | 4mn | here | here |
Key concepts of Time Series | 4mn | here | here |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
Markov Chains | 9mn | here | here |
Hidden Markov Models | 6mn | here | --- |
Build a language recognition app from scratch | 10mn | here | here |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
Introduction to Graph Mining | 5mn | here | here |
Graph Analysis | 4mn | here | here |
Graph Algorithms | 11mn | here | here |
Graph Learning | 8mn | here | here |
Graph Embedding | 4mn | here | here |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
GridSearch vs. Randomized Search | 2mn | here | --- |
AutoML with h2o | 6mn | here | --- |
Bayesian Hyperparameter Optimization | 7mn | here | here |
Machine Learning Explainability | 12mn | here | --- |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
Introduction to Data Viz | 12mn | here | --- |
Visual Recommendation System | 4mn | here | --- |
Interactive graphs in Python with Altair | 5mn | here | here |
Dynamic plots with BQ-Plot | --- | --- | here |
An interactive tool with Altair | --- | here | --- |
An interactive tool with D3.js | --- | here | --- |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
Introduction to Online Learning | 5mn | here | --- |
Linear Classification | 1mn | here | --- |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
The Rosenbaltt's Perceptron | 8mn | here | here |
Multilayer Perceptron (MLP) | 5mn | here | here |
Prevent Overfitting of Neural Netorks | 6mn | here | --- |
Full introduction to Neural Nets | 6mn | here | --- |
Convolutional Neural Network | 6mn | here | --- |
How do Neural Networks learn? | 3mn | here | --- |
Activation functions in DL | 3mn | here | here |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
Inception Architecture in Keras | 2mn | here | here |
Build an autoencoder using Keras functional API | 5mn | here | --- |
XCeption Architecture | 5mn | here | here |
GANs on the MNIST dataset | --- | --- | here |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
Build an Emotion Recognition WebApp from scratch | 8mn | here | here |
A full guide to Face, Mouth and Eyes Real Time detection | 16mn | here | here |
How to use OpenPose on MacOS ? | 3mn | here | --- |
Introduction to Computer Vision | 1mn | here | --- |
Image Filtering and Image Gradients | 5mn | here | here |
Advanced Filtering and Image Transformation | 5mn | here | --- |
Image Features, Panorama, Matching | 5mn | here | --- |
Implementing YoloV3 for Object Detection | 3mn | here | --- |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
Introduction to NLP | 1mn | here | --- |
Text Pre-Processing | 8mn | here | --- |
Text Embedding with BoW and Tf-Idf | 5mn | here | --- |
Text Embedding with Word2Vec | 6mn | here | --- |
I trained a Neural Network to speak like me | 8mn | here | here |
I trained a Neural Network to speak like me | 8mn | here | here |
Few Shot Text Classification | 10mn | here | here |
Improved Few Shot Text Classification | 9mn | here | here |
Predicting Gender of First Names | 7mn | here | here |
Data Augmentation in NLP | 3mn | here | --- |
Easy Question Answering with AllenNLP | 4mn | here | --- |
Article Title | Read Time | Article | Code Folder |
---|---|---|---|
Introduction to Reinforcement Learning | 6mn | here | --- |
Markov Decision Process | 7mn | here | --- |
Planning by Dynamic Programming | 4mn | here | --- |
Two general articles :
Understanding Computer Components (6mn read) https://maelfabien.github.io/bigdata/comp_components/
Useful Bash commands (1mn read) https://maelfabien.github.io/bigdata/Terminal/
Making your code production ready (1mn read) https://maelfabien.github.io/bigdata/Code/
Article Title | Read Time | Article |
---|---|---|
Introduction to Hadoop | 4mn | here |
MapReduce | 3mn | here |
HDFS | 2mn | here |
VMs in Virtual Box | 1mn | here |
Hadoop with the HortonWorks Sandbox | 2mn | here |
Load and move files to HDFS | 2mn | here |
Launch a MapReduce Job | 2mn | here |
MapReduce Jobs in Python | 3mn | here |
MapReduce Job in Python locally | 1mn | here |
Article Title | Read Time | Article |
---|---|---|
Introduction to Spark | 6mn | here |
Install Spark-Scala and PySpark | 1mn | here |
Discover Spark-Scala | 2mn | here |
Article Title | Read Time | Article |
---|---|---|
A No-SQL project from scratch | 8mn | here |
Big (Open) Data, the GDelt project | 2mn | here |
Install Zeppelin locally | 1mn | here |
Run Zeppelin on AWS EMR | 4mn | here |
Work with S3 buckets | 1mn | here |
Launch and access AWS EC2 instances | 2mn | here |
Install Apache Cassandra on EC2 Cluster | 2mn | here |
Install Zookeeper on EC2 instances | 3mn | here |
Build an ETL in Scala | 3mn | here |
Move Scala Dataframes to Cassandra | 2mn | here |
Move Scala Dataframes to Cassandra | 2mn | here |
Article Title | Read Time | Article |
---|---|---|
AWS Cloud Concepts | 2mn | here |
AWS Core Services | 1mn | here |
Article Title | Read Time | Article |
---|---|---|
TPU Survival Guide on Colab | 8mn | here |
Store files on Google Cloud and Colab | 1mn | here |
TPU Survival Guide on Colab | 8mn | here |
Introduction to GCP (Week 1 Module 1) | 6mn | here |
Lab - Instance VM + Cloud Storage | 3mn | here |
Lab - BigQuery Public Datasets | 1mn | here |
Introduction to Recommendation Systems (Week 1 Module 2) | 4mn | here |
Run Spark jobs on Cloud DataProc (Week 1 Module 2) | 2mn | here |
Lab - Recommend products using Cloud SQL and SparkML | 6mn | here |
Run ML models in SQL with BigQuery ML (Week 1 Module 3) | 6mn | here |
Article Title | Read Time | Article |
---|---|---|
Introduction to ElasticStack | 1mn | here |
Getting Started with ElasticSearch and Kibana | 7mn | here |
Install and run Kibana locally | 1mn | here |
Working with DevTools in ElasticSearch | 9mn | here |
Working with DevTools in ElasticSearch | 9mn | here |
Article Title | Read Time | Article |
---|---|---|
Introduction to Graph Databases | 1mn | here |
A day at Neo4J GraphTour | 7mn | here |
Who's the painter? - For explorium.ai : An illustration of how data enrichment and feature engineering can improve a model.
Machine Learning Interpretability and Explainability (1/2) - For explorium.ai : An introduction to interpretable models with code and examples.
Machine Learning Interpretability and Explainability (2/2) - For explorium.ai : An introduction to explainability in Machine Learning with code and examples.
A guide to Face Detection - For digitalminds.io : An overview of the different techniques face Face Detection in Python (with code).
Modéliser des distributions avec Python (French) - For Stat4Decision: Distribution fitting web application with Streamlit.
Introduction au Traitement Automatique de Language Naturel (TAL) (French) - For Stat4Decision
Boosting and Adaboost clearly explained : https://towardsdatascience.com/boosting-and-adaboost-clearly-explained-856e21152d3e
A guide to Face Detection in Python: https://towardsdatascience.com/a-guide-to-face-detection-in-python-3eab0f6b9fc1
Markov Chains and HMMs: https://towardsdatascience.com/markov-chains-and-hmms-ceaf2c854788
Introduction to Graphs (Part 1): https://towardsdatascience.com/introduction-to-graphs-part-1-2de6cda8c5a5
Graph Algorithms (Part 2): https://towardsdatascience.com/graph-algorithms-part-2-dce0b2734a1d
Graph Algorithms (Part 3): https://towardsdatascience.com/learning-in-graphs-with-python-part-3-8d5513eef62d
I trained a neural network to speak like me: https://towardsdatascience.com/i-trained-a-network-to-speak-like-me-9552c16e2396
Stay tuned :)