Data Blog

AB Testing: effect of early peeking and what to do about it

Sun 01 November 2020 — Xavier Bourret Sicotte

This notebook simulates the impact of early peaking on the results of a conversion rate AB test. Early peaking is loosely defined as the practice of checking and concluding the results of an AB test (i.e. based on its p value, statistical significance, secondary metrics etc) before the target sample size and power are reached.

Category: Statistics Cover:

Tags:

Comparing t-test and Mann Whitney test for the means of Gamma

Thu 18 October 2018 — Xavier Bourret Sicotte

This notebook explores various simulations where we are testing for the difference in means of two independent gamma distributions, by sampling them and computing the means of each sample. We will compare two main test methods: the t-test and the Mann Whitney test.

Category: Statistics Cover:

Tags:

Gaussian Mixture Model EM Algorithm - Vectorized implementation

Sat 14 July 2018 — Xavier Bourret Sicotte

Implementation of a Gaussian Mixture Model using the Expectation Maximization Algorithm. Vectorized implementation using Python Numpy and comparison to the Sklearn implementation on a toy data set

Category: Machine Learning Cover:

Tags:

AdaBoost: Implementation and intuition

Tue 10 July 2018 — Xavier Bourret Sicotte

This notebook explores the well known AdaBoost M1 algorithm which combines several weak classifiers to create a better overall classifier. The notebook consists of three main sections: A review of the Adaboost M1 algorithm and an intuitive visualization of its inner workings. An implementation from scratch in Python, using an Sklearn decision tree stump as the weak classifier. A discussion on the trade-off between the Learning rate and Number of weak classifiers parameters

Category: Machine Learning Cover:

Tags:

Tree based models

Sun 01 July 2018 — Xavier Bourret Sicotte

This notebook explores chapter 8 of the book "Introduction to Statistical Learning" and aims to reproduce several of the key figures and discussion topics. Of interest is the use of the graphviz library to help visualize the resulting trees and GridSearch from the Sklearn library to plot the validation curves

Category: Machine Learning Cover:

Tags:

Kernels and Feature maps: Theory and intuition

Thu 28 June 2018 — Xavier Bourret Sicotte

Following the series on SVM, we will now explore the theory and intuition behind Kernels and Feature maps, showing the link between the two as well as advantages and disadvantages. The notebook is divided into two main sections: 1. Theory, derivations and pros and cons of the two concepts. 2. An intuitive and visual interpretation in 3 dimensions

Category: Machine Learning Cover:

Tags:

Support Vector Machine: Python implementation using CVXOPT

Tue 26 June 2018 — Xavier Bourret Sicotte

In this second notebook on SVMs we will walk through the implementation of both the hard margin and soft margin SVM algorithm in Python using the well known CVXOPT library. While the algorithm in its mathematical form is rather straightfoward, its implementation in matrix form using the CVXOPT API can be challenging at first. This notebook will show the steps required to derive the appropriate vectorized notation as well as the inputs needed for the API.

Category: Machine Learning Cover:

Tags:

Support Vector Machine: calculate coefficients manually

Mon 25 June 2018 — Xavier Bourret Sicotte

In this first notebook on the topic of Support Vector Machines, we will explore the intuition behind the weights and coefficients by solving a simple SVM problem by hand.

Category: Machine Learning Cover:

Tags:

Linear and Quadratic Discriminant Analysis

Fri 22 June 2018 — Xavier Bourret Sicotte

Exploring the theory and implementation behind two well known generative classification algorithms. Linear discriminative analysis (LDA) and Quadratic discriminative analysis (QDA). This notebook will use the Iris dataset as a case study for comparing and visualizing the prediction boundaries of the algorithms

Category: Machine Learning Cover:

Tags:

Gaussian Naive Bayes Classifier: Iris data set

Fri 22 June 2018 — Xavier Bourret Sicotte

In this short notebook, we will use the Iris dataset example and implement instead a Gaussian Naive Bayes classifier using Pandas, Numpy and Scipy.stats libraries. Results are then compared to the Sklearn implementation as a sanity check

Category: Machine Learning Cover:

Tags:

Optimal Bayes Classifier

Fri 22 June 2018 — Xavier Bourret Sicotte

This notebook summarises the theory and the derivation of the optimal bayes classifier. It then provides a comparison of the boundaries of the Optimal and Naive Bayes classifiers.

Category: Machine Learning Cover:

Tags:

Maximum Likelihood Estimator: Multivariate Gaussian Distribution

Fri 22 June 2018 — Xavier Bourret Sicotte

The Multivariate Gaussian appears frequently in Machine Learning and this notebook aims to summarize the full derivation of its Maximum Likelihood Estimator

Category: Machine Learning Cover:

Tags:

Lasso regression: implementation of coordinate descent

Thu 14 June 2018 — Xavier Bourret Sicotte

Following the blog post where we have derived the closed form solution for lasso coordinate descent, we will now implement it in python numpy and visualize the path taken by the coefficients as a function of lambda. Our results are also compared to the Sklearn implementation as a sanity check.

Category: Machine Learning Cover:

Tags:

Ridge and Lasso: visualizing the optimal solutions

Thu 14 June 2018 — Xavier Bourret Sicotte

This short notebook offers a visual intuition behind the similarity and differences between Ridge and Lasso regression. In particular we will the contour of the Olrdinary Least Square (OLS) cost function, together with the L2 and L1 cost functions.

Category: Machine Learning Cover:

Tags:

Lasso regression: derivation of the coordinate descent update rule

Wed 13 June 2018 — Xavier Bourret Sicotte

This post describes how to derive the solution to the Lasso regression problem when using coordinate gradient descent. It also provides intuition and a summary of the main properties of subdifferentials and subgradients. Code to generate the figure is in Python.

Category: Machine Learning Cover:

Tags:

Coordinate Descent - Implementation for linear regression

Tue 12 June 2018 — Xavier Bourret Sicotte

Description of the algorithm and derivation of the implementation of Coordinate descent for linear regression in Python. Visualization of the "staircase" steps using surface and contour plots as well as a simple animation. This implementation will serve as a step towards more complex use cases such as Lasso.

Category: Machine Learning Cover:

Tags:

Ridge regression and L2 regularization - Introduction

Tue 12 June 2018 — Xavier Bourret Sicotte

This notebook is the first of a series exploring regularization for linear regression, and in particular ridge and lasso regression. We will focus here on ridge regression with some notes on the background theory and mathematical derivations and python numpy implementation. Finally we will provide visualizations of the cost functions with and without regularization to help gain an intuition as to why ridge regression is a solution to poor conditioning and numerical stability.

Category: Machine Learning Cover:

Tags:

Choosing the optimal model: Subset selection

Mon 11 June 2018 — Xavier Bourret Sicotte

In this notebook we explore some methods for selecting subsets of predictors. These include best subset and stepwise selection procedures. Code and figures inspired from the book ISLR - chapter 6 - converted into python.

Category: Machine Learning Cover:

Tags:

Animations of gradient descent: Ridge regression

Fri 08 June 2018 — Xavier Bourret Sicotte

Animation of gradient descent in Python using Matplotlib for contour and 3D plots. This particular example uses polynomial regression with ridge regularization

Category: Machine Learning Cover:

Tags:

Locally Weighted Linear Regression (Loess)

Thu 24 May 2018 — Xavier Bourret Sicotte

Introduction, theory, mathematical derivation of a vectorized implementation of Loess regression. Comparison of different implementations in python and visualization of the result on a noisy sine wave

Category: Machine Learning Cover:

Tags:

Introduction to Optimization and Visualizing algorithms

Sun 20 May 2018 — Xavier Bourret Sicotte

Introductory optimization algorithms implemented in Python Numpy and their corresponding visualizations using Matplotlib. A case study comparison between Gradient Descent and Newton's method using the Rosenbrock function.

Category: Machine Learning Cover:

Tags: Optimization Visualization

Statistical inference on multiple linear regression

Thu 17 May 2018 — Xavier Bourret Sicotte

Statistical inference on multiple linear regression in Python using Numpy, Statsmodel and Sklearn. Implementation of model selection, study of multicolinearity and residuals analysis.

Category: Statistics Cover:

Tags: statistics regression

Statistical inference on simple linear regression

Tue 15 May 2018 — Xavier Bourret Sicotte

Implementation of statistical inference in Python using Numpy, Statsmodel and Sklearn. Detailed breakdown of the formulae used and main assumptions behind the model. Some nice graphs on leverage and influence of observations.

Category: Statistics Cover:

Tags: statistics regression

Data Blog Data Science, Machine Learning and Statistics, implemented in Python

AB Testing: effect of early peeking and what to do about it

Comparing t-test and Mann Whitney test for the means of Gamma

Gaussian Mixture Model EM Algorithm - Vectorized implementation

AdaBoost: Implementation and intuition

Tree based models

Kernels and Feature maps: Theory and intuition

Support Vector Machine: Python implementation using CVXOPT

Support Vector Machine: calculate coefficients manually

Linear and Quadratic Discriminant Analysis

Gaussian Naive Bayes Classifier: Iris data set

Optimal Bayes Classifier

Maximum Likelihood Estimator: Multivariate Gaussian Distribution

Lasso regression: implementation of coordinate descent

Ridge and Lasso: visualizing the optimal solutions

Lasso regression: derivation of the coordinate descent update rule

Coordinate Descent - Implementation for linear regression

Ridge regression and L2 regularization - Introduction

Choosing the optimal model: Subset selection

Animations of gradient descent: Ridge regression

Locally Weighted Linear Regression (Loess)

Introduction to Optimization and Visualizing algorithms

Statistical inference on multiple linear regression

Statistical inference on simple linear regression

Data Blog
Data Science, Machine Learning and Statistics, implemented in Python