(From Lei et al, “Geometric Understanding of Deep Learning,” (2018), 1805.10451)

Posts

Non-Max Suppression

Feb 17, 2019
The Hypothesis Space and Representer Theorem

A theoretical discussion of learning theory, specifically the construction of a hypothesis space in ML/DL, and how it can be formulated as a Reproducing Kernel Hilbert Space.

Jan 30, 2019
Learning Rate Schedulers

Different learning rate schedulers and their implementation in python

Jan 6, 2019
Alternating Method of Multipliers- Theory and Industry Example Application

A mathematical overview and practical application of the ADMM optimization algorithm, a useful alternative to Stochastic Gradient Descent (SGD) as a deep learning optimizer.

Dec 17, 2018
Accelerating Seq2Seq LSTM Training - GPU vs CPU

GPUs can be effectively used in parallel for massive distributed computational processes.But GPU usage needs to be tailored to your model architecture.

Nov 29, 2018
Cost-Sensitive Feature Selection

Incorporating feature cost information into the learning process.

Sep 7, 2018
Multi-Objective Optimization

A sophisticated yet simple solution to combining multiple loss functions for optimizing a single model

Aug 11, 2018
Bayesian Optimization

Jul 21, 2018
Coordinate Descent is Fast!

A simple demonstration of how cool coordinate descent is.

May 4, 2018
Functional Linear Regression

Linear regression augmented through interpolation and regularization techniques.

Apr 2, 2018
Sparse Linear Regression

A comparison of sparsity inducing regularization techniques for regression problems.

Mar 17, 2018
The Log-Sum-Exp Trick

Normalizing vectors of log probabilities is a common task in statistical modeling, but it can result in under- or overflow when exponentiating large values. The log-sum-exp trick for resolving this issue.

Mar 1, 2018
A Linear Algebraic Perspective of Neural Networks

How a neural network's operations such as hidden layers and activation functions can be thought of as linear algebra operations.

Feb 12, 2018
Tweedie Distribution

For Dramatically Unbalanced & Zero-inflated Data

Jan 31, 2018
Group Lasso

Jan 23, 2018