Notation
This document provides the formal notation used in this site. The main goal is to elegantly differentiate between notational clashes across different math topics. For example,
We mostly follow the notations in the Deep Learning Book, which is based on this LaTex file. Differences between our notation and theirs will be highlighted in
This notation elegantly differentiates between scalars
Numbers and Arrays
Notation | Description |
---|---|
A scalar (integer or real) | |
A vector | |
A matrix | |
A tensor | |
Identity matrix with |
|
Identity matrix with dimensionality implied by context | |
Standard basis vector |
|
A square, diagonal matrix with diagonal entries given by |
|
A scalar random variable | |
A vector-valued random variable | |
A matrix-valued random variable |
Sets and Graphs
Notation | Description |
---|---|
A set | |
The set of real numbers | |
The set containing 0 and 1 | |
The set of all integers between |
|
The real interval including |
|
The real interval excluding |
|
Set subtraction, i.e., the set containing the elements of |
|
A graph | |
The parents of |
Indexing
Notation | Description |
---|---|
Element |
|
All elements of vector |
|
Element |
|
Row |
|
Column |
|
Element |
|
2-D slice of a 3-D tensor | |
Element |
Linear Algebra Operations
Notation | Description |
---|---|
Transpose of matrix |
|
Moore-Penrose pseudoinverse of |
|
Element-wise (Hadamard) product of |
|
Determinant of |
Calculus
Notation | Description |
---|---|
Derivative of |
|
Partial derivative of |
|
Gradient of |
|
Matrix derivatives of |
|
Tensor containing derivatives of |
|
Jacobian matrix |
|
The Hessian matrix |
|
Definite integral over the entire domain of |
|
Definite integral with respect to |
Probability and Information Theory
Notation | Description |
---|---|
The random variables |
|
They are conditionally independent given |
|
A probability distribution over a discrete variable | |
A probability distribution over a continuous variable, or over a variable whose type has not been specified | |
Random variable |
|
Expectation of |
|
Variance of |
|
Covariance of |
|
Shannon entropy of the random variable |
|
Kullback-Leibler divergence of P and Q | |
Gaussian distribution over |
Functions
Notation | Description |
---|---|
The function |
|
Composition of the functions |
|
A function of |
|
Natural logarithm of |
|
Logistic sigmoid, |
|
Softplus, |
|
Positive part of |
|
is 1 if the condition is true, 0 otherwise |
Sometimes we use a function
Datasets and Distributions
Notation | Description |
---|---|
The data generating distribution | |
The empirical distribution defined by the training set | |
A set of training examples | |
The |
|
The target associated with |
|
The |