Notation
This document provides the formal notation used in this site. The main goal is to elegantly differentiate between notational clashes across different math topics. For example,
We mostly follow the notations in the Deep Learning Book, which is based on this LaTex file. Differences between our notation and theirs will be highlighted in
This notation elegantly differentiates between scalars
Numbers and Arrays
| Notation | Description |
|---|---|
| A scalar (integer or real) | |
| A vector | |
| A matrix | |
| A tensor | |
| Identity matrix with |
|
| Identity matrix with dimensionality implied by context | |
| Standard basis vector |
|
| A square, diagonal matrix with diagonal entries given by |
|
| A scalar random variable | |
| A vector-valued random variable | |
| A matrix-valued random variable |
Sets and Graphs
| Notation | Description |
|---|---|
| A set | |
| The set of real numbers | |
| The set containing 0 and 1 | |
| The set of all integers between |
|
| The real interval including |
|
| The real interval excluding |
|
| Set subtraction, i.e., the set containing the elements of |
|
| A graph | |
| The parents of |
Indexing
| Notation | Description |
|---|---|
| Element |
|
| All elements of vector |
|
| Element |
|
| Row |
|
| Column |
|
| Element |
|
| 2-D slice of a 3-D tensor | |
| Element |
Linear Algebra Operations
| Notation | Description |
|---|---|
| Transpose of matrix |
|
| Moore-Penrose pseudoinverse of |
|
| Element-wise (Hadamard) product of |
|
| Determinant of |
Calculus
| Notation | Description |
|---|---|
| Derivative of |
|
| Partial derivative of |
|
| Gradient of |
|
| Matrix derivatives of |
|
| Tensor containing derivatives of |
|
| Jacobian matrix |
|
| The Hessian matrix |
|
| Definite integral over the entire domain of |
|
| Definite integral with respect to |
Probability and Information Theory
| Notation | Description |
|---|---|
| The random variables |
|
| They are conditionally independent given |
|
| A probability distribution over a discrete variable | |
| A probability distribution over a continuous variable, or over a variable whose type has not been specified | |
| Random variable |
|
| Expectation of |
|
| Variance of |
|
| Covariance of |
|
| Shannon entropy of the random variable |
|
| Kullback-Leibler divergence of P and Q | |
| Gaussian distribution over |
Functions
| Notation | Description |
|---|---|
| The function |
|
| Composition of the functions |
|
| A function of |
|
| Natural logarithm of |
|
| Logistic sigmoid, |
|
| Softplus, |
|
| Positive part of |
|
| is 1 if the condition is true, 0 otherwise |
Sometimes we use a function
Datasets and Distributions
| Notation | Description |
|---|---|
| The data generating distribution | |
| The empirical distribution defined by the training set | |
| A set of training examples | |
| The |
|
| The target associated with |
|
| The |