Maximum Likelihood Estimation (MLE)
Derivation
Optimization
The following training goal:
corresponds to the Negative Log-Likelihood (NLL) loss function:
which can be optimized based on its gradients:
The following training goal:
corresponds to the Negative Log-Likelihood (NLL) loss function:
which can be optimized based on its gradients: