In practice
In LLMs the most used one is cross-entropy on next tokens. The loss value shown during training is the top signal to check whether the model is converging or there is a bug. A flat curve almost always means data or hyperparameter issues.
Related terms
Seen in the wild
0 entries mentioning itNo archive entry mentions it explicitly. Appears in broader contexts.