██FR█████ █INTELL███████████
frenchintelligence.org
cross-entropy-loss
The Impact of Data Size on Transformer Training: Overfitting & Loss Dynamics
June 21, 2025
Empirical Results: GPT-2 Analysis of Transformer Memorization & Loss
June 21, 2025