██FR█████ █INTELL███████████
frenchintelligence.org
transformer-models
Exploring Alternative Architectures for Multi-Token LLM Prediction
July 20, 2025
The Impact of Data Size on Transformer Training: Overfitting & Loss Dynamics
June 21, 2025
Empirical Results: GPT-2 Analysis of Transformer Memorization & Loss
June 21, 2025