December 2025 · Machine Learning

The Geometry of Attention

A geometric intuition for why attention mechanisms work so well across domains—from NLP to vision to audio.

November 2025 · Reinforcement Learning

Understanding GRPO

Group Relative Policy Optimization eliminates the learned value function. Here's why that matters for RL training.