Yifan's Blog

Notes on Language Models and Deep Learning.

Latest Posts

ShortSWA Is the Next-Generation N-gram Embedding

Yifan Zhang

Yifan's Blog, January 12, 2026

Revisiting Variance Reduction in Policy Gradients for LLM Reinforcement Learning

Yifan Zhang, Quanquan Gu

Yifan's Blog, December 27, 2025

Matrix Exponential Attention

Yifan Zhang

Yifan's Blog, December 15, 2025

More posts coming soon.