Posts

Anatomy of an LLM: Internals of DeepSeek-V3's Attention Implementation

Math Is Eating the World—And Software Is the Accelerant

Why Are Sines and Cosines Used For Positional Encoding?

Why Is Symmetry So Important in Particle Physics?

Why Tensors? A Beginner's Perspective