Blog — Saran

FounderOSagentsaibuildingdata-engineeringdesignengineeringfounderframeworksgpt-2linguisticsllamallmmindsetmlmodel-atlasmozhi-ainlpproductpytorchreinforcement-learningtamiltrainingtransformers

Building Llama 3.2 From Scratch (How Modern LLMs Improved on GPT-2) Week 2 of the model-atlas series: rebuild a Llama-style decoder block in PyTorch and see why RoPE, RMSNorm, GQA, and SwiGLU became the modern default.

→ Apr 4, 2026

Building GPT-2 From Scratch (and Loading Real Weights) Week 1 of a 24-model series: implement every layer in PyTorch, load OpenAI's checkpoint, and see why today's LLMs are still this architecture.

→ Mar 30, 2026

Seven Hidden Faults in Every Tamil NLP Pipeline Unicode fragmentation, mojibake, agglutination explosions, and the romanized web your model never saw - a field audit of what goes wrong before training.

→ Mar 28, 2026

Building a Tiny Tamil GPT From Scratch What I learned training a decoder-only Transformer on my own data

→ Mar 15, 2026

The Anatomy of an RL Environment — How AI Agents Actually Learn to Write Better Code Most people think training an AI agent is about feeding it data and hoping it gets smarter. It's not. Here's what a real RL environment looks like under the hood.

→ Mar 15, 2026

Building FounderOS - Agents SSharing the journey is part of the process — here's why I decided to document everything I build.

→ Mar 15, 2026

Why I'm building in public Sharing the journey is part of the process — here's why I decided to document everything I build.

→ Mar 10, 2026

Notes on product thinking The mental models I keep coming back to when building products people actually want.