← Engineermaxxing

Nuranors

Deep technical series and architecture overviews. From first principles to production-level detail, with interactive visualizations throughout.

No matches found.
8-Part Series

LLM Internals

Demystifying how large language models actually work — from the mechanics of tokenization to the algorithms behind alignment and the systems that serve them at scale.

Tokenization Attention Training LoRA Alignment Quantization KV Cache FlashAttention
8 articles ~280 min 30+ demos
8-Part Series

Diffusion & Flow Matching

Demystifying diffusion models and flow matching — from probability foundations and DDPMs to score-based models, SDEs, modern architectures, and state-of-the-art generation.

Generative Models DDPMs Score Matching SDEs Flow Matching DiT Samplers Applications
8 articles ~290 min 30+ demos
8-Part Series

Vision-Language Models

How images and text merge into a single intelligence — from vision transformers and CLIP to multimodal fusion, visual instruction tuning, and spatial grounding.

Vision Foundations ViT CLIP Multimodal Fusion LLaVA Training Grounding Applications
8 articles ~280 min 30+ demos
8-Part Series

Vision-Language-Action Models

Where perception meets physical action — from imitation learning and behavioral cloning to RT-2, OpenVLA, and the foundation models teaching robots to act.

Embodied AI Imitation Learning Robot Vision Language Policies RT-2 / OpenVLA Sim-to-Real Planning Deployment
8 articles ~280 min 30+ demos
Deep-Dive · Field Manual

The Robot Learning Stack

A teardown of the architectures, losses, and training recipes that move modern manipulators — from behavior cloning’s first sin to flow-matched VLAs and the pixel-RL renaissance.

Behavior Cloning Diffusion Policy ACT VLAs PPO / SAC Sim-to-Real World Models Deployment
25 sections ~90 min 15+ demos
Deep-Dive · Deployment

Scaling, Optimizing & Deploying Foundation Models

VLMs, VLAs, and World Models — from architecture internals to quantization, inference acceleration, and production serving. Includes autonomous driving perception/planning and five portfolio projects.

VLM / VLA World Models Scaling Laws Quantization KV Cache vLLM / TensorRT Autonomous Driving Portfolio Projects
14 sections ~55 min 11+ demos
Thinking Machines · May 2026

The Long-Running Agent Stack

The four-lane map: managed harnesses, frameworks, specialized agents, and durable execution. MCP + A2A protocols, 43 platforms compared, 12 concepts shown working.

MCP A2A Durable Execution Agent Harnesses HITL Sandboxing
6 chapters 18 simulations 43 platforms
Thinking Machines · May 2026

Interaction Models

Real-time human-AI collaboration via 200ms micro-turns, encoder-free fusion, and streaming sessions.

Micro-turns dMel Flow Matching Batch Invariance MoE
9 chapters 10 canvases 11 papers
Thinking Machines · Oct 2025

On-Policy Distillation

Combining on-policy sampling with dense teacher supervision for 9–30x cheaper training than RL.

Reverse KL RLHF Continual Learning Math Reasoning
8 chapters 7 canvases
Thinking Machines · Sep 2025

LoRA Without Regret

When LoRA matches full fine-tuning, why all layers matter, and the information theory of RL capacity.

LoRA PEFT eNTK Information Theory RL
8 chapters 7 canvases
Thinking Machines · Sep 2025

Modular Manifolds

Constraining weight matrices to Stiefel manifolds with spectral-norm budgeting across layers.

Manifold Optimization Muon SVD Lipschitz
8 chapters 6 canvases
Thinking Machines · Sep 2025

Defeating Nondeterminism

Why LLM inference is nondeterministic (batch variance, not atomics) and how to make it bitwise reproducible.

FP Arithmetic Batch Invariance FlashDecoding On-Policy RL
8 chapters 6 canvases
12 Interactive Overviews

Modern AI Architecture Atlas

A bird's-eye map of every major AI architecture — interactive diagrams, intuitive explanations, and the research context to navigate the field.

Transformer Mamba / SSM Diffusion Flow Matching VAE / VQ-VAE GAN CLIP / Contrastive VLM VLA World Models NeRF / 3DGS Reward / Alignment
12 overviews ~180 min 48+ diagrams