Topic: AI Engineering
All essays filed under "AI Engineering".
-
Open Weights Is Not the Same as Open Source AI
A practical distinction between open-weight AI models and truly open source AI systems, and why the difference matters when choosing local LLMs.
-
What LLMs Do at Inference: A Deep Dive Under the Hood
Updated:A step-by-step, reference-backed explanation of what happens during LLM inference: tokenization, embeddings, prefill & decode phases, KV caching, decoding strategies, bottlenecks and optimizations like quantization, FlashAttention and speculative decoding.
-
GPU vs TPU - Decoding the Battle of AI Accelerators in 2025
Updated:Decoding the Battle of AI Accelerators in 2025
-
Why Does Retrieval-Augmented Generation (RAG) Exist?
Updated:In the rapidly evolving world of artificial intelligence, large language models (LLMs) like GPT-4 or Grok have transformed how we interact with technology.