Subscribe
Sign in
Home
Notes
LLM Engineer's Handbook
Free Courses
Perks
Archive
About
LLM Engineer's Handbook is now thriving!
A framework for building LLM and RAG apps
Oct 22, 2024
•
Paul Iusztin
115
17
Recent posts
View all
Introducing Decoding AI Magazine
New name, same mission: building real-world AI.
Sep 25
•
Paul Iusztin
30
1
AI Agents in 5 Levels of Difficulty
With full code implementation
Sep 23
•
Paolo Perrone
47
3
The 5-Star Lie: You Are Doing AI Evals Wrong
Why binary evals are better than likert scales
Sep 20
•
Hamel Husain
35
9
The Mirage of Generic AI Metrics
Why off-the-shelf evals sabotage your AI product
Sep 13
•
Hamel Husain
49
7
See all
Join for proven content on production-grade AI, GenAI, and information retrieval systems
Subscribe
AI Engineering
View all
The 5-Star Lie: You Are Doing AI Evals Wrong
Why binary evals are better than likert scales
Sep 20
•
Hamel Husain
35
9
The Mirage of Generic AI Metrics
Why off-the-shelf evals sabotage your AI product
Sep 13
•
Hamel Husain
49
7
Getting Agent Architecture Right
Designing real-world AI agents with MCP without burning your time, money, and SANITY.
Sep 6
•
Anca Ioana Muscalagiu
60
8
Build with MCP Like a Real Engineer
Hands-on code for an enterprise AI PR Reviewer by unifying GitHub, Slack, and Asana through MCP.
Aug 30
•
Anca Ioana Muscalagiu
95
6
Why MCP Breaks Old Enterprise AI Architectures
Build from scratch an AI PR reviewer integrated with GitHub, Slack and Asana that scales within your organization.
Aug 19
•
Anca Ioana Muscalagiu
104
10
Information Retrieval
View all
The King of Multi-Modal RAG: ColPali
Query your rich visual PDFs under a RAG web app
Jun 5
•
Juan Ovalle
89
Stop using text-to-SQL for search. Here's why.
Multi-attribute vectors + natural language are changing how we find products online
Jan 16
•
Paul Iusztin
35
Using LLMs to build TikTok-like recommenders
The fifth and final lesson of the “Hands-On Real-time Personalized Recommender” open-source course — a free course that will teach you how to build and…
Jan 9
•
Paolo Perrone
33
1
Deploy scalable TikTok-like recommenders
Ship to the real world an H&M recommender using KServe
Dec 26, 2024
•
Paul Iusztin
27
2
Forget text-to-SQL: Use this natural query instead
Tabular semantic search engine on e-commerce using natural queries
Dec 19, 2024
•
Paul Iusztin
66
4
ML System Design
View all
Stop building AI demos that die
E2E MLOps architecture guide: Real-time fraud detection use case
Mar 22
•
Paul Iusztin
32
2
Monolith vs micro: The $1M ML design decision
The weight of your ML serving architectural choice
Jan 23
•
Paul Iusztin
32
6
ML serving 101: Core architectures
Choose the right architecture for your AI/ML app
Nov 2, 2024
•
Paul Iusztin
34
Build a semantic news search engine with 0 delay
How to build a real-time news search engine using Kafka, vector DBs, RAG and streaming engines.
Sep 21, 2024
•
Paul Iusztin
31
1
Real-time feature pipelines for RAG
RAG hybrid search with transformers-based sparse vectors. CDC tech stack for event-driven architectures.
Aug 17, 2024
•
Paul Iusztin
16
MLOps
View all
The GitHub Issue AI Butler on Kubernetes
A guide to THE production AI stack: LangGraph, K8s, Docker, Guardrails, Qdrant, AWS & CDK
Aug 5
•
Benito Martin
35
2
Deploying LLMs: Cloud, Metal, Serverless
DeepSeek-R1: GCP, Latitude.sh, Modal
May 15
•
Paul Iusztin
and
Louis-François Bouchard
39
2
The 6 MLOps foundational principles
The core MLOps guidelines for production ML
Sep 28, 2024
•
Paul Iusztin
36
2
Experiment Tracking Essentials: Finding the Right Tool
Gradio’s Custom Dashboards vs Wandb’s Built-In Tools for Training Diffusion Models
Sep 7, 2024
•
Anca Ioana Muscalagiu
11
The LLM-Twin Free Course on Production-Ready RAG applications.
Learn how to build a full end-to-end LLM & RAG production-ready system, follow and code along each component by yourself.
Jun 20, 2024
•
Alex Razvant
14
Join for proven content on production-grade AI, GenAI, and information retrieval systems
Subscribe
Newsletter
View all
Introducing Decoding AI Magazine
New name, same mission: building real-world AI.
Sep 25
•
Paul Iusztin
30
1
The Open-Source Stack for AI Agents
9 tool buckets you must know about
Jul 29
•
Paolo Perrone
96
I read Claude’s prompt. Here are 5 tips to master prompt engineering.
The simplest way to find the right LLMs. Copying code from ChatGPT won't make you an AI Engineer.
Jul 19
•
Paul Iusztin
32
3
Why Most AI Agents Fail in Production
And How to Build Ones That Don't
Jul 17
•
Paolo Perrone
48
4
3 things NOT to learn as an AI Engineer
Architecting the observability pipeline of an AI agent
Jul 12
•
Paul Iusztin
32
1
Guests
View all
AI Agents in 5 Levels of Difficulty
With full code implementation
Sep 23
•
Paolo Perrone
47
3
The Real Battle-Tested RAG Playbook
7-steps trusted by OpenAI, Anthropic & Google
Aug 12
•
Jason Liu
89
3
The GitHub Issue AI Butler on Kubernetes
A guide to THE production AI stack: LangGraph, K8s, Docker, Guardrails, Qdrant, AWS & CDK
Aug 5
•
Benito Martin
35
2
Why Most AI Agents Fail in Production
And How to Build Ones That Don't
Jul 17
•
Paolo Perrone
48
4
Stop Building AI Agents
Here’s what you should build instead
Jun 26
•
Hugo Bowne-Anderson
180
12
Join for proven content on production-grade AI, GenAI, and information retrieval systems
Subscribe