0% found this document useful (0 votes)

81 views23 pages

Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course

The PEC Generative AI Training Program focuses on Retrieval-Augmented Generation (RAG), which combines retrieval systems with generative AI models to enhance response accuracy and relevance. The program covers key components of RAG, including retrievers, generators, and feedback loops, alongside various RAG workflows such as Standard, Corrective, Speculative, and Agentic RAG. Additionally, it compares RAG with fine-tuning methods, highlighting their respective strengths and weaknesses in application development.

Uploaded by

regata4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views23 pages

Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course

Uploaded by

regata4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

PEC Generative AI Training

Training Program
Module 4: Training Generative AI Application to Your Needs,
May 05, 2025
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Module 3: Building Generative AI Applications

To Your Needs
➢ What is RAG ?
➢ Why we need RAG
➢ Important Terminologies in RAG (Key Components)
➢ How RAG works ? (WorkFlow in RAG)
➢ Types
➢ Comparison
➢ Fine Tuning (Alternative Of RAG)
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

What is RAG?
➢ RAG stands for Retrieval-Augmented Generation.
➢ It combines retrieval systems with Generative AI
models to produce accurate and relevant
responses.
➢ It is particularly useful for applications that require
up-to-date, fact-based, or domain-specific
responses.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Why we need RAG ?

➢ Hallucination (Incorrect Information), when an AI model
generates incorrect or misleading results. This can
happen in any type of AI model, including natural
language processing (NLP) models and computer vision
models.
➢ Data Staleness The model's inability to provide updated
information because it was trained on a fixed dataset that
does not include newer data.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Important Terminologies in RAG (Key Components)

Retriever:
(But there is something which is done before, Let’s See that First)

➢ Searches for relevant information from external knowledge bases or

datasets.

Generator:

➢ Uses the retrieved information to create coherent and accurate responses.

Feedback Loop: (Optional)

➢ Optional mechanism to refine outputs iteratively.

Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Preprocessing Before Retrieval

1. Chunking
• What it is:
Breaking large documents or datasets into smaller,
manageable pieces (chunks).
• Why it’s needed:
• Large text blocks are difficult to process efficiently.
• Helps maintain context and relevance in retrieval.
• Example:
• A 10,000-word article might be divided into 500-word chunks.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

2. Tokenization
• What it is:
Splitting text into smaller units called tokens (e.g., words,
phrases, or characters).
• Why it’s needed:
• Allows text to be processed numerically for embedding and search.
• Prepares the text for the embedding model.
• Example:
• "Retrieval-Augmented Generation" →
["Retrieval", "-", "Augmented", "Generation"]
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

3. Embedding
• What it is:
Converting text chunks into dense numerical vectors using pre-
trained models (e.g., Sentence Transformers, OpenAI
Embedding API).
• Why it’s needed:
• Vectors represent semantic meaning, enabling efficient similarity
search.
• These embeddings capture the context of the text.
• Where it's stored:
• Store embeddings in vector databases (e.g., FAISS, Pinecone,
Weaviate, ChromaDB).
• These databases allow quick and efficient similarity searches.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Important Terminologies in RAG (Key Components)

Retriever
● The retriever is responsible for finding the most relevant
information from an external knowledge base, database, or
document store.
● It uses methods like vector similarity search (e.g., FAISS,
ElasticSearch) or traditional keyword matching to locate data
relevant to the input query.
● Why it’s important:
○ Ensures the generative model has access to accurate and
contextually appropriate information to base its response.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Important Terminologies in RAG (Key Components)

Generator
● The generator is a pre-trained language model (e.g., GPT, BERT,
T5 or from Groq) that creates responses by incorporating the
retrieved information.
● It synthesizes retrieved data and transforms it into human-like,
coherent text.
● Why it’s important:
○ Acts as the "voice" of the system, converting raw retrieved
data into usable, conversational, or actionable outputs.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Important Terminologies in RAG (Key Components)

Feedback Loop (Optional)

● A mechanism to iteratively refine the output by re-querying the
retriever or adjusting the generator’s response based on user
feedback or model evaluation.
● Why it’s important:
○ Helps improve the accuracy and relevance of responses over
time.
○ Critical for applications requiring high precision, like healthcare
or legal advisory systems.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

How RAG works

( WorkFlow Diagram )
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Standard RAG
➢ Combines retrieval with generation in a straightforward manner.

Workflow:

1. Input query.
2. Retrieve relevant documents.
3. Generate response using retrieved documents.

Use Case:
● Question answering using enterprise knowledge bases

(Already Seen Above)

Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Corrective RAG
➢ Enhances response accuracy by correcting errors in real-time.

Workflow:

1. Generate an initial response.

2. Identify errors using retrieval.
3. Correct errors based on retrieved facts.

Use Case:
● Customer support chatbots with high accuracy requirements.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Corrective RAG
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Speculative RAG
➢ Prioritizes efficiency by speculating which documents are relevant
without full retrieval.
Workflow:

1. Model predicts relevance without actual retrieval.

2. Generates speculative output.

Advantages:
● Faster responses at the cost of potential accuracy.

Use Case:
● Real-time conversational AI with high-speed requirements.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Speculative RAG
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Agentic RAG
➢ Adds decision-making capabilities to the RAG model.

Workflow:

1. Retrieve information.
2. Evaluate context and goals.
3. Generate adaptive and strategic responses.

Use Case:
● Virtual assistants for decision-making tasks.
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Agentic RAG
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Comparison

Technique Focus Strengths Weaknesses

Standard RAG Simplicity Easy to implement Limited adaptability

Corrective RAG Accuracy Error correction in real- Slower responses

time

Speculative RAG Efficiency Faster responses Risk of inaccuracies

Agentic RAG Decision-making Strategic outputs Higher complexity

Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Fine Tuning VS RAG

Aspect Fine Tuning RAG
Definition Modifies a pre-trained model by Combines a pre-trained model with
training it on new data. external knowledge retrieval

Purpose Customizes the model for a specific Enhances responses dynamically with
task external information.

Data Dependency Requires training on task-specific Uses external data stored in a vector
data. database or index.

Flexibility Requires retraining for updates or Dynamically updates responses without

new data. retraining.

Computational Cost High, due to additional training Low, as it uses pre-trained models with
requirements, High GPU, CPU req. retrieval.

Example Use Case Creating a specialized application for Answering questions about frequently
a specific domain e.g (health care) updated knowledge (e.g., news, chatbot).
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Practical
Demo
RAG
Leading Engineers Forward: PEC Generative AI Training Program - Cohort 2

Trainer: Sajjad Ahmad, Inam ur Rehman

LinkedIn:
https://www.linkedin.com/in/muhammmad-talha/

Natural Language Processing
No ratings yet
Natural Language Processing
11 pages
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
No ratings yet
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
6 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
12 pages
Advanced Gen-AI Development
No ratings yet
Advanced Gen-AI Development
57 pages
GenAI PDF
No ratings yet
GenAI PDF
34 pages
Understanding Retrieval-Augmented Generation (RAG)
No ratings yet
Understanding Retrieval-Augmented Generation (RAG)
12 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
RAG vs GPT: A Comprehensive Guide
No ratings yet
RAG vs GPT: A Comprehensive Guide
8 pages
A Comprehensive Guide To Building Agentic RAG Systems With LangGraph
No ratings yet
A Comprehensive Guide To Building Agentic RAG Systems With LangGraph
23 pages
Retrieval Augmented Generation (RAG) For Everyone
No ratings yet
Retrieval Augmented Generation (RAG) For Everyone
57 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
Challenge
No ratings yet
Challenge
8 pages
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
5 pages
Rag
No ratings yet
Rag
10 pages
Minor Proj
No ratings yet
Minor Proj
15 pages
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
No ratings yet
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
12 pages
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
No ratings yet
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
14 pages
RAG for NLP Experts
No ratings yet
RAG for NLP Experts
2 pages
RAG Retrieval-Augmented Generation
No ratings yet
RAG Retrieval-Augmented Generation
12 pages
RAG Architecture
100% (11)
RAG Architecture
52 pages
Document 2
No ratings yet
Document 2
12 pages
What Is Retrieval-Augmented Generation (RAG)
No ratings yet
What Is Retrieval-Augmented Generation (RAG)
12 pages
Blue Futuristic Artificial Intelligence Presentation
No ratings yet
Blue Futuristic Artificial Intelligence Presentation
8 pages
Chapters
No ratings yet
Chapters
7 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
RAG - Genai
No ratings yet
RAG - Genai
11 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
Understanding RAG AI
No ratings yet
Understanding RAG AI
6 pages
RAG Understanding PDF
No ratings yet
RAG Understanding PDF
12 pages
RAG - The Future of LLMs - LinkedIn
No ratings yet
RAG - The Future of LLMs - LinkedIn
7 pages
Survey on Retrieval-Augmented Generation for AI Content
No ratings yet
Survey on Retrieval-Augmented Generation for AI Content
22 pages
The Complete Guide To RAG
No ratings yet
The Complete Guide To RAG
27 pages
Retrieval-Augmented Generation For AI-Generated Content A Survey
No ratings yet
Retrieval-Augmented Generation For AI-Generated Content A Survey
28 pages
Understanding the RAG Model in NLP
No ratings yet
Understanding the RAG Model in NLP
4 pages
CrateDB and LangChain
No ratings yet
CrateDB and LangChain
14 pages
AGENTIC RAG-Tech Stack
No ratings yet
AGENTIC RAG-Tech Stack
18 pages
RAG - A Simple Introduction
100% (6)
RAG - A Simple Introduction
75 pages
Tyjt
No ratings yet
Tyjt
2 pages
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
No ratings yet
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
16 pages
12 Essential RAG Types 1735544647
No ratings yet
12 Essential RAG Types 1735544647
29 pages
Practical RAG
No ratings yet
Practical RAG
127 pages
(Retrieval Augmented Generation) : by Uttam Grade
No ratings yet
(Retrieval Augmented Generation) : by Uttam Grade
6 pages
RAG Deep-Dive Research Report
No ratings yet
RAG Deep-Dive Research Report
46 pages
Title
No ratings yet
Title
2 pages
GenAI RAG: A Comprehensive Overview
No ratings yet
GenAI RAG: A Comprehensive Overview
12 pages
A Powerful Technique For Improved Text Generation and Efficiency
No ratings yet
A Powerful Technique For Improved Text Generation and Efficiency
14 pages
Medium
No ratings yet
Medium
22 pages
Model Training and Fine Tuning
No ratings yet
Model Training and Fine Tuning
11 pages
RAG Cheat Sheet-2
No ratings yet
RAG Cheat Sheet-2
29 pages
LangChain & RAG - U1
No ratings yet
LangChain & RAG - U1
32 pages
Advanced RAG Techniques for LLM Apps
No ratings yet
Advanced RAG Techniques for LLM Apps
54 pages
RAG Detailed Overview
No ratings yet
RAG Detailed Overview
3 pages
RAG and Vector Database Guide
No ratings yet
RAG and Vector Database Guide
18 pages
Learning: Gen Ai
No ratings yet
Learning: Gen Ai
6 pages
Generative AI PPT Final
No ratings yet
Generative AI PPT Final
34 pages
How To Build AI Driven Knowledge Assistants
100% (1)
How To Build AI Driven Knowledge Assistants
24 pages
RAG First Month Assessment GenAI
No ratings yet
RAG First Month Assessment GenAI
3 pages
Zhao Et Al (2024) - Retrieval-Augmented Generation For AI-Generated Content
No ratings yet
Zhao Et Al (2024) - Retrieval-Augmented Generation For AI-Generated Content
21 pages
PEC GenAI Course - Making A Streamlit Application and Deploying On Huggingface1
No ratings yet
PEC GenAI Course - Making A Streamlit Application and Deploying On Huggingface1
15 pages
PEC Generative AI Training - Week 03 Day 03 Tahir Ali Bhutto
No ratings yet
PEC Generative AI Training - Week 03 Day 03 Tahir Ali Bhutto
4 pages
Poster_Scientific Research and Innovation_IIoT (1)
No ratings yet
Poster_Scientific Research and Innovation_IIoT (1)
1 page
SDP Poster for Solar Powered Gas Monitoring Drone
No ratings yet
SDP Poster for Solar Powered Gas Monitoring Drone
1 page
Module 1 - Intro To GenAI - PEC - Gen - AI - Training
No ratings yet
Module 1 - Intro To GenAI - PEC - Gen - AI - Training
49 pages
Mega 3 Days 17-03-2024 @LuLu Jeddah, Tabuk & Yanbu
No ratings yet
Mega 3 Days 17-03-2024 @LuLu Jeddah, Tabuk & Yanbu
6 pages
Ramdan Season Vol 3 at LuLu Jeddah, Tabuk & Yanbu
No ratings yet
Ramdan Season Vol 3 at LuLu Jeddah, Tabuk & Yanbu
45 pages
Pha-F Project Peshawar Residencia
No ratings yet
Pha-F Project Peshawar Residencia
60 pages
PA My Assessments - Ideagist Online Assessments
No ratings yet
PA My Assessments - Ideagist Online Assessments
3 pages
PEC Generative AI Training - Week 04 Day 01 Tahir Ali Bhutto
No ratings yet
PEC Generative AI Training - Week 04 Day 01 Tahir Ali Bhutto
4 pages
Education and Training Policy Handbook Ver1 2014-2015
No ratings yet
Education and Training Policy Handbook Ver1 2014-2015
40 pages
1-4 Answers
No ratings yet
1-4 Answers
2 pages
Class Substitution Form - Blank
No ratings yet
Class Substitution Form - Blank
3 pages
Bank Islami Branch Locations in Karachi
No ratings yet
Bank Islami Branch Locations in Karachi
11 pages
Ii-A-Class Time Table-Gs
No ratings yet
Ii-A-Class Time Table-Gs
1 page
Weather and Season Worksheet
100% (4)
Weather and Season Worksheet
3 pages
Project Feasibility Study For The Establishment of Footwear and Other Accessories
100% (1)
Project Feasibility Study For The Establishment of Footwear and Other Accessories
12 pages
Ch-1 Numbers Beyond 999
100% (2)
Ch-1 Numbers Beyond 999
20 pages
555 Timer Astable Oscillator Lab Guide
No ratings yet
555 Timer Astable Oscillator Lab Guide
3 pages
Whole Cracknell Theisis Inc Pub Mat
No ratings yet
Whole Cracknell Theisis Inc Pub Mat
301 pages
Algorithms for CS Students
100% (1)
Algorithms for CS Students
35 pages
Quant Roadmap (Ultimate Edition) 双语对照版
100% (1)
Quant Roadmap (Ultimate Edition) 双语对照版
148 pages
UNIT 4 (MCQS)
No ratings yet
UNIT 4 (MCQS)
13 pages
Digital Circuits Problem Solving
No ratings yet
Digital Circuits Problem Solving
9 pages
An Ensemble Features Aware Machine Learning Model For Detection and Staging of Dyslexia
No ratings yet
An Ensemble Features Aware Machine Learning Model For Detection and Staging of Dyslexia
10 pages
Class 10 Unit 3 Evaluating Models Answerkey of Book Exercise
No ratings yet
Class 10 Unit 3 Evaluating Models Answerkey of Book Exercise
7 pages
Linear Algebra For Image Processing
No ratings yet
Linear Algebra For Image Processing
31 pages
Face Recognition Seminar
No ratings yet
Face Recognition Seminar
23 pages
Icai23 Abstract Book
No ratings yet
Icai23 Abstract Book
66 pages
Forward and Backward Propagation Que
No ratings yet
Forward and Backward Propagation Que
5 pages
Efficient Image Colorization
No ratings yet
Efficient Image Colorization
8 pages
Assignment 01
No ratings yet
Assignment 01
3 pages
CS6018 - Image Processing Module 5 Notes
No ratings yet
CS6018 - Image Processing Module 5 Notes
15 pages
Python Daily Report
No ratings yet
Python Daily Report
5 pages
Solutions To All DIY Questions
No ratings yet
Solutions To All DIY Questions
15 pages
Exponential Functions Guide
No ratings yet
Exponential Functions Guide
10 pages
Anwar Butt on Artificial Neural Networks
No ratings yet
Anwar Butt on Artificial Neural Networks
39 pages
Adaptive Dynamic Programming With Applications in Optimal Control 1st Edition Derong Liu - Quickly Download The Ebook To Read Anytime, Anywhere
100% (2)
Adaptive Dynamic Programming With Applications in Optimal Control 1st Edition Derong Liu - Quickly Download The Ebook To Read Anytime, Anywhere
59 pages
Distributed Control Systems (DCS)
25% (4)
Distributed Control Systems (DCS)
2 pages
R23 DS Unit 1-1
100% (1)
R23 DS Unit 1-1
11 pages
Deep Learning for Seizure Prediction
No ratings yet
Deep Learning for Seizure Prediction
10 pages
EE6150 Tutorial 6 2022
No ratings yet
EE6150 Tutorial 6 2022
2 pages
DFA Minimization
No ratings yet
DFA Minimization
5 pages
Transportation and Assignment Models Guide
No ratings yet
Transportation and Assignment Models Guide
46 pages
Crypto++ FAQ Guide
No ratings yet
Crypto++ FAQ Guide
15 pages
Statistical Test Selection Guide
No ratings yet
Statistical Test Selection Guide
1 page
Romberg Rule of Integration: Major: All Engineering Majors Authors: Autar Kaw, Charlie Barker
No ratings yet
Romberg Rule of Integration: Major: All Engineering Majors Authors: Autar Kaw, Charlie Barker
27 pages
Case 15 Pacific Healthcare - B - Student - 6th Edition
No ratings yet
Case 15 Pacific Healthcare - B - Student - 6th Edition
1 page
Sem 4 PGCA 1976 Machine Learning and Data Analytics Using Python - Watermark
No ratings yet
Sem 4 PGCA 1976 Machine Learning and Data Analytics Using Python - Watermark
2 pages