0% found this document useful (0 votes)

138 views14 pages

Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai

Uploaded by

uma5b3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

138 views14 pages

Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai

Uploaded by

uma5b3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

/ Blog Changelog Docs Customer stories Try GitHub Copilot Contact sales

AI & ML Developer skills Engineering Enterprise software News & insights Open Source Security

Home / AI & ML / Generative AI

What is retrieval-augmented generation,

and what does it do for generative AI?
Here’s how retrieval-augmented generation, or RAG, uses a variety of data sources to keep AI models fresh with up-to-date
information and organizational knowledge.

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Nicole Choi · @nicchoi29
Share:
April 4, 2024 | 9 minutes

One of the hottest topics in AI right now is RAG, or retrieval-augmented AI Insights generative AI

generation, which is a retrieval method used by some AI tools to improve the GitHub Copilot Enterprise
quality and relevance of their outputs.

Organizations want AI tools that use RAG because it makes those tools aware
More on AI Insights
of proprietary data without the effort and expense of custom model training.
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
RAG also keeps models up to date. When generating an answer without RAG,
models can only draw upon data that existed when they were trained. With RAG, Survey: The AI wave
continues to grow on
on the other hand, models can leverage a private database of newer information
software development
for more informed responses. teams
We surveyed 2,000 people on
We talked to GitHub Next’s Senior Director of Research, Idan Gazit, and
software development teams at
Software Engineer, Colin Merkel, to learn more about RAG and how it’s used in enterprises in the U.S., Brazil,
generative AI tools. India, and Germany about the
use, experience, and
expectations around generative
Why everyone’s talking about RAG AI tools in software development.
Kyle Daigle & GitHub Staff
One of the reasons you should always verify outputs from a generative AI tool is
because its training data has a knowledge cut-off date. While models are able
to produce outputs that are tailored to a request, they can only reference Unlocking the power of
information that existed at the time of their training. But with RAG, an AI tool can unstructured data with
use data sources beyond its model’s training data to generate an output. RAG
Unstructured data holds valuable
The difference between RAG and fine-tuning information about codebases,
organizational best practices,
Most organizations currently don’t train their own AI models. Instead, they
and customer feedback. Here are
customize pre-trained models to their specific needs, often using RAG or fine- some ways you can leverage it
tuning. Here’s a quick breakdown of how these two strategies differ. with RAG, or retrieval-augmented
generation.
Fine-tuning requires adjusting a model’s weights, which results in a highly Nicole Choi
customized model that excels at a specific task. It’s a good option for
organizations that rely on codebases written in a specialized language,
especially if the language isn’t well-represented in the model’s original training
data.

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
RAG, on the other hand, doesn’t require weight adjustment. Instead, it retrieves
and gathers information from a variety of data sources to augment a prompt,
which results in an AI model generating a more contextually relevant response
for the end user.

Some organizations start with RAG and then fine-tune their models to
accomplish a more specific task. Other organizations find that RAG is a
sufficient method for AI customization alone.

How AI models use context

In order for an AI tool to generate helpful responses, it needs the right context.
This is the same dilemma we face as humans when making a decision or solving
a problem. It’s hard to do when you don’t have the right information to act on.

So, let’s talk more about context in the context ( 😉) of generative AI:
Today’s generative AI applications are powered by large language models
(LLMs) that are structured as transformers, and all transformer LLMs have a
context window— the amount of data that they can accept in a single
prompt. Though context windows are limited in size, they can and will
continue to grow larger as more powerful models are released.

Input data will vary depending on the AI tool’s capabilities. For instance,
when it comes to GitHub Copilot in the IDE, input data comprises all of the
code in the file that you’re currently working on. This is made possible
because of our Fill-in-the-Middle (FIM) paradigm, which makes GitHub
Copilot aware of both the code before your cursor (the prefix) and after your
cursor (the suffix).

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
GitHub Copilot also processes code from your other open tabs (a process we
call neighboring tabs) to potentially find and add relevant information to the
prompt. When there are a lot of open tabs, GitHub Copilot will scan the most
recently reviewed ones.

Because of the context window’s limited size, the challenge of ML engineers

is to figure out what input data to add to the prompt and in what order to
generate the most relevant suggestion from the AI model. This task is known
as prompt engineering.

How RAG enhances an AI model’s contextual

understanding
With RAG, an LLM can go beyond training data and retrieve information from a
variety of data sources, including customized ones.

When it comes to GitHub Copilot Chat within GitHub.com and in the IDE, input
data can include your conversation with the chat assistant, whether it’s code or
natural language, through a process called in-context learning. It can also
include data from indexed repositories (public or private), a collection of
Markdown documentation across repositories (that we refer to as knowledge
bases), and results from integrated search engines. From these other sources,
RAG will retrieve additional data to augment the initial prompt. As a result, it can
generate a more relevant response.

The type of input data used by GitHub Copilot will depend on which GitHub
Copilot plan you’re using.

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
RAG and semantic search
Unlike keyword search or Boolean search operators, an ML-powered semantic
search system uses its training data to understand the relationship between
your keywords. So, rather than view, for example, “cats” and “kittens” as
independent terms as you would in a keyword search, a semantic search system
can understand, from its training, that those words are often associated with
cute videos of the animal. Because of this, a search for just “cats and kittens”
might rank a cute animal video as a top search result.

How does semantic search improve the quality of RAG retrievals? When using
a customized database or search engine as a RAG data source, semantic
search can improve the context added to the prompt and overall relevance of
the AI-generated output.

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
The semantic search process is at the heart of retrieval. “It surfaces great
examples that often elicit great results,” Gazit says.

0:00

Developers can use Copilot Chat on GitHub.com to ask questions and receive
answers about a codebase in natural language, or surface relevant
documentation and existing solutions.

RAG data sources: Where RAG uses semantic search

You’ve probably read dozens of articles (including some of our own) that talk
about RAG, vector databases, and embeddings. And even if you haven’t, here’s
something you should know: RAG doesn’t require embeddings or vector
databases.

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
A RAG system can use semantic search to retrieve relevant documents, whether
from an embedding-based retrieval system, traditional database, or search
engine. The snippets from those documents are then formatted into the model’s
prompt. We’ll provide a quick recap of vector databases and then, using GitHub
Copilot Enterprise as an example, cover how RAG retrieves data from a variety
of sources.

Vector databases
Vector databases are optimized for storing embeddings of your repository code
and documentation. They allow us to use novel search parameters to find
matches between similar vectors.

To retrieve data from a vector database, code and documentation are converted
into embeddings, a type of high-dimensional vector, to make them searchable
by a RAG system.

Here’s how RAG retrieves data from vector databases: while you code in your
IDE, algorithms create embeddings for your code snippets, which are stored in a
vector database. Then, an AI coding tool can search that database by
embedding similarity to find snippets from across your codebase that are
related to the code you’re currently writing and generate a coding suggestion.
Those snippets are often highly relevant context, enabling an AI coding assistant
to generate a more contextually relevant coding suggestion. GitHub Copilot
Chat uses embedding similarity in the IDE and on GitHub.com, so it finds code
and documentation snippets related to your query.

Embedding similarity is incredibly powerful because it identifies code that has

subtle relationships to the code you’re editing.

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
“Embedding similarity might surface code that uses the same APIs, or code that
performs a similar task to yours but that lives in another part of the codebase,”
Gazit explains. “When those examples are added to a prompt, the model’s
primed to produce responses that mimic the idioms and techniques that are
native to your codebase—even though the model was not trained on your code.”

General text search and search engines

With a general text search, any documents that you want to be accessible to the
AI model are indexed ahead of time and stored for later retrieval. For instance,
RAG in GitHub Copilot Enterprise can retrieve data from files in an indexed
repository and Markdown files across repositories.

Learn more about GitHub Copilot Enterprise features

RAG can also retrieve information from external and internal search engines.
When integrated with an external search engine, RAG can search and retrieve
information from the entire internet. When integrated with an internal search
engine, it can also access information from within your organization, like an
internal website or platform. Integrating both kinds of search engines
supercharges RAG’s ability to provide relevant responses.

For instance, GitHub Copilot Enterprise integrates both Bing, an external search
engine, and an internal search engine built by GitHub into Copilot Chat on
GitHub.com. Bing integration allows GitHub Copilot Chat to conduct a web
search and retrieve up-to-date information, like about the latest Java release.
But without a search engine searching internally, ”Copilot Chat on GitHub.com
cannot answer questions about your private codebase unless you provide a

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
specific code reference yourself,” explains Merkel, who helped to build GitHub’s
internal search engine from scratch.

Here’s how this works in practice. When a developer asks a question about a
repository to GitHub Copilot Chat in GitHub.com, RAG in Copilot Enterprise
uses the internal search engine to find relevant code or text from indexed files to
answer that question. To do this, the internal search engine conducts a
semantic search by analyzing the content of documents from the indexed
repository, and then ranking those documents based on relevance. GitHub
Copilot Chat then uses RAG, which also conducts a semantic search, to find
and retrieve the most relevant snippets from the top-ranked documents. Those
snippets are added to the prompt so GitHub Copilot Chat can generate a
relevant response for the developer.

Key takeaways about RAG

RAG offers an effective way to customize AI models, helping to ensure outputs

are up to date with organizational knowledge and best practices, and the latest
information on the internet.

GitHub Copilot uses a variety of methods to improve the quality of input data
and contextualize an initial prompt, and that ability is enhanced with RAG.
What’s more, the RAG retrieval method in GitHub Copilot Enterprise goes
beyond vector databases and includes data sources like general text search and
search engine integrations, which provides even more cost-efficient retrievals.

Context is everything when it comes to getting the most out of an AI tool. To

improve the relevance and quality of a generative AI output, you need to improve

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
the relevance and quality of the input.

As Gazit says, “Quality in, quality out.”

Looking to bring the power of GitHub Copilot Enterprise to your

organization? Learn more about GitHub Copilot Enterprise or
get started now.

Tags: AI Insights generative AI GitHub Copilot Enterprise

Written by

Nicole Choi
@nicchoi29

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Related posts

AI & ML AI & ML AI & ML

5 tips and tricks when using GitHub How students teamed up to decode What are AI agents and why do they
Copilot Workspace 2,000-year-old texts using AI matter?
GitHub Next launched the technical preview Students used GitHub Copilot to decode Learn how AI agents and agentic AI systems
for GitHub Copilot Workspace in April 2024. ancient texts buried in Mount Vesuvius, use generative AI models and large language
Since then, we’ve been listening to the achieving a groundbreaking historical models to autonomously perform tasks on
community, learning, and have some tips to breakthrough. This is their journey, the behalf of end users.
share on how to get the most out of it! technology behind it, and the power of Aaron Winston
Chris Reddington & Cole Bemis collaboration.
Juan Pablo Flores Cortés

Explore more from GitHub

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Docs The ReadME Project GitHub Copilot Work at GitHub!
Everything you need to Stories and voices from the Don’t fly solo. Try 30 days Check out our current job
master GitHub, all in one developer community. for free. openings.
place.

Go to Docs Learn more Learn more

We do newsletters, too Your email address Subscribe

Discover tips, technical guides, and best practices in
our biweekly newsletter just for devs. Yes please, I’d like GitHub and affiliates to use my information for personalized
communications, targeted advertising and campaign effectiveness. See the GitHub
Privacy Statement for more details.

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Product Platform Support Company

Features Developer API Docs About

Security Partners Community Forum Blog

Enterprise Atom Training Careers

Customer Stories Electron Status Press

Pricing GitHub Desktop Contact Shop

Resources

PDFmyURL converts web pages and even full websites to PDF easily and quickly.

WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
5 pages
Minor Proj
No ratings yet
Minor Proj
15 pages
RAG Architectures on AWS for AI Solutions
No ratings yet
RAG Architectures on AWS for AI Solutions
34 pages
Advanced Gen-AI Development
No ratings yet
Advanced Gen-AI Development
57 pages
RAG vs GPT: A Comprehensive Guide
No ratings yet
RAG vs GPT: A Comprehensive Guide
8 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
The Complete Guide To RAG
No ratings yet
The Complete Guide To RAG
27 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
Natural Language Processing
No ratings yet
Natural Language Processing
11 pages
Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course
No ratings yet
Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course
23 pages
RAG - The Future of LLMs - LinkedIn
No ratings yet
RAG - The Future of LLMs - LinkedIn
7 pages
1 - Build A Complete OpenSource LLM RAG QA Chatbot - An In-Depth Journey (Introduction) - by Marco Bertelli - Level Up Coding
No ratings yet
1 - Build A Complete OpenSource LLM RAG QA Chatbot - An In-Depth Journey (Introduction) - by Marco Bertelli - Level Up Coding
12 pages
GenAI RAG: A Comprehensive Overview
No ratings yet
GenAI RAG: A Comprehensive Overview
12 pages
RAG for NLP Experts
No ratings yet
RAG for NLP Experts
2 pages
RAG App Guide for Beginners
100% (1)
RAG App Guide for Beginners
16 pages
How To Build AI Driven Knowledge Assistants
100% (1)
How To Build AI Driven Knowledge Assistants
24 pages
What Is Retrieval-Augmented Generation (RAG)
No ratings yet
What Is Retrieval-Augmented Generation (RAG)
12 pages
Challenge
No ratings yet
Challenge
8 pages
What Is Retrieval-Augmented Generation, Aka RAG?: Rick Merritt
No ratings yet
What Is Retrieval-Augmented Generation, Aka RAG?: Rick Merritt
9 pages
The Practical Applications of Retrieval Augmented
No ratings yet
The Practical Applications of Retrieval Augmented
8 pages
GenAI PDF
No ratings yet
GenAI PDF
34 pages
Advanced RAG Architecture. What Is RAG - Advanced Topics & - by Uğur Özker - Medium
No ratings yet
Advanced RAG Architecture. What Is RAG - Advanced Topics & - by Uğur Özker - Medium
21 pages
Code RAG 101 1737004806
No ratings yet
Code RAG 101 1737004806
8 pages
New Ebook The Ultimate Guide To RAG For AI 1730277314
No ratings yet
New Ebook The Ultimate Guide To RAG For AI 1730277314
15 pages
Implementing RAG in Business Strategy
No ratings yet
Implementing RAG in Business Strategy
19 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
RAG Architecture
100% (11)
RAG Architecture
52 pages
AI-Powered Enterprise Transformation
No ratings yet
AI-Powered Enterprise Transformation
15 pages
CrateDB and LangChain
No ratings yet
CrateDB and LangChain
14 pages
Document 2
No ratings yet
Document 2
12 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
12 pages
DOM Graph RAG: Advanced AI Architecture
No ratings yet
DOM Graph RAG: Advanced AI Architecture
30 pages
Introduction To Generative AI
No ratings yet
Introduction To Generative AI
5 pages
Retrieval-Augmented Generation (RAG) : Michael Klesel H. Felix Wittmann
No ratings yet
Retrieval-Augmented Generation (RAG) : Michael Klesel H. Felix Wittmann
11 pages
Generative AI Curriculum
No ratings yet
Generative AI Curriculum
2 pages
Tyjt
No ratings yet
Tyjt
2 pages
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
No ratings yet
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
6 pages
Advanced RAG Techniques for LLM Apps
No ratings yet
Advanced RAG Techniques for LLM Apps
54 pages
Rag
No ratings yet
Rag
10 pages
(Retrieval Augmented Generation) : by Uttam Grade
No ratings yet
(Retrieval Augmented Generation) : by Uttam Grade
6 pages
Agentic RAG: Survey on AI Advancements
No ratings yet
Agentic RAG: Survey on AI Advancements
39 pages
RAG Basics for AI Developers
No ratings yet
RAG Basics for AI Developers
2 pages
RAG - A Simple Introduction
100% (6)
RAG - A Simple Introduction
75 pages
Survey on Retrieval-Augmented Generation for AI Content
No ratings yet
Survey on Retrieval-Augmented Generation for AI Content
22 pages
Annisa Reiny HF - UKSW - Summary 2
No ratings yet
Annisa Reiny HF - UKSW - Summary 2
3 pages
Retrieval-Augmented Generation For AI-Generated Content A Survey
No ratings yet
Retrieval-Augmented Generation For AI-Generated Content A Survey
28 pages
Retrieval Augmented Generation (RAG) For Everyone
No ratings yet
Retrieval Augmented Generation (RAG) For Everyone
57 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
20 pages
Rag PDF
No ratings yet
Rag PDF
10 pages
NVIDIA RAG Whitepaper
No ratings yet
NVIDIA RAG Whitepaper
7 pages
Building A Generative AI Platform
No ratings yet
Building A Generative AI Platform
26 pages
Model Training and Fine Tuning
No ratings yet
Model Training and Fine Tuning
11 pages
Architecting Scalable AI RAG Systems
No ratings yet
Architecting Scalable AI RAG Systems
32 pages
RAG Retrieval-Augmented Generation
No ratings yet
RAG Retrieval-Augmented Generation
12 pages
Understanding Retrieval-Augmented Generation (RAG)
No ratings yet
Understanding Retrieval-Augmented Generation (RAG)
12 pages
WWW Analyticsvidhya Com Blog 2023 09 Retrieval-Augmented-Generation-Rag-In-Ai
No ratings yet
WWW Analyticsvidhya Com Blog 2023 09 Retrieval-Augmented-Generation-Rag-In-Ai
20 pages
A Taxonomy of Retrieval Augmented Generation
100% (5)
A Taxonomy of Retrieval Augmented Generation
56 pages
IAI Sp2025 Session 15 - Improving LLMs
No ratings yet
IAI Sp2025 Session 15 - Improving LLMs
34 pages
42 Amazing AI SEO Tools in 2024 - To Level-Up Your Content Writing Game
No ratings yet
42 Amazing AI SEO Tools in 2024 - To Level-Up Your Content Writing Game
36 pages
The Use of XML in A Video Digital Librar
No ratings yet
The Use of XML in A Video Digital Librar
320 pages
Lab 02.1 - ELSFOO
No ratings yet
Lab 02.1 - ELSFOO
9 pages
Holistic Reputation Management
No ratings yet
Holistic Reputation Management
141 pages
Automatic Answer Checker Using Machine Learning
No ratings yet
Automatic Answer Checker Using Machine Learning
7 pages
Meta Tags
No ratings yet
Meta Tags
4 pages
Web and Social Media Analytics Lab
No ratings yet
Web and Social Media Analytics Lab
34 pages
Evaluating Online Information Sources
No ratings yet
Evaluating Online Information Sources
19 pages
DOJ v. Google
No ratings yet
DOJ v. Google
286 pages
MCQ - Digital Marketing
88% (8)
MCQ - Digital Marketing
109 pages
Inspec Ei Village Version 1
No ratings yet
Inspec Ei Village Version 1
35 pages
Minteer 2022 Tips For Improving The Visibility of Research Publications
No ratings yet
Minteer 2022 Tips For Improving The Visibility of Research Publications
2 pages
CA Notes Unit - IV
No ratings yet
CA Notes Unit - IV
8 pages
224 Assignment
No ratings yet
224 Assignment
14 pages
SOP 002 Perform An On-Page SEO Audit On A Page
No ratings yet
SOP 002 Perform An On-Page SEO Audit On A Page
10 pages
Seo and Its Types: E-Commerce Notes SEO (Search Engine Optimization) By: Hafiza Maham Sattar
No ratings yet
Seo and Its Types: E-Commerce Notes SEO (Search Engine Optimization) By: Hafiza Maham Sattar
2 pages
Memoria
No ratings yet
Memoria
6 pages
Internet Connectivity and Web Browsers
No ratings yet
Internet Connectivity and Web Browsers
8 pages
Module 4 Techniques in Big Data Analytics
No ratings yet
Module 4 Techniques in Big Data Analytics
46 pages
Better Experience - Complete AéPiot User Guide - Master RSS Management, Backlink Creation & SEO Optimization
No ratings yet
Better Experience - Complete AéPiot User Guide - Master RSS Management, Backlink Creation & SEO Optimization
23 pages
Internet Setup & Web Services Guide
No ratings yet
Internet Setup & Web Services Guide
3 pages
AI and The Evolution of Search
No ratings yet
AI and The Evolution of Search
43 pages
AEO Vs
No ratings yet
AEO Vs
10 pages
Mastering Google Search Operators
No ratings yet
Mastering Google Search Operators
41 pages
Algorithms of Oppression - How Search Engines Reinforce Racism
75% (8)
Algorithms of Oppression - How Search Engines Reinforce Racism
231 pages
Mobile Website
No ratings yet
Mobile Website
161 pages
Internet and Email
0% (1)
Internet and Email
9 pages
Ai Tools!
No ratings yet
Ai Tools!
48 pages
Publishers Sue Google for Monopoly
No ratings yet
Publishers Sue Google for Monopoly
102 pages
English 7 SLR-Q2-B1
No ratings yet
English 7 SLR-Q2-B1
32 pages