0% found this document useful (0 votes)

24 views14 pages

CrateDB and LangChain

The document discusses the use of private data in Generative AI, emphasizing the importance of current, accurate, and private data for enhancing model performance. It introduces Retrieval Augmented Generation (RAG) as a solution to improve data access and reduce training needs while addressing challenges like hallucinations. The document also highlights the benefits of using CrateDB and LangChain for efficient data management and integration in generative AI applications.

Uploaded by

mrokay69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views14 pages

CrateDB and LangChain

Uploaded by

mrokay69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

How to Use Private Data in Generative AI: End-to-End

Solution for RAG with CrateDB and LangChain

Marija Selakovic, Developer Advocate, CrateDB

Christian Kurze, VP Product, CrateDB

1
What is Generative AI?

Generative AI is a set of artificial intelligence methodologies that can produce novel

content that resembles the training data they were exposed to.

The content could be anything spanning from synthesizing text, generating

code, realistic images, music and more.

Text Instructions
Text
Prompt

Images Images

Audio Audio
Training Generate
Billions of
Video Parameters
Video

Foundational Model
2
Huge Potential, but also Challenges of Generative AI

Quality & Reliability: Hallucinations, accuracy,

timely input data
Ethical & Societal: Deepfakes, misinformation, bias
in AI-generated content require policies and controls
Computational Costs & Environmental Impact:
High power required to run large generative AI
models
Intellectual Property & Copyright: Generated
content resembles human-created work
Managing & Governing AI: Frameworks to manage
the development and deployment of generative AI
technologies

3
The Importance of Current, Accurate, Private Data

• Current & Accurate: most recent information

must be available for meaningful answers
• Private data: internal, confidential, sensitive,
subject to privacy regulations
• Utilizing with LLMs:
• Improves accuracy (less hallucinations)
• Enhanced personalization (better user
experience)
• Richer data insights (documentation,
support tickets, legal documents)

4
The AI Dilemma: Fine-tuning vs RAG

• Advantages:
• Updates knowledge with domain-specific data
• More cost-effective than full model retraining

• Challenges:
• Still a need for frequent data update
• Static knowledge (overfitting risk)
• May still produce hallucinations
• Resource intensive

5
Retrieval Augmented Generation (RAG)

6
Benefits of RAG

• Advantages:
• Access control, knowledge not incorporated into the LLM
• Real-time data available
• Reduced training needs
• Flexibility when integrating with different data sources and formats
• Flexibility in choosing embedding algorithms and LLMs

• Challenges:
• Depends on the efficiency of the underlying search system
• Limitations on the amount of context LLMs can consider
• Hallucinations can be reduced, but still might happen

7
How is semantics of language captured? Vectors!

Vectors / Embeddings are numerical representation of data objects (like words, phrases, entire
documents, images, audio, etc.) in a high-dimensional space. They enable systematic access
to unstructured data like similarity search and therefore enable processing and understanding
text in a mathematical form.

Text

[0.2, 0.3, 0.1, …]

Images
[0.5, 0.3, 0.8, …]
[0.6, 0.7, 0.1, …]
Audio
Embedding Model Vectors / Vector
of your choice Embeddings Store
Video

8
Knowledge Assistants - Architecture
Data Chatbot
People
Web Frontend
Directory
API

Internal Wiki
API

Landing Processing Input Response Output

Knowledge Handler Formation Handler
APIs
Parse Data Source
Source 11 Request Handler Output Guardrails
Source Data Routing
Access
Control

Input Guardrails Retriever Context Parser

Source 2 Chunk Chunks

…
Query Improver Context
Enrichment
Source 3 Vectorize …
Embeddings
Intent
Recognition
Chatbot Response
Context Data Vector Store … Response Handler

Backend

LLMs
Embeddings Model
Glossary Prompt Conversation Feedback
Classifiers Responses History Database

Large Language
Configuration Stores Operational Stores
LLM
Model Service Gateway

Monitoring and Reporting

LLM Logging
Usage Cost Data
Reports Reports Reports

Quantum Black: [Link] 9

Knowledge Assistants – Open Source J
Data Chatbot
People
Web Frontend
Directory
API

Internal Wiki
API

Landing Processing Input Response Output

Knowledge Handler Formation Handler
APIs
Parse Data Source
Source 11 Request Handler Output Guardrails
Source Data Routing
Access
Control

Input Guardrails Retriever Context Parser

Source 2 Chunk Chunks

…
Query Improver Context
Enrichment
Source 3 Vectorize …
Embeddings
Intent
Recognition
Chatbot Response
Context Data Vector Store … Response Handler

Backend

LLMs
Embeddings Model
Glossary Prompt Conversation Feedback
Classifiers Responses History Database

Configuration Stores Operational Stores

Large Language LLM Potential for LangChain
Model Service Gateway

Potential for CrateDB

Monitoring and Reporting
LLM Logging
Usage Cost Data
Reports Reports Reports

Quantum Black: [Link] 10

Why CrateDB and LangChain?

• Data comes in many formats: structured,

semi-structured, unstructured; while typical
Robust data management: distributed,
databases can only cope with one type of data highly scalable database natively supporting
and come with custom APIs tables, time-series, geospatial, full-text,
vector; accessible via standard SQL
• 80% of data is unstructured (Gartner)

• Generative AI requires efficient data

management, especially contextualization
Comprehensive set of building blocks
• Foundational Models are only trained on public and swappable libraries to access models,
data vector stores, text splitters, output parsers,
and pre-built chains; covering development,
• (Too?) many alternatives regarding
embedding models and LLMs serving and observability; available in
Python and JavaScript
Demo: Chat for Support Knowledge Base
CrateDB + LangChain for RAG

[Link]

12
Demo

13
Get Started Today!
marija@[Link]
[Link]@[Link]

LangChain: [Link]
LangChain Docs: [Link]

CrateDB Cloud: [Link]

CrateDB Community: [Link]
CrateDB Docs: [Link]

How To Build AI Driven Knowledge Assistants
100% (1)
How To Build AI Driven Knowledge Assistants
24 pages
Enterprise AI: RAG & Foundational Models
100% (1)
Enterprise AI: RAG & Foundational Models
20 pages
GenAI PDF
No ratings yet
GenAI PDF
34 pages
RAG vs GPT: A Comprehensive Guide
No ratings yet
RAG vs GPT: A Comprehensive Guide
8 pages
ChatGPT and Generative AI Insights
No ratings yet
ChatGPT and Generative AI Insights
38 pages
Advanced Gen-AI Development
No ratings yet
Advanced Gen-AI Development
57 pages
4-HC24.PrimisAI - Hans Bouwmeester.v4
No ratings yet
4-HC24.PrimisAI - Hans Bouwmeester.v4
29 pages
Rag
No ratings yet
Rag
10 pages
Exploring Hugging Face and OpenAI AI Tools
No ratings yet
Exploring Hugging Face and OpenAI AI Tools
16 pages
Build Personalized Bots with RAG
No ratings yet
Build Personalized Bots with RAG
32 pages
Model Training and Fine Tuning
No ratings yet
Model Training and Fine Tuning
11 pages
RAG - The Future of LLMs - LinkedIn
No ratings yet
RAG - The Future of LLMs - LinkedIn
7 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
RAG Architecture
100% (11)
RAG Architecture
52 pages
IAI Sp2025 Session 15 - Improving LLMs
No ratings yet
IAI Sp2025 Session 15 - Improving LLMs
34 pages
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
No ratings yet
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
14 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
Aistudy 240521200530 db141c56
No ratings yet
Aistudy 240521200530 db141c56
18 pages
Generative AI Interview Study Guide
No ratings yet
Generative AI Interview Study Guide
26 pages
Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course
No ratings yet
Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course
23 pages
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
No ratings yet
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
12 pages
Session03 - RNN
No ratings yet
Session03 - RNN
69 pages
RAG Techniques for Large Language Models
No ratings yet
RAG Techniques for Large Language Models
26 pages
RAG and Prompt Engineering Overview
No ratings yet
RAG and Prompt Engineering Overview
56 pages
Minor Proj
No ratings yet
Minor Proj
15 pages
Basic AI & ML Concepts Explained - LinkedIn
No ratings yet
Basic AI & ML Concepts Explained - LinkedIn
10 pages
RCM Logbook on Generative AI Insights
No ratings yet
RCM Logbook on Generative AI Insights
12 pages
Pec Gen Ai Notes
No ratings yet
Pec Gen Ai Notes
11 pages
03 GenAI Intro
No ratings yet
03 GenAI Intro
13 pages
GEN-AI Masters Program Curriculum
No ratings yet
GEN-AI Masters Program Curriculum
5 pages
Advanced RAG Techniques for LLM Apps
No ratings yet
Advanced RAG Techniques for LLM Apps
54 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
12 pages
Generative AI and NLP Workshop Overview
No ratings yet
Generative AI and NLP Workshop Overview
35 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
No ratings yet
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
6 pages
Generative Ai Fundamentals v1
100% (19)
Generative Ai Fundamentals v1
80 pages
AI Notes
No ratings yet
AI Notes
19 pages
RAG - A Simple Introduction
100% (6)
RAG - A Simple Introduction
75 pages
RAG 570 Hasnad Ahmed2
No ratings yet
RAG 570 Hasnad Ahmed2
9 pages
Tutorial Membuat RAG AI ChatBot API Dengan Python FastAPI Dan Open Source LLMs
No ratings yet
Tutorial Membuat RAG AI ChatBot API Dengan Python FastAPI Dan Open Source LLMs
41 pages
Generative AI Curriculum
No ratings yet
Generative AI Curriculum
2 pages
LangChain & RAG - U1
No ratings yet
LangChain & RAG - U1
32 pages
Steps Involved in RAG
No ratings yet
Steps Involved in RAG
4 pages
Understanding Retrieval-Augmented Generation (RAG)
No ratings yet
Understanding Retrieval-Augmented Generation (RAG)
12 pages
Generativeaiconamazonbedrock 231229150142 844d444e
No ratings yet
Generativeaiconamazonbedrock 231229150142 844d444e
48 pages
GenAIRAG LLM 71731191 PDF
No ratings yet
GenAIRAG LLM 71731191 PDF
32 pages
Retrieval Augmented Generation Explained
No ratings yet
Retrieval Augmented Generation Explained
8 pages
Data For GenAI
No ratings yet
Data For GenAI
17 pages
Generative Ai and Large Language Models (LLMS) : Unit - 7
No ratings yet
Generative Ai and Large Language Models (LLMS) : Unit - 7
42 pages
Generative AI Roadmap
100% (1)
Generative AI Roadmap
36 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
LLM Deployment Strategies and Insights
No ratings yet
LLM Deployment Strategies and Insights
5 pages
Retrieval-Augmented Generation (RAG) : Michael Klesel H. Felix Wittmann
No ratings yet
Retrieval-Augmented Generation (RAG) : Michael Klesel H. Felix Wittmann
11 pages
Workshop AI Baker PDF
No ratings yet
Workshop AI Baker PDF
88 pages
AI Learning Roadmap
No ratings yet
AI Learning Roadmap
6 pages
LLM and RAG
No ratings yet
LLM and RAG
12 pages
2024 04 25 AI Bots Vitalii
No ratings yet
2024 04 25 AI Bots Vitalii
20 pages
Natural Language Processing
No ratings yet
Natural Language Processing
11 pages
1 - Build A Complete OpenSource LLM RAG QA Chatbot - An In-Depth Journey (Introduction) - by Marco Bertelli - Level Up Coding
No ratings yet
1 - Build A Complete OpenSource LLM RAG QA Chatbot - An In-Depth Journey (Introduction) - by Marco Bertelli - Level Up Coding
12 pages
Metalogic - Xlink
No ratings yet
Metalogic - Xlink
7 pages
CV - Jean Francois Laloux
No ratings yet
CV - Jean Francois Laloux
5 pages
Fractal Terrains Pro: A Comprehensive Guide
No ratings yet
Fractal Terrains Pro: A Comprehensive Guide
30 pages
CN WF514 Manual
No ratings yet
CN WF514 Manual
96 pages
9.2.4 Packet Tracer Identify Packet Flow
No ratings yet
9.2.4 Packet Tracer Identify Packet Flow
3 pages
Understanding Bluetooth Profiles
No ratings yet
Understanding Bluetooth Profiles
20 pages
Web Design Complete Lecture Notes
No ratings yet
Web Design Complete Lecture Notes
23 pages
Cse3004 Design-Analysis-Of-Algorithm LT 1.0 1 Cse3004
No ratings yet
Cse3004 Design-Analysis-Of-Algorithm LT 1.0 1 Cse3004
2 pages
Mitsubishi Password
No ratings yet
Mitsubishi Password
4 pages
SM-A366 RepairGuide EU Eng Rev.1.1 250612
No ratings yet
SM-A366 RepairGuide EU Eng Rev.1.1 250612
162 pages
DMX-SPI-203 LED Controller Guide
No ratings yet
DMX-SPI-203 LED Controller Guide
3 pages
Artificial Intelligence Tool in RPF
No ratings yet
Artificial Intelligence Tool in RPF
11 pages
Prudent Assets
No ratings yet
Prudent Assets
4 pages
Cinematic LUT Pack Installation Guide
No ratings yet
Cinematic LUT Pack Installation Guide
4 pages
Problem Solving with Computers
No ratings yet
Problem Solving with Computers
21 pages
Takeoff Edu Group CSE Title List
No ratings yet
Takeoff Edu Group CSE Title List
321 pages
Intel Server Board S2600WT TPS
No ratings yet
Intel Server Board S2600WT TPS
178 pages
Duplicator 4 4X User Manual
No ratings yet
Duplicator 4 4X User Manual
48 pages
UD20197B C - Baseline - Multi Lingual - Video Intercom 8 Series Villa Door Station - QSG - V2.2.3
No ratings yet
UD20197B C - Baseline - Multi Lingual - Video Intercom 8 Series Villa Door Station - QSG - V2.2.3
4 pages
Schneider Electric Vijeo-Designer VJDBTPRO1P
No ratings yet
Schneider Electric Vijeo-Designer VJDBTPRO1P
2 pages
Assignment # 3-DSA
No ratings yet
Assignment # 3-DSA
5 pages
Extreme API
No ratings yet
Extreme API
144 pages
E-Shop Full Project Report
No ratings yet
E-Shop Full Project Report
5 pages
MC CH-4 Mobile Transport Layer
No ratings yet
MC CH-4 Mobile Transport Layer
18 pages
Ethernet: Network Fundamentals - Chapter 9
No ratings yet
Ethernet: Network Fundamentals - Chapter 9
24 pages
Student Record Management
No ratings yet
Student Record Management
18 pages
Assignment Operators in C - C++
100% (1)
Assignment Operators in C - C++
3 pages
Huawei's ODN Deployment Guide
No ratings yet
Huawei's ODN Deployment Guide
79 pages
15960
No ratings yet
15960
36 pages
EY Security Operations Centers Helping You Get Ahead of Cybercrime
100% (1)
EY Security Operations Centers Helping You Get Ahead of Cybercrime
20 pages