0% found this document useful (0 votes)

36 views46 pages

PythonAI VectorEmbeddingsForSharing

The document provides an overview of vector embeddings, including their generation, similarity calculations, and applications in AI. It discusses various distance metrics, quantization methods, and the importance of vector search in tasks like recommendation systems and fraud detection. Additionally, it highlights the use of different embedding models and their respective dimensions, along with practical coding examples for implementation.

Uploaded by

38o69zmdn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views46 pages

PythonAI VectorEmbeddingsForSharing

Uploaded by

38o69zmdn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 46

Python +

AI
Python + AI
🧠 3/11: LLMs
↖️ 3/13: Vector
embeddings
🔍 3/18: RAG
3/20: Vision models
3/25: Structured outputs
3/27: Quality
Register & Safety
aka.ms/PythonAI/serie
Python + AI
↖️Vector embeddings
Pamela Fox
Python Cloud Advocate
www.pamelafox.org
Today we'll cover...
• What are vector embeddings?
• Vector similarity space
• Vector search
• Vector distance metrics
• Vector quantization
• Dimension reduction
Vector embeddings 101
Want to follow along?
1. Open this GitHub repository:
https://github.com/pamelafox/vector-embeddings-demo
s
2. Use "Code" button to create a GitHub Codespace:

3. Wait a few minutes for Codespace to start up

Vector embeddings
An embedding encodes an input as a list of floating-point numbers.
"dog" → [0.017198, -0.007493, -0.057982,…]

Different embedding models output different embeddings, with varying

lengths.
Vector
Embedding model Encodes MTEB Avg.
length
word2vec words 300
text (up to ~400
Sbert (Sentence-Transformers) 768
words)
text (up to 8191
OpenAI text-embedding-ada-00 1536 61.0%
2 tokens)
text (up to 8191
OpenAI text-embedding-3-small 256 - 1536 62.3%
tokens)
MTEB: text (up to 8191
OpenAIhttps://huggingface.co/spaces/mteb/leaderb
text-embedding-3-large 256 - 3072 64.6%
oard tokens)
Generating an embedding with
OpenAI SDK
Use the OpenAI SDK with OpenAI.com, Azure, Ollama, or
GitHub Models:
openai_client = openai.OpenAI(
base_url="https://models.inference.ai.azure.com",
api_key=os.environ["GITHUB_TOKEN"]
)

Generate embeddings for single or multiple inputs:

embeddings_response = openai_client.embeddings.create(
model ="text-embedding-3-small",
dimensions=1536,
input="hello world"
)
print(embeddings_response.data[0].embedding)

Notebook: generate_embedding.ipynb
Vector embeddings vary across
models
"queen "queen "queen
" " "
word2vec-google-news-300 text-embedding-ada-002 text-embedding-3-small
300 dimensions 1536 dimensions 1536 dimensions
[0.0052490234375, [-0.00449855113402009, [0.04379640519618988,
-0.1435546875, -0.006737332791090012, -0.03982372209429741,
-0.0693359375,...] - 0.044741131365299225, ...]
0.002418933203443885, ...]

Notebook: comparison.ipynb
Vector similarity
We compute embeddings so that we can calculate similarity between inputs.
The most common distance measurement is cosine similarity.

def cosine_similarity(v1, v2):

dot_product = sum(
[a * b for a, b in zip(v1,
v2)])

magnitude = (
sum([a**2 for a in v1]) *
sum([a**2 for a in v2])) ** 0.5

return dot_product / magnitude

Notebook: similarity.ipynb
Similarity space varies across models

text-embedding-ada-002 text-embedding-3-small (1536)

cosin cosin
word word
e e
1.000 1.000
dog dog
0 0
anima 0.885 anima 0.661
l 5 l 9
0.866 0.650
god cat
0 2
0.863 0.618
cat car
5 5
0.856 0.592
fish horse
6 7
0.855 0.573
bird boat
5 7
0.853 0.565
Similarity values range across models
Cosine similarity of "dog" to 1000 other words across two models.

text-embedding-ada-002 text-embedding-3-small (1536)

Business uses for vector similarity
Recommendation system:

https://learn.microsoft.com/azure/postgresql/flexible-server/generative-ai-recommendation-system

Fraud detection:

https://www.redpanda.com/blog/fraud-detection-pipeline-redpanda-
pinecone
Vector search
Vector search
1 Compute the embedding vector for the query
2 Find K closest vectors for the query vector
• Search exhaustively or using approximations

Query Query vector K closest vectors

Compute Search
embedding vector existing vectors

[-0.003335318, - [[“snake”, [-0.122, ..],

“tortoise” 0.0176891904,…] Search [“frog”, [-0.045, ..]]]
OpenAI
create embedding existing vectors
Exhaustive vector search in Python
An exhaustive search checks every single vector for the closest
one.
def exhaustive_search(query_vector, vectors):

similarities = []
for title, vector in vectors.items():
similarity = cosine_similarity(query_vector,
vector)
similarities.append((title, similarity))

similarities.sort(key=lambda x: x[1], reverse=True)

return similarities
Notebook: search.ipynb
ANN (Approximate Nearest Neighbor)
search
There are multiple ANN search algorithms that can speed up
search time.
Algorithm Python package Example database support

HNSW hnswlib PostgreSQL pgvector extension

Azure AI Search
Chromadb
Weaviate
DiskANN diskannpy Cosmos DB

IVFFlat faiss PostgreSQL pgvector extension

Faiss faiss None, in-memory index only

HNSW: Hierarchical Navegable Small
Worlds
The HNSW algorithm is great for situations where your index may
be frequently updated, and scales logarithmically even with large
indexes.
import hnswlib

p = hnswlib.Index(space='cosine', dim=1536)
p.init_index(
max_elements=len(movies),
ef_construction=200,
M=16)

vectors = list(movies.values())
ids = list([i for i in range(len(vectors))])
p.add_items(vectors, ids)

p.set_ef(50)
From HNSW research paper:
https://github.com/nmslib/hnswlib
Business use: Retrieval Augmented
Generation
Vector search can greatly improve the retrieval step in RAG.

Azure OpenAI +
Azure AI Search +
Azure AI Vision +
Azure App Service +

Code:
aka.ms/ragchat

Demo:
aka.ms/ragchat/demo

Join upcoming stream on RAG on 3/18! aka.ms/PythonAI/series

Vector distance metrics
Common distance metrics
Four common distance metrics between two vectors are:

1. Euclidean distance
2. Manhattan distance
3. Inner product
4. Cosine distance

The metric that we pick may depend on whether the vectors are unit
vectors.
Notebook: distance_metrics.ipynb
Unit vectors
A unit vector is a vector with a magnitude of 1.
def magnitude(vector):
return sum([a**2 for a in vector]) ** 0.5

Two vectors with same magnitude After normalization,

of 3.7416573867739413: two vectors with same magnitude
of 1:

[1, 2, 3] [0.26726124 0.53452248 0.80178373]

[3, 1, 2] [0.80178373 0.26726124 0.53452248]
Euclidean distance
The straight-line distance between two points in Euclidean space.

def euclidean(v1, v2):

return magnitude(v1 - v2)

euclidean(
[0.26726124 0.53452248
0.80178373],
[0.80178373 0.26726124
0.53452248]
)

0.65
5
Manhattan distance
The "taxicab" distance between two points in Euclidean space.

def manhattan(v1, v2):

return sum(abs(a - b)
for a, b in zip(v1, v2))

manhattan(
[0.26726124 0.53452248
0.80178373],
[0.80178373 0.26726124
0.53452248]
)

1.07
Dot product
The sum of products of corresponding vector elements.

def dot_product(v1, v2):

return sum(a * b
for a, b in zip(v1, v2))

x1y1 + x2y2 + x3y3

dot_product(
[0.26726124 0.53452248
0.80178373],
[0.80178373 0.26726124
0.53452248]
)

0.78
6
Cosine distance
The complement of the cosine of the angle between two vectors in
Euclidean space.
def cosine_similarity(v1, v2):
return dot_product(v1, v2) /
(magnitude(v1) * magnitude(v2))

def cosine_distance(v1, v2):

return 1 - cosine_similarity(v1,
v2)

cosine_distance(
[0.26726124 0.53452248
0.80178373],
[0.80178373 0.26726124
0.53452248]
)

0.21
Cosine similarity vs. Dot product
For unit vectors, the cosine similarity is the same as the dot
product.

>> cosine_similarity(v1, v2) == dot_product(v1, v2)

True

>>> 1 - cosine_distance(v1, v2) == dot_product(v1, v2)

True

In some vector databases, the dot product operator will be slightly faster
than cosine distance operators, since it does not need to calculate the
magnitude.

If your embeddings are unit vectors, consider using dot product as the metric.
OpenAI embedding models currently all output unit vectors!
Vector quantization
Vector quantization
Most vector embeddings are stored as floating point numbers (64-bit in
Python). We can use quantization to reduce the size of the embeddings.

 Scalar quantization: Reduce each number to an integer

[0.03265173360705376, [53, 40, 20, ...]

0.01370371412485838,
-0.017748944461345673,...]

 Binary quantization: Reduce each number to a single bit

[0.03265173360705376, [1, 1, 0, ...]

0.01370371412485838,
-0.017748944461345673,...]

Notebook: quantization.ipynb
Scalar quantization: The process
float3 int8
2
[0.03265173360705376, 0.01370371412485838, ...] [53, 40,...]
[-0.00786194484680891, - [27, 19, ...]
0.018985141068696976, ...] [29, 44, ...]
[-0.0039056178648024797,
0.019039113074541092, ...]
1. Calculate the min/max of all the embeddings
2. Normalize each embedding's values to [0, 1] range
3. Map normalized values into integer buckets from -128 to
+127
Min float Max float

~Min ~Max
observed observed
value value

-128 127
Scalar quantization: Before & after
"Moan
float3 a" int8
quantizati
2
[0.03265173360705376, [53, 40, 20, ...]
on
0.01370371412485838,
-0.017748944461345673,...]
Scalar quantization: Affects on
similarity
float3 int8
2
[0.03265173360705, 0.013703...] [53, 40,...]
[-0.00786194484680891, -0.0189...] [27, 19, ...]
[-0.0039056178648024797, [29, 44, ...]
0.0190...]
movie similarity movie similarity
Moana 1.000000 Moana 1.000000
Mulan 0.546800 ✅ Mulan 0.903532
Lilo & Stitch 0.502114 The Little Mermaid 0.894227
The Little Mermaid 0.498209 Lilo & Stitch 0.893718
Big Hero 6 0.491800 ✅ Big Hero 6 0.890959
Monsters University 0.484857 Monsters University 0.890915
✅
The Princess and the Frog 0.471984 ✅ The Princess and the Frog 0.889009

Finding Dory 0.471386 ✅ Finding Dory 0.888350

Maleficent 0.461029 Ice Princess 0.885539
Ice Princess 0.457817 Maleficent 0.885364
Binary quantization: The process
float3 bit
2
[0.03265173360705376, 0.01370371412485838, ...] [1, 1,...]
[-0.00786194484680891, - [0, 0, ...]
0.018985141068696976, ...] [0, 1, ...]
[-0.0039056178648024797,
0.019039113074541092, ...]

1. Pick a center C based on average, sample, or offline

knowledge
2. If value is >= C, map to 1, otherwise map to 0

0 C 1
Binary quantization: Before & after
"Moan
float3 a" bit
quantizati
2
[0.03265173360705376, [1, 1, 0, ...]
on
0.01370371412485838,
-0.017748944461345673,...]
Binary quantization: Affects on
similarity
float3 bit
2
[0.03265173360705, 0.013703...] [1, 1,...]
[-0.00786194484680891, -0.0189...] [0, 0, ...]
[-0.0039056178648024797, [0, 1, ...]
0.0190...]
movie similarity movie similarity
Moana 1.000000 Moana 1.000000
Mulan 0.546800 ✅ Mulan 0.686634
Lilo & Stitch 0.502114 The Little Mermaid 0.666260
The Little Mermaid 0.498209 The Princess and the Frog 0.659825
Big Hero 6 0.491800 Lilo & Stitch 0.657599
Monsters University 0.484857 ❌ Big Hero 6 0.655869

The Princess and the Frog 0.471984 Ice Princess 0.648046

Finding Dory 0.471386 ✅ Finding Dory 0.643830

Maleficent 0.461029 The Lion King 0.643088
Ice Princess 0.457817 Maleficent 0.642270
Quantization: effects on storage size
float3 int8 bit
2
[0.03265173360705,...] [53, 40,...] [1, 1,...]
[- [27, [0,
0.00786194484680891,...] 19, ...] 0, ...]
[- [29, [0,
0.00390561786480247,...] 44, ...] 1, ...]
Python built-
in number 12728 12728 12728
type
numpy typed
arrays 12400 1648 1648

Databases with vector storage support can often save more space
with bits,
using techniques such as bit packing.
Quantization: effects on index size in AI
Search
Azure AI Search supports quantization as a way to reduce vector storage
space needed.
float3 int8 bit
2
[0.03265173360705,...] [53, 40,...] [1, 1,...]
[- [27, [0,
0.00786194484680891,...] 19, ...] 0, ...]
[- [29, [0,
0.00390561786480247,...] 44, ...] 1, ...]
Vector
index size 1177.12 298.519 41.8636
(MB)

74.64% reduction! 96.44% reduction!

AI Search has two storage locations for vectors: the HNSW index used for
searching, and the actual data storage. The stats above are for index size.

Learn more in RAG time https://aka.ms/rag-time/journey3

series:
MRL dimension reduction
MRL: Matryoshka Representation
Learning
MRL is a technique that lets you reduce the dimensions of a
vector,
while still retaining much of the original semantic
The OpenAI text-embedding-3-large
representation. 3072
...
model
has default dimensions of 3072, 102 ...
4
but can be truncated all the way to
256. 512 ...
⚠️Only some models support MRL!
You can truncate either:
• when first generating embeddings 256 ...
• or when storing in database (if
supported)
Dimension reduction with OpenAI SDK
Specify dimensions when generating an embedding:
embeddings_response = openai_client.embeddings.create(
model ="text-embedding-3-small",
input="hello world",
dimensions=256
)

print(embeddings_response.data[0].embedding)

Notebook: dimension_reduction.ipynb
Dimension reduction: Before & after
"Moan "Moan
a" a"
dimensions=153 dimensions=256
6
[0.03265173360705376, [0.06316128373146057,
0.01370371412485838, 0.02650836855173111,
-0.017748944461345673,...] -0.03433343395590782,...]
Dimension reduction: Affects on
similarity
dimensions=1536 dimensions=256
[0.03265173360705376, [0.03265173360705376,
0.01370371412485838, 0.01370371412485838,
-0.017748944461345673,...] -0.017748944461345673,...]

movie similarity movie similarity

Moana 1.000000 Moana 1.000000
Mulan 0.546800 The Little Mermaid 0.587367
Lilo & Stitch 0.502114 Mulan 0.583428

The Little Mermaid 0.498209 Lilo & Stitch 0.575990

Big Hero 6 0.491800 ✅ Big Hero 6 0.574590

Monsters University 0.484857 The Princess and the Frog 0.568726
❌
The Princess and the Frog 0.471984 Finding Dory 0.549391
Finding Dory 0.471386 The Lion King 0.521125
Maleficent 0.461029 Tangled 0.513131
Ice Princess 0.457817 Maleficent 0.511412
❌
Dimension reduction plus
quantization
For maximum vector compression, combine both techniques!
...

1 MRL
Dimension To keep high accuracy,
Reduction only compress vectors in
... index,
oversample when retrieving,
2 Scalar or Binary and rescore using originals.
Quantization
. That's how Azure AI Search
can handle billions of vectors
Learn more in RAG time https://aka.ms/rag-time/journey3
Dive even deeper into vector
embeddings!
Vector embeddings 101 Quantization:
• Embedding projector • Scalar quantization 101
• Why are Cosine Similarities of Text embe • Product quantization 101
ddings almost always positive? • Binary and scalar quantiz
• Expected Angular Differences in Embeddi ation
ng Random Text?
• Embeddings: What they are and why the
y matter MRL dimension reduction:
• Unboxing Nomic Embed v
ANN algorithms 1.5: Resizable Production
• HNSW tutorial Embeddings with MRL
• Video: HNSW for Vector Search Explained • MRL from the Ground Up

Distance metrics:
• Two Forms of the Dot Product
• Is Cosine-Similarity of Embeddings Really
About Similarity?
Next steps 🧠 3/11: LLMs
Join upcoming streams! →
↖️ 3/13: Vector
embeddings
Come to office hours on
Thursdays in Discord: 🔍 3/18: RAG
aka.ms/pythonai/oh
3/20: Vision models
Get more Python AI 3/25: Structured outputs
resources 3/27: Quality & Safety
aka.ms/thesource/Python_A Register @ aka.ms/PythonAI/series
I
Thank you!

Understanding Vector Embeddings
No ratings yet
Understanding Vector Embeddings
14 pages
Vector Embedding
No ratings yet
Vector Embedding
8 pages
Streamlit PDF Application Setup All Commands in One Single File
No ratings yet
Streamlit PDF Application Setup All Commands in One Single File
8 pages
Hybrid Retrieval-Augmented Generation Approach For LLMs Query Response Enhancement
No ratings yet
Hybrid Retrieval-Augmented Generation Approach For LLMs Query Response Enhancement
5 pages
RAG Beyond Text Enhancing Image Retrieval in RAG Systems
100% (1)
RAG Beyond Text Enhancing Image Retrieval in RAG Systems
6 pages
Implementing A Retrieval-Augmented Generation System
No ratings yet
Implementing A Retrieval-Augmented Generation System
3 pages
RAG Syllabus R&D
No ratings yet
RAG Syllabus R&D
6 pages
Y2 Autumn Block 2 SOL Addition and Subtraction
No ratings yet
Y2 Autumn Block 2 SOL Addition and Subtraction
67 pages
Retrieval Augmented Generation
No ratings yet
Retrieval Augmented Generation
31 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
AIM307 - Retrieval Augmented Generation With Amazon Bedrock
No ratings yet
AIM307 - Retrieval Augmented Generation With Amazon Bedrock
15 pages
The Ultimate Guide To GenAI RAG: Enhancing AI With Real-Time Data Retrieval
No ratings yet
The Ultimate Guide To GenAI RAG: Enhancing AI With Real-Time Data Retrieval
12 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
19 pages
Stable Diffusion
No ratings yet
Stable Diffusion
6 pages
Machine Learning Crashcourse
No ratings yet
Machine Learning Crashcourse
233 pages
Machine Learning Algorithms Theory - Vimal Mishra
No ratings yet
Machine Learning Algorithms Theory - Vimal Mishra
931 pages
Transformers Explained Visually (Part 3) - Multi-Head Attention, Deep Dive - by Ketan Doshi - Towards Data Science
No ratings yet
Transformers Explained Visually (Part 3) - Multi-Head Attention, Deep Dive - by Ketan Doshi - Towards Data Science
24 pages
Weaviate Advanced RAG Techniques Ebook
100% (1)
Weaviate Advanced RAG Techniques Ebook
13 pages
Machine Learning Part 03
No ratings yet
Machine Learning Part 03
81 pages
Number Bonds Activities
No ratings yet
Number Bonds Activities
17 pages
Quantecon Python Programming
No ratings yet
Quantecon Python Programming
388 pages
Gluon Tutorials: Deep Learning - The Straight Dope
No ratings yet
Gluon Tutorials: Deep Learning - The Straight Dope
403 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
Machine Learning Part 02
No ratings yet
Machine Learning Part 02
161 pages
LangChain QuickStart With Llama 2
No ratings yet
LangChain QuickStart With Llama 2
16 pages
Flow Matching Guide and Code
No ratings yet
Flow Matching Guide and Code
83 pages
Getting Started With GPT-4 API: May 14,2024 Update To From gpt-4 To Gpt-4o
No ratings yet
Getting Started With GPT-4 API: May 14,2024 Update To From gpt-4 To Gpt-4o
8 pages
RLHF - Reinforcement Learning From Human Feedback
No ratings yet
RLHF - Reinforcement Learning From Human Feedback
21 pages
LLM Attention
No ratings yet
LLM Attention
13 pages
Newwhitepaper - Embeddings & Vector Stores
No ratings yet
Newwhitepaper - Embeddings & Vector Stores
51 pages
Natural Language Processing Professional Program
No ratings yet
Natural Language Processing Professional Program
13 pages
Generalist Fellowship Brochure
No ratings yet
Generalist Fellowship Brochure
13 pages
Human Emotion Detection With Speech Recognition Using Mel-Frequency Cepstral Coefficient and CNN - New
No ratings yet
Human Emotion Detection With Speech Recognition Using Mel-Frequency Cepstral Coefficient and CNN - New
2 pages
Kdd2014 Gabrilovich Bordes Knowledge Graphs
No ratings yet
Kdd2014 Gabrilovich Bordes Knowledge Graphs
159 pages
SketchyGAN: Realistic Sketch-to-Image Synthesis
No ratings yet
SketchyGAN: Realistic Sketch-to-Image Synthesis
10 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
13 pages
Lecture8 PDF
No ratings yet
Lecture8 PDF
434 pages
OS by JJsir
100% (1)
OS by JJsir
269 pages
RAG - Genai
No ratings yet
RAG - Genai
11 pages
Build Chatbots with LangChain & OpenAI
No ratings yet
Build Chatbots with LangChain & OpenAI
17 pages
Bias-Variance Tradeoff Presentation
No ratings yet
Bias-Variance Tradeoff Presentation
11 pages
Build Your Own YouTube Chatbot
No ratings yet
Build Your Own YouTube Chatbot
10 pages
5-Day Gen AI Intensive Course 2024 November 11-15 (Full)
No ratings yet
5-Day Gen AI Intensive Course 2024 November 11-15 (Full)
347 pages
Theory Lectures v2.3
No ratings yet
Theory Lectures v2.3
264 pages
NLP Presentation
No ratings yet
NLP Presentation
20 pages
Poetic Automatismspdf
No ratings yet
Poetic Automatismspdf
476 pages
GenAI Pinnacle Plus Brochure
No ratings yet
GenAI Pinnacle Plus Brochure
10 pages
Data Analysis and Machine Learning With Kaggle (2021) - Banachewicz & Massaron
No ratings yet
Data Analysis and Machine Learning With Kaggle (2021) - Banachewicz & Massaron
51 pages
Lang Chain
No ratings yet
Lang Chain
143 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Comparative Analysis of RAG Fine-Tuning and Prompt Engineering in Chatbot Development
No ratings yet
Comparative Analysis of RAG Fine-Tuning and Prompt Engineering in Chatbot Development
4 pages
Retrieval-Augmented Generation For Large Language Models: A Survey
No ratings yet
Retrieval-Augmented Generation For Large Language Models: A Survey
26 pages
Sandro Skansi - Introduction To Deep Learning. From Logical Calculus To Artificial Intelligence (2018, Springer)
No ratings yet
Sandro Skansi - Introduction To Deep Learning. From Logical Calculus To Artificial Intelligence (2018, Springer)
193 pages
771 A18 Lec4
100% (1)
771 A18 Lec4
128 pages
LangChain - Chat With Your Data
No ratings yet
LangChain - Chat With Your Data
32 pages
Knowledge Graphs for AI Experts
No ratings yet
Knowledge Graphs for AI Experts
197 pages
Unit 5. Invertebrates
No ratings yet
Unit 5. Invertebrates
8 pages
Cosine Similarity in Machine Learning
No ratings yet
Cosine Similarity in Machine Learning
14 pages
Types of Similarity Search
No ratings yet
Types of Similarity Search
11 pages
Indian Mobile Brands & Ambassadors
No ratings yet
Indian Mobile Brands & Ambassadors
9 pages
Module 3 - NC II - Solving and Addressing General Workplace Problems - ForTrainingOnly
100% (9)
Module 3 - NC II - Solving and Addressing General Workplace Problems - ForTrainingOnly
86 pages
ITU T A5 TD New G.1028.2
No ratings yet
ITU T A5 TD New G.1028.2
7 pages
CHP 10 - Professional Skills & Formats (SBL Notes by Sir Hasan Dossani)
No ratings yet
CHP 10 - Professional Skills & Formats (SBL Notes by Sir Hasan Dossani)
8 pages
Parametrization of Joint OFDM-based JRC
No ratings yet
Parametrization of Joint OFDM-based JRC
5 pages
TSA Book
No ratings yet
TSA Book
154 pages
LESSON 10 - Functions
No ratings yet
LESSON 10 - Functions
6 pages
Pay Fixation Guide for Educators
No ratings yet
Pay Fixation Guide for Educators
17 pages
BSD Guide 4.00
No ratings yet
BSD Guide 4.00
12 pages
Jdsref PDF
No ratings yet
Jdsref PDF
838 pages
Chapter 4.1 Install openLDAP For Windows PDF
No ratings yet
Chapter 4.1 Install openLDAP For Windows PDF
6 pages
NRC 2018 Rules & Regulations Football
No ratings yet
NRC 2018 Rules & Regulations Football
14 pages
RFID Projects
No ratings yet
RFID Projects
9 pages
Applsci 12 08252
No ratings yet
Applsci 12 08252
20 pages
Project Report - 2 On CreditCard Fraud Detection
No ratings yet
Project Report - 2 On CreditCard Fraud Detection
42 pages
692283-Phone Repair StepbyStep Flowchart Diagrams
No ratings yet
692283-Phone Repair StepbyStep Flowchart Diagrams
52 pages
CCNA Security v2.0 Final Exam Answers PDF
No ratings yet
CCNA Security v2.0 Final Exam Answers PDF
22 pages
Using The 1783-NATR To Bridge Networks - The Automation Blog
No ratings yet
Using The 1783-NATR To Bridge Networks - The Automation Blog
5 pages
Lesson 15 Multimedia Information and Media
No ratings yet
Lesson 15 Multimedia Information and Media
5 pages
Task Decomposition Techniques Guide
No ratings yet
Task Decomposition Techniques Guide
7 pages
Annexe 3.5 - Article GREED... (2024)
No ratings yet
Annexe 3.5 - Article GREED... (2024)
8 pages
Get Response
No ratings yet
Get Response
15 pages
Java Unit Wise Questions
No ratings yet
Java Unit Wise Questions
4 pages
C 09 S 4
100% (2)
C 09 S 4
12 pages
Completing the Square Practice
No ratings yet
Completing the Square Practice
4 pages
PKK Bellin 2024
No ratings yet
PKK Bellin 2024
1 page
In The 1960s and 1970s Dennis Ritchie and Ken Thompson Invented Unix
No ratings yet
In The 1960s and 1970s Dennis Ritchie and Ken Thompson Invented Unix
3 pages
Pearson Edexcel GCE: Statistics S1 Advanced/Advanced Subsidiary
No ratings yet
Pearson Edexcel GCE: Statistics S1 Advanced/Advanced Subsidiary
48 pages
Sensitivity On Wheatstone Bridge Report
No ratings yet
Sensitivity On Wheatstone Bridge Report
4 pages
DS Professor Layton Diabolical Box
No ratings yet
DS Professor Layton Diabolical Box
40 pages

PythonAI VectorEmbeddingsForSharing

Uploaded by

PythonAI VectorEmbeddingsForSharing

Uploaded by

Python +

3. Wait a few minutes for Codespace to start up

Different embedding models output different embeddings, with varying

Generate embeddings for single or multiple inputs:

def cosine_similarity(v1, v2):

return dot_product / magnitude

text-embedding-ada-002 text-embedding-3-small (1536)

text-embedding-ada-002 text-embedding-3-small (1536)

Query Query vector K closest vectors

[-0.003335318, - [[“snake”, [-0.122, ..],

similarities.sort(key=lambda x: x[1], reverse=True)

HNSW hnswlib PostgreSQL pgvector extension

IVFFlat faiss PostgreSQL pgvector extension

Faiss faiss None, in-memory index only

Join upcoming stream on RAG on 3/18! aka.ms/PythonAI/series

Two vectors with same magnitude After normalization,

[1, 2, 3] [0.26726124 0.53452248 0.80178373]

def euclidean(v1, v2):

def manhattan(v1, v2):

def dot_product(v1, v2):

x1y1 + x2y2 + x3y3

def cosine_distance(v1, v2):

>> cosine_similarity(v1, v2) == dot_product(v1, v2)

>>> 1 - cosine_distance(v1, v2) == dot_product(v1, v2)

 Scalar quantization: Reduce each number to an integer

[0.03265173360705376, [53, 40, 20, ...]

 Binary quantization: Reduce each number to a single bit

[0.03265173360705376, [1, 1, 0, ...]

Finding Dory 0.471386 ✅ Finding Dory 0.888350

1. Pick a center C based on average, sample, or offline

The Princess and the Frog 0.471984 Ice Princess 0.648046

Finding Dory 0.471386 ✅ Finding Dory 0.643830

74.64% reduction! 96.44% reduction!

Learn more in RAG time https://aka.ms/rag-time/journey3

movie similarity movie similarity

The Little Mermaid 0.498209 Lilo & Stitch 0.575990

Big Hero 6 0.491800 ✅ Big Hero 6 0.574590

You might also like