This page describes how to tune your indexes to achieve faster query performance and better recall in AlloyDB for PostgreSQL.
Analyze your queries
Use the EXPLAIN ANALYZE command to analyze your query insights as shown in the following example SQL query.
EXPLAIN ANALYZE SELECT result-column FROM my-table ORDER BY EMBEDDING_COLUMN <-> embedding('text-embedding-005', 'What is a database?')::vector LIMIT 1; The example response QUERY PLAN includes information such as the time taken, the number of rows scanned or returned, and the resources used.
Limit (cost=0.42..15.27 rows=1 width=32) (actual time=0.106..0.132 rows=1 loops=1) -> Index Scan using my-scann-index on my-table (cost=0.42..858027.93 rows=100000 width=32) (actual time=0.105..0.129 rows=1 loops=1) Order By: (embedding_column <-> embedding('text-embedding-005', 'What is a database?')::vector(768)) Limit value: 1 Planning Time: 0.354 ms Execution Time: 0.141 ms View vector index metrics
You can use vector index metrics to review performance of your vector index, identify areas for improvement, and tune your index based on the metrics, if needed. The pg_stat_ann_indexes view helps you understand the state of index utilization while the pg_stat_ann_index_creation view provides information about rows created at the time of index creation.
To view index utilization metrics, run the following command:
SELECT * FROM pg_stat_ann_indexes; You see output similar to the following:
-[ RECORD 1 ]----------+--------------------------------------------------------------------------- relid | 271236 indexrelid | 271242 schemaname | public relname | t1 indexrelname | t1_ix1 indextype | scann indexconfig | {num_leaves=100,quantizer=SQ8} indexsize | 832 kB indexscan | 0 insertcount | 250 deletecount | 0 updatecount | 0 partitioncount | 100 distribution | {"average": 3.54, "maximum": 37, "minimum": 0, "outliers": [37, 12, 11, 10, 10, 9, 9, 9, 9, 9]} distributionpercentile |{"10": { "num_vectors": 0, "num_partitions": 0 }, "25": { "num_vectors": 0, "num_partitions": 30 }, "50": { "num_vectors": 3, "num_partitions": 30 }, "75": { "num_vectors": 5, "num_partitions": 19 }, "90": { "num_vectors": 7, "num_partitions": 11 }, "95": { "num_vectors": 9, "num_partitions": 5 }, "99": { "num_vectors": 12, "num_partitions": 4 }, "100": { "num_vectors": 37, "num_partitions": 1 }} To view number of rows created at the time of index creation, run the following command:
SELECT * FROM pg_stat_ann_index_creation; You see output similar to the following:
-[ RECORD 1 ]----------+--------------------------------------------------------------------------- relid | 271236 indexrelid | 271242 schemaname | public relname | t1 indexrelname | t1_ix1 index_rows_at_creation_time | 262144 For more information about the complete list of metrics, see Vector index metrics.
What's next
- Maintain vector indexes.
- Learn about an example embedding workflow.