What is a vector embedding and what is it used for

A vector embedding is a list of real numbers that represents the semantic meaning of a piece of text, an image, or any other data. Two sentences with the same meaning produce vectors that are close together; two unrelated ones produce vectors that are far apart. Semantic search, RAG, and recommendation systems are all built on this principle.

July 5, 2026 5 min 50

Artificial Intelligence

RAG with Postgres and pgvector in production: from PoC to SLO

Embeddings, HNSW indexing, reranking, evaluation, context window, latency under load. Full stack with code and measurable SLOs.

June 9, 2026 10 min 308

Architecture

Hybrid RAG in 2026: the patterns that keep winning

Hybrid RAG in 2026 combines dense and lexical search fused with RRF, cross-encoder reranking over the top-50 candidates, structure-aware chunking, and continuous evaluation with Ragas or TruLens. It is the pattern that survives in serious production systems three years after the initial embeddings boom.

April 28, 2026 3 min 567 4.6

Architecture

Redis 8.2 and its vector support: when it actually makes sense

Redis 8.2 ships vector search as a native data type. The real question is whether it replaces a dedicated engine like Qdrant, Weaviate, or pgvector on workloads with millions of vectors and tight latency budgets, or only works as a bonus on top of the cache you already run.

August 4, 2025 5 min 224 4.6

Artificial Intelligence

RAG 2.0: knowledge graphs, vectors, and hybrid

El RAG de 2023 era búsqueda vectorial con un LLM detrás. El de 2025 es un sistema híbrido que combina vectores, búsqueda léxica y grafos de conocimiento. Qué ha cambiado, dónde funciona cada pieza y qué decisiones marcan la diferencia entre un RAG útil y uno decepcionante.

July 17, 2025 6 min 331 4.2

Artificial Intelligence

nomic-embed-text: Competitive Open Embeddings

nomic-embed-text-v1.5 from Nomic AI is an embedding model with weights, code and training data released under Apache 2.0: 137 million parameters, up to 8192 tokens of context, and an MTEB score of 62.4, almost matching the 62.3 of OpenAI's text-embedding-3-small, at 768 dimensions instead of 1536.

May 5, 2024 4 min 250 4.4

Artificial Intelligence

OpenAI text-embedding-3: What Changes vs the Previous One

OpenAI released text-embedding-3 on 25 January 2024 in two variants: small and large. It improves MTEB quality over ada-002, adds variable dimensions you can truncate without retraining, and lowers the price for small. Migration pays off for most serious RAG setups, but measure real recall on your own corpus before reindexing everything.

January 27, 2024 4 min 244 4.5

Architecture

pgvector in 2024: HNSW Indexes and Real Scaling

pgvector matured in 2023-2024 with the HNSW index type and parallel construction that arrived in version 0.6. For projects already running PostgreSQL, a dedicated vector database is not needed in most cases: this guide explains when PostgreSQL is enough, how to configure the index, and where it starts to fall short.

January 21, 2024 5 min 233 4.4

Artificial Intelligence

Cohere Embed v3: Multilingual and Enterprise-Oriented

Cohere Embed v3 is an embedding model that distinguishes queries from documents via the input_type parameter and scores intrinsic text quality, with multilingual support for over 100 languages at 1024 dimensions. It costs $0.10 per million tokens versus OpenAI's $0.02, and delivers better recall in multilingual RAG.

January 9, 2024 4 min 228 4.2

Architecture

Vector Databases: Qdrant, Pinecone, and Weaviate

Vector databases have gone from an experimental curiosity to the central component of most LLM-based products. This comparison covers Qdrant, Pinecone, and Weaviate: architecture, strengths, limitations, and a decision tree for choosing the right option based on your operational priorities and budget.

November 13, 2023 5 min 258 4.3

Architecture

pgvector: Semantic Search Without Leaving Postgres

pgvector turns PostgreSQL into a fully functional vector database without adding a separate service to the stack. It extends Postgres with the vector type, IVFFlat indexes for approximate nearest-neighbour search (ANN), and the ability to combine relational SQL filters with vector ranking in a single query. For most RAG projects and internal chatbots, those limits never become a problem.

November 1, 2023 6 min 227

Artificial Intelligence

Text Embeddings: Turning Words Into Useful Vectors

A text embedding is a numeric vector that encodes the meaning of a word or phrase, so that semantically similar pieces of text produce nearby vectors measured by cosine distance. The models most used in production are OpenAI ada-002, Sentence Transformers, and BGE, and they mainly serve semantic search, RAG systems, and text classification without training a classic classifier.

October 26, 2023 5 min 269 4.8

Architecture

Chroma: A Lightweight Vector Database for Embedding Prototypes

Chroma is the easiest vector database to get started with embeddings and semantic search: install it with pip install chromadb, no extra infrastructure required, and it exposes a minimal API (add, query, delete). It suits prototypes and mid-sized RAG systems well; past a few million vectors, Qdrant or Milvus scale better.

October 17, 2023 5 min 235 4.4

Artificial Intelligence

Vector Database Comparison: Qdrant, Pinecone, and Weaviate

Qdrant is the pick when full control and performance in self-hosted setups matter most; Pinecone wins for fully managed SaaS with zero operations; Weaviate stands out when native embeddings and hybrid search built into one pipeline add real value. This comparison covers architecture, quantisation, filtering, and RAG use cases to help you decide based on budget and control needs.

March 30, 2023 4 min 339 4.4