Techniques

RAG (Retrieval-Augmented Generation)

A technique that connects an LLM to external knowledge bases to improve accuracy and reduce hallucinations.

Detailed Explanation

Retrieval-Augmented Generation (RAG) is a framework that improves the quality of LLM-generated responses by grounding the model on external sources of knowledge. When a user asks a question, a RAG system first searches a database (often a vector database) for relevant documents. It then retrieves those documents and feeds them to the LLM alongside the user's prompt. This ensures the AI uses up-to-date, factual, and private data, significantly reducing hallucinations.

Related Terms

Hallucination

When an AI model confidently generates false, nonsensical, or unverified information.

Large Language Model (LLM)

A type of foundational AI model trained on vast amounts of text to understand and generate human language.

Vector Database

A specialized database designed to store and search data represented as mathematical vectors (embeddings).

Back to Glossary