What is Retrieval-Augmented Generation (RAG)?

Question

Accepted Answer

An AI architecture pattern that improves the accuracy of language model responses by retrieving relevant documents or data at query time and including them in the model's context. Retrieval-augmented generation, commonly called RAG, is an architecture pattern used to make AI language models more accurate and grounded in specific, up-to-date information. Rather than relying solely on what a model learned during training, RAG systems first retrieve relevant documents, records, or data from a knowledge base, then pass that retrieved content to the model as context alongside the user's query.