What is Context Window?

Question

Accepted Answer

The maximum amount of text a language model can process at once in a single interaction, including the conversation history, instructions, and any retrieved documents. The context window is the total amount of text, measured in tokens, that a language model can hold in working memory during a single interaction. Everything the model reads and responds to in one session, including the system prompt, the conversation history, any documents retrieved via RAG, and the user's current message, must fit within this limit. Content outside the context window is invisible to the model.