Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.openinterpreter.com/llms.txt

Use this file to discover all available pages before exploring further.

Most settings — like model architecture and GPU offloading — can be adjusted via your LLM providers like LM Studio. However, max_tokens and context_window should be set via Open Interpreter. For local mode, smaller context windows will use less RAM, so we recommend trying a much shorter window (~1000) if it’s is failing or if it’s slow.
interpreter --local --max_tokens 1000 --context_window 3000

Make sure max_tokens is less than context_window.