OS mode is a highly experimental mode that allows Open Interpreter to control the operating system visually through the mouse and keyboard. It provides a multimodal LLM like GPT-4V with the necessary tools to capture screenshots of the display and interact with on-screen elements such as text and icons. It will try to use the most direct method to achieve the goal, like using spotlight on Mac to open applications, and using query parameters in the URL to open websites with additional information. OS mode is a work in progress, if you have any suggestions or experience issues, please reach out on our Discord. To enable OS Mode, run the interpreter with theDocumentation Index
Fetch the complete documentation index at: https://docs.openinterpreter.com/llms.txt
Use this file to discover all available pages before exploring further.
--os flag:

