
DroidRun is a powerful framework for controlling Android and iOS devices through LLM agents. It allows you to automate device interactions using natural language commands. Checkout our benchmark results
- 🤖 Control Android and iOS devices with natural language commands
- 🔀 Supports multiple LLM providers (OpenAI, Anthropic, Gemini, Ollama, DeepSeek)
- 🧠 Planning capabilities for complex multi-step tasks
- 💻 Easy to use CLI with enhanced debugging features
- 🐍 Extendable Python API for custom automations
- 📸 Screenshot analysis for visual understanding of the device
- Execution tracing with Arize Phoenix
pip install droidrun
Read on how to get droidrun up and running within seconds in our docs!
-
Group Chat Summarization: Let DroidRun summarize an escalated group chat for you.
-
Travel Search Assistant: Wittness DroidRun looking for the cheapest stay and share it with a colleague on telegram.
-
Automate TikTok Shopping: See how DroidRun looks for a stanley cup on TikTok Shop and send the product details via email.
- Automated UI testing of mobile applications
- Creating guided workflows for non-technical users
- Automating repetitive tasks on mobile devices
- Remote assistance for less technical users
- Exploring mobile UI with natural language commands
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
To ensure the security of the codebase, we have integrated security checks using bandit
and safety
. These tools help identify potential security issues in the code and dependencies.
Before submitting any code, please run the following security checks:
-
Bandit: A tool to find common security issues in Python code.
bandit -r droidrun
-
Safety: A tool to check your installed dependencies for known security vulnerabilities.
safety scan