This project provides two AI assistants:
- Webcam Assistant: Interact with an AI assistant that captures video using your webcam
- Desktop Assistant: Interact with an AI assistant that captures your desktop screen
You need the following API keys to run the project:
- OPENAI_API_KEY: Required for OpenAI GPT models
- GOOGLE_API_KEY: Optional, only needed if you use Google's AI models
Store these keys in a .env file in the root directory of the project or set them as environment variables.
- Python: Version 3.7 or later
- Apple Silicon Users: If you're running this on Apple Silicon, install PortAudio:
brew install portaudioClone the repository and navigate into the project directory:
git clone <repository_url>
cd <repository_directory>Create a virtual environment to keep dependencies isolated:
python3 -m venv .venvFor macOS/Linux:
source .venv/bin/activateFor Windows:
.venv\Scripts\activateEnsure you have the latest version of pip:
pip install --upgrade pipInstall all the required packages from requirements.txt:
pip install -r requirements.txtRun the assistant that uses your webcam:
python webcamAssistant.pyRun the assistant that captures your desktop:
python desktopAssistant.pyCreate a .env file in the root directory and add your API keys:
OPENAI_API_KEY=your_openai_api_key
GOOGLE_API_KEY=your_google_api_key
For both assistants, press the ESC key or q in the display window to quit.
- Virtual Environment: Always activate the virtual environment before running the assistants to ensure dependencies are available
- Desktop Assistant Features: The desktop assistant takes screenshots of your screen to provide context for the AI assistant
- Webcam Assistant Features: The webcam assistant uses live video feed from your webcam to interact with the AI assistant
- Audio Issues: Ensure your microphone and speakers are configured correctly for speech recognition and TTS (Text-to-Speech)
- Dependency Issues: If you encounter errors while installing packages, ensure you're inside the virtual environment and using Python 3.7 or later
- PortAudio on macOS: If
pyaudiofails to install, ensureportaudiois installed via Homebrew (brew install portaudio)
Feel free to fork this repository, open issues, or submit pull requests to improve the assistants or add new features.
This project is licensed under the MIT License.