Genie is an AI assistant that lives on your desktop. Press Cmd+G, say what you need, and watch Genie work across your apps autonomously.
How it works
Genie captures everything it needs from your screen and voice to understand and execute your request.
Press Cmd+G and say what you need in plain English. Genie captures your voice, your screen, and your cursor position for full context.
Cmd + GGenie sees what you see. It reads your screen, recognizes the app you're in, and understands exactly what you're pointing at. No copy-pasting or explaining.
Watch Genie work across your apps — browsing the web, editing documents, sending messages, running commands — all autonomously.
Features
Built with native macOS APIs for speed, privacy, and seamless integration with your workflow.
Local speech-to-text via Whisper. No internet required for transcription. Just speak naturally and Genie listens.
Captures screenshots, reads text via OCR, and tracks your cursor — so it understands context just by looking at what you're looking at.
Controls your browser, reads PDFs, edits Google Docs, runs terminal commands, types and clicks — all without you lifting a finger.
Built on Tauri with Rust. 30 MB binary, 30 MB idle RAM. Direct macOS API access — no Electron, no bloat.
Use cases
Here are some things people say to Genie every day.
Download Genie, sign in, and you're ready. Onboarding walks you through API key and permissions.
Apple Silicon (M1/M2/M3/M4). Requires macOS 12.3+. Intel build coming soon.
Sign in with Google, paste your Anthropic API key, and grant Microphone, Accessibility, and Screen Recording permissions when prompted. Then press Cmd+G to start.