InterfaceAI is a Chrome extension that turns natural language into action.
Ask it to “set up an AWS instance,” “book a flight,” or “open my calculus lecture,” and it will understand your intent, navigate webpages, and execute tasks directly on your screen.
By combining intent recognition, vision-based action execution, and contextual awareness, InterfaceAI goes beyond existing tools with autonomous webpage exploration and personalized digital profiles—building the next generation of AI agents that make technology work for you.
- 💬 Natural language task execution (e.g., “Set up AWS instance”)
- 👀 Vision-based screen understanding and action execution
- 🌐 Autonomous webpage exploration
- 🧠 Contextual awareness for multi-step tasks
- 🪪 Personalized digital user profiles
- 🔒 Secure and privacy-first agent design
- Frontend: React + Chrome Extension APIs
- Backend / Infra: Flask, SQL
- ML / AI: Google Gemini API, OCR / Vision Models
- Hosting / Deployment: GCP (Cloud Run to easily deploy our docker containers, BigTable, BigQuery)
| Photo | Contributor | Role & Tools | Notable Contributions |
|---|---|---|---|
![]() | Sachin Gupta Project Lead | Python · PyTorch · GCP | Ideation |
| Adrian Najmi AI | Python · PyTorch · Playwright · Tesseract OCR | Ideation | |
| Hassan Ibrahim Backend | Python · Flask | Database Design | |
| Luna Aljammal Backend + DB | Python · Flask · Redis · GCP | Database Integration | |
| Andrew Wu Frontend | JavaScript | UI & Frontend Development | |
| Mohit Gedela Vision AI | Python · Java · JavaScript | Vision AI Development | |
| Alex Melnbardis AI | Python · PyTorch · Playwright · LangGraph | AI Agents + Vision AI | |
| David Olejniczak CI/CD + Cloud | CGP · Docker · Github Actions | CI/CD | |
Quick Start Guide - Get started in minutes
DevOps Cheatsheet - Docker and local development
CI/CD Documentation - Tests, linters, and formatters
- frontend: Chrome Extension (
frontend) - backend: Flask API at
http://localhost:5000(backend/app/main.py) - playwright: Minimal Python worker (
playwright/worker.py) - vision-ai: Minimal Flask service (
vision-ai/service.py) - redis: Cache/queue at
localhost:6379 - postgres: DB at
localhost:5432with init scriptscripts/init_db.sql
docker compose up --buildHealth checks:
- Backend:
GET http://localhost:5000/health - Relay:
POST http://localhost:5000/api/relaywith{ "message": "hello" }
To stop:
docker compose downThis repo includes a Chrome extension UI in frontend/:
frontend/manifest.jsonfrontend/popup.htmlfrontend/popup.js
The popup sends a POST to http://localhost:5000/api/relay and displays the response. CORS and host_permissions are configured to allow this in development.
- Open Chrome and go to
chrome://extensions. - Toggle on "Developer mode" (top-right).
- Click "Load unpacked" and select the
frontend/directory. - Click the extension icon (puzzle piece) → pin "InterfaceAI MVP".
- Open the popup, type a message, and click "Send to backend".
If you see CORS issues, ensure Docker services are running and the backend is reachable at http://localhost:5000.
interface-ai/ ├── docker-compose.yml ├── backend/ │ ├── Dockerfile │ ├── requirements.txt │ └── app/ │ └── main.py ├── frontend/ │ ├── manifest.json │ ├── popup.html │ └── popup.js ├── playwright/ │ ├── Dockerfile │ └── worker.py ├── vision-ai/ │ ├── Dockerfile │ └── service.py └── scripts/ └── init_db.sql - Backend CORS allows
http://localhost:*andchrome-extension://*for simple dev flow. - Update
frontend/manifest.jsonhost_permissionsif backend URL changes. - Postgres credentials are development defaults; change for production.
