| --- |
| title: FARA - Browser Use Agent |
| emoji: π€ |
| colorFrom: blue |
| colorTo: purple |
| sdk: docker |
| pinned: true |
| license: mit |
| short_description: Microsoft Fara-7B Browser Use Demo inspired by CUA2 |
| app_port: 7860 |
| tags: |
| - computer-use |
| - browser-automation |
| - ai-agent |
| - vision-language-model |
| --- |
| |
| # π€ FARA - Computer Use Agent Demo |
|
|
| FARA (Fara Agent for Real-world Automation) is an AI agent that can browse the web and complete tasks autonomously. |
|
|
| ## Features |
|
|
| - π **Autonomous Web Navigation** - The agent can browse websites on its own |
| - π **Web Search** - Search for information across the web |
| - π **Form Filling** - Fill out forms automatically |
| - π±οΈ **Point and Click** - Click buttons, links, and elements |
| - β¨οΈ **Text Input** - Type text into fields |
| - π **Page Scrolling** - Scroll through content |
|
|
| ## How to Use |
|
|
| 1. Enter a task in natural language (e.g., "Search for the latest news about AI") |
| 2. Click "Run Task" and watch the agent work! |
| 3. View the screenshots to see each step the agent takes |
|
|
| ## Powered By |
|
|
| - **Microsoft Fara-7B** - Vision-Language Model for computer use |
| - **Playwright** - Browser automation framework |
| - **Modal** - Model hosting and inference |
|
|
| ## Links |
|
|
| - [GitHub Repository](https://github.com/microsoft/fara) |
|
|
| ## License |
|
|
| MIT License |