Spaces:

Sanjeev23oct
/

browser-use-sg

Running

App Files Files Community

browser-use-sg / README.md

Sanjeev23oct

Upload folder using huggingface_hub

f1d5e1c verified 10 days ago

preview code

raw

history blame contribute delete

9.59 kB

	---
	title: browser-use-sg
	app_file: webui.py
	sdk: gradio
	sdk_version: 5.23.1
	---
	<img src="./assets/web-ui.png" alt="Browser Use Web UI" width="full"/>

	<br/>

	[![GitHub stars](https://img.shields.io/github/stars/browser-use/web-ui?style=social)](https://github.com/browser-use/web-ui/stargazers)
	[![Discord](https://img.shields.io/discord/1303749220842340412?color=7289DA&label=Discord&logo=discord&logoColor=white)](https://link.browser-use.com/discord)
	[![Documentation](https://img.shields.io/badge/Documentation-📕-blue)](https://docs.browser-use.com)
	[![WarmShao](https://img.shields.io/twitter/follow/warmshao?style=social)](https://x.com/warmshao)

	This project builds upon the foundation of the [browser-use](https://github.com/browser-use/browser-use), which is designed to make websites accessible for AI agents.

	We would like to officially thank [WarmShao](https://github.com/warmshao) for his contribution to this project.

	WebUI: is built on Gradio and supports most of `browser-use` functionalities. This UI is designed to be user-friendly and enables easy interaction with the browser agent.

	Expanded LLM Support: We've integrated support for various Large Language Models (LLMs), including: Google, OpenAI, Azure OpenAI, Anthropic, DeepSeek, Ollama etc. And we plan to add support for even more models in the future.

	Custom Browser Support: You can use your own browser with our tool, eliminating the need to re-login to sites or deal with other authentication challenges. This feature also supports high-definition screen recording.

	Persistent Browser Sessions: You can choose to keep the browser window open between AI tasks, allowing you to see the complete history and state of AI interactions.

	<video src="https://github.com/user-attachments/assets/56bc7080-f2e3-4367-af22-6bf2245ff6cb" controls="controls">Your browser does not support playing this video!</video>

	## Installation Guide

	### Prerequisites
	- Python 3.11 or higher
	- Git (for cloning the repository)

	### Option 1: Local Installation

	Read the [quickstart guide](https://docs.browser-use.com/quickstart#prepare-the-environment) or follow the steps below to get started.

	#### Step 1: Clone the Repository
	```bash
	git clone https://github.com/browser-use/web-ui.git
	cd web-ui
	```

	#### Step 2: Set Up Python Environment
	We recommend using [uv](https://docs.astral.sh/uv/) for managing the Python environment.

	Using uv (recommended):
	```bash
	uv venv --python 3.11
	```

	Activate the virtual environment:
	- Windows (Command Prompt):
	```cmd
	.venv\Scripts\activate
	```
	- Windows (PowerShell):
	```powershell
	.\.venv\Scripts\Activate.ps1
	```
	- macOS/Linux:
	```bash
	source .venv/bin/activate
	```

	#### Step 3: Install Dependencies
	Install Python packages:
	```bash
	uv pip install -r requirements.txt
	```

	Install Browsers in Playwright:
	You can install specific browsers by running:
	```bash
	playwright install --with-deps chromium
	```

	To install all browsers:
	```bash
	playwright install
	```

	#### Step 4: Configure Environment
	1. Create a copy of the example environment file:
	- Windows (Command Prompt):
	```bash
	copy .env.example .env
	```
	- macOS/Linux/Windows (PowerShell):
	```bash
	cp .env.example .env
	```
	2. Open `.env` in your preferred text editor and add your API keys and other settings

	### Option 2: Docker Installation

	#### Prerequisites
	- Docker and Docker Compose installed
	- [Docker Desktop](https://www.docker.com/products/docker-desktop/) (For Windows/macOS)
	- [Docker Engine](https://docs.docker.com/engine/install/) and [Docker Compose](https://docs.docker.com/compose/install/) (For Linux)

	#### Installation Steps
	1. Clone the repository:
	```bash
	git clone https://github.com/browser-use/web-ui.git
	cd web-ui
	```

	2. Create and configure environment file:
	- Windows (Command Prompt):
	```bash
	copy .env.example .env
	```
	- macOS/Linux/Windows (PowerShell):
	```bash
	cp .env.example .env
	```
	Edit `.env` with your preferred text editor and add your API keys

	3. Run with Docker:
	```bash
	# Build and start the container with default settings (browser closes after AI tasks)
	docker compose up --build
	```
	```bash
	# Or run with persistent browser (browser stays open between AI tasks)
	CHROME_PERSISTENT_SESSION=true docker compose up --build
	```


	4. Access the Application:
	- Web Interface: Open `http://localhost:7788` in your browser
	- VNC Viewer (for watching browser interactions): Open `http://localhost:6080/vnc.html`
	- Default VNC password: "youvncpassword"
	- Can be changed by setting `VNC_PASSWORD` in your `.env` file

	## Usage

	### Local Setup
	1. Run the WebUI:
	After completing the installation steps above, start the application:
	```bash
	python webui.py --ip 127.0.0.1 --port 7788
	```
	2. WebUI options:
	- `--ip`: The IP address to bind the WebUI to. Default is `127.0.0.1`.
	- `--port`: The port to bind the WebUI to. Default is `7788`.
	- `--theme`: The theme for the user interface. Default is `Ocean`.
	- Default: The standard theme with a balanced design.
	- Soft: A gentle, muted color scheme for a relaxed viewing experience.
	- Monochrome: A grayscale theme with minimal color for simplicity and focus.
	- Glass: A sleek, semi-transparent design for a modern appearance.
	- Origin: A classic, retro-inspired theme for a nostalgic feel.
	- Citrus: A vibrant, citrus-inspired palette with bright and fresh colors.
	- Ocean (default): A blue, ocean-inspired theme providing a calming effect.
	- `--dark-mode`: Enables dark mode for the user interface.
	3. Access the WebUI: Open your web browser and navigate to `http://127.0.0.1:7788`.
	4. Using Your Own Browser(Optional):
	- Set `CHROME_PATH` to the executable path of your browser and `CHROME_USER_DATA` to the user data directory of your browser. Leave `CHROME_USER_DATA` empty if you want to use local user data.
	- Windows
	```env
	CHROME_PATH="C:\Program Files\Google\Chrome\Application\chrome.exe"
	CHROME_USER_DATA="C:\Users\YourUsername\AppData\Local\Google\Chrome\User Data"
	```
	> Note: Replace `YourUsername` with your actual Windows username for Windows systems.
	- Mac
	```env
	CHROME_PATH="/Applications/Google Chrome.app/Contents/MacOS/Google Chrome"
	CHROME_USER_DATA="/Users/YourUsername/Library/Application Support/Google/Chrome"
	```
	- Close all Chrome windows
	- Open the WebUI in a non-Chrome browser, such as Firefox or Edge. This is important because the persistent browser context will use the Chrome data when running the agent.
	- Check the "Use Own Browser" option within the Browser Settings.
	5. Keep Browser Open(Optional):
	- Set `CHROME_PERSISTENT_SESSION=true` in the `.env` file.

	### Docker Setup
	1. Environment Variables:
	- All configuration is done through the `.env` file
	- Available environment variables:
	```
	# LLM API Keys
	OPENAI_API_KEY=your_key_here
	ANTHROPIC_API_KEY=your_key_here
	GOOGLE_API_KEY=your_key_here

	# Browser Settings
	CHROME_PERSISTENT_SESSION=true # Set to true to keep browser open between AI tasks
	RESOLUTION=1920x1080x24 # Custom resolution format: WIDTHxHEIGHTxDEPTH
	RESOLUTION_WIDTH=1920 # Custom width in pixels
	RESOLUTION_HEIGHT=1080 # Custom height in pixels

	# VNC Settings
	VNC_PASSWORD=your_vnc_password # Optional, defaults to "vncpassword"
	```

	2. Platform Support:
	- Supports both AMD64 and ARM64 architectures
	- For ARM64 systems (e.g., Apple Silicon Macs), the container will automatically use the appropriate image

	3. Browser Persistence Modes:
	- Default Mode (CHROME_PERSISTENT_SESSION=false):
	- Browser opens and closes with each AI task
	- Clean state for each interaction
	- Lower resource usage

	- Persistent Mode (CHROME_PERSISTENT_SESSION=true):
	- Browser stays open between AI tasks
	- Maintains history and state
	- Allows viewing previous AI interactions
	- Set in `.env` file or via environment variable when starting container

	4. Viewing Browser Interactions:
	- Access the noVNC viewer at `http://localhost:6080/vnc.html`
	- Enter the VNC password (default: "vncpassword" or what you set in VNC_PASSWORD)
	- Direct VNC access available on port 5900 (mapped to container port 5901)
	- You can now see all browser interactions in real-time

	5. Container Management:
	```bash
	# Start with persistent browser
	CHROME_PERSISTENT_SESSION=true docker compose up -d

	# Start with default mode (browser closes after tasks)
	docker compose up -d

	# View logs
	docker compose logs -f

	# Stop the container
	docker compose down
	```

	## Changelog
	- [x] 2025/01/26: Thanks to @vvincent1234. Now browser-use-webui can combine with DeepSeek-r1 to engage in deep thinking!
	- [x] 2025/01/10: Thanks to @casistack. Now we have Docker Setup option and also Support keep browser open between tasks.[Video tutorial demo](https://github.com/browser-use/web-ui/issues/1#issuecomment-2582511750).
	- [x] 2025/01/06: Thanks to @richard-devbot. A New and Well-Designed WebUI is released. [Video tutorial demo](https://github.com/warmshao/browser-use-webui/issues/1#issuecomment-2573393113).