Ray

A privacy-respecting AI personal assistant. One agent, one browser UI. Runs entirely on your machine via Docker Compose — no data leaves without your permission.

Install (one-liner)

curl -fsSL https://raw.githubusercontent.com/Bigalan09/Ray/main/install.sh | bash

This downloads the compose file, seeds config, and pulls pre-built images from GHCR. You only need Docker installed.

Manual Setup

cp .env.example .env
# Set OPENAI_API_KEY in .env
docker compose up --build
open http://localhost:3000

On first run Ray guides you through a short onboarding conversation to set up your identity and preferences. Type /bootstrap done when finished.

Architecture

Browser :3000
    └─ ray-ui  (Bun static + /api/* proxy)
           └─ ray-api :8000  (FastAPI)
                  ├─ OpenAI Responses API  (primary, gpt-5-mini default)
                  ├─ Optional: Azure OpenAI, Ollama
                  ├─ Agent loop: tool calls, multi-round, retries
                  ├─ Slash commands  /help /tool /task /skill /exec /file ...
                  ├─ Built-in tools + MCP stdio servers
                  ├─ SQLite  (conversations, tasks)
                  ├─ ChromaDB  (vector memory)
                  ├─ Background tasks + cron scheduler  ← ray-worker
                  ├─ Webhooks + lifecycle hooks
                  └─ Auth, rate limiting, audit log

ray-worker      background tasks + cron
ray-redis       task queue + rate limiting
ray-chromadb    vector memory
ray-prometheus  metrics scraping
ray-loki        log aggregation
ray-promtail    log shipper (Docker socket)
ray-grafana     dashboards (localhost:3001 / grafana.bigalan.dev)

Core services: ray-ui, ray-api, ray-worker, ray-redis, ray-chromadb.
Observability: ray-prometheus, ray-loki, ray-promtail, ray-grafana — bundled in docker-compose.yml.
Optional: ray-ollama for local model inference (uncomment in docker-compose.yml).

Features

Feature	Status
Streaming chat (SSE)	✅
Multi-turn tool calls (agent loop, up to 10 rounds)	✅
Web search with citation cards	✅ `web_search` + `web_search_preview`
Persistent conversations (SQLite)	✅
Auto-generated conversation titles	✅
Vector memory (ChromaDB)	✅ store/search + proactive injection per turn
Proactive memory recall	✅ relevant facts injected before each response
Memory panel (browse/search/delete)	✅
Background tasks + WebSocket updates	✅
Cron-scheduled tasks + enable/disable	✅
Webhooks + internal hooks	✅ event listeners + HTTP webhooks
MCP tool servers (stdio, auto-restart)	✅
Exec guardrails (allowlist + approval card)	✅
Image upload + multimodal chat	✅ paste, drag-drop, or file picker
File/PDF RAG ingestion	✅ chunks embedded in ChromaDB, `document_search` tool
Model switching UI	✅ dropdown in header
Workspace file editors (Soul/User/Identity)	✅
Schedule enable/disable	✅
Settings panel	✅
API key management UI	✅
MCP server management form	✅
Mobile-responsive UI	✅ sidebar drawer, 44 px touch targets, dvh layout
Browser telemetry (RUM)	✅ batched events → structlog + Prometheus
Response timing display	✅ shown in status bar
Structured chat errors	✅ sanitised SSE errors with request IDs
URL fetching (`web_fetch`)	✅ HTML-to-text conversion
Workspace file search (`grep_files`, `glob_files`)	✅ regex + glob patterns
Interactive clarification (`ask_user`)	✅ structured questions with options
Sub-agent delegation (`spawn_agent`)	✅ single-task focused delegation

Slash Commands

Type / in the chat input for autocomplete.

Command	Description
`/help`	List all commands
`/new`	New session
`/clear`	Clear current session
`/clear all`	Delete all sessions
`/compact`	Summarise conversation to save tokens
`/status`	System status (MCP servers, tasks, scheduler)
`/tool [name] [args]`	Execute a tool or list tools
`/task [prompt]`	Create a background task
`/schedule [cron\|natural language] [prompt]`	Schedule a recurring task
`/schedule list`	List scheduled tasks
`/schedule remove [name]`	Remove a scheduled task
`/file read\|write\|list\|search`	Workspace file operations
`/skill [name] [input]`	Run a saved prompt template
`/exec <command>`	Run an allowlisted system command (requires approval)
`/exec list`	Show allowed commands
`/hook [list\|add\|remove\|test\|log\|events]`	Manage webhooks
`/agent [name]`	Switch to a named agent, or list available agents
`/bootstrap done\|reset\|status`	Manage first-run onboarding

Configuration

Environment (`.env`)

cp .env.example .env

Variable	Required	Description
`OPENAI_API_KEY`	Yes	OpenAI API key
`OPENAI_BASE_URL`	No	Override for compatible gateways
`RATE_LIMIT_ENABLED`	No	`true`/`false` (default `true`)
`RATE_LIMIT_RPM`	No	Requests per minute (default `1200`)
`RATE_LIMIT_BURST`	No	Burst size (default `200`)
`BRAVE_API_KEY`	No	Enables Brave Search instead of DuckDuckGo

Models (`config/models.yaml`)

Default model is gpt-5-mini (Azure OpenAI). Add or switch providers here — gpt-5-nano (OpenAI direct) and Ollama are also configured out of the box.

Workspace (`workspace/`)

Ray's personal state — not in git. Created from workspace-template/ on first run.

File	Purpose
`SOUL.md`	Personality and principles
`USER.md`	Your profile and preferences
`IDENTITY.md`	Ray's self-identity (written during bootstrap)
`MEMORY.md`	Curated long-term memory
`mcp_servers.json`	MCP server configuration

Back up workspace/ to preserve Ray's entire state.

MCP Tools

Configure external tool servers in workspace/mcp_servers.json:

{
  "servers": [
    {
      "name": "filesystem",
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-filesystem", "/workspace"],
      "enabled": true
    }
  ]
}

MCP tools are passed to the LLM alongside built-in tools. Crashed servers are automatically restarted on the next tool call.

Exec Guardrails

Ray can run system commands with strict guardrails. Edit config/guardrails.yaml:

exec:
  enabled: true
  default_timeout: 30
  allow:
    - command: git
      args: ["status", "log", "diff"]
      description: "Git read-only operations"
      timeout: 15

Every command requires an explicit Approve/Deny from the user before it runs. Commands execute with shell=False, a stripped environment, and enforced timeouts.

Hooks

Ray has two hook systems:

Webhooks — HTTP callbacks to external URLs when events fire. Manage via the Webhooks panel in the sidebar or the /hook command.

Webhook events: message_received, command_executed, tool_executing, tool_executed, exec_approved, exec_denied, task_started, task_completed, task_failed, session_created, session_deleted, response_persisted.

Internal hooks — In-process Python event listeners registered via hook_engine.on(event, callback). Support glob patterns (e.g. command:*).

Internal events: gateway:startup, command, command:new, command:reset, command:stop, session:compact:before, session:compact:after, session:patch, agent:bootstrap, message:received, message:preprocessed, message:sent.

Skills

Define reusable prompt templates in config/skills.yaml:

skills:
  - name: summarise
    description: Summarise text
    prompt: "Please summarise:\n\n{input}"
    agent: general

Invoke with /skill summarise <text>.

Security

API key: Disabled until generated. POST /api/auth/key creates a key stored in workspace/api_key. Pass as X-API-Key header.
Rate limiting: Configurable via .env. Defaults to 1200 req/min, 200 burst. Keys by API key → forwarded IP → socket IP.
Audit logging: Mutating requests logged to workspace/audit.db.
All ports bound to 127.0.0.1.
Path traversal protection on all /file and workspace operations.

Testing

# API unit + integration tests (169+ tests, live OpenAI auto-skipped if no key)
cd api && .venv/bin/python -m pytest tests/ -v

# E2E against local dev stack
cd tests && npm test

# E2E against live Docker stack (recommended for CI)
cd tests && npm run test:docker

# Full coverage suite against Docker stack
cd tests && npm run test:docker:full

# API-only (no browser)
cd tests && npm run test:api

Development

# Backend dependencies
cd api
python3.13 -m venv .venv
.venv/bin/python -m pip install -r requirements.txt

# API (hot reload)
.venv/bin/python -m uvicorn main:app --reload --port 8000

# UI (HMR)
cd ui && API_URL=http://localhost:8000 bun run dev

# Or from repo root
npm run ui:dev
npm run ui:build
npm run docker:up

Install dependencies first:

cd api && python3.13 -m venv .venv && .venv/bin/python -m pip install -r requirements.txt
cd ui && bun install
cd tests && npm install

Playwright prefers api/.venv/bin/python when it is healthy, then falls back to python3.13, python3.12, and finally python3. Python 3.14 is currently too new for the pinned ChromaDB stack. Override with PYTHON_BIN if you need a different interpreter.

Release

Docker images are published to GHCR on every version tag and manual workflow dispatch:

ghcr.io/bigalan09/ray-api:latest
ghcr.io/bigalan09/ray-ui:latest

To release a new version:

git tag v1.2.3
git push origin v1.2.3

The GitHub Actions workflow builds linux/amd64 and linux/arm64 images and pushes them to GHCR automatically.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
.github/workflows		.github/workflows
api		api
config		config
tests		tests
ui		ui
workspace-template		workspace-template
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
ISSUES.md		ISSUES.md
README.md		README.md
ROADMAP.md		ROADMAP.md
docker-compose.ghcr.yml		docker-compose.ghcr.yml
docker-compose.yml		docker-compose.yml
install.sh		install.sh
package.json		package.json
ray-icon.ico		ray-icon.ico
ray-icon.svg		ray-icon.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ray

Install (one-liner)

Manual Setup

Architecture

Features

Slash Commands

Configuration

Environment (`.env`)

Models (`config/models.yaml`)

Workspace (`workspace/`)

MCP Tools

Exec Guardrails

Hooks

Skills

Security

Testing

Development

Release

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ray

Install (one-liner)

Manual Setup

Architecture

Features

Slash Commands

Configuration

Environment (.env)

Models (config/models.yaml)

Workspace (workspace/)

MCP Tools

Exec Guardrails

Hooks

Skills

Security

Testing

Development

Release

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Environment (`.env`)

Models (`config/models.yaml`)

Workspace (`workspace/`)

Packages