Add non-code MCP mode (?codemode=false): search + invoke by RhysSullivan · Pull Request #1127 · RhysSullivan/executor

RhysSullivan · 2026-06-25T07:26:42Z

What

By default the MCP server runs in code mode: one execute tool the model writes TypeScript against, discovering connections through tools.search() / tools.describe.tool() and calling them inside a sandbox. Some clients can't drive a code sandbox and want to discover and call tools through plain MCP tool calls.

This adds ?codemode=false, which switches a session into non-code mode: instead of execute, it exposes two meta-tools, search and invoke.

search({ query, limit?, offset? }) ranks over the whole catalog and returns a bounded page, each hit carrying its own input schema so it can be called directly.
invoke({ name, arguments? }) runs a tool by name, reusing the same resolve / invoke / pause / resume path as code mode.

Code mode stays the default, so existing clients are unaffected.

Why search + invoke, not a tool dump

The obvious reading of the flag (dump every tool as an individual MCP tool, like Cloudflare's ?codemode=false) does not scale. The full Microsoft Graph connection alone is ~16,575 tools / ~640 MB of self-contained inlined schema. The server builds that in under a second, but the payload itself is the wall:

it exceeds the runtime's memory budget, and
no client can receive and validate a catalog that large in one tools/list (Codex does not paginate tools/list at all; the spec's cursor pagination only helps clients that loop on nextCursor).

search + invoke is the lazy-loading shape: the client pulls the handful of tools it needs rather than the whole catalog, so it works for any client and any catalog size.

How

New engine seam searchTools (bounded, ranked, paginated; reuses the existing discovery ranking and enriches only the returned page with schemas). The dump-only listTools seam is removed.
The non-code MCP host registers low-level ListTools / CallTools handlers advertising search / invoke (+ resume when an approval pauses); code mode keeps the high-level execute path.
A resumed invoke returns the tool's own result, unwrapped from the ToolResult envelope, the same shape it returns without pausing.

Evidence

codemode-off (e2e/scenarios/mcp-codemode-off.test.ts): a non-code session advertises search/invoke (not execute, not a dumped catalog); search finds a seeded connection's tools; invoke runs one and returns its real result; a second case drives an approval-gated tool through invoke → pause → approve → resume and asserts the resumed result is the unwrapped tool result. Green on self-host and the workerd Durable Object.
codemode-scale (e2e/scenarios/mcp-codemode-scale.test.ts): the full 16,575-tool Graph catalog is searched (bounded page) and invoked, with trace assertions that the catalog is never dumped, each invoke dispatches exactly once, and a single invoke neither searches nor rebuilds the catalog. Green on cloud.

Full gates pass: format:check, lint (0/0), typecheck (41/41), plus the touched packages' unit tests.

By default the MCP server runs in code mode: it advertises a single `execute` tool and the model writes TypeScript that calls `tools.search`, `tools.describe.tool`, and the connection tools. That keeps the tool list tiny, but clients that do lazy tool loading expect every tool enumerated directly so they can fetch schemas on demand. This adds a `?codemode=false` query parameter that switches the session into transparent mode: instead of `execute`, the server lists every directly-callable tool (connection tools plus the static core and plugin tools) with its own input schema, and routes `tools/call` straight to a single-tool invoke. Code mode stays the default. Threading: - New engine seams `listTools` / `invokeTool` / `invokeToolWithPause` alongside the existing code-execution methods, carried through every host (cloud, cloudflare, local, self-host) and the usage decorator. - The MCP host registers low-level `ListTools` / `CallTools` handlers in transparent mode and keeps the high-level `registerTool` path for code mode; the session reads the flag off the connection URL. A normalizer stamps `type: "object"` onto any advertised input schema whose root lacks one (a union-root tool such as add-server otherwise compiles to `anyOf` with no top-level type, which makes the MCP client reject the whole tools/list response). Covered by a cross-target e2e scenario that seeds an OpenAPI connection, opens a transparent session, asserts the tools are dumped directly, and makes a verifiable direct core-tool call. Green on self-host and on the workerd Durable Object path.

cloudflare-workers-and-pages · 2026-06-25T07:28:07Z

Deploying with Cloudflare Workers

The latest updates on your project. Learn more about integrating Git with Workers.

Status	Name	Latest Commit	Preview URL	Updated (UTC)
✅ Deployment successful! View logs	executor-marketing	`175edf5`	Commit Preview URL Branch Preview URL	Jun 25 2026, 05:53 PM

cloudflare-workers-and-pages · 2026-06-25T07:28:46Z

Deploying with Cloudflare Workers

The latest updates on your project. Learn more about integrating Git with Workers.

Status	Name	Latest Commit	Updated (UTC)
✅ Deployment successful! View logs	executor-cloud	`175edf5`	Jun 25 2026, 05:54 PM

github-actions · 2026-06-25T07:29:35Z

Cloudflare preview


Console	https://executor-preview-pr-1127.executor-e2e.workers.dev
MCP	`https://executor-preview-pr-1127.executor-e2e.workers.dev/mcp`
Deployed commit	`175edf5`

Sign-in is Cloudflare Access (one-time PIN to an allowed email). The preview has its own database and encryption key; it is destroyed when this PR closes.

pkg-pr-new · 2026-06-25T07:32:31Z

Open in StackBlitz

@executor-js/cli

npm i https://pkg.pr.new/@executor-js/cli@1127

@executor-js/config

npm i https://pkg.pr.new/@executor-js/config@1127

@executor-js/execution

npm i https://pkg.pr.new/@executor-js/execution@1127

@executor-js/sdk

npm i https://pkg.pr.new/@executor-js/sdk@1127

@executor-js/codemode-core

npm i https://pkg.pr.new/@executor-js/codemode-core@1127

@executor-js/runtime-quickjs

npm i https://pkg.pr.new/@executor-js/runtime-quickjs@1127

@executor-js/plugin-file-secrets

npm i https://pkg.pr.new/@executor-js/plugin-file-secrets@1127

@executor-js/plugin-graphql

npm i https://pkg.pr.new/@executor-js/plugin-graphql@1127

@executor-js/plugin-keychain

npm i https://pkg.pr.new/@executor-js/plugin-keychain@1127

@executor-js/plugin-mcp

npm i https://pkg.pr.new/@executor-js/plugin-mcp@1127

@executor-js/plugin-onepassword

npm i https://pkg.pr.new/@executor-js/plugin-onepassword@1127

@executor-js/plugin-openapi

npm i https://pkg.pr.new/@executor-js/plugin-openapi@1127

executor

npm i https://pkg.pr.new/executor@1127

commit: 175edf5

A direct tool call in transparent mode unwraps the tool's `ToolResult` envelope (renders `data` natively, sets `isError` on failures). The `resume` path is shared with code mode and formatted every resumed completion with the code-mode `execute` envelope, so a transparent-mode tool that paused for approval and then resumed came back wrapped in `{ status, result, logs }` instead of the tool's own result, unlike the same tool when it did not pause. Pick the resume completion formatter by session mode: a paused execution can only have originated from the tool this session registered (`execute` in code mode, a direct single-tool invoke in transparent mode), so format the resumed completion the same way that origin tool formats a non-paused completion. Covered by a second case in the codemode-off scenario that drives the approval-gated `policies.create` through pause, approve, and resume in a transparent session and asserts the resumed structured content is the policy itself, not the execute envelope. Green on self-host and workerd; the assertion fails against the pre-fix formatter.

`?codemode=false` previously dumped every tool into one `tools/list`. That does not scale: the full Microsoft Graph connection is ~16.5k tools and ~640 MB of self-contained schema, which no client can load in a single response and which exceeds the runtime's memory budget. The server builds it fast, but the payload itself is the wall, and no client (Codex does not paginate tools/list; the spec's cursor pagination only helps clients that do) can usefully receive a catalog that large. Switch non-code mode to a fixed two-tool surface instead: - `search({ query, limit?, offset? })` ranks over the whole catalog and returns only a bounded page, each hit carrying its own input schema so it can be called directly. - `invoke({ name, arguments? })` runs a tool by name, reusing the same resolve/invoke/pause/resume path (the resumed result stays unwrapped). This is the lazy-loading shape: the client pulls the handful of tools it needs rather than the whole catalog, so it works for any client and any catalog size. It is essentially code mode's own search/invoke primitives exposed as flat MCP tools instead of behind the `execute` sandbox. Engine: add a bounded `searchTools` seam (reuses the existing discovery ranking, enriches the page with schemas) and drop the now-unused `listTools` seam that backed the dump. Covered end to end: - codemode-off: a non-code session advertises search/invoke (not execute, not a dumped catalog); search finds a seeded connection's tools; invoke runs one and returns its real result; the pause/resume shape guard still holds. Green on self-host and the workerd DO. - codemode-scale: the full 16.5k Graph catalog is searched (bounded page) and invoked, with trace assertions that the catalog is never dumped, each invoke dispatches once, and a single invoke neither searches nor rebuilds the catalog. Green on cloud.

The non-code mode no longer dumps the catalog, so nothing enumerates tools with their schemas anymore. Revert `tools.list`'s `includeSchemas` branch (the bulk self-contained-schema enrichment) to the original projected-only listing, drop the `ToolListFilter.includeSchemas` field, and fold the now-single-use `ToolListing` type into `ToolSearchResult`. Also refresh the doc comments that still described the dump.

RhysSullivan added 2 commits June 25, 2026 00:51

RhysSullivan changed the title ~~Add transparent MCP connection mode (?codemode=false)~~ Add non-code MCP mode (?codemode=false): search + invoke Jun 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add non-code MCP mode (?codemode=false): search + invoke#1127

Add non-code MCP mode (?codemode=false): search + invoke#1127
RhysSullivan wants to merge 4 commits into
mainfrom
claude/kind-elion-73a9fb

RhysSullivan commented Jun 25, 2026 •

edited

Loading

Uh oh!

cloudflare-workers-and-pages Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

cloudflare-workers-and-pages Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

pkg-pr-new Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

RhysSullivan commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why search + invoke, not a tool dump

How

Evidence

Uh oh!

cloudflare-workers-and-pages Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying with Cloudflare Workers

Uh oh!

cloudflare-workers-and-pages Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying with Cloudflare Workers

Uh oh!

github-actions Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Cloudflare preview

Uh oh!

pkg-pr-new Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

RhysSullivan commented Jun 25, 2026 •

edited

Loading

cloudflare-workers-and-pages Bot commented Jun 25, 2026 •

edited

Loading

cloudflare-workers-and-pages Bot commented Jun 25, 2026 •

edited

Loading

github-actions Bot commented Jun 25, 2026 •

edited

Loading

pkg-pr-new Bot commented Jun 25, 2026 •

edited

Loading