Changelog

What the coworker learned lately.

Recent features, integrations, and improvements — grouped by month. Link out to the deeper write-ups where we have them.

May 2026

Feature

Agentic default chat — every question reasons across your sources

Default chat is no longer a single-shot search. Every question now runs a budget-tiered agent loop over your connected sources — knowledge base, databases, the web, and read tools like Jira, Slack, and GitHub — and cross-checks them instead of tunneling into one. Pick the depth per message: Chat (knowledge base only), Auto (agentic default), or Research (full planning + approval-gated write tools).

Feature

Agents take action in your connected systems

Beyond reading, agents can now act: post and update Slack messages, react, pin, and DM; create and update Salesforce and SAP records; open and update ServiceNow changes and incidents; create HubSpot records, associations, and notes; log Jira worklogs and assignees; comment on Confluence; and open, review, and merge GitHub pull requests. Every write is risk-tier gated and pauses for approval, and automation-authored writes carry a transparency footnote.

Feature

Multi-agent delegation

An agent can hand a subtask to a specialist drawn from your team's delegatable automations and fold the result back into its answer — up to two levels deep. A research- or automation-tier parent can even delegate an approval-gated write: the parent pauses for a single approval, then resumes the specialist to perform it. Per-run fan-out is bounded so a run can't spawn unbounded sub-agents.

Feature

Ask the agent to build and manage your automations

In research mode you can describe an automation in plain language and the agent drafts the schedule, prompt, and tools, then proposes it for one-click approval — and can list, update, pause, or delete existing ones the same way. Off by default; a team admin opts the team in. Deletes are soft: an automation is hidden and stops firing while its run history is retained for audit.

Improvement

Per-automation cost controls

Each automation can pin its own model, set per-run and per-day spend caps, and show a live cost estimate as you configure it. A per-automation cost dashboard charts recent spend so you can see exactly where the budget goes.

Improvement

Inline agent reasoning, retries, and a context-window meter in chat

The chat transcript now renders the agent's reasoning, tool retries, and delegation hand-offs as they happen, plus a context-window usage meter so you can see how much of the model's memory a long conversation is using. Default-tier agent activity and per-turn latency now show up in Analytics.

Integration

MCP: Streamable HTTP and stdio transports, plus resources and prompts

Connect Model Context Protocol servers over SSE or Streamable HTTP (both cloud-usable), or over stdio as a local subprocess for self-hosted deployments. Agents can now read MCP resources and prompts, not just call tools, and stdio server environments support per-key editing.

Improvement

Org-aware approval routing and richer Slack link previews

Approvals route to the right person automatically — the on-call engineer for the relevant service (via PagerDuty / Opsgenie), then the requester's manager from your org chart, then team owners. And shared ZenSearch links now unfurl in Slack as permission-aware Work Objects with a per-viewer detail pane.

April 2026

Feature

Agent cost control: pre-flight ceilings and soft-limit pause

Agent runs now go through a five-layer budget system: complexity-aware tiers, per-sub-task wall-clock timeouts, a pre-flight dollar ceiling that rejects expensive plans before any tokens are spent, mid-run cost enforcement, and a soft-limit pause that saves a resumable checkpoint with a 7-day TTL when any budget exhausts.

Read the write-up

Feature

Procedural memory — self-improving agent workflows

After successful multi-tool sessions (≥5 tool calls, ≥0.7 synthesis confidence), agents now distill the workflow into a reusable procedure and progressively disclose it to future runs via the new view_procedure tool.

Read the write-up

Feature

Observational memory for long conversations

LLM-based observer extracts key findings, tool usage, and pending work from long sessions into a compact summary. Typically 80%+ token compression on 50+ message conversations; injected as a cacheable system-prompt prefix.

Read the write-up

Improvement

Recovery recipes for transient tool failures

Timeouts, rate limits, and upstream 5xx errors are now auto-retried once before escalation. Retries don't count against tool-call budgets. Opt out via AGENT_RECOVERY_ENABLED=false.

Improvement

Live cost meter in the chat UI

cost_update SSE event streams every few iterations so users see real dollars ticking up against the per-run cap while the agent is still running. Cadence configurable via AGENT_COST_UPDATE_INTERVAL.

March 2026

Integration

Google Workspace agent tools (18)

Gmail, Calendar, Drive, Docs, and Sheets — read and write. OAuth 2.0 or Google Service Account with domain-wide delegation. Agents now act as the end user, with per-user OAuth tokens.

Read the write-up

Integration

Microsoft 365 agent tools (33+)

OneDrive, Outlook, Teams, SharePoint, Planner, plus Graph-powered org-chart and calendar free-busy. OAuth through Azure AD with per-scope configuration.

Read the write-up

Integration

Zendesk, Airtable, and Notion agent tools

Zendesk (14 tools — ticket CRUD, help-center articles, users, organisations), Airtable (8 — bases, records, comments), Notion (12 — pages, databases, blocks, comments). All via per-user OAuth where supported.

Feature

Custom webhook tools and MCP server support

Admins can register internal HTTP endpoints as agent tools (name, URL, method, auth, parameter schema — encrypted credentials). For third-party tool servers, ZenSearch speaks Model Context Protocol (MCP) as a client — register an MCP server and its tools appear in the team's tool catalog.

Feature

Slack and Microsoft Teams assistant surfaces

Native assistants in both chat apps: Block Kit answers and approval cards in Slack, Adaptive Cards and threaded replies in Teams. Cross-surface continuity — the same conversation follows the user between web, Slack, and Teams.

Read the write-up

Integration

Model Gateway providers: Groq, OpenRouter, Azure AI Foundry, Amazon Bedrock

Every AI call routes through a central gateway. Swap providers with one env var — Groq for speed, OpenRouter for unified billing across 100+ upstream models, Azure for per-tenant OpenAI, Bedrock for AWS-native Claude and Nova. Ollama still works via OpenAI-compatible endpoint for fully local deployment.

Read the write-up

Improvement

Prompt caching on chat calls

Anthropic cache_control ephemeral markers get a 90% discount on cached reads; OpenAI and Groq prefix caching get ~50%. Tracked as cached_input_tokens and cache_creation_input_tokens on the usage ledger.

Improvement

Multimodal chat: mixed text + image content

ContentParts on chat messages lets agents pass image inputs (file uploads, extracted document images) to vision-capable models. Token estimation accounts for ~765 tokens per image part.

Feature

Incremental sync intelligence

Content-hash dedup short-circuits unchanged documents before they enter the parse pipeline. A PageChangeDetector handles web-crawl delta. A DeletionDetector uses set-diff against each connector's seen source IDs to find removals, with a 50% safety threshold that refuses to act on suspiciously large deletion sets.

Read the write-up

February 2026

Feature

Chrome browser extension with side-panel chat

AI assistant in a browser side panel, aware of the current page context. Keyboard shortcuts, session persistence, and identity federation with the web app.

Feature

Canvas artifacts — versioned, persistent AI outputs

Agents produce structured outputs (code, reports, plans) into a side panel with diff view, version history, and user editing. Content types: markdown, code (Python, Go, TypeScript, etc.), JSON.

Improvement

Answer confidence scoring and citation grounding

Composite confidence score (0.0–1.0) from coverage ratio, shape confidence, and violation-free signals; surfaced as High/Medium/Low confidence in the chat UI. Post-synthesis verification semantically checks citations against sources and corrects misattributions.

January 2026

Integration

PostgreSQL, MySQL, ClickHouse, SQL Server via NL-to-SQL

Natural-language-to-SQL over all four databases. Schema discovery, read-only query execution, and automatic error correction via LLM-friendly error messages.

Read the write-up

Feature

Document-level RBAC synced from source systems

Permission metadata imported from Confluence, SharePoint, Google Drive, and identity providers on every sync. Applied as a query-time filter — restricted documents never enter the ranking pipeline.

Read the write-up

Feature

Guardrails — input and output validation

Prompt injection and PII detection on inputs; hallucination (lexical/semantic/hybrid), toxicity, and relevance checks on outputs. Configurable per team with hot-reload.

Read the write-up

Read the engineering blog