Changelog
What's new in ZenSearch
Recent features, integrations, and improvements — grouped by month. Link out to the deeper write-ups where we have them.
May 2026
- Feature
Agentic default chat — every question reasons across your sources
Default chat is no longer a single-shot search. Every question now runs a budget-tiered agent loop over your connected sources — knowledge base, databases, the web, and read tools like Jira, Slack, and GitHub — and cross-checks them instead of tunneling into one. Pick the depth per message: Chat (knowledge base only), Auto (agentic default), or Research (full planning + approval-gated write tools).
- Feature
Agents take action in your connected systems
Beyond reading, agents can now act: post and update Slack messages, react, pin, and DM; create and update Salesforce and SAP records; open and update ServiceNow changes and incidents; create HubSpot records, associations, and notes; log Jira worklogs and assignees; comment on Confluence; and open, review, and merge GitHub pull requests. Every write is risk-tier gated and pauses for approval, and automation-authored writes carry a transparency footnote.
- Feature
Multi-agent delegation
An agent can hand a subtask to a specialist drawn from your team's delegatable automations and fold the result back into its answer — up to two levels deep. A research- or automation-tier parent can even delegate an approval-gated write: the parent pauses for a single approval, then resumes the specialist to perform it. Per-run fan-out is bounded so a run can't spawn unbounded sub-agents.
- Feature
Ask the agent to build and manage your automations
In research mode you can describe an automation in plain language and the agent drafts the schedule, prompt, and tools, then proposes it for one-click approval — and can list, update, pause, or delete existing ones the same way. Off by default; a team admin opts the team in. Deletes are soft: an automation is hidden and stops firing while its run history is retained for audit.
- Improvement
Per-automation cost controls
Each automation can pin its own model, set per-run and per-day spend caps, and show a live cost estimate as you configure it. A per-automation cost dashboard charts recent spend so you can see exactly where the budget goes.
- Improvement
Inline agent reasoning, retries, and a context-window meter in chat
The chat transcript now renders the agent's reasoning, tool retries, and delegation hand-offs as they happen, plus a context-window usage meter so you can see how much of the model's memory a long conversation is using. Default-tier agent activity and per-turn latency now show up in Analytics.
- Integration
MCP: Streamable HTTP and stdio transports, plus resources and prompts
Connect Model Context Protocol servers over SSE or Streamable HTTP (both cloud-usable), or over stdio as a local subprocess for self-hosted deployments. Agents can now read MCP resources and prompts, not just call tools, and stdio server environments support per-key editing.
- Improvement
Org-aware approval routing and richer Slack link previews
Approvals route to the right person automatically — the on-call engineer for the relevant service (via PagerDuty / Opsgenie), then the requester's manager from your org chart, then team owners. And shared ZenSearch links now unfurl in Slack as permission-aware Work Objects with a per-viewer detail pane.
April 2026
- Feature
Agent cost control: pre-flight ceilings and soft-limit pause
Agent runs now go through a five-layer budget system: complexity-aware tiers, per-sub-task wall-clock timeouts, a pre-flight dollar ceiling that rejects expensive plans before any tokens are spent, mid-run cost enforcement, and a soft-limit pause that saves a resumable checkpoint with a 7-day TTL when any budget exhausts.
Read the write-up - Feature
Procedural memory — self-improving agent workflows
After successful multi-tool sessions (≥5 tool calls, ≥0.7 synthesis confidence), agents now distill the workflow into a reusable procedure and progressively disclose it to future runs via the new view_procedure tool.
Read the write-up - Feature
Observational memory for long conversations
LLM-based observer extracts key findings, tool usage, and pending work from long sessions into a compact summary. Typically 80%+ token compression on 50+ message conversations; injected as a cacheable system-prompt prefix.
Read the write-up - Improvement
Recovery recipes for transient tool failures
Timeouts, rate limits, and upstream 5xx errors are now auto-retried once before escalation. Retries don't count against tool-call budgets. Opt out via AGENT_RECOVERY_ENABLED=false.
- Improvement
Live cost meter in the chat UI
cost_update SSE event streams every few iterations so users see real dollars ticking up against the per-run cap while the agent is still running. Cadence configurable via AGENT_COST_UPDATE_INTERVAL.
March 2026
- Integration
Google Workspace agent tools (18)
Gmail, Calendar, Drive, Docs, and Sheets — read and write. OAuth 2.0 or Google Service Account with domain-wide delegation. Agents now act as the end user, with per-user OAuth tokens.
Read the write-up - Integration
Microsoft 365 agent tools (33+)
OneDrive, Outlook, Teams, SharePoint, Planner, plus Graph-powered org-chart and calendar free-busy. OAuth through Azure AD with per-scope configuration.
Read the write-up - Integration
Zendesk, Airtable, and Notion agent tools
Zendesk (14 tools — ticket CRUD, help-center articles, users, organisations), Airtable (8 — bases, records, comments), Notion (12 — pages, databases, blocks, comments). All via per-user OAuth where supported.
- Feature
Custom webhook tools and MCP server support
Admins can register internal HTTP endpoints as agent tools (name, URL, method, auth, parameter schema — encrypted credentials). For third-party tool servers, ZenSearch speaks Model Context Protocol (MCP) as a client — register an MCP server and its tools appear in the team's tool catalog.
- Feature
Slack and Microsoft Teams assistant surfaces
Native assistants in both chat apps: Block Kit answers and approval cards in Slack, Adaptive Cards and threaded replies in Teams. Cross-surface continuity — the same conversation follows the user between web, Slack, and Teams.
Read the write-up - Integration
Model Gateway providers: Groq, OpenRouter, Azure AI Foundry, Amazon Bedrock
Every AI call routes through a central gateway. Swap providers with one env var — Groq for speed, OpenRouter for unified billing across 100+ upstream models, Azure for per-tenant OpenAI, Bedrock for AWS-native Claude and Nova. Ollama still works via OpenAI-compatible endpoint for fully local deployment.
Read the write-up - Improvement
Prompt caching on chat calls
Anthropic cache_control ephemeral markers get a 90% discount on cached reads; OpenAI and Groq prefix caching get ~50%. Tracked as cached_input_tokens and cache_creation_input_tokens on the usage ledger.
- Improvement
Multimodal chat: mixed text + image content
ContentParts on chat messages lets agents pass image inputs (file uploads, extracted document images) to vision-capable models. Token estimation accounts for ~765 tokens per image part.
- Feature
Incremental sync intelligence
Content-hash dedup short-circuits unchanged documents before they enter the parse pipeline. A PageChangeDetector handles web-crawl delta. A DeletionDetector uses set-diff against each connector's seen source IDs to find removals, with a 50% safety threshold that refuses to act on suspiciously large deletion sets.
Read the write-up
February 2026
- Feature
Chrome browser extension with side-panel chat
AI assistant in a browser side panel, aware of the current page context. Keyboard shortcuts, session persistence, and identity federation with the web app.
- Feature
Canvas artifacts — versioned, persistent AI outputs
Agents produce structured outputs (code, reports, plans) into a side panel with diff view, version history, and user editing. Content types: markdown, code (Python, Go, TypeScript, etc.), JSON.
- Improvement
Answer confidence scoring and citation grounding
Composite confidence score (0.0–1.0) from coverage ratio, shape confidence, and violation-free signals; surfaced as High/Medium/Low confidence in the chat UI. Post-synthesis verification semantically checks citations against sources and corrects misattributions.
January 2026
- Integration
PostgreSQL, MySQL, ClickHouse, SQL Server via NL-to-SQL
Natural-language-to-SQL over all four databases. Schema discovery, read-only query execution, and automatic error correction via LLM-friendly error messages.
Read the write-up - Feature
Document-level RBAC synced from source systems
Permission metadata imported from Confluence, SharePoint, Google Drive, and identity providers on every sync. Applied as a query-time filter — restricted documents never enter the ranking pipeline.
Read the write-up - Feature
Guardrails — input and output validation
Prompt injection and PII detection on inputs; hallucination (lexical/semantic/hybrid), toxicity, and relevance checks on outputs. Configurable per team with hot-reload.
Read the write-up