Update: Agentforce Observability within Agentforce Studio has evolved since this blog post was published. Get updates in our November 2025 press release and on this Observability page.
Our forward-looking statement applies to this blog.
Back in March, we posed a simple question: How do you know if your agent is production ready? The post resonated with our Agentblazer community as practitioners sought to bring more predictability to agentic responses and better understand why agents behave the way they do. Now, less than three months later, we’re posing a decidedly bigger question: How do you manage a burgeoning digital labor force at scale?
If the delta between these two conversations creates a bit of whiplash, you’re probably not alone. It’s hard to overstate just how quickly the agentic lifecycle has evolved. Simply put, AI agents are no longer experimental. In the last 6 months, AI agent usage is up 233%, with 96% of employees reporting that AI helps them complete tasks they couldn’t before, according to new Slack research. But as a new hybrid workforce begins to emerge, most agent platforms still lack the necessary tooling, governance, and observability to scale beyond POCs.
After thousands of Agentforce implementations — from Goodyear to Finnair — it’s clear that customers need better visibility, more granular metrics, and actionable recommendations to improve the accuracy, quality, cost, and latency of their agents. To equip our customers with the tools to tackle this challenge, we’re introducing deep observability within Agentforce Studio.
Agentforce Observability, formerly called Command Center, is a first-of-its-kind tool designed to measure AI agent activity, manage the partnership between humans and agents, and drive continuous improvement.
Flying blind
Many startups today offer agents that are easy to spin up, but difficult to optimize and scale. As we’re learning together with our customers, building a great agent that can reliably and efficiently deliver results at 95% accuracy takes time and iteration. But iterating can be hard when you don’t know where to start. Many customers understand that their agents need work, but struggle to define where to focus their efforts. In essence, they’re flying blind. “Is my agent delivering bad results because of a configuration mistake, or is there something wrong with the data I’m feeding it?”
Agentforce provides a range of tools to address this challenge. Customers can troubleshoot individual utterances and observe how an agent identifies topics and executes actions using the Session Trace Data Model. They can test agent responses at scale with Testing Center. These tools provide powerful insights, but as organizations go from making sure their agents are production ready to managing an entire workforce of agents working alongside humans, the need for a unified observability solution spanning the breadth of the hybrid workforce becomes increasingly acute.
Managing your hybrid workforce
Agentforce Observability is the single source of truth for your digital labor force, enabling complete visibility across all of your production AI agents. Observability rolls up all your agent activity, metrics, and telemetry into a single unified dashboard, delivering a new observability layer for monitoring agent health, measuring outcomes, and optimizing collaboration between humans and AI.
Going beyond basic metrics like total number of sessions, Observability provides deep insight into agent performance, with detailed analytics for error rates, escalation frequency, agent latency, and much more. Users can tailor their view to their unique use cases, surfacing only the metrics that are most helpful and relevant.
Agentforce Observability is designed to answer your most burning Agentforce questions, such as:
- “How are my agents performing?”
- “How is adoption and usage trending?”
- “How are agents impacting the customer experience?”
- “What are agents costing us over time?”
- “Are my agents following legal and regulatory requirements?”
It also gives managers the ability to set real-time alerts in case an agent isn’t behaving as expected. Observability gives visibility into what’s happening across your org and the ability to drill down into specific conversations — tracing every turn of an agentic interaction to see what went right, what went wrong, and what we can do to improve outcomes.

In addition to topics, which are configured at runtime to describe the job an agent should be doing, Observability uses Intent tags that categorize actual conversations agents are having by intent and sentiment.

Agentforce Observability not only helps surface shortfalls and highlight areas for improvement, but it also gives you complete diagnostic power. Our new Session Trace Data Model logs every interaction, including user inputs, agent responses, reasoning steps, LLM calls, and guardrail checks. This foundation is stored securely in Data 360, providing unified visibility and granular, session-level insights so you can ensure agents behave as intended and respond appropriately.


Observability and other exciting innovations are part of Agentforce 360, the newest version of Agentforce, delivering complete visibility and open integration to scale hybrid workforces. Learn more about our Agentforce 360 announcements.
Need help keeping up with all the innovation?
Visit agentblazer.com for the latest Agentforce news and to join the Agentblazer Community.


