VIBEPASS, a new benchmark, reveals a fundamental weakness in modern AI coding assistants: even with near-perfect scores on code generation tasks, frontier models falter when it comes to finding and fixing subtle bugs…
AI agents that rely on web search are vulnerable to “well poisoning” attacks, where adversaries publish fabricated but authoritative-sounding content designed to be retrieved during search. Think “AI Slop” for agents. Our research…
AI agents are increasingly used today to automate complex enterprise tasks, ranging from customer service interactions to sophisticated data analysis and workflow management. Their importance lies in their ability to drive significant efficiency…
In the rapidly accelerating world of Artificial Intelligence, the bridge between academic theory and industry application is more vital than ever. At Salesforce AI Research, we believe that the most transformative breakthroughs happen…
Introducing MoiraiAgent Accurate forecasting is the backbone of strategic decision-making in everything from global finance to climate science. In an enterprise setting, it powers the early warning systems for telemetry, optimizes inventory through…
The era of software engineering agents is underway. Benchmarks and real-world usage (e.g., tools like Cursor and Claude Code) illustrate that LLMs can be incredibly effective at writing code for real-world use-cases. Over…
Why Forecasting Matters—and How to Unlock More from Foundation Models Forecasting is critical to how many large organizations, including Salesforce, manage their global cloud infrastructure.. Reliable projections of compute, storage, usage, and cost…