A new approach to enterprise graphical user interface (GUI) automation learns from one human demonstration and replays workflows with deterministic precision— delivering the reliability and privacy that enterprise operations demand. Ask any veteran…
Introduction Reinforcement Learning from Human or AI Feedback (RLHF, RLAIF) has become the standard recipe for aligning large language models (LLMs). But as we push into the agentic era — where models call…
VIBEPASS, a new benchmark, reveals a fundamental weakness in modern AI coding assistants: even with near-perfect scores on code generation tasks, frontier models falter when it comes to finding and fixing subtle bugs…
AI agents that rely on web search are vulnerable to “well poisoning” attacks, where adversaries publish fabricated but authoritative-sounding content designed to be retrieved during search. Think “AI Slop” for agents. Our research…
AI agents are increasingly used today to automate complex enterprise tasks, ranging from customer service interactions to sophisticated data analysis and workflow management. Their importance lies in their ability to drive significant efficiency…
In the rapidly accelerating world of Artificial Intelligence, the bridge between academic theory and industry application is more vital than ever. At Salesforce AI Research, we believe that the most transformative breakthroughs happen…