Salesforce AI Research will present 21 accepted papers at ICLR 2026, the Fourteenth International Conference on Learning Representations. The conference runs April 23–27 at the Riocentro Convention and Event Center in Rio de…
As codebases grow to millions of lines of code, can AI agents still understand, reason, and code effectively? LoCoBench-Agent delivers the answer: a comprehensive benchmark for evaluating AI coding assistants across contexts ranging…
Our 10th annual State of Marketing Report surveyed 4,450 marketers worldwide — here's what the data said: SMBs can outpace the competition with agentic AI.
A new approach to enterprise graphical user interface (GUI) automation learns from one human demonstration and replays workflows with deterministic precision— delivering the reliability and privacy that enterprise operations demand. Ask any veteran…
Introduction Reinforcement Learning from Human or AI Feedback (RLHF, RLAIF) has become the standard recipe for aligning large language models (LLMs). But as we push into the agentic era — where models call…
VIBEPASS, a new benchmark, reveals a fundamental weakness in modern AI coding assistants: even with near-perfect scores on code generation tasks, frontier models falter when it comes to finding and fixing subtle bugs…