The era of software engineering agents is underway. Benchmarks and real-world usage (e.g., tools like Cursor and Claude Code) illustrate that LLMs can be incredibly effective at writing code for real-world use-cases. Over…
Why Forecasting Matters—and How to Unlock More from Foundation Models Forecasting is critical to how many large organizations, including Salesforce, manage their global cloud infrastructure.. Reliable projections of compute, storage, usage, and cost…
As 2025 comes to a close, I’m struck by a paradox: the AI industry has never been more capable—yet the discourse has never been more confused. The loudest debates right now center on…
Imagine the not-too-distant future where AI agents routinely negotiate on our behalf—personal assistants handling returns with retailers, procurement agents negotiating with suppliers, healthcare advocates coordinating with billing departments. Whether it’s a consumer’s personal…
What is FINDAP? The FINDAP Framework is a cutting-edge approach to fine-tune large language models (LLMs) specifically for the finance industry. While LLMs like ChatGPT or Bard are excellent general-purpose tools, specialized domains…
Large language models (LLMs) have become foundational to AI code understanding and generation, powering a wide range of enterprise AI workflows — from software synthesis to automated reasoning over symbolic sequences. Despite this…
In the world of AI agents that click, scroll, execute and automate — we’re moving fast from “just understand text” to “actually use software for you.” The new benchmark SCUBA tackles exactly that:…