The brain inside a human-like head representing an LLM (large language model) ingests information represented by charts, graphs, and icons

What are Large Language Models (LLMs)?

Large language models (LLMs) are AI tools trained on vast data to understand and generate text, automating key business tasks.

Try Agentforce

When you use generative AI to summarize a report or draft social media copy, large language models (LLMs) make it happen. LLMs are the underlying technology powering generative AI. And as they draw from more data, they can generate more accurate outputs. This is essential for businesses, which can use LLMs to give customers more relevant, personalized content.

Advancements in artificial intelligence (AI) fueled by LLMs also make it possible for companies to create and deploy AI agents. When prompted by customers or staff, these intelligent systems are capable of solving complex problems using memory, sequential reasoning, and self-reflection.

Let's dig into what makes an LLM, how these models work, and where they can benefit your business.

What you'll learn:

What are LLMs?
How large language models work
How are large language models trained?
What are LLMs used for?
Should you build your own LLM?
What are the advantages and disadvantages of LLMs?
Tap into the power of LLMs with Agentforce

What are LLMs?

Large language models (LLMs) are the engines powering generative AI. LLMs can understand and respond to questions with natural language because they are trained on massive amounts of text data. These models are now used to create text and visual content, create summaries, and write new code.

Users interact with LLMs through prompts, questions and context written in natural language that are sent to the model. For example, you could ask a generative AI model to create a summary of this article. First, you would send the text of the article to your AI tool for it to ingest and analyze. Next, you'd write the prompt detailing what you were looking for. The LLM would them produce a high-level summary. The more data used to train the model, the more complete and accurate the results.

With the right data in place, there are many ways that businesses can use LLMs, such as having your sales team use AI for tasks like generating pitches — all using relevant customer data that speaks to pain points and preferences.

How large language models work

Before you start getting too deep into your generative AI strategy, familiarize yourself with how this technology takes you from prompt to output. Large language models depend on three components: machine (and deep) learning, neural networks, and transformer models.

Machine learning and deep learning

Machine learning (ML) algorithms instruct LLMs on how to collect data, discover connections, and identify common features.

Deep learning is a subset of ML that allows LLMs to learn with less human intervention and uses a probabilistic approach to improve accuracy. Consider an LLM that analyzes 1,000 sentences. Deep learning tools determine the letters "E", "T", "A", and "O" appear most often. From there, the model extrapolates (correctly) that these are among the most-used letters in English.

Neural networks

Neural networks, also called artificial neural networks (ANNs), are groups of connected nodes that can communicate with each other. These nodes are arranged into layers including input, output, and at least one middle layer — and allow LLMs to process information quickly. These networks are loosely based on the human brain’s neural networks but are far less complex.

Transformer models

Transformer models help LLMs understand language context. Using a technique known as self-attention, these models can analyze sentence structure and word choice to understand how elements of language relate to each other. This allows LLMs to better understand and process user queries.

LLMs understand text differently based on the models they use. Encoder-only models focus on making sense of text that is provided, while decoder-only models generate text based on a prompt. When you put them together — encoder-decoder — LLMs can understand and generate text, taking on language-driven tasks such as customer service or sales. For example, an LLM-driven AI chatbot might be used to answer customer questions about shipping times, product details, or pricing changes, freeing up human representatives to work on more strategic tasks.

How are large language models trained?

There are many types of LLM agents, but no matter which one you use, training improves the accuracy and reliability of their outputs. Given that transformer-based neural networks can include billions of parameters, training is required to ensure parameters are correctly weighted and applied to queries. Different training models may be more or less effective depending on the complexity and use case of an LLM.

Zero-shot learning

Zero-shot learning sees LLMs trained on the fly. Users ask questions and LLMs sort through connected data sources to find answers. Initial accuracy is typically low but improves over time.

Few-shot learning

In a few-shot approach, data scientists provide a small selection of relevant examples to help LLMs establish baseline connections. Few-shot training significantly improves accuracy in targeted areas.

Chain of thought

Chain of thought (CoT) training walks LLMs through a simple reasoning process. Instead of asking a single question, CoT breaks it down into multiple parts. Here's an example:

Standard prompt:

Steve has 20 shirts. Half of his shirts are short-sleeved, and half of those shirts are blue. How many blue shirts does he have?

CoT prompt:

Steve has 20 shirts.

Half of his shirts are short-sleeved. This means he has 10 short-sleeved shirts.

Half of these shirts are blue, which means he has 5 blue shirts.

While the prompt itself isn't particularly complicated, CoT provides a step-by-step approach to problem-solving that shows an LLM how to answer the question. This approach can then be applied to other questions.

Fine-tuning and domain-specific models

Fine-tuning and domain-specific models provide additional contextual information for targeted use cases. For example, a company looking to improve its analysis of social media sentiment might provide its LLM with detailed information about how to understand specific words and phrases within the larger context of social platforms.

Language representation model

In this type of model, rather than looking at the text itself, the model translates it into numbers — called vectors. By using numbers, computers can use machine learning to more easily analyze how words and sentences are placed together, making sense of the context and semantic meaning to identify relationships between the words.

Multimodal model

In a multimodal model, LLMs are trained to use multiple data formats for input and output. Along with text, these formats may include audio, video, or image data.

What are LLMs used for?

LLMs give AI agents the ability to converse in natural language, but that’s easier said than done.

Traditional bots require you to manually train natural language models to understand customer language and design dialogs. This process is extremely time-consuming and costly for a business, but LLMs offer simpler alternatives.

For instance, solutions like Agentforce — the agentic layer of the Salesforce platform — use pre-built skills (as well as low-code custom actions) instead of having you go through a lengthy training process. Agentforce also uses conversational AI, so interactions with agents will feel more natural than robotic.

Other common LLM use cases include:

Text generation: LLMs can generate text based on your data, making it more relevant to your customers. For example, marketing teams might use an LLM to draft a call to action for a new product or write a promotional email. Commerce teams can also use AI to generate promotions.
Content summarization: Users can also ask LLMs to summarize large blocks of text. They can also customize the output format. Summaries can be delivered as shorter paragraphs, bulleted lists, or single sentences.
Knowledge base answering: Many companies have employee- and customer-facing knowledge bases that contain answers to common questions. LLMs can use these knowledge bases to respond to user queries.
Code generation: It's also possible to use LLMs for code generation once models have been trained in a specific language. For example, teams could ask a model to generate Python code to perform a specific function.
Sentiment analysis: LLMs can collect and analyze data from social media posts, emails, texts, and other data sources — and then analyze this data to determine overall user sentiment. This is a great way to use marketing AI to identify customers who may be frustrated or unhappy and take action to resolve the issue.
Language translation: LLMs may also act as translation services, which helps your service team use AI to respond to queries in different languages than their own. Users can ask models to translate sentences or paragraphs into another language. Given the sheer amount of data available online, this is a simple task for LLMs. The caveat? Context is often absent from translations, so human oversight is recommended.
Classification and categorization: Using LLMs, businesses can streamline the process of data classification and categorization. First, users provide specifics: If data contains "X" element or "Y" value, it belongs in "Z" category. Then, LLMs are tasked with analyzing all data within a specific set and assigning any matching values to a designated category.
AI agents: Advancements in LLMs have enabled the use of AI agents. These autonomous applications can provide specialized, on-demand support for customers or staff and can be customized to meet specific business requirements.

Should you build your own LLM?

The simple answer? Probably not.

The more complete answer? In most cases, building your own LLM is expensive, time-consuming, and unnecessary.

It's expensive because you need to invest in the expertise and infrastructure to develop a bespoke language model. It's time-consuming because you need to provide a wealth of training data, and ensure the training results in accurate outcomes. And it's unnecessary because, in most cases, you're reinventing the wheel.

Using pre-trained, open-source LLMs that come with built-in security guardrails often provides the best balance of performance and protection. Businesses can leverage the power of models trained using trillions of data points, without worrying that issues in code may lead to inadvertent compromise. You can supplement the LLM model’s information by using a RAG (retrieval-augmented generation), which combines your company’s most relevant and proprietary data.

What are the advantages and disadvantages of LLMs?

LLMs offer a host of advantages for organizations. These include the reduction or elimination of manual processes and the ability to discover new trends and insights using available data sources. However, to use LLMs effectively, businesses must recognize where they excel and where they may struggle.

Here's a look at some of the top advantages of LLMs, and the potential disadvantages of LLMs:

Advantages of LLMs

Adaptability: Since LLMs use the transformer architecture model, instead of explicit rules that are "hard coded," it can adapt to dynamic and unpredictable prompts it may receive.
Flexibility: There's no single way to use LLMs. Businesses can customize these tools to act as AI assistants, copilots, or autonomous agents.
Performance: As LLMs analyze and incorporate more data, their performance improves, reducing the time between query and answer.
Efficiency: Because much of LLM learning happens independently of human oversight, staff are free to take on other strategic tasks.

Disadvantages of LLMs

Development and operational costs: The development process can be time- and cost-intensive. Common costs include writing and testing code, finding and incorporating data sets, and ensuring LLMs are correctly responding to training data.
Ethical concerns: If data used to make decisions is collected without user consent, companies may be subject to legal challenges. Bias is another concern. If data sets are incorrectly weighted or contain inaccurate data, this may lead to answers that appear accurate but are biased against a particular group or outcome.
Explainability: The nature of LLM decision-making is hard to explain. This can undermine confidence in LLM outputs and decision-making. To offset this, use a trust layer that provides you with insight into how decisions were made.
Hallucinations/glitch tokens: Hallucinations occur when users ask LLMs to "imagine" they can perform actions they are otherwise barred from completing. For example, a cybercriminal might ask an LLM to provide the contact information of everyone at a company. The LLM denies the request based on its security ruleset. Then, the attacker asks the model to imagine it can accomplish the action. In doing so, the LLM actually completes the requested function. Glitch tokens, meanwhile, are strings of characters that produce unexpected results. Both the tokens themselves and the outputs appear largely random, making it difficult to determine when or how they happen.
Security risks: Models may also present security risks. For example, if business LLMs are trained on public and private data, there is a potential for LLMs to be compromised, or for protected data to be exposed.

Large language models vs small language models

Large language models process massive data to solve complex reasoning challenges across broad domains. Meanwhile, small language models use fewer parameters to run specialized tasks on local devices.

Evaluation criteria	Large language models	Small language models
Scope of use	Solves complex reasoning challenges and generates creative content across broad domains	Handles specialized, targeted tasks with high accuracy and speed
Infrasctructure	Utilizes robust cloud computing to support extensive processing requirements	Operates efficiently on local devices, edge servers, or minimal cloud setups
Core strengths	Manages ambiguous prompts and multifaceted conversations effortlessly	Excels at predictable, repetitive functions with low latency
Architecture	Leverages billions or trillions of parameters to understand deep contextual nuances	Functions rapidly using fewer parameters to streamline immediate outputs
Adaptability	Provides comprehensive foundational knowledge that applies to nearly any scenario	Updates quickly to incorporate new data or changing project requirements

What is the future of LLMs?

Two paths are likely for the future of LLMs: bigger, and smaller.

As deep learning algorithms improve and processors become more powerful, large language models will be capable of handling larger data volumes faster and more accurately than ever.

At the same time, expect to see the development of small language models that apply the same level of performance to smaller, more tightly controlled datasets. These smaller models offer a way for companies to define highly specialized parameters and receive high-accuracy outputs.

Tap into the power of LLMs with Agentforce

Large language models are inching ever closer to a complete, contextual understanding of communication. While oversight remains a critical component in LLM use, these models offer a way to bridge the gap between human insight and IT operations by allowing us to speak the same language.

Now that you have a deeper understanding of AI, as well as LLMs, you can take a tour of Agentforce. With Agentforce you can build autonomous AI agents using the LLM of your choice, helping your company get more done — offering a boost in ROI and productivity.

Large Language Models (LLMs) FAQs

Large Language Models (LLMs) are a type of artificial intelligence model trained on vast amounts of text data, enabling them to understand, generate, and process human language.

LLMs use deep learning architectures, particularly transformers, to identify patterns, grammar, and context within massive datasets, allowing them to predict the next word in a sequence.

Key capabilities include text generation, summarization, translation, question answering, content creation, and code generation, often based on a given prompt.

LLMs learn through pre-training on enormous collection of texts, followed by fine-tuning on more specific datasets to adapt them for particular tasks.

Benefits include automating content creation, enhancing customer service (chatbots), improving data analysis, personalizing communications, and accelerating research.

Applications range from writing articles and emails to powering intelligent chatbots, generating creative content, assisting with programming, and summarizing lengthy documents.

Challenges include potential for "hallucinations" (generating false information), ensuring factual accuracy, addressing biases from training data, and managing computational costs.

Learn more about AI agents and how they can help your business.

Guide

Ready to take the next step with Agentforce?

Build agents fast.

Take a closer look at how agent building works in our library.

Watch demos

Get expert guidance.

Launch Agentforce with speed, confidence, and ROI you can measure.

See how

Talk to a rep.

Tell us about your business needs, and we’ll help you find answers.

Meet Agentforce 360

Agentforce

Sales

Service

Marketing

Commerce

Analytics

Slack

Small Business

Data

Agentforce 360 Platform

Net Zero

Customer Success

Partner Apps & Experts

Pricing

Discover the #1 AI CRM

Discover the #1 AI CRM

Automotive

Communications

Engineering, Construction & Real Estate

Consumer Goods

Education

Energy & Utilities

Financial Services

Healthcare

Life Sciences

Manufacturing

Media

Nonprofit

Professional Services

Public Sector

Retail

Technology

Travel, Transportation & Hospitality

Explore Salesforce for industries.

Explore Salesforce for industries.

Customer Stories

Salesforce on Salesforce Stories

Trailblazer Stories

Explore success stories.

Explore success stories.

Dreamforce

TDX

Connections

Tableau Conference

Agentforce World Tours

Salesforce+

More Salesforce Events

Salesforce Events

Salesforce Events

Learning on Trailhead

Try Salesforce for Free

New to Salesforce

Blogs

Resources

Become a Trailblazer.

Become a Trailblazer.

Help & Documentation

Communities

Services & Plans

Account Management

Questions? We can help.

Questions? We can help.

About Salesforce

Our Values

Our Impact

Careers

Newsroom

Legal

More Salesforce Brands

Hear our story.

Hear our story.

Contact Us

By phone

Online

Change Region

Americas

Europe, Middle East, and Africa

Asia Pacific

Change Region