Few-Shot Learning with LLMs: In-Context Learning

The key mechanism that enables models to generate text, translate, create summaries, and solve complex tasks is Few-Shot Learning — an approach in which a model learns a new task after being shown only a few examples of how to solve it.

In-Context Learning, where the model learns directly while interacting with the user, using the provided examples as context. This approach eliminates the need for separate fine-tuning, making working with LLMs more flexible and interactive.

Key Takeaways

In-context prompts let models adapt quickly without retraining.
Zero-shot works for general queries; complex tasks need example-driven prompts.
Transfer and meta techniques make rapid adaptation viable at scale.
Good task design reduces overfitting and boosts real-world performance.

What is Few-Shot Learning

Few-shot learning is a machine learning method in which a model learns to perform a task using only a small number of labeled examples. Unlike traditional training with large amounts of data, few-shot approaches rely on only a few labeled examples to teach the model how to perform the task correctly.

In the context of LLMs, few-shot learning is often implemented through in-context learning (ICL). The user provides a few examples of the task in a query, and the model uses them to understand the expected response pattern or type. For example, if the sentiment of a text is classified, provide 3–5 sentences labeled as “positive” or “negative,” after which the model will be able to classify a new sentence based on these examples. Few-shot learning occupies an intermediate position between two approaches:

Zero-Shot Learning – the model performs the task without any examples, relying solely on instructions.
Few-Shot Learning – the model receives a few examples, which helps improve accuracy and reliability.

Few-Shot Learning vs One-Shot and Zero-Shot Learning

Approach	Context Examples	Description	Advantages	Disadvantages
Zero-Shot Learning	None	The model performs a task based solely on instructions, without any few shot examples.	No need to prepare examples; fast to apply.	Lower accuracy; the model may misinterpret the task.
One-Shot Learning	One example	The model uses in context learning (ICL) with a single example to demonstrate the task.	Provides better context than zero-shot; slightly improved accuracy.	One example may not be representative; sensitive to example selection.
Few-Shot Learning	2–10+ examples	The model receives several few shot examples in the context to learn the task without retraining.	Higher accuracy and generalization; better control through example selection.	Requires careful selection of examples; too many examples can overload the context.

In-Context Learning with LLMs: How Prompted Examples Drive Model Use

This is an LLM mechanism that allows the model to perform tasks using examples provided directly in the user’s query. The model does not adjust its weights; instead, it learns on the fly by drawing on the context from few-shot examples.

When a user provides several examples for a task, such as text translation or emotion classification in sentences, the model analyzes these examples and generates new responses in the correct format. The model’s performance depends largely on the selection of examples: choosing relevant and diverse examples helps it better understand the task. The ICL mechanism allows for different training approaches:

Zero-Shot – the model receives only instructions without examples.
One-Shot – the model receives one example in context.
Few-Shot – the model receives a few examples (few-shot examples) in context, which improves the accuracy and generalizability of the results.

Thanks to ICL, language models can quickly adapt to new tasks without separate fine-tuning, leveraging the power of carefully selected prompt examples.

LLM Annotation | Keymakr

N-Way-K-Shot Episodes: Support and Query Sets Explained

N-Way refers to the number of classes or categories in a task. For example, if the task involves classifying emotions in a text and you need to distinguish between three emotions - joy, sadness, and anger - this would be a 3-way task.

K-Shot refers to the number of examples per class in the support set. If 5 examples are provided for each emotion, this is 5-shot learning.

In an N-Way-K-Shot scenario, examples are divided into two sets:

Support Set – contains a few-shot set of examples that the model uses for in-context learning (ICL). These are demonstration examples that show the model what the correct answer looks like for each class.
Query Set – contains new examples that the model must classify or process based on information from the support set. The results on the query set allow us to assess how well the model has learned the task from the provided examples.

The key point in N-Way-K-Shot learning lies in the selection of examples for the support set: representative and diverse examples critically affect the model’s accuracy on the query set. Proper organization of the support and query sets enables effective testing of models in zero-shot and few-shot scenarios, assessing their ability to generalize and quickly adapt to new tasks.

Meta Learning Methods: Learning to Learn Across Tasks

Approach	Description	How Used with LLMs	Advantages	Disadvantages
Model-Agnostic Meta-Learning (MAML)	The model learns to quickly adapt to new tasks by minimizing errors across multiple tasks during training.	Can be applied in context learning (ICL), where LLM uses few-shot examples for fast generalization.	Fast adaptation; task-agnostic; flexible.	Requires complex fine-tuning; sensitive to example selection.
Metric-Based Meta-Learning	The model learns to compare new examples with examples it has already seen (support set) using similarity metrics.	LLM can select the most relevant few-shot examples in context to generate correct outputs.	Simple to implement; works well for N-Way-K-Shot tasks.	Limited generalization on complex tasks; depends on quality of metrics.
Memory-Based Meta-Learning	The model stores information about previous tasks in “memory” to use for new tasks.	LLM can integrate previous prompted examples as context for new queries.	Good for sequential tasks; fast adaptation without global retraining.	Requires additional memory resources; harder to scale.

Data Strategies: Augmentation, Synthetic Data, and Generative Models

Data augmentation. Augmentation involves creating variations of existing data without altering its essence. This may include rephrasing sentences, changing word order, adding noise, or using synonyms. Using augmented examples in context helps the model better recognize various formats of few-shot examples and increases the model’s robustness to new examples.
Synthetic data. Synthetic data is generated automatically using algorithms or pre-trained models. It can supplement real, prompted examples, providing additional examples for example selection in the support set. Synthetic data enables the model to learn patterns more quickly and perform tasks when real data is limited.
Generative models. Generative models can independently create new examples for training or testing. Using generative models enables the creation of a rich set of few-shot examples and experimentation with various scenarios without additional data collection.

FAQ

What is Few-Shot Learning?

Few-Shot Learning is a method in which LLMs learn to perform tasks using only a small number of examples. It improves performance compared to zero-shot approaches by providing context within the prompt.

How does In-Context Learning (ICL) work?

ICL allows a model to learn from prompted examples in the input without changing its weights. The model uses these examples to generalize and produce correct outputs for new queries.

What is the difference between Zero-Shot and Few-Shot Learning?

In zero-shot, the model receives only instructions with no examples, while in few-shot, it uses a few-shot example in the prompt. Few-shot generally yields higher accuracy due to the contextual guidance.

Why is Example Selection important in ICL?

Careful selection of examples ensures that the support examples are representative and diverse. This significantly affects the model’s ability to generalize and correctly solve new queries.

What is One-Shot Learning?

One-Shot Learning is a type of in-context learning ICL where the model receives only a single example for each class. It provides minimal context, improving accuracy slightly over zero-shot, but may still be limited by the quality of the examples.

What are N-Way-K-Shot Episodes?

N-Way-K-Shot episodes structure a few-shot examples into a support set (for context) and a query set (for evaluation). N refers to the number of classes, and K is the number of examples per class.

What is the purpose of the Support Set?

The support set contains a few-shot examples that the model uses for in-context learning ICL. It provides guidance on handling different classes or tasks within the query set.

How do Meta-Learning Methods help in LLMs?

Meta-learning teaches models to “learn to learn” across tasks, improving adaptation with a minimal number of few-shot examples. It enhances ICL by leveraging knowledge from prior tasks to accelerate generalization.

What are common data strategies for Few-Shot Learning?

Strategies include data augmentation, synthetic data, and generative models, which create or modify few-shot examples to improve model performance. These strategies support better example selection and richer in-context learning.

Why are Generative Models useful for Few-Shot Learning?

Generative models can produce additional prompted examples for the support set. This expands the diversity of a few-shot examples, helping the model generalize better during in-context learning ICL.