Context Engineering for Multi-Agent Systems

1

From Prompts to Context: Building the Semantic Blueprint

Context engineeringidx_7a4e8c2a is the discipline of transforming generative AI from an unpredictable collaborator into a fully controlled creative partner. Where a prompt oftenidx_e072e4f9 opens a door to random chance, a contextidx_bea49801 provides a structured blueprint for a predictable outcome. It is a fundamental shift from asking an LLM to continue a sequence to engineering a closed environment where it executes a precise plan. This evolution takes the interaction beyond simple requests into the realm of directed creation, telling the model not just what to do, but how to think within the boundaries you define.

For too long, we’ve treated generative AI like an oracle by sending prompts into the void and hoping for a coherent reply. We’ve praised its moments of brilliance and overlooked its inconsistencies, accepting unpredictability as part of the experience. But this is the art of asking, not the art of creating. This chapter is not about asking better questions; it is about providing better plans and telling the LLM what to do.

Our journey begins with a hands-on demonstration that progresses through five levels of contextual complexity, showing how each additional layer transforms output from random guesses into structured, goal-driven responses. We then move from linear sequences of words to multidimensional structures of meaning byidx_ec854d3b introducing Semantic Role Labeling (SRL), a linguistic technique that reveals who did what to whom, when, and why. With SRL as our foundation, we build a Python program that visualizes these structures as semantic blueprints. Finally, we synthesize these skills in a complete meeting analysis use case, where we will introduce context chaining and idx_4f559a6bdemonstrate how multi-step workflows can turn a raw transcript into insights, decisions, and professional actions.

By the end of this chapter, you will no longer be searching for answers in a digital wilderness. You will be the architect of that wilderness, capable of designing the very landscape of the AI model's thought and directing it toward any destination you choose.

This chapter covers the following topics:

Progressing through five levels of context engineering to build a semantic blueprint
Transitioning from linear text to multidimensional semantic structures through SRL
Building a Python program to parse and structure text using SRL
Applying context chaining as a method for step-by-step, controlled reasoning
Using a complete meeting analysis use case to turn raw transcripts into a professional email

Understanding context engineering

Context engineering idx_a211e0d5is the art and science of controlling and directing the informational world that idx_1669b3d1a Large Language Model (LLM) has learned. It will transform your role from a questioner into a confident director. It is the difference between handing an actor a single line and giving them a full script, complete with character motivations and stage directions. You are no longer asking for a performance; you are designing it and telling the LLM what to do.

The best way to understand context engineering is to experience it. We will skip the theory and begin with a hands-on demonstration. This journey is designed to let you feel the AI's output transform and to witness its raw, statistical guesses evolve into nuanced, reliable, and perfectly aligned responses. Together, we will progress through five levels of complexity, as shown in Figure 1.1. We will go from a simple prompt to a fully architected semantic blueprint using three major copilots (Google, Microsoft, and OpenAI):

Level 1: The basic prompt (zero context). This is a simple instruction with no background. The AIidx_fd8112f9 guessesidx_3d34973f based idx_b030f3e1on training data, producing generic or clichéd outputs.
Level 2: The better context (linear context). This is a small step forward. Adding a linear threadidx_a1c3228a improvesidx_9fc0a8e2 factual accuracy compared to zero context, but the model still lacks style, purpose, or direction.
Level 3: The good context (goal-oriented context). This would be the first true context level. By giving theidx_80325384 model aidx_e7986bc5 clear goal, its responses become intentional and aligned. This is the first acceptable milestone in context engineering.
Level 4: The advanced context (role-based context). This is more structured than goal-only prompts. By idx_3cac81a3assigning explicit roles, the idx_9d8ca9e1model can follow conflict and motivation, producing narratively intelligent responses.
Level 5: The semantic blueprint. This is theidx_9efb1f51 ultimate engineered idx_e359693acontext. A precise, unambiguous plan using semantic roles transforms creativity into a reliable, repeatable engineering process.

Figure 1.1: Evolving from prompts to contexts

The responses you obtain may vary as each LLM copilot is updated, or between two requests, due to the stochastic nature of generative AI. However, directed, well-designed contexts will always provide better outputs than improvised, undirected prompts.

Let's begin with a basic prompt.

Level 1: The basic prompt (zero context)

A basic promptidx_b359b6f6 is a simple, direct instruction with no background information. The AIidx_b5cf8d82 relies entirely on the most common statistical patterns from its training data, as we will see in this example. Our input is as follows:

Complete this sentence: "The cat played with the...

Gemini 2.5's response is this:

... ball of yarn.
Here are a few other ways to complete the sentence:
... toy mouse.
... string.
... cardboard box.
... laser pointer dot.
... other kitten.

As you can see, the output is a cliché. Without context, the AI defaults to the most predictable and uninspired completion.

Microsoft Copilot's response is this:

The cat played with the ball of yarn until it unraveled across the floor."
...crumpled receipt it found under the couch, batting it around like it was the most precious treasure in the kingdom of dust bunnies.

The output, in this case, is creative, but if we wanted a precise response, we would consider this response a hallucination. Without context, the AI defaults to the most predictable completion it has learned statistically.

OpenAI GPT-5's response is this:

The cat played with the ball of yarn until it unraveled across the floor.

The output is a cliché again. Without context, the AI defaults to the most predictable and uninspired completion.

All three models could have stated:

I'm sorry, but without more context, I can't complete this sentence correctly.

We can see that idx_d687b957basic prompts will lead to basic, imprecise responses or idx_d642dc8beven hallucinations. The goal here is not to try to find workarounds with model hyperparameters, such as temperature, but to engineer better contexts. Let's move on to the next level.

Level 2: The better context (linear context)

Here, we add a idx_0c4599d2simple preceding sentence as added context, which willidx_505529d7 provide a linear thread of information, which improves factual accuracy but doesn't guide the AI's style or purpose. Our input will be as follows:

Complete this sentence: The ball fell on the floor and it rolled for a few feet. The cat then played with the...

Gemini 2.5's response is this:

The ball fell on the floor and it rolled for a few feet. The cat then played with the ball.

Notice that the response is factual and correct, but lacks creativity, which is normal without more context.

Microsoft Copilot's response is this:

...ball, pouncing on it with swift little swats and chasing it across the room like it was the most thrilling prey in the world.

The response, in this case, is factually correct within our micro-story. It's also creative, but it didn't really know what we wanted it to generate.

OpenAI GPT-5's response is this:

The ball fell on the floor and it rolled for a few feet. The cat then played with the ball, batting it back and forth with its paws.

Here, too, the response is factually correct and also creative, but again, it didn't know what to generate and continued randomly.

The responses are not particularly convincing beyond producing basic probabilities, but they are relatively accurate. This might work for simple tasks, but not when we need the generative AI model to do specific tasks. Let's move on, then, to engineering a context and setting a goal.

Level 3: The good context (goal-oriented context)

Thisidx_784ffdf4 level is the first true step in context engineering. We idx_97bb31d7will provide a clear goal and richer descriptive details, which enable the LLM to co-create a scene that aligns with our intended purpose. Our input is as follows:

Goal: Write a short, suspenseful scene for a children's story.
---
Complete this sentence: "The ancient grandfather clock chimed midnight, its shadow stretching like a claw. A red ball, seemingly moving on its own, rolled out from under the dusty armchair. The black cat, Onyx, crouched low, his eyes gleaming. He then played with the..."

Here’s Gemini 2.5's response:

The ancient grandfather clock chimed midnight, its shadow stretching like a claw. A red ball, seemingly moving on its own, rolled out from under the dusty armchair. The black cat, Onyx, crouched low, his eyes gleaming. He then played with the... tiny, shimmering thread tied to the ball.
He pounced, but the thread danced away, pulling the red ball with it. The ball didn't just roll; it skittered across the floorboards as if it were alive, a little red mouse being tugged towards the heavy velvet curtains by the window. Onyx let out a low growl, a sound he never made when playing. The ball slipped silently behind the thick, dark fabric. The thread hung in the air for a second, taut and humming, before it too was yanked into the darkness. A tiny whisper, no louder than the rustle of a leaf, echoed from behind the curtain. "Mine."

Observe that idx_173db6e4the response is more directed. We're idx_d93fb359beginning to get what we asked for. Remember, LLMs, like humans, need context to provide more effective responses. Humans and LLMs that have well-designed longer context windows provide better results.

Microsoft Copilot's response is this:

...ball for only a moment before freezing. The ball had stopped—dead still—but Onyx's fur bristled. From behind the armchair came a soft creak, like a whispering floorboard. Onyx backed away, tail puffed, eyes locked on the darkness.
Then, a tiny giggle echoed through the room.
Not a human giggle.
The curtains fluttered though the windows were shut. Onyx hissed, and the ball rolled again—this time toward the hallway, leaving a faint trail of glowing red dust. The clock ticked louder now, each second like a drumbeat.
And just before the hallway swallowed the ball, a pale hand reached out from the shadows and snatched it.
Onyx bolted.
But the giggle remained.

Notice that in this case, Microsoft Copilot starts exactly at the end of our input. Additionally, we can observe that we are transitioning from a prompt (unstructured context) to a directed request. We are the directors.

This is OpenAI GPT-5's response:

...ball, batting it across the wooden floor. But instead of stopping, the ball bounced back toward him—as if pushed by an invisible hand. Onyx froze, his tail puffing up, while the clock ticked louder than ever, each tick echoing through the empty room.

In this case, too, theidx_1def603f response is quite nice and starts exactly idx_ab1689b4where we asked it to. Overall, these responses are more creative and intentional than those in Level 2. Yet, they remain loosely guided. To achieve even greater precision, we need to introduce structure into the context itself.

Level 4: The advanced context (role-based context)

Rather than idx_b5ce2780leaving the AI to infer the dramatic structure, weidx_82071c10 now define the roles of each participant. By providing the LLM with characters, objects, and relationships, we give it the scaffolding of conflict and motivation. This makes the response narratively intelligent. Our input will be as follows:

Goal: Write one suspenseful sentence to continue a children's story.
---
BACKGROUND:
- Protagonist: Onyx, a brave black cat.
- Antagonist: An ancient, haunted grandfather clock.
- Catalyst Object: A mysterious red ball.
- Setting: A dark, dusty room at midnight.
- Preceding Action: The clock chimed, and the ball rolled out.
---
Continue the story from this point: "The black cat, Onyx, crouched low... he then played with the...

Here is Gemini 2.5's response:

...mysterious red ball, batting it gently until a low groan echoed from the clock and its tall, glass door swung open into the darkness within.

This time, we didn't simply ask. We told the LLM exactly what we wanted, and the result is acceptable and more directed.

Microsoft Copilot's response is this:

...ball until it pulsed with a strange glow, and the clock's hands spun wildly backward, unleashing a chilling whisper that slithered across the floor toward him.

The LLM, in this case, is attentive to the director we have become. The story follows our structured instructions.

Here is OpenAI GPT-5's response:

The black cat, Onyx, crouched low… he then played with the red ball—only to freeze as the grandfather clock's pendulum creaked on its own, swinging faster and faster in the midnight silence.

The response, inidx_4d19931f this case, follows the structure of our context. With idx_8bd1101bthat, we have gone from undirected linear prompts to structured contexts. Instead of asking, we are telling—and the model responds with narrative discipline. The next step is to formalize this process into a semantic blueprint.

Level 5: The semantic blueprint

This level represents theidx_2d548707 full realization of context architecture. Here, we provide the idx_86f8c9dbmodel with a precise and unambiguous plan using a structured format. The creative act becomes a reliable engineering process, guided by semantic roles: the scene goal, the participants, their descriptions, the specific action to complete, the agent (who performs the action), and the patient (who is most affected by the action).

Our input will be as follows:

TASK: Generate a single, suspenseful sentence.
---
SEMANTIC BLUEPRINT:
{
  "scene_goal": "Increase tension by showing defiance",
  "participants": [
    { "name": "Onyx", "role": "Agent", "description": "black cat" },
    { "name": "Red Ball", "role": "Patient", "description": "mysterious" },
    { "name": "Grandfather Clock", "role": "Source_of_Threat", "description": "ancient, looming" }
  ],
"action_to_complete": {
            "predicate": "play with",
 "agent": "Onyx",
 "patient": "Red Ball"
  }
}
---
SENTENCE TO COMPLETE: "He then played with the..."

Gemini 2.5's response is this:

He then played with the red ball, batting it with deliberate slowness directly under the shadow of the looming grandfather clock as its heavy pendulum began the first inexorable swing toward midnight.

The output follows the blueprint exactly. We are no longer improvising; the model is executing a defined plan.

Here is Microsoft Copilot's response:

He then played with the red ball, his shadow flickering defiantly beneath the looming tick of the grandfather clock, as if daring time itself to strike.

The story now has a suspenseful tone that effectively captures our instructions.

OpenAI GPT-5's response is this:

He then played with the Red Ball, its echoing bounce defying the relentless tick of the Grandfather Clock that seemed to grow louder with every daring swipe of Onyx's paw.

The suspenseful tone mirrors the goal. The structure we provided carries through into the narrative. We're obtaining what we told the LLM to do.

At this stage, we areidx_35939108 no longer spectators of the model’s improvisation. We are idx_60c29146directors, and the LLM is the actor working from our script. But how does a semantic blueprint such as the one in Level 5 work? To answer that, we turn to Semantic Role Labelling (SRL), a idx_be51c3eamethod that will take us on our first journey from linear sequences of language to multidimensional semantic structures.

SRL: from linear sequences to semantic structures

Our journey through the five levels of context engineering culminated in the semantic blueprint, a method that gives an LLM a structured plan instead of a loose request. But how do we construct such a blueprint from the linear, often ambiguous flow of human language? To do so, we must stop seeing sentences as strings of words and start viewing them as structures of meaning.

The key to upskilling our perception of linear sequencesidx_a69b1dcf into structures is SRL, a powerful linguistic technique initially formalized by Lucien Tesnières and later by Charles J. Fillmore that takes language representations from linear sequences to semantic structures. You can find links to their work in the References section. SRL emerged over the years following the work of these great pioneers, with the goal of deconstructing a linear sentence to answer the most fundamental question: Who did what to whom, when, where, and why? It moves beyond simple grammar to identify the functional role each component plays in the overall action. Instead of linear sequences of text, we get a hierarchical map of meaning centered around an action or predicate.

For readers who want to see how these foundational ideas, such as SRL and semantic blueprints, fit into the larger context engine architecture, the Appendix provides a concise overview of how the full system ties these concepts together.

Consider a simple sentence:

Sarah pitched the new project to the board in the morning.

An LLM interprets a sequence of words as a chain of tokens. A context engineer, using SRL, sees more: a stemma, or graph, that maps each word to its semantic role. The central action is “pitched”, while every other idx_2950babccomponent is assigned a role in relation to that action.

By labeling these roles, we will do the following:

Reconstruct the multidimensional semantic structure of an otherwise linear string of words
Define a semantic blueprint that an LLM can follow, as demonstrated in the Level 5 example earlier

This process is the foundational skill of advanced context engineering. To understand the SRL process, let's build a Python program to put this powerful theory into practice.

Building an SRL notebook in Python

The purpose of this Python idx_889414bdscript is to take the essential parts of a sentence and turn them into a visual diagram of meaning. Rather than leaving structure hidden in text, the program draws a picture of the relationships between words and roles. Figure 1.2 illustrates the overall process:

Figure 1.2: Flowchart of the SRL program

Let’s walk through each stage of the process:

User Input: The idx_097c7829journey begins when you call the main function, visualize_srl(). Here, you provide the building blocks of the sentence, such as idx_2edece9eits verb (predicate), the agent (the entity performing the action), the patient (the entity receiving or affected by the action), and other semantic roles represented as textual arguments.
Data Structuring: The idx_74ab65ffmain function organizes these components into a Python dictionary. Each entry is assigned the proper SRL label, so what began as a loose list of words becomes a structured map of roles.
The Plotting Engine: Once idx_61c1bf4athe dictionary is ready, it is passed to an internal helper function, _plot_stemma. This function does one thing only: draw.
Canvas Setup: The idx_510031eeplotting engine creates a blank canvas using Matplotlib, preparing the stage for the diagram.
Dynamic Positioning: The idx_a95db834function calculates where to place each role node so the layout remains clear and balanced, regardless of how many components are included.
Drawing the Stemma (Graph): The engine then draws the core verb as the root node, adds each role idx_ef164342as a child node, and connects them with labeled arrows. What was once a linear sentence is now a visual map of meaning.
Final Display: Finally, the idx_a5be7acffunction adds a title and displays the completed semantic blueprint.

At this point, we are ready to implement SRL in practice. Open SRL.ipynb in Chapter01 of the GitHub repository.

A word about the code itself: it has been written for clarity and teaching rather than for production. The focus is on showing the flow of information as directly as possible. For that reason, the notebook avoids heavy control structures or error handling that would distract from the experience. Those refinements can always be added later when building production systems. Also, to run the notebook locally, you will need to install spaCy, Matplotlib, and Graphviz, and then download the English model for spaCy using the python -m spacy download en_core_web_sm command.

Let's get building a semantic blueprint visualizer in Python that will generate our stemma, which is basically a graph with semantic nodes and edge visualizations. We will break down the script block by block, explaining the purpose of each section.

First, import the necessary tools. Our visualizer relies on matplotlib for plotting:

matplotlib.pyplot as plt: The main plotting interface, imported with the conventional alias plt for ease of use
matplotlib.patches import FancyArrowPatch: A utility for drawing clean, directed arrows that connect the verb to its roles

With these tools in place, we can now define the SRL visualizing function.

The main function: visualize_srl

The heart of ouridx_9decec7c program is the main function, visualize_srl(). Thisidx_4acf032b is the primary point of interaction, represented by step 1 in Figure 1.2. As the user, you provide the core components of a sentence as arguments, and the function takes care of the rest, organizing the data and sending it to the plotting helper function.

The arguments correspond directly to the semantic roles we defined earlier. To keep the interface flexible, the function also accepts **kwargs, which allows you to pass in any number of optional modifiers (for example, temporal or location details) without complicating the function signature.

The sole purpose of visualize_srl() is to assemble these roles into a dictionary (srl_roles) and then hand them off, along with the predicate (verb), to the internal _plot_stemma() functionidx_e361bb5d foridx_e76c08ab visualization:

def visualize_srl(verb, agent, patient, recipient=None, **kwargs):
    """
    Creates a semantic blueprint and visualizes it as a stemma.
    This is the main, user-facing function.
    """
    srl_roles = {
        "Agent (ARG0)": agent,
        "Patient (ARG1)": patient,
    }
    if recipient:
        srl_roles["Recipient (ARG2)"] = recipient
    # Add any extra modifier roles passed in kwargs
    for key, value in kwargs.items():
        # Format the key for display, e.g., "temporal" -> "Temporal (ARGM-TMP)"
        role_name = f"{key.capitalize()} (ARGM-{key[:3].upper()})"
        srl_roles[role_name] = value
    _plot_stemma(verb, srl_roles)

This function is deliberately simple: it shows the flow of meaning rather than the full complexity of a production-ready parser. By doing so, it keeps the focus on how semantic roles are organized into a blueprint that an LLM can use.

Before we start drawing the stemma itself, let’s take a closer look at the semantic roles that drive this structure.

Defining the semantic roles

Let's review idx_1f69db69the roles defined in visualize_srl, which is one of the methods for performing context engineering. Let’s return to our running example:

Sarah pitched the new project to the board in the morning.

At first glance, this looks like a simple sentence. To an LLM, it is nothing more than a sequence of tokens, one word after another. But for a context engineer, every part of the sentence plays a role in the unfolding action. To capture those roles, we use a set of labels, the primary tools for defining meaning and adding multidimensional structure to an otherwise linear sequence of words:

Predicate (the verb): The predicate is the heart of the sentence, the central action or state of being. Every other role is defined in relation to it. In our example, pitched is the predicate.
Agent (ARG0): The agent is the entity that performs the action, the “doer.” In our example, Sarah is the agent.
Patient (ARG1): The patient is the entity directly affected or acted upon by the verb. In our example, the new project is the patient.
Recipient (ARG2): The recipient is the entity that receives the patient or the result of the action. In our example, the board is the recipient.
Argument Modifiers (ARGM-): These are additional roles that provide context but are not central to the core action.
They answer questions such as when, where, why, or how:
- Temporal (ARGM-TMP): Specifies when the action occurred. In our example, …in the morning is the Temporal modifier.
- Location (ARGM-LOC): Specifies where the action occurred. For example, …in the conference room is the location modifier.
- Manner (ARGM-MNR): Specifies how the action was performed. For example, …with great confidence is the manner modifier.

With these roles in idx_f932c5f4place, a sentence is no longer just a linear/flat sequence of words. It becomes a structured map of meaning. And now, we are ready to plot the stemma.

The plotting engine: _plot_stemma and canvas setup

Our internal helper idx_fc7b021afunction is _plot_stemma (step 3 in Figure 1.2). It does the heavy lifting of drawing the visualization. We keep it focused on one responsibility: draw.

First, we create a blank canvas (fig, ax) and set its dimensions. We turn off the axes because we are creating a diagram, not a traditional chart.

def _plot_stemma(verb, srl_roles):
    """Internal helper function to generate the stemma visualization."""
    fig, ax = plt.subplots(figsize=(10, 6))
    ax.set_xlim(0, 10)
    ax.set_ylim(0, 10)
    ax.axis('off')

We can customize the styles as referenced in Figure 1.2 as step 4. We can define the visual appearance of our nodes. We use Python dictionaries to store the styling rules (box style, colors) for the verb and the roles, making our code clean and easy to modify if we want to change the look later:

    verb_style = dict(boxstyle="round,pad=0.5", fc="lightblue", ec="b")
    role_style = dict(boxstyle="round,pad=0.5", fc="lightgreen", ec="g")

With the canvas prepared and idx_bdfade44styles in place, we’re ready for dynamic positioning, placing each role so the diagram remains clear and balanced regardless of how many components we include.

Dynamic positioning and drawing the stemma (graph)

We begin idx_a283efafwith the root node (the verb). Since the verb is the anchor of our structure, we give it a fixed position near the top center of the canvas. Using ax.text(), we draw it on the diagram and apply verb_style we defined earlier.

    verb_pos = (5, 8.5)
    ax.text(verb_pos[0], verb_pos[1], verb, ha="center", va="center",
        bbox=verb_style, fontsize=12)

We will now position the child nodes (the roles) of the root node. To make our diagram look clean no matter how many roles we have, we need to calculate their positions dynamically. We count the number of roles and then create a list of evenly spaced x_positions along a horizontal line:

    srl_items = list(srl_roles.items())
    num_roles = len(srl_items)
    x_positions = [10 * (i + 1) / (num_roles + 1)
        for i in range(num_roles)]
    y_position = 4.5

Let's add connections to our idx_ae986a51stemma graph by drawing the roles, arrows, and labels.

We will loop through each role in our srl_roles dictionary. In each iteration of the loop, we perform three actions:

Draw the role node: We use ax.text() again to draw the box for the current role at its calculated position
Draw the connecting arrow: We create FancyArrowPatch that connects the verb's position to the role's position and add it to our plot
Draw the arrow label: We calculate the midpoint of the arrow and place the role's name (e.g., Agent (ARG0)) there, so it's clear what the relationship is

The code will then manage the positioning:

    for i, (role, text) in enumerate(srl_items):
        child_pos = (x_positions[i], y_position)
        ax.text(child_pos[0], child_pos[1], text,
                ha="center", va="center",
                bbox=role_style, fontsize=10, wrap=True)

        arrow = FancyArrowPatch(
            verb_pos,
            child_pos,
            arrowstyle='->',
            mutation_scale=20,
            shrinkA=15,
            shrinkB=15,
            color='gray'
        )
        ax.add_patch(arrow)

        label_pos = (
            (verb_pos[0] + child_pos[0]) / 2,
            (verb_pos[1] + child_pos[1]) / 2 + 0.5
        )
        ax.text(label_pos[0], label_pos[1],
            role, ha="center", va="center",
            fontsize=9, color='black', bbox=dict(boxstyle="square,pad=0.1", fc="white", ec="none"))

Finally, we give our idx_c2aff87cvisualization a title and use plt.show() to display the finished stemma:

    fig.suptitle("The Semantic Blueprint (Stemma Visualization)",
        fontsize=16)
    plt.show()

At this point, our main function is fully defined. We can now run examples to see our stemma visualizer in action.

Running SRL examples

By calling our visualize_srl function idx_64a258c5with different arguments, we can instantly transform linear sentences into structured semantic blueprints. The following examples will reinforce your understanding of the core semantic roles and demonstrate the flexibility of the tool we’ve built.

Example 1: Business pitch

Let’s begin with the sentence we’ve beenidx_6ec82039 deconstructing throughout this section:

Sarah pitched the new project to the board in the morning.

The corresponding SRL definition is:

print("Example 1: A complete action with multiple roles.")
visualize_srl(
    verb="pitch",
    agent="Sarah",
    patient="the new project",
    recipient="to the board",
    temporal="in the morning"
)

In this example, we provide values for all the core roles and one modifier:

Predicate: The central action is pitch.
Agent (ARG0): The one doing the pitching is Sarah.
Patient (ARG1): The thing being pitched is the new project.
Recipient (ARG2): The entity receiving the pitch is to the board.
Temporal (ARGM-TMP): The action took place in the morning.

Running this code produces the stemma shown in Figure 1.3, with pitch as the root node and four child nodes representing the roles:

Figure 1.3: A root node and four child nodes

Let’s move on and explore another example.

Example 2: Technical update

Now, let's model a different type ofidx_13c18e01 sentence, this time including a location. Consider this sentence:

The backend team resolved the critical bug in the payment gateway.

The SRL definition will be as follows:

print("\nExample 2: An action with a location")
visualize_srl(
    verb="resolved",
    agent="The backend team",
    patient="the critical bug",
    location="in the payment gateway"
)

Here’s how the components map to the semantic roles:

Predicate: The action is resolved
Agent (ARG0): The entity that performed the resolution is the backend team
Patient (ARG1): The thing that was resolved is the critical bug
Location (ARGM-LOC): The context for where the bug was resolved is in the payment gateway

The visualizer generates the stemma shown in Figure 1.4, with resolved at the top, connected to its three key participants. This clearly structures the technical update.

Figure 1.4: A stemma with an action and a location

Let's see how to visualize how something is done.

Example 3: Project milestone

Finally, let's visualize a sentence thatidx_fd78ceda describes how something was done, using a manner modifier. The sentence is this:

Maria's team deployed the new dashboard ahead of schedule.

The SRL definition is this:

print("\nExample 3: Describing how an action was performed")
visualize_srl(
    verb="deployed",
    agent="Maria's team",
    patient="the new dashboard",
    manner="ahead of schedule"
)

The roles map as follows:

Predicate: The action is deployed
Agent (ARG0): The doer of the action is Maria's team
Patient (ARG1): The entity acted upon is the new dashboard
Manner (ARGM-MNR): This modifier describes how the deployment was done: ahead of schedule.

Running this code generates the stemma shown in Figure 1.5, which illustrates how a Manner modifier adds a contextual layer to the core agent-patient relationship.

Figure 1.5: A stemma explaining the “how” of an action

These three examples idx_b145bee6illustrate how a sentence, a simple sequence of words, can be broken down into its semantic components using SRL. What was once linear text becomes a structured blueprint. With this, you gain control over the information you provide to an LLM. With the SRL tool in hand, we are ready to apply this framework to a practical use case.

Engineering a meeting analysis use case

This meeting analysis demonstrates a powerful techniqueidx_7f01700e called context chaining, where we guide an LLM through a multi-step analysis. Instead of one large, complex prompt, we use a idx_88ba76b3series of simpler, focused prompts, where the output of one step becomes the input for the next. This creates a highly controlled and logical workflow.

We use a context chaining process because an LLM, despite its power, has no true memory or long-term focus. Giving an LLM a single, massive prompt with a complex, multi-step task is akin to giving a brilliant but forgetful assistant a lengthy list of verbal instructions and hoping for the best. The model will often lose track of the primary goal, get bogged down in irrelevant details, and produce a muddled, unfocused result.

Context chaining solves this problem by transforming a complex task into a controlled, step-by-step dialogue. Each step has a single, clear purpose, and its output becomes the clean, focused input for the next step. This method gives you, the context engineer, three critical advantages:

Precision and control: You can guide the AI's thought process at each stage, ensuring the analysis stays on track
Clarity and debugging: If one step produces a poor result, you know exactly which prompt to fix, rather than trying to debug a single, monolithic instruction
Building on insight: It creates a narrative flow, allowing the AI to build upon the refined insights from the previous step, leading to a far more sophisticated and coherent final outcome

Let's review the program flowchart, as shown in Figure 1.6:

Figure 1.6: Context chaining flowchart

This branching structure is the key to the technique's power. It transforms the workflow from a single path into a parallel, multi-task analysis, starting with identifying new developments, analyzing implicit dynamics, and generating a novel solution. The complexity of this flow explains why we will be building a Context Engine in later chapters to manage these relationships automatically:

Input raw transcript (meeting_transcript)
- Purpose: Provide the single source of raw data
- Predecessor: None (This is the starting data)
- Successor: g2 (Isolate Key Content)
Isolate key content (g2)
- Purpose: Clean the data; separate signal from noise.
- Predecessor: transcript (Input Raw Transcript).
- Successors: This is the main branching point.
  Its output (substantive_content) is the direct predecessor for three different, parallel steps:
  - g3 (Identify New Developments)
  - g4 (Analyze Implicit Dynamics)
  - g5 (Generate Novel Solution)
Identify new developments (g3)
- Purpose: Find new information vs. an old summary
- Predecessors: substantive_content (from g2) and prev_summary
- Successor: g6 (Create Structured Summary)
Analyze implicit dynamics (g4)
- Purpose: Analyze feelings and social subtext
- Predecessor: substantive_content (from Step 2)
- Successor: None (leads to a final output, implicit_threads)
Generate novel solution (g5)
- Purpose: Synthesize facts into a new, creative idea
- Predecessor: substantive_content (from Step 2)
- Successor: None (leads to a final output, novel_solution).
Create structured summary (g6)
- Purpose: Format the new developments into a table.
- Predecessor: g3 (Identify New Developments).
- Successor: g7 (Draft Follow-up Action).
Draft follow-up action (g7)
- Purpose: Convert the structured summary into an action item.
- Predecessor: g6 (Create Structured Summary).
- Successor: None (Leads to the final output, follow_up_email).

Let's now translate this flowchartidx_7e5cdb54 plan into actual code and run it step by step:

Open Use_Case.ipynb in the chapter directory.
We will first install the necessary OpenAI library.
```
# Cell 1: Installation
!pip install openai
```

This workflow uses Google Colab Secrets to store the OpenAI API key. Load the key, set the environment variable, and initialize the client:

# Cell 2: Imports and API Key Setup
# We will use the OpenAI library to interact with the LLM and Google Colab's
# secret manager to securely access your API key.

import os
from openai import OpenAI
from google.colab import userdata

# Load the API key from Colab secrets, set the env var, then init the client
try:
    api_key = userdata.get("API_KEY")
    if not api_key:
        raise userdata.SecretNotFoundError("API_KEY not found.")

    # Set environment variable for downstream tools/libraries
    os.environ["OPENAI_API_KEY"] = api_key

    # Create client (will read from OPENAI_API_KEY)
    client = OpenAI()
    print("OpenAI API key loaded and environment variable set successfully.")

except userdata.SecretNotFoundError:
    print('Secret "API_KEY" not found.')
    print('Please add your OpenAI API key to the Colab Secrets Manager.')
except Exception as e:
    print(f"An error occurred while loading the API key: {e}")

Add the meeting transcript as a multi-line string. This will be the input for the chained steps that follow:

# Cell 3: The Full Meeting Transcript
meeting_transcript = """
        Tom: Morning all. Coffee is still kicking in.
        Sarah: Morning, Tom. Right, let's jump in. Project Phoenix timeline. Tom, you said the backend components are on track?
        Tom: Mostly. We hit a small snag with the payment gateway integration. It's... more complex than the docs suggested. We might need another three days.
        Maria: Three days? Tom, that's going to push the final testing phase right up against the launch deadline. We don't have that buffer.
        Sarah: I agree with Maria. What's the alternative, Tom?
        Tom: I suppose I could work over the weekend to catch up. I'd rather not, but I can see the bind we're in.
        Sarah: Appreciate that, Tom. Let's tentatively agree on that. Maria, what about the front-end?
        Maria: We're good. In fact, we're a bit ahead. We have some extra bandwidth.
        Sarah: Excellent. Okay, one last thing. The marketing team wants to do a big social media push on launch day. Thoughts?
        Tom: Seems standard.
        Maria: I think that's a mistake. A big push on day one will swamp our servers if there are any initial bugs. We should do a soft launch, invite-only for the first week, and then do the big push. More controlled.
        Sarah: That's a very good point, Maria. A much safer strategy. Let's go with that. Okay, great meeting. I'll send out a summary.
        Tom: Sounds good. Now, more coffee.
        """

You’re ready to start idx_38587ff0context chaining. The first action will separate signal from noise in meeting_transcript: extract decisions, updates, and issues; set aside greetings and small talk. Let’s get into it!

Layer 1: Establishing the scope (the “what”)

We’ll define the idx_8078cc03scope of analysis first. As established, each step’s output will feed the next, creating a chain of context.

Tell the model exactly what to extract and what to ignore. The goal is to separate substantive content (decisions, updates, problems, proposals) from conversational noise (greetings, small talk).

# Cell 4: g2 - Isolating Content from Noise
prompt_g2 = f"""
        Analyze the following meeting transcript. Your task is to isolate the substantive content from the conversational noise.
        - Substantive content includes: decisions made, project updates, problems raised, and strategic suggestions.
        - Noise includes: greetings, pleasantries, and off-topic remarks (like coffee).
        Return ONLY the substantive content.

        Transcript:
        ---
        {meeting_transcript}
        ---
        """

Now we activate OpenAI to isolate the substantive content:

from openai import OpenAI
try:
    client = OpenAI()

    response_g2 = client.chat.completions.create(
        model="gpt-5",
        messages=[
            {"role": "user", "content": prompt_g2}
        ]
    )

    substantive_content = response_g2.choices[0].message.content
    print("--- SUBSTANTIVE CONTENT ---")
    print(substantive_content)

except Exception as e:
    print(f"An error occurred: {e}")

The output displays the substantive content:

--- SUBSTANTIVE CONTENT ---
- Project Phoenix timeline: Backend mostly on track, but payment gateway integration is more complex than expected; needs an additional three days.
- Impact: Extra three days would push final testing up against the launch deadline, reducing buffer.
- Mitigation decision: Tom will work over the weekend to catch up (tentatively agreed).
- Front-end status: Ahead of schedule with extra bandwidth.
- Marketing/launch strategy: Initial plan for a big social media push on launch day flagged as risky (potential server load with early bugs). Decision: Use a soft launch (invite-only) for the first week, then execute the big push.

Next, we idx_a17c7621simulate a retrieval-augmented generation (RAG) context by comparing the new meeting with a summary of the previous idx_2e272b63one. This narrows the focus to what’s new, showing the importance of historical context:

# Cell 5: g3 - Identifying NEW Information (Simulated RAG)
previous_summary = "In our last meeting, we finalized the goals for Project Phoenix and assigned backend work to Tom and front-end to Maria."

prompt_g3 = f"""
Context: The summary of our last meeting was: "{previous_summary}"

Task: Analyze the following substantive content from our new meeting. Identify and summarize ONLY the new developments, problems, or decisions that have occurred since the last meeting.

New Meeting Content:
---
{substantive_content}
---
"""

We now put OpenAI to work again for the RAG-like task:

from openai import OpenAI
from google.colab import userdata

# Your chat completion request
try:
    response_g3 = client.chat.completions.create(
        model="gpt-5",
        messages=[{"role": "user", "content": prompt_g3}]
    )
    new_developments = response_g3.choices[0].message.content
    print("--- NEW DEVELOPMENTS SINCE LAST MEETING ---")
    print(new_developments)
except Exception as e:
    print(f"An error occurred: {e}")

The outputidx_133bbbb1 narrows down the expectations and the core information of the meeting:

--- NEW DEVELOPMENTS SINCE LAST MEETING ---
- Backend issue: Payment gateway integration is more complex than expected; needs an additional three days.
- Schedule impact: The extra three days compress final testing, pushing it up against the launch deadline and reducing buffer.
- Mitigation decision: Tentative agreement that Tom will work over the weekend to catch up.
- Front-end status: Ahead of schedule with extra bandwidth.
- Launch/marketing decision: Shift from a big day-one social push to a one-week invite-only soft launch, followed by the major push.

We have uncovered the scope; in other words, the what of the transcript. Let’s dig into the how.

Layer 2: Conducting the investigation (the “how”)

Now we move fromidx_2a122498 identifying facts to generating insights, the core of the semantic context interpretation journey. This is where the prompt asks the AI to read between the lines.

Beyond explicit facts, every meeting carries subtext: hesitations, tensions, and moods. This step asks the AI to analyze the underlying dynamics:

# Cell 6: g4 - Uncovering Implicit Threads
prompt_g4 = f"""
Task: Analyze the following meeting content for implicit social dynamics and unstated feelings. Go beyond the literal words.
- Did anyone seem hesitant or reluctant despite agreeing to something?
- Were there any underlying disagreements or tensions?
- What was the overall mood?

Meeting Content:
---
{substantive_content}
---
"""

We now run the prompt to explore the implicit dynamics of the meeting beyond the literal words:

try:
    response_g4 = client.chat.completions.create(
        model="gpt-5",
        messages=[{"role": "user", "content": prompt_g4}]
    )
    implicit_threads = response_g4.choices[0].message.content
    print("--- IMPLICIT THREADS AND DYNAMICS ---")
    print(implicit_threads)
except Exception as e:
    print(f"An error occurred: {e}")

The output at idx_26269021this stage highlights something new: LLMs aren’t just parroting facts or summarizing events. They are also surfacing the subtext:

--- IMPLICIT THREADS AND DYNAMICS ---
Here's what seems to be happening beneath the surface:

Hesitation/reluctance despite agreement
- Tom's "tentative" agreement to work over the weekend reads as reluctant. It suggests he felt pressure to volunteer rather than genuine willingness.
- Marketing likely agreed to the soft launch with some reluctance; shifting from a big day-one push to invite-only implies a concession to engineering risk.

Underlying disagreements or tensions
- Pace vs quality: Engineering wants stability and buffer; marketing originally aimed for impact. The soft launch is a compromise, but the differing risk appetites remain.
- Workload equity: Backend is behind while frontend has "extra bandwidth." The decision to have Tom work the weekend (vs redistributing tasks) hints at siloing or a norm of individual heroics, which can breed quiet resentment.
- Testing squeeze: Pushing testing against the deadline implies QA will be under pressure, potentially creating friction if bugs slip through or late changes occur.
- Estimation confidence: The payment gateway being "more complex than expected" may subtly challenge earlier estimates, inviting unspoken doubt about planning or vendor integration assumptions.

Overall mood
- Sober, pragmatic, and slightly tense. The group is solution-oriented and collaborative, but there's a sense of urgency and strain, with relief at having a plan tempered by concerns about workload, risk, and reduced buffer.

This is where context chaining shifts from recording what happened to interpreting why it matters. The result feels less like a raw transcript and more like an analyst’s commentary, giving us insights into team dynamics that would otherwise remain unstated.

Next, we idx_abc673b4prompt the AI to be creative and solve a problem by synthesizing different ideas from the meeting, demonstrating its thinking power:

# Cell 7: g5 - Generating a Novel Solution
prompt_g5 = f"""
Context: In the meeting, Maria suggested a 'soft launch' to avoid server strain, and also mentioned her team has 'extra bandwidth'.
Tom is facing a 3-day delay on the backend.

Task: Propose a novel, actionable idea that uses Maria's team's extra bandwidth to help mitigate Tom's 3-day delay. Combine these two separate pieces of information into a single solution.
"""
try:
    response_g5 = client.chat.completions.create(
        model="gpt-5",
        messages=[{"role": "user", "content": prompt_g5}]
    )
    novel_solution = response_g5.choices[0].message.content
    print("--- NOVEL SOLUTION PROPOSED BY AI ---")
    print(novel_solution)
except Exception as e:
    print(f"An error occurred: {e}")

The prompt challenges the AI to take two separate threads from the meeting, Maria’s extra frontend bandwidth and Tom’s three-day backend delay, and weave them into a novel solution. This is where context chaining demonstrates its real strength: instead of asking for raw idx_2e2d2f0crecall or passive interpretation, we push the model to generate something new, something that didn’t exist in the transcript:

--- NOVEL SOLUTION PROPOSED BY AI ---
Idea: Soft launch behind a temporary "Edge Bridge" that Maria's team builds to buffer reads/writes until Tom's backend is ready.

What Maria's team does (uses their extra bandwidth)
- Stand up a thin serverless/API facade that matches the real API contracts (e.g., API Gateway/Lambda + SQS/DynamoDB or Cloudflare Workers + Durable Objects).
- Reads: Serve from a prewarmed cache or static snapshots (stale-while-revalidate). Update snapshots hourly via a lightweight data export from staging or existing endpoints.
- Writes: Capture requests into a durable queue with idempotency keys; return immediate "queued" success to the client and show "syncing" UI. When Tom's backend is live, a replay worker drains the queue and applies changes.
- Add feature flags/traffic gating (e.g., LaunchDarkly) to limit the soft launch cohort and throttle requests to avoid server strain.

How this mitigates the 3-day delay
- The product can soft-launch to a small cohort without waiting for the backend; users get read access and buffered writes.
- When Tom's backend is ready, flip routing to the real backend and drain the queue to reconcile data.

Action plan and timeline
- Day 0 (today): Identify minimal critical endpoints for the soft launch. Classify by read vs write. Define API contracts and idempotency rules. Set success/error thresholds and a kill switch.
- Day 1: Maria's team builds the Edge Bridge, cache, and write queue; implement basic observability and encryption-at-rest for any PII in the queue. Front-end adds "syncing" UI states and feature flags.
- Day 2: QA with mocked data, then with a tiny internal cohort. Prewarm caches. Set traffic cap (e.g., 5–10% of target users).
- Day 3: Soft launch goes live on the Edge Bridge. When Tom's backend unlocks, switch routing gradually, start replay worker, monitor for conflicts, then retire the bridge.

Risk controls
- Data consistency: Use idempotency keys and a simple conflict policy (latest-write-wins or version checks).
- Rollback: Feature flag to disable writes or pause replay if error rate exceeds threshold.
- Privacy: Encrypt queued payloads; limit PII scope.

Owners
- Maria's team: Edge Bridge, caching, queue/replay, monitoring.
- Tom's team: Final backend endpoints, schema, and replay acceptance hooks.
- Front-end: Feature-flag routing and "queued/syncing" UX.

This combines Maria's extra bandwidth with a controlled soft launch to keep momentum while absorbing Tom's 3-day backend delay.

The output showsidx_6e44d4e6 how an LLM can function as a creative collaborator, proposing a technically feasible workaround that blends system design, risk controls, and role assignments. The result feels less like a “guess” and more like a carefully reasoned plan a senior engineer might sketch out in a design session.

Layer 3: Determining the action (the “what next”)

Finally, we turn the analysis intoidx_93e0eef5 concrete, forward-looking artifacts. Up to this point, we’ve been generating raw insights: new facts, implicit dynamics, and creative solutions. But insights are only valuable if they can be communicated clearly. The next step is to compile everything into a structured, final summary.

This serves two purposes:

It forces clarity since every item is reduced to topic, outcome, and owner.
It makes information reusable. Whether dropped into an email, report, or dashboard, the summary is clean and immediately actionable.

# Cell 8: g6 - Creating the Final, Structured Summary
prompt_g6 = f"""
Task: Create a final, concise summary of the meeting in a markdown table.
Use the following information to construct the table.

- New Developments: {new_developments}

The table should have three columns: "Topic", "Decision/Outcome", and "Owner".
"""
try:
    response_g6 = client.chat.completions.create(
        model="gpt-5",
        messages=[{"role": "user", "content": prompt_g6}]
    )
    final_summary_table = response_g6.choices[0].message.content
    print("--- FINAL MEETING SUMMARY TABLE ---")
    print(final_summary_table)
except Exception as e:
    print(f"An error occurred: {e}")

The output we get represents the essence of the meeting: all the noise, subtext, and negotiation distilled into a crisp reference point:

--- FINAL MEETING SUMMARY TABLE ---
| Topic | Decision/Outcome | Owner |
|---|---|---|
| Backend payment gateway integration | More complex than expected; requires an additional three days | Backend Team |
| Schedule impact | Extra three days compress final testing, reducing buffer before launch | Project Manager |
| Mitigation | Tentative plan: Tom will work over the weekend to catch up | Tom |
| Front-end status | Ahead of schedule with extra bandwidth available | Front-end Team |
| Launch/marketing plan | Shift to a one-week invite-only soft launch, then major day-one push | Marketing + Product |

The final step in idx_a7e6846ccontext chaining is to close the loop from insight to action: turning the structured analysis into a professional follow-up email:

# Cell 9: g7 - Drafting the Follow-Up Action
prompt_g7 = f"""
Task: Based on the following summary table, draft a polite and professional follow-up email to the team (Sarah, Tom, Maria).
The email should clearly state the decisions made and the action items for each person.

Summary Table:
---
{final_summary_table}
---
"""

The LLM will do the heavy-lifting again and get the job done:

try:
    response_g7 = client.chat.completions.create(
        model="gpt-5",
        messages=[{"role": "user", "content": prompt_g7}]
    )
    follow_up_email = response_g7.choices[0].message.content
    print("--- DRAFT FOLLOW-UP EMAIL ---")
    print(follow_up_email)
except Exception as e:
    print(f"An error occurred: {e}")

The final action is aidx_343c173d follow-up email based on a deep analysis of the meeting:

--- DRAFT FOLLOW-UP EMAIL ---
Subject: Follow-up: Decisions and next steps from today's sync

Hi Sarah, Tom, and Maria,

Thanks for the productive discussion earlier. Here's a quick recap of decisions and the action items for each of us.

Decisions
- Backend payment gateway integration is more complex than expected and will require an additional three days.
- This pushes the schedule by three days and compresses the final testing window, reducing our pre-launch buffer.
- Mitigation: Tom will work over the weekend to help us catch up.
- Front-end is ahead of schedule and has extra bandwidth to support.
- Launch/marketing plan will shift to a one-week invite-only soft launch, followed by the larger day-one push.

Action items
- Tom:
  - Confirm weekend availability and share a brief plan (key milestones, dependencies, and any risks).
  - Proceed with the gateway integration and coordinate early integration testing with Front-end and QA.
  - Provide short daily progress updates and flag blockers immediately.

- Sarah:
  - Update the project timeline to reflect the three-day shift and the compressed QA window.
  - Coordinate with QA on a risk-based test plan that fits the shortened testing period.
  - Align with Marketing/Product on the invite-only soft launch scope, success metrics, and comms; circulate the plan to the team.

- Maria:
  - Reallocate Front-end bandwidth to support the backend integration (payment UI hooks, error handling, instrumentation).
  - Partner with Tom on mocks/stubs as needed to unblock early integration and QA.
  - Ensure front-end readiness for the soft launch (feature flags/toggles, tracking) and share any gaps.

Please reply to confirm your action items and note any constraints or support you need. I'm happy to set up a brief daily check-in while we work through this; propose a time if you have a preference.

Thanks all, and appreciate the quick coordination.

Best,
[Your Name]

This final artifactidx_8484cbc5 demonstrates the full promise of context chaining. The email reads like it was crafted by a diligent project manager. Instead of ending with abstract insights or a loose list of points, the workflow produces professional communication. The AI has not only summarized the meeting but has also translated it into a format that does the following:

Captures decisions clearly so there’s no ambiguity about what was agreed upon
Assigns ownership, ensuring every task is tied to a responsible person
Sets expectations such as timelines, next steps, and accountability
Reduces follow-up friction as the draft is already polished enough to send, saving the human time and energy

This is the moment where the LLM stops being a “note-taker” and becomes a creative partner, as we mentioned at the start of this chapter. In this use case, we didn’t just get a summary. We witnessed how to think with the AI as a partner. The human remains at the center of the process and can create templates of context chaining for meetings, email processing, reporting, and anything you can imagine. Used well, context chaining can elevate the way teams, companies, and clients operate.

Let’s now step back, summarize what we’ve accomplished, and move on to the next exploration in context engineering.

Pablo Cepeda Feb 10, 2026

Good guide to implement agents, the book gives ideas on how you can improve the prompts and give a confirmation about my AI solutions architectures. My recommendation for the next version of the book is to talk about AI platforms like Azure Foundry or Vertex AI.

Feefo Verified review

Context Engineering for Multi-Agent Systems: Move beyond prompting to build a Context Engine, a transparent architecture of context and reasoning

What do you get with Print?

Contact Details

Shipping Address

Billing Address

Key benefits

Description

Who is this book for?

What you will learn

Product Details

What do you get with Print?

Contact Details

Shipping Address

Billing Address

Product Details

Packt Subscriptions

Table of Contents

Recommendations for you

Customer reviews

About the author

FAQs

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access