Chapter 1: Your First LLM Call

File(s) to edit: src/mock.rs Test to run: cargo test -p mini-claw-code-starter test_mock_ Estimated time: 15 min

Before building an agent, you need to talk to an LLM. In this chapter you will implement a MockProvider — a fake LLM that returns canned responses. No API key, no HTTP, no network. Just the protocol.

The nouns

Before any code, a one-line glossary of the types you'll meet in chapters 1–3. They're all already defined in src/types.rs — this list is just so the names aren't strangers. Chapter 4 is the deep dive; for now, a sentence each is enough:

Type	What it is
`Message`	Enum of conversation entries: `System`, `User`, `Assistant`, `ToolResult`, `Attachment`, `Progress`.
`AssistantTurn`	What the LLM returns: optional `text`, a `Vec<ToolCall>`, a `StopReason`, optional `TokenUsage`.
`StopReason`	`Stop` (the LLM is done) or `ToolUse` (it wants to call tools).
`ToolCall`	LLM's request to call a tool: `id`, `name`, JSON `arguments`.
`ToolDefinition`	JSON-Schema description of a tool, sent to the LLM so it knows what's available.
`Tool`	Trait with `definition()` and `call()` — implement it to give the agent a new capability.
`ToolSet`	A `HashMap<String, Box<dyn Tool>>` for dispatching tool calls by name.
`Provider`	Trait with one `chat()` method — the abstraction over "an LLM that responds to messages."

If any of these feel fuzzy later, come back here. Chapter 4 rebuilds all of them from scratch with full commentary.

Goal

Implement MockProvider so that:

You create it with a VecDeque<AssistantTurn> of canned responses.
Each call to chat() returns the next response in sequence.
If all responses have been consumed, it returns an error.

The protocol

Every LLM interaction follows the same pattern:

sequenceDiagram
    participant C as Your Code
    participant L as LLM

    C->>L: messages + tool definitions
    L-->>C: text and/or tool calls + stop reason

You send messages and a list of available tools. The LLM responds with text, tool calls, or both — plus a StopReason telling you what to do next.

In Rust, that is one trait with one method:

#![allow(unused)]
fn main() {
pub trait Provider: Send + Sync {
    fn chat(
        &self,
        messages: &[Message],
        tools: &[&ToolDefinition],
    ) -> impl Future<Output = anyhow::Result<AssistantTurn>> + Send;
}
}

The core types

Open mini-claw-code-starter/src/types.rs. These types are already defined for you — read them to understand the protocol:

classDiagram
    class Provider {
        <<trait>>
        +chat(messages, tools) AssistantTurn
    }

    class AssistantTurn {
        text: Option~String~
        tool_calls: Vec~ToolCall~
        stop_reason: StopReason
        usage: Option~TokenUsage~
    }

    class StopReason {
        <<enum>>
        Stop
        ToolUse
    }

    class Message {
        <<enum>>
        System(String)
        User(String)
        Assistant(AssistantTurn)
        ToolResult
    }

    Provider --> AssistantTurn : returns
    Provider --> Message : receives
    AssistantTurn --> StopReason
    AssistantTurn --> ToolCall : contains 0..*

The LLM responds with an AssistantTurn:

#![allow(unused)]
fn main() {
pub struct AssistantTurn {
    pub text: Option<String>,          // what the LLM said
    pub tool_calls: Vec<ToolCall>,     // tools it wants to call
    pub stop_reason: StopReason,       // Stop or ToolUse
    pub usage: Option<TokenUsage>,     // token counts (optional)
}
}

Two outcomes:

StopReason::Stop — the LLM is done, read text for the answer
StopReason::ToolUse — the LLM wants to call tools, read tool_calls

That's it. Every coding agent — Claude Code, Cursor, Copilot — runs on this exact protocol.

Key Rust concept: `Mutex` for interior mutability

The Provider trait takes &self (not &mut self) because providers are shared across async tasks. But MockProvider needs to mutate its response queue. The solution is Mutex<VecDeque<AssistantTurn>> — it lets you mutate the queue through a shared reference.

#![allow(unused)]
fn main() {
pub struct MockProvider {
    responses: Mutex<VecDeque<AssistantTurn>>,
}
}

This pattern — Mutex around shared state in a &self method — appears throughout async Rust.

The implementation

Open src/mock.rs. You'll see the struct definition and two stubs.

Step 1: `new()`

Wrap the VecDeque in a Mutex:

#![allow(unused)]
fn main() {
pub fn new(responses: VecDeque<AssistantTurn>) -> Self {
    Self {
        responses: Mutex::new(responses),
    }
}
}

Step 2: `chat()`

Lock the mutex, pop the front response, convert None to an error:

#![allow(unused)]
fn main() {
async fn chat(
    &self,
    _messages: &[Message],
    _tools: &[&ToolDefinition],
) -> anyhow::Result<AssistantTurn> {
    self.responses
        .lock()
        .unwrap()
        .pop_front()
        .ok_or_else(|| anyhow::anyhow!("MockProvider: no more responses"))
}
}

Three lines of logic. The mock ignores messages and tools entirely — it just returns the next canned response.

Run the tests

cargo test -p mini-claw-code-starter test_mock_

14 tests verify your mock:

test_mock_returns_text — basic text response
test_mock_returns_tool_calls — response with tool calls
test_mock_steps_through_sequence — FIFO order across multiple calls
test_mock_empty_responses_exhausted — error when queue is empty
test_mock_ignores_messages_and_tools — mock doesn't look at inputs
test_mock_long_sequence — 10 responses consumed in order

Build Your Own Mini Coding Agent in Rust