JSON Generation and Structured Outputs

This document covers the agent-swarm-kit's JSON generation and structured output system, which enables AI agents to produce well-formatted, validated JSON responses according to predefined schemas. The system provides a framework for defining output formats, validating generated content, and ensuring consistency across multiple generation attempts.

For information about AI model integration and completion services, see Completion Adapters. For details about agent tool execution and function calling, see Model Context Protocol (MCP).

Core Concepts

The JSON generation system is built around outline schemas that define the expected structure, validation rules, and generation parameters for structured outputs. Each outline schema specifies a target JSON format and the prompts needed to guide AI models in producing compliant responses.

Outline Schema Structure

The IOutlineSchema interface defines the complete configuration for JSON generation:

interface IOutlineSchema {
  outlineName: OutlineName;
  format: IOutlineFormat;
  prompt: string | ((outlineName: OutlineName) => Promise<string> | string);
  system?: string[] | ((outlineName: OutlineName) => Promise<string[]> | string[]);
  completion: CompletionName;
  validations?: IOutlineValidationFn[];
  getOutlineHistory: (args: IOutlineArgs) => Promise<void>;
  callbacks?: Partial<IOutlineCallbacks>;
}

Format Definitions

The system supports two format types defined by IOutlineFormat:

Schema Format (IOutlineSchemaFormat): Full JSON schema specification

interface IOutlineSchemaFormat {
  type: string;
  json_schema: object;
}

Object Format (IOutlineObjectFormat): Simplified property-based definition

interface IOutlineObjectFormat {
  type: string;
  required: string[];
  properties: {
    [key: string]: {
      type: string;
      description: string;
      enum?: string[];
    };
  };
}

System Architecture

The JSON generation system consists of several interconnected components that handle schema management, generation attempts, and validation.

Component Overview

Mermaid Diagram

Service Integration

Mermaid Diagram

JSON Generation Flow

The json function orchestrates the entire generation process through multiple attempts to produce valid structured output.

Generation Process

Mermaid Diagram

Attempt Management

Each generation attempt follows this pattern:

History Creation: Fresh OutlineHistory instance for each attempt
Prompt Assembly: Combines base prompt, system messages, and completion flags
History Population: Calls getOutlineHistory to add context-specific messages
Completion Request: Sends message history to AI provider with format specification
JSON Parsing: Attempts to parse response as valid JSON
Validation: Runs all validation functions against parsed data
Result Handling: Returns success or continues to next attempt

Schema Definition and Management

The OutlineSchemaService provides centralized management of outline schemas with support for context-specific registries and validation.

Schema Registration

Mermaid Diagram

Schema Validation

The service validates schemas on registration to ensure:

outlineName is a non-empty string
getOutlineHistory is a callable function
validations array contains valid validation functions or objects
Required properties are present and correctly typed

// Schema validation example
private validateSchema(schema: IOutlineSchema): void {
  if (!schema.outlineName || typeof schema.outlineName !== 'string') {
    throw new Error('Invalid outlineName');
  }
  if (typeof schema.getOutlineHistory !== 'function') {
    throw new Error('getOutlineHistory must be a function');
  }
  // Additional validation logic...
}

Validation and Error Handling

The validation system ensures generated JSON meets specified requirements through configurable validation functions.

Validation Function Types

type IOutlineValidationFn = 
  | ((args: IOutlineValidationArgs) => Promise<void> | void)
  | {
      validate: (args: IOutlineValidationArgs) => Promise<void> | void;
      name?: string;
    };

Validation Arguments

Each validation function receives:

attempt: Current attempt number
format: Expected output format
param: Input parameters
data: Parsed JSON data
history: Message history for context

Error Recovery

When validation fails or JSON parsing errors occur:

Error Logging: Detailed error information logged with attempt context
Retry Logic: Automatic retry up to maxAttempts (default 5)
Final Failure: Returns IOutlineResult with isValid: false and error details
History Preservation: Full message history available for debugging

// Error handling example from json function
try {
  const data = JSON.parse(output.content) as IOutlineData;
  // Run validations...
  return { isValid: true, data, /* ... */ };
} catch (error) {
  errorMessage = getErrorMessage(error);
  console.error(`agent-swarm outline error outlineName=${outlineName}`, {
    param, errorMessage
  });
}

Integration with Completion Services

The JSON generation system integrates with the completion service architecture to leverage different AI providers and their specific capabilities.

Completion Service Integration

Mermaid Diagram

Format Conversion

The toJsonSchema helper function converts IOutlineObjectFormat to OpenAI-compatible JSON schema format:

export const toJsonSchema = (name: string, schema: IOutlineObjectFormat): IOutlineSchemaFormat => ({
  type: "json_schema",
  json_schema: {
    name,
    strict: true,
    schema,
  },
});

Provider-Specific Handling

Different completion providers handle structured output differently:

OpenAI: Uses json_schema format with strict validation
Ollama: Uses simplified object format specification
Claude: Relies on prompt engineering for structure guidance

The system abstracts these differences through the completion service layer, allowing outline schemas to work consistently across providers.

Usage Patterns

The JSON generation system supports various usage patterns for different structured output requirements.

Basic Usage

// Simple data extraction
const result = await json<UserData>("extract_user_info", {
  text: "John Doe, age 30, works as engineer"
});

if (result.isValid) {
  console.log(result.data); // Typed UserData object
}

Complex Validation

// Multi-step validation with custom rules
const schema: IOutlineSchema = {
  outlineName: "analyze_sentiment",
  format: {
    type: "object",
    required: ["sentiment", "confidence", "keywords"],
    properties: {
      sentiment: { type: "string", enum: ["positive", "negative", "neutral"] },
      confidence: { type: "number", description: "0-1 confidence score" },
      keywords: { type: "array", description: "Key phrases" }
    }
  },
  validations: [
    // Confidence range validation
    ({ data }) => {
      if (data.confidence < 0 || data.confidence > 1) {
        throw new Error("Confidence must be 0-1");
      }
    },
    // Keyword validation
    {
      name: "keyword_validator",
      validate: ({ data }) => {
        if (!Array.isArray(data.keywords) || data.keywords.length === 0) {
          throw new Error("Keywords required");
        }
      }
    }
  ]
};

History Context Management

// Dynamic history population
const schema: IOutlineSchema = {
  outlineName: "contextual_analysis",
  getOutlineHistory: async ({ param, history, attempt }) => {
    // Add context based on input
    await history.push({
      role: "system",
      content: `Context: ${param.context}`
    });
    
    // Add examples on retry attempts
    if (attempt > 0) {
      await history.push({
        role: "system", 
        content: "Previous attempt failed. Here's an example..."
      });
    }
  }
};

JSON Generation and Structured Outputs

Core Concepts

Outline Schema Structure

Format Definitions

System Architecture

Component Overview

Service Integration

JSON Generation Flow

Generation Process

Attempt Management

Schema Definition and Management

Schema Registration

Schema Validation

Validation and Error Handling

Validation Function Types

Validation Arguments

Error Recovery

Integration with Completion Services

Completion Service Integration

Format Conversion

Provider-Specific Handling

Usage Patterns

Basic Usage

Complex Validation

History Context Management

Settings

On This Page