crmne · kieranklaassen · Apr 18, 2025 · Apr 18, 2025 · Apr 18, 2025 · Apr 18, 2025
diff --git a/CLAUDE.md b/CLAUDE.md
@@ -0,0 +1,23 @@
+# CLAUDE.md
+
+This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
+
+## Build & Test Commands
+- Build: `bundle exec rake build`
+- Install dependencies: `bundle install`
+- Run all tests: `bundle exec rspec`
+- Run specific test: `bundle exec rspec spec/ruby_llm/chat_spec.rb`
+- Run specific test by description: `bundle exec rspec -e "description"`
+- Re-record VCR cassettes: `bundle exec rake vcr:record[all]` or `bundle exec rake vcr:record[openai,anthropic]`
+- Check style: `bundle exec rubocop`
+- Auto-fix style: `bundle exec rubocop -A`
+
+## Code Style Guidelines
+- Follow [Standard Ruby](https://github.com/testdouble/standard) style
+- Use frozen_string_literal comment at the top of each file
+- Follow model naming conventions from CONTRIBUTING.md when adding providers
+- Use RSpec for tests with descriptive test names that form clean VCR cassettes
+- Handle errors with specific error classes from RubyLLM::Error
+- Use method keyword arguments with Ruby 3+ syntax
+- Document public APIs with YARD comments
+- Maintain backward compatibility for minor version changes
diff --git a/README.md b/README.md
@@ -15,12 +15,9 @@ A delightful Ruby way to work with AI. No configuration madness, no complex call
   <img src="https://upload.wikimedia.org/wikipedia/commons/e/ec/DeepSeek_logo.svg" alt="DeepSeek" height="40" width="120">
 </div>
 
-<a href="https://badge.fury.io/rb/ruby_llm"><img src="https://badge.fury.io/rb/ruby_llm.svg" alt="Gem Version" /></a>
-<a href="https://github.com/testdouble/standard"><img src="https://img.shields.io/badge/code_style-standard-brightgreen.svg" alt="Ruby Style Guide" /></a>
-<a href="https://rubygems.org/gems/ruby_llm"><img alt="Gem Downloads" src="https://img.shields.io/gem/dt/ruby_llm"></a>
-<a href="https://codecov.io/gh/crmne/ruby_llm"><img src="https://codecov.io/gh/crmne/ruby_llm/branch/main/graph/badge.svg" alt="codecov" /></a>
+<a href="https://badge.fury.io/rb/ruby_llm"><img src="https://badge.fury.io/rb/ruby_llm.svg" alt="Gem Version" /></a> <a href="https://github.com/testdouble/standard"><img src="https://img.shields.io/badge/code_style-standard-brightgreen.svg" alt="Ruby Style Guide" /></a> <a href="https://rubygems.org/gems/ruby_llm"><img alt="Gem Downloads" src="https://img.shields.io/gem/dt/ruby_llm"></a> <a href="https://codecov.io/gh/crmne/ruby_llm"><img src="https://codecov.io/gh/crmne/ruby_llm/branch/main/graph/badge.svg" alt="codecov" /></a>
 
-🤺 Battle tested at [💬  Chat with Work](https://chatwithwork.com)
+🤺 Battle tested at [💬 Chat with Work](https://chatwithwork.com)
 
 ## The problem with AI libraries
 
@@ -36,6 +33,7 @@ RubyLLM fixes all that. One beautiful API for everything. One consistent format.
 - 🖼️ **Image generation** with DALL-E and other providers
 - 📊 **Embeddings** for vector search and semantic analysis
 - 🔧 **Tools** that let AI use your Ruby code
+- 📝 **Structured Output** with JSON schema validation
 - 🚂 **Rails integration** to persist chats and messages with ActiveRecord
 - 🌊 **Streaming** responses with proper Ruby patterns
 
@@ -83,6 +81,28 @@ class Weather < RubyLLM::Tool
 end
 
 chat.with_tool(Weather).ask "What's the weather in Berlin? (52.5200, 13.4050)"
+
+# Get structured output with JSON schema validation
+schema = {
+  type: "object",
+  properties: {
+    name: { type: "string" },
+    age: { type: "integer" },
+    interests: {
+      type: "array",
+      items: { type: "string" }
+    }
+  },
+  required: ["name", "age", "interests"]
+}
+
+# Returns a validated Hash instead of plain text
+user_data = chat.with_response_format(schema).ask("Create a profile for a Ruby developer")
+
+# Access the structured data using hash keys
+puts "Name: #{user_data.content['name']}"              # => "Jane Smith"
+puts "Age: #{user_data.content['age']}"                # => 32
+puts "Interests: #{user_data.content['interests'].join(', ')}"  # => "Ruby, Rails, API design"
 ```
 
 ## Installation
@@ -214,6 +234,7 @@ Check out the guides at https://rubyllm.com for deeper dives into conversations
 We welcome contributions to RubyLLM!
 
 See [CONTRIBUTING.md](CONTRIBUTING.md) for detailed instructions on how to:
+
 - Run the test suite
 - Add new features
 - Update documentation

diff --git a/docs/_data/navigation.yml b/docs/_data/navigation.yml
@@ -19,6 +19,8 @@
       url: /guides/image-generation
     - title: Embeddings
       url: /guides/embeddings
+    - title: Structured Output
+      url: /guides/structured-output
     - title: Error Handling
       url: /guides/error-handling
     - title: Models

diff --git a/docs/guides/index.md b/docs/guides/index.md
@@ -33,6 +33,9 @@ Learn how to generate images using DALL-E and other providers.
 ### [Embeddings]({% link guides/embeddings.md %})
 Explore how to create vector embeddings for semantic search and other applications.
 
+### [Structured Output]({% link guides/structured-output.md %})
+Learn how to use JSON schemas to get validated structured data from LLMs.
+
 ### [Error Handling]({% link guides/error-handling.md %})
 Master the techniques for robust error handling in AI applications.
 

diff --git a/docs/guides/rails.md b/docs/guides/rails.md
@@ -25,6 +25,7 @@ After reading this guide, you will know:
 *   How to set up ActiveRecord models for persisting chats and messages.
 *   How to use `acts_as_chat` and `acts_as_message`.
 *   How chat interactions automatically persist data.
+*   How to work with structured output in your Rails models.
 *   A basic approach for integrating streaming responses with Hotwire/Turbo Streams.
 
 ## Setup
@@ -174,6 +175,89 @@ system_message = chat_record.messages.find_by(role: :system)
 puts system_message.content # => "You are a concise Ruby expert."
 ```
 
+## Working with Structured Output
+
+RubyLLM 1.3.0+ supports structured output with JSON schema validation. This works seamlessly with Rails integration, allowing you to get and persist structured data from AI models.
+
+### Database Considerations
+
+For best results with structured output, use a database that supports JSON data natively:
+
+```ruby
+# For PostgreSQL, use jsonb for the content column
+class CreateMessages < ActiveRecord::Migration[7.1]
+  def change
+    create_table :messages do |t|
+      t.references :chat, null: false, foreign_key: true
+      t.string :role
+      t.jsonb :content # Use jsonb instead of text for PostgreSQL
+      # ...other fields...
+    end
+  end
+end
+```
+
+For databases without native JSON support, you can use text columns with serialization:
+
+```ruby
+# app/models/message.rb
+class Message < ApplicationRecord
+  acts_as_message
+  serialize :content, JSON # Add this for text columns
+end
+```
+
+### Using Structured Output
+
+The `with_response_format` method is available on your `Chat` model thanks to `acts_as_chat`:
+
+```ruby
+# Make sure to use a model that supports structured output
+chat_record = Chat.create!(model_id: 'gpt-4.1-nano')
+
+# Define your JSON schema
+schema = {
+  type: "object",
+  properties: {
+    name: { type: "string" },
+    version: { type: "string" },
+    features: { 
+      type: "array", 
+      items: { type: "string" }
+    }
+  },
+  required: ["name", "version"]
+}
+
+begin
+  # Get structured data instead of plain text
+  response = chat_record.with_response_format(schema).ask("Tell me about Ruby")
+
+  # The response content is a Hash (or serialized JSON in text columns)
+  response.content # => {"name"=>"Ruby", "version"=>"3.2.0", "features"=>["Blocks", "Procs"]}
+
+  # You can access the persisted message as usual
+  message = chat_record.messages.where(role: 'assistant').last
+  message.content['name'] # => "Ruby"
+
+  # In your views, you can easily display structured data:
+  # <%= message.content['name'] %> <%= message.content['version'] %>
+  # <ul>
+  #   <% message.content['features'].each do |feature| %>
+  #     <li><%= feature %></li>
+  #   <% end %>
+  # </ul>
+rescue RubyLLM::UnsupportedStructuredOutputError => e
+  # Handle case where the model doesn't support structured output
+  puts "This model doesn't support structured output: #{e.message}"
+rescue RubyLLM::InvalidStructuredOutput => e
+  # Handle case where the model returns invalid JSON
+  puts "The model returned invalid JSON: #{e.message}"
+end
+```
+
+With this approach, you can build robust data-driven applications that leverage the structured output capabilities of AI models while properly handling errors.
+
 ## Streaming Responses with Hotwire/Turbo
 
 You can combine `acts_as_chat` with streaming and Turbo Streams for real-time UI updates. The persistence logic works seamlessly alongside the streaming block.

diff --git a/docs/guides/structured-output.md b/docs/guides/structured-output.md
@@ -0,0 +1,160 @@
+---
+layout: default
+title: Structured Output
+parent: Guides
+nav_order: 7
+---
+
+# Structured Output
+
+RubyLLM allows you to request structured data from language models by providing a JSON schema. When you use the `with_response_format` method, RubyLLM will ensure the model returns data matching your schema instead of free-form text.
+
+## Schema-Based Output (Recommended)
+
+We recommend providing a schema for structured data:
+
+```ruby
+# Define a JSON schema
+schema = {
+  type: "object",
+  properties: {
+    name: { type: "string" },
+    age: { type: "integer" },
+    interests: { type: "array", items: { type: "string" } }
+  },
+  required: ["name", "age", "interests"]
+}
+
+response = RubyLLM.chat(model: "gpt-4o")
+  .with_response_format(schema)
+  .ask("Create a profile for a Ruby developer")
+```
+
+RubyLLM intelligently handles your schema based on the model's capabilities:
+
+- For models with native schema support (like GPT-4o): Uses API-level schema validation
+- For models without schema support: Automatically adds schema instructions to the system message
+
+## Simple JSON Mode (Alternative)
+
+For cases where you just need well-formed JSON:
+
+```ruby
+response = RubyLLM.chat(model: "gpt-4.1-nano")
+  .with_response_format(:json)
+  .ask("Create a profile for a Ruby developer")
+```
+
+This uses OpenAI's `response_format: {type: "json_object"}` parameter, works with most OpenAI models, and guarantees valid JSON without enforcing a specific structure.
+
+## Strict and Non-Strict Modes
+
+By default, RubyLLM operates in "strict mode" which only allows models that officially support the requested output format. If you try to use a schema with a model that doesn't support schema validation, RubyLLM will raise an `UnsupportedStructuredOutputError`.
+
+For broader compatibility, you can disable strict mode:
+
+```ruby
+# Use schema with a model that doesn't currently support schema validation on RubyLLM
+response = RubyLLM.chat(model: "gemini-2.0-flash")
+  .with_response_format(schema, strict: false)
+  .ask("Create a profile for a Ruby developer")
+```
+
+In non-strict mode:
+
+- RubyLLM doesn't validate if the model supports the requested format
+- The schema is automatically added to the system message
+- JSON parsing is handled automatically
+- Works with most models that can produce JSON output, including Claude and Gemini
+
+This allows you to use schema-based output with a wider range of models, though without API-level schema validation.
+
+## Error Handling
+
+RubyLLM provides two main error types for structured output:
+
+1. **UnsupportedStructuredOutputError**: Raised when using schema-based output with a model that doesn't support it in strict mode:
+2. **InvalidStructuredOutput**: Raised if the model returns invalid JSON:
+
+Note: RubyLLM checks that responses are valid JSON but doesn't verify conformance to the schema structure. For full schema validation, use a library like `json-schema`.
+
+## With ActiveRecord and Rails
+
+The structured output feature works seamlessly with RubyLLM's Rails integration. Message content can be either a String or a Hash.
+
+If you're storing message content in your database, ensure your messages table can store JSON. PostgreSQL's `jsonb` column type is ideal:
+
+```ruby
+# In a migration
+create_table :messages do |t|
+  t.references :chat
+  t.string :role
+  t.jsonb :content # Use jsonb for efficient JSON storage
+  # other fields...
+end
+```
+
+If you have an existing application with a text-based content column, add serialization:
+
+```ruby
+# In your Message model
+class Message < ApplicationRecord
+  serialize :content, JSON
+  acts_as_message
+end
+```
+
+## Tips for Effective Schemas
+
+1. **Be specific**: Provide clear property descriptions to guide the model.
+2. **Start simple**: Begin with basic schemas and add complexity gradually.
+3. **Include required fields**: Specify which properties are required.
+4. **Use appropriate types**: Match JSON Schema types to your expected data.
+5. **Validate locally**: Consider using a gem like `json-schema` for additional validation.
+6. **Test model compatibility**: Different models have different levels of schema support.
+
+## Example: Complex Schema
+
+```ruby
+schema = {
+  type: "object",
+  properties: {
+    products: {
+      type: "array",
+      items: {
+        type: "object",
+        properties: {
+          name: { type: "string" },
+          price: { type: "number" },
+          in_stock: { type: "boolean" },
+          categories: {
+            type: "array",
+            items: { type: "string" }
+          }
+        },
+        required: ["name", "price", "in_stock"]
+      }
+    },
+    total_products: { type: "integer" },
+    store_info: {
+      type: "object",
+      properties: {
+        name: { type: "string" },
+        location: { type: "string" }
+      }
+    }
+  },
+  required: ["products", "total_products"]
+}
+
+inventory = chat.with_response_format(schema)  # Let RubyLLM handle the schema formatting
+  .ask("Create an inventory for a Ruby gem store")
+```
+
+### Limitations
+
+- Schema validation is only available at the API level for certain OpenAI models
+- No enforcement of required fields or data types without external validation
+- For full schema validation, use a library like `json-schema` to verify the output
+
+RubyLLM handles all the complexity of supporting different model capabilities, so you can focus on your application logic.
diff --git a/docs/index.md b/docs/index.md
@@ -58,6 +58,7 @@ RubyLLM fixes all that. One beautiful API for everything. One consistent format.
 - 🖼️ **Image generation** with DALL-E and other providers
 - 📊 **Embeddings** for vector search and semantic analysis
 - 🔧 **Tools** that let AI use your Ruby code
+- 📝 **Structured Output** with JSON schema validation
 - 🚂 **Rails integration** to persist chats and messages with ActiveRecord
 - 🌊 **Streaming** responses with proper Ruby patterns
 
@@ -105,6 +106,23 @@ class Weather < RubyLLM::Tool
 end
 
 chat.with_tool(Weather).ask "What's the weather in Berlin? (52.5200, 13.4050)"
+
+# Get structured output with JSON schema validation
+schema = {
+  type: "object",
+  properties: {
+    name: { type: "string" },
+    age: { type: "integer" },
+    interests: { 
+      type: "array", 
+      items: { type: "string" }
+    }
+  },
+  required: ["name", "age", "interests"]
+}
+
+# Returns a validated Hash instead of plain text
+user_data = chat.with_response_format(schema).ask("Create a profile for a Ruby developer")
 ```
 
 ## Quick start