Observability Integrations

Observability is crucial for understanding, debugging, and optimizing your AI-powered applications. The AI SDK provides built-in telemetry support that integrates with leading observability platforms, enabling you to monitor model performance, track costs, analyze user interactions, and identify issues in production.

Why Observability Matters

As AI applications grow in complexity, observability becomes essential for:

Performance Monitoring - Track response times, token usage, and throughput
Cost Management - Monitor API costs across different models and providers
Quality Assurance - Evaluate output quality and identify issues
Debugging - Trace errors and unexpected behaviors through your application
User Analytics - Understand how users interact with your AI features
Compliance - Maintain audit trails for sensitive applications

How It Works

The AI SDK emits telemetry data through OpenTelemetry-compatible traces and spans. Observability providers can consume this data to provide insights into your application’s behavior. Most integrations work by configuring a telemetry exporter in your application:

import { createObservabilityProvider } from 'observability-provider';
import { generateText } from 'ai';

// Configure the observability provider
const telemetry = createObservabilityProvider({
  apiKey: process.env.OBSERVABILITY_API_KEY,
});

// Use AI SDK as normal - telemetry is automatically captured
const result = await generateText({
  model: yourModel,
  prompt: 'Your prompt here',
});

Available Observability Providers

Several LLM observability providers offer integrations with the AI SDK telemetry data:

Production-Ready Platforms

Axiom - Serverless log analytics and observability
Braintrust - LLM evaluation and observability platform
Confident AI - LLM testing and evaluation
Helicone - Open-source LLM observability
Langfuse - Open-source LLM engineering platform
LangSmith - LangChain’s observability platform
Laminar - LLM observability and analytics
LangWatch - Conversation analytics for LLMs
MLflow - Open-source ML lifecycle platform
Maxim - AI quality and safety platform
Scorecard - LLM evaluation and monitoring
SigNoz - Open-source observability platform
Traceloop - OpenTelemetry for LLM applications
Weave - W&B’s toolkit for LLM applications

Additional Integrations

HoneyHive - LLM evaluation and monitoring
Sentry - Error tracking with AI SDK support
Literal AI - LLM monitoring through model wrappers

Key Features by Provider

Tracing & Logging

All listed providers support comprehensive tracing and logging of AI SDK operations, including:

Request/response logging
Token usage tracking
Latency measurements
Error tracking

Evaluation & Testing

Some providers specialize in evaluation:

Braintrust - Automated evaluations and prompt optimization
Confident AI - LLM testing frameworks
Scorecard - Quality scoring and monitoring
Weave - Evaluation workflows

Analytics & Dashboards

Providers offering rich analytics:

Axiom - Real-time log analytics
Langfuse - Session tracking and user analytics
LangWatch - Conversation analytics
Helicone - Cost analytics and dashboards

Open Source Options

If you prefer self-hosted solutions:

Helicone - Open-source observability
Langfuse - Open-source LLM engineering platform
MLflow - ML lifecycle management
SigNoz - Open-source APM and observability

Choosing a Provider

When selecting an observability provider, consider:

Deployment Model - Cloud-hosted vs. self-hosted
Pricing - Based on volume, features, or seats
Features - Tracing, evaluation, analytics, or all three
Integration Complexity - SDK configuration vs. proxy setup
Data Privacy - Where your data is stored and processed
Supported Models - Compatibility with your AI providers

Getting Started

Most observability integrations follow these steps:

Sign up for the observability platform
Install the provider’s SDK or configure telemetry export
Add configuration to your AI SDK application
Deploy and start monitoring

Each provider’s documentation page includes specific setup instructions and examples.

Telemetry Data Structure

The AI SDK emits structured telemetry including:

Spans - Individual operations (generate, stream, embed)
Attributes - Model ID, provider, settings, token counts
Events - Tool calls, errors, completions
Metrics - Duration, token usage, costs

Observability providers parse this data to create insights, dashboards, and alerts.

Multiple Providers

You can use multiple observability providers simultaneously for different purposes:

// Example: Use one for tracing, another for evaluation
import { configureTelemetry } from 'ai';

configureTelemetry({
  exporters: [
    createTracingExporter(),
    createEvaluationExporter(),
  ],
});

Best Practices

Start Simple - Begin with basic tracing before adding advanced features
Sample in Production - Use sampling to manage costs at scale
Set Alerts - Configure alerts for errors, latency, and costs
Review Regularly - Analyze traces and metrics weekly
Test Evaluations - Validate evaluation criteria before relying on them
Document Baselines - Track performance changes over time

Contributing

Do you have an observability integration that supports the AI SDK and has an integration guide? Please open a pull request to add it to the list.

Learn More

AI SDK Telemetry Documentation
OpenTelemetry Documentation
Individual provider documentation pages (linked above)

Documentation Index

​Observability Integrations

​Why Observability Matters

​How It Works

​Available Observability Providers

​Production-Ready Platforms

​Additional Integrations

​Key Features by Provider

​Tracing & Logging

​Evaluation & Testing

​Analytics & Dashboards

​Open Source Options

​Choosing a Provider

​Getting Started

​Telemetry Data Structure

​Multiple Providers

​Best Practices

​Contributing

​Learn More

Observability Integrations

Why Observability Matters

How It Works

Available Observability Providers

Production-Ready Platforms

Additional Integrations

Key Features by Provider

Tracing & Logging

Evaluation & Testing

Analytics & Dashboards

Open Source Options

Choosing a Provider

Getting Started

Telemetry Data Structure

Multiple Providers

Best Practices

Contributing

Learn More