Skip to content
@langwatch

LangWatch

012d1688-24ae-4759-ae70-5f8f81a13c0e

Get started | Scenarios | Instrument with MCP | Documentation

Welcome to LangWatch, an open-source platform for building and shipping reliable LLM-powered agents.

LangWatch combines observability, evaluation, and scenario-based testing to help teams understand agent behavior across real workflows — from production traces to simulated failures.

It empowers domain experts to review and score conversations, developers to debug and evaluate agents end-to-end, and business teams to track quality, usage, and cost with custom analytics.

lwp_og.webm

You can sign up and already start the integration on our free tier by following the guides bellow:

🚀 Quick Start

Ship safer agents in minutes. Create a free account, then dive into these guides:

🔑 Key Projects

  • LangWatch The core open-source platform for observing, evaluating, and testing LLM-powered agents.

  • Scenarios End-to-end simulations for multi-step, tool-using agents across real workflows.

  • Better Agents A standard and tooling ecosystem for building production-grade AI agents.

  • Docs Source for LangWatch documentation.

🤝 Contributing

Open-source is at the heart of LangWatch. We welcome issues, pull requests, and discussions of new ideas.

Please read our Contribution Guidelines for details on our code of conduct and contribution process.

🛟 Support

Need help or want to get involved?

Popular repositories Loading

  1. langwatch langwatch Public

    The platform for LLM evaluations and AI agent testing

    TypeScript 2.8k 254

  2. better-agents better-agents Public

    Standards for building agents, better

    TypeScript 1.5k 152

  3. scenario scenario Public

    Agentic testing for agentic codebases

    TypeScript 780 53

  4. langevals langevals Public

    LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores and LLM guardrails, for you to protect and benchmark your LLM…

    70 10

  5. data-simulator data-simulator Public

    Synthetic Data Generation

    Jupyter Notebook 9 1

  6. cookbooks cookbooks Public

    example projects that use langwatch's features.

    Jupyter Notebook 9 1

Repositories

Showing 10 of 29 repositories
  • langwatch Public

    The platform for LLM evaluations and AI agent testing

    langwatch/langwatch’s past year of commit activity
    TypeScript 2,833 254 225 (1 issue needs help) 111 Updated Feb 22, 2026
  • better-agents Public

    Standards for building agents, better

    langwatch/better-agents’s past year of commit activity
    TypeScript 1,474 MIT 152 11 3 Updated Feb 22, 2026
  • claude-resume Public
    langwatch/claude-resume’s past year of commit activity
    TypeScript 1 0 0 0 Updated Feb 21, 2026
  • scenario Public

    Agentic testing for agentic codebases

    langwatch/scenario’s past year of commit activity
    TypeScript 780 MIT 53 25 15 Updated Feb 20, 2026
  • langwatch-nebius Public

    LangWatch x Nebius: Comparing LLM models for AI agent quality using agent simulations

    langwatch/langwatch-nebius’s past year of commit activity
    Python 1 0 0 0 Updated Feb 18, 2026
  • bank-example Public
    langwatch/bank-example’s past year of commit activity
    Python 1 0 0 3 Updated Feb 17, 2026
  • claude-remote Public
    langwatch/claude-remote’s past year of commit activity
    Shell 2 MIT 1 0 0 Updated Feb 17, 2026
  • docs Public

    Docs for LangWatch LLM Ops Platform

    langwatch/docs’s past year of commit activity
    MDX 3 3 0 4 Updated Feb 17, 2026
  • langevals Public

    LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores and LLM guardrails, for you to protect and benchmark your LLM models and pipelines.

    langwatch/langevals’s past year of commit activity
    70 10 3 (1 issue needs help) 15 Updated Feb 15, 2026
  • openclaw-phone-assistant Public

    Real-time voice assistant for OpenClaw — talk to Snaps over WebRTC

    langwatch/openclaw-phone-assistant’s past year of commit activity
    Python 2 0 0 0 Updated Feb 11, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…