AI Agent Observability & Evaluation Tools (LangSmith)

Observability, Testing & Evaluation

Monitor, debug, and evaluate your AI agents. Discover essential LLMOps and AgentOps tools like LangSmith for reliable deployment.

Braintrust

An evals and observability platform designed for building reliable AI agents.

Galileo AI

Galileo AI is a company specializing in generative AI evaluation and monitoring solutions.

AgentOps

AgentOps provides AI agent development tools for monitoring, evaluation, and debugging to enhance performance and reliability.

Parea AI

An experimentation and human annotation platform designed for AI development teams.

Lunary

Open-source platform for managing and enhancing LLM chatbots.

LangSmith

LangSmith is a platform designed to streamline the development and deployment of LLM applications from prototype to production.

HoneyHive

HoneyHive is a unified LLMOps platform offering AI evaluation, testing, observability, and collaborative prompt management for teams building LLM applications.