Skip to main content
Draft. Not yet editorially reviewed.
Langfuse is the observability layer of the Agentic Data Stack. It records what the agent did in LibreChat, so you can debug it, measure quality, and track cost. Built on OpenTelemetry, Langfuse runs on ClickHouse.

Trace and inspect every run

Every conversation is captured as a Langfuse trace: the prompts, each tool call (including the SQL the agent ran), and the response. Each trace also records token usage, cost, and latency. Open a run to see what the agent did and where it failed. Sort by user and session to see who is spending the most.

Score output quality

Model output is nondeterministic, so Langfuse lets you measure quality instead of guessing at it. Score runs with human annotation or an automated LLM-as-a-judge evaluator, for example to flag when an answer is wrong or an analysis is unhelpful.

In the bundle

The bundle wires LibreChat to Langfuse, so every run is traced automatically, with no instrumentation to add. Langfuse stores its data in the stack’s ClickHouse. To run it as part of the stack, see the Docker setup guide. To send traces from a standalone LibreChat instance, or to use a regional or HIPAA Langfuse endpoint, see the Langfuse companion guide. For Langfuse on ClickHouse generally, see the Langfuse overview.
Prefer a managed experience? Langfuse Cloud is a fully managed deployment powered by a managed ClickHouse cluster — no infrastructure to run.
Last modified on June 12, 2026