Draft. Not yet editorially reviewed.
Trace and inspect every run
Every conversation is captured as a Langfuse trace: the prompts, each tool call (including the SQL the agent ran), and the response. Each trace also records token usage, cost, and latency. Open a run to see what the agent did and where it failed. Sort by user and session to see who is spending the most.Score output quality
Model output is nondeterministic, so Langfuse lets you measure quality instead of guessing at it. Score runs with human annotation or an automated LLM-as-a-judge evaluator, for example to flag when an answer is wrong or an analysis is unhelpful.In the bundle
The bundle wires LibreChat to Langfuse, so every run is traced automatically, with no instrumentation to add. Langfuse stores its data in the stack’s ClickHouse. To run it as part of the stack, see the Docker setup guide. To send traces from a standalone LibreChat instance, or to use a regional or HIPAA Langfuse endpoint, see the Langfuse companion guide. For Langfuse on ClickHouse generally, see the Langfuse overview.Prefer a managed experience? Langfuse Cloud is a fully managed deployment powered by a managed ClickHouse cluster — no infrastructure to run.