Claude Code Skills·Claude Skills·The open SKILL.md registry for Claude
ClaudSkillsArcadeCompare › Langfuse vs Braintrust

LangfusevsBraintrust

Two eval & observability for the Claude Code ecosystem — side-by-side metadata, tag overlap, install commands, and when to reach for each.

Langfuse

OSS, self-hostable LLM observability — traces, evals, prompts, and datasets.

License
MIT
Install
pip package
Website
https://langfuse.com
GitHub
https://github.com/langfuse/langfuse
View Langfuse →
pip install langfuse

Braintrust

Eval-focused platform for LLM applications — datasets, scoring, A/B comparisons.

License
proprietary
Install
pip package
Website
https://www.braintrust.dev
GitHub
https://github.com/braintrustdata/braintrust-proxy
View Braintrust →
pip install braintrust

Shared tags

eval observability

Only in Langfuse

open-source tracing

Only in Braintrust

ab-testing

When to reach for which

Langfuse — OSS, self-hostable LLM observability — traces, evals, prompts, and datasets.

Braintrust — Eval-focused platform for LLM applications — datasets, scoring, A/B comparisons.

Both sit in the Eval & Observability category — they're substitutes, not complements. Pick by your install constraint (MCP-native, license, hosting model) and which tag overlap matters most to your stack.

More Eval & Observability comparisons

See all comparisons →

ClaudSkills Arcade · All comparisons · Catalog · CC BY 4.0