P50 / P95 / P99 (Percentiles)

/'pee.fifty slash 'pee.ninetyfive slash 'pee.ninetynine per'seh.nteyelz/Latency percentiles used to describe typical vs. tail performance. Tail latency (P95/P99) often matters more for user experience than the average. (noun)

“P95 jumped after we enabled a heavier retrieval step.”

Related Observability terms

Active observability

•

AI observability

•

Alert / threshold

•

Dashboard

•

Data flywheel

•

Deep search

•

Drift

•

Error rate

•

Feedback loop

•

Logs

•

Model drift

•

Online evaluation (production scoring)

•

Sampling rate

•

Service Level Indicator (SLI)

•

Service Level Objective (SLO)

•

Time-to-first-token (TTFT)

•

Token usage / cost tracking

•

Topics

From the docs

Observe your application

•

View your logs

•

Monitor with dashboards

•

Glossary

Get started with Evals

Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.

Start building

← Online evaluation (production scoring)

Manifesto

Pairwise evaluation →