Skip to main content

3 docs tagged with "Evals"

View all tags

Engineering

Build, evaluate, observe, and govern AI systems. The leap from chart to live operation.

The AI Engineer's Stack

The discipline of engineering everything around the model so it survives production — harness, inference economics, reliability, retrieval, evals, observability, and safety.