Evaluation Tutorials#

Use these tutorials to become familiar with evaluation with NeMo Microservices.

Before You Start#

Set up NeMo Microservices Quickstart for the following tutorials.

Run a benchmark evaluation

Learn how to run an evaluation with a built-in benchmark.

Run a Benchmark Evaluation
Run an LLM Judge Eval

Learn how to evaluate a fine-tuned model using the LLM Judge metric with a custom dataset.

Evaluate Response Quality with LLM-as-a-Judge