Visualize classification with Confusion Matrix
getWeather
, lookupOrder
, summarizeReport
). Each tool or function is treated as a class, and you can use a confusion matrix to analyze accuracy and errors for each function.
Add a Metric in Prompt Workbench
Set metric Property
Configure as Confusion Matrix
Create a Test Case
Add an Assertion Using Your Metric
Run Your Tests
Review Results in the Session Page