Runners Overview
Runners execute test datasets against AI systems and collect responses for evaluation.Why Use Runners?
Automated Testing
Execute tests against any AI system automatically
Multiple Backends
Support for Alquimia, LLMs, and custom systems
Async Execution
Efficient asynchronous batch processing
Detailed Metrics
Track execution times and success rates
Installation
Quick Start
Available Runners
| Runner | Use Case | Requirements |
|---|---|---|
AlquimiaRunner | Alquimia AI agents | Alquimia API credentials |
| Custom runners | Any AI system | Implement BaseRunner interface |
Execution Modes
LLM Mode (Lambda)
Execute tests directly against any LangChain-compatible LLM:Alquimia Mode
Execute tests against Alquimia AI agents:Workflow
Execution Summary
Each run returns a summary with metrics:Next Steps
AlquimiaRunner
Learn about the Alquimia runner
Custom Runners
Create your own runner
Storage
Save and load results