Scenarios
Test suites containing prompts and expectations for evaluating LLM outputs. Learn more →
ArtemisKit is built around a few key concepts that apply across both the CLI and SDK. Understanding these will help you get the most out of the toolkit.
Scenarios
Test suites containing prompts and expectations for evaluating LLM outputs. Learn more →
Expectations
Matchers that define how to evaluate LLM responses against expected behavior. Learn more →
Providers
LLM provider configurations for OpenAI, Anthropic, Azure, and more. Learn more →
Evaluators
The evaluation engine that runs scenarios and produces results. Learn more →
┌─────────────────────────────────────────────────────────────┐│ ArtemisKit │├─────────────────────────────────────────────────────────────┤│ ││ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐ ││ │ Scenarios │───▶│ Evaluators │───▶│ Results │ ││ │ (YAML/TS) │ │ (Matchers) │ │ (Reports) │ ││ └─────────────┘ └─────────────┘ └─────────────┘ ││ │ │ │ ││ ▼ ▼ ▼ ││ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐ ││ │ Provider │ │ Guardian │ │ Storage │ ││ │ (LLM API) │ │ (Protection)│ │ (History) │ ││ └─────────────┘ └─────────────┘ └─────────────┘ ││ │└─────────────────────────────────────────────────────────────┘akit run) or SDK (kit.run())These concepts work the same way whether you’re using the CLI or SDK:
| Concept | CLI | SDK |
|---|---|---|
| Scenarios | YAML files | YAML files or scenario() builder |
| Expectations | YAML expected field | YAML or contains(), exact(), etc. |
| Providers | Config file or --provider flag | ArtemisKit({ provider: '...' }) |
| Results | Terminal output + reports | RunResult object |
| Storage | artemis-output/ directory | Configurable via storage option |