Skip to content

ArtemisKit Documentation

ArtemisKit is the open-source toolkit for testing LLM applications. One CLI. Three capabilities:

  • Test — Evaluate outputs with scenario-based quality gates
  • Secure — Red team your LLM before attackers do
  • Stress — Know your latency limits before users find them

Quick Start

From zero to your first test in 5 minutes. Install, configure, run.

Get started →

Core Concepts

Understand scenarios, expectations, providers, and evaluators.

Learn concepts →

CLI (@artemiskit/cli)

Run evaluations from the terminal, integrate with CI/CD, generate reports. Most teams start here.

CLI Documentation →

SDK (@artemiskit/sdk)

Embed evaluations in your code. Includes Guardian Mode for runtime protection and Jest/Vitest matchers.

SDK Documentation →

Terminal window
# Install
npm install -g @artemiskit/cli
# Run a scenario
akit run scenarios/hello-world.yaml
Packagenpm NameDescription
CLI@artemiskit/cliCommand-line interface (artemiskit, akit)
SDK@artemiskit/sdkProgrammatic API, Guardian Mode, test matchers
Core@artemiskit/coreCore evaluation engine
Reports@artemiskit/reportsHTML/JSON report generation
Red Team@artemiskit/redteamSecurity testing and mutations
OpenAI Adapter@artemiskit/adapter-openaiOpenAI and Azure OpenAI adapter
Anthropic Adapter@artemiskit/adapter-anthropicAnthropic Claude adapter
Vercel AI Adapter@artemiskit/adapter-vercel-aiVercel AI SDK adapter
LangChain Adapter@artemiskit/adapter-langchainLangChain chains and agents
DeepAgents Adapter@artemiskit/adapter-deepagentsDeepAgents multi-agent systems

Examples & Cookbook

CI/CD integration, security testing, regression detection recipes.

View examples →

Guardian Mode

Runtime protection with semantic validation and injection detection.

Learn more →