Library · Eval datasets

Eval datasets

Curated benchmarks distilled from SME-validated eval runs. Versioned, hashed, licensable. The Atlas IP layer.

Datasets

1 available · 1 beta · 1 roadmap

Curated items

9,430

SME-validated, hashed

Source runs

29,000

Production traffic origins

Avg SME validation

96 %

Inter-rater agreement weighted

Registry · 3 datasetsSorted by recency

Dataset	Lang	Items	Source runs	SME val.	License	Status
Tagalog Healthcare Adversarial Prompts — 2026 Q2 atlas-ds-tl-healthcare-2026q2·v0.4.0-beta	tl Tagalog	4,280	12,900	97%	Eval + finetune $28k	beta
Korean Clinical Register & Safety Set — 2026 Q1 atlas-ds-ko-clinical-2026q1·v1.2.0	ko Korean	3,140	9,700	96%	Eval + finetune $32k	available
Japanese Keigo-Stability Adversarial Set atlas-ds-ja-keigo-2026q2·v0.1.0-roadmap	ja Japanese	2,010	6,400	95%	Eval-only tbd	roadmap

MaranoAtlas

Atlas IP · 15 runs in lineage graph

Atlas command palette