Skip to main content

Open public goods

The four public goods

We're publishing the evaluation infrastructure alongside the model — openly, so any Canadian AI builder, evaluator, or procurer can reference, reproduce, and audit the same baseline. The standard is the point. Links go live as each artifact lands.

Dataset

Canadian Bilingual Legal Corpus

The open dataset flash-1-mini is fine-tuned on, with full provenance. Bilingual English and French, Canadian legal context — released with flash-1-pro in March 2027, held during active training to preserve the specialization edge.

Coming soon
Evaluation suite · Preview

CBLRE Evaluation Suite

The Canadian Bilingual Legal & Regulatory Evaluation — six tracks, bilingual ground truth, reproducible scoring. In preview, pending subject-matter-expert validation.

Read the post
Methodology · v1.0

Canadian AI Evaluation Methodology

How to evaluate AI for Canadian regulated workflows — the framework behind the CBLRE tracks.

Read the post
Methodology · v1.0

Model Benchmarking Methodology

How we measured what we measured — the reproducibility protocol that makes every published number checkable.

Read the post

Maintained, versioned, and updated by the SimpleDirect team. Reference them in RFPs and procurement scoring; cite them in academic work.

See the model these standards measured

Go to flash-1-mini