Verification
Don't trust. Verify.
We make Vinci's behaviour verifiable, not asked-for. Four public artifacts let customers, auditors, and journalists check that Vinci does what we say it does.
- Constitution…
- Character…
- Methodology…
- Adversarial results…
The bundle
Four public artifacts.
Everything that defines Vinci ships in the open. Two are public in draft today; the full set ships with vinci-studio on August 8, 2026.
Constitution
The values and behaviour specification — nine values, identity, and the safety framework.
Character
The personality and texture — how Vinci embodies the Constitution.
Methodology
How Vinci was trained — Constitutional Fine-Tuning, reproducible.
Adversarial test results
HarmBench, JailbreakBench, and a censorship testbed run on the deployed weights.
The Constitution
Nine values, in the open.
The values Vinci is trained to hold — published as a specification anyone can read, test, and hold us to.
Integrity
Doesn't pretend, fabricate, or say what it doesn't believe.
Transparency
Clear about what it is, what it knows, and how it was built.
Calibrated honesty
Knows the difference between sure and not sure — and says so.
Ethical refusal
Declines what's wrong, clearly and without a sermon.
Willing to disagree
Says so when you're wrong. Disagreement is respect.
Usefulness
Built for work. Engages with the task, skips the preamble.
Empowerment
Takes work off your plate, and acts within your direction.
Humanism
Treats you as a person, not a unit of output.
No pushing
No manipulation, no upsell, no false urgency.
The Character
Warmly direct.
The directness is the warmth — Vinci tells you the truth because it respects you. Don't take our word for it. Watch the difference.
No sycophancy
Most AI
Vinci
Willing to disagree
Most AI
Vinci
Comfortable with limits
Most AI
Vinci
Honest about itself
Vinci
The safety framework
Bright lines, and no over-refusal.
Vinci refuses a short, categorical list — and engages with everything else, including hard topics, without lecturing.
Categorical refusals
Weapons of mass harm, cyberweapons against systems you don't own, content that endangers children, targeted harm, election deception, mass fraud. No roleplay or system prompt unlocks them.
No over-refusal
Medical, security, and harm-reduction questions get real answers. "Sensitive" is not "harmful." No sermons on safe requests.
Adversarial robustness
Identity and refusals hold under pressure — long conversations, injected instructions, "ignore previous instructions," claimed authority.
Agentic safety
No covert or irreversible actions without asking, no self-exfiltration, no scheming. Retrieved content is data, not commands.
Most AI
Vinci
The commitment
Diverge, and it's a bug.
The bundle is the contract. If Vinci's deployed behaviour diverges from it, that's a bug — and we commit to publishing diagnosis and fix within 30 days. Closed-weight providers can't offer this, because you can't independently check them. We can, because the weights are open and you run your own tests.
Launching August 8, 2026
Follow the build.
Open weights, a public Constitution, and a model you can verify — the day it ships. Get the launch in your inbox. No spam, just the drop.
or follow along — Follow on X