flash-1-mini — coming end of May 2026
AI you can own.
Open-weight models that run on your computer.
Your data never leaves. No API key. No subscription.
Ownership
What ownership actually means
You have the weights.
The model files live on your machine. They are files, not a service.
It runs without internet.
Once downloaded, inference happens entirely on your hardware. Airplane mode works.
Nobody can revoke access.
No account to suspend. No API key to rotate. If SimpleDirect disappeared tomorrow, your model still works.
Your data stays local.
Every prompt, every document, every conversation stays on your hardware. Privacy is architectural, not a setting.
No per-token billing.
You paid once (or downloaded for free). Run it a million times. The cost is your electricity.
No vendor lock-in.
Models ship as GGUF files. Compatible with Ollama, LM Studio, llama.cpp, and any compliant runtime.
The models
Three models. One principle: yours.
flash-1-mini
Coming end of May- Size
- 4 billion parameters
- Download
- ~2.7 GB download
- Runs on
- Any laptop (4 GB RAM)
- Best for
- Personal AI, quick tasks, mobile
Free — forever
flash-1
Coming July 2026- Size
- 9 billion parameters
- Download
- ~5.5 GB download
- Runs on
- 8 GB RAM or entry GPU
- Best for
- Business workloads, RAG, daily driver
$99 one-time
flash-1-pro
Coming September 2026- Size
- 27 billion parameters
- Download
- ~16 GB download
- Runs on
- 24+ GB VRAM or 32+ GB RAM
- Best for
- Enterprise, defense, complex reasoning
$499 one-time
All models ship as GGUF with multiple quantization levels. Bilingual English / French. Citation-grounded. Trained for function calling and RAG.
Where it fits
You already have the hardware.
Now you need the brain.
SimpleDirect models run at every level of the ownership ladder — from the laptop on your desk to the rack in your colo. Same models. Same weights. Your choice of home.
Tier 1
Your laptop
- Hardware
- Mac Mini, MacBook, any laptop with 4+ GB RAM
- Model
- flash-1-mini (free)
- Good for
- Solo founders, consultants, creators
Your first private AI. Zero cost after the download.
Tier 2
Your office
- Hardware
- Mac Studio, workstation with 32–96+ GB RAM
- Model
- flash-1 ($99) or flash-1-pro ($499)
- Good for
- Law firms, accounting practices, clinics, small teams
Serious capability on your desk. Power draw: three light bulbs.
Tier 3
Your cloud
- Hardware
- Bare-metal GPU in a datacenter you choose
- Model
- flash-1-pro ($499) + vLLM for multi-user
- Good for
- Growing businesses, regulated industries, multi-office teams
Pick the jurisdiction. Own the stack. Move providers in a weekend.
Tier 4
Your rack
- Hardware
- NVIDIA GPU in a colocation facility
- Model
- flash-1-pro ($499) + deployment support
- Good for
- Enterprise, defense, critical operations
Full ownership. Air-gapped if needed. No external dependencies. Ever.
Who it's for
Built for people who handle
things that matter
Founders
You're paying $20–200/month for AI that isn't yours. Download flash-1-mini and own it in three minutes.
Lawyers
Client privilege doesn't survive a ChatGPT prompt. Run your AI locally. No data leaves your machine.
Accountants & advisors
Client financial data on someone else's server is a liability. This is the fix.
Clinics & healthcare
PIPEDA, PHIPA, HIPAA — pick your regulation. Local inference solves all of them.
CTOs & engineering
Open-weight GGUF models. RAG-optimized. Function calling. Deploy on your hardware, air-gapped if needed.
Defense & government
Models that run with no external dependencies. Commercial licensing and deployment support available.
The app — September 2026
One download. Then yours.
SimpleDirect is a native desktop app. Install it, open it, start talking to your AI.
Double-click. It works.
No terminal, no Python, no Docker. The app detects your hardware, picks the right model, and handles everything.
Point it at your folders.
Google Drive, Dropbox, project directories. It indexes your documents locally. Ask questions grounded in your actual files.
Nothing transmitted.
Your conversations, documents, and preferences stay on your machine. Nothing logged. Nothing sent anywhere.
Join the waitlist. Be first when it ships.
One email when the app ships. No marketing blasts.



