Sovereign AI hosting in France
Run open-source models on dedicated LPUs hosted in France, billed at a predictable flat rate. The performance you need for production inference, with EU sovereignty and a bill your CFO can forecast.
Why host AI in France?
A French company with no US parent falls outside the reach of the US CLOUD Act — so your prompts, embeddings and fine-tunes are not exposed to foreign jurisdiction. Citadea hosts inference entirely in the EU, aligned with the EU AI Act, with a documented SecNumCloud path (in progress).
Flat-rate inference, no token shock
Per-token pricing punishes high-utilisation workloads — a single agent reasoning loop can produce a frightening invoice. Citadea Neural rents a dedicated LPU slice at a fixed monthly price. Above roughly 30% sustained utilisation it is simply cheaper, and it never surprises you.
Built for production, measured for engineers
Dedicated, non-shared LPUs serve Llama, Mistral and other open-source models with real-time streaming and sub-20 ms edge latency, drawing up to 70% less energy than comparable GPU inference. Memory adds sovereign sub-millisecond vector search for RAG.
Frequently asked
Build on a cloud that respects you
Join the waitlist or talk to an engineer about your web, web3 or agent workloads.