Where are the models hosted?

Entirely in France and the EU, on dedicated LPUs operated by Citadea, never on US infrastructure.

Are you EU AI Act-aligned?

Yes, inference is designed to align with the EU AI Act, with EU data residency and a SecNumCloud path in progress.

Which models can I run?

Open-source models such as Llama 3 and Mistral, served via an OpenAI-compatible API.

Sovereign AI hosting in France | Citadea

< 20 ms

edge latency (P99)

-70%

energy vs GPU

100%

hosted in the EU

Why host AI in France?

A French company with no US parent falls outside the reach of the US CLOUD Act, so your prompts, embeddings and fine-tunes are not exposed to foreign jurisdiction. Citadea hosts inference entirely in the EU, aligned with the EU AI Act, with a documented SecNumCloud path (in progress).

Flat-rate inference, no token shock

Per-token pricing punishes high-utilisation workloads, a single agent reasoning loop can produce a frightening invoice. Citadea Neural rents a dedicated LPU slice at a fixed monthly price. Above roughly 30% sustained utilisation it is simply cheaper, and it never surprises you.

Built for production, measured for engineers

Dedicated, non-shared LPUs serve Llama, Mistral and other open-source models with real-time streaming and sub-20 ms edge latency, drawing up to 70% less energy than comparable GPU inference. Memory adds sovereign sub-millisecond vector search for RAG.

Products