Live Pricing Calculator
Build your AI infrastructure
cost estimate
Real specs, real pricing — compare Tonomia across every service layer.
01 — GPU Platform
NVIDIA
B300 Blackwell Ultra
4.20 EUR/GPU-hr
288 GB HBM3e · 8 TB/s
~5 PFLOPS FP8 · 15 PFLOPS FP4
NVLink 5 · ConnectX-8 800Gb/s
~5 PFLOPS FP8 · 15 PFLOPS FP4
NVLink 5 · ConnectX-8 800Gb/s
AMD
Instinct MI355X
2.90 EUR/GPU-hr
288 GB HBM3E · 8 TB/s
~5 PFLOPS FP8 · 10 PFLOPS MXFP4
Infinity Fabric · Pensando 400G
~5 PFLOPS FP8 · 10 PFLOPS MXFP4
Infinity Fabric · Pensando 400G
02 — Configuration
Number of GPUs8 GPUs
Utilization80%
Billing Period
03 — Workload
—
Total PFLOPS (FP8)
—
Total HBM Memory
—
Power Draw (kW)
Estimated GPU Cost
NVIDIA B300 · 4.20 EUR/GPU-hr
Tonomia Price
EUR 0
per month
GPU compute—
TonoFabric orchestrationIncluded
800 Gbps networkingIncluded
NVMe storage—
24/7 SOC + SLAIncluded
RL efficiency saved—
vs Hyperscalers
Tonomia
—
Hyperscaler 1
—
Hyperscaler 2
—
Hyperscaler 3
—
—vs hyperscalers
PUE ~1.05 vs 1.3-1.6 for hyperscalers. TonoFabric RL scheduler adds ≥15% efficiency over time.
Estimation Mode
Select model → adjust volume below
Volume & Billing
Tokens per Month10M tokens
Billing
≤8ms
p99 API Latency SLA
99.9%
API Uptime SLA
GDPR
EU Sovereign
Token Cost Estimate
EUR/MT = Euro per Million Tokens
Tonomia Cost
EUR 0
per year
Token processing—
Model—
Smart MaaS routingIncluded
Audit logsIncluded
GDPR complianceIncluded
Tonomia vs Reference
Tonomia
—
Hyperscaler
—
— cheaperYou save —
TonoFabric MaaS marketplace routes each request to the optimal node. Full version control, no egress fees.
Choose Plan
Standard
TOOMI PRO
5 EUR
/ user / month
Model agnostic
Text & document translation
File Q&A · Audio/video transcription
Image analysis
Image generation (optional)
Report generation · Code execution
RAG grounded answers
Premium
TOOMI PRO+
9 EUR
/ user / month
Everything in TOOMI PRO
Image generation & edit included
Natural Continuous Conversation
Advanced bot customization
Workspace prompts/templates
Speech transcription · Extensions
Configuration
Number of Users100 users
Billing
Feature Comparison
| Feature | TOOMI Chat | 5EUR TOOMI PRO | 9EUR TOOMI PRO+ |
|---|---|---|---|
| Model agnostic | S | S | S |
| Text translation | S | S | S |
| Document translation | intact fmt | S | S |
| File Q&A | S | S | S |
| Audio/video transcription | S | S | S |
| Image analysis | S | S | S |
| Image generation & edit | S | O | S |
| Report generation | S | S | S |
| Code execution | S | S | S |
| Bot customization | S | S | S |
| RAG grounded answers | S | S | S |
| Multi-language chat | S | S | S |
| Enterprise data isolation | on-prem | S | S |
| Built-in web search | S | S | S |
| Speech transcription | S | S | S |
| Natural Continuous Conv. | S | O | S |
| Public API / connectors | X | — | — |
S = Included · O = Optional · X = Not available
-80%
vs market avg (25EUR/user)
≤30s
Failover · ≥95% recovery
GDPR
EU Sovereign native
Chatbot Cost Estimate
TOOMI PRO · 5EUR/user/month
Tonomia Total
EUR 0
per month
User licenses—
PlanTOOMI PRO
Model servingIncluded
GDPR data residencyIncluded
Tonomia vs Hyperscaler (25EUR/user)
Tonomia
—
Hyperscaler
—
–% cheaperYou save —
Powered by Toomi on TonoFabric — EU sovereign, ≤30s failover, 99.9% uptime.
H100 Reference Price (your cloud rate)
H100 SXM5 EUR/GPU-hr
EUR
Tonomia H100 reference: 2.70 EUR/GPU-hr
Precision – MI355X
Precision – B300
NVIDIA – Reference
H100 SXM5 (FP8)
TFLOPS FP81,979 TFLOPS
Memory80 GB HBM3
Bandwidth3.35 TB/s
EUR / 1000 TFLOPS/hr—
AMD – Tonomia 2.90 EUR/hr
MI355X (MXFP4)
TFLOPS MXFP410,000 TFLOPS
Memory288 GB HBM3E
Bandwidth8.0 TB/s
EUR / 1000 TFLOPS/hr—
NVIDIA – Tonomia 4.20 EUR/hr
B300 Blackwell Ultra (FP4)
TFLOPS FP415,000 TFLOPS
Memory288 GB HBM3e
Bandwidth8.0 TB/s
EUR / 1000 TFLOPS/hr—
Cost Efficiency vs H100
MI355X cheaper per 1000 TFLOPS—
B300 cheaper per 1000 TFLOPS—
EUR / 1000 TFLOPS/hr — lower is better
H100 FP8
—
MI355X
—
B300
—
Lower EUR/TFLOP = more raw AI compute per euro — directly impacting training time and cost per token.
DeepSeek V3 685B — Est. Cost per Million Tokens
Cost per MT from GPU-hour rate, minimum GPUs to fit model, bandwidth-weighted throughput. H100 price adjustable above.
| GPU | Precision | Min GPUs | EUR/GPU-hr | Cluster/hr | Throughput | EUR/MT | vs H100 |
|---|
* H100: 80 GB, min 17 GPUs (FP8). MI355X/B300: 288 GB, min 5 (FP8), 3 (FP4). Throughput: bandwidth-limited decode model.
GPU: AMD MI355X 2.90 EUR/GPU-hr · NVIDIA B300 4.20 EUR/GPU-hr · H100 ref 2.70 EUR/GPU-hr. Token pricing from Tonomia reference table. Chatbot: TOOMI PRO 5EUR · TOOMI PRO+ 9EUR/user/month. Sources: tonomia.com/pricing · tonomia.com/tonoforge. Estimates are indicative — contact for a tailored proposal.
Generate Letter of Intent
Contact Details
Configuration Tested
Loading…
Letter of Intent Preview
Fill in your details on the left to generate the letter…
By submitting, your LOI is saved to Tonomia’s CRM and a confirmation is sent to your email.
