Compute Pricing Calculator — Tonomia
Live Pricing Calculator

Build your AI infrastructure
cost estimate

Real specs, real pricing — compare Tonomia across every service layer.

01 — GPU Platform
NVIDIA
B300 Blackwell Ultra
4.20 EUR/GPU-hr
288 GB HBM3e · 8 TB/s
~5 PFLOPS FP8 · 15 PFLOPS FP4
NVLink 5 · ConnectX-8 800Gb/s
4nm BlackwellNVL72 rack
AMD
Instinct MI355X
2.90 EUR/GPU-hr
288 GB HBM3E · 8 TB/s
~5 PFLOPS FP8 · 10 PFLOPS MXFP4
Infinity Fabric · Pensando 400G
3nm CDNA 464-GPU rack
02 — Configuration
Number of GPUs8 GPUs
Utilization80%
Billing Period
03 — Workload
LLM Inference
Training
Fine-Tuning
RAG / Search
Vision / Multimodal
HPC / Simulation
Total PFLOPS (FP8)
Total HBM Memory
Power Draw (kW)

Estimated GPU Cost

NVIDIA B300 · 4.20 EUR/GPU-hr

Tonomia Price
EUR 0
per month
GPU compute
TonoFabric orchestrationIncluded
800 Gbps networkingIncluded
NVMe storage
24/7 SOC + SLAIncluded
RL efficiency saved
vs Hyperscalers
Tonomia
Hyperscaler 1
Hyperscaler 2
Hyperscaler 3
vs hyperscalers
PUE ~1.05 vs 1.3-1.6 for hyperscalers. TonoFabric RL scheduler adds ≥15% efficiency over time.
Estimation Mode
Select model → adjust volume below
Volume & Billing
Tokens per Month10M tokens
Billing
≤8ms
p99 API Latency SLA
99.9%
API Uptime SLA
GDPR
EU Sovereign

Token Cost Estimate

EUR/MT = Euro per Million Tokens

Tonomia Cost
EUR 0
per year
Token processing
Model
Smart MaaS routingIncluded
Audit logsIncluded
GDPR complianceIncluded
Tonomia vs Reference
Tonomia
Hyperscaler
— cheaperYou save —
TonoFabric MaaS marketplace routes each request to the optimal node. Full version control, no egress fees.
Choose Plan
Standard
TOOMI PRO
5 EUR
/ user / month
Model agnostic
Text & document translation
File Q&A · Audio/video transcription
Image analysis
Image generation (optional)
Report generation · Code execution
RAG grounded answers
Premium
TOOMI PRO+
9 EUR
/ user / month
Everything in TOOMI PRO
Image generation & edit included
Natural Continuous Conversation
Advanced bot customization
Workspace prompts/templates
Speech transcription · Extensions
Configuration
Number of Users100 users
Billing
Feature Comparison
FeatureTOOMI Chat5EUR
TOOMI PRO
9EUR
TOOMI PRO+
Model agnosticSSS
Text translationSSS
Document translationintact fmtSS
File Q&ASSS
Audio/video transcriptionSSS
Image analysisSSS
Image generation & editSOS
Report generationSSS
Code executionSSS
Bot customizationSSS
RAG grounded answersSSS
Multi-language chatSSS
Enterprise data isolationon-premSS
Built-in web searchSSS
Speech transcriptionSSS
Natural Continuous Conv.SOS
Public API / connectorsX
S = Included · O = Optional · X = Not available
-80%
vs market avg (25EUR/user)
≤30s
Failover · ≥95% recovery
GDPR
EU Sovereign native

Chatbot Cost Estimate

TOOMI PRO · 5EUR/user/month

Tonomia Total
EUR 0
per month
User licenses
PlanTOOMI PRO
Model servingIncluded
GDPR data residencyIncluded
Tonomia vs Hyperscaler (25EUR/user)
Tonomia
Hyperscaler
–% cheaperYou save —
Powered by Toomi on TonoFabric — EU sovereign, ≤30s failover, 99.9% uptime.
H100 Reference Price (your cloud rate)
H100 SXM5 EUR/GPU-hr
EUR
Tonomia H100 reference: 2.70 EUR/GPU-hr
Precision – MI355X
Precision – B300
NVIDIA – Reference
H100 SXM5 (FP8)
TFLOPS FP81,979 TFLOPS
Memory80 GB HBM3
Bandwidth3.35 TB/s
EUR / 1000 TFLOPS/hr
AMD – Tonomia 2.90 EUR/hr
MI355X (MXFP4)
TFLOPS MXFP410,000 TFLOPS
Memory288 GB HBM3E
Bandwidth8.0 TB/s
EUR / 1000 TFLOPS/hr
NVIDIA – Tonomia 4.20 EUR/hr
B300 Blackwell Ultra (FP4)
TFLOPS FP415,000 TFLOPS
Memory288 GB HBM3e
Bandwidth8.0 TB/s
EUR / 1000 TFLOPS/hr
Cost Efficiency vs H100
MI355X cheaper per 1000 TFLOPS
B300 cheaper per 1000 TFLOPS
EUR / 1000 TFLOPS/hr — lower is better
H100 FP8
MI355X
B300
Lower EUR/TFLOP = more raw AI compute per euro — directly impacting training time and cost per token.
DeepSeek V3 685B — Est. Cost per Million Tokens

Cost per MT from GPU-hour rate, minimum GPUs to fit model, bandwidth-weighted throughput. H100 price adjustable above.

GPUPrecisionMin GPUsEUR/GPU-hrCluster/hrThroughputEUR/MTvs H100
* H100: 80 GB, min 17 GPUs (FP8). MI355X/B300: 288 GB, min 5 (FP8), 3 (FP4). Throughput: bandwidth-limited decode model.
GPU: AMD MI355X 2.90 EUR/GPU-hr · NVIDIA B300 4.20 EUR/GPU-hr · H100 ref 2.70 EUR/GPU-hr. Token pricing from Tonomia reference table. Chatbot: TOOMI PRO 5EUR · TOOMI PRO+ 9EUR/user/month. Sources: tonomia.com/pricing · tonomia.com/tonoforge. Estimates are indicative — contact for a tailored proposal.