Compact components for PowerPoint/Keynote slides
| Model | GPU | VRAM | TFLOPS | Price |
|---|---|---|---|---|
| AI Cube Pro TOP | RTX PRO 6000 | 96 GB | 125 | from €13,599 |
| AI Cube Custom | Multi-GPU | ∞ | ∞ | On request |
| ☁️ Cloud (ChatGPT etc.) | 🖥️ AI Cube | |
|---|---|---|
| Data Privacy | ❌ Data with third party | ✓ 100% local |
| GDPR | ❌ Problematic | ✓ Fully compliant |
| Costs | ❌ Per token / month | ✓ One-time |
| Availability | ❌ Internet dependent | ✓ Always available |
| Vendor Lock-in | ❌ Yes | ✓ No |
| Specification | Basic | Pro |
|---|---|---|
| GPU | RTX PRO 4000 | RTX PRO 6000 |
| VRAM | 24 GB GDDR7 | 96 GB GDDR7 |
| TFLOPS | 46.9 | 125 |
| Max. Model Size | ~20B Parameters | 120B+ Parameters |
| SSD | 1 TB NVMe | 2 TB NVMe |
| Price | from €4,299 | from €13,599 |
On-Premise AI Server for Enterprises
Local LLM Inference • No token costs • 100% data privacy
The On-Premise AI Server for GDPR-compliant enterprises
GPT-5 mini • Input:Output = 3:1
GPT-5 mini @ 500 Output Tokens/s (24/7 Operation)
GPT-5 mini vs. AI Cube Pro
Full-stack IT services for modern businesses
Become an AI hardware provider
End-to-end full-service
Designed for environments with strict requirements
| Model | AI Cube Pro (96GB) | AI Cube Pro (96GB) |
|---|---|---|
| GPT-OSS 20B | 50 t/s1 User | 200 t/s1 User |
| GPT-OSS 120B | —not enough VRAM | 150 t/s 1 User 1050 t/s 20 Users, small ctx 300-500 t/s 20 Users, mixed ctx |
| Qwen3-30B-A3B FP8 | —not enough VRAM | 90-150 t/s 1 User, 1k ctx 650 t/s 10 Users, 1k ctx 22 t/s 1 User, 256k ctx 115 t/s 10 Users, 256k ctx 311 t/s 10 Users, 64k ctx 413 t/s 10 Users, 32k ctx |
| Model | AI Cube Pro | |
|---|---|---|
| LLaMA 3 (8B Q4_K_M) | 60 t/s1 User | 130 t/s1 User |
Screenshots of this page can be used directly in presentations.
Tip: Use browser DevTools (F12) → Device Toolbar for exact sizes