WZ-IT Logo
AI Cube Pro NVIDIA RTX PRO 6000 Blackwell 96 GB VRAM - Enterprise KI-Inferenz Server für große LLMs
Enterprise Model
GDPR Compliant
NVIDIA RTX Blackwell
MadeinGermany

AI Cube Pro – Premium performance for your AI infrastructure

High-end AI inference with NVIDIA RTX PRO 6000 Blackwell – perfect for large LLMs, RAG systems with millions of documents, training & fine-tuning and models up to 120B+ parameters.

230 V • 292×185×372 mm • Mini-ITX

View Basic Model

Technical Highlights RTX PRO 6000 Blackwell

Enterprise hardware with 96 GB VRAM for maximum performance

NVIDIA RTX PRO 6000 Blackwell

96 GB GDDR7 VRAM

Sufficient for models up to 120B+ parameters

125 TFLOPS FP32

24,064 CUDA Cores

Maximum performance for large LLMs

Use cases for large LLMs and RAG systems

Internal Chatbots

Run AI assistants for customer service or internal knowledge bases – completely local and GDPR-compliant.

Code Assistance

Use models like Qwen or DeepSeek for code completion, review and documentation – without sending your codebase to the cloud.

Small to Medium Models

Llama 3.1 (7B-13B), Gemma 3, Mistral 7B, Phi-4 and many other models.

Large LLMs (70B-120B+)

Run models like Llama 3.1 70B, DeepSeek-R1 or GPT-OSS 120B completely locally.

RAG Systems

Process knowledge bases with thousands to millions of documents.

Multi-Model Operation

Run multiple models in parallel – depending on hardware configuration.

Performance Benchmarks

Datacenter Performance for Your Office

Enterprise performance of AI Cube Pro with large open-source models

GPT-OSS 20B

~20 Milliarden Parameter

200token/s

Batch Size 1

GPT-OSS 120B

~120 Milliarden Parameter

150token/s

Batch Size 1

All values were measured with batch size 1 and represent inference speed for interactive use cases. Actual performance may vary depending on model configuration and prompt length. Higher batch sizes increase throughput for parallel requests.

Local AI Usage

Local GPT with our AI Cube

Use Open WebUI for a ChatGPT-like experience – completely local on your own hardware

Open WebUI Interface - Lokale ChatGPT Alternative für AI Cube Pro mit RTX PRO 6000

The AI Cube can be delivered with Open WebUI based on customer requirements – an intuitive, user-friendly interface that enables a local ChatGPT-like experience. No cloud dependency, no API keys, no token limits – just you and your AI models.

ChatGPT-like Interface

Familiar and intuitive user interface for natural conversations with your local AI models

Completely Local

All data and conversations stay on your hardware – no connection to external servers required

Multi-Model Support

Switch seamlessly between different AI models within the same interface

No Token Fees

Unlimited usage without pay-per-use fees or monthly API costs

Open WebUI can be pre-installed and delivered ready to use upon request. Simply plug in, power on, and immediately interact with your local AI models – like ChatGPT, but completely under your control.

Vorinstalliert
Sofort einsatzbereit
100% lokal
Enterprise & Pro Service

On-Site Service for Maximum Security & Comfort

For our AI Cube Pro customers, we offer personal delivery and professional commissioning in Germany and the Netherlands. For Enterprise customers, this service is available Europe-wide.

Secure Delivery

Directly to your company premises or to your customers – personally

Physical Installation

Professional installation and cabling on-site

Initial Setup

Operating system, GPU drivers, container environment and security configuration (VPN, firewall, backup)

Validation & Acceptance

Performance test, stability check and GDPR compliance review before commissioning

All-Inclusive Package

For Enterprise & Pro Customers

Our on-site service ensures that your AI Cube runs optimally from the start – without you having to worry about installation or configuration.

Perfect for companies that value:

Highest quality standards
Compliance & Data Protection
Clean Integration
AI Cube Pro: DE & NL
Enterprise: Europe-wide

Enterprise advantages with AI Cube Pro and 96 GB VRAM

Maximum Performance

125 TFLOPS and 96 GB VRAM – the most powerful Blackwell GPU for local inference.

Enterprise Data Sovereignty

Even the largest models and extensive RAG systems remain completely in your network.

Future-proof

With 96 GB VRAM you're equipped for the coming years – even for future model generations.

Pro vs. Basic – which model fits?

Compare the two AI Cube models

YOU ARE HERE

AI Cube Pro

  • NVIDIA RTX PRO 6000 Blackwell
  • 96 GB VRAM
  • Models up to 120B+ parameters
  • Ideal for large LLMs, RAG & training

From €13,599.90

excl. VAT

ENTRY

AI Cube Basic

  • NVIDIA RTX PRO 4000 Blackwell
  • 24 GB VRAM
  • Models up to 20B parameters
  • Ideal for chatbots & code assistance

From €4,299.90

excl. VAT

View Basic Model

Case Study: Healthcare Facility

How a private clinic uses AI Cube Pro for medical knowledge bases

!Challenge

A network of private psychiatric clinics needed an AI solution for the central knowledge base with medical protocols, SOPs and training materials. Sensitive patient data could not go to the cloud.

Solution with AI Cube Pro

  • RAG system with Llama 3.1 70B for complex medical queries
  • Integration with BookStack as knowledge source (custom development)
  • Completely local operation with managed service by WZ-IT

Result

Immediate access to relevant protocols

Cross-location knowledge consistency

Complete GDPR compliance

Technical Specifications

Graphics CardNVIDIA RTX PRO 6000 Blackwell (96 GB GDDR7)
Network1 GbE (10 GbE optional)
Dimensions & Weight292×185×372 mm (H×W×D), approx. 8 kg
CertificationCE, RoHS, GDPR-compliant
SecuritySecure Boot, TPM 2.0, WireGuard VPN

Included in Delivery

Pre-installed Software (Ollama, vLLM, Open WebUI)
Operating System & GPU Drivers
Setup Documentation
Root Access & Full Control
German Support
No recurring costs

Frequently Asked Questions about AI Cube Pro

Ready for enterprise AI infrastructure?

Get free consultation

Let's Talk About Your Idea

Whether a specific IT challenge or just an idea – we look forward to the exchange. In a brief conversation, we'll evaluate together if and how your project fits with WZ-IT.

Trusted by leading companies

  • Keymate
  • SolidProof
  • Rekorder
  • Führerscheinmacher
  • ARGE
  • NextGym
  • Paritel
  • EVADXB
  • Boese VA
  • Maho Management
  • Aphy
  • Negosh
  • Millenium
  • Yonju
  • Mr. Clipart
Timo Wevelsiep & Robin Zins - CEOs of WZ-IT

Timo Wevelsiep & Robin Zins

CEOs of WZ-IT

1/3 – Topic Selection33%

What is your inquiry about?

Select one or more areas where we can support you.