24.11.2025
GPT-OSS 120B on AI Cube Pro: Run OpenAI's Open-Source Model Locally
With GPT-OSS 120B, OpenAI released their first open-weight model since GPT-2 in August 2025 – and it's impressive. The model achieves near o4-mini performance but...

Qdrant is an open-source vector database for semantic search, RAG and AI applications. Documents, tickets, product data or knowledge bases become searchable as vectors.

Qdrant is an open-source vector database for semantic search, RAG and AI applications. Documents, tickets, product data or knowledge bases become searchable as vectors.
We operate Qdrant as a production retrieval layer: with clean collection design, embedding pipelines, payload filters, backup strategy, monitoring and integration into AnythingLLM, Open WebUI, Langfuse and custom applications.
RAG rarely fails at the chat frontend. It fails because of poor data preparation, wrong embeddings, missing filters and unstable operations. This is exactly the layer we make reliable.
Qdrant is licensed under Apache-2.0, making it a strong fit for open, business-critical AI stacks without BSL traps.
Semantic search across documents, tickets, product catalogs or internal knowledge bases with modern embedding models.
Qdrant provides the relevant context for chatbots, assistants and internal AI applications.
Combine vector search with payload filters, metadata, tenant logic and domain-specific constraints.
We plan storage, snapshots, replicas, collection strategy and monitoring around data volume and response-time targets.
Run in controlled infrastructure with backups, access control, network separation and clean data classification.
Connect to AnythingLLM, Open WebUI, custom apps, ETL processes and embedding pipelines for production RAG systems.
Qdrant stores embeddings and metadata so AI applications can find relevant content quickly and under control.
Good retrieval decides whether a RAG system provides reliable answers or only wraps hallucinations nicely.
We operate Qdrant with monitoring, snapshots, resource planning and clear processes for growing data sets.
Open source enterprise-ready for productive workloads - we run your applications with highest security standards and enterprise support
Open Source Software für geschäftskritische Prozesse erfordert professionelle Wartung, kontinuierliche Updates und enterprise-grade Support. Wir übernehmen Hosting und Betrieb von Qdrant auf unserer DSGVO-konformen Infrastruktur in Deutschland (oder optional in Ihrer Cloud) – inklusive Backups, SLAs, Telefon-Support und persönlichem Ansprechpartner. Damit Sie sich auf Ihr Kerngeschäft konzentrieren können.
Wir bieten auch maßgeschneiderte Hosting- und Entwicklungs-Lösungen für Ihre speziellen Anforderungen rund um Qdrant. Kontaktieren Sie uns für ein individuelles Angebot.
From fully managed GPU servers to compact AI Cubes - we provide the ideal infrastructure for your local LLM applications.
Powerful GPU servers with dedicated hardware for compute-intensive LLM workloads. Fully managed, scalable, and optimized for maximum performance.
Compact AI workstation for local LLM inference. Perfect for office environments, with top-tier performance and absolute data sovereignty.
Good choice - we'll help you get started or with operations.
As a Managed Service customer at WZ-IT, you have access to our exclusive portal: Monitor your infrastructure in real-time, schedule maintenance, request quotes, and get direct support - all in one central location.

24.11.2025
With GPT-OSS 120B, OpenAI released their first open-weight model since GPT-2 in August 2025 – and it's impressive. The model achieves near o4-mini performance but...
09.11.2025
In times of rising cloud costs, data sovereignty challenges and vendor lock-in, the topic of local AI inference is becoming increasingly important for companies. With...
08.11.2025
The use of Large Language Models (LLMs) such as GPT-4, Claude or Llama has evolved from experimental applications to mission-critical tools in recent years. However,...
These solutions are often used together with Qdrant
These solutions offer similar functionalities and can be evaluated together
Proof for production deployments, architecture decisions and ongoing operations around modern software stacks.
Whether a specific IT challenge or just an idea - we look forward to the exchange. In a brief conversation, we'll evaluate together if and how your project fits with WZ-IT.
Timo Wevelsiep & Robin Zins
Managing Directors of WZ-IT

