Lightweight, secure and ephemeral data platform.

Simple enough for developers, sophisticated enough for enterprises, open enough for innovation, governed enough for compliance.

What is Kubox?

Bring your own cloud, no Terraform or Kubernetes required.

The first ephemeral data platform: deploy customisable data infrastructure for AI/ML workloads in 5 minutes with enterprise security and 70% lower costs.

It is powered by Talos Linux and Pulumi.

Batteries Included

Frictionless start, no kubernetes or professional services required for installation.

Short feedback loop

Build on laptops and deploy on clusters. Rinse and repeat.

Multi-Cloud

Run in any cloud, fog or edge. Burst from on-prem to the cloud.

Best-of-breed

Curated open source data & AI tools, working out of the box. Extend it with your preferred tools.

Air-gapped

Secure-by-default and self-hosted services, including LLMs. (coming soon)

Compliant

IRAP assessed (coming end 2025)

Deploy a complete data stack in 5 minutes. It includes Kubernetes and curated tools like Dagster, Dask, and Ray for AI/ML and agentic workloads.

curl https://kubox.sh | sh

What is Kubox CLI?

A free command line tool.

Blazing fast Kubernetes and a composable data and AI stack

An on-demand, repeatable and portable data platform as a service for everyone, from individual engineers to enterprises. It is unbelievably powerful and future-proofed.

Distributed Compute

Use Dask, Spark and federated query engines like Trino for distributed data processing.

Data Pipelines

Data pipelines written in Python with Dagster, tracked in version control and packaged as containers.

Model Serving

ML models deployed and served with Ray Serve - a framework agnostic python library that supports model composition, multiplexing, and fractional GPUs.

Vector Database

Use Qdrant as a self-hosted open-source vector database for next-generation AI-native semantic search and meaningful information extraction from unstructured data.

LLMs & RAG

Build self-hosted Retrieval Augmented Generation (RAG) pipelines to access your organisation's past and current knowledge in form of multi-modal data.

Anything else

If it runs on Kubernetes, it most likely will run in kubox.

Make data infrastructure simple, secure and accessible to everyone, anywhere.

“There is no simple, composable and vendor-neutral data platform that runs anywhere. kubox is our solution to make data infrastructure simple and boring. So you spend less time on tedious configurations and focus instead on building awesome data products.”

Chinkit Patel

Founder, Kubox AI