Redpanda and Databricks

Open Lakehouse Streaming powered by Redpanda and Databricks

Get a high-performance, low-latency streaming and analytics powerhouse that turns streaming data into real-time insights, without the complexity of traditional Kafka infrastructure.

Logomark of Databricks
How it works

Redpanda and Databricks working together

Why it matters

Why use Redpanda with Databricks?

Redpanda captures streaming data the moment it’s created, while Databricks enriches, analyzes, and serves it at lakehouse scale. The result is an open lakehouse you can trust for machine-learning models, dashboards, and customer-facing apps.

Drop-in Kafka compatibility

Keep existing producers, consumers, and tooling—no refactoring required.

Lightweight, no-code connectors

YAML-based configs spin up pipelines in minutes while using up to 3× less compute.

Bring Your Own Cloud

Run a fully managed streaming service inside your own AWS, Azure, or GCP account for complete data sovereignty.

Iceberg Topics for instant SQL

Query live streams as Apache Iceberg tables - accessible from Databricks SQL, Spark, or any Iceberg-compatible engine.

Start streaming in minutes

Ready to build your streaming lakehouse?

Get a high-performance, low-latency streaming and analytics powerhouse that turns streaming data into real-time insights, without the complexity of traditional Kafka infrastructure.