Redpanda Unveils State of Streaming Data Report on Industry Trends, Ecosystem Components and Challenges

Based on an extensive, third-party survey of streaming data-savvy engineers, research shows real-time analytics and AI/ML are driving the majority of streaming data adoption

November 15, 2023

San Francisco, CA (November 15, 2023)

Streaming data pioneer Redpanda today released its inaugural State of Streaming Data Report to shed light on the trends, use cases, data volumes, technology stack and technical and business challenges in the rapidly growing streaming data ecosystem. The report shares what’s driving companies to migrate their systems from batch processing to real-time systems and the challenges they face. Based on a third-party survey of 300 engineering organizations familiar with streaming data, the report is the first comprehensive, independent study on the current state of the streaming data industry.

Organizations today increasingly look to enrich their applications, analytics platforms, and AI/ML models with real-time data, which requires a transformation from traditional batch processing to streaming data systems that process and analyze gigabytes of data per second. In these streaming data pipelines, real-time data flows continuously as it’s generated from sources such as sensors, devices and applications. However, streaming data adoption varies across industries and streaming data systems can be challenging to implement, manage and scale for the unprepared.

"At Redpanda, our mission has always been to make real-time data easier to consume," said Tristan Stevens, Director of Customer Success at Redpanda. “Toward that goal, this report offers insight into the transformative capabilities of streaming data. We commissioned the survey as a resource for comparison by organizations that have already adopted streaming data and a guide for those that are now evaluating this important technology.”

The report was based on responses from 300 engineering organizations familiar with streaming data, captured in a survey conducted by insights-driven strategy firm Material. Roughly 75% of survey participants were at various stages of adoption, providing a comprehensive overview of both established and emerging use cases.

"As we see in most industries, AI and machine learning are going to shake up how companies operate, and streaming data amplifies the leverage that can be gained from these powerful tools in real-time for bigger impact," said Hilary DeCamp, Material’s Chief Methodologist.

Key findings include:

  • Real-time analytics and AI are driving streaming data adoption – Survey respondents expressed that real-time analytics (71%) is the leading current use case for adopting streaming data systems. Looking forward, nearly three-in-four respondents cited that development of AI/ML systems will be the biggest driver of streaming data adoption in the next 12-24 months.
  • Data privacy and technical skills are barriers to adoption – Perceived technical challenges for adopting streaming data are led by concerns for data privacy (42%) and data consistency (35%). Perceived business challenges are centered around the cost (36%) of these systems and the in-house technical skills required to be successful with streaming data systems (34%).
  • Companies are running both analytical and transactional workloads – The majority (58%) of current streaming data users are running both transactional and analytical workloads. Nearly all users expect to see an increase in the amount of real-time data they stream for analytical (71%) and transactional (81%) workloads.
  • Streaming data environments are hybrid – Organizations consistently reported navigating multiple platforms, encompassing both Apache Kafka®-compatible and Kafka non-compatible solutions. More than half of current users stated that their data streaming infrastructure is hosted on VMs or containers and is located in a hybrid environment. AWS (57%) and Microsoft Azure (57%) were the most common cloud providers selected.

For a deeper dive into streaming data trends, download the full report at https://redpanda.com/state-of-streaming-data-report-2023-24.

Additional topics covered in the full report include:

  • Typical message throughput for transactional and analytical workloads
  • Daily volume of streaming data based on workload types
  • Data retention policies in use
  • Most popular core components of streaming data pipelines
  • Most used client libraries, data processing tools and formats

About Redpanda

Redpanda is the streaming data platform for developers. API-compatible with Apache Kafka®, Redpanda introduces a breakthrough architecture and disruptive capabilities that make it a simple, fast, reliable, and unified engine of record for both real-time and historical enterprise data. Innovators like Lacework, Jump Trading, Vodafone, Moody’s, Hotels Network and Alpaca rely on Redpanda to process hundreds of terabytes of data a day. Backed by premier venture investors Lightspeed, GV and Haystack VC, Redpanda is a diverse, people-first organization with teams distributed around the globe. To learn more, visit redpanda.com and follow the company on Twitter at @redpandadata.

About Material

Material is a global strategy partner to the world’s most recognized brands and innovators. Deeply connected to markets, culture and people through behavioral science and robust insights capabilities, Material helps companies realize critical outcomes across their growth and innovation agendas. We combine deep human insights with design and enabling technology – using a proprietary Science + Systems approach that speeds engagement and growth. We design and build customer-centric business models and experiences that create transformative relationships between businesses and the people they serve. Learn more at www.materialplus.io.

Explore more from Redpanda

Real-World Impact,
Proven Results

Learn how enterprises like and others across key industries use Redpanda to unlock performance, resilience, and cost savings.

Explore Redpanda solutions—from financial services and telecommunications to real-time analytics and event-driven architectures. Discover how Redpanda can help you build faster, more resilient, and cost-efficient streaming pipelines for your specific challenges.

View all customer stories
No items found.
Retail & E-Commerce
Retail & E-Commerce
Simplify real-time data streaming for retail and e-commerce

Unlock growth opportunities in retail and e-commerce with Redpanda’s high-speed, low-latency data streaming platform. Simplify inventory management, deliver personalized customer experiences, and drive revenue—without the complexities of Kafka.

Software & Technology
Software & Technology
Drive new revenue with real-time data. Powerful, simple, reliable.

Unlock real-time revenue opportunities with Redpanda’s powerful, Kafka-compatible streaming platform. Deliver personalized user experiences, automate billing, and simplify integrations—faster and more efficiently than ever.

Finance
Finance
Simplify real-time data streaming for financial services

Explore how Redpanda's powerful data streaming platform enables real-time insights, operational efficiency, and innovation in the financial services industry.

Use Case
Streaming Iceberg Tables

Revolutionize data management with Apache Iceberg Topics in Redpanda. Enable instant analytics on streaming data, simplify ETL, and integrate seamlessly with analytics tools.

Use Case
Snowflake Connector for Redpanda Connect

The Snowflake Connector for Redpanda Connect offers the easiest, no-code configuration-driven approach to stream data from anywhere into Snowflake using Snowpipe Streaming.

Use Case
Streaming Data Lakehouse

Build real-time cloud security solutions with Redpanda’s high-performance, Kafka-compatible platform for instant threat detection, analysis, and scalability.

READY TO START?

Take Redpanda for a spin!

Ready to supercharge performance and simplify data pipelines? Give Redpanda a try or get in touch with us to learn more!