When Flink Meets Fluss: The Future of Streaming Warehouse

Jark Wu

Chinese Session #streaming

Kafka and Flink have been widely used together in streaming processing scenarios, becoming a defacto standard paradigm for building streaming warehouses and real-time analytics. However, it still faces many challenging issues that are hard to resolve. This session will explore the challenges and problems this paradigm faces in streaming analytics.

We will first discuss the limitations and pain points of Kafka when used with Flink. Then we will introduce Fluss, a next-generation streaming storage designed for streaming analytics. We’ll walk through its architecture and core innovations, highlighting how it seamlessly integrates with Flink to power the next generation of streaming warehouses. You’ll discover the game-changing capabilities unlocked by combining Flink and Fluss, such as streaming column pruning, delta joins, union reads, and merge engines.

Finally, we’ll explore real-world use cases of Flink + Fluss, showcasing how this powerful combination delivers true benefits like reduced infrastructure costs, improved performance, and enhanced stability for large-scale streaming and batch workloads.

====================================================

NOTE FOR THE COMMITTEE: Fluss is currently an open-source project under the Alibaba organization and is planned to be donated to the Apache Software Foundation in May 2025. If all goes as planned, it will likely be an Apache incubating project by the time the conference takes place.

Speakers:


Jark Wu is a committer and PMC member of Apache Flink. He leads both the Fluss and Flink SQL teams at Alibaba Cloud. With a decade of experience in Flink, he has been deeply involved in developing and evolving Flink SQL from zero to now. Throughout this journey, he has also initiated and incubated Flink CDC and Fluss projects, further expanding the Flink ecosystem.