Supercharge Lakehouse Implementation with Apache Iceberg

Bill Zhang

English Session 2025-07-26 16:15 GMT+8 (ROOM : WanChun Hall) #datalake

The modern data lakehouse architecture combines the best of data lakes and data warehouses, enabling scalable analytics with ACID transactions, schema evolution, and performance optimizations. Apache Iceberg has emerged as a leading open-table format that supercharges lakehouse implementations by providing reliability, scalability, and seamless integration with popular open source compute engines like Spark, Flink, Doris, StarRocks, Impala, Hive, Nifi, Kafka and Trino.

In this session, we will explore how Apache Iceberg enhances lakehouse architectures by ensuring data reliability, optimizing performance, enabling multi-engine compatibility and simplifying maintenance, In addition, we’ll also discuss real-world use cases, best practices for migrating Hive tables to Iceberg tables, and how to leverage its features to build a high-performance, future-proof lakehouse.

Speakers:

Bill Zhang: Cloudera, Lakehouse , Iceberg Integration

Bill is Vice President of product strategies at Cloudera, responsible for Open Data Lakehouse product strategy and Apache Iceberg integration with all Cloudera Data Platform (CDP) form factors. Most recently, Bill also leads Apache Hive product roadmap and adoption.