Streaming to Apache Iceberg Tables
Alex Merced

Streaming data is at the heart of every modern data platform — but building a robust, scalable streaming architecture can be complex. In this talk, we’ll explore how to design and implement efficient streaming pipelines with Apache Iceberg, a next-generation table format that brings reliability and flexibility to data lakes.

You’ll learn how to:

  • Identify your organization’s streaming requirements and architecture patterns
  • Integrate key tools like Apache Flink, Apache Kafka, Debezium, Kafka Connect, and Apache Spark for end-to-end data movement and transformation
  • Manage compaction and delete files in Iceberg to maintain performance and consistency
  • Streamline real-time analytics, machine learning, and data lakehouse ingestion with Iceberg

Whether you’re modernizing legacy ETL or scaling up real-time data systems, this session offers practical best practices and design patterns for streaming to Apache Iceberg tables efficiently and reliably.

Alex Merced
Head of DevRel, Dremio

Alex Merced is Head of DevRel for Dremio and co-author of "Apache Iceberg: The definitive guide" from O'Reilly, and has worked as a developer and instructor for companies like GenEd Systems, Crossfield Digital, CampusGuard, and General Assembly. Alex is passionate about technology and has put out tech content on outlets such as blogs, videos, and his podcasts Datanation and Web Dev 101. Alex Merced has contributed a variety of libraries in the JavaScript & Python worlds, including SencilloDB, CoquitoJS, dremio-simple-query, and more.

Newsletter

Our strategies and tactics delivered right to your inbox

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.