Flink changelog modes and why should you care about them?

Session Overview

Unlock the secrets of Flink's changelog modes to enhance SQL performance, optimize data pipelines, and reduce latency for real-time processing.

TL;DR

Understanding Flink's changelog modes is crucial because even minor SQL changes can significantly impact state size, latency, network use, and compute requirements. The session detailed how different operators use and produce changelog modes and provided strategies for optimizing data pipelines. By mastering these concepts, data streaming practitioners can enhance the efficiency and performance of their streaming applications.

Opening

Imagine running a simple SQL query that unexpectedly inflates your system's latency and state size. In Apache Flink, such surprises often stem from changelog modes—Flink's way of tracking table changes. Changelog modes are crucial yet often overlooked until they cause performance issues. This session shed light on these modes, providing insights into how to manage them effectively for optimal data streaming performance.

What You'll Learn (Key Takeaways)

Understanding Changelog Modes – Flink uses four changelog modes: insert, update before, update after, and delete. Each mode impacts how data is processed and stored.
Optimizing Data Pipelines – By understanding which operators consume and produce specific changelog modes, practitioners can optimize queries to reduce state size and network traffic.
Real-world Application – Implementing upsert modes can save network traffic by eliminating unnecessary updates, crucial for high-performance streaming applications.
Advanced Techniques – Recent improvements in Flink allow transitions between modes, offering greater flexibility in managing data streams.

Q&A Highlights

Q: Are there any failure patterns or pitfalls you've seen from customers? A: Customers often encounter confusing errors related to unsupported changelog modes in SQL queries. Understanding these modes helps interpret such errors and plan queries effectively.

Q: Will you be publishing this info in a blog? A: While there are no immediate plans, the interest is noted, and there may be future blog posts detailing individual operators and their internal workings.

Q: How can understanding Flink's internal operations help with state size issues? A: Knowing about intermediate operators like changelog normalize can explain unexpected state size growth and guide query optimization to mitigate costs.

About Speaker

Bonnie Varghese Bonnie Varghese is a Software Engineer at Confluent where he is part of the Flink SQL & Metastore team. He’s spent the last several years working on open-source big data systems and leveraged streamin...

Jim Hughes Jim Hughes applies training in mathematics and computer science to build distributed, scalable systems capable of supporting streaming data science and machine learning. Jim has joined Confluent and i...

Flink changelog modes and why should you care about them?

Session Overview

TL;DR

Opening

What You'll Learn (Key Takeaways)

Q&A Highlights

Related Resources

Announcing Data Streaming Summit 2026: The Data Streaming + Agent Infra Conference

Introducing Scalable Topics in Apache Pulsar 5.0

Introducing StreamNative Agent Skills: From Connectivity to Expertise for the Agentic Era

Make Your Data Ready and Safe for Agentic AI