Flink Backward - The Apache Flink Retrospective

November 3, 2021 - Johannes Moser

It has now been a month since the community released Apache Flink 1.14 into the wild. We had a comprehensive look at the enhancements, additions, and fixups in the release announcement blog post, and now we will look at the development cycle from a different angle. Based on feedback collected from contributors involved in this release, we will explore the experiences and processes behind it all. A retrospective on the release cycle # From the team, we collected emotions that have been attributed to points in time of the 1. ...

Continue reading »

Sort-Based Blocking Shuffle Implementation in Flink - Part One

October 26, 2021 - Yingjie Cao (Kevin) Daisy Tsang

Part one of this blog post will explain the motivation behind introducing sort-based blocking shuffle, present benchmark results, and provide guidelines on how to use this new feature. How data gets passed around between operators # Data shuffling is an important stage in batch processing applications and describes how data is sent from one operator to the next. In this phase, output data of the upstream operator will spill over to persistent storages like disk, then the downstream operator will read the corresponding data and process it. ...

Continue reading »

Sort-Based Blocking Shuffle Implementation in Flink - Part Two

October 26, 2021 - Yingjie Cao (Kevin) Daisy Tsang

Part one of this blog post explained the motivation behind introducing sort-based blocking shuffle, presented benchmark results, and provided guidelines on how to use this new feature. Like sort-merge shuffle implemented by other distributed data processing frameworks, the whole sort-based shuffle process in Flink consists of several important stages, including collecting data in memory, sorting the collected data in memory, spilling the sorted data to files, and reading the shuffle data from these spilled files. ...

Continue reading »

Apache Flink 1.13.3 Released

October 19, 2021 - Chesnay Schepler

The Apache Flink community released the third bugfix version of the Apache Flink 1.13 series. This release includes 136 fixes and minor improvements for Flink 1.13.2. The list below includes bugfixes and improvements. For a complete list of all changes see: JIRA. We highly recommend all users to upgrade to Flink 1.13.3. Updated Maven dependencies: <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-java</artifactId> <version>1.13.3</version> </dependency> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-streaming-java_2.11</artifactId> <version>1.13.3</version> </dependency> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-clients_2.11</artifactId> <version>1.13.3</version> </dependency> You can find the binaries on the updated Downloads page. ...

Continue reading »

Apache Flink 1.14.0 Release Announcement

September 29, 2021 - Stephan Ewen (@StephanEwen) Johannes Moser (@joemoeAT)

The Apache Software Foundation recently released its annual report and Apache Flink once again made it on the list of the top 5 most active projects! This remarkable activity also shows in the new 1.14.0 release. Once again, more than 200 contributors worked on over 1,000 issues. We are proud of how this community is consistently moving the project forward. This release brings many new features and improvements in areas such as the SQL API, more connector support, checkpointing, and PyFlink. ...

Continue reading »

Implementing a Custom Source Connector for Table API and SQL - Part One

September 7, 2021 - Ingo Buerk Daisy Tsang

Part one of this tutorial will teach you how to build and run a custom source connector to be used with Table API and SQL, two high-level abstractions in Flink. The tutorial comes with a bundled docker-compose setup that lets you easily run the connector. You can then try it out with Flink’s SQL client. Introduction # Apache Flink is a data processing engine that aims to keep state locally in order to do computations efficiently. ...

Continue reading »

Implementing a custom source connector for Table API and SQL - Part Two

September 7, 2021 - Ingo Buerk Daisy Tsang

In part one of this tutorial, you learned how to build a custom source connector for Flink. In part two, you will learn how to integrate the connector with a test email inbox through the IMAP protocol and filter out emails using Flink SQL. Goals # Part two of the tutorial will teach you how to: integrate a source connector which connects to a mailbox using the IMAP protocol use Jakarta Mail, a Java library that can send and receive email via the IMAP protocol write Flink SQL and execute the queries in the Ververica Platform for a nicer visualization You are encouraged to follow along with the code in this repository. ...

Continue reading »

Stateful Functions 3.1.0 Release Announcement

August 31, 2021 - Seth Wiesman (@sjwiesman) Igal Shilman (@IgalShilman) Tzu-Li (Gordon) Tai (@tzulitai)

Stateful Functions is a cross-platform stack for building Stateful Serverless applications, making it radically simpler to develop scalable, consistent, and elastic distributed applications. This new release brings various improvements to the StateFun runtime, a leaner way to specify StateFun module components, and a brand new GoLang SDK! The binary distribution and source artifacts are now available on the updated Downloads page of the Flink website, and the most recent Java SDK, Python SDK, and GoLang SDK distributions are available on Maven, PyPI, and Github repecitvely. ...

Continue reading »

Help us stabilize Apache Flink 1.14.0 RC0

August 31, 2021 - Johannes Moser

Hint Update 29th of September: Today Apache Flink 1.14 has been released. For sure we'd still like to hear your feedback. Dear Flink Community, we are excited to announce the first release candidate of Apache Flink 1.14. 🎉 A lot of features and fixes went into this release, including improvements to the unified batch and streaming experience, an increase in fault tolerance by reducing in-flight data, and more developments on connectors and components. ...

Continue reading »

Apache Flink 1.11.4 Released

August 9, 2021 - Xiaoling He

The Apache Flink community released the next bugfix version of the Apache Flink 1.11 series. This release includes 78 fixes and minor improvements for Flink 1.11.4. The list below includes a detailed list of all fixes and improvements. We highly recommend all users to upgrade to Flink 1.11.4. Updated Maven dependencies: <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-java</artifactId> <version>1.11.4</version> </dependency> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-streaming-java_2.11</artifactId> <version>1.11.4</version> </dependency> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-clients_2.11</artifactId> <version>1.11.4</version> </dependency> You can find the binaries on the updated Downloads page. ...

Continue reading »