[Pulsar Community Weekly Update] 2019-10-19 ~ 2019-10-25
This is the Pulsar community weekly update for 2019-10-19 ~ 2019-10-25, which helps you quickly capture Pulsar’s highlights and spot trends over last week, meanwhile strengthen the communication and connection within the Pulsar family.
[CI] ASF Jenkins is back to ‘normal’ after reverting ‘Add default loader for latest pyyaml (#4974)’  in #5432 . The problem was from the usage of pyyaml in python 2.7 causing function workers failing to start in integration tests. The committers started merging the pull requests.
[PIP-43] The main logic for supporting producers to send message with different schemas was merged. With this change, Pulsar provides the capability of supporting event sourcing applications with different schemas. Kudos to Yi Tang!
[Functions] Jerry Peng started the effort on refactoring functions runtime to make it pluggale. It will make the future development of adding a new runtime easier and more smoothly.
[PIP-45] The second pull request for pluggable metadata interface is out to change the implementation of ManagedLedger to use MetadataStorre interface.
[Key-Shared] Penghui kicked off the implementation of supporting sticky consumers in key_shared subscription. This provides a capability for consuming sub-streams from a given topic (partition) in order. It can be used in Flink integration for supporting flexiblle scaling up-and-down.
[Transaction] The development of TC continues with adding topic ownership listener for bootstraping coordinator when a coordination topic is owned.
[Client][Java] Add support for partitioned topic consumer seek by time. (Release: 2.5.0)
[Functions] Make Function authentication provider pluggable. (Release: 2.5.0)
[Client][Java] Support set read-position based on timestamp. (Release: 2.5.0)
Notable Bug Fix
[Client][CGo] Return message ID for produced messages. (Fixed, Release: 2.5.0)
[Broker] Fix potential deadlock that can occur in addConsumer. (Fixed, Release: 2.4.2 / 2.5.0)
[Client][Java] Avoid leak on publish failure on batch message. (Fixed, Release: 2.4.2 / 2.5.0)
[Broker] Fix: race condition: failed to read-more entries on dispatcher. (Fixed, Release: 2.4.2 / 2.5.0)
[Client][Java] Fix message corruption on OOM for batch messages. (Fixed, Release: 2.4.2 / 2.5.0)
Pulsar.Client 0.7.0 was released with TLS and token authentication support.
More Pulsar tools, integrations, and resources can also be found at https://github.com/streamnative/awesome-pulsar.
Apache Pulsar gets more attention by giants like Splunk. On Oct 21, Splunk announced to acquire Streamlio to accelerate efforts in real-time stream processing and containerized multi-tenant cloud platform applications. Streamlio is powered by Apache Pulsar, specializing in designing and operating streaming data solutions at scale in demanding enterprise environments.
Paris Data Engineers (meetup)
Paris Data Engineers was held on Oct 22 in France. Quentin Adam talked about how CleverCloud is using Pulsar for scalable logs processing.
Powering Tencent Billing Platform with Apache Pulsar (by Dezhi Liu)
How to use Apache Pulsar Manager with HerdDB (by Enrico Olivelli)
Why Nutanix Beam went ahead with Apache Pulsar instead of Apache Kafka? (by Yuvaraj Loganathan)
Basic Pulsar producer and consumer (by Thomas Memenga)