Eric Anderson (@ericmander) reunites with old colleagues Kenn Knowles (@KennKnowles) and Pablo Estrada (@polecitoem) for a conversation on Apache Beam, the open-source programming model for data processing. The trio once worked together at Google, and Beam was a turning point in the history of open-source there. Today, both Kenn and Pablo are members of the Beam PMC, and join the show with the inside scoop on Beam’s past, present and future.
In this episode we discuss:
- Transitioning Beam to the Apache Way
- How “inner source” works at Google
- Thoughts on the relationship between batch processing and streaming
- Some ways that community “power users” have contributed to Beam
- Information on Beam Summit 2022, the first onsite summit since COVID began
- The first few people to register can use code BEAM_POD_INV for a discount on tickets!
Links:
- Apache Beam
- Apache Spark
- Apache Flink
- Apache Nemo
- Apache Samza
- Apache Crunch
- MapReduce paper
- MillWheel paper
- FlumeJava paper
- Dataflow paper
- Beam Summit 2022 Website
Other episodes:
The podcast Contributor is embedded on this page from an open RSS feed. All files, descriptions, artwork and other metadata from the RSS-feed is the property of the podcast owner and not affiliated with or validated by Podplay.