Fundamentals, Implementation, and Operation of Streaming Applications
Fabian Hueske, Vasiliki Kalavri

#Apache
#Stream
#open_source
#ETL
#API
#DataStream
#Apache_Flink
Get started with Apache Flink, the open source framework that powers some of the world’s largest stream processing applications. With this practical book, you’ll explore the fundamental concepts of parallel stream processing and discover how this technology differs from traditional batch data processing.
Longtime Apache Flink committers Fabian Hueske and Vasia Kalavri show you how to implement scalable streaming applications with Flink’s DataStream API and continuously run and maintain these applications in operational environments. Stream processing is ideal for many use cases, including low-latency ETL, streaming analytics, and real-time dashboards as well as fraud detection, anomaly detection, and alerting. You can process continuous data of any kind, including user interactions, financial transactions, and IoT data, as soon as you generate them.
Table of Contents
Chapter 1. Introduction to Stateful Stream Processing
Chapter 2. Stream Processing Fundamentals
Chapter 3. The Architecture of Apache Flink
Chapter 4. Setting Up a Development Environment for Apache Flink
Chapter 5. The DataStream API (v1.7)
Chapter 6. Time-Based and Window Operators
Chapter 7. Stateful Operators and Applications
Chapter 8. Reading from and Writing to External Systems
Chapter 9. Setting Up Flink for Streaming Applications
Chapter 10. Operating Flink and Streaming Applications
Chapter 11. Where to Go from Here?
This book will teach you everything you need to know about stream processing with Apache Flink. It consists of 11 chapters that hopefully tell a coherent story. While some chapters are descriptive and aim to introduce high-level design concepts, others are more hands-on and contain many code examples.
While we intended for the book to be read in chapter order when we were writing it, readers familiar with a chapter’s content might want to skip it. Others more interested in writing Flink code right away might want to read the practical chapters first. In the following, we briefly describe the contents of each chapter, so you can directly jump to those chapters that interest you most.
About the Author
Fabian Hueske is a PMC member of the Apache Flink project and has been contributing to Flink since day one. Fabian is cofounder of data Artisans (now Ververica) and holds a PhD in computer science from TU Berlin.
Vasiliki (Vasia) Kalavri is a postdoctoral fellow in the Systems Group at ETH Zurich. Vasia is a PMC member of the Apache Flink project. An early contributor to Flink, she has worked on its graph processing library, Gelly, and on early versions of the Table API and streaming SQL.









