0
نام کتاب
Streaming Systems

The What, Where, When, and How of Large-Scale Data Processing

Tyler Akidau, Slava Chernyak, Reuven Lax

Paperback349 Pages
PublisherO'Reilly
Edition1
LanguageEnglish
Year2018
ISBN9781491983874
559
A6560
انتخاب نوع چاپ:
جلد سخت
669,000ت
0
جلد نرم
589,000ت
0
طلق پاپکو و فنر
599,000ت
0
مجموع:
0تومان
کیفیت متن:اورجینال انتشارات
قطع:B5
رنگ صفحات:دارای متن و کادر رنگی
پشتیبانی در روزهای تعطیل!
ارسال به سراسر کشور

#Streaming_Systems

#Large-Scale

#Data

#SQL

توضیحات

Streaming data is a big deal in big data these days. As more and more businesses seek to tame the massive unbounded data sets that pervade our world, streaming systems have finally reached a level of maturity sufficient for mainstream adoption. With this practical guide, data engineers, data scientists, and developers will learn how to work with streaming data in a conceptual and platform-agnostic way.


Expanded from Tyler Akidau’s popular blog posts "Streaming 101" and "Streaming 102", this book takes you from an introductory level to a nuanced understanding of the what, where, when, and how of processing real-time data streams. You’ll also dive deep into watermarks and exactly-once processing with co-authors Slava Chernyak and Reuven Lax.


You’ll explore:

  • How streaming and batch data processing patterns compare
  • The core principles and concepts behind robust out-of-order data processing
  • How watermarks track progress and completeness in infinite datasets
  • How exactly-once data processing techniques ensure correctness
  • How the concepts of streams and tables form the foundations of both batch and streaming data processing
  • The practical motivations behind a powerful persistent state mechanism, driven by a real-world example
  • How time-varying relations provide a link between stream processing and the world of SQL and relational algebra



Table of Contents

Part I The Beam Model

1 Streaming 101

2 The What, Where, When, and How of Data Processing

3 Watermarks 59

4 Advanced Windowing

5 Exactly-Once and Side Effects 

Part II Streams and Tables

6 Streams and Tables

7 The Practicalities of Persistent State 

8 Streaming SQL 

9 Streaming Joins 

10 The Evolution of Large-Scale Data Processing



About the Authors

Tyler Akidau is a senior staff software engineer at Google, where he is the technical lead for the Data Processing Languages & Systems group, responsible for Google's Apache Beam efforts, Google Cloud Dataflow, and internal data processing tools like Google Flume, MapReduce, and MillWheel. His also a founding member of the Apache Beam PMC. Though deeply passionate and vocal about the capabilities and importance of stream processing, he is also a firm believer in batch and streaming as two sides of the same coin, with the real endgame for data processing systems the seamless merging between the two. He is the author of the 2015 Dataflow Model paper and the Streaming 101 and Streaming 102 articles on the O’Reilly website. His preferred mode of transportation is by cargo bike, with his two young daughters in tow.


Slava Chernyak is a senior software engineer at Google Seattle. Slava spent over five years working on Google’s internal massive-scale streaming data processing systems and has since become involved with designing and building Windmill, Google Cloud Dataflow's next-generation streaming backend, from the ground up. Slava is passionate about making massive-scale stream processing available and useful to a broader audience. When he is not working on streaming systems, Slava is out enjoying the natural beauty of the Pacific Northwest.


Reuven Lax is a senior staff software engineer at Google Seattle, and has spent the past nine years helping to shape Google's data processing and analysis strategy. For much of that time he has focused on Google's low-latency, streaming data processing efforts, first as a long-time member and lead of the MillWheel team, and more recently founding and leading the team responsible for Windmill, the next-generation stream processing engine powering Google Cloud Dataflow. He's very excited to bring Google's data-processing experience to the world at large, and proud to have been a part of publishing both the MillWheel paper in 2013 and the Dataflow Model paper in 2015. When not at work, Reuven enjoys swing dancing, rock climbing, and exploring new parts of the world.


دیدگاه خود را بنویسید
نظرات کاربران (0 دیدگاه)
نظری وجود ندارد.
کتاب های مشابه
الگوریتم‌‌ها
1,045
Algorithms + Data Structures = Programs
632,000 تومان
Data
1,002
MCA Microsoft Certified Associate Azure Data Engineer Study Guide: Exa...
1,724,000 تومان
Data
458
In-Memory Analytics with Apache Arrow
658,000 تومان
Software Engineering
2,150
Fundamentals of Data Engineering
706,000 تومان
Data
981
Fundamentals of Data Observability
491,000 تومان
Data
748
Learning Apache Drill
568,000 تومان
Python
1,164
Data Engineering with Python
670,000 تومان
Data
531
Aerospike: Up and Running
434,000 تومان
Data
1,020
Modern Deep Learning for Tabular Data
1,370,000 تومان
Data
994
Delta Lake: Up & Running
491,000 تومان
قیمت
منصفانه
ارسال به
سراسر کشور
تضمین
کیفیت
پشتیبانی در
روزهای تعطیل
خرید امن
و آسان
آرشیو بزرگ
کتاب‌های تخصصی
هـر روز با بهتــرین و جــدیــدتـرین
کتاب های روز دنیا با ما همراه باشید
آدرس
پشتیبانی
مدیریت
ساعات پاسخگویی
درباره اسکای بوک
دسترسی های سریع
  • راهنمای خرید
  • راهنمای ارسال
  • سوالات متداول
  • قوانین و مقررات
  • وبلاگ
  • درباره ما
چاپ دیجیتال اسکای بوک. 2024-2022 ©