نام کتاب
Apache Iceberg

The Definitive Guide

Data Lakehouse Functionality, Performance, and Scalability on the Data Lake

Tomer Shiran, Jason Hughes, Alex Merced

Paperback344 Pages
PublisherO'Reilly
Edition1
LanguageEnglish
Year2024
ISBN9781098148621
478
A5643
انتخاب نوع چاپ:
جلد سخت
599,000ت
0
جلد نرم
539,000ت
0
طلق پاپکو و فنر
549,000ت
0
مجموع:
0تومان
کیفیت متن:اورجینال انتشارات
قطع:B5
رنگ صفحات:دارای متن و کادر رنگی
پشتیبانی در روزهای تعطیل!
ارسال به سراسر کشور

#Apache

#Data

#Lakehouse

#Data_Lake

#SQL

توضیحات

Traditional data architecture patterns are severely limited. To use these patterns, you have to ETL data into each tool—a cost-prohibitive process for making warehouse features available to all of your data. The lack of flexibility with these patterns requires you to lock into a set of priority tools and formats, which creates data silos and data drift. This practical book shows you a better way.

Apache Iceberg provides the capabilities, performance, scalability, and savings that fulfill the promise of an open data lakehouse. By following the lessons in this book, you'll be able to achieve interactive, batch, machine learning, and streaming analytics with this high-performance open source format. Authors Tomer Shiran, Jason Hughes, and Alex Merced from Dremio show you how to get started with Iceberg.


With this book, you'll learn:

  • The architecture of Apache Iceberg tables
  • What happens under the hood when you perform operations on Iceberg tables
  • How to further optimize Iceberg tables for maximum performance
  • How to use Iceberg with popular data engines such as Apache Spark, Apache Flink, and Dremio
  • Discover why Apache Iceberg is a foundational technology for implementing an open data lakehouse.


Table of Contents

Part I. Fundamentals of Apache Iceberg

Chapter 1. Introduction to Apache Iceberg

Chapter 2. The Architecture of Apache Iceberg

Chapter 3. Lifecycle of Write and Read Queries

Chapter 4. Optimizing the Performance of Iceberg Tables

Chapter 5. Iceberg Catalogs

Part II. Hands-on with Apache Iceberg

Chapter 6. Apache Spark

Chapter 7. Dremio's SQL Query Engine

Chapter 8. AWS Glue

Chapter 9. Apache Flink

Part III. Apache Iceberg in Practice

Chapter 10. Apache Iceberg in Production

Chapter 11. Streaming with Apache Iceberg

Chapter 12. Governance and Security

Chapter 13. Migrating to Apache Iceberg

Chapter 14. Real-World Use Cases of Apache Iceberg


About the Authors

Tomer Shiran is the Founder and Chief Product Officer of Dremio, an open data lakehouse platform that enables companies to run analytics in the cloud without the cost, complexity and lock-in of data warehouses. As the company's founding CEO, Tomer built a world-class organization that has raised over $400M and now serves hundreds of the world's largest enterprises, including 3 of the Fortune 5. Prior to Dremio, Tomer was the 4th employee and VP Product of MapR, a Big Data analytics pioneer. He also held numerous product management and engineering roles at Microsoft and IBM Research, founded several websites that have served millions of users and hundreds of thousands of paying customers, and is a successful author and presenter on a wide range of industry topics. He holds an MS in Computer Engineering from Carnegie Mellon University and a BS in Computer Science from Technion - Israel Institute of Technology.


Jason Hughes is the Director of Technical Advocacy at Dremio. Previously at Dremio, he's been a Product Director, Technical Director and a Senior Solutions Architect. He's been working in technology and data for over a decade, including roles as tech lead for the field at Dremio, the pre-sales and post-sales lead for Presto and QueryGrid for the Americas at Teradata, and leading the development, deployment, and management of a custom CRM system for multiple auto dealerships. He is passionate about making customers and individuals successful and self-sufficient. When he’s not working, he’s usually taking his dog to the dog park, playing hockey, or cooking (when he feels like it). He lives in San Diego, California.


Alex Merced is a developer advocate for Dremio and has worked as a developer and instructor for companies like GenEd Systems, Crossfield Digital, CampusGuard and General Assembly.


Alex is passionate about technology and has put out tech content on outlets such as blogs, videos and his podcasts Datanation and Web Dev 101. Alex Merced has contributed a variety of libraries in the Javascript & Python worlds including SencilloDB, CoquitoJS, dremio-simple-query and more.

دیدگاه خود را بنویسید
نظرات کاربران (0 دیدگاه)
نظری وجود ندارد.
کتاب های مشابه
Apache Spark
952
Advanced Analytics with PySpark
420,000 تومان
Apache Spark
1,434
Data Engineering with Scala and Spark
490,000 تومان
Apache Spark
911
Hands-on Guide to Apache Spark 3
605,000 تومان
Apache Spark
964
Scaling Machine Learning with Spark
484,000 تومان
Apache Spark
951
Spark GraphX in Action
471,000 تومان
Apache Spark
977
Spark in Action
955,000 تومان
for Beginners
944
Beginning Apache Spark 3
664,000 تومان
Apache Spark
1,043
Modern Data Engineering with Apache Spark
975,000 تومان
Apache Spark
1,187
Data Engineering with Apache Spark, Delta Lake, and Lakehouse
784,000 تومان
Apache Spark
883
Beginning Apache Spark Using Azure Databricks
470,000 تومان
قیمت
منصفانه
ارسال به
سراسر کشور
تضمین
کیفیت
پشتیبانی در
روزهای تعطیل
خرید امن
و آسان
آرشیو بزرگ
کتاب‌های تخصصی
هـر روز با بهتــرین و جــدیــدتـرین
کتاب های روز دنیا با ما همراه باشید
آدرس
پشتیبانی
مدیریت
ساعات پاسخگویی
درباره اسکای بوک
دسترسی های سریع
  • راهنمای خرید
  • راهنمای ارسال
  • سوالات متداول
  • قوانین و مقررات
  • وبلاگ
  • درباره ما
چاپ دیجیتال اسکای بوک. 2024-2022 ©