0
نام کتاب
Dataproc Cookbook

Running Spark and Hadoop Workloads in Google Cloud

Narasimha Sadineni, Anuyogam Venkataraman

Paperback438 Pages
PublisherO'Reilly
Edition1
LanguageEnglish
Year2026
ISBN9781098157708
340
A6756
انتخاب نوع چاپ:
جلد سخت
1,149,000ت
0
جلد نرم
1,019,000ت
0
طلق پاپکو و فنر
1,039,000ت
0
مجموع:
0تومان
کیفیت متن:اورجینال انتشارات
قطع:B5
رنگ صفحات:سیاه و سفید
پشتیبانی در روزهای تعطیل!
ارسال به سراسر کشور

#Dataproc

#Metastore

#Spark

#Hadoop

#Google

#Cloud

توضیحات

Want to build big data solutions in Google Cloud? Dataproc Cookbook is your hands-on guide to mastering Dataproc and the essential GCP fundamentals—like networking, security, logging, monitoring, and cost optimization—that apply across Google Cloud services. Learn practical skills that not only fast-track your Dataproc expertise, but also help you succeed with a wide range of GCP technologies. Written by data experts Narasimha Sadineni and Anu Venkataraman, this cookbook tackles real-world use cases like serverless Spark jobs, Kubernetes-native deployments, and cost-optimized data lake workflows. You’ll learn how to create ephemeral and persistent Dataproc clusters, run secure data science workloads, implement monitoring solutions, and plan effective migration and optimization strategies.


• Create Dataproc clusters on Compute Engine and Kubernetes Engine

• Run data science and Spark workloads in serverless and cost-efficient ways

• Orchestrate workloads using Cloud Composer (Airflow) and Cloud Scheduler

• Manage metadata in a centralized metastore

• Secure, monitor, and troubleshoot jobs across hybrid and cloud native setups

• Migrate from Hadoop to Dataproc with proven patterns and tooling support

• Understand billing components and learn cost optimization strategies


Table of Contents

Chapter 1. Creating a Dataproc Cluster

Chapter 2. Running Hive, Spark, and Sqoop Workloads

Chapter 3. Advanced Dataproc Cluster Configuration

Chapter 4. Serverless Spark and Ephemeral Dataproc Clusters

Chapter 5. Dataproc on Google Kubernetes Engine

Chapter 6. Dataproc Metastore

Chapter 7. Connecting from Dataproc to GCP Services

Chapter 8. Configuring Logging in Dataproc

Chapter 9. Setting Up Monitoring and Dashboards

Chapter 10. Dataproc Security

Chapter 11. Performance Tuning and Cost Optimization

Chapter 12. Orchestrating Dataproc Workloads

Chapter 13. Using Spark Notebooks on Dataproc

Chapter 14. Migrating from On-Premises and Public Cloud Services to GCP


About the Authors

Narasimha Sadineni is a senior data engineer at Google with over 15 years of experience helping organizations design, secure, and scale data pipelines using Hadoop and Google Cloud.


Anu Venkataraman is a former Googler and seasoned big data subject matter expert who brings a deep understanding of data platforms to enterprise technology transformation using Google Cloud and Microsoft Azure.

دیدگاه خود را بنویسید
نظرات کاربران (0 دیدگاه)
نظری وجود ندارد.
کتاب های مشابه
Data
935
Modern Data Protection
1,004,000 تومان
Data
1,540
Optimizing Databricks Workloads
690,000 تومان
Data
967
Communicating with Data
912,000 تومان
Data
973
Database-Driven Web Development
658,000 تومان
Data
945
Data Cleaning and Exploration with Machine Learning
1,544,000 تومان
Data
930
Graph Data Processing with Cypher
894,000 تومان
Python
2,742
Hands-On Data Preprocessing in Python
1,664,000 تومان
Data
1,183
Excel as Your Database
720,000 تومان
Data
824
Analyzing Data with Microsoft Power BI and Power Pivot for Excel
980,000 تومان
Data
979
Mastering Snowflake Solutions
716,000 تومان
قیمت
منصفانه
ارسال به
سراسر کشور
تضمین
کیفیت
پشتیبانی در
روزهای تعطیل
خرید امن
و آسان
آرشیو بزرگ
کتاب‌های تخصصی
هـر روز با بهتــرین و جــدیــدتـرین
کتاب های روز دنیا با ما همراه باشید
آدرس
پشتیبانی
مدیریت
ساعات پاسخگویی
درباره اسکای بوک
دسترسی های سریع
  • راهنمای خرید
  • راهنمای ارسال
  • سوالات متداول
  • قوانین و مقررات
  • وبلاگ
  • درباره ما
چاپ دیجیتال اسکای بوک. 2024-2022 ©