نام کتاب
Data Quality Fundamentals

A Practitioner's Guide to Building Trustworthy Data Pipelines
Barr Moses, Lior Gavish, Molly Vorwerck

Paperback311 Pages
PublisherO'Reilly
Edition1
LanguageEnglish
Year2022
ISBN9781098112042
966
A1542
انتخاب نوع چاپ:
جلد سخت
563,000ت
0
جلد نرم
503,000ت
0
طلق پاپکو و فنر
513,000ت
0
مجموع:
0تومان
کیفیت متن:اورجینال انتشارات
قطع:B5
رنگ صفحات:دارای متن و کادر رنگی
پشتیبانی در روزهای تعطیل!
ارسال به سراسر کشور

#Data

#Data_Quality

#Pipelines

#SLA

#SLI

#SLO

توضیحات

Do your product dashboards look funky? Are your quarterly reports stale? Is the data set you're using broken or just plain wrong? These problems affect almost every team, yet they're usually addressed on an ad hoc basis and in a reactive manner. If you answered yes to these questions, this book is for you.


Many data engineering teams today face the "good pipelines, bad data" problem. It doesn't matter how advanced your data infrastructure is if the data you're piping is bad. In this book, Barr Moses, Lior Gavish, and Molly Vorwerck, from the data observability company Monte Carlo, explain how to tackle data 
quality and trust at scale by leveraging best practices and technologies used by some of the world's most innovative companies.


  • •  Build more trustworthy and reliable data pipelines
  • •  Write scripts to make data checks and identify broken pipelines with data observability
  • •  Learn how to set and maintain data SLAs, SLIs, and SLOs
  • •  Develop and lead data quality initiatives at your company
  • •  Learn how to treat data services and systems with the diligence of production software
  • •  Automate data lineage graphs across your data ecosystem
  • •  Build anomaly detectors for your critical data assets

    If you’ve experienced any of the following scenarios, raise your hand (or, you can just nod in solidarity):
  • •  Five thousand rows in a critical table suddenly turns into five hundred, with no rhyme or reason.
  • •  A broken dashboard causes an executive dashboard to spit null values.
  • •  A hidden schema change breaks a downstream pipeline.
  • •  And the list goes on.

    From the Preface

This book is for everyone who has suffered from unreliable data, silently or with muffled screams, and wants to do something about it. We expect that these individuals will come from data engineers, data analytics, or data science backgrounds, and be actively involved in building, scaling, and managing their company’s data pipelines.
 

On the surface, it may seem like Data Quality Fundamentals is a manual about how to clean, wrangle, and generally make sense of data—and it is. But more so, this book tackles best practices, technologies, and processes around building more reliable data systems and, in the process, cultivating data trust with your team and stakeholders.

Editorial Reviews

About the Author

Barr Moses is the CEO and co-founder of Monte Carlo, a data reliability company. In her decade-long career in data, Barr has served as commander of a data intelligence unit in the Israeli Air Force, a consultant at Bain & Company, and VP of Operations at Gainsight, where she built and led their data and analytics team. The instructor of O’Reilly first course on Data Observability, an emerging discipline in data engineering, Barr has worked with hundreds of data teams struggling with these problems. Inspired by her time in the analytics trenches, she is building a product literally dedicated to identifying, resolving, and preventing what she calls “data downtime,” periods of time when data is missing, erroneous, or otherwise inaccurate. In other words: bad data. In this book, she shares her experiences and learnings on how today’s data organizations can achieve high data quality at scale through technological, organization, and cultural best practices.

Lior Gavish is CTO and Co-Founder of Monte Carlo, a data reliability company backed by Accel, Redpoint, GGV, and other top Silicon Valley investors. Prior to Monte Carlo, Lior co-founded cybersecurity startup Sookasa, which was acquired by Barracuda in 2016. At Barracuda, Lior was SVP of Engineering, launching award-winning ML products for fraud prevention. Lior holds an MBA from Stanford and an MSC in Computer Science from Tel-Aviv University.

Molly Vorwerck is the Head of Content at Monte Carlo, a data reliability company. Prior to joining Monte Carlo, Molly served as editor-in-chief of the Uber Engineering Blog and lead program manager for Uber’s Technical Brand team, where she spent countless hours helping engineers, data scientists, and analysts write and edit content about their technical work and experiences. She also led internal communications for Uber’s Chief Technology Officer and strategy for Uber AI’s Research Review Program. In her spare time, she freelances for USA Today, reads up on all the latest trends in data, and volunteers for the California Historical Society.

دیدگاه خود را بنویسید
نظرات کاربران (0 دیدگاه)
نظری وجود ندارد.
کتاب های مشابه
AWS
1,289
AWS Certified Database – Specialty (DBS-C01) Certification Guide
680,000 تومان
Data
918
Building the Snowflake Data Cloud
692,000 تومان
Python
1,031
Data Visualization with Python and JavaScript
946,000 تومان
Data
964
CompTIA Data+ Study Guide
569,000 تومان
Data
981
Data and Computer Communications
1,329,000 تومان
Data
887
Pro Database Migration to Azure
540,000 تومان
Data
884
The Enterprise Data Catalog
401,000 تومان
Data
718
High Performance Spark
552,000 تومان
Data
432
The Definitive Guide to Data Integration
699,000 تومان
Data
883
Synthetic Data for Deep Learning
419,000 تومان
قیمت
منصفانه
ارسال به
سراسر کشور
تضمین
کیفیت
پشتیبانی در
روزهای تعطیل
خرید امن
و آسان
آرشیو بزرگ
کتاب‌های تخصصی
هـر روز با بهتــرین و جــدیــدتـرین
کتاب های روز دنیا با ما همراه باشید
آدرس
پشتیبانی
مدیریت
ساعات پاسخگویی
درباره اسکای بوک
دسترسی های سریع
  • راهنمای خرید
  • راهنمای ارسال
  • سوالات متداول
  • قوانین و مقررات
  • وبلاگ
  • درباره ما
چاپ دیجیتال اسکای بوک. 2024-2022 ©