0
نام کتاب
Learning Apache Drill

Query and Analyze Distributed Data Sources with SQL

Charles Givre, Paul Rogers

Paperback331 Pages
PublisherO'Reilly
Edition1
LanguageEnglish
Year2019
ISBN9781492032793
747
A4842
انتخاب نوع چاپ:
جلد سخت
648,000ت
0
جلد نرم
568,000ت
0
طلق پاپکو و فنر
578,000ت
0
مجموع:
0تومان
کیفیت متن:اورجینال انتشارات
قطع:B5
رنگ صفحات:دارای متن و کادر رنگی
پشتیبانی در روزهای تعطیل!
ارسال به سراسر کشور

#Apache

#Data

#SQL

#MongoDB

#CSV

#Parquet

#JSON

توضیحات

Get up to speed with Apache Drill, an extensible distributed SQL query engine that reads massive datasets in many popular file formats such as Parquet, JSON, and CSV. Drill reads data in HDFS or in cloud-native storage such as S3 and works with Hive metastores along with distributed databases such as HBase, MongoDB, and relational databases. Drill works everywhere: on your laptop or in your largest cluster.


In this practical book, Drill committers Charles Givre and Paul Rogers show analysts and data scientists how to query and analyze raw data using this powerful tool. Data scientists today spend about 80% of their time just gathering and cleaning data. With this book, you’ll learn how Drill helps you analyze data more effectively to drive down time to insight.


  • Use Drill to clean, prepare, and summarize delimited data for further analysis
  • Query file types including logfiles, Parquet, JSON, and other complex formats
  • Query Hadoop, relational databases, MongoDB, and Kafka with standard SQL
  • Connect to Drill programmatically using a variety of languages
  • Use Drill even with challenging or ambiguous file formats
  • Perform sophisticated analysis by extending Drill’s functionality with user-defined functions
  • Facilitate data analysis for network security, image metadata, and machine learning


Table of Contents

Chapter 1. Introduction to Apache Drill

Chapter 2. Installing and Running Drill

Chapter 3. Overview of Apache Drill

Chapter 4. Querying Delimited Data

Chapter 5. Analyzing Complex and Nested Data

Chapter 6. Connecting Drill to Data Sources

Chapter 7. Connecting to Drill

Chapter 8. Data Engineering with Drill

Chapter 9. Deploying Drill in Production

Chapter 10. Setting Up Your Development Environment

Chapter 11. Writing Drill User-Defined Functions

Chapter 12. Writing a Format Plug-in

Chapter 13. Unique Uses of Drill

Appendix A. List of Drill Functions

Appendix B. Drill Formatting Strings


About the Authors

Paul Rogers is an Apache Drill committer at MapR where he focuses on Drill’s execution engine. Paul has worked as a software architect at a number database and BI companies such as Oracle, Actuate and Informix. Paul was the early architect of the Eclipse BIRT project. His interests include making Drill even easier to use for end-users and plug-in developers.


Charles Givre is an Apache Drill committer and has worked as a Senior Lead Data Scientist for Booz Allen Hamilton for the last six years where he works in the intersection of cyber security and data science. Mr. Givre is passionate about teaching others data science and analytic skills and has taught data science classes all over the world at conferences, universities and for clients. Most recently, Mr. Givre taught a data science class at the BlackHat conference in Las Vegas and the Center for Research in Applied Cryptography and Cyber Security at Bar Ilan University. He is a sought-after speaker and has delivered presentations at major industry conferences such as Strata-Hadoop World, BlackHat, Open Data Science Conference and others.

دیدگاه خود را بنویسید
نظرات کاربران (0 دیدگاه)
نظری وجود ندارد.
کتاب های مشابه
SQL Server
1,080
SQL Server 2022 Query Performance Tuning
1,238,000 تومان
SQL Server
1,115
Pro T-SQL 2022
1,024,000 تومان
SQL Server
1,170
Pro SQL Server 2022 Wait Statistics
744,000 تومان
SQL Server
1,041
Expert T-SQL Window Functions in SQL Server 2019
436,000 تومان
SQL Server
1,177
Pro SQL Server Relational Database Design and Implementation
1,930,000 تومان
SQL Server
1,019
SQL Server T-SQL Recipes
1,477,000 تومان
SQL Server
949
Practical Database Auditing for Microsoft SQL Server and Azure SQL
545,000 تومان
SQL Server
1,418
Pro SQL Server 2022 Administration
1,727,000 تومان
SQL Server
1,080
SQL Server Advanced Troubleshooting and Performance Tuning
770,000 تومان
SQL Server
1,124
Pro SQL Server Internals
1,306,000 تومان
قیمت
منصفانه
ارسال به
سراسر کشور
تضمین
کیفیت
پشتیبانی در
روزهای تعطیل
خرید امن
و آسان
آرشیو بزرگ
کتاب‌های تخصصی
هـر روز با بهتــرین و جــدیــدتـرین
کتاب های روز دنیا با ما همراه باشید
آدرس
پشتیبانی
مدیریت
ساعات پاسخگویی
درباره اسکای بوک
دسترسی های سریع
  • راهنمای خرید
  • راهنمای ارسال
  • سوالات متداول
  • قوانین و مقررات
  • وبلاگ
  • درباره ما
چاپ دیجیتال اسکای بوک. 2024-2022 ©