نام کتاب
Practical Statistics for Data Scientists

50+ Essential Concepts Using R and Python
Peter Bruce, Andrew Bruce, Peter Gedeck

Paperback363 Pages
PublisherO'Reilly
Edition2
LanguageEnglish
Year2020
ISBN9781492072942
1K
A1919
انتخاب نوع چاپ:
جلد سخت
620,000ت
0
جلد نرم
560,000ت
0
طلق پاپکو و فنر
570,000ت
0
مجموع:
0تومان
کیفیت متن:اورجینال انتشارات
قطع:B5
رنگ صفحات:دارای متن و کادر رنگی
پشتیبانی در روزهای تعطیل!
ارسال به سراسر کشور

#Data_science

#Python

#statistical

#R_programming

#big_data

#machine_learning

توضیحات

Statistical methods are a key part of data science, yet few data scientists have formal statistical training. Courses and books on basic statistics rarely cover the topic from a data science perspective. The second edition of this popular guide adds comprehensive examples in Python, provides practical guidance on applying statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what’s important and what’s not.


Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you’re familiar with the R or Python programming languages and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.


With this book, you’ll learn:

  • •  Why exploratory data analysis is a key preliminary step in data science
  • •  How random sampling can reduce bias and yield a higher-quality dataset, even with big data
  • •  How the principles of experimental design yield definitive answers to questions
  • •  How to use regression to estimate outcomes and detect anomalies
  • •  Key classification techniques for predicting which categories a record belongs to
  • •  Statistical machine learning methods that "learn" from data
  • •  Unsupervised learning methods for extracting meaning from unlabeled data.

    This book is aimed at the data scientist with some familiarity with the R and/or Python programming languages, and with some prior (perhaps spotty or ephemeral) exposure to statistics. Two of the authors came to the world of data science from the world of statistics, and have some appreciation of the contribution that statistics can make to the art of data science. At the same time, we are well aware of the limitations of traditional statistics instruction: statistics as a discipline is a century and a half old, and most statistics textbooks and courses are laden with the momentum and inertia of an ocean liner. All the methods in this book have some connection—historical or methodological—to the discipline of statistics. Methods that evolved mainly out of computer science, such as neural nets, are not included.
     

In all cases, this book gives code examples first in R and then in Python. In order to avoid unnecessary repetition, we generally show only output and plots created by the R code. We also skip the code required to load the required packages and data sets. You can find the complete code as well as the data sets for download at GitHub.


Two goals underlie this book:

  • To lay out, in digestible, navigable, and easily referenced form, key concepts from statistics that are relevant to data science.
  • To explain which concepts are important and useful from a data science perspective, which are less so, and why.


About the Author

Peter Bruce is the Founder and Chief Academic Officer of the Institute for Statistics Education at Statistics.com, which offers about 80 courses in statistics and analytics, roughly half of which are aimed at data scientists. He has authored or co-authored several books in statistics and analytics, and he earned his Bachelor’s degree at Princeton, and Masters degrees at Harvard and the University of Maryland.

 

Andrew Bruce, Principal Research Scientist at Amazon, has over 30 years of experience in statistics and data science in academia, government and business. The co-author of Applied Wavelet Analysis with S-PLUS, he earned his bachelor’s degree at Princeton, and PhD in statistics at the University of Washington

 

Peter Gedeck, Senior Data Scientist at Collaborative Drug Discovery, specializes in the development of machine learning algorithms to predict biological and physicochemical properties of drug candidates. Co-author of Data Mining for Business Analytics, he earned PhD’s in Chemistry from the University of Erlangen-Nürnberg in Germany and Mathematics from Fernuniversität Hagen, Germany.

دیدگاه خود را بنویسید
نظرات کاربران (0 دیدگاه)
نظری وجود ندارد.
کتاب های مشابه
Data Science
1,009
Learning Data Science
977,000 تومان
Data Science
941
Econometrics and Data Science
426,000 تومان
Data Science
883
Beginning Data Science in R 4
901,000 تومان
آمار و احتمالات
959
Computer Age Statistical Inference
705,000 تومان
Data Science
553
Scaling Graph Learning for the Enterprise
566,000 تومان
Python
1,073
Intro to Python (Global Edition)
1,467,000 تومان
Data Science
974
Software Engineering for Data Scientists
444,000 تومان
Data Science
959
Data Science Solutions with Python
301,000 تومان
Data Science
948
How to Lead in Data Science
886,000 تومان
Software Engineering
6,092
Designing Data-Intensive Applications
995,000 تومان
قیمت
منصفانه
ارسال به
سراسر کشور
تضمین
کیفیت
پشتیبانی در
روزهای تعطیل
خرید امن
و آسان
آرشیو بزرگ
کتاب‌های تخصصی
هـر روز با بهتــرین و جــدیــدتـرین
کتاب های روز دنیا با ما همراه باشید
آدرس
پشتیبانی
مدیریت
ساعات پاسخگویی
درباره اسکای بوک
دسترسی های سریع
  • راهنمای خرید
  • راهنمای ارسال
  • سوالات متداول
  • قوانین و مقررات
  • وبلاگ
  • درباره ما
چاپ دیجیتال اسکای بوک. 2024-2022 ©