Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python

(12 customer reviews)

Original price was: $79.99.Current price is: $17.49.

Product details :
    PDF 14,72 MB • Pages: 361

    Statistical methods are a key part of data science, yet few data scientists have formal statistical training. Courses and books on basic statistics rarely cover the topic from a data science perspective. The second edition of this popular guide adds comprehensive examples in Python, provides practical guidance on applying statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what’s important and what’s not.

    Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you’re familiar with the R or Python programming languages and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.

    With this book, you’ll learn:

    • Why exploratory data analysis is a key preliminary step in data science
    • How random sampling can reduce bias and yield a higher-quality dataset, even with big data
    • How the principles of experimental design yield definitive answers to questions
    • How to use regression to estimate outcomes and detect anomalies
    • Key classification techniques for predicting which categories a record belongs to
    • Statistical machine learning methods that “learn” from data
    • Unsupervised learning methods for extracting meaning from unlabeled data.

    12 reviews for Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python

    1. WC (verified owner)

      Glad to get the python scripts for the content. I was expecting a color print pages but this is black and white.

    2. Marina (verified owner)

      The book is amazing and very useful, for beginners also. The most valuable from my point of view is presence of code both for R and Python, which helps understand the syntax better for one language if you know another.

    3. Stephen Martin (verified owner)

      This is a very good book to begin your DS stats journey with. I learned more from this book than I did in my DS grad school classes. It covers the basics you’ll need everyday in a practical way.

    4. M. W. Hefner (verified owner)

      I’ve taken many stats classes, most of them using R, at the undergraduate and graduate level, and I really wish I found this book before I did. I picked this book up as a refresher, and not only did it succinctly describe all and a bit more of what I learned in those courses, but it has excellent “further readings,” great clarifying synonym lists when it defines “key terms,” and is very readable. Literally blown away.

    5. Christopher M. Myers – IS (verified owner)

      No punches pulled in this book, great for getting right in and doing work.

    6. Jonathan (verified owner)

      I had purchased a new physical copy of the book, and realized there were several pages that were blank and missing. I contacted O’Reilly about the problem and they were extremely quick with a resolution! They were able to give me a different copy so I could read it without the missing pages. The content of the book itself is good, except in all black and white, which doesn’t bother me personally but may bother someone else when it comes to the graphs. I think the R and Python content are both great, and it keeps the code concise and quick to the point. Great for R beginners, but for python users I would recommend a little more experience. As for the math parts, its great for those who are new to statistics and gives easy to read explanations, and a great refresher for those who just want to review some of the concepts. I especially like the sections provided for further reading, which have been helpful.

    7. Carlos A. (verified owner)

      Lo compre hace un mes por menos del valor que tiene ahora incluyendo el “descuento de hot sale”.

      El libro es bueno pero recomiendo esperar a fechas de baja demanda.

    8. Farshad E. (verified owner)

      Good content/low quality print

    9. Rebecca (verified owner)

      No noticeable flaws or writings

    10. denverteach (verified owner)

      Very good book- covers more than just implementing same old tactics.

    11. H.P.J.M. (verified owner)

      This book explores traditional statistical concepts (median, correlation, distributions etc) before moving on to machine learning models (the traditional, statistical kind like logistic regression and trees, not neural networks). Both classification and regression tasks are explored.

      The book is broken down into fairly digestible sections, where each section states the idea, before exploring it with both R and Python snippets and some recurring data sets. Data output is in R.

      In general, the quality of writing is good and I particularly liked how the authors pointed out where something useful in classical statistics might not be particularly relevant for data science or machine learning (e.g. p-values). The book is very practical in that sense, and I appreciate the more nuanced details about some real-life problems you might face (like the “rare-class” problem).

      But I think some things could be improved. First, the authors seem keen to state a formal, mathematical explanation of a concept but don’t always bring it to life with an example. For someone not trained in stats, that can be a little daunting (they don’t state what background knowledge they expect).

      Second, I think they try to squeeze a bit too much in that isn’t really needed. For example, they talk about the F-statistic briefly but almost as a reference. I was left none the wiser as to how I can use it in my work.

      My suggestions to the authors would therefore be to: bring the concepts to life a bit more and connect more of the dots. Otherwise, a worthwhile book if you are into data science.

    12. Sa (verified owner)

      Arrived promptly in perfect condition – like new with zero marking!

    Add a review
    YOUR CART

    New item(s) have been added to your cart.

    Quantity: 1
    Total: $15.99
    Statistics for People Who (Think They) Hate Statistics Original price was: $156.00.Current price is: $19.95.
    R for Data Science: Import, Tidy, Transform, Visualize, and Model Data Original price was: $79.99.Current price is: $19.95.
    Building Thinking Classrooms in Mathematics: A Comprehensive Guide for Grades K-12 (3 book series) Original price was: $149.99.Current price is: $30.00.
    Everything You Need to Ace Pre-Algebra and Algebra I in One Big Fat Notebook (Big Fat Notebooks) Original price was: $29.00.Current price is: $11.49.
    Making Sense of Math: How to Help Every Student Become a Mathematical Thinker and Problem Solver (ASCD Arias) Original price was: $20.00.Current price is: $6.95.
    Starting Data Analytics with Generative AI and Python Original price was: $160.00.Current price is: $26.95.
    Complex Analysis and Special Functions: Cauchy Formula, Elliptic Functions and Laplace’s Method (De Gruyter Textbook) Original price was: $104.99.Current price is: $19.99.
    Mathematics for Electricity & Electronics Original price was: $250.95.Current price is: $19.99.
    Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python Original price was: $79.99.Current price is: $17.49.
    Sciencia: Mathematics, Physics, Chemistry, Biology, and Astronomy for All (Wooden Books) Original price was: $66.00.Current price is: $14.00.
    Problem Solving Approach to Mathematics for Elementary School Teachers, A Original price was: $246.65.Current price is: $19.95.
    Modeling Life: The Mathematics of Biological Systems Original price was: $80.09.Current price is: $17.99.
    Spring Boot in Practice Original price was: $57.95.Current price is: $24.95.
    Foundations of Applied Machine Learning for Engineering Professionals Original price was: $64.99.Current price is: $18.94.
    Physical Mathematics 2nd Edition Original price was: $99.99.Current price is: $19.99.
    Qualitative Inquiry and Research Design: Choosing Among Five Approaches Original price was: $85.00.Current price is: $20.00.
    The Art of Computer Programming (6 books) Original price was: $499.99.Current price is: $44.99.
    Blueprints: How Mathematics Shapes Creativity Original price was: $42.00.Current price is: $19.99.
    Calculus 8th Edition Original price was: $355.95.Current price is: $20.00.
    Algebra and Trigonometry 4th Edition Original price was: $375.95.Current price is: $19.99.
    Introduction to Probability, Statistics, and Random Processes Original price was: $69.95.Current price is: $19.99.
    Discrete Mathematics and Its Applications Original price was: $875.66.Current price is: $20.00.
    Mathematics for the Nonmathematician (Dover Books on Mathematics) Original price was: $69.95.Current price is: $9.00.
    Fractions Essentials Workbook with Answers Original price was: $13.99.Current price is: $4.99.
    The Math Book (DK Big Ideas) Original price was: $21.99.Current price is: $10.95.
    Schaum’s 3,000 Solved Problems in Calculus (Schaum’s Outlines) Original price was: $32.99.Current price is: $19.00.
    Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG Original price was: $89.99.Current price is: $9.99.
    Scalar, Vector, and Matrix Mathematics: Theory, Facts, and Formulas - Revised and Expanded Edition Original price was: $214.00.Current price is: $20.00.
    Trigonometry 011 Edition Original price was: $203.74.Current price is: $20.00.
    Machine Learning using Python Original price was: $16.99.Current price is: $7.99.
    Designing and Conducting Mixed Methods Research Original price was: $116.00.Current price is: $19.99.
    Vector: A Surprising Story of Space, Time, and Mathematical Transformation Original price was: $58.00.Current price is: $19.99.
    An Introduction to Systems Biology (Chapman & Hall/CRC Computational Biology Series) Original price was: $200.00.Current price is: $19.00.
    Basic Physics: A Self-Teaching Guide, 3rd Edition (Wiley Self-Teaching Guides) 3rd Edition Original price was: $58.00.Current price is: $16.95.
    RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone Original price was: $45.99.Current price is: $19.95.
    Essential Prealgebra Skills Practice Workbook Original price was: $16.99.Current price is: $4.99.
    Mindset Mathematic (10 books) Original price was: $275.99.Current price is: $49.99.
    Objective Bayesian Inference Original price was: $118.00.Current price is: $20.00.
    Generative AI in Action Original price was: $59.99.Current price is: $24.00.
    The Humongous Book of Calculus Problems (Humongous Books) Original price was: $40.00.Current price is: $19.50.
    11