Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.
Read instantly on your browser with Kindle for Web.
Using your mobile phone camera - scan the code below and download the Kindle app.
Follow the author
OK
Hands-On Exploratory Data Analysis with Python: Perform EDA techniques to understand, summarize, and investigate your data
Purchase options and add-ons
Discover techniques to summarize the characteristics of your data using PyPlot; NumPy; SciPy; and pandas
Key Features:
- Understand the fundamental concepts of exploratory data analysis using Python
- Find missing values in your data and identify the correlation between different variables
- Practice graphical exploratory analysis techniques using Matplotlib and the Seaborn Python package
Book Description:
Exploratory Data Analysis (EDA) is an approach to data analysis that involves the application of diverse techniques to gain insights into a dataset. This book will help you gain practical knowledge of the main pillars of EDA - data cleaning; data preparation; data exploration; and data visualization.
You'll start by performing EDA using open source datasets and perform simple to advanced analyses to turn data into meaningful insights. You'll then learn various descriptive statistical techniques to describe the basic characteristics of data and progress to performing EDA on time-series data. As you advance; you'll learn how to implement EDA techniques for model development and evaluation and build predictive models to visualize results. Using Python for data analysis; you'll work with real-world datasets; understand data; summarize its characteristics; and visualize it for business intelligence.
By the end of this EDA book; you'll have developed the skills required to carry out a preliminary investigation on any dataset; yield insights into data; present your results with visual aids; and build a model that correctly predicts future outcomes.
What You Will Learn:
- Import; clean; and explore data to perform preliminary analysis using powerful Python packages
- Identify and transform erroneous data using different data wrangling techniques
- Explore the use of multiple regression to describe non-linear relationships
- Discover hypothesis testing and explore techniques of time-series analysis
- Understand and interpret results obtained from graphical analysis
- Build; train; and optimize predictive models to estimate results
- Perform complex EDA techniques on open source datasets
Who this book is for:
This EDA book is for anyone interested in data analysis; especially students; statisticians; data analysts; and data scientists. The practical concepts presented in this book can be applied in various disciplines to enhance decision-making processes with data analysis and synthesis. Fundamental knowledge of Python programming and statistical concepts is all you need to get started with this book.
Table of Contents
- Exploratory Data Analysis Fundamentals
- Visual Aids for EDA
- EDA with Personal Email
- Data Transformation
- Descriptive Statistics
- Grouping Dataset
- Correlation
- Time Series Analysis
- Hypothesis Testing and Regression
- Model Development and Evaluation
- EDA on Wine Quality Data Analysis
- Appendix
- ISBN-101789537258
- ISBN-13978-1789537253
- PublisherPackt Publishing
- Publication dateMarch 27, 2020
- LanguageEnglish
- Dimensions7.5 x 0.8 x 9.25 inches
- Print length352 pages
Frequently bought together

Customers who viewed this item also viewed
Exploratory Data Analysis with Python Cookbook: Over 50 recipes to analyze, visualize, and extract insights from structured and unstructured dataPaperbackFREE Shipping by AmazonGet it as soon as Wednesday, Dec 3
Customers also bought or read
- Exploratory Data Analysis with Python Cookbook: Over 50 recipes to analyze, visualize, and extract insights from structured and unstructured data
Paperback$40.49$40.49FREE delivery Wednesday - Mathematics of Machine Learning: Master linear algebra, calculus, and probability for machine learning#1 Best SellerMathematical Analysis
Paperback$50.99$50.99FREE delivery Wednesday - Machine Learning with PyTorch and Scikit-Learn: Develop machine learning and deep learning models with Python
Paperback$39.95$39.95FREE delivery Wednesday - The Big Book of Dashboards: Visualizing Your Data Using Real-World Business Scenarios
Paperback$23.93$23.93Delivery Wednesday - Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter
Paperback$43.99$43.99FREE delivery Wednesday
From the brand
Editorial Reviews
About the Author
Suresh Kumar Mukhiya is a PhD candidate, currently affiliated to the Western Norway University of Applied Sciences (HVL). He is a big data enthusiast, specializing in Information Systems, Model-Driven Software Engineering, Big Data Analysis, Artificial Intelligence and Frontend development. He has completed a Masters in Information Systems from the Norwegian University of Science and Technology (NTNU, Norway) along with a thesis in processing mining. He also holds a bachelor's degree in computer science and information technology (BSc.CSIT) from Tribhuvan University, Nepal, where he was decorated with the Vice-Chancellor's Award for obtaining the highest score. He is a passionate photographer and a resilient traveler.
Usman Ahmed is a data scientist and Ph.D. candidate at Western Norway University of Applied Science (HVL). He has rich experience in building and scaling high-performance systems based on data mining, natural language processing, and machine learning. Usman's research interests are sequential data mining, heterogeneous computing, natural language processing, a recommendation system, and machine learning. He has completed a Master's of Science in computer science from Capital University of Science and Technology, Islamabad, Pakistan. Usman Ahmed was awarded Gold Medal in Bachelor of Computer Science from Heavy Industries Taxila Education City.
Product details
- Publisher : Packt Publishing
- Publication date : March 27, 2020
- Language : English
- Print length : 352 pages
- ISBN-10 : 1789537258
- ISBN-13 : 978-1789537253
- Item Weight : 1.33 pounds
- Dimensions : 7.5 x 0.8 x 9.25 inches
- Best Sellers Rank: #2,716,959 in Books (See Top 100 in Books)
- #853 in Data Modeling & Design (Books)
- #1,253 in Data Processing
- #2,078 in Python Programming
- Customer Reviews:
About the author

Suresh Kumar Mukhiya is a Ph.D. candidate currently affiliated with the Western Norway University of Applied Sciences (HVL). He has completed his Master's degree in information systems at the Norwegian University of Science and Technology (NTNU, Norway), along with a thesis in processing mining. He also holds a Bachelor's degree in computer science and information technology (BSc.CSIT) from Tribhuvan University, Nepal, where he was decorated with the Vice-Chancellor's Award for obtaining the highest score. He is a passionate photographer and a resilient traveler.
Suresh has many years of experience with coding in Python and other programming languages. He has given several seminars on the practical applications of data science and machine learning and deep learning over the years. Suresh loves to write and talk about data science, machine learning, and Python. He is very motivated to help people developing data-driven solutions without necessarily requiring a machine learning background. He believes Artificial Intelligence is the driving fuel for computer science in the coming days.
Customer reviews
Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.
Learn more how customers reviews work on AmazonTop reviews from the United States
There was a problem filtering reviews. Please reload the page.
- Reviewed in the United States on October 15, 2020Format: KindlePros:
1. Covers all the important concepts you need to know to get started with EDA.
2. Lots of sample codes which the reader can use to get their hands dirty.
3. Works excellently as a reference book to look up the basic concepts of EDA.
Cons:
This book only contains an brief overview of the concepts. So, if you want to get a very deep understanding of the subject matter, this is not the book for you.
Top reviews from other countries
samarth saxenaReviewed in India on August 28, 20201.0 out of 5 stars Waste of money 💰 , content of no use
Format: PaperbackVerified PurchaseWorst print , black - white copy , no colour print as shown in Google book preview and not any special content
Worst print , black - white copy , no colour print as shown in Google book preview and not any special content1.0 out of 5 stars
samarth saxenaWaste of money 💰 , content of no use
Reviewed in India on August 28, 2020
Images in this review
DavidReviewed in the United Kingdom on June 27, 20251.0 out of 5 stars Don't waste your time
Format: PaperbackVerified PurchaseNot worth your time, the tone of the author is needlessly condescending, a significant portion of the book is basically just scraped basic tutorials of numpy, pandas and seaborn. There are also a number of serious errors in the code, examples do not work and require modification to work as the author intended and some of the referenced datasets do not exist
Don't waste your time with this book, just set up the libraries and have a play around on the quick-start tutorials for those instead.







