BPB Online LLP
Data Science with Jupyter
Data Science with Jupyter
US$ 19.95
The publisher has enabled DRM protection, which means that you need to use the BookFusion iOS, Android or Web app to read this eBook. This eBook cannot be used outside of the BookFusion platform.
Description
Contents
Reviews

Step-by-step guide to practising data science techniques with Jupyter notebooks

Key Features
Acquire Python skills to do independent data science projects
Learn the basics of linear algebra and statistical science in Python way
Understand how and when they're used in data science
Build predictive models, tune their parameters and analyze performance in few steps
Cluster, transform, visualize, and extract insights from unlabelled datasets
Learn how to use matplotlib and seaborn for data visualization
Implement and save machine learning models for real-world business scenarios

Description
Modern businesses are awash with data, making data driven decision-making tasks increasingly complex. As a result, relevant technical expertise and analytical skills are required to do such tasks. This book aims to equip you with just enough knowledge of Python in conjunction with skills to use powerful tool such as Jupyter Notebook in order to succeed in the role of a data scientist.

The book starts with a brief introduction to the world of data science and the opportunities you may come across along with an overview of the key topics covered in the book. You will learn how to setup Anaconda installation which comes with Jupyter and preinstalled Python packages. Before diving in to several supervised, unsupervised and other machine learning techniques, you’ll learn how to use basic data structures, functions, libraries and packages required to import, clean, visualize and process data. Several machine learning techniques such as regression, classification, clustering, time-series etc have been explained with the use of practical examples and by comparing the performance of various models.

By the end of the book, you will come across few case studies to put your knowledge to practice and solve real-life business problems such as building a movie recommendation engine, classifying spam messages, predicting the ability of a borrower to repay loan on time and time series forecasting of housing prices. Remember to practice additional examples provided in the code bundle of the book to master these techniques.

Audience
The book is intended for anyone looking for a career in data science, all aspiring data scientists who want to learn the most powerful programming language in Machine Learning or working professionals who want to switch their career in Data Science. While no prior knowledge of Data Science or related technologies is assumed, it will be helpful to have some programming experience.

Table of Contents
Data Science Fundamentals
Installing Software and Setting up
Lists and Dictionaries
Function and Packages
NumPy Foundation
Pandas and Dataframe
Interacting with Databases
Thinking Statistically in Data Science
How to import data in Python?
Cleaning of imported data
Data Visualization
Data Pre-processing
Supervised Machine Learning
Unsupervised Machine Learning
Handling Time-Series Data
Time-Series Methods
Case Study – 1
Case Study – 2
Case Study – 3
Case Study – 4

About the Author
Prateek is a Data Enthusiast and loves the data driven technologies. Prateek has total 7 years of experience and currently he is working as a Data Scientist in an MNC. He has worked with finance and retail clients and has developed Machine Learning and Deep Learning solutions for their business. His keen area of interest is in natural language processing and in computer vision. In leisure he writes posts about Data Science with Python in his blog.

Language
English
ISBN
9789388511377
Cover
Data Science with Jupyter
Copyright
About the Author
Preface
Acknowledgements
Erratta
Contents
1.    Data Science Fundamentals
What is Data?
What is Data Science?
What a Data Scientist actually do?
Real world use cases of Data Science
Why Python for Data Science?
2. Installing Software and Setting up
System Requirements
Downloading the Anaconda
Installing the Anaconda in Windows
Installing the Anaconda in Linux
How to install a new Python library in Anaconda
Open your notebook- Jupyter
Know your notebook
3. Lists and Dictionaries
What is list?
How to create a list?
Different list Manipulation operations
Difference between lists and tuples
What is dictionary?
How to create a dictionary?
Some operations with dictionary
4. Function and packages
Help() function in Python
How to import a Python package?
How to create and call a function?
Passing parameter in a function
Default parameter in a function
How to use unknown parameters in a function?
Global and Local variable in a function
What is Lambda function?
Understanding main in Python
5. NumPy Foundation
Importing a NumPy package
Why NumPy array over List?
NumPy array Attributes
Creating NumPy arrays
Accessing element of a NumPy array
Slicing in NumPy array
Array Concatenation
6. Pandas and Dataframe
Importing Pandas
Pandas Data Structures
.loc[] and .iloc[]
Some Useful DataFrame Functions
Handling missing values in DataFrame
7. Interacting with Databases
What is SQLAlchemy?
Installing SQLAlchemy Package
How to use SQLAlchemy?
SQLAlchemy Engine Configuration
Creating A Table In Database
Inserting Data In Table
Update a record
How to join two tables
8. Thinking Statistically in Data Science
Statistics in Data Science
Types of Statistical data/variables?
Mean, Median and Mode
Basics of Probability
Statistical Distributions
Pearson Correlation Coefficient
Real World Example
Statistical Inference and Hypothesis Testing
9. How to import data in Python?
Importing txt data
Importing csv data
Importing Excel data
Importing JSON data
Importing pickled data
Importing a compressed data
10. Cleaning of imported data
Know your data
Analysing Missing Values
Dropping Missing Values
Automatically Fill Missing Values
How to scale and normalize data?
How to Parse Dates?
How to apply character encoding
Cleaning inconsistent data
11. Data Visualization
Bar Chart
Line Chart
Histograms
Scatter Plot
Stacked Plot
Box Plot
12. Data Pre-processing
About the case-study
Importing the dataset
Exploratory Data Analysis
Data Cleaning & Pre-processing
Feature Engineering
13. Supervised Machine Learning
Some common ML Terms
Introduction to Machine Learning (ML)
List of common ML Algorithms
Supervised ML Fundamentals
Solving a Classification ML Problem
Why train/test split and cross validation?
Solving a Regression ML Problem
How to Tune your ML Model?
How to handle categorical variable in Sklearn?
Advance technique to handle missing data
14. unsupervised Machine Learning
Why Unsupervised Learning?
Unsupervised Learning Techniques
Kmeans-Clustering
Hierarchical Clustering
Principal Component Analysis (PCA)
Case Study
Validation of Unsupervised Ml
15. Handling time-Series Data
Why Time-Series is important?
How to handle Date and Time?
Transforming a Time Series Data
Manipulating a Time Series Data
Comparing Time Series Growth Rates
How to change Time Series Frequency?
16. Time-Series Methods
What is Time-Series forecasting?
Basic Steps in Forecasting
Time Series Forecasting Techniques
Forecast future traffic to a Web page
17. Case Study-1
18. Case Study-2
19. case Study-3
20. Case Study-4
Index

Loading...