BPB Online LLP
Practical Full Stack Machine Learning
Alok Kumar
Practical Full Stack Machine Learning
US$ 19.95
The publisher has enabled DRM protection, which means that you need to use the BookFusion iOS, Android or Web app to read this eBook. This eBook cannot be used outside of the BookFusion platform.
Description
Contents
Reviews

Master the ML process, from pipeline development to model deployment in production.

Key Features
● Prime focus on feature-engineering, model-exploration & optimization, dataops, ML pipeline, and scaling ML API.
● A step-by-step approach to cover every data science task with utmost efficiency and highest performance.
● Access to advanced data engineering and ML tools like AirFlow, MLflow, and ensemble techniques.

Description
'Practical Full-Stack Machine Learning' introduces data professionals to a set of powerful, open-source tools and concepts required to build a complete data science project. This book is written in Python, and the ML solutions are language-neutral and can be applied to various software languages and concepts.

The book covers data pre-processing, feature management, selecting the best algorithm, model performance optimization, exposing ML models as API endpoints, and scaling ML API. It helps you learn how to use cookiecutter to create reusable project structures and templates. It explains DVC so that you can implement it and reap the same benefits in ML projects.It also covers DASK and how to use it to create scalable solutions for pre-processing data tasks. KerasTuner, an easy-to-use, scalable hyperparameter optimization framework that solves the pain points of hyperparameter search will be covered in this book. It explains ensemble techniques such as bagging, stacking, and boosting methods and the ML-ensemble framework to easily and effectively implement ensemble learning.
The book also covers how to use Airflow to automate your ETL tasks for data preparation. It explores MLflow, which allows you to train, reuse, and deploy models created with any library. It teaches how to use fastAPI to expose and scale ML models as API endpoints.

What you will learn
● Learn how to create reusable machine learning pipelines that are ready for production.
● Implement scalable solutions for pre-processing data tasks using DASK.
● Experiment with ensembling techniques like Bagging, Stacking, and Boosting methods.
● Learn how to use Airflow to automate your ETL tasks for data preparation.
● Learn MLflow for training, reprocessing, and deployment of models created with any library.
● Workaround cookiecutter, KerasTuner, DVC, fastAPI, and a lot more.

Who this book is for
This book is geared toward data scientists who want to become more proficient in the entire process of developing ML applications from start to finish. Knowing the fundamentals of machine learning and Keras programming would be an essential requirement.

Table of Contents
1. Organizing Your Data Science Project
2. Preparing Your Data Structure
3. Building Your ML Architecture
4. Bye-Bye Scheduler, Welcome Airflow
5. Organizing Your Data Science Project Structure
6. Feature Store for ML
7. Serving ML as API

Language
English
ISBN
9789391030421
Cover Page
Title Page
Copyright Page
About the Author
About the Reviewer
Acknowledgements
Preface
Errata
Table of Contents
1. Organizing Your Data Science Project
Structure
Objective
1.1 Project folder and code organization
1.2 GPU 101
1.3 On-premises vs cloud
1.4 Deciding your framework
1.5 Deciding your targets
1.6 Preparing baseline
1.7 Managing workflow
Observing an experiment
Output:
Conclusion
Questions
Points to remember
Further reading
2. Preparing Your Data
Structure
Objective
2.1 Data exploration with facets
2.2 Missing data conundrum: Imputation techniques
2.3 Scaling data
2.4 Outlier treatment
2.5 Feature engineering
2.6 Data collection
2.7 Large scale data processing with Dask
Dask-ML
2.8 Data distribution
Training and validation data distribution
Distribution mis-match
2.9 Training data management with Snorkel
Writing Labelling Functions (LFs)
Keyword matching
Custom heuristic
Third party model
Labels cleaning
Transformation for data augmentation
Slicing function
2.10 Data versioning
DVC
Conclusion
Points to remember
Questions
3. Building Your ML Architecture
Structure
Objective
3.1 Building intuition for algorithms
3.1.1 How algorithms learn?
3.1.2 What do algorithms learn?
3.1.3 Algorithm use cases
3.2 Hyperparameters selection and optimization using Keras Tuner
3.2.1 Grid Search
3.2.2 Random Search
3.2.3 Bayesian optimization
3.2.4 Keras Tuner
Hyperband
Bayesian optimization
3.3 Transfer learning for computer vision problems
Training the top layer
Fine-tuning
Generalizing transfer learning
Training the bottom layers
Model distillation
3.4 Transfer learning for NLP
Pre-trained language model
3.5 Model ensembling using ML-ens
Bagging
K-fold cross-validation
Boosting
Weight-based Boosting
Residual-based boosting
Stacking
Varying models
ML-Ensemble
Multi-layer ensembles
Ensemble model selection
The scoring function
Evaluator
Pre-processing
3.6 Conclusion
3.7 Points to remember
3.8 Questions
4. Bye-Bye Scheduler, Welcome Airflow
Structure
Objective
4.1 Introducing Airflow
4.2 Installing Airflow
4.3 Running Airflow
4.3.1 Running Airflow via UI
4.3.2 Running Airflow via command line
4.3.2.1 Airflow UI tour
4.3.2.2 Airflow command line tour
4.4 Executors in Airflow
Sequential
Local
Celery
4.5 Anatomy of a DAG
4.6 Understanding scheduling
4.6.1 Understanding scheduling preset
4.6.2 Understanding cron scheduling
4.6.3 Understanding frequency based scheduling
4.6.4 Understanding dynamic time
4.6.5 Understanding backfilling and catch up
4.7 Airflow operators
4.7.1 Python operator
4.7.2 Email operator
4.7.3 Setting up notifications at DAG
4.7.3.1 Setting up notification at DAG task
4.7.4 Understanding sensor operators
4.7.4.1 Understanding file sensor operators
4.7.4.2 Understanding custom pooling
4.7.5 Understanding TriggerDAGRunOperator
4.8 Managing task branching and dependencies
4.9 Sharing data between tasks using XCom
4.10 Understanding Airflow variables
4.10.1 Variable creation
4.10.1.1 UI
4.10.1.2 CLI
4.10.1.3 Code
4.10.1.4 Environment variables
4.10.2 Hiding variable values
4.11 Conclusion
4.12 Points to remember
4.13 Questions
5. Organizing Your Data Science Project Structure
Structure
Structure
Objective
5.1 Understanding ML pipeline challenges
5.2 MLflow components
5.2.1 MLflow tracking
Backend stores
5.2.2 MLflow projects
5.2.3 Multistep workflow
5.2.4 MLflow models
5.2.5 MLflow Registry
5.2.5.1 Registry components
5.2.5.2 Workflow
5.3 Conclusion
5.4 Points to remember
5.5 Questions
6. Feature Store for ML
Structure
Objective
6.1 Introduction to feature stores
6.1.1 Understanding features
6.1.2 Feature store motivation
6.1.2.1 Duplication
6.1.2.2 Inconsistency
6.1.2.3 Inconsistent access to data
6.1.2.4 Difficulty in production deployment
6.1.2.5 Incorrect training and production data
6.1.3 Feature store guiding principles
6.2 Feature store frameworks
6.3 Feast
6.3.1 Feast components
6.3.2 Installing Feast
6.3.3 Architecture
Storage
6.3.4 Feast concepts
6.3.4.1 Entity
6.3.4.2 Sources
6.3.4.3 Feature tables
6.4 Conclusion
6.5 Points to remember
6.6 Questions
7. Serving ML as API
Structure
Objective
7.1 Introducing fastAPI
7.1.1 Installation of fastAPI
7.1.2 fastAPI concepts
7.2 Deploying ML model as fastAPI
7.3 Scaling ML serving with fastAPI and Ray Serve
7.4 Conclusion
7.5 Points to remember
7.6 Questions
Index
The book hasn't received reviews yet.
You May Also Like
IT Interview Guide for Freshers
$19.95
Sameer S Paradkar
IT Interview Guide for Freshers
Up and Running Google AutoML and AI Platform
$19.95
Navin Sabharwal, Amit Agarwal
Up and Running Google AutoML and AI Platform
Handbook of Cloud Computing
$19.95
Dr. Anand Nayyar
Handbook of Cloud Computing
Cracking Microservices Interview
$19.95
Sameer S. Paradkar
Cracking Microservices Interview