BPB Online LLP
Fundamentals of Deep Learning and Computer Vision
Fundamentals of Deep Learning and Computer Vision
US$ 19.95
The publisher has enabled DRM protection, which means that you need to use the BookFusion iOS, Android or Web app to read this eBook. This eBook cannot be used outside of the BookFusion platform.
Description
Contents
Reviews

Master Computer Vision concepts using Deep Learning with easy-to-follow steps

Key Features
Setting up the Python and TensorFlow environment
Learn core Tensorflow concepts with the latest TF version 2.0
Learn Deep Learning for computer vision applications
Understand different computer vision concepts and use-cases
Understand different state-of-the-art CNN architectures
Build deep neural networks with transfer Learning using features from pre-trained CNN models
Apply computer vision concepts with easy-to-follow code in Jupyter Notebook

Description
This book starts with setting up a Python virtual environment with the deep learning framework TensorFlow and then introduces the fundamental concepts of TensorFlow. Before moving on to Computer Vision, you will learn about neural networks and related aspects such as loss functions, gradient descent optimization, activation functions and how backpropagation works for training multi-layer perceptrons.

To understand how the Convolutional Neural Network (CNN) is used for computer vision problems, you need to learn about the basic convolution operation. You will learn how CNN is different from a multi-layer perceptron along with a thorough discussion on the different building blocks of the CNN architecture such as kernel size, stride, padding, and pooling and finally learn how to build a small CNN model.

The book concludes with a chapter on sequential models where you will learn about RNN, GRU, and LSTMs and their architectures and understand their applications in machine translation, image/video captioning and video classification.

Language
English
ISBN
9789388511858
Cover Page
Title Page
Copyright Page
About the Authors
Acknowledgement
Preface
Errata
Table of Contents
1. Introduction to TensorFlow
Structure
Objective
Machine learning and deep learning
What is TensorFlow?
TensorFlow installation
Virtual environment
Dataflow graphs
Tensors
Graph
Dataflow graph in TensorFlow
TensorFlow operation
Static shape
Dynamic shape
Session in TensorFlow
Create TensorFlow graph
Fetches
Feedings in TensorFlow
Placeholders
Variables
Assign value
Name scopes
Distributed computing in TensorFlow
Conclusion
2. Introduction to Neural Networks
Structure
Objective
Introduction to ANN
Feedforward Neural Network
XOR Function Using a Linear Model
Learning Based on Gradients
Cost/Loss Functions
Least Square Function
Cross-Entropy Function
Softmax Function
Optimization
Gradient Descent
Stochastic Gradient Descent
Activation Function
The Sigmoid Function
The Tanh Function
Relu
Leaky relu
Backpropagation
Overfitting and Underfitting
Conclusion
3. Convolutional Neural Network
Structure
Objectives
Introduction to CNN
Convolution operation
Why CNN?
Spatial relation among pixels
Convolution in 2D
Translation equivariance
Stride
Padding
Convolution in 3D image
Notations used in CNN
Pooling layer
Architecture of CNN
Conclusion
4. CNN Architectures
Structure
Objective
AlexNet
VGG Net
GoogLeNet/Inception network
ResNets
Deep residual learning framework
Fast R-CNN
Faster R-CNN
Single Shot Detector (SSD)
You Only Look Once (YOLO)
Direct location prediction
Dimension cluster
Feature upsampling and concatenation
Conclusion
5. Sequential Models
Structure
Objective
RNN
Why RNN?
Forward pass
Variants of RNN
Bidirectional RNNs
Motivation for RNN
Language modeling
Machine translation
Image Captioning
Backward pass
Vanishing gradient problem
Long Short-Term Memory (LSTM)
Architecture of LSTM
Variants on LSTM
Application of RNNs in Image and Video Analytics
Object recognition and captioning in video
Video description
Video classification
Conclusion
Bibiliography

Loading...