
Data Analytics with Google Cloud Platform
US$ 19.95
The publisher has enabled DRM protection, which means that you need to use the BookFusion iOS, Android or Web app to read this eBook. This eBook cannot be used outside of the BookFusion platform.
Description
Contents
Reviews
Language
English
ISBN
9789389423631
Cover Page
Title Page
Copyright Page
Disclaimer
Dedication
About the Author
About the Reviewer
Acknowledgement
Preface
Errata
Table of Contents
1. GCP Overview and Architecture
Introduction
Structure
Objectives
Cloud computing history
On-premise versus cloud computing
Benefits of cloud computing
Infrastructure as a Service (IaaS)
Platform as a Service (PaaS)
Software as a Service (SaaS)
The cloud computing architecture
Google Cloud Platform
Why Google Cloud Platform?
Google Cloud Platform regions and zones
Google Cloud Platform Console
Billing
Resource hierarchy
Projects
Command-line interface
Roles and services in GCP
Primitive roles
Predefined roles
Custom roles
Application Engine
Services
Versions
Instances
Application requests
Limits
Compute Engines
Instances
Container Engines
Container technologies that run on Compute Engine
Container-optimized VM images
Cloud Functions
Connect and extend cloud services
Events and triggers
Serverless
Security via IAM
Cloud IAM and policy APIs
Policy hierarchy
Conclusion
Questions
2. Google Cloud Platform Storage
Cloud Storage
Overview of storage classes
Comparison of storage classes
Bucket
Cloud Datastore
Cloud Firestore in Datastore mode
Comparison of Cloud Datastore and Cloud Firestore with ancient databases
Cloud Firestore
Documents
Collections
Cloud SQL
Comparison between Cloud SQL and standard MySQL based on their functionality
Cloud SQL for PostgreSQL
Differences between Cloud SQL and the standard PostgreSQL functionality
Cloud Spanner
Cloud Bigtable
Cloud Bigtable storage model
Cloud Bigtable architecture
Cloud BigQuery
Conclusion
Questions
3. Data Processing and Message with Dataflow and Pub/Sub
Introduction
Structure
Objectives
Cloud Dataflow
Cloud Dataflow templates
Traditional versus templated job execution
Data transformation with Cloud Dataflow
Example of the WordCount Template
Apache Beam
Working of Apache Beam code
Cloud Pub/Sub
Publisher-subscriber relationships
Cloud Pub/Sub message flow
Cloud Pub/Sub integrations
Fundamentals of a Publish/Subscribe service
Judging performance of a messaging service
Cloud Pub/Sub basic architecture
Control plane
Data plane - The lifecycle of a message
Cloud Pub/Sub implementation
Conclusion
Questions
4. Data Processing with Dataproc and Dataprep
Introduction
Structure
Objectives
Cloud Dataproc
Cloud Dataproc usage
Cloud Dataproc parts
Removing/Terminating the Dataproc Cluster
Moving on-premises Hadoop infrastructure to GCP
Best practices of Cloud Dataproc
Cloud Dataprep
Setting up Cloud Dataprep service
Create a flow in Cloud Dataprep
Conclusion
Questions
5. BigQuery and Data Studio
Introduction
Structure
Objectives
BigQuery
Google BigQuery architecture
Storing data in BigQuery
BigQuery execution
Tree Architecture
BigQuery versus MapReduce
Comparing BigQuery and Redshift
Getting started with Google BigQuery
Introduction to partitioned tables
Ingestion-time based partitioned tables
Date or timestamp partitioned tables
Partitioning versus sharding
Partitioned table quotas and limits
Assign a project-level Cloud IAM role to data analysts
Introduction to external data sources
BigQuery best practices for controlling prices
Optimizing query computation
Google Data Studio
Building the Google Data Studio report
Connecting Google Analytics Account to Google Data Studio
Conclusion
Questions
6. Machine Learning with GCP
Objective
Structure
Machine learning and types of machine learning
Supervised learning
TensorFlow and machine learning
Linear regression with TensorFlow
Training in linear regression TensorFlow
The Estimator API
Unsupervised machine learning
Cloud ML API (vision, translate, speech)
AutoML
Vision AI
Video AI
AutoML Video Intelligence
Cloud Video Intelligence API
Speech to text API
Using client libraries
Using gccloud tool
Text-to-Speech API
Using client libraries
Dialogue Flow
Dialogue Flow Console introduction
Agents
Intents
Entities
Events
Fulfillments
Integration
Conclusion
Questions
7. Sample Use Cases and Example
Introduction
Structure
Objective
Setting up a mobile gaming analytics platform - a reference architecture
Real-time event processing using streaming pattern
Message source: mobile or game server
Cloud Dataflow streaming pipeline
Data visualization tools
Bulk process using batch pattern
Asynchronous transfer service pattern
Alternative batch pattern: Direct loading from Cloud Storage to BigQuery
Operational factors for the reference architectures
Managing and processing logs at scale by leveraging the Cloud Dataflow
Overview
Understanding the Cloud Dataflow pipeline
Receiving the data
Collecting the data into objects
Aggregating the data by days
Loading the data into BigQuery
Querying the data from BigQuery
Using a streaming pipeline
Monitoring the pipeline
Delete the project
Removing all the components
Product recommendations using machine learning on Compute Engine
Scenario
Solution overview
Choosing the components
Collecting the data
Storing the data
Generating the insights from data
Training the models
Finding the right model
Delivering the recommendations
Code walkthrough
Fetching the data from Cloud SQL
Translate the DataFrame to RDD and generate the various datasets
Train models based on various parameters
Calculating top predictions for the user
Saving the top predictions
Executing the solution
Conclusion
Questions
The book hasn't received reviews yet.