Название: Data Analytics with Google Cloud Platform: Build Real Time Data Analytics on Google Cloud Platform Автор: Murari Ramuka Издательство: BPB Publications Год: 2020 Формат: True PDF Страниц: 265 Размер: 10.6 Mb Язык: English
Step-by-step guide to different data movement and processing techniques, using Google Cloud Platform Services. Modern businesses are awash with data, making data-driven decision-making tasks increasingly complex. As a result, relevant technical expertise and analytical skills are required to do such tasks. This book aims to equip you with enough knowledge of Cloud Computing in conjunction with Google Cloud Data platform to succeed in the role of a Cloud data expert. The current market is trending towards the latest cloud technologies, which is the need of the hour. Google being the pioneer, is dominating this space with the right set of cloud services being offered as part of GCP (Google Cloud Platform). At this juncture, this book will be very vital and will be cover all the services that are being offered by GCP, putting emphasis on Data services. What will you learn By the end of the book, you will have come across different data services and platforms offered by Google Cloud, and how those services/features can be enabled to serve business needs. You will also see a few case studies to put your knowledge to practice and solve business problems such as building a real-time streaming pipeline engine, Scalable Datawarehouse on Cloud, fully managed Hadoop cluster on Cloud and enabling TensorFlow/Machine Learning API’s to support real-life business problems. Remember to practice additional examples to master these techniques. Key Features ? Learn the basic concept of Cloud Computing along with different Cloud service provides with their supported Models (IaaS/PaaS/SaaS) ? Learn the basics of Compute Engine, App Engine, Container Engine, Project and Billing setup in the Google Cloud Platform ? Learn how and when to use Cloud DataFlow, Cloud DataProc and Cloud DataPrep ? Build real-time data pipeline to support real-time analytics using Pub/Sub messaging service ? Setting up a fully managed GCP Big Data Cluster using Cloud DataProc for running Apache Spark and Apache Hadoop clusters in a simpler, more cost-efficient manner ? Learn how to use Cloud Data Studio for visualizing the data on top of Big Query ? Implement and understand real-world business scenarios for Machine Learning, Data Pipeline Engineering