Название: Big Data and Analytics: The key concepts and practical applications of Big Data analytics
Автор: Jugnesh Kumar, Anubhav Kumar, Rinku Kumar
Издательство: BPB Publications
Год: 2024
Страниц: 467
Язык: английский
Формат: pdf, epub, mobi
Размер: 10.1 MB
Unveiling insights, unleashing potential: Navigating the depths of Big Data and analytics for a data-driven tomorrow. Big Data and analytics is an indispensable guide that navigates the complex data management and analysis. This comprehensive book covers the core principles, processes, and tools, ensuring readers grasp the essentials and progress to advanced applications. It will help you understand the different analysis types like descriptive, predictive, and prescriptive. Learn about NoSQL databases and their benefits over SQL. The book centers on Hadoop, explaining its features, versions, and main components like HDFS (storage) and MapReduce (processing). Explore MapReduce and YARN for efficient data processing. Gain insights into MongoDB and Hive, popular tools in the Big Data landscape. Introduction to Hadoop - Hadoop's key characteristics, including distributed processing and storage, fault tolerance, and scalability, have made it the perfect solution for managing large and varied datasets. Hadoop has emerged as a top option for many data-driven applications due to benefits like affordability, compatibility with commodity hardware, and the capacity to process unstructured data. Its strong ecosystem, which includes different Hadoop distributions and Apache Pig for data processing, further strengthens its capabilities.
Автор: Jugnesh Kumar, Anubhav Kumar, Rinku Kumar
Издательство: BPB Publications
Год: 2024
Страниц: 467
Язык: английский
Формат: pdf, epub, mobi
Размер: 10.1 MB
Unveiling insights, unleashing potential: Navigating the depths of Big Data and analytics for a data-driven tomorrow. Big Data and analytics is an indispensable guide that navigates the complex data management and analysis. This comprehensive book covers the core principles, processes, and tools, ensuring readers grasp the essentials and progress to advanced applications. It will help you understand the different analysis types like descriptive, predictive, and prescriptive. Learn about NoSQL databases and their benefits over SQL. The book centers on Hadoop, explaining its features, versions, and main components like HDFS (storage) and MapReduce (processing). Explore MapReduce and YARN for efficient data processing. Gain insights into MongoDB and Hive, popular tools in the Big Data landscape. Introduction to Hadoop - Hadoop's key characteristics, including distributed processing and storage, fault tolerance, and scalability, have made it the perfect solution for managing large and varied datasets. Hadoop has emerged as a top option for many data-driven applications due to benefits like affordability, compatibility with commodity hardware, and the capacity to process unstructured data. Its strong ecosystem, which includes different Hadoop distributions and Apache Pig for data processing, further strengthens its capabilities.