Название: Practical Lakehouse Architecture: Designing and Implementing Modern Data Platforms at Scale (5th Early Release) Автор: Gaurav Ashok Thalpati Издательство: O’Reilly Media, Inc. Год: 2024-05-16 Страниц: 238 Язык: английский Формат: epub Размер: 14.3 MB
This concise yet comprehensive guide explains how to adopt a data lakehouse architecture to implement modern data platforms. It reviews the design considerations, challenges, and best practices for implementing a lakehouse and provides key insights into the ways that using a lakehouse can impact your data platform, from managing structured and unstructured data and supporting BI and AI/ML use cases to enabling more rigorous data governance and security measures.
Lakehouse architecture is one such modern architectural pattern that has evolved in the last few years. It has become a popular choice for data architects who are designing data platforms. In the Chapter 1, I’ll introduce you to fundamental concepts related to data architecture, data platform and its core components, and how data architecture helps build a data platform. Once you have understood these, I’ll explain why there is a need for new architectural patterns like lakehouse, lakehouse fundamentals, its characteristics, and the benefits of implementing a data platform using lakehouse architecture. I’ll conclude the chapter with key takeaways to summarize everything we discuss and help you remember the key points while reading the subsequent chapters in this book.
Practical Lakehouse Architecture shows you how to:
Understand key lakehouse concepts and features like transaction support, time travel, and schema evolution Understand the differences between traditional and lakehouse data architectures Differentiate between various file formats and table formats Design lakehouse architecture layers for storage, compute, metadata management, and data consumption Implement data governance and data security within the platform Evaluate technologies and decide on the best technology stack to implement the lakehouse for your use case Make critical design decisions and address practical challenges to build a future-ready data platform Start your lakehouse implementation journey and migrate data from existing systems to the lakehouse
Chapter 1: Introduction to Lakehouse Architecture (available) Chapter 2: Traditional Architectures and Modern Data Platforms (available) Chapter 3: Storage: Heart of the Lakehouse (available) Chapter 4: Data Catalogs (available) Chapter 5: Compute Engines for Lakehouse Architectures (available) Chapter 6: Lakehouse Data (and AI) Governance and Security (available) Chapter 7: The Big Picture: Designing and Implementing Your Lakehouse Platform (available) Chapter 8: Lakehouse in the Real World (available) Chapter 9: Conclusion (unavailable)
Скачать Practical Lakehouse Architecture (5th Early Release)