Big Data: Principles and best practices of scalable realtime data systems

big-data-scalable-realtime-systems

Author: James Warren, Nathan Marz
Pub Date: 2015
Publisher: Manning
ISBN: 978-1-617290-34-3
Pages: 328
Language: English
Format: PDF/EPUB
Size: 14 Mb

Big Data teaches you to build big data systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data. It describes a scalable, easy-to-understand approach to big data systems that can be built and run by a small team. Following a realistic example, this book guides readers through the theory of big data systems, how to implement them in practice, and how to deploy and operate them once they’re built. Web-scale applications like social networks, real-time analytics, or e-commerce sites deal with a lot of data, whose volume and velocity exceed the limits of traditional database systems. These applications require architectures built around clusters of machines to store and process data of any size, or speed. Fortunately, scale and simplicity are not mutually exclusive. Big Data teaches you to build big data systems using an architecture designed specifically to capture and analyze web-scale data. This book presents the Lambda Architecture, a scalable, easy-to-understand approach that can be built and run by a small team. You’ll explore the theory of big data systems and how to implement them in practice. In addition to discovering a general framework for processing big data, you’ll learn specific technologies like Hadoop, Storm, and NoSQL databases. This book requires no previous exposure to large-scale data analysis or NoSQL tools. Familiarity with traditional databases is helpful. What’s Inside

  • Introduction to big data systems
  • Real-time processing of web-scale data
  • Tools like Hadoop, Cassandra, and Storm
  • Extensions to traditional database skills

Table of Contents

1: A new paradigm for Big Data

Part 1: Batch layer
2: Data model for Big Data
3: Data model for Big Data: Illustration
4: Data storage on the batch layer
5: Data storage on the batch layer: Illustration
6: Batch layer
7: Batch layer: Illustration
8: An example batch layer: Architecture and algorithms
9: An example batch layer: Implementation

Part 2: Serving layer
10: Serving layer
11: Serving layer: Illustration

Part 3: Speed layer
12: Realtime views
13: Realtime views: Illustration
14: Queuing and stream processing
15: Queuing and stream processing: Illustration
16: Micro-batch stream processing
17: Micro-batch stream processing: Illustration
18: Lambda Architecture in depth

Download

Download