Getting Started with Storm

£11.50

Getting Started with Storm

Continuous Streaming Computation with Twitter's Cluster Technology

Data capture and analysis

Authors: Jonathan Leibiusky, Gabriel Eisbruch, Dario Simonassi

Dinosaur mascot

Language: English

Published by: O'Reilly Media

Published on: 31st August 2012

Format: LCP-protected ePub

Size: 2 Mb

ISBN: 9781449324049


Introduction to Real-Time Data Analysis with Storm

Even as big data is turning the world upside down, the next phase of the revolution is already taking shape: real-time data analysis. This hands-on guide introduces you to Storm, a distributed, JVM-based system for processing streaming data. Through simple tutorials, sample Java code, and a complete real-world scenario, you’ll learn how to build fast, fault-tolerant solutions that process results as soon as the data arrives.

Discover how easy it is to set up Storm clusters for solving various problems, including continuous data computation, distributed remote procedure calls, and data stream processing.

Key Concepts and Techniques

  • Learn how to program Storm components: spouts for data input and bolts for data transformation
  • Discover how data is exchanged between spouts and bolts in a Storm topology
  • Make spouts fault-tolerant with several commonly used design strategies
  • Explore bolts— their life cycle, strategies for design, and ways to implement them
  • Scale your solution by defining each component’s level of parallelism
  • Study a real-time web analytics system built with Node.js, a Redis server, and a Storm topology
  • Write spouts and bolts with non-JVM languages such as Python, Ruby, and Javascript

Show moreShow less