Apache Storm Training
Live Online & Classroom Enterprise Training Course
Become an expert on Apcahe Storm . Unlock the full potential of Storm and gain expertise in real-time data processing through our comprehensive course.Take your Storm expertise to new heights by exploring advanced topics that amplify your real-time data processing capabilities
Introduction to Apache Storm: Features and Benefits
Course Overview :
Apache Storm is a distributed real-time processing system that is designed to process high volumes of data quickly. It can be used to process streams of data in real time, making it suitable for use cases such as real-time analytics, online machine learning, and more. Apache Storm Training is a course that teaches participants how to use Apache Storm to build and deploy distributed real-time processing systems. The training covers topics such as the architecture of Apache Storm, how to write and deploy Storm topologies, and how to use Storm with other big data technologies.
Apache Storm vs Spark vs Storm
At the end of the training, participants will be able to:
- Recognize differences between batch and real-time data processing
- Define Storm elements including tuples, streams, spouts, topologies, worker processes, executors, and stream groupings
- Recognize/interpret Java code for a spout, bolt, or topology
- Identify how to develop and submit a topology to a local or remote distributed cluster
- Recognize and explain the differences between reliable and unreliable Storm operation
- Manage and monitor Storm using the command-line client or browser-based Storm User Interface (UI)
- Define Trident elements including tuples, streams, batches, partitions, topologies, Trident spouts, and operations
- Recognize the differences between the different types of Trident state
- Recognize the differences in fault tolerance between different types of Trident spouts
- Define Kafka topics, producers, consumers, and brokers
- Publish Kafka messages to Storm or Trident topologies
- Work on Real World Projects using Storm
Pre-requisite
- Prior Programming experience and must be familiar with basic concepts of Core Java
- Prior knowledge of Object Oriented Programming Concepts
- Should have a basic understanding of Hadoop.
Duration
2 days
Course Outline
- Baysean Law
- Hadoop Distributed Computing
- Legacy Architecture of Real-Time System
- Difference b/w Storm and Hadoop
- The fundamental concept of storm
- Storm Development Environment
- Real Life Storm Project
- Apache Storm Installation
- Storm Architecture
- Logical Dynamic and Components in Storm
- Topology in Storm
- Storm Execution Components
- Stream Grouping
- Tuple
- Spout
- Reliable versus Unreliable Messages
- Getting Data: Direct connection, Enqueued Messages and DRPC
- Bolt Lifecycle
- Bolt Structure
- Bolt-normalization bolt
- Reliable versus Unreliable Bolts
- Multiple Streams
- Multiple Anchoring
- Using IBasicBolt to Ack Automatically
- Hands-On:
- Creating Storm project in eclipse
- Running Storm bolt and spouts
- Running twitter example using Storm
- Grouping and its different types
- Reliable and unreliable messaging
- How to get Data – Direct connection and Enqueued message
- Life cycle of bolt
- Stream Grouping
- Fields Grouping
- All Grouping
- Custom Grouping
- Direct Grouping
- Global Grouping
- None Grouping
- Hands-On:
- Using different grouping techniques in Storm topologies
- What is Trident
- Trident Spouts
- Types of Trident Spouts
- Trident Spout components
- Trident spout Interface
- Trident filter, function & Aggregator
- Hands-On:
- Implementing Trident Spouts and Bolts
- Transactional Topologies
- Partitioned Transactional Spouts
- Opaque Transactional Topologies
- Hands-On:
- Implementing transactional system using Transactional topologies
- Basic Kafka Concepts
- Kafka vs Other Messaging Systems
- Intra-Cluster Replication
- An Inside Look at Kafka’s Components
- Log Administration, Retention, and Compaction
- Hardware and Runtime Configurations
- Monitoring and Alerting
- Cluster Administration
- Securing Kafka
- Using Kafka Connect to Move Data