Apache Kafka

Course Overview

Apache Kafka is a distributed, publish-subscribe messaging platform that is designed to be fast, scalable, and durable. Kafka is used for building real-time data pipelines and streaming applications. Training in Apache Kafka typically covers the fundamental concepts and architecture of Kafka, as well as how to develop and deploy Kafka-based applications. It may also cover topics such as Kafka clustering, Kafka security, and Kafka performance tuning.

At the end of the training, participants will be able to:

  1. Describe the architecture of Kafka and explain business use cases thereof 
  2. Install & configure Kafka cluster in either single-node or multi-node setup
  3. Implement Kafka Producer and Consumer using real-time streaming data
  4. Apply their learning to use powerful features like partitions & the messaging system
  5. Use Kafka interfaces as per their organization’s requirement.

Pre-requisite

Basic Java programming knowledge is required.

Duration

2 days

Course Outline

  1. What is Big Data?
  2. Three Vs Of Big data
  3. Data storage sizes
  4. Evolution of Data
  5. Features of Big Data
  6. Big Data Analysis
  7. Technology Comparison
  8. Apache Hadoop
  9. Apache Kafka
  10. Apache Storm
  11. Apache Spark
  12. Apache Cassandra
  13. Apache HBase
  1. Overview of Zookeeper
  2. Distributed Application
  3. Characteristics of Zookeeper
  4. Zookeeper data model
  5. Sequential Znode
  6. Vmware
  7. PuTTY and WinSCP
  8. Zookeeper CLI and APIs
  1. What is Apache Kafka?
  2. Characteristics of Kafka
  3. Kafka Use cases
  4. Kafka Data Model
  5. Kafka Architecture
  6. Core APIs of Kafka
  7. Kafka Connect
  1. Introduction to Kafka Interface
  2. Creating kafka Topic
  3. Modifying kafka topic
  4. Kafka console producer
  5. Kafka console consumer
  6. Java Interface to Kafka
  7. Producer and consumer side APIs
  8. Running java program in Kafka
Accordion Content
Accordion Content
Accordion Content
Accordion Content

Reviews