Apache Kafka
Course Overview
Apache Kafka is a distributed, publish-subscribe messaging platform that is designed to be fast, scalable, and durable. Kafka is used for building real-time data pipelines and streaming applications. Training in Apache Kafka typically covers the fundamental concepts and architecture of Kafka, as well as how to develop and deploy Kafka-based applications. It may also cover topics such as Kafka clustering, Kafka security, and Kafka performance tuning.
At the end of the training, participants will be able to:
- Describe the architecture of Kafka and explain business use cases thereof
- Install & configure Kafka cluster in either single-node or multi-node setup
- Implement Kafka Producer and Consumer using real-time streaming data
- Apply their learning to use powerful features like partitions & the messaging system
- Use Kafka interfaces as per their organization’s requirement.
Pre-requisite
Basic Java programming knowledge is required.
Duration
2 days
Course Outline
- What is Big Data?
- Three Vs Of Big data
- Data storage sizes
- Evolution of Data
- Features of Big Data
- Big Data Analysis
- Technology Comparison
- Apache Hadoop
- Apache Kafka
- Apache Storm
- Apache Spark
- Apache Cassandra
- Apache HBase
- Overview of Zookeeper
- Distributed Application
- Characteristics of Zookeeper
- Zookeeper data model
- Sequential Znode
- Vmware
- PuTTY and WinSCP
- Zookeeper CLI and APIs
- What is Apache Kafka?
- Characteristics of Kafka
- Kafka Use cases
- Kafka Data Model
- Kafka Architecture
- Core APIs of Kafka
- Kafka Connect
- Introduction to Kafka Interface
- Creating kafka Topic
- Modifying kafka topic
- Kafka console producer
- Kafka console consumer
- Java Interface to Kafka
- Producer and consumer side APIs
- Running java program in Kafka
Accordion Content
Accordion Content
Accordion Content
Accordion Content