Apache Cassandra

Course Overview

Gain a practical working knowledge of Cassandra architecture, interfaces and data model. Master the deployment of Apache Cassandra – an open-source distributed NoSQL database.  Build scalable database solutions with high availability and performance. Deploy Cassandra to manage your big data with tunable consistency.

At the end of the training, participants will be able to:

  1. Explain Big data and NoSQL databases
  2. List the features of Cassandra
  3. Explain the architecture and data model of Cassandra
  4. Discuss Hadoop ecosystem of products around Cassandra
  5. Deploy NoSQL database solutions using Cassandra

Pre-requisite

Exposure to SQL databases and Java programming.

Duration

3 days

Course Outline

  1. What is Big data?
  2. Three V’s of Big data
  3. Data Volume
  4. Data Velocity
  5. Data Variety
  6. Evolution of Data
  7. Features of Big Data
  8. What is Apache Hadoop?
  9. Components of Hadoop: MapReduce, HDFS
  10. What is NoSQL?
  11. Difference between RDBMS and NoSQL Databases
  12. CAP Principle
  13. Types of NoSQL
  14. NoSQL Cassandra Database
  1. What is Cassandra?
  2. Cassandra: Use Cases
  3. Cassandra: Use in Industry
  4. Features of Cassandra
  5. Advantages of Cassandra
  6. Cassandra Commands for LINUX
  1. Cassandra Architecture
  2. Cassandra Architecture Components
  3. Data replication
  4. Simple strategy
  5. Network Topology
  6. Data Partition
  7. Snitches
  8. Gossip Protocol
  9. Seed Nodes
  10. What is Token?
  11. What is Virtual Node?
  12. Write Process
  13. Read Process
  1. Introduction to Data Model
  2. Features of Cassandra Data Model
  3. Cassandra Data Model Rules
  4. Cassandra Data Model Components
  5. UUIS and Time UUID
  6. Counter
  7. Features of Counter
  8. Compound Key
  9. Indexes
  10. Collections
  11. Types of Collections
  12. CQL
  13. DML Statements
  14. DDL Statements
  1. What is CQL?
  2. What is cqlsh?
  3. cqlsh Options
  4. cqlsh Shell Commands
  5. CQL Data Definition
  6. CQL Data Manipulation
  7. Java Interfaces
  8. Queries using java Interfaces
  9. ODBC Driver for Cassandra

 

  1. What is Partitioning?
  2. Features of Partitioners
  3. Types of Partitioners
  4. Replication of Data
  5. Replication strategy
  6. Types of Common replication Strategies
  7. Tunable Consistency
  8. Read Consistency
  9. Write Consistency
  10. Hinted handoff
  11. Time to Live
  12. Tombstones
  13. Monitoring the Cluster
  14. Monitoring the NodeTool
  15. Monitoring with OpsCenter
  1. CassandraStream
  2. Apache Storm
  3. Apache Kafka
  4. Real Time Data Analysis Platform
  5. Apache spark
  6. Spark and scala
Accordion Content

Reviews