Greenplum

CloudLabs

Projects

Assignment

24x7 Support

Lifetime Access

.

Course Overview

In this course, you will learn to design and implement the Greenplum environment and gain the information needed to install, configure, and manage the Greenplum database system. You will be introduced to the Greenplum environment, consisting of the Greenplum Database and supported systems. Greenplum was a big data analytics company headquartered in San Mateo, California. Greenplum’s products include its Unified Analytics Platform, Data Computing Appliance, Analytics Lab, Database, HD and Chorus.EMC Greenplum debuts its own Hadoop distribution, Pivotal HD, which marries Greenplum’s massively parallel processing database technology with the Apache Hadoop framework to create a technology called HAWQ.

At the end of the training, participants will be able to:

  • • Describe Greenplum database features, benefits, and architecture in terms of shared nothing, MPP design and how Greenplum database supports redundancy and high availability
  • • Install, configure, and administer a Greenplum Database
  • • Be proficient with DDL, DML, and DQL to access, manage, and query data
  • • Implement appropriate table storage models, compression, and tablespaces, data distributions, and table partitioning strategies to store data in a Greenplum database
  • • Understand best practices for data loading
  • • Be proficient in data modeling and physical design decisions
  • • Improve query performance by following a number of performance enhancement tips by understanding the Pivotal Query Optimizer, database tuning, query profiling, tuning and rewriting, and indexing strategies
  • • Implement security and access strategies
  • • Perform backup and restore

Pre-requisite

  1. Basic UNIX or Linux command-line navigation and administration skills
  2. Database query language basics, including but not limited to basic SQL knowledge for accessing database objects
  3. Fundamental relational database concepts

Duarion

5 days

Course Outline

  1. psql CLI utility
  2. Command Cente
  1. System preparation and verification
  2. Greenplum Database Initializatio
  1. DDL, DML, DQL
  2. Roles and Privileges
  3. Controlling Access
  4. Managing Resources
  5. GP Workload Manage
  1. Implementing table storage models, compression, and tablespaces
  2. Data loading
  3. Table partitioning
  1. Managing the Greenplum Database
  2. Backups and Restores
  1. Data modeling in Greenplum
  2. Physical design decisions
  1. Pivotal Query Optimizer
  2. DB Tuning
  3. Query profiling
  4. Query tuning and rewriting
  5. Statistics
  6. Indexing strategies
  1. Lab – 9

Reviews