Duration: 2 days

Overview

This Data Analytics on AWS training course gives attendees a comprehensive overview of AWS' core offerings to choose which best suits their data analysis needs.

Objectives

  • Understand how to implement data warehouses using AWS Lake Formation service
  • Use S3 through the management console
  • Understand the architecture of the Snowflake data platform
  • Use Snowflake web UI (a.k.a Web Portal, Snowflake Manager, and Snowflake Console)
  • Create databases, tables, and warehouses in the Snowflake Web UI
  • Understand how Amazon QuickSight builds visualizations, perform ad hoc analysis, and business insights
  • Explore the main capabilities of AWS Glue
  • Create a Glue crawler to work over a collection of CSV files using a customized classifier to infer their schemas
  • Create and run an AWS Glue ETL job

Audience

This AWS course best suits Data and business analysts, IT architects, developers, and technical managers.

Prerequisites

Participants must have a general knowledge of a modern programming language.

Lab Setup Guide

Outline for Data Analytics on AWS Training

  • Chapter 1 - The AWS Lake Formation Service
    • First, What is a Data Lake?
    • Data Lakes vs. Traditional Data Warehouses
    • Characteristics of Data Warehouses and Data Lakes
    • Now, What is AWS Lake Formation?
    • What are the Benefits of Using Lake Formation?
    • How Lake Formation Works
    • The Lake Formation Dashboard
    • AWS Lake Formation Pricing
    • Summary
  • Chapter 2 - AWS Simple Storage Service
    • What is AWS Simple Storage Service (S3)
    • AWS S3
    • Storage
    • Regions
    • S3 Regions
    • Getting started with S3
    • Using BitTorrent
    • More on Buckets
    • Bucket Configurable Properties
    • Advanced S3 Bucket Properties
    • The Bucket Creation Dialog in the Management Console
    • Bucket Permissions
    • Bucket-level Operations
    • Authorization of REST Requests
    • Adding Cross-Origin Resource Sharing Configuration
    • Event Notifications
    • The Requester Pays Option
    • The Object Key
    • Object Versioning
    • Example of Object Properties
    • Object Storage Class Levels
    • Object-level Operations
    • Object Lifecycle Configuration
    • Amazon S3 Data Consistency Model
    • Observable Data Consistency Behaviors
    • Eventually Consistent Reads vs Consistent Reads
    • Amazon S3 Security
    • S3 Use Case: Backup and Archiving
    • Another S3 Use Case: Static Web Hosting
    • More on Static Web Hosting
    • S3 Static Website Hosting Dialog in Management Console
    • S3 Use Case: Disaster Recovery
    • AWS S3 Pricing
    • Storage Pricing
    • Request Pricing
    • Data Transfer Pricing
    • Amazon S3 Transfer Acceleration
    • How to Enable Transfer Acceleration
    • Enabling Transfer Acceleration in the Management Console
    • Amazon S3 SLA Definitions
    • Amazon S3 SLA Service Commitment
    • S3 CLI
    • Summary
  • Chapter 3 - Introduction to the Snowflake Cloud Data Platform
    • What is Snowflake?
    • Certifications
    • Snowflake Conceptual Architecture
    • Core Underlying Design Considerations
    • Core Services and Tools
    • Snowflake Editions
    • The Standard Edition
    • The Enterprise Edition
    • The Business Critical Edition
    • Virtual Private Snowflake
    • Billing: The Cost Components
    • Data Storage Segments
    • Parts of Snowflake that Incur Compute-related Costs
    • Snowflake Quickstart
    • Summary
  • Chapter 4 - Snowflake's Web UI
    • Web UI (Web Portal)
    • The Landing Page
    • Snowflake Roles
    • The Roles UI
    • Databases
    • Shares
    • Data Marketplace
    • The Warehouses UI
    • Worksheets
    • History
    • A History Sample
    • Account
    • Operational Transparency: Controlling the Usage of Your Account
    • Create Network Policy Dialog (under Account > Policies)
    • Preview App
    • Summary
  • Chapter 5 - Visualization and Reporting
    • Amazon QuickSight
    • SPICE
    • Data Analyses
    • Visuals
    • Sheets
    • Dashboards
    • Typical Amazon QuickSight Workflow
    • Create a Data Set
    • Create an Analysis
    • Create a Visual Manually
    • Amazon Athena
    • Amazon Athena and AWS Data Catalog
    • Query Data Using Amazon Athena
    • What is Tableau
    • Create a Report Using Tableau
    • Tutorial: Get Started with Tableau Desktop
    • Summary
  • Chapter 6 - Introduction to AWS Glue
    • What is AWS Glue?
    • AWS Glue Components
    • Managing Notebooks
    • Putting it Together: The AWS Glue Environment Architecture
    • AWS Glue Main Activities
    • Additional Glue Services
    • AWS Glue Pricing
    • When To Use AWS Glue?
    • Integration with other AWS Services
    • Summary
  • Chapter 7 - AWS Glue PySpark Extensions
    • AWS Glue and Spark
    • The DynamicFrame Object
    • The DynamicFrame API
    • The GlueContext Object
    • Glue Transforms
    • A Sample Glue PySpark Script
    • Using PySpark
    • AWS Glue PySpark SDK
    • Summary

Lab Exercises

  • Lab 1. Using S3 Through Management Console
  • Lab 2. Signing Up for the Free Trial of Snowflake
  • Lab 3. The Snowflake Web UI
  • Lab 4. Creating and Working with Databases in Snowflake
  • Lab 5. AWS Glue Overview
  • Lab 6. AWS Glue Crawlers and Classifiers
  • Lab 7. Creating an S3 Bucket for AWS Glue ETL Script Output
  • Lab 8. Creating and Working with Glue Scripts
  • Lab 9. Using PySpark API Directly
  • Lab 10. Understanding AWS Glue ETL Jobs
10/02/2023 - 10/03/2023
10:00 AM - 06:00 PM
Eastern Standard Time
Online Virtual Class
USD $1,460.00
Enroll
11/13/2023 - 11/14/2023
10:00 AM - 06:00 PM
Eastern Standard Time
Online Virtual Class
USD $1,460.00
Enroll
12/11/2023 - 12/12/2023
10:00 AM - 06:00 PM
Eastern Standard Time
Online Virtual Class
USD $1,460.00
Enroll