Skip to content

Introduction

Presto Workshop - Getting Started with Presto

Welcome to our workshop! In this workshop, you’ll learn the basics of Presto, the open-source SQL query engine. You’ll get Presto running locally on your machine, connect data sources, and run some queries. This is a beginner-level workshop for software developers and engineers who are new to Presto. At the end of the workshop, you will understand how to federate queries using Presto.

The goals of this workshop are:

  • What is Presto and why you’d use it
  • How to write a Presto query
  • How to create and deploy a Presto cluster on your machine using Docker
  • How to add 2 data sources (MySQL and MongoDB) and query the data from them
  • How to create dashboards/visualizations of your data

About this workshop

The introductory page of the workshop is broken down into the following sections:

Agenda

Prerequisite Prerequisites for the workshop
Introduction Presto Introduction
Lab 1: Set up Presto Set up a Presto cluster with 1 coordinator and 3 workers
Lab 2: Set up Data Sources Set up 2 data source - MySQL and MongoDB
Lab 3: Connect to Data Sources Set up 2 catalogs to connect to MySQL and MongoDB
Lab 4: Data Visualization Visualize the data

Compatibility

This workshop has been tested on the following platforms:

  • Linux: Ubuntu 22.04
  • MacOS

Technology Used

  • Docker: A container engine to run several applications in self-contained containers.
  • Presto: Fast and Reliable SQL Engine for Data Analytics and the Open Lakehouse
  • MySQL: A popular open-source relational database management system
  • MongoDB: A document-oriented database
  • Apache Zeppelin: Web-based notebook for interactive data analytics

Credits