Introduction¶
Presto Workshop - Getting Started with Presto¶
Welcome to our workshop! In this workshop, you’ll learn the basics of Presto, the open-source SQL query engine. You’ll get Presto running locally on your machine, connect data sources, and run some queries. This is a beginner-level workshop for software developers and engineers who are new to Presto. At the end of the workshop, you will understand how to federate queries using Presto.
The goals of this workshop are:
- What is Presto and why you’d use it
- How to write a Presto query
- How to create and deploy a Presto cluster on your machine using Docker
- How to add 2 data sources (MySQL and MongoDB) and query the data from them
- How to create dashboards/visualizations of your data
About this workshop¶
The introductory page of the workshop is broken down into the following sections:
Agenda¶
Prerequisite | Prerequisites for the workshop |
Introduction | Presto Introduction |
Lab 1: Set up Presto | Set up a Presto cluster with 1 coordinator and 3 workers |
Lab 2: Set up Data Sources | Set up 2 data source - MySQL and MongoDB |
Lab 3: Connect to Data Sources | Set up 2 catalogs to connect to MySQL and MongoDB |
Lab 4: Data Visualization | Visualize the data |
Compatibility¶
This workshop has been tested on the following platforms:
- Linux: Ubuntu 22.04
- MacOS
Technology Used¶
- Docker: A container engine to run several applications in self-contained containers.
- Presto: Fast and Reliable SQL Engine for Data Analytics and the Open Lakehouse
- MySQL: A popular open-source relational database management system
- MongoDB: A document-oriented database
- Apache Zeppelin: Web-based notebook for interactive data analytics