SQL at Any Scale, on Any Storage, in Any Environment
Matt Fuller, Manfred Moser, and Martin Traverso

#Trino
#Definitive_Guide
#software_engineers
#SQL
#database
Trino: The Definitive Guide is the first and foremost book about the Trino distributed query engine. The book is aimed at beginners and existing users of Trino alike. Ideally, you have some understanding of databases and SQL, but if not, you can divert from reading and look things up while working your way through this book. No matter your level of expertise, we are sure that you’ll learn something new from this book.
This second edition modernizes the content to keep up with the rapid innovation of Trino. We cover new aspects such as the Helm chart to deploy a Trino cluster to Kubernetes, the new Iceberg and Delta Lake connectors for modern lakehouse architectures, fault-tolerant execution for query processing, expanded SQL language features, and the latest Trino release, now running on Java 17.
The first part of the book introduces you to Trino and then helps you get up and running quickly so you can start learning how to use it. This includes installation and first use of the command-line interface as well as client- and web-based applications, such as SQL database management or dashboard and reporting tools, using the JDBC driver.
The second part of the book advances your knowledge and includes details about the Trino architecture, cluster deployment, many connectors to data sources, and a lot of information about the main power of Trino—querying any data source with SQL.
The third part of the book rounds out the content with further aspects you need to know when running and using a production Trino deployment. This includes Web UI usage, security configuration, and some discussion of real-world uses of Trino in other organizations.
Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. In the second edition of this practical guide, you'll learn how to conduct analytics on data where it lives, whether it's a data lake using Hive, a modern lakehouse with Iceberg or Delta Lake, a different system like Cassandra, Kafka, or SingleStore, or a relational database like PostgreSQL or Oracle.
Analysts, software engineers, and production engineers learn how to manage, use, and even develop with Trino and make it a critical part of their data platform. Authors Matt Fuller, Manfred Moser, and Martin Traverso show you how a single Trino query can combine data from multiple sources to allow for analytics across your entire organization.
Part I. Getting Started with Trino
Chapter 1. Introducing Trino
Chapter 2. Installing and Configuring Trino
Chapter 3. Using Trino
Part II. Diving Deeper into Trino
Chapter 4. Trino Architecture
Chapter 5. Production-Ready Deployment
Chapter 6. Connectors
Chapter 7. Advanced Connector Examples
Chapter 8. Using SQL in Trino
Chapter 9. Advanced SQL
Part Ill. Trino in Real-World Uses
Chapter 1 0. Security
Chapter 11. Integrating Trino with Other Tools
Chapter 12. Trino in Production
Chapter 13. Real-World Examples
Matt Fuller is a co-founder at Starburst Data, the Analytics Anywhere Company. Prior to founding Starburst, Matt was a director of engineering at Teradata, where he worked to build the new Center for Hadoop division within the company. As a major part of this, Matt worked to bring Trino to the enterprise market. Matt has managed a team contributing to the open source Trino project since 2015 and led the internal Trino product roadmap. Starburst was later formed from this team at Teradata.
Before Teradata, Matt was an early engineer at Vertica, where he co-built the query optimizer. Matt is also a Very Large Databases (VLDB) published author and has US patents in the database management systems space.
Manfred Moser is a community advocate, writer, trainer and software engineer at Starburst. Manfred has a long history of developing and advocating open source software. He is an Apache Maven committer, wrote the Hudson book and others, and continues to be active in the open source community and his projects. He is a seasoned trainer and conference presenter for CI/CD, Cloud Native, Agile and other software development tools and processes, having trained well over 20,000 developers for companies including Walmart Labs, Sonatype, and Telus.
His database background includes designing databases and related applications in the RDBMS space and working as business intelligence consultant wrangling thousands of lines of SQL by hand. He is glad he can use Presto now, and is spreading the word about how great Presto is.
Martin Traverso is the cofounder of the Trino Software Foundation and CTO at Starburst Data. Prior to Starburst, Martin worked as a software engineer at Facebook where he saw the need for fast interactive SQL analytics. Martin and three other engineers worked to create what became Trino. Martin led the Trino development team and in the spring of 2013 Trino was rolled out into production, later made opensource in the fall of 2013. Since then, Trino has gained wide adoption both internal and external to Facebook. Prior to Facebook, Martin was an architect at Proofpoint and Ning, where he led development and architecture design of numerous complex enterprise and social network applications.









