PostgreSQL Subquery

Summary: in this tutorial, you will learn how to use the PostgreSQL subquery that allows you to construct complex queries.

Introduction to PostgreSQL subquery

Let’s start with a simple example.

Suppose we want to find the films whose rental rate is higher than the average rental rate. We can do it in two steps:

  • Find the average rental rate by using the SELECT statement and average function ( AVG).
  • Use the result of the first query in the second SELECT statement to find the films that we want.

The following query gets the average rental rate:

SELECT AVG (rental_rate) FROM film;
Code language: SQL (Structured Query Language) (sql)
SELECT AVG

The average rental rate is 2.98

Now, we can get films whose rental rate is higher than the average rental rate:

SELECT film_id, title, rental_rate FROM film WHERE rental_rate > 2.98;
Code language: SQL (Structured Query Language) (sql)
SELECT film whose rental rate is higher than average

The code is not so elegant, which requires two steps. We want a way to pass the result of the first query to the second query in one query. The solution is to use a subquery.

A subquery is a query nested inside another query such as SELECT, INSERT, DELETE and UPDATE. In this tutorial, we are focusing on the SELECT statement only.

To construct a subquery, we put the second query in brackets and use it in the WHERE clause as an expression:

SELECT film_id, title, rental_rate FROM film WHERE rental_rate > ( SELECT AVG (rental_rate) FROM film );
Code language: SQL (Structured Query Language) (sql)

The query inside the brackets is called a subquery or an inner query. The query that contains the subquery is known as an outer query.

PostgreSQL executes the query that contains a subquery in the following sequence:

  • First, executes the subquery.
  • Second, gets the result and passes it to the outer query.
  • Third, executes the outer query.

PostgreSQL subquery with IN operator

A subquery can return zero or more rows. To use this subquery, you use the IN operator in the WHERE clause.

For example, to get films that have the returned date between 2005-05-29 and 2005-05-30, you use the following query:

SELECT inventory.film_id FROM rental INNER JOIN inventory ON inventory.inventory_id = rental.inventory_id WHERE return_date BETWEEN '2005-05-29' AND '2005-05-30';
Code language: SQL (Structured Query Language) (sql)
SELECT films between

It returns multiple rows so we can use this query as a subquery in the WHERE clause of a query as follows:

SELECT film_id, title FROM film WHERE film_id IN ( SELECT inventory.film_id FROM rental INNER JOIN inventory ON inventory.inventory_id = rental.inventory_id WHERE return_date BETWEEN '2005-05-29' AND '2005-05-30' );
Code language: SQL (Structured Query Language) (sql)
PostgreSQL Subquery with IN

PostgreSQL subquery with EXISTS operator

The following expression illustrates how to use a subquery with EXISTS operator:

EXISTS subquery
Code language: SQL (Structured Query Language) (sql)

A subquery can be an input of the EXISTS operator. If the subquery returns any row, the EXISTS operator returns true. If the subquery returns no row, the result of EXISTS operator is false.

The EXISTS operator only cares about the number of rows returned from the subquery, not the content of the rows, therefore, the common coding convention of EXISTS operator is as follows:

EXISTS (SELECT 1 FROM tbl WHERE condition);
Code language: SQL (Structured Query Language) (sql)

See the following query:

SELECT first_name, last_name FROM customer WHERE EXISTS ( SELECT 1 FROM payment WHERE payment.customer_id = customer.customer_id );
Code language: SQL (Structured Query Language) (sql)
PostgreSQL Subquery with EXISTS

The query works like an inner join on the customer_id column. However, it returns at most one row for each row in the customer table even though there are some corresponding rows in the payment table.

In this tutorial, you have learned how to use the PostgreSQL subquery to construct complex queries.

Was this tutorial helpful ?