min_by
Finds the row(s) containing the minimum value for a column and returns the value of another column in that row.
For example, if a table contains the columns employee_id
and salary
, MIN_BY(employee_id, salary)
returns the value of the employee_id
column for the row that has the lowest value in the salary
column.
If multiple rows contain the specified minimum value, the function is non-deterministic.
Syntax
min_by( <col_to_return>, <col_containing_minimum> )
Arguments
<col_to_return>
Column containing the value to return.
<col_containing_mininum>
Column containing the minimum value.
Returns
The function returns a value of the same type as col_to_return
.
Example
The following examples demonstrate how to use the MIN_BY function.
To run these examples, execute the following statements to set up the table and data for the examples:
CREATE TABLE employees(employee_id int, department_id int, salary int);
VALUES
(1001, 10, 10000),
(1020, 10, 9000),
(1030, 10, 8000),
(900, 20, 15000),
(2000, 20, NULL),
(2010, 20, 15000),
(2020, 20, 8000),
INSERT INTO employees;
The following example returns the ID of the employee with the lowest salary:
FROM employees aggregate min_by(employee_id, salary);
+-----------------------------+
| min_by(employee_id, salary) |
+-----------------------------+
| 2020 |
+-----------------------------+
Note the following:
- Because more than one row contains the minimum value for the
salary
column, the function is non-deterministic and might return the employee ID for a different row in subsequent executions. - The function ignores the NULL value in the salary column when determining the rows with the minimum values.