Functions as values, revisited

Functional Programming with OCaml

Functions as values, revisited

Module 6 · Lecture 1

KC Sivaramakrishnan
IIT Madras

Back in Module 3 we said, almost in passing, that functions are first-class values in OCaml: a function is a value the same way 42 is a value, with the same rights to be named, passed around, returned from another function, or dropped into a data structure. We even saw a couple of small consequences of that fact: let plus_one = fun x -> x + 1 looks like a perfectly ordinary let binding; make_adder 5 returns a brand new function value; apply_twice f x takes a function as an argument.

From Module 3, leaned on

Module 3: functions are first-class values (named, passed, returned, stored).
Module 6: we lean on that idea for the entire module.
A higher-order function is one that:
- takes a function as an argument, or
- returns a function as its result
- (most interesting ones do both).
Lets you extract the pattern of a computation from the details; reuse the same pattern across many concrete computations.

This module is where we take that idea and lean on it. The remaining five lectures in Module 6 are, in a sense, a single sustained exercise in higher-order programming: writing functions that take other functions, returning functions from other functions, and combining the two to express computations that would be much more verbose in a first-order language. We are going to meet the famous trio map, filter, and fold; the pipeline operator |>; and finally, in the tutorial, rebuild a slice of the standard library and lift fold to binary trees and rose trees.

A higher-order function is one that does either of two things: it takes a function as an argument, or it returns a function as its result. (Most of the interesting ones do both.) That is the entire definition. Once you have higher-order functions, you can extract the pattern of a computation from the details it operates on, parameterise the pattern by the details, and reuse the same pattern across dozens of concrete computations.

The plan for Module 6

L1 (today): higher-order functions; the anatomy.
L2 (map): apply a function to each element.
L3 (filter): keep elements that satisfy a predicate.
L4 (fold): collapse a collection to a value.
L5 (pipelines): the |> operator.
L6 (tutorial): rebuild parts of List, then fold across binary trees and rose trees.

The canonical paper on this style is John Hughes's Why Functional Programming Matters from 1990. Hughes argues that higher-order functions are not merely a notational convenience; they are the glue that lets a functional programmer combine small parts into large programs. If you read only one optional paper this course, read that one.

A function that takes a function

Let us start with the simplest example. Suppose we want a function that applies its argument twice:

let twice f x = f (f x) let _ = twice (fun n -> n + 3) 10 (* = 16 *) let _ = twice (fun s -> "(" ^ s ^ ")") "x" (* = "((x))" *)

A function that takes a function

let twice f x = f (f x) let _ = twice (fun n -> n + 3) 10 (* = 16 *) let _ = twice (fun s -> "(" ^ s ^ ")") "x" (* = "((x))" *)

Type: ('a -> 'a) -> 'a -> 'a.
Takes a function 'a -> 'a and a value 'a; returns an 'a.
Works for any type 'a.
Both calls above instantiate 'a differently (int, string).

The first call returns 16 (the increment-by-3 function applied to 10 gives 13, then to 13 gives 16). The second returns "((x))" (the parenthesise function wraps once, then wraps again). The same twice works for both because its type is polymorphic: ('a -> 'a) -> 'a -> 'a. As long as the function we pass maps a type to itself, twice will apply it twice to a value of that type and return a value of that same type.

The polymorphism here is doing real work. In Java or C++, you would write two separate twice functions (one for int, one for String), or you would reach for generics. In OCaml, polymorphism is the default and a single definition handles every type. Templates in C++ and generics in Java give you something similar, but the OCaml version is checked at compile time without code duplication, and it reads as plain OCaml rather than as a separately-defined template language.

A function that returns a function

We have already met make_adder in Module 3:

let make_adder n = fun x -> x + n let plus_five = make_adder 5 let plus_ten = make_adder 10 let _ = plus_five 1 (* = 6 *) let _ = plus_ten 1 (* = 11 *)

A function that returns a function

let make_adder n = fun x -> x + n let plus_five = make_adder 5 let plus_ten = make_adder 10 let _ = plus_five 1 (* = 6 *) let _ = plus_ten 1 (* = 11 *)

make_adder 5 produces a new function that adds 5.
The new function holds onto n from the enclosing scope.
This is the closure we saw in Module 3.

make_adder 5 returns the function that adds 5 to its argument. That returned function remembers, in its closure, that n was 5 at the moment of creation. Different calls to make_adder produce different functions, each with its own captured n. This is the mechanism that makes higher-order programming practical: a function returned from another function is not a stripped-down code pointer but a full value carrying whatever local state it needs.

We say a function is higher-order if it takes a function as a parameter, or returns one as a result, or both. twice qualifies on the first count; make_adder qualifies on the second. Most of the rest of this module is about higher-order functions that do both at once.

Why bother? The Abstraction Principle

The slogan is do not say the same thing twice. Higher-order functions are how we cash that slogan out for repeated patterns of computation.

Suppose we have three functions, all of the same shape:

let rec all_doubled = function | [] -> [] | x :: rest -> (x * 2) :: all_doubled rest let rec all_squared = function | [] -> [] | x :: rest -> (x * x) :: all_squared rest let rec all_plus_one = function | [] -> [] | x :: rest -> (x + 1) :: all_plus_one rest

Three functions; the same skeleton ("walk the list, do something to each element, build the result"). The only difference is the "something" applied to each element: x * 2, x * x, x + 1. Every other line of code is identical.

That repetition is begging to be factored out. The trick: pull the per-element computation up into a parameter. We get one function that captures the walk, and pass in three different parameters for the work:

let rec map f = function | [] -> [] | x :: rest -> f x :: map f rest let _ = map (fun x -> x * 2) [1; 2; 3] (* = [2; 4; 6] *) let _ = map (fun x -> x * x) [1; 2; 3] (* = [1; 4; 9] *) let _ = map (fun x -> x + 1) [1; 2; 3] (* = [2; 3; 4] *)

The same shape, three times

Three first-order functions:

Same shape; only the per-element work differs.

The same shape, three times: factor the work out

One function captures the walk.
Three per-element computations are passed in.

map is the protagonist of Lecture 2. We are previewing it here to make the larger point: factoring the walk out of the work gives you one general-purpose function plus a small data-like description of what to do at each step. Three concrete computations collapsed into three one-line definitions. This pattern, identifying repeated structure and parameterising over the difference, is at the heart of every higher-order function we will write.

The same idea drives why higher-order functions matter in practice. It is not just an aesthetic gain: when you later need to optimise the walk (say, make it tail-recursive), every client of the abstracted function benefits at once. When you write all_doubled, all_squared, all_plus_one separately, you have to find and fix three places. Module 6 is essentially one extended case study in this principle.

Functions are values: what that really means

Three things follow from "functions are values":

They have a type. fun x -> x + 1 has type int -> int. You can ask the compiler what type a function has, and it will tell you. We have seen this from the start of the course.

They can be passed around. Anywhere you can put a value (an argument, a list element, a record field, a tuple component), you can put a function. There is no need for a special "function type" marker; the type system already covers it.

They can be created on the fly. fun x -> x + n is an expression: writing it down produces a function value. You do not need a separate "compile" step. You can compute a function inside another function and return the result.

This is what people mean by first-class. In a language where functions are not first-class (C is the classic example; you can pass function pointers, but you cannot create new functions at runtime), each of these three points fails in some way. C function pointers cannot capture local variables; they are not values in the same sense as an int. OCaml functions can capture; they really are values.

Callbacks: the everyday higher-order pattern

Outside of strictly functional programming, the most common form of higher-order function in everyday code is the callback. A callback is a function you give to another piece of code so that it can call you back later. GUIs, network code, and event loops all work this way.

Callbacks: the GUI / event idiom

Higher-order functions are everywhere in GUI / network code:

let on_click handler = (* imagine a real GUI here *) handler "user clicked at (100, 200)" let _ = on_click (fun msg -> "got event: " ^ msg) (* = "got event: user clicked at (100, 200)" *)

on_click calls its argument handler with an event description.
You pass a function that decides what to do with the event.
That function is a callback.
Java needs an interface; JS / OCaml pass a lambda directly.

on_click does not know what should happen when the user clicks; that is the caller's problem. So it asks the caller for a function, calls that function with the relevant event data, and lets the caller decide. The whole arrangement is one function passed as an argument to another, used once and discarded.

In Java prior to Java 8, this idiom required defining an interface (OnClickListener, say), instantiating an anonymous class that implemented the interface, and passing an instance to on_click. Java 8 added lambdas precisely so that this would not be so painful. In JavaScript, callbacks have been the everyday building block since the language was born; one of the foundational books on the language, JavaScript: The Good Parts, devotes a whole chapter to them. In OCaml, no special machinery is needed: a function is a value, you just pass it.

Operators are functions too

A small but useful piece of syntax: any OCaml infix operator can be turned into a regular function by wrapping it in parentheses. The operator + is normally infix, but (+) is a normal two-argument function value.

let _ = (+) 3 4 (* = 7 *) let _ = ( * ) 3 4 (* = 12 *) let _ = (^) "hi " "there" (* = "hi there" *)

Operators are functions too

Wrap an infix operator in parens, get a function:

let _ = (+) 3 4 (* = 7 *) let _ = ( * ) 3 4 (* = 12 *) let _ = (^) "hi " "there" (* = "hi there" *)

This composes well with higher-order functions:

let _ = List.map ((+) 10) [1; 2; 3] (* = [11; 12; 13] *)

(+) 10 is the partial application "add 10".
Pass that function directly to map; no lambda needed.

Notice the small piece of whitespace inside ( * ). Without the spaces, OCaml parses (* as the start of a comment, which is rarely what you want. So the convention for the multiplication operator is to write it with spaces: ( * ).

Combining operators-as-functions with partial application gives a particularly compact higher-order idiom:

let _ = List.map ((+) 10) [1; 2; 3] (* = [11; 12; 13] *)

(+) 10 is the function "add 10": we have applied (+) to one of its two arguments, leaving a function from int to int. We then pass that function directly to List.map. The result, [11; 12; 13], comes out without us having to write a single anonymous function. You will see this pattern repeatedly in idiomatic OCaml.

Function composition

If you have two functions f : 'b -> 'c and g : 'a -> 'b, you can build a new function fun x -> f (g x) that runs g first and then f. This is function composition, and it is so common that it has its own name in mathematics:

(f ∘ g)(x) = f(g(x))

In OCaml, the standard library does not provide a composition operator by default (some projects define one as (>>) or (<<)), but you can write one yourself in a single line:

let compose f g = fun x -> f (g x) let square = fun x -> x * x let inc = fun x -> x + 1 let square_then_inc = compose inc square let _ = square_then_inc 4 (* = 17 *)

The result of square_then_inc 4 is 17: first square (16), then increment (17). We will revisit composition in Lecture 5 alongside the pipeline operator |>.

Function composition

let compose f g = fun x -> f (g x) let square = fun x -> x * x let inc = fun x -> x + 1 let square_then_inc = compose inc square let _ = square_then_inc 4 (* = 17 *)

Signature, one arrow per line:

val compose :
       ('b -> 'c)        (* f *)
    -> ('a -> 'b)        (* g *)
    -> 'a -> 'c          (* result *)

Takes two functions, returns a third.
Higher-order on both ends: in and out.
Not in the stdlib by default; one line to write yourself.
Revisited with |> in Lecture 5.

Read as: "given a function from 'b to 'c, and a function from 'a to 'b, produce a function from 'a to 'c." Composition is a function that takes two functions and returns a function. It is higher-order on both ends.

Closures revisited: capture happens once

We saw closures in Module 3. The one point worth repeating, because it surprises people, is that a closure captures values at the time the function is created, not names that get re-looked-up later.

let n = 10 let f = fun x -> x + n let n = 99 let _ = f 1 (* = 11 *)

What is f 1? It is 11, not 100. When f was defined, the binding for n in scope was n = 10; the closure captured that value. The later let n = 99 shadows the outer n for code written after it, but does not retroactively update what f saw. Closures freeze the environment at the moment of creation.

This is the same point we made in Module 2 (shadowing is not mutation), applied here to function values. Functions returned from make_adder work the same way: the captured n is whatever it was when that particular function was created. Each make_adder call creates an independent closure with its own captured n.

Functions can return functions, which can return functions, which can...

Once we accept that functions are values, there is no reason a function cannot return another function, which itself returns another function, and so on. In fact this is exactly what multi-argument functions in OCaml are: a chain of one-argument functions, each returning the next.

let curry3 f x y z = f (x, y, z) let dist3 (x, y, z) = sqrt (float_of_int (x*x + y*y + z*z)) let _ = curry3 dist3 3 4 12 (* = 13.0 *)

Functions can return functions, which can return functions, which can...

let curry3 f x y z = f (x, y, z) let dist3 (x, y, z) = sqrt (float_of_int (x*x + y*y + z*z)) let _ = curry3 dist3 3 4 12 (* = 13.0 *)

curry3 takes a function expecting a triple.
It turns that into a function of three arguments.
We pass dist3 (which wants a triple) plus three numbers; curry3 bundles.

curry3 dist3 is a value that, given three arguments one at a time, will eventually bundle them into a triple and apply dist3. "Currying" is the term for this technique: turning an n-argument function into a chain of n one-argument functions. We saw it in the currying lecture and will see it again throughout this module.

The point of the example above is not currying itself; it is the demonstration that functions are values you can shape and reshape at will. You can repackage them, partially apply them, pass them to other functions that then partially apply them, and so on. The language puts no special restrictions on what you can do.

A quick check

What is the type of twice, where let twice f x = f (f x)?

('a -> 'b) -> 'a -> 'b
('a -> 'a) -> 'a -> 'a
'a -> 'a -> 'a
('a -> 'b) -> 'b -> 'a

Why: for f (f x) to type-check, the output of the inner f x must be a valid input to the outer f. So f's input and output types must coincide. Calling that type 'a, f is 'a -> 'a, x is 'a, and the result is 'a. So twice : ('a -> 'a) -> 'a -> 'a.

Which of these is not a higher-order function?

let apply f x = f x
let make_adder n = fun x -> x + n
let add x y = x + y
let compose f g = fun x -> f (g x)

Why: a higher-order function either takes or returns a function. apply takes a function. make_adder returns a function (so does every two-argument function in OCaml, technically, by currying, but the explicit fun makes it especially clear). compose does both. add takes two ints and returns an int; no functions involved.

(Pedantic note: add is internally curried, so add 3 is a function int -> int. By that strict reading every multi-argument function is higher-order. The everyday usage of "higher-order" reserves the term for functions that take or return functions by intent. add is not such a function.)

A small code challenge:

Write apply_n : int -> ('a -> 'a) -> 'a -> 'a that applies f to x, n times. So apply_n 3 f x is f (f (f x)), and apply_n 0 f x is just x.

let rec apply_n n f x = failwith "not implemented"

Show reference solution

Reference solution:

let rec apply_n n f x =
  if n = 0 then x
  else apply_n (n - 1) f (f x)

The base case is n = 0: nothing left to do, return x. The recursive case: apply f once, then apply n - 1 more times.

Activity

Write flip : ('a -> 'b -> 'c) -> 'b -> 'a -> 'c that swaps the argument order of a two-argument function. Test with:

let _ = flip (-) 3 10 (* = 7 *) let _ = flip (fun x xs -> x :: xs) [1; 2; 3] 0 (* = [0; 1; 2; 3] *)

Show reference solution

Activity solution

let flip f x y = f y x let _ = flip (-) 3 10 (* = 7 *) let _ = flip (fun x xs -> x :: xs) [1; 2; 3] 0 (* = [0; 1; 2; 3] *)

flip is a tiny combinator: function in, function out.
Higher-order on both ends.
Polymorphic on three type variables ('a, 'b, 'c).
Useful when a stdlib function has its arguments "in the wrong order" for partial application.

flip is a tiny but characteristic higher-order combinator: input is a function, output is a function, and no concrete value of a named type passes through that the user has to think about. The three type variables 'a, 'b, 'c are doing real work: the input function maps 'a -> 'b -> 'c, and the output function maps 'b -> 'a -> 'c. We will meet more combinators in Lecture 5, where |> and @@ play the same shape-changing role.

What's next

We have set the stage. The next four lectures take the abstraction-of-a-walk idea and follow it to the three canonical higher-order list functions: map (transform each element), filter (keep some elements), and fold (combine all elements). Lecture 5 introduces the pipeline operator |>, which lets us chain these three together in left-to-right reading order. Lecture 6 is the tutorial: rebuild parts of List from this small toolkit, then lift fold to binary trees and rose trees.

What's next

Lecture 2: map in detail.

Canonical "do something to each element" operation.
The most common higher-order function in practice.
Prototype for everything in this module.

Reading

Cornell CS3110, Higher-order functions: https://cs3110.github.io/textbook/chapters/hop/higher_order.html
John Hughes, Why Functional Programming Matters (1990): the classic paper arguing that higher-order functions and lazy evaluation are the glue of modular functional programs. https://www.cs.kent.ac.uk/people/staff/dat/miranda/whyfp90.pdf

Sources

This lecture's prose, worked examples, and quizzes are original to this course. Materials referenced during preparation are listed in the Reading section above; Cornell CS3110 and Real World OCaml are CC BY-NC-ND-licensed and have not been derivatively reused. See LICENSES.md at the repository root for the full source posture.