Currying and partial application

Functional Programming with OCaml

Currying and partial application

Module 3 · Lecture 3

KC Sivaramakrishnan
IIT Madras

This lecture: currying

A multi-argument function is secretly a chain of one-argument functions.
The shape is called currying (after Haskell Curry).
It enables partial application: supply some arguments, get back a function wanting the rest.
This is the Module 3 idea that most surprises C / Java / Python arrivals.
Explains a lot of OCaml's API style:
- why List.map takes the function first;
- why operators have prefix forms like (+).
Four parts: unfolding, partial application, argument order, the tuple alternative.

In OCaml, a function with multiple arguments is secretly a chain of one-argument functions. This shape is called currying, after the logician Haskell Curry (the same person the Haskell language is named for), and it makes a useful trick available: partial application, supplying some of the arguments and getting back a function that wants the rest.

Curry was generous about the name himself: the technique was published in 1924 by Moses Schönfinkel, six years before Curry's own work, so by the usual rule of priority we should arguably call it schönfinkelisation. Curry built the systematic theory and the catchier name stuck. It is a small lesson in how mathematics, like software, often credits the person who packaged an idea rather than the one who first had it.

This is, in my experience, the idea from Module 3 that most surprises students arriving from C, Java, or Python. In those languages, a function with two arguments is two arguments, full stop; you cannot "call it with one and a half." Curried functions remove that restriction. Once you internalise the picture, a number of common patterns in OCaml code that previously looked mysterious become obvious: why List.map takes the function before the list, why so many standard library functions can be passed around fluently, why operators have prefix forms like (+). They are all corollaries of one design choice.

The lecture has four parts. First, the unfolding: what let f x y = ... really desugars to. Second, partial application in everyday use. Third, the consequences for API design (argument order matters). Fourth, a comparison with the tuple alternative, so you know when not to want currying.

Unfolding a two-argument function

You already know how to define a two-argument function:

let add x y = x + y

The type is int -> int -> int. There are two ways to read that type, and the design of OCaml only makes sense once you have both.

The first reading is the everyday one: "a function that takes two ints and returns an int." That is how you will read it ninety-nine percent of the time, and it is correct.

The second reading takes the type signature literally. We saw in the functions-as-values lecture that -> is right-associative, so int -> int -> int parses as int -> (int -> int). Under that parse, add is a function that takes one int and returns another function of type int -> int. The "returned function" is the one that does the actual addition.

Both readings are correct, and they describe the same function, because the desugaring of let add x y = x + y is literally let add = fun x -> fun y -> x + y. The shorthand on the left and the nested-fun form on the right are the same definition.

Two-argument functions, unfolded

let add x y = x + y

This is short for:

let add = fun x -> fun y -> x + y

add takes one argument x.
Result: a function of one argument y returning x + y.

Type confirms:

val add : int -> int -> int

Right-associative: int -> (int -> int).
Takes an int, returns a function from int to int.

A useful slogan, from Cornell's CS3110 textbook: every OCaml function takes exactly one argument. A "three-argument" function is a one-argument function that returns a "two-argument" function, which is itself a one-argument function that returns a "one-argument" function. The function-type arrow is right-associative to match this view; the application syntax is left-associative for the same reason. The names currying and uncurrying describe moving between this view and the "tuple of arguments" view we will return to at the end of the lecture.

Partial application: the payoff

Once you accept that add is "a one-argument function returning a function," you can apply it to one argument and stop. The result is a function value.

let add x y = x + y let add5 = add 5

add5 has type int -> int. It is the function that adds 5 to its argument. We did not write a fun anywhere; we just stopped applying add halfway through. The technical name for this trick is partial application, and the underlying mechanism is currying.

What does `add 5` do?

let add x y = x + y let add5 = add 5

add5 : int -> int. Adds 5 to whatever you give it.

let _ = add5 3 (* = 8 *)

We partially applied add to one argument.
Result: a function waiting for the rest.

The picture is exactly the closure picture from the functions-as-values lecture. When we apply add to 5, the body fun y -> x + y becomes a value with x captured to 5. That captured value lives inside the closure as long as the closure exists. Calls to add5 reuse that captured 5 forever; each call supplies a new y, computes 5 + y, and returns.

It is sometimes useful to read partial application as specialisation: add5 is a specialised version of add with one knob (x) permanently set to 5. You can specialise the same function in multiple ways, getting multiple specialised functions, all coexisting peacefully:

let add1 = add 1 let add100 = add 100

add1 always adds one; add100 always adds a hundred; they share no state. Each is a separate closure with its own captured x.

Why partial application matters in practice

The motivating use case is passing a function to another function. Higher-order functions like List.map expect a function as their first argument; partial application lets you build that function inline without writing fun x -> ... for every little operation.

Why is this useful?

let xs = [1; 2; 3; 4] let xs_plus_10 = List.map (add 10) xs (* = [11; 12; 13; 14] *)

List.map wants int -> int. add 10 already is that.
Currying eliminates small wrapper lambdas in higher-order code.

Without currying you would write:

let xs_plus_10 = List.map (fun x -> add 10 x) xs

which works but is noisier. The version List.map (add 10) xs reads as "map the add-10 function over xs." That form reads more directly.

You will see this idiom constantly in real OCaml code. Anywhere a higher-order function expects a unary function, partial application of a binary function is the shortest way to produce one. The Jane Street Base library, and the standard library itself, are designed with this in mind: the function argument almost always comes first, precisely so that callers can partial-apply.

Argument order matters

The first argument is the easiest to partial-apply. The second argument requires more work; the third even more. So when you design a function with multiple arguments, the order of the arguments matters for how it will feel to use.

Suppose you wanted a half function from a generic divide:

let divide x y = x / y let half x = divide x 2

You might be tempted to write let half = divide 2, by analogy to add5 = add 5. But divide 2 is not the halving function: it is "the function that divides 2 by its argument." divide 2 4 is 2 / 4 = 0, not 4 / 2 = 2. The first argument of divide is the numerator, which is not the part we want to fix.

Argument order matters for partial application

let divide x y = x / y let half x = divide x 2 (* not "divide 2 x" *)

divide 2: takes y, returns 2 / y. Not "half".
The first argument is the easiest to fix.

Stdlib places arguments accordingly.

List.map takes the function first, list second:

val List.map : ('a -> 'b) -> 'a list -> 'b list

So List.map (add 10) partial-applies meaningfully.

The lesson generalises. When you write a function f a b c, ask: which of these arguments is the most likely to be held fixed while the others vary? That argument should go first. The most "configuration-like" parameter goes leftmost; the most "data-like" parameter goes rightmost. List.map follows this exactly: the function (configuration, often a constant) comes first; the list (data, varying) comes last. So does List.filter, List.fold_left, List.iter, and essentially every higher-order combinator in the standard library.

The dual mistake is also worth flagging: do not panic about argument order on functions you will only ever fully apply. divide 10 2 is fine; the order only matters when you want to partial-apply.

Operators have prefix forms too

Every infix operator in OCaml has a prefix form, obtained by surrounding it in parentheses. (+) is the function corresponding to +; (*) to * (with a quirk we will see); (<) to <.

This means you can partial-apply operators too:

let increment = (+) 1 let double = ( * ) 2 let _ = increment 5 (* = 6 *) let _ = double 5 (* = 10 *)

Operators have prefix forms too

let increment = (+) 1 let double = ( * ) 2 let _ = increment 5 (* = 6 *) let _ = double 5 (* = 10 *)

(+) is prefix form of +: fun x y -> x + y.
Partial-apply to 1 to get increment.
Space inside ( * ) avoids being parsed as comment (*.

Most infix operators have this prefix form: (+), (*), (<), (^), (&&). Useful when you want to pass the operator as a function, for instance to List.fold_left (+) 0 xs to sum a list.

The one syntactic curiosity is the multiplication operator. (*) on its own would be parsed as the start of a block comment (*, so OCaml requires a space: ( * ). The same trick applies to any operator that would otherwise collide with comment syntax.

Currying composes: more than two arguments

The pattern scales to functions with any number of arguments. A three-argument function is just a chain of three nested one-argument functions; you can stop at any point and get back a partially-applied function.

let between lo hi x = x >= lo && x <= hi let in_human_range = between 0 150 let in_celsius_room = between 15.0 30.0 let _ = in_human_range 42 (* = true *) let _ = in_celsius_room 22.5 (* = true *)

Multi-argument functions, the same pattern

let between lo hi x = x >= lo && x <= hi let in_human_range = between 0 150 (* age in years *) let in_celsius_room = between 15.0 30.0 let _ = in_human_range 42 (* = true *) let _ = in_celsius_room 22.5 (* = true *)

between takes three arguments.
Partial-apply two; get a one-argument predicate.
Same idea as add 5, one more layer of nesting.

between has type 'a -> 'a -> 'a -> bool. Each partial application fixes one argument and returns a function of the rest. between 0 150 has type int -> bool; between 15.0 30.0 has type float -> bool. We have built two purpose-specific predicates out of one general one. Each predicate is a closure carrying its captured bounds.

This is a clean illustration of the trade-off between generality and specificity in API design. The general between is useful because it works on any ordered type. The specialised predicates are useful because they take fewer arguments at the call site and read more clearly. Currying lets you have both: one general definition, many specialised closures.

Eta-reduction: a small simplification

When you find yourself writing fun x -> g x for some function g, you can usually replace it with just g. The two are the same function: both take an x and call g on it.

This simplification is called eta-reduction (after the Greek letter "eta" used to label this rule in the lambda calculus). Lambda calculus is a minimal formal system of functions, introduced by Alonzo Church in the 1930s, with only three constructs: a variable (x), a function (fun x -> body, called an abstraction), and a function call (f x, called an application). Despite this minimal vocabulary it is Turing-complete, and it is the mathematical foundation of every functional language including OCaml. The fun keyword we have been writing is direct syntax for lambda-calculus abstraction; function application is just juxtaposition. Eta-reduction is one of three small rewriting rules that this calculus codifies (the others are alpha-conversion, renaming a bound variable, and beta-reduction, substituting an argument into a body). Beyond the name, the rule is just: wrapping a function in a lambda that does nothing but call it is busywork.

Eta-reduction: the rule

let f x = g x

is the same function as

let f = g

Wrapping g in fun x -> g x is busywork.
Named after the Greek letter η in lambda calculus.
Lambda calculus = minimal formal system of functions (Church, 1930s); fun x -> body is its native syntax.

Eta-reduction: an example

let incr x = x + 1 let f x = incr x let g = incr

f and g are the same function.
Both have type int -> int.
The right-hand form reads "g is incr."

Eta-reduction with `List.map`

Build a reusable transformer with List.map:

let add_ten xs = List.map ((+) 10) xs

eta-reduces to:

let add_ten = List.map ((+) 10) let _ = add_ten [1; 2; 3; 4] (* = [11; 12; 13; 14] *)

Reads cleanly: "add_ten is map with +10."
No surplus fun xs -> ....

The shorter form is generally preferable when it does not hurt readability. It also clarifies what the function is: not "a function that applies g to its input," but just g itself. The caveat is evaluation timing. let f = g evaluates g once, at the point of let; let f x = g x evaluates g at every call. For pure functions, this is invisible. For functions that have side effects when constructed (rare, but possible with lazy values or closures over mutable state), the two are not the same. In everyday code, eta-reduce freely.

When currying isn't what you want: tuples

Sometimes "two arguments" is the wrong picture. Sometimes you have one argument that happens to be a pair. The classic example is a 2D point: it makes no sense to specialise distance by fixing the x-coordinate while leaving the y-coordinate variable. The two coordinates are conceptually one thing.

For that case, OCaml has tuples. Take one argument that is a pair; pattern-match it inside the function:

let add_pair (x, y) = x + y let _ = add_pair (3, 4) (* = 7 *)

When currying isn't what you want

Sometimes you want "a pair of ints", not "an int then another int".
Use a tuple:

let add_pair (x, y) = x + y let _ = add_pair (3, 4) (* = 7 *)

add_pair : int * int -> int. One argument: a pair.
Can't partial-apply the tuple version.
Prefer curried; use tuple when values are conceptually one thing.

The type int * int -> int reads "takes a pair of ints and returns an int." There is only one argument. You cannot partial-apply add_pair: you have to pass the whole pair.

The default in OCaml is curried. Tuples are reserved for cases where the bundling is meaningful: a 2D point is (x, y), a record's old-and-new value is (before, after), a function returning multiple results uses a tuple. If you ever catch yourself writing a function of (int * int * int * int) for a four-parameter computation where the four are independent quantities, that is a smell: it should probably be curried, or, better, a record with named fields (Module 4).

The standard library has fst and snd for projecting the components of a pair, and (fun (a, b) -> ...) patterns to destructure them. For larger tuples there is no built-in projection; you destructure with patterns. We will see all of this in detail when we get to product types in Module 4.

A quick check

Given let between lo hi x = x >= lo && x <= hi, what is the type of between 0 150?

bool
int -> int -> bool
int -> bool
'a -> bool

Why: between has type 'a -> 'a -> 'a -> bool. The integer literals 0 and 150 pin 'a to int, and supplying the first two arguments peels two arrows off the chain. What remains is the one-argument predicate int -> bool, waiting for x. (It is not 'a -> bool: once the bounds are int, the third argument must be int too.)

Which of these is not an idiomatic use of partial application?

List.map ((+) 1) [1; 2; 3]
let add5 = (+) 5
let half = ( / ) 2
List.filter ((<) 0) xs

Why: ( / ) 2 is "the function that divides 2 by its argument," not "the halving function." The intended half would need fun x -> x / 2 or a custom divide whose argument order puts the divisor first. The other three are standard idioms: (+) 1 increments, (+) 5 adds five, and (<) 0 partial-applies the left-hand side of < to 0, giving the predicate "is greater than zero."

Define compose3 that takes three functions h, g, f and returns the composed function fun x -> h (g (f x)).

let compose3 h g f = failwith "not implemented"

Show reference solution

The body is fun x -> h (g (f x)), or equivalently let compose3 h g f x = h (g (f x)). Notice how the three functions chain right-to-left: f runs first, then g, then h. This is the standard mathematical convention for composition.

Activity

Given:

let add x y = x + y

Predict:

The type of add 5.
What add 5 evaluates to.
What (add 5) 3 evaluates to.

Before reading on, predict the three answers.

Activity discussion

let add x y = x + y

Type of add 5: int -> int. It's the function fun y -> 5 + y.
add 5 evaluates to a function value. Toplevel: int -> int = <fun>.
(add 5) 3 evaluates to 8. First compute add 5, then apply to 3.
- add 5 3 (no parens) gives the same 8.
Function application is left-associative: ((add) 5) 3.
- The parens version just makes partial application explicit.

add 5 has type int -> int. It is a closure: a function value that has captured x = 5 and is waiting for y. The toplevel reports int -> int = <fun>, where <fun> is the toplevel's placeholder for "a function value, can't print." There is no way to inspect the body of a closure from OCaml; you can only call it.

(add 5) 3 evaluates to 8. We first compute add 5 (a closure), then apply it to 3. Inside the closure, x is 5 and y is now 3, so x + y is 8.

add 5 3 without parentheses gives the same 8. Function application is left-associative: add 5 3 parses as (add 5) 3, exactly the explicit form. The parentheses make partial application visible but they do not change what is computed.

A harder check: n-fold application

We wrote apply_twice f x = f (f x) in the functions-as-values lecture. This check generalizes it, and needs recursion as well as currying.

Write apply_n : int -> ('a -> 'a) -> 'a -> 'a so that apply_n n f x computes f (f (... (f x))) with n applications of f. For n <= 0, return x unchanged. Note the payoff of the argument order: apply_n 5 ((+) 1) is itself a function, "add 5 the slow way."

let rec apply_n n f x = failwith "not implemented"

Show reference solution

let rec apply_n n f x = if n <= 0 then x else apply_n (n - 1) f (f x)

Each step peels one application off: apply f once, recurse with n - 1. The accumulator is x itself, so the definition is tail-recursive. apply_n 2 f x computes exactly apply_twice f x.

What's next

Lecture 4: tail recursion.

Write recursive functions without stack-overflow risk on large inputs.
Rewrite factorial and sum to be tail recursive.

We have unpacked currying and partial application. The next lecture, M03-L04, returns to the recursion from M03-L02 with a sharper tool: tail recursion, the technique that lets recursive functions run in constant stack space. The naive factorial and sum_up_to from the recursion lecture overflow the stack on large inputs; the tail-recursive versions do not. The mechanism is small (an extra parameter), but the payoff is that you can recurse on inputs of any size without worrying about the stack.

Reading

Cornell CS3110, Multiple-argument functions: https://cs3110.github.io/textbook/chapters/basics/functions.html
Real World OCaml, Variables and functions (multi-argument and currying): https://dev.realworldocaml.org/variables-and-functions.html

Sources

This lecture's prose, worked examples, and quizzes are original to this course. Materials referenced during preparation are listed in the Reading section above; Cornell CS3110 and Real World OCaml are CC BY-NC-ND-licensed and have not been derivatively reused. See LICENSES.md at the repository root for the full source posture.

Currying and partial application