Functions and Records
Chapter Goals
This chapter will introduce two building blocks of PureScript programs: functions and records. In addition, we'll see how to structure PureScript programs, and how to use types as an aid to program development.
We will build a simple address book application to manage a list of contacts. This code will introduce some new ideas from the syntax of PureScript.
The front-end of our application will be the interactive mode PSCi, but it would be possible to build on this code to write a front-end in JavaScript. In fact, we will do exactly that in later chapters, adding form validation and save/restore functionality.
Project Setup
The source code for this chapter is contained in the file src/Data/AddressBook.purs
. This file starts with a module declaration and its import list:
Here, we import several modules:
The
Control.Plus
module, which defines theempty
value.The
Data.List
module, which is provided by thelists
package which can be installed using Spago. It contains a few functions which we will need for working with linked lists.The
Data.Maybe
module, which defines data types and functions for working with optional values.
Notice that the imports for these modules are listed explicitly in parentheses. This is generally a good practice, as it helps to avoid conflicting imports.
Assuming you have cloned the book's source code repository, the project for this chapter can be built using Spago, with the following commands:
Simple Types
PureScript defines three built-in types which correspond to JavaScript's primitive types: numbers, strings and booleans. These are defined in the Prim
module, which is implicitly imported by every module. They are called Number
, String
, and Boolean
, respectively, and you can see them in PSCi by using the :type
command to print the types of some simple values:
PureScript defines some other built-in types: integers, characters, arrays, records, and functions.
Integers are differentiated from floating point values of type Number
by the lack of a decimal point:
Character literals are wrapped in single quotes, unlike string literals which use double quotes:
Arrays correspond to JavaScript arrays, but unlike in JavaScript, all elements of a PureScript array must have the same type:
The error in the last example is an error from the type checker, which unsuccessfully attempted to unify (i.e. make equal) the types of the two elements.
Records correspond to JavaScript's objects, and record literals have the same syntax as JavaScript's object literals:
This type indicates that the specified object has two fields, a name
field which has type String
, and an interests
field, which has type Array String
, i.e. an array of String
s.
Fields of records can be accessed using a dot, followed by the label of the field to access:
PureScript's functions correspond to JavaScript's functions. The PureScript standard libraries provide plenty of examples of functions, and we will see more in this chapter:
Functions can be defined at the top-level of a file by specifying arguments before the equals sign:
Alternatively, functions can be defined inline, by using a backslash character followed by a space-delimited list of argument names. To enter a multi-line declaration in PSCi, we can enter "paste mode" by using the :paste
command. In this mode, declarations are terminated using the Control-D key sequence:
Having defined this function in PSCi, we can apply it to its arguments by separating the two arguments from the function name by whitespace:
Quantified Types
In the previous section, we saw the types of some functions defined in the Prelude. For example, the flip
function had the following type:
The keyword forall
here indicates that flip
has a universally quantified type. It means that we can substitute any types for a
, b
and c
, and flip
will work with those types.
For example, we might choose the type a
to be Int
, b
to be String
and c
to be String
. In that case we could specialize the type of flip
to
We don't have to indicate in code that we want to specialize a quantified type - it happens automatically. For example, we can just use flip
as if it had this type already:
While we can choose any types for a
, b
and c
, we have to be consistent. The type of the function we passed to flip
had to be consistent with the types of the other arguments. That is why we passed the string "Ten"
as the second argument, and the number 10
as the third. It would not work if the arguments were reversed:
Notes On Indentation
PureScript code is indentation-sensitive, just like Haskell, but unlike JavaScript. This means that the whitespace in your code is not meaningless, but rather is used to group regions of code, just like curly braces in C-like languages.
If a declaration spans multiple lines, then any lines except the first must be indented past the indentation level of the first line.
Therefore, the following is valid PureScript code:
But this is not valid code:
In the second case, the PureScript compiler will try to parse two declarations, one for each line.
Generally, any declarations defined in the same block should be indented at the same level. For example, in PSCi, declarations in a let statement must be indented equally. This is valid:
but this is not:
Certain PureScript keywords (such as where
, of
and let
) introduce a new block of code, in which declarations must be further-indented:
Note how the declarations for foo
and bar
are indented past the declaration of example
.
The only exception to this rule is the where
keyword in the initial module
declaration at the top of a source file.
Defining Our Types
A good first step when tackling a new problem in PureScript is to write out type definitions for any values you will be working with. First, let's define a type for records in our address book:
This defines a type synonym called Entry
- the type Entry
is equivalent to the type on the right of the equals symbol: a record type with three fields - firstName
, lastName
and address
. The two name fields will have type String
, and the address
field will have type Address
, defined as follows:
Note that records can contain other records.
Now let's define a third type synonym, for our address book data structure, which will be represented simply as a linked list of entries:
Note that List Entry
is not the same as Array Entry
, which represents an array of entries.
Type Constructors and Kinds
List
is an example of a type constructor. Values do not have the type List
directly, but rather List a
for some type a
. That is, List
takes a type argument a
and constructs a new type List a
.
Note that just like function application, type constructors are applied to other types simply by juxtaposition: the type List Entry
is in fact the type constructor List
applied to the type Entry
- it represents a list of entries.
If we try to incorrectly define a value of type List
(by using the type annotation operator ::
), we will see a new type of error:
This is a kind error. Just like values are distinguished by their types, types are distinguished by their kinds, and just like ill-typed values result in type errors, ill-kinded types result in kind errors.
There is a special kind called Type
which represents the kind of all types which have values, like Number
and String
.
There are also kinds for type constructors. For example, the kind Type -> Type
represents a function from types to types, just like List
. So the error here occurred because values are expected to have types with kind Type
, but List
has kind Type -> Type
.
To find out the kind of a type, use the :kind
command in PSCi. For example:
PureScript's kind system supports other interesting kinds, which we will see later in the book.
Displaying Address Book Entries
Let's write our first function, which will render an address book entry as a string. We start by giving the function a type. This is optional, but good practice, since it acts as a form of documentation. In fact, the PureScript compiler will give a warning if a top-level declaration does not contain a type annotation. A type declaration separates the name of a function from its type with the ::
symbol:
This type signature says that showEntry
is a function, which takes an Entry
as an argument and returns a String
. Here is the code for showEntry
:
This function concatenates the three fields of the Entry
record into a single string, using the showAddress
function to turn the record inside the address
field into a String
. showAddress
is defined similarly:
A function definition begins with the name of the function, followed by a list of argument names. The result of the function is specified after the equals sign. Fields are accessed with a dot, followed by the field name. In PureScript, string concatenation uses the diamond operator (<>
), instead of the plus operator like in JavaScript.
Test Early, Test Often
The PSCi interactive mode allows for rapid prototyping with immediate feedback, so let's use it to verify that our first few functions behave as expected.
First, build the code you've written:
Next, load PSCi, and use the import
command to import your new module:
We can create an entry by using a record literal, which looks just like an anonymous object in JavaScript.
Now, try applying our function to the example:
Let's also test showEntry
by creating an address book entry record containing our example address:
Creating Address Books
Now let's write some utility functions for working with address books. We will need a value which represents an empty address book: an empty list.
We will also need a function for inserting a value into an existing address book. We will call this function insertEntry
. Start by giving its type:
This type signature says that insertEntry
takes an Entry
as its first argument, and an AddressBook
as a second argument, and returns a new AddressBook
.
We don't modify the existing AddressBook
directly. Instead, we return a new AddressBook
which contains the same data. As such, AddressBook
is an example of an immutable data structure. This is an important idea in PureScript - mutation is a side-effect of code, and inhibits our ability to reason effectively about its behavior, so we prefer pure functions and immutable data where possible.
To implement insertEntry
, we can use the Cons
function from Data.List
. To see its type, open PSCi and use the :type
command:
This type signature says that Cons
takes a value of some type a
, and a list of elements of type a
, and returns a new list with entries of the same type. Let's specialize this with a
as our Entry
type:
But List Entry
is the same as AddressBook
, so this is equivalent to
In our case, we already have the appropriate inputs: an Entry
, and a AddressBook
, so can apply Cons
and get a new AddressBook
, which is exactly what we wanted!
Here is our implementation of insertEntry
:
This brings the two arguments entry
and book
into scope, on the left hand side of the equals symbol, and then applies the Cons
function to create the result.
Curried Functions
Functions in PureScript take exactly one argument. While it looks like the insertEntry
function takes two arguments, it is in fact an example of a curried function.
The ->
operator in the type of insertEntry
associates to the right, which means that the compiler parses the type as
That is, insertEntry
is a function which returns a function! It takes a single argument, an Entry
, and returns a new function, which in turn takes a single AddressBook
argument and returns a new AddressBook
.
This means that we can partially apply insertEntry
by specifying only its first argument, for example. In PSCi, we can see the result type:
As expected, the return type was a function. We can apply the resulting function to a second argument:
Note though that the parentheses here are unnecessary - the following is equivalent:
This is because function application associates to the left, and this explains why we can just specify function arguments one after the other, separated by whitespace.
Note that in the rest of the book, I will talk about things like "functions of two arguments". However, it is to be understood that this means a curried function, taking a first argument and returning another function.
Now consider the definition of insertEntry
:
If we explicitly parenthesize the right-hand side, we get (Cons entry) book
. That is, insertEntry entry
is a function whose argument is just passed along to the (Cons entry)
function. But if two functions have the same result for every input, then they are the same function! So we can remove the argument book
from both sides:
But now, by the same argument, we can remove entry
from both sides:
This process is called eta conversion, and can be used (along with some other techniques) to rewrite functions in point-free form, which means functions defined without reference to their arguments.
In the case of insertEntry
, eta conversion has resulted in a very clear definition of our function - "insertEntry
is just cons on lists". However, it is arguable whether point-free form is better in general.
Querying the Address Book
The last function we need to implement for our minimal address book application will look up a person by name and return the correct Entry
. This will be a nice application of building programs by composing small functions - a key idea from functional programming.
We can first filter the address book, keeping only those entries with the correct first and last names. Then we can simply return the head (i.e. first) element of the resulting list.
With this high-level specification of our approach, we can calculate the type of our function. First open PSCi, and find the types of the filter
and head
functions:
Let's pick apart these two types to understand their meaning.
filter
is a curried function of two arguments. Its first argument is a function, which takes a list element and returns a Boolean
value as a result. Its second argument is a list of elements, and the return value is another list.
head
takes a list as its argument, and returns a type we haven't seen before: Maybe a
. Maybe a
represents an optional value of type a
, and provides a type-safe alternative to using null
to indicate a missing value in languages like JavaScript. We will see it again in more detail in later chapters.
The universally quantified types of filter
and head
can be specialized by the PureScript compiler, to the following types:
We know that we will need to pass the first and last names that we want to search for, as arguments to our function.
We also know that we will need a function to pass to filter
. Let's call this function filterEntry
. filterEntry
will have type Entry -> Boolean
. The application filter filterEntry
will then have type AddressBook -> AddressBook
. If we pass the result of this function to the head
function, we get our result of type Maybe Entry
.
Putting these facts together, a reasonable type signature for our function, which we will call findEntry
, is:
This type signature says that findEntry
takes two strings, the first and last names, and a AddressBook
, and returns an optional Entry
. The optional result will contain a value only if the name is found in the address book.
And here is the definition of findEntry
:
Let's go over this code step by step.
findEntry
brings three names into scope: firstName
, and lastName
, both representing strings, and book
, an AddressBook
.
The right hand side of the definition combines the filter
and head
functions: first, the list of entries is filtered, and the head
function is applied to the result.
The predicate function filterEntry
is defined as an auxiliary declaration inside a where
clause. This way, the filterEntry
function is available inside the definition of our function, but not outside it. Also, it can depend on the arguments to the enclosing function, which is essential here because filterEntry
uses the firstName
and lastName
arguments to filter the specified Entry
.
Note that, just like for top-level declarations, it was not necessary to specify a type signature for filterEntry
. However, doing so is recommended as a form of documentation.
Infix Function Application
In the code for findEntry
above, we used a different form of function application: the head
function was applied to the expression filter filterEntry book
by using the infix $
symbol.
This is equivalent to the usual application head (filter filterEntry book)
($)
is just an alias for a regular function called apply
, which is defined in the Prelude. It is defined as follows:
So apply
takes a function and a value and applies the function to the value. The infixr
keyword is used to define ($)
as an alias for apply
.
But why would we want to use $
instead of regular function application? The reason is that $
is a right-associative, low precedence operator. This means that $
allows us to remove sets of parentheses for deeply-nested applications.
For example, the following nested function application, which finds the street in the address of an employee's boss:
becomes (arguably) easier to read when expressed using $
:
Function Composition
Just like we were able to simplify the insertEntry
function by using eta conversion, we can simplify the definition of findEntry
by reasoning about its arguments.
Note that the book
argument is passed to the filter filterEntry
function, and the result of this application is passed to head
. In other words, book
is passed to the composition of the functions filter filterEntry
and head
.
In PureScript, the function composition operators are <<<
and >>>
. The first is "backwards composition", and the second is "forwards composition".
We can rewrite the right-hand side of findEntry
using either operator. Using backwards-composition, the right-hand side would be
In this form, we can apply the eta conversion trick from earlier, to arrive at the final form of findEntry
:
An equally valid right-hand side would be:
Either way, this gives a clear definition of the findEntry
function: "findEntry
is the composition of a filtering function and the head
function".
I will let you make your own decision which definition is easier to understand, but it is often useful to think of functions as building blocks in this way - each function executing a single task, and solutions assembled using function composition.
Exercises
(Easy) Test your understanding of the
findEntry
function by writing down the types of each of its major subexpressions. For example, the type of thehead
function as used is specialized toAddressBook -> Maybe Entry
.(Medium) Write a function which looks up an
Entry
given a street address, by reusing the existing code infindEntry
. Test your function in PSCi.(Medium) Write a function which tests whether a name appears in a
AddressBook
, returning a Boolean value. Hint: Use PSCi to find the type of theData.List.null
function, which test whether a list is empty or not.(Difficult) Write a function
removeDuplicates
which removes duplicate address book entries with the same first and last names. Hint: Use PSCi to find the type of theData.List.nubBy
function, which removes duplicate elements from a list based on an equality predicate.
Conclusion
In this chapter, we covered several new functional programming concepts:
How to use the interactive mode PSCi to experiment with functions and test ideas.
The role of types as both a correctness tool, and an implementation tool.
The use of curried functions to represent functions of multiple arguments.
Creating programs from smaller components by composition.
Structuring code neatly using
where
expressions.How to avoid null values by using the
Maybe
type.Using techniques like eta conversion and function composition to refactor code into a clear specification.
In the following chapters, we'll build on these ideas.
Last updated
Was this helpful?