2012-10-07 07:47:26 +02:00
|
|
|
|
% The Rust Language Tutorial
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
# Introduction
|
|
|
|
|
|
2012-09-23 02:57:30 +02:00
|
|
|
|
Rust is a programming language with a focus on type safety, memory
|
|
|
|
|
safety, concurrency and performance. It is intended for writing
|
2012-10-11 04:05:13 +02:00
|
|
|
|
large-scale, high-performance software that is free from several
|
2012-09-23 02:57:30 +02:00
|
|
|
|
classes of common errors. Rust has a sophisticated memory model that
|
|
|
|
|
encourages efficient data structures and safe concurrency patterns,
|
|
|
|
|
forbidding invalid memory accesses that would otherwise cause
|
|
|
|
|
segmentation faults. It is statically typed and compiled ahead of
|
|
|
|
|
time.
|
2012-07-06 21:14:32 +02:00
|
|
|
|
|
2012-07-23 04:12:51 +02:00
|
|
|
|
As a multi-paradigm language, Rust supports writing code in
|
2012-09-23 02:57:30 +02:00
|
|
|
|
procedural, functional and object-oriented styles. Some of its
|
|
|
|
|
pleasant high-level features include:
|
2012-07-06 21:14:32 +02:00
|
|
|
|
|
2012-10-07 05:23:38 +02:00
|
|
|
|
* **Type inference.** Type annotations on local variable declarations
|
|
|
|
|
are optional.
|
|
|
|
|
* **Safe task-based concurrency.** Rust's lightweight tasks do not share
|
2012-10-11 04:05:13 +02:00
|
|
|
|
memory, instead communicating through messages.
|
2012-10-07 05:23:38 +02:00
|
|
|
|
* **Higher-order functions.** Efficient and flexible closures provide
|
|
|
|
|
iteration and other control structures
|
|
|
|
|
* **Pattern matching and algebraic data types.** Pattern matching on
|
2012-10-11 04:05:13 +02:00
|
|
|
|
Rust's enumeration types (a more powerful version of C's enums,
|
|
|
|
|
similar to algebraic data types in functional languages) is a
|
|
|
|
|
compact and expressive way to encode program logic.
|
|
|
|
|
* **Polymorphism.** Rust has type-parametric functions and
|
2012-10-07 05:23:38 +02:00
|
|
|
|
types, type classes and OO-style interfaces.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 02:57:30 +02:00
|
|
|
|
## Scope
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 02:57:30 +02:00
|
|
|
|
This is an introductory tutorial for the Rust programming language. It
|
|
|
|
|
covers the fundamentals of the language, including the syntax, the
|
2012-10-07 07:23:16 +02:00
|
|
|
|
type system and memory model, generics, and modules. [Additional
|
2012-09-23 02:57:30 +02:00
|
|
|
|
tutorials](#what-next) cover specific language features in greater
|
|
|
|
|
depth.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-12-31 22:46:52 +01:00
|
|
|
|
This tutorial assumes that the reader is already familiar with one or
|
2012-12-20 22:16:27 +01:00
|
|
|
|
more languages in the C family. Understanding of pointers and general
|
|
|
|
|
memory management techniques will help.
|
2012-07-04 02:48:08 +02:00
|
|
|
|
|
2012-01-19 12:51:20 +01:00
|
|
|
|
## Conventions
|
|
|
|
|
|
2012-10-11 04:05:13 +02:00
|
|
|
|
Throughout the tutorial, language keywords and identifiers defined in
|
|
|
|
|
example code are displayed in `code font`.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-01-20 06:22:05 +01:00
|
|
|
|
Code snippets are indented, and also shown in a monospaced font. Not
|
2012-01-19 12:51:20 +01:00
|
|
|
|
all snippets constitute whole programs. For brevity, we'll often show
|
|
|
|
|
fragments of programs that don't compile on their own. To try them
|
|
|
|
|
out, you might have to wrap them in `fn main() { ... }`, and make sure
|
2012-10-11 04:05:13 +02:00
|
|
|
|
they don't contain references to names that aren't actually defined.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-11 04:05:13 +02:00
|
|
|
|
> ***Warning:*** Rust is a language under ongoing development. Notes
|
2012-07-08 08:50:30 +02:00
|
|
|
|
> about potential changes to the language, implementation
|
|
|
|
|
> deficiencies, and other caveats appear offset in blockquotes.
|
|
|
|
|
|
2012-01-19 12:51:20 +01:00
|
|
|
|
# Getting started
|
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
The Rust compiler currently must be built from a [tarball], unless you
|
|
|
|
|
are on Windows, in which case using the [installer][win-exe] is
|
|
|
|
|
recommended.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
Since the Rust compiler is written in Rust, it must be built by
|
|
|
|
|
a precompiled "snapshot" version of itself (made in an earlier state
|
|
|
|
|
of development). As such, source builds require a connection to
|
|
|
|
|
the Internet, to fetch snapshots, and an OS that can execute the
|
|
|
|
|
available snapshot binaries.
|
2012-01-19 23:48:38 +01:00
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
Snapshot binaries are currently built and tested on several platforms:
|
2012-01-19 23:48:38 +01:00
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
* Windows (7, Server 2008 R2), x86 only
|
|
|
|
|
* Linux (various distributions), x86 and x86-64
|
2012-10-11 02:56:23 +02:00
|
|
|
|
* OSX 10.6 ("Snow Leopard") or greater, x86 and x86-64
|
2012-01-19 23:48:38 +01:00
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
You may find that other platforms work, but these are our "tier 1"
|
|
|
|
|
supported build environments that are most likely to work.
|
|
|
|
|
|
|
|
|
|
> ***Note:*** Windows users should read the detailed
|
2012-10-11 04:05:13 +02:00
|
|
|
|
> "[getting started][wiki-start]" notes on the wiki. Even when using
|
|
|
|
|
> the binary installer, the Windows build requires a MinGW installation,
|
|
|
|
|
> the precise details of which are not discussed here. Finally, `rustc` may
|
2012-12-30 22:09:34 +01:00
|
|
|
|
> need to be [referred to as `rustc.exe`][bug-3319]. It's a bummer, we
|
2012-10-11 04:05:13 +02:00
|
|
|
|
> know.
|
|
|
|
|
|
|
|
|
|
[bug-3319]: https://github.com/mozilla/rust/issues/3319
|
|
|
|
|
[wiki-start]: https://github.com/mozilla/rust/wiki/Note-getting-started-developing-Rust
|
2012-01-19 23:48:38 +01:00
|
|
|
|
|
|
|
|
|
To build from source you will also need the following prerequisite
|
|
|
|
|
packages:
|
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
* g++ 4.4 or clang++ 3.x
|
|
|
|
|
* python 2.6 or later (but not 3.x)
|
|
|
|
|
* perl 5.0 or later
|
|
|
|
|
* gnu make 3.81 or later
|
|
|
|
|
* curl
|
2012-01-19 23:48:38 +01:00
|
|
|
|
|
2012-10-11 04:05:13 +02:00
|
|
|
|
If you've fulfilled those prerequisites, something along these lines
|
|
|
|
|
should work.
|
2012-01-19 23:48:38 +01:00
|
|
|
|
|
2012-03-21 00:01:32 +01:00
|
|
|
|
~~~~ {.notrust}
|
2013-04-02 01:06:23 +02:00
|
|
|
|
$ curl -O http://static.rust-lang.org/dist/rust-0.6.tar.gz
|
|
|
|
|
$ tar -xzf rust-0.6.tar.gz
|
|
|
|
|
$ cd rust-0.6
|
2012-01-19 23:48:38 +01:00
|
|
|
|
$ ./configure
|
|
|
|
|
$ make && make install
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-01-25 22:37:14 +01:00
|
|
|
|
You may need to use `sudo make install` if you do not normally have
|
2012-07-04 04:08:13 +02:00
|
|
|
|
permission to modify the destination directory. The install locations
|
|
|
|
|
can be adjusted by passing a `--prefix` argument to
|
2012-10-11 04:05:13 +02:00
|
|
|
|
`configure`. Various other options are also supported: pass `--help`
|
2012-07-04 04:08:13 +02:00
|
|
|
|
for more information on them.
|
2012-01-23 23:08:26 +01:00
|
|
|
|
|
2012-09-27 04:34:48 +02:00
|
|
|
|
When complete, `make install` will place several programs into
|
|
|
|
|
`/usr/local/bin`: `rustc`, the Rust compiler; `rustdoc`, the
|
2013-02-19 18:10:31 +01:00
|
|
|
|
API-documentation tool; `rustpkg`, the Rust package manager;
|
2013-02-19 20:57:23 +01:00
|
|
|
|
`rusti`, the Rust REPL; and `rust`, a tool which acts both as a unified
|
|
|
|
|
interface for them, and for a few common command line scenarios.
|
2012-01-19 23:48:38 +01:00
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
[wiki-start]: https://github.com/mozilla/rust/wiki/Note-getting-started-developing-Rust
|
2013-04-02 01:06:23 +02:00
|
|
|
|
[tarball]: http://static.rust-lang.org/dist/rust-0.6.tar.gz
|
|
|
|
|
[win-exe]: http://static.rust-lang.org/dist/rust-0.6-install.exe
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
## Compiling your first program
|
|
|
|
|
|
|
|
|
|
Rust program files are, by convention, given the extension `.rs`. Say
|
|
|
|
|
we have a file `hello.rs` containing this program:
|
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-01 01:57:37 +02:00
|
|
|
|
fn main() {
|
2012-12-20 22:55:52 +01:00
|
|
|
|
io::println("hello?");
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
If the Rust compiler was installed successfully, running `rustc
|
2012-09-23 03:59:34 +02:00
|
|
|
|
hello.rs` will produce an executable called `hello` (or `hello.exe` on
|
2012-10-07 07:57:40 +02:00
|
|
|
|
Windows) which, upon running, will likely do exactly what you expect.
|
2012-09-23 03:59:34 +02:00
|
|
|
|
|
2012-10-11 04:05:13 +02:00
|
|
|
|
The Rust compiler tries to provide useful information when it encounters an
|
|
|
|
|
error. If you introduce an error into the program (for example, by changing
|
|
|
|
|
`io::println` to some nonexistent function), and then compile it, you'll see
|
|
|
|
|
an error message like this:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-03-21 00:01:32 +01:00
|
|
|
|
~~~~ {.notrust}
|
2012-12-20 22:55:52 +01:00
|
|
|
|
hello.rs:2:4: 2:16 error: unresolved name: io::print_with_unicorns
|
|
|
|
|
hello.rs:2 io::print_with_unicorns("hello?");
|
2012-10-04 02:52:04 +02:00
|
|
|
|
^~~~~~~~~~~~~~~~~~~~~~~
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
In its simplest form, a Rust program is a `.rs` file with some types
|
|
|
|
|
and functions defined in it. If it has a `main` function, it can be
|
|
|
|
|
compiled to an executable. Rust does not allow code that's not a
|
2012-10-11 04:05:13 +02:00
|
|
|
|
declaration to appear at the top level of the file: all statements must
|
2012-09-23 03:59:34 +02:00
|
|
|
|
live inside a function. Rust programs can also be compiled as
|
2012-10-06 04:51:36 +02:00
|
|
|
|
libraries, and included in other programs.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2013-02-19 20:57:23 +01:00
|
|
|
|
## Using the rust tool
|
|
|
|
|
|
|
|
|
|
While using `rustc` directly to generate your executables, and then
|
|
|
|
|
running them manually is a perfectly valid way to test your code,
|
|
|
|
|
for smaller projects, prototypes, or if you're a beginner, it might be
|
|
|
|
|
more convenient to use the `rust` tool.
|
|
|
|
|
|
2013-02-19 22:19:19 +01:00
|
|
|
|
The `rust` tool provides central access to the other rust tools,
|
|
|
|
|
as well as handy shortcuts for directly running source files.
|
|
|
|
|
For example, if you have a file `foo.rs` in your current directory,
|
|
|
|
|
`rust run foo.rs` would attempt to compile it and, if successful,
|
|
|
|
|
directly run the resulting binary.
|
|
|
|
|
|
|
|
|
|
To get a list of all available commands, simply call `rust` without any
|
|
|
|
|
argument.
|
2013-02-19 20:57:23 +01:00
|
|
|
|
|
2012-01-19 12:51:20 +01:00
|
|
|
|
## Editing Rust code
|
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
There are vim highlighting and indentation scripts in the Rust source
|
|
|
|
|
distribution under `src/etc/vim/`. There is an emacs mode under
|
|
|
|
|
`src/etc/emacs/` called `rust-mode`, but do read the instructions
|
|
|
|
|
included in that directory. In particular, if you are running emacs
|
|
|
|
|
24, then using emacs's internal package manager to install `rust-mode`
|
|
|
|
|
is the easiest way to keep it up to date. There is also a package for
|
|
|
|
|
Sublime Text 2, available both [standalone][sublime] and through
|
2012-10-04 02:52:04 +02:00
|
|
|
|
[Sublime Package Control][sublime-pkg], and support for Kate
|
|
|
|
|
under `src/etc/kate`.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-04 02:52:04 +02:00
|
|
|
|
There is ctags support via `src/etc/ctags.rust`, but many other
|
2012-10-11 04:05:13 +02:00
|
|
|
|
tools and editors are not yet supported. If you end up writing a Rust
|
2012-01-19 12:51:20 +01:00
|
|
|
|
mode for your favorite editor, let us know so that we can link to it.
|
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
[sublime]: http://github.com/dbp/sublime-rust
|
|
|
|
|
[sublime-pkg]: http://wbond.net/sublime_packages/package_control
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-07 07:40:12 +02:00
|
|
|
|
# Syntax basics
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
Assuming you've programmed in any C-family language (C++, Java,
|
2012-09-23 03:59:34 +02:00
|
|
|
|
JavaScript, C#, or PHP), Rust will feel familiar. Code is arranged
|
|
|
|
|
in blocks delineated by curly braces; there are control structures
|
2012-10-04 02:52:04 +02:00
|
|
|
|
for branching and looping, like the familiar `if` and `while`; function
|
2012-09-23 03:59:34 +02:00
|
|
|
|
calls are written `myfunc(arg1, arg2)`; operators are written the same
|
2012-12-20 22:42:13 +01:00
|
|
|
|
and mostly have the same precedence as in C; comments are again like C;
|
2012-12-30 22:09:34 +01:00
|
|
|
|
module names are separated with double-colon (`::`) as with C++.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 03:59:34 +02:00
|
|
|
|
The main surface difference to be aware of is that the condition at
|
2012-12-30 22:09:34 +01:00
|
|
|
|
the head of control structures like `if` and `while` does not require
|
2012-10-11 04:32:11 +02:00
|
|
|
|
parentheses, while their bodies *must* be wrapped in
|
|
|
|
|
braces. Single-statement, unbraced bodies are not allowed.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-12-21 01:32:28 +01:00
|
|
|
|
# mod universe { pub fn recalibrate() -> bool { true } }
|
2012-01-19 12:51:20 +01:00
|
|
|
|
fn main() {
|
2012-09-23 08:11:24 +02:00
|
|
|
|
/* A simple loop */
|
|
|
|
|
loop {
|
|
|
|
|
// A tricky calculation
|
2012-12-20 22:42:13 +01:00
|
|
|
|
if universe::recalibrate() {
|
2012-09-23 08:11:24 +02:00
|
|
|
|
return;
|
2012-09-01 01:57:37 +02:00
|
|
|
|
}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 04:32:11 +02:00
|
|
|
|
The `let` keyword introduces a local variable. Variables are immutable by
|
|
|
|
|
default. To introduce a local variable that you can re-assign later, use `let
|
|
|
|
|
mut` instead.
|
2012-09-23 03:59:34 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
let hi = "hi";
|
|
|
|
|
let mut count = 0;
|
|
|
|
|
|
|
|
|
|
while count < 10 {
|
2012-12-21 01:32:28 +01:00
|
|
|
|
io::println(fmt!("count: %?", count));
|
2012-09-23 03:59:34 +02:00
|
|
|
|
count += 1;
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-04 02:52:04 +02:00
|
|
|
|
Although Rust can almost always infer the types of local variables, you
|
|
|
|
|
can specify a variable's type by following it with a colon, then the type
|
2013-03-23 00:45:54 +01:00
|
|
|
|
name. Static items, on the other hand, always require a type annotation.
|
2012-09-23 03:59:34 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-23 01:24:26 +01:00
|
|
|
|
static monster_factor: float = 57.8;
|
2012-12-20 22:18:37 +01:00
|
|
|
|
let monster_size = monster_factor * 10.0;
|
2012-10-04 02:52:04 +02:00
|
|
|
|
let monster_size: int = 50;
|
2012-09-23 03:59:34 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 04:32:11 +02:00
|
|
|
|
Local variables may shadow earlier declarations, as in the previous example:
|
2012-12-30 22:09:34 +01:00
|
|
|
|
`monster_size` was first declared as a `float`, and then a second
|
|
|
|
|
`monster_size` was declared as an `int`. If you were to actually compile this
|
2013-01-05 02:31:12 +01:00
|
|
|
|
example, though, the compiler would determine that the first `monster_size` is
|
2012-10-11 04:32:11 +02:00
|
|
|
|
unused and issue a warning (because this situation is likely to indicate a
|
|
|
|
|
programmer error). For occasions where unused variables are intentional, their
|
2012-12-30 22:09:34 +01:00
|
|
|
|
names may be prefixed with an underscore to silence the warning, like `let
|
2012-10-11 04:32:11 +02:00
|
|
|
|
_monster_size = 50;`.
|
2012-10-03 05:28:53 +02:00
|
|
|
|
|
2012-12-20 12:21:04 +01:00
|
|
|
|
Rust identifiers start with an alphabetic
|
2012-09-23 03:59:34 +02:00
|
|
|
|
character or an underscore, and after that may contain any sequence of
|
|
|
|
|
alphabetic characters, numbers, or underscores. The preferred style is to
|
2012-12-20 22:20:02 +01:00
|
|
|
|
write function, variable, and module names with lowercase letters, using
|
2012-09-23 03:59:34 +02:00
|
|
|
|
underscores where they help readability, while writing types in camel case.
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
let my_variable = 100;
|
2012-12-20 22:20:02 +01:00
|
|
|
|
type MyType = int; // primitive types are _not_ camel case
|
2012-09-23 03:59:34 +02:00
|
|
|
|
~~~
|
|
|
|
|
|
2012-12-20 12:43:20 +01:00
|
|
|
|
## Expressions and semicolons
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
Though it isn't apparent in all code, there is a fundamental
|
2012-10-04 02:52:04 +02:00
|
|
|
|
difference between Rust's syntax and predecessors like C.
|
|
|
|
|
Many constructs that are statements in C are expressions
|
2012-09-23 03:59:34 +02:00
|
|
|
|
in Rust, allowing code to be more concise. For example, you might
|
2012-09-01 01:57:37 +02:00
|
|
|
|
write a piece of code like this:
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
# let item = "salad";
|
|
|
|
|
let price;
|
|
|
|
|
if item == "salad" {
|
|
|
|
|
price = 3.50;
|
|
|
|
|
} else if item == "muffin" {
|
|
|
|
|
price = 2.25;
|
|
|
|
|
} else {
|
|
|
|
|
price = 2.00;
|
|
|
|
|
}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-01 01:57:37 +02:00
|
|
|
|
But, in Rust, you don't have to repeat the name `price`:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-01 01:57:37 +02:00
|
|
|
|
# let item = "salad";
|
2012-10-04 02:52:04 +02:00
|
|
|
|
let price =
|
|
|
|
|
if item == "salad" {
|
|
|
|
|
3.50
|
|
|
|
|
} else if item == "muffin" {
|
|
|
|
|
2.25
|
|
|
|
|
} else {
|
|
|
|
|
2.00
|
|
|
|
|
};
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 04:32:11 +02:00
|
|
|
|
Both pieces of code are exactly equivalent: they assign a value to
|
2012-10-04 02:52:04 +02:00
|
|
|
|
`price` depending on the condition that holds. Note that there
|
2012-10-11 04:32:11 +02:00
|
|
|
|
are no semicolons in the blocks of the second snippet. This is
|
|
|
|
|
important: the lack of a semicolon after the last statement in a
|
2012-09-23 03:59:34 +02:00
|
|
|
|
braced block gives the whole block the value of that last expression.
|
2012-09-01 01:57:37 +02:00
|
|
|
|
|
2012-10-04 07:18:46 +02:00
|
|
|
|
Put another way, the semicolon in Rust *ignores the value of an expression*.
|
|
|
|
|
Thus, if the branches of the `if` had looked like `{ 4; }`, the above example
|
|
|
|
|
would simply assign `()` (nil or void) to `price`. But without the semicolon, each
|
|
|
|
|
branch has a different value, and `price` gets the value of the branch that
|
|
|
|
|
was taken.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-11 04:32:11 +02:00
|
|
|
|
In short, everything that's not a declaration (declarations are `let` for
|
2012-12-30 22:09:34 +01:00
|
|
|
|
variables; `fn` for functions; and any top-level named items such as
|
2012-10-11 04:32:11 +02:00
|
|
|
|
[traits](#traits), [enum types](#enums), and [constants](#constants)) is an
|
|
|
|
|
expression, including function bodies.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-23 03:59:34 +02:00
|
|
|
|
fn is_four(x: int) -> bool {
|
|
|
|
|
// No need for a return statement. The result of the expression
|
|
|
|
|
// is used as the return value.
|
|
|
|
|
x == 4
|
|
|
|
|
}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-20 12:43:20 +01:00
|
|
|
|
## Primitive types and literals
|
2012-06-21 02:09:30 +02:00
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
There are general signed and unsigned integer types, `int` and `uint`,
|
|
|
|
|
as well as 8-, 16-, 32-, and 64-bit variants, `i8`, `u16`, etc.
|
2012-10-11 04:32:11 +02:00
|
|
|
|
Integers can be written in decimal (`144`), hexadecimal (`0x90`), or
|
2012-09-23 07:22:49 +02:00
|
|
|
|
binary (`0b10010000`) base. Each integral type has a corresponding literal
|
|
|
|
|
suffix that can be used to indicate the type of a literal: `i` for `int`,
|
2012-12-20 12:43:20 +01:00
|
|
|
|
`u` for `uint`, `i8` for the `i8` type.
|
2012-06-21 02:09:30 +02:00
|
|
|
|
|
2012-10-11 04:32:11 +02:00
|
|
|
|
In the absence of an integer literal suffix, Rust will infer the
|
2012-09-23 07:22:49 +02:00
|
|
|
|
integer type based on type annotations and function signatures in the
|
|
|
|
|
surrounding program. In the absence of any type information at all,
|
|
|
|
|
Rust will assume that an unsuffixed integer literal has type
|
|
|
|
|
`int`.
|
2012-06-21 02:09:30 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-23 07:22:49 +02:00
|
|
|
|
let a = 1; // a is an int
|
|
|
|
|
let b = 10i; // b is an int, due to the 'i' suffix
|
2012-09-27 01:19:07 +02:00
|
|
|
|
let c = 100u; // c is a uint
|
2012-09-23 07:22:49 +02:00
|
|
|
|
let d = 1000i32; // d is an i32
|
2012-06-21 02:09:30 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
There are three floating-point types: `float`, `f32`, and `f64`.
|
|
|
|
|
Floating-point numbers are written `0.0`, `1e6`, or `2.1e-4`.
|
|
|
|
|
Like integers, floating-point literals are inferred to the correct type.
|
|
|
|
|
Suffixes `f`, `f32`, and `f64` can be used to create literals of a specific type.
|
2012-06-21 02:09:30 +02:00
|
|
|
|
|
2012-12-20 12:43:20 +01:00
|
|
|
|
The keywords `true` and `false` produce literals of type `bool`.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
Characters, the `char` type, are four-byte Unicode codepoints,
|
2012-12-20 12:43:20 +01:00
|
|
|
|
whose literals are written between single quotes, as in `'x'`.
|
|
|
|
|
Just like C, Rust understands a number of character escapes, using the backslash
|
2012-10-04 02:52:04 +02:00
|
|
|
|
character, such as `\n`, `\r`, and `\t`. String literals,
|
2012-12-20 12:43:20 +01:00
|
|
|
|
written between double quotes, allow the same escape sequences.
|
|
|
|
|
More on strings [later](#vectors-and-strings).
|
|
|
|
|
|
|
|
|
|
The nil type, written `()`, has a single value, also written `()`.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
## Operators
|
|
|
|
|
|
2012-09-01 02:20:36 +02:00
|
|
|
|
Rust's set of operators contains very few surprises. Arithmetic is done with
|
2012-12-30 22:09:34 +01:00
|
|
|
|
`*`, `/`, `%`, `+`, and `-` (multiply, divide, take remainder, add, and subtract). `-` is
|
|
|
|
|
also a unary prefix operator that negates numbers. As in C, the bitwise operators
|
2012-09-01 02:20:36 +02:00
|
|
|
|
`>>`, `<<`, `&`, `|`, and `^` are also supported.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-01 02:20:36 +02:00
|
|
|
|
Note that, if applied to an integer value, `!` flips all the bits (like `~` in
|
|
|
|
|
C).
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
The comparison operators are the traditional `==`, `!=`, `<`, `>`,
|
|
|
|
|
`<=`, and `>=`. Short-circuiting (lazy) boolean operators are written
|
|
|
|
|
`&&` (and) and `||` (or).
|
|
|
|
|
|
2012-09-01 02:20:36 +02:00
|
|
|
|
For type casting, Rust uses the binary `as` operator. It takes an
|
|
|
|
|
expression on the left side and a type on the right side and will,
|
2012-05-17 05:22:32 +02:00
|
|
|
|
if a meaningful conversion exists, convert the result of the
|
|
|
|
|
expression to the given type.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
let x: float = 4.0;
|
|
|
|
|
let y: uint = x as uint;
|
2013-03-29 02:39:09 +01:00
|
|
|
|
assert!(y == 4u);
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
## Syntax extensions
|
|
|
|
|
|
2012-09-01 02:20:36 +02:00
|
|
|
|
*Syntax extensions* are special forms that are not built into the language,
|
|
|
|
|
but are instead provided by the libraries. To make it clear to the reader when
|
2012-10-11 04:32:11 +02:00
|
|
|
|
a name refers to a syntax extension, the names of all syntax extensions end
|
|
|
|
|
with `!`. The standard library defines a few syntax extensions, the most
|
2012-12-20 22:45:54 +01:00
|
|
|
|
useful of which is `fmt!`, a `sprintf`-style text formatter that you will
|
|
|
|
|
often see in examples.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-11 04:32:11 +02:00
|
|
|
|
`fmt!` supports most of the directives that [printf][pf] supports, but unlike
|
|
|
|
|
printf, will give you a compile-time error when the types of the directives
|
2012-01-19 12:51:20 +01:00
|
|
|
|
don't match the types of the arguments.
|
|
|
|
|
|
2012-09-23 08:11:24 +02:00
|
|
|
|
~~~~
|
|
|
|
|
# let mystery_object = ();
|
|
|
|
|
|
2012-10-04 05:03:37 +02:00
|
|
|
|
io::println(fmt!("%s is %d", "the answer", 43));
|
2012-09-23 08:11:24 +02:00
|
|
|
|
|
|
|
|
|
// %? will conveniently print any type
|
|
|
|
|
io::println(fmt!("what is this thing: %?", mystery_object));
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-01-19 12:51:20 +01:00
|
|
|
|
[pf]: http://en.cppreference.com/w/cpp/io/c/fprintf
|
|
|
|
|
|
2012-10-11 04:32:11 +02:00
|
|
|
|
You can define your own syntax extensions with the macro system. For details, see the [macro tutorial][macros].
|
|
|
|
|
|
|
|
|
|
[macros]: tutorial-macros.html
|
2012-08-23 04:07:46 +02:00
|
|
|
|
|
2012-01-19 12:51:20 +01:00
|
|
|
|
# Control structures
|
|
|
|
|
|
|
|
|
|
## Conditionals
|
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
We've seen `if` expressions a few times already. To recap, braces are
|
|
|
|
|
compulsory, an `if` can have an optional `else` clause, and multiple
|
2012-01-19 12:51:20 +01:00
|
|
|
|
`if`/`else` constructs can be chained together:
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
if false {
|
2012-09-23 08:11:24 +02:00
|
|
|
|
io::println("that's odd");
|
2012-01-19 12:51:20 +01:00
|
|
|
|
} else if true {
|
2012-09-23 08:11:24 +02:00
|
|
|
|
io::println("right");
|
2012-01-19 12:51:20 +01:00
|
|
|
|
} else {
|
2012-09-23 08:11:24 +02:00
|
|
|
|
io::println("neither true nor false");
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
The condition given to an `if` construct *must* be of type `bool` (no
|
|
|
|
|
implicit conversion happens). If the arms are blocks that have a
|
|
|
|
|
value, this value must be of the same type for every arm in which
|
|
|
|
|
control reaches the end of the block:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
fn signum(x: int) -> int {
|
|
|
|
|
if x < 0 { -1 }
|
|
|
|
|
else if x > 0 { 1 }
|
2012-09-01 01:57:37 +02:00
|
|
|
|
else { return 0 }
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
## Pattern matching
|
|
|
|
|
|
2012-08-06 21:34:08 +02:00
|
|
|
|
Rust's `match` construct is a generalized, cleaned-up version of C's
|
2012-10-11 05:08:08 +02:00
|
|
|
|
`switch` construct. You provide it with a value and a number of
|
|
|
|
|
*arms*, each labelled with a pattern, and the code compares the value
|
|
|
|
|
against each pattern in order until one matches. The matching pattern
|
|
|
|
|
executes its corresponding arm.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
# let my_number = 1;
|
2012-08-06 21:34:08 +02:00
|
|
|
|
match my_number {
|
2012-09-01 01:57:37 +02:00
|
|
|
|
0 => io::println("zero"),
|
|
|
|
|
1 | 2 => io::println("one or two"),
|
|
|
|
|
3..10 => io::println("three to ten"),
|
|
|
|
|
_ => io::println("something else")
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
Unlike in C, there is no "falling through" between arms: only one arm
|
2012-10-11 05:08:08 +02:00
|
|
|
|
executes, and it doesn't have to explicitly `break` out of the
|
2012-01-19 12:51:20 +01:00
|
|
|
|
construct when it is finished.
|
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
A `match` arm consists of a *pattern*, then an arrow `=>`, followed by
|
|
|
|
|
an *action* (expression). Literals are valid patterns and match only
|
|
|
|
|
their own value. A single arm may match multiple different patterns by
|
|
|
|
|
combining them with the pipe operator (`|`), so long as every pattern
|
|
|
|
|
binds the same set of variables. Ranges of numeric literal patterns
|
|
|
|
|
can be expressed with two dots, as in `M..N`. The underscore (`_`) is
|
|
|
|
|
a wildcard pattern that matches any single value. The asterisk (`*`)
|
|
|
|
|
is a different wildcard that can match one or more fields in an `enum`
|
|
|
|
|
variant.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-11-11 17:19:40 +01:00
|
|
|
|
The patterns in a match arm are followed by a fat arrow, `=>`, then an
|
2012-08-06 07:07:22 +02:00
|
|
|
|
expression to evaluate. Each case is separated by commas. It's often
|
2012-10-04 02:52:04 +02:00
|
|
|
|
convenient to use a block expression for each case, in which case the
|
2012-08-06 07:07:22 +02:00
|
|
|
|
commas are optional.
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# let my_number = 1;
|
2012-08-06 21:34:08 +02:00
|
|
|
|
match my_number {
|
2012-10-04 02:52:04 +02:00
|
|
|
|
0 => { io::println("zero") }
|
|
|
|
|
_ => { io::println("something else") }
|
2012-08-06 07:07:22 +02:00
|
|
|
|
}
|
|
|
|
|
~~~
|
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
`match` constructs must be *exhaustive*: they must have an arm
|
|
|
|
|
covering every possible case. For example, the typechecker would
|
|
|
|
|
reject the previous example if the arm with the wildcard pattern was
|
|
|
|
|
omitted.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
A powerful application of pattern matching is *destructuring*:
|
|
|
|
|
matching in order to bind names to the contents of data
|
2013-04-03 23:52:28 +02:00
|
|
|
|
types. Assuming that `(float, float)` is a tuple of two floats:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-01 01:57:37 +02:00
|
|
|
|
fn angle(vector: (float, float)) -> float {
|
2012-09-25 04:51:03 +02:00
|
|
|
|
let pi = float::consts::pi;
|
2012-09-01 01:57:37 +02:00
|
|
|
|
match vector {
|
|
|
|
|
(0f, y) if y < 0f => 1.5 * pi,
|
|
|
|
|
(0f, y) => 0.5 * pi,
|
2012-08-06 07:07:22 +02:00
|
|
|
|
(x, y) => float::atan(y / x)
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
A variable name in a pattern matches any value, *and* binds that name
|
|
|
|
|
to the value of the matched value inside of the arm's action. Thus, `(0f,
|
2012-01-19 12:51:20 +01:00
|
|
|
|
y)` matches any tuple whose first element is zero, and binds `y` to
|
2012-12-30 22:09:34 +01:00
|
|
|
|
the second element. `(x, y)` matches any two-element tuple, and binds both
|
2012-10-11 05:08:08 +02:00
|
|
|
|
elements to variables.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
Any `match` arm can have a guard clause (written `if EXPR`), called a
|
|
|
|
|
*pattern guard*, which is an expression of type `bool` that
|
|
|
|
|
determines, after the pattern is found to match, whether the arm is
|
|
|
|
|
taken or not. The variables bound by the pattern are in scope in this
|
|
|
|
|
guard expression. The first arm in the `angle` example shows an
|
|
|
|
|
example of a pattern guard.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-25 04:51:03 +02:00
|
|
|
|
You've already seen simple `let` bindings, but `let` is a little
|
2012-10-11 05:08:08 +02:00
|
|
|
|
fancier than you've been led to believe. It, too, supports destructuring
|
|
|
|
|
patterns. For example, you can write this to extract the fields from a
|
|
|
|
|
tuple, introducing two variables at once: `a` and `b`.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
# fn get_tuple_of_two_ints() -> (int, int) { (1, 1) }
|
|
|
|
|
let (a, b) = get_tuple_of_two_ints();
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
Let bindings only work with _irrefutable_ patterns: that is, patterns
|
2012-09-25 04:51:03 +02:00
|
|
|
|
that can never fail to match. This excludes `let` from matching
|
2012-10-11 05:08:08 +02:00
|
|
|
|
literals and most `enum` variants.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
## Loops
|
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
`while` denotes a loop that iterates as long as its given condition
|
|
|
|
|
(which must have type `bool`) evaluates to `true`. Inside a loop, the
|
|
|
|
|
keyword `break` aborts the loop, and `loop` aborts the current
|
|
|
|
|
iteration and continues with the next.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-05-10 23:06:19 +02:00
|
|
|
|
~~~~
|
|
|
|
|
let mut cake_amount = 8;
|
|
|
|
|
while cake_amount > 0 {
|
|
|
|
|
cake_amount -= 1;
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
`loop` denotes an infinite loop, and is the preferred way of writing `while true`:
|
2012-05-10 23:06:19 +02:00
|
|
|
|
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
2012-03-22 16:39:41 +01:00
|
|
|
|
let mut x = 5;
|
2012-07-02 04:20:43 +02:00
|
|
|
|
loop {
|
2012-01-19 12:51:20 +01:00
|
|
|
|
x += x - 3;
|
|
|
|
|
if x % 5 == 0 { break; }
|
2013-02-26 16:36:59 +01:00
|
|
|
|
io::println(int::to_str(x));
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
This code prints out a weird sequence of numbers and stops as soon as
|
|
|
|
|
it finds one that can be divided by five.
|
|
|
|
|
|
2012-10-11 05:08:08 +02:00
|
|
|
|
For more involved iteration, such as enumerating the elements of a
|
|
|
|
|
collection, Rust uses [higher-order functions](#closures).
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-07 07:45:48 +02:00
|
|
|
|
# Data structures
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-09-01 01:57:37 +02:00
|
|
|
|
## Structs
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-09-01 01:57:37 +02:00
|
|
|
|
Rust struct types must be declared before they are used using the `struct`
|
|
|
|
|
syntax: `struct Name { field1: T1, field2: T2 [, ...] }`, where `T1`, `T2`,
|
|
|
|
|
... denote types. To construct a struct, use the same syntax, but leave off
|
2012-10-11 05:35:33 +02:00
|
|
|
|
the `struct`: for example: `Point { x: 1.0, y: 2.0 }`.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-09-01 01:57:37 +02:00
|
|
|
|
Structs are quite similar to C structs and are even laid out the same way in
|
2012-10-11 05:35:33 +02:00
|
|
|
|
memory (so you can read from a Rust struct in C, and vice-versa). Use the dot
|
|
|
|
|
operator to access struct fields, as in `mypoint.x`.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-20 00:15:57 +01:00
|
|
|
|
struct Point {
|
|
|
|
|
x: float,
|
|
|
|
|
y: float
|
2012-09-01 01:57:37 +02:00
|
|
|
|
}
|
2012-07-08 01:23:10 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2013-03-20 00:15:57 +01:00
|
|
|
|
Inherited mutability means that any field of a struct may be mutable, if the
|
|
|
|
|
struct is in a mutable slot (or a field of a struct in a mutable slot, and
|
|
|
|
|
so forth).
|
|
|
|
|
|
|
|
|
|
With a value (say, `mypoint`) of such a type in a mutable location, you can do
|
|
|
|
|
`mypoint.y += 1.0`. But in an immutable location, such an assignment to a
|
2013-02-26 19:02:36 +01:00
|
|
|
|
struct without inherited mutability would result in a type error.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2013-03-20 00:15:57 +01:00
|
|
|
|
~~~~ {.xfail-test}
|
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
let mut mypoint = Point { x: 1.0, y: 1.0 };
|
|
|
|
|
let origin = Point { x: 0.0, y: 0.0 };
|
|
|
|
|
|
|
|
|
|
mypoint.y += 1.0; // mypoint is mutable, and its fields as well
|
|
|
|
|
origin.y += 1.0; // ERROR: assigning to immutable field
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 05:35:33 +02:00
|
|
|
|
`match` patterns destructure structs. The basic syntax is
|
2012-12-30 22:09:34 +01:00
|
|
|
|
`Name { fieldname: pattern, ... }`:
|
2012-10-05 05:03:40 +02:00
|
|
|
|
|
2012-07-08 01:23:10 +02:00
|
|
|
|
~~~~
|
2012-09-01 01:57:37 +02:00
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
# let mypoint = Point { x: 0.0, y: 0.0 };
|
2012-08-06 21:34:08 +02:00
|
|
|
|
match mypoint {
|
2012-10-04 21:41:45 +02:00
|
|
|
|
Point { x: 0.0, y: yy } => { io::println(yy.to_str()); }
|
|
|
|
|
Point { x: xx, y: yy } => { io::println(xx.to_str() + " " + yy.to_str()); }
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-01 01:57:37 +02:00
|
|
|
|
In general, the field names of a struct do not have to appear in the same
|
|
|
|
|
order they appear in the type. When you are not interested in all
|
|
|
|
|
the fields of a struct, a struct pattern may end with `, _` (as in
|
2012-12-30 22:09:34 +01:00
|
|
|
|
`Name { field1, _ }`) to indicate that you're ignoring all other fields.
|
2012-10-05 05:03:40 +02:00
|
|
|
|
Additionally, struct fields have a shorthand matching form that simply
|
|
|
|
|
reuses the field name as the binding name.
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
# let mypoint = Point { x: 0.0, y: 0.0 };
|
|
|
|
|
match mypoint {
|
|
|
|
|
Point { x, _ } => { io::println(x.to_str()) }
|
|
|
|
|
}
|
|
|
|
|
~~~
|
|
|
|
|
|
2012-07-08 01:23:10 +02:00
|
|
|
|
## Enums
|
|
|
|
|
|
|
|
|
|
Enums are datatypes that have several alternate representations. For
|
|
|
|
|
example, consider the type shown earlier:
|
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-01 01:57:37 +02:00
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
enum Shape {
|
|
|
|
|
Circle(Point, float),
|
|
|
|
|
Rectangle(Point, Point)
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-27 01:41:14 +02:00
|
|
|
|
A value of this type is either a `Circle`, in which case it contains a
|
|
|
|
|
`Point` struct and a float, or a `Rectangle`, in which case it contains
|
|
|
|
|
two `Point` structs. The run-time representation of such a value
|
2012-07-08 01:23:10 +02:00
|
|
|
|
includes an identifier of the actual form that it holds, much like the
|
2012-12-30 22:09:34 +01:00
|
|
|
|
"tagged union" pattern in C, but with better static guarantees.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-10-11 05:35:33 +02:00
|
|
|
|
The above declaration will define a type `Shape` that can refer to
|
|
|
|
|
such shapes, and two functions, `Circle` and `Rectangle`, which can be
|
|
|
|
|
used to construct values of the type (taking arguments of the
|
2012-12-30 22:09:34 +01:00
|
|
|
|
specified types). So `Circle(Point { x: 0f, y: 0f }, 10f)` is the way to
|
2012-07-08 01:23:10 +02:00
|
|
|
|
create a new circle.
|
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
Enum variants need not have parameters. This `enum` declaration,
|
2012-10-11 05:35:33 +02:00
|
|
|
|
for example, is equivalent to a C enum:
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-16 03:44:44 +02:00
|
|
|
|
enum Direction {
|
|
|
|
|
North,
|
|
|
|
|
East,
|
|
|
|
|
South,
|
|
|
|
|
West
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 05:35:33 +02:00
|
|
|
|
This declaration defines `North`, `East`, `South`, and `West` as constants,
|
2012-09-16 03:44:44 +02:00
|
|
|
|
all of which have type `Direction`.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-10-11 05:35:33 +02:00
|
|
|
|
When an enum is C-like (that is, when none of the variants have
|
|
|
|
|
parameters), it is possible to explicitly set the discriminator values
|
|
|
|
|
to a constant value:
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-16 03:44:44 +02:00
|
|
|
|
enum Color {
|
|
|
|
|
Red = 0xff0000,
|
|
|
|
|
Green = 0x00ff00,
|
|
|
|
|
Blue = 0x0000ff
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
If an explicit discriminator is not specified for a variant, the value
|
|
|
|
|
defaults to the value of the previous variant plus one. If the first
|
|
|
|
|
variant does not have a discriminator, it defaults to 0. For example,
|
2012-10-11 05:35:33 +02:00
|
|
|
|
the value of `North` is 0, `East` is 1, `South` is 2, and `West` is 3.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-10-11 05:35:33 +02:00
|
|
|
|
When an enum is C-like, you can apply the `as` cast operator to
|
2012-12-30 22:09:34 +01:00
|
|
|
|
convert it to its discriminator value as an `int`.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
For enum types with multiple variants, destructuring is the only way to
|
|
|
|
|
get at their contents. All variant constructors can be used as
|
|
|
|
|
patterns, as in this definition of `area`:
|
|
|
|
|
|
|
|
|
|
~~~~
|
2012-10-07 07:35:08 +02:00
|
|
|
|
# struct Point {x: float, y: float}
|
2012-09-16 03:44:44 +02:00
|
|
|
|
# enum Shape { Circle(Point, float), Rectangle(Point, Point) }
|
|
|
|
|
fn area(sh: Shape) -> float {
|
2012-08-06 21:34:08 +02:00
|
|
|
|
match sh {
|
2012-09-16 03:44:44 +02:00
|
|
|
|
Circle(_, size) => float::consts::pi * size * size,
|
2012-12-30 22:09:34 +01:00
|
|
|
|
Rectangle(Point { x, y }, Point { x: x2, y: y2 }) => (x2 - x) * (y2 - y)
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-11-11 17:19:40 +01:00
|
|
|
|
You can write a lone `_` to ignore an individual field, and can
|
2012-10-11 05:35:33 +02:00
|
|
|
|
ignore all fields of a variant like: `Circle(*)`. As in their
|
|
|
|
|
introduction form, nullary enum patterns are written without
|
2012-09-24 03:02:28 +02:00
|
|
|
|
parentheses.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-12-30 22:09:34 +01:00
|
|
|
|
# struct Point { x: float, y: float }
|
2012-09-16 03:44:44 +02:00
|
|
|
|
# enum Direction { North, East, South, West }
|
|
|
|
|
fn point_from_direction(dir: Direction) -> Point {
|
2012-08-06 21:34:08 +02:00
|
|
|
|
match dir {
|
2012-12-30 22:09:34 +01:00
|
|
|
|
North => Point { x: 0f, y: 1f },
|
|
|
|
|
East => Point { x: 1f, y: 0f },
|
|
|
|
|
South => Point { x: 0f, y: -1f },
|
|
|
|
|
West => Point { x: -1f, y: 0f }
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-30 16:09:14 +01:00
|
|
|
|
Enum variants may also be structs. For example:
|
2012-12-20 08:05:21 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-12-30 06:52:51 +01:00
|
|
|
|
# use core::float;
|
2012-12-30 22:09:34 +01:00
|
|
|
|
# struct Point { x: float, y: float }
|
2012-12-20 08:05:21 +01:00
|
|
|
|
# fn square(x: float) -> float { x * x }
|
|
|
|
|
enum Shape {
|
|
|
|
|
Circle { center: Point, radius: float },
|
2012-12-30 16:09:14 +01:00
|
|
|
|
Rectangle { top_left: Point, bottom_right: Point }
|
2012-12-20 08:05:21 +01:00
|
|
|
|
}
|
|
|
|
|
fn area(sh: Shape) -> float {
|
2012-12-30 22:09:34 +01:00
|
|
|
|
match sh {
|
|
|
|
|
Circle { radius: radius, _ } => float::consts::pi * square(radius),
|
|
|
|
|
Rectangle { top_left: top_left, bottom_right: bottom_right } => {
|
|
|
|
|
(bottom_right.x - top_left.x) * (bottom_right.y - top_left.y)
|
|
|
|
|
}
|
|
|
|
|
}
|
2012-12-20 08:05:21 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
2012-12-30 22:09:34 +01:00
|
|
|
|
|
2012-07-08 01:23:10 +02:00
|
|
|
|
## Tuples
|
|
|
|
|
|
2012-09-27 01:41:14 +02:00
|
|
|
|
Tuples in Rust behave exactly like structs, except that their fields
|
2012-10-11 05:35:33 +02:00
|
|
|
|
do not have names. Thus, you cannot access their fields with dot notation.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
Tuples can have any arity except for 0 or 1 (though you may consider
|
2012-10-11 05:35:33 +02:00
|
|
|
|
unit, `()`, as the empty tuple if you like).
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
let mytup: (int, int, float) = (10, 20, 30.0);
|
2012-08-06 21:34:08 +02:00
|
|
|
|
match mytup {
|
2013-03-08 21:39:42 +01:00
|
|
|
|
(a, b, c) => info!(a + b + (c as int))
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
2012-07-08 00:37:58 +02:00
|
|
|
|
|
2012-12-20 05:52:03 +01:00
|
|
|
|
## Tuple structs
|
|
|
|
|
|
2013-03-08 03:15:07 +01:00
|
|
|
|
Rust also has _tuple structs_, which behave like both structs and tuples,
|
|
|
|
|
except that, unlike tuples, tuple structs have names (so `Foo(1, 2)` has a
|
|
|
|
|
different type from `Bar(1, 2)`), and tuple structs' _fields_ do not have
|
|
|
|
|
names.
|
2012-12-20 05:52:03 +01:00
|
|
|
|
|
|
|
|
|
For example:
|
|
|
|
|
~~~~
|
|
|
|
|
struct MyTup(int, int, float);
|
|
|
|
|
let mytup: MyTup = MyTup(10, 20, 30.0);
|
|
|
|
|
match mytup {
|
2013-03-08 21:39:42 +01:00
|
|
|
|
MyTup(a, b, c) => info!(a + b + (c as int))
|
2012-12-20 05:52:03 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2013-03-08 03:15:07 +01:00
|
|
|
|
<a name="newtype"></a>
|
|
|
|
|
|
|
|
|
|
There is a special case for tuple structs with a single field, which are
|
|
|
|
|
sometimes called "newtypes" (after Haskell's "newtype" feature). These are
|
|
|
|
|
used to define new types in such a way that the new name is not just a
|
|
|
|
|
synonym for an existing type but is rather its own distinct type.
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
struct GizmoId(int);
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
For convenience, you can extract the contents of such a struct with the
|
|
|
|
|
dereference (`*`) unary operator:
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
# struct GizmoId(int);
|
|
|
|
|
let my_gizmo_id: GizmoId = GizmoId(10);
|
|
|
|
|
let id_int: int = *my_gizmo_id;
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
Types like this can be useful to differentiate between data that have
|
|
|
|
|
the same type but must be used in different ways.
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
struct Inches(int);
|
|
|
|
|
struct Centimeters(int);
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
The above definitions allow for a simple way for programs to avoid
|
|
|
|
|
confusing numbers that correspond to different units.
|
|
|
|
|
|
2012-12-20 10:34:15 +01:00
|
|
|
|
# Functions
|
2012-09-25 03:25:57 +02:00
|
|
|
|
|
|
|
|
|
We've already seen several function definitions. Like all other static
|
|
|
|
|
declarations, such as `type`, functions can be declared both at the
|
2012-10-11 05:35:33 +02:00
|
|
|
|
top level and inside other functions (or in modules, which we'll come
|
|
|
|
|
back to [later](#modules-and-crates)). The `fn` keyword introduces a
|
|
|
|
|
function. A function has an argument list, which is a parenthesized
|
|
|
|
|
list of `expr: type` pairs separated by commas. An arrow `->`
|
|
|
|
|
separates the argument list and the function's return type.
|
2012-09-25 03:25:57 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-10-04 21:41:45 +02:00
|
|
|
|
fn line(a: int, b: int, x: int) -> int {
|
2012-10-05 02:33:06 +02:00
|
|
|
|
return a * x + b;
|
2012-09-25 03:25:57 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
The `return` keyword immediately returns from the body of a function. It
|
|
|
|
|
is optionally followed by an expression to return. A function can
|
2012-12-30 22:09:34 +01:00
|
|
|
|
also return a value by having its top-level block produce an
|
2012-09-25 03:25:57 +02:00
|
|
|
|
expression.
|
|
|
|
|
|
|
|
|
|
~~~~
|
2012-10-04 21:41:45 +02:00
|
|
|
|
fn line(a: int, b: int, x: int) -> int {
|
2012-10-05 02:33:06 +02:00
|
|
|
|
a * x + b
|
2012-09-25 03:25:57 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 05:35:33 +02:00
|
|
|
|
It's better Rust style to write a return value this way instead of
|
|
|
|
|
writing an explicit `return`. The utility of `return` comes in when
|
|
|
|
|
returning early from a function. Functions that do not return a value
|
|
|
|
|
are said to return nil, `()`, and both the return type and the return
|
|
|
|
|
value may be omitted from the definition. The following two functions
|
|
|
|
|
are equivalent.
|
2012-09-25 03:25:57 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
fn do_nothing_the_hard_way() -> () { return (); }
|
|
|
|
|
|
|
|
|
|
fn do_nothing_the_easy_way() { }
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-04 21:41:45 +02:00
|
|
|
|
Ending the function with a semicolon like so is equivalent to returning `()`.
|
|
|
|
|
|
|
|
|
|
~~~~
|
2012-10-05 02:33:06 +02:00
|
|
|
|
fn line(a: int, b: int, x: int) -> int { a * x + b }
|
|
|
|
|
fn oops(a: int, b: int, x: int) -> () { a * x + b; }
|
2012-10-04 21:41:45 +02:00
|
|
|
|
|
2013-03-29 02:39:09 +01:00
|
|
|
|
assert!(8 == line(5, 3, 1));
|
|
|
|
|
assert!(() == oops(5, 3, 1));
|
2012-10-04 21:41:45 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-21 01:51:37 +01:00
|
|
|
|
As with `match` expressions and `let` bindings, function arguments support
|
|
|
|
|
pattern destructuring. Like `let`, argument patterns must be irrefutable,
|
2012-12-30 22:09:34 +01:00
|
|
|
|
as in this example that unpacks the first value from a tuple and returns it.
|
2012-12-21 01:51:37 +01:00
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
fn first((value, _): (int, float)) -> int { value }
|
|
|
|
|
~~~
|
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
# Destructors
|
2012-07-02 04:20:43 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
C-style resource management requires the programmer to match every allocation
|
|
|
|
|
with a free, which means manually tracking the responsibility for cleaning up
|
|
|
|
|
(the owner). Correctness is left to the programmer, and it's easy to get wrong.
|
2012-09-24 03:45:42 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
The following code demonstrates manual memory management, in order to contrast
|
|
|
|
|
it with Rust's resource management. Rust enforces safety, so the `unsafe`
|
|
|
|
|
keyword is used to explicitly wrap the unsafe code. The keyword is a promise to
|
|
|
|
|
the compiler that unsafety does not leak outside of the unsafe block, and is
|
|
|
|
|
used to create safe concepts on top of low-level code.
|
2012-07-02 04:20:43 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-28 22:53:29 +01:00
|
|
|
|
use core::libc::{calloc, free, size_t};
|
2012-09-25 04:11:48 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
fn main() {
|
|
|
|
|
unsafe {
|
|
|
|
|
let a = calloc(1, int::bytes as size_t);
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
let d;
|
2012-09-25 04:37:41 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
{
|
|
|
|
|
let b = calloc(1, int::bytes as size_t);
|
|
|
|
|
|
|
|
|
|
let c = calloc(1, int::bytes as size_t);
|
|
|
|
|
d = c; // move ownership to d
|
2012-09-25 04:37:41 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
free(b);
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
free(d);
|
|
|
|
|
free(a);
|
|
|
|
|
}
|
2012-09-25 04:37:41 +02:00
|
|
|
|
}
|
2013-03-19 00:24:48 +01:00
|
|
|
|
~~~~
|
2012-09-25 04:37:41 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
Rust uses destructors to handle the release of resources like memory
|
|
|
|
|
allocations, files and sockets. An object will only be destroyed when there is
|
|
|
|
|
no longer any way to access it, which prevents dynamic failures from an attempt
|
|
|
|
|
to use a freed resource. When a task fails, the stack unwinds and the
|
|
|
|
|
destructors of all objects owned by that task are called.
|
2012-09-25 04:37:41 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
The unsafe code from above can be contained behind a safe API that prevents
|
|
|
|
|
memory leaks or use-after-free:
|
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-28 22:53:29 +01:00
|
|
|
|
use core::libc::{calloc, free, c_void, size_t};
|
2012-09-25 04:37:41 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
struct Blob { priv ptr: *c_void }
|
2013-01-11 04:08:07 +01:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
impl Blob {
|
2013-03-23 01:24:26 +01:00
|
|
|
|
fn new() -> Blob {
|
2013-03-19 00:24:48 +01:00
|
|
|
|
unsafe { Blob{ptr: calloc(1, int::bytes as size_t)} }
|
|
|
|
|
}
|
|
|
|
|
}
|
2013-01-11 04:08:07 +01:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
impl Drop for Blob {
|
|
|
|
|
fn finalize(&self) {
|
|
|
|
|
unsafe { free(self.ptr); }
|
|
|
|
|
}
|
|
|
|
|
}
|
2013-01-11 04:08:07 +01:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
fn main() {
|
|
|
|
|
let a = Blob::new();
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
let d;
|
2012-07-10 06:02:36 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
{
|
|
|
|
|
let b = Blob::new();
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
let c = Blob::new();
|
|
|
|
|
d = c; // move ownership to d
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
// b is destroyed here
|
|
|
|
|
}
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
// d is destroyed here
|
|
|
|
|
// a is destroyed here
|
|
|
|
|
}
|
2012-07-08 01:23:10 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
This pattern is common enough that Rust includes dynamically allocated memory
|
|
|
|
|
as first-class types (`~` and `@`). Non-memory resources like files are cleaned
|
|
|
|
|
up with custom destructors.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-19 00:24:48 +01:00
|
|
|
|
fn main() {
|
|
|
|
|
let a = ~0;
|
|
|
|
|
|
|
|
|
|
let d;
|
|
|
|
|
|
|
|
|
|
{
|
|
|
|
|
let b = ~0;
|
|
|
|
|
|
|
|
|
|
let c = ~0;
|
|
|
|
|
d = c; // move ownership to d
|
2012-10-04 21:41:45 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
// b is destroyed here
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
// d is destroyed here
|
|
|
|
|
// a is destroyed here
|
|
|
|
|
}
|
2012-07-08 01:23:10 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
# Ownership
|
|
|
|
|
|
|
|
|
|
Rust formalizes the concept of object ownership to delegate management of an
|
|
|
|
|
object's lifetime to either a variable or a task-local garbage collector. An
|
|
|
|
|
object's owner is responsible for managing the lifetime of the object by
|
|
|
|
|
calling the destructor, and the owner determines whether the object is mutable.
|
|
|
|
|
|
|
|
|
|
Ownership is recursive, so mutability is inherited recursively and a destructor
|
|
|
|
|
destroys the contained tree of owned objects. Variables are top-level owners
|
|
|
|
|
and destroy the contained object when they go out of scope. A box managed by
|
|
|
|
|
the garbage collector starts a new ownership tree, and the destructor is called
|
|
|
|
|
when it is collected.
|
|
|
|
|
|
|
|
|
|
If an object doesn't contain garbage-collected boxes, it consists of a single
|
|
|
|
|
ownership tree and is given the `Owned` trait which allows it to be sent
|
2013-03-28 23:40:44 +01:00
|
|
|
|
between tasks. Custom destructors can only be implemented directly on types
|
|
|
|
|
that are `Owned`, but garbage-collected boxes can still *contain* types with
|
|
|
|
|
custom destructors.
|
2013-03-19 00:24:48 +01:00
|
|
|
|
|
|
|
|
|
# Boxes
|
2013-01-11 04:08:07 +01:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
Many modern languages represent values as as pointers to heap memory by
|
|
|
|
|
default. In contrast, Rust, like C and C++, represents such types directly.
|
|
|
|
|
Another way to say this is that aggregate data in Rust are *unboxed*. This
|
|
|
|
|
means that if you `let x = Point { x: 1f, y: 1f };`, you are creating a struct
|
|
|
|
|
on the stack. If you then copy it into a data structure, you copy the entire
|
|
|
|
|
struct, not just a pointer.
|
|
|
|
|
|
|
|
|
|
For small structs like `Point`, this is usually more efficient than allocating
|
|
|
|
|
memory and indirecting through a pointer. But for big structs, or mutable
|
|
|
|
|
state, it can be useful to have a single copy on the stack or on the heap, and
|
|
|
|
|
refer to that through a pointer.
|
|
|
|
|
|
|
|
|
|
## Owned boxes
|
|
|
|
|
|
2013-03-29 19:51:52 +01:00
|
|
|
|
An owned box (`~`) is a uniquely owned allocation on the heap. It inherits the
|
|
|
|
|
mutability and lifetime of the owner as it would if there was no box.
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
let x = 5; // immutable
|
|
|
|
|
let mut y = 5; // mutable
|
|
|
|
|
y += 2;
|
|
|
|
|
|
|
|
|
|
let x = ~5; // immutable
|
|
|
|
|
let mut y = ~5; // mutable
|
|
|
|
|
*y += 2; // the * operator is needed to access the contained value
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
The purpose of an owned box is to add a layer of indirection in order to create
|
|
|
|
|
recursive data structures or cheaply pass around an object larger than a
|
|
|
|
|
pointer. Since an owned box has a unique owner, it can be used to represent any
|
|
|
|
|
tree data structure.
|
|
|
|
|
|
|
|
|
|
The following struct won't compile, because the lack of indirection would mean
|
|
|
|
|
it has an infinite size:
|
|
|
|
|
|
|
|
|
|
~~~~ {.xfail-test}
|
|
|
|
|
struct Foo {
|
|
|
|
|
child: Option<Foo>
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
> ***Note:*** The `Option` type is an enum that represents an *optional* value.
|
|
|
|
|
> It's comparable to a nullable pointer in many other languages, but stores the
|
|
|
|
|
> contained value unboxed.
|
|
|
|
|
|
|
|
|
|
Adding indirection with an owned pointer allocates the child outside of the
|
|
|
|
|
struct on the heap, which makes it a finite size and won't result in a
|
|
|
|
|
compile-time error:
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
struct Foo {
|
|
|
|
|
child: Option<~Foo>
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
2013-03-19 00:24:48 +01:00
|
|
|
|
|
|
|
|
|
## Managed boxes
|
2013-01-11 04:08:07 +01:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
A managed box (`@`) is a heap allocation with the lifetime managed by a
|
|
|
|
|
task-local garbage collector. It will be destroyed at some point after there
|
|
|
|
|
are no references left to the box, no later than the end of the task. Managed
|
|
|
|
|
boxes lack an owner, so they start a new ownership tree and don't inherit
|
|
|
|
|
mutability. They do own the contained object, and mutability is defined by the
|
|
|
|
|
type of the shared box (`@` or `@mut`). An object containing a managed box is
|
|
|
|
|
not `Owned`, and can't be sent between tasks.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2013-03-29 20:07:36 +01:00
|
|
|
|
~~~~
|
|
|
|
|
let a = @5; // immutable
|
|
|
|
|
|
|
|
|
|
let mut b = @5; // mutable variable, immutable box
|
|
|
|
|
b = @10;
|
|
|
|
|
|
|
|
|
|
let c = @mut 5; // immutable variable, mutable box
|
|
|
|
|
*c = 10;
|
|
|
|
|
|
|
|
|
|
let mut d = @mut 5; // mutable variable, mutable box
|
|
|
|
|
*d += 5;
|
|
|
|
|
d = @mut 15;
|
|
|
|
|
~~~~
|
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
# Move semantics
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
Rust uses a shallow copy for parameter passing, assignment and returning values
|
|
|
|
|
from functions. A shallow copy is considered a move of ownership if the
|
|
|
|
|
ownership tree of the copied value includes an owned box or a type with a
|
|
|
|
|
custom destructor. After a value has been moved, it can no longer be used from
|
|
|
|
|
the source location and will not be destroyed there.
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
let x = ~5;
|
|
|
|
|
let y = x.clone(); // y is a newly allocated box
|
|
|
|
|
let z = x; // no new memory allocated, x can no longer be used
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
# Borrowed pointers
|
|
|
|
|
|
|
|
|
|
Rust's borrowed pointers are a general purpose reference type. In contrast with
|
2013-03-29 19:16:52 +01:00
|
|
|
|
owned boxes, where the holder of an owned box is the owner of the pointed-to
|
|
|
|
|
memory, borrowed pointers never imply ownership. A pointer can be borrowed to
|
|
|
|
|
any object, and the compiler verifies that it cannot outlive the lifetime of
|
|
|
|
|
the object.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-09-25 04:11:48 +02:00
|
|
|
|
As an example, consider a simple struct type, `Point`:
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-09-25 04:11:48 +02:00
|
|
|
|
~~~
|
|
|
|
|
struct Point {
|
2012-12-30 22:09:34 +01:00
|
|
|
|
x: float,
|
|
|
|
|
y: float
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 05:52:20 +02:00
|
|
|
|
We can use this simple definition to allocate points in many different
|
|
|
|
|
ways. For example, in this code, each of these three local variables
|
|
|
|
|
contains a point, but allocated in a different location:
|
2012-07-08 01:27:59 +02:00
|
|
|
|
|
2012-09-25 04:11:48 +02:00
|
|
|
|
~~~
|
|
|
|
|
# struct Point { x: float, y: float }
|
2012-12-30 22:09:34 +01:00
|
|
|
|
let on_the_stack : Point = Point { x: 3.0, y: 4.0 };
|
|
|
|
|
let managed_box : @Point = @Point { x: 5.0, y: 1.0 };
|
|
|
|
|
let owned_box : ~Point = ~Point { x: 7.0, y: 9.0 };
|
2012-09-25 04:11:48 +02:00
|
|
|
|
~~~
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-09-25 04:11:48 +02:00
|
|
|
|
Suppose we wanted to write a procedure that computed the distance
|
|
|
|
|
between any two points, no matter where they were stored. For example,
|
|
|
|
|
we might like to compute the distance between `on_the_stack` and
|
2012-11-11 17:19:40 +01:00
|
|
|
|
`managed_box`, or between `managed_box` and `owned_box`. One option is
|
2012-09-25 04:11:48 +02:00
|
|
|
|
to define a function that takes two arguments of type point—that is,
|
|
|
|
|
it takes the points by value. But this will cause the points to be
|
|
|
|
|
copied when we call the function. For points, this is probably not so
|
|
|
|
|
bad, but often copies are expensive or, worse, if there are mutable
|
|
|
|
|
fields, they can change the semantics of your program. So we’d like to
|
|
|
|
|
define a function that takes the points by pointer. We can use
|
|
|
|
|
borrowed pointers to do this:
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-09-25 04:11:48 +02:00
|
|
|
|
~~~
|
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
# fn sqrt(f: float) -> float { 0f }
|
|
|
|
|
fn compute_distance(p1: &Point, p2: &Point) -> float {
|
|
|
|
|
let x_d = p1.x - p2.x;
|
|
|
|
|
let y_d = p1.y - p2.y;
|
|
|
|
|
sqrt(x_d * x_d + y_d * y_d)
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
2012-09-25 04:11:48 +02:00
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
Now we can call `compute_distance()` in various ways:
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# struct Point{ x: float, y: float };
|
2012-12-30 22:09:34 +01:00
|
|
|
|
# let on_the_stack : Point = Point { x: 3.0, y: 4.0 };
|
|
|
|
|
# let managed_box : @Point = @Point { x: 5.0, y: 1.0 };
|
|
|
|
|
# let owned_box : ~Point = ~Point { x: 7.0, y: 9.0 };
|
2012-09-25 04:11:48 +02:00
|
|
|
|
# fn compute_distance(p1: &Point, p2: &Point) -> float { 0f }
|
2012-11-11 17:19:40 +01:00
|
|
|
|
compute_distance(&on_the_stack, managed_box);
|
|
|
|
|
compute_distance(managed_box, owned_box);
|
2012-09-25 04:11:48 +02:00
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
Here the `&` operator is used to take the address of the variable
|
2012-09-27 01:41:14 +02:00
|
|
|
|
`on_the_stack`; this is because `on_the_stack` has the type `Point`
|
|
|
|
|
(that is, a struct value) and we have to take its address to get a
|
2012-09-25 04:11:48 +02:00
|
|
|
|
value. We also call this _borrowing_ the local variable
|
2012-12-30 22:09:34 +01:00
|
|
|
|
`on_the_stack`, because we are creating an alias: that is, another
|
2012-09-25 04:11:48 +02:00
|
|
|
|
route to the same data.
|
|
|
|
|
|
2012-11-11 17:19:40 +01:00
|
|
|
|
In the case of the boxes `managed_box` and `owned_box`, however, no
|
2012-09-25 04:11:48 +02:00
|
|
|
|
explicit action is necessary. The compiler will automatically convert
|
|
|
|
|
a box like `@point` or `~point` to a borrowed pointer like
|
|
|
|
|
`&point`. This is another form of borrowing; in this case, the
|
2012-12-30 22:09:34 +01:00
|
|
|
|
contents of the managed/owned box are being lent out.
|
2012-09-25 04:11:48 +02:00
|
|
|
|
|
|
|
|
|
Whenever a value is borrowed, there are some limitations on what you
|
|
|
|
|
can do with the original. For example, if the contents of a variable
|
|
|
|
|
have been lent out, you cannot send that variable to another task, nor
|
|
|
|
|
will you be permitted to take actions that might cause the borrowed
|
|
|
|
|
value to be freed or to change its type. This rule should make
|
|
|
|
|
intuitive sense: you must wait for a borrowed value to be returned
|
|
|
|
|
(that is, for the borrowed pointer to go out of scope) before you can
|
|
|
|
|
make full use of it again.
|
|
|
|
|
|
|
|
|
|
For a more in-depth explanation of borrowed pointers, read the
|
|
|
|
|
[borrowed pointer tutorial][borrowtut].
|
|
|
|
|
|
|
|
|
|
[borrowtut]: tutorial-borrowed-ptr.html
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
## Freezing
|
|
|
|
|
|
|
|
|
|
Borrowing an immutable pointer to an object freezes it and prevents mutation.
|
2013-03-29 20:34:14 +01:00
|
|
|
|
`Owned` objects have freezing enforced statically at compile-time.
|
2013-03-19 00:24:48 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
let mut x = 5;
|
|
|
|
|
{
|
|
|
|
|
let y = &x; // x is now frozen, it cannot be modified
|
|
|
|
|
}
|
|
|
|
|
// x is now unfrozen again
|
|
|
|
|
~~~~
|
|
|
|
|
|
2013-03-29 20:34:14 +01:00
|
|
|
|
Mutable managed boxes handle freezing dynamically when any of their contents
|
|
|
|
|
are borrowed, and the task will fail if an attempt to modify them is made while
|
|
|
|
|
they are frozen:
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
let x = @mut 5;
|
|
|
|
|
let y = x;
|
|
|
|
|
{
|
|
|
|
|
let y = &*y; // the managed box is now frozen
|
|
|
|
|
// modifying it through x or y will cause a task failure
|
|
|
|
|
}
|
|
|
|
|
// the box is now unfrozen again
|
|
|
|
|
~~~~
|
|
|
|
|
|
2013-03-19 00:24:48 +01:00
|
|
|
|
# Dereferencing pointers
|
2012-10-06 03:39:09 +02:00
|
|
|
|
|
|
|
|
|
Rust uses the unary star operator (`*`) to access the contents of a
|
|
|
|
|
box or pointer, similarly to C.
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
let managed = @10;
|
|
|
|
|
let owned = ~20;
|
|
|
|
|
let borrowed = &30;
|
|
|
|
|
|
|
|
|
|
let sum = *managed + *owned + *borrowed;
|
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
Dereferenced mutable pointers may appear on the left hand side of
|
2012-10-11 05:52:20 +02:00
|
|
|
|
assignments. Such an assignment modifies the value that the pointer
|
|
|
|
|
points to.
|
2012-10-06 03:39:09 +02:00
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
let managed = @mut 10;
|
2013-02-26 20:32:00 +01:00
|
|
|
|
let mut owned = ~20;
|
2012-10-06 03:39:09 +02:00
|
|
|
|
|
|
|
|
|
let mut value = 30;
|
|
|
|
|
let borrowed = &mut value;
|
|
|
|
|
|
|
|
|
|
*managed = *owned + 10;
|
|
|
|
|
*owned = *borrowed + 100;
|
|
|
|
|
*borrowed = *managed + 1000;
|
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
Pointers have high operator precedence, but lower precedence than the
|
2012-10-11 05:52:20 +02:00
|
|
|
|
dot operator used for field and method access. This precedence order
|
|
|
|
|
can sometimes make code awkward and parenthesis-filled.
|
2012-10-06 03:39:09 +02:00
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
# enum Shape { Rectangle(Point, Point) }
|
2013-03-13 03:32:14 +01:00
|
|
|
|
# impl Shape { fn area(&self) -> int { 0 } }
|
2012-10-06 03:39:09 +02:00
|
|
|
|
let start = @Point { x: 10f, y: 20f };
|
|
|
|
|
let end = ~Point { x: (*start).x + 100f, y: (*start).y + 100f };
|
|
|
|
|
let rect = &Rectangle(*start, *end);
|
|
|
|
|
let area = (*rect).area();
|
|
|
|
|
~~~
|
|
|
|
|
|
2012-10-11 05:52:20 +02:00
|
|
|
|
To combat this ugliness the dot operator applies _automatic pointer
|
2012-12-30 22:09:34 +01:00
|
|
|
|
dereferencing_ to the receiver (the value on the left-hand side of the
|
2012-10-11 05:52:20 +02:00
|
|
|
|
dot), so in most cases, explicitly dereferencing the receiver is not necessary.
|
2012-10-06 03:39:09 +02:00
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
# enum Shape { Rectangle(Point, Point) }
|
2013-03-13 03:32:14 +01:00
|
|
|
|
# impl Shape { fn area(&self) -> int { 0 } }
|
2012-10-06 03:39:09 +02:00
|
|
|
|
let start = @Point { x: 10f, y: 20f };
|
|
|
|
|
let end = ~Point { x: start.x + 100f, y: start.y + 100f };
|
|
|
|
|
let rect = &Rectangle(*start, *end);
|
|
|
|
|
let area = rect.area();
|
|
|
|
|
~~~
|
|
|
|
|
|
2012-10-11 05:52:20 +02:00
|
|
|
|
You can write an expression that dereferences any number of pointers
|
|
|
|
|
automatically. For example, if you felt inclined, you could write
|
|
|
|
|
something silly like
|
2012-10-06 03:39:09 +02:00
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
let point = &@~Point { x: 10f, y: 20f };
|
|
|
|
|
io::println(fmt!("%f", point.x));
|
|
|
|
|
~~~
|
|
|
|
|
|
2012-10-11 05:52:20 +02:00
|
|
|
|
The indexing operator (`[]`) also auto-dereferences.
|
2012-10-06 03:39:09 +02:00
|
|
|
|
|
2012-09-24 03:45:42 +02:00
|
|
|
|
# Vectors and strings
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-10-11 06:06:22 +02:00
|
|
|
|
A vector is a contiguous section of memory containing zero or more
|
2012-07-10 06:06:22 +02:00
|
|
|
|
values of the same type. Like other types in Rust, vectors can be
|
2012-09-24 03:45:42 +02:00
|
|
|
|
stored on the stack, the local heap, or the exchange heap. Borrowed
|
|
|
|
|
pointers to vectors are also called 'slices'.
|
2012-07-08 02:31:39 +02:00
|
|
|
|
|
|
|
|
|
~~~
|
2012-10-07 10:06:07 +02:00
|
|
|
|
# enum Crayon {
|
|
|
|
|
# Almond, AntiqueBrass, Apricot,
|
|
|
|
|
# Aquamarine, Asparagus, AtomicTangerine,
|
|
|
|
|
# BananaMania, Beaver, Bittersweet,
|
|
|
|
|
# Black, BlizzardBlue, Blue
|
|
|
|
|
# }
|
2012-09-24 03:45:42 +02:00
|
|
|
|
// A fixed-size stack vector
|
2013-03-23 01:24:26 +01:00
|
|
|
|
let stack_crayons: [Crayon, ..3] = [Almond, AntiqueBrass, Apricot];
|
2012-09-24 03:45:42 +02:00
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
// A borrowed pointer to stack-allocated vector
|
2012-10-05 02:58:13 +02:00
|
|
|
|
let stack_crayons: &[Crayon] = &[Aquamarine, Asparagus, AtomicTangerine];
|
2012-09-24 03:45:42 +02:00
|
|
|
|
|
|
|
|
|
// A local heap (managed) vector of crayons
|
2012-10-05 02:58:13 +02:00
|
|
|
|
let local_crayons: @[Crayon] = @[BananaMania, Beaver, Bittersweet];
|
2012-09-24 03:45:42 +02:00
|
|
|
|
|
|
|
|
|
// An exchange heap (owned) vector of crayons
|
2012-10-05 02:58:13 +02:00
|
|
|
|
let exchange_crayons: ~[Crayon] = ~[Black, BlizzardBlue, Blue];
|
2012-07-08 02:31:39 +02:00
|
|
|
|
~~~
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
The `+` operator means concatenation when applied to vector types.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-16 03:44:44 +02:00
|
|
|
|
# enum Crayon { Almond, AntiqueBrass, Apricot,
|
|
|
|
|
# Aquamarine, Asparagus, AtomicTangerine,
|
|
|
|
|
# BananaMania, Beaver, Bittersweet };
|
2012-07-08 02:31:39 +02:00
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
let my_crayons = ~[Almond, AntiqueBrass, Apricot];
|
|
|
|
|
let your_crayons = ~[BananaMania, Beaver, Bittersweet];
|
|
|
|
|
|
|
|
|
|
// Add two vectors to create a new one
|
|
|
|
|
let our_crayons = my_crayons + your_crayons;
|
|
|
|
|
|
2012-10-07 10:06:07 +02:00
|
|
|
|
// += will append to a vector, provided it lives in a mutable slot
|
2012-12-20 11:37:21 +01:00
|
|
|
|
let mut my_crayons = my_crayons;
|
2012-10-05 02:58:13 +02:00
|
|
|
|
my_crayons += your_crayons;
|
2012-07-08 01:23:10 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
> ***Note:*** The above examples of vector addition use owned
|
|
|
|
|
> vectors. Some operations on slices and stack vectors are
|
2012-10-11 06:06:22 +02:00
|
|
|
|
> not yet well-supported. Owned vectors are often the most
|
2012-10-05 02:58:13 +02:00
|
|
|
|
> usable.
|
|
|
|
|
|
2012-10-11 06:06:22 +02:00
|
|
|
|
Square brackets denote indexing into a vector:
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-07-08 02:31:39 +02:00
|
|
|
|
~~~~
|
2012-09-16 03:44:44 +02:00
|
|
|
|
# enum Crayon { Almond, AntiqueBrass, Apricot,
|
|
|
|
|
# Aquamarine, Asparagus, AtomicTangerine,
|
|
|
|
|
# BananaMania, Beaver, Bittersweet };
|
2012-10-05 02:58:13 +02:00
|
|
|
|
# fn draw_scene(c: Crayon) { }
|
2013-03-23 01:24:26 +01:00
|
|
|
|
let crayons: [Crayon, ..3] = [BananaMania, Beaver, Bittersweet];
|
2012-10-05 02:58:13 +02:00
|
|
|
|
match crayons[0] {
|
|
|
|
|
Bittersweet => draw_scene(crayons[0]),
|
|
|
|
|
_ => ()
|
|
|
|
|
}
|
2012-07-08 02:31:39 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2013-01-23 14:26:47 +01:00
|
|
|
|
A vector can be destructured using pattern matching:
|
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-23 01:24:26 +01:00
|
|
|
|
let numbers: [int, ..3] = [1, 2, 3];
|
2013-01-23 14:26:47 +01:00
|
|
|
|
let score = match numbers {
|
|
|
|
|
[] => 0,
|
|
|
|
|
[a] => a * 10,
|
|
|
|
|
[a, b] => a * 6 + b * 4,
|
|
|
|
|
[a, b, c, ..rest] => a * 5 + b * 3 + c * 2 + rest.len() as int
|
|
|
|
|
};
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
The elements of a vector _inherit the mutability of the vector_,
|
2012-10-11 06:06:22 +02:00
|
|
|
|
and as such, individual elements may not be reassigned when the
|
2012-10-05 02:58:13 +02:00
|
|
|
|
vector lives in an immutable slot.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
~~~ {.xfail-test}
|
2012-09-16 03:44:44 +02:00
|
|
|
|
# enum Crayon { Almond, AntiqueBrass, Apricot,
|
|
|
|
|
# Aquamarine, Asparagus, AtomicTangerine,
|
|
|
|
|
# BananaMania, Beaver, Bittersweet };
|
2012-10-05 02:58:13 +02:00
|
|
|
|
let crayons: ~[Crayon] = ~[BananaMania, Beaver, Bittersweet];
|
2012-07-08 02:31:39 +02:00
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
crayons[0] = Apricot; // ERROR: Can't assign to immutable vector
|
|
|
|
|
~~~
|
2012-07-08 02:31:39 +02:00
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
Moving it into a mutable slot makes the elements assignable.
|
2012-07-08 02:31:39 +02:00
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
~~~
|
2012-09-16 03:44:44 +02:00
|
|
|
|
# enum Crayon { Almond, AntiqueBrass, Apricot,
|
|
|
|
|
# Aquamarine, Asparagus, AtomicTangerine,
|
|
|
|
|
# BananaMania, Beaver, Bittersweet };
|
2012-10-05 02:58:13 +02:00
|
|
|
|
let crayons: ~[Crayon] = ~[BananaMania, Beaver, Bittersweet];
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
// Put the vector into a mutable slot
|
2013-02-15 11:44:18 +01:00
|
|
|
|
let mut mutable_crayons = crayons;
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
// Now it's mutable to the bone
|
|
|
|
|
mutable_crayons[0] = Apricot;
|
|
|
|
|
~~~
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
2012-10-05 02:58:13 +02:00
|
|
|
|
This is a simple example of Rust's _dual-mode data structures_, also
|
|
|
|
|
referred to as _freezing and thawing_.
|
2012-09-24 03:45:42 +02:00
|
|
|
|
|
2012-10-11 06:06:22 +02:00
|
|
|
|
Strings are implemented with vectors of `u8`, though they have a
|
|
|
|
|
distinct type. They support most of the same allocation options as
|
|
|
|
|
vectors, though the string literal without a storage sigil (for
|
|
|
|
|
example, `"foo"`) is treated differently than a comparable vector
|
|
|
|
|
(`[foo]`). Whereas plain vectors are stack-allocated fixed-length
|
2012-12-31 22:46:52 +01:00
|
|
|
|
vectors, plain strings are borrowed pointers to read-only (static)
|
2012-10-11 06:06:22 +02:00
|
|
|
|
memory. All strings are immutable.
|
2012-09-24 03:45:42 +02:00
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
// A plain string is a slice to read-only (static) memory
|
|
|
|
|
let stack_crayons: &str = "Almond, AntiqueBrass, Apricot";
|
|
|
|
|
|
2012-10-04 21:41:45 +02:00
|
|
|
|
// The same thing, but with the `&`
|
2012-10-05 02:58:13 +02:00
|
|
|
|
let stack_crayons: &str = &"Aquamarine, Asparagus, AtomicTangerine";
|
2012-09-24 03:45:42 +02:00
|
|
|
|
|
|
|
|
|
// A local heap (managed) string
|
2012-12-10 21:58:18 +01:00
|
|
|
|
let local_crayons: @str = @"BananaMania, Beaver, Bittersweet";
|
2012-09-24 03:45:42 +02:00
|
|
|
|
|
|
|
|
|
// An exchange heap (owned) string
|
2012-10-05 02:58:13 +02:00
|
|
|
|
let exchange_crayons: ~str = ~"Black, BlizzardBlue, Blue";
|
2012-09-24 03:45:42 +02:00
|
|
|
|
~~~
|
2012-07-08 02:54:13 +02:00
|
|
|
|
|
|
|
|
|
Both vectors and strings support a number of useful
|
2012-10-05 02:58:13 +02:00
|
|
|
|
[methods](#functions-and-methods), defined in [`core::vec`]
|
|
|
|
|
and [`core::str`]. Here are some examples.
|
|
|
|
|
|
|
|
|
|
[`core::vec`]: core/vec.html
|
|
|
|
|
[`core::str`]: core/str.html
|
2012-07-08 02:54:13 +02:00
|
|
|
|
|
|
|
|
|
~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use core::io::println;
|
2012-09-16 03:44:44 +02:00
|
|
|
|
# enum Crayon {
|
|
|
|
|
# Almond, AntiqueBrass, Apricot,
|
|
|
|
|
# Aquamarine, Asparagus, AtomicTangerine,
|
|
|
|
|
# BananaMania, Beaver, Bittersweet
|
2012-07-08 02:54:13 +02:00
|
|
|
|
# }
|
2012-09-16 03:44:44 +02:00
|
|
|
|
# fn unwrap_crayon(c: Crayon) -> int { 0 }
|
2012-07-08 02:54:13 +02:00
|
|
|
|
# fn eat_crayon_wax(i: int) { }
|
2012-09-16 03:44:44 +02:00
|
|
|
|
# fn store_crayon_in_nasal_cavity(i: uint, c: Crayon) { }
|
2012-10-13 01:41:16 +02:00
|
|
|
|
# fn crayon_to_str(c: Crayon) -> &str { "" }
|
2012-07-08 02:54:13 +02:00
|
|
|
|
|
2012-10-13 03:47:46 +02:00
|
|
|
|
let crayons = [Almond, AntiqueBrass, Apricot];
|
2012-07-08 02:54:13 +02:00
|
|
|
|
|
|
|
|
|
// Check the length of the vector
|
2013-03-29 02:39:09 +01:00
|
|
|
|
assert!(crayons.len() == 3);
|
|
|
|
|
assert!(!crayons.is_empty());
|
2012-07-08 02:54:13 +02:00
|
|
|
|
|
2012-09-19 06:41:37 +02:00
|
|
|
|
// Iterate over a vector, obtaining a pointer to each element
|
2012-07-08 02:54:13 +02:00
|
|
|
|
for crayons.each |crayon| {
|
2012-09-20 03:59:48 +02:00
|
|
|
|
let delicious_crayon_wax = unwrap_crayon(*crayon);
|
2012-07-08 02:54:13 +02:00
|
|
|
|
eat_crayon_wax(delicious_crayon_wax);
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
// Map vector elements
|
2012-09-22 03:43:30 +02:00
|
|
|
|
let crayon_names = crayons.map(|v| crayon_to_str(*v));
|
2012-07-08 02:54:13 +02:00
|
|
|
|
let favorite_crayon_name = crayon_names[0];
|
|
|
|
|
|
|
|
|
|
// Remove whitespace from before and after the string
|
|
|
|
|
let new_favorite_crayon_name = favorite_crayon_name.trim();
|
|
|
|
|
|
|
|
|
|
if favorite_crayon_name.len() > 5 {
|
|
|
|
|
// Create a substring
|
|
|
|
|
println(favorite_crayon_name.substr(0, 5));
|
|
|
|
|
}
|
|
|
|
|
~~~
|
|
|
|
|
|
2012-07-08 01:23:10 +02:00
|
|
|
|
# Closures
|
|
|
|
|
|
2012-07-08 03:15:59 +02:00
|
|
|
|
Named functions, like those we've seen so far, may not refer to local
|
2012-10-11 06:06:22 +02:00
|
|
|
|
variables declared outside the function: they do not close over their
|
|
|
|
|
environment (sometimes referred to as "capturing" variables in their
|
|
|
|
|
environment). For example, you couldn't write the following:
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
~~~~ {.ignore}
|
|
|
|
|
let foo = 10;
|
|
|
|
|
|
|
|
|
|
fn bar() -> int {
|
2012-08-02 02:30:05 +02:00
|
|
|
|
return foo; // `bar` cannot refer to `foo`
|
2012-07-08 01:23:10 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
Rust also supports _closures_, functions that can access variables in
|
|
|
|
|
the enclosing scope.
|
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use println = core::io::println;
|
2013-03-09 01:21:58 +01:00
|
|
|
|
fn call_closure_with_ten(b: &fn(int)) { b(10); }
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
let captured_var = 20;
|
2012-08-23 02:44:14 +02:00
|
|
|
|
let closure = |arg| println(fmt!("captured_var=%d, arg=%d", captured_var, arg));
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
call_closure_with_ten(closure);
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 06:06:22 +02:00
|
|
|
|
Closures begin with the argument list between vertical bars and are followed by
|
2012-07-08 03:15:59 +02:00
|
|
|
|
a single expression. The types of the arguments are generally omitted,
|
|
|
|
|
as is the return type, because the compiler can almost always infer
|
2012-10-11 06:06:22 +02:00
|
|
|
|
them. In the rare case where the compiler needs assistance, though, the
|
2012-07-08 03:15:59 +02:00
|
|
|
|
arguments and return types may be annotated.
|
2012-07-08 01:23:10 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-11-24 18:28:34 +01:00
|
|
|
|
let square = |x: int| -> uint { x * x as uint };
|
2012-07-02 04:20:43 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
There are several forms of closure, each with its own role. The most
|
2012-10-13 02:48:45 +02:00
|
|
|
|
common, called a _stack closure_, has type `&fn` and can directly
|
2012-07-03 01:27:53 +02:00
|
|
|
|
access local variables in the enclosing scope.
|
2012-07-02 04:20:43 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
let mut max = 0;
|
2012-10-06 04:10:41 +02:00
|
|
|
|
[1, 2, 3].map(|x| if *x > max { max = *x });
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-07-02 04:20:43 +02:00
|
|
|
|
Stack closures are very efficient because their environment is
|
|
|
|
|
allocated on the call stack and refers by pointer to captured
|
|
|
|
|
locals. To ensure that stack closures never outlive the local
|
2012-10-11 06:06:22 +02:00
|
|
|
|
variables to which they refer, stack closures are not
|
|
|
|
|
first-class. That is, they can only be used in argument position; they
|
|
|
|
|
cannot be stored in data structures or returned from
|
|
|
|
|
functions. Despite these limitations, stack closures are used
|
2012-07-02 04:20:43 +02:00
|
|
|
|
pervasively in Rust code.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-24 03:45:42 +02:00
|
|
|
|
## Managed closures
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-01-20 18:14:30 +01:00
|
|
|
|
When you need to store a closure in a data structure, a stack closure
|
|
|
|
|
will not do, since the compiler will refuse to let you store it. For
|
|
|
|
|
this purpose, Rust provides a type of closure that has an arbitrary
|
2012-10-13 02:48:45 +02:00
|
|
|
|
lifetime, written `@fn` (boxed closure, analogous to the `@` pointer
|
2012-10-11 06:06:22 +02:00
|
|
|
|
type described earlier). This type of closure *is* first-class.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-24 03:45:42 +02:00
|
|
|
|
A managed closure does not directly access its environment, but merely
|
2012-01-19 12:51:20 +01:00
|
|
|
|
copies out the values that it closes over into a private data
|
|
|
|
|
structure. This means that it can not assign to these variables, and
|
2012-10-11 06:06:22 +02:00
|
|
|
|
cannot observe updates to them.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
This code creates a closure that adds a given string to its argument,
|
|
|
|
|
returns it from a function, and then calls it:
|
|
|
|
|
|
|
|
|
|
~~~~
|
2012-10-06 04:51:36 +02:00
|
|
|
|
# extern mod std;
|
2012-10-13 02:48:45 +02:00
|
|
|
|
fn mk_appender(suffix: ~str) -> @fn(~str) -> ~str {
|
|
|
|
|
// The compiler knows that we intend this closure to be of type @fn
|
|
|
|
|
return |s| s + suffix;
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
fn main() {
|
2012-07-14 07:57:48 +02:00
|
|
|
|
let shout = mk_appender(~"!");
|
|
|
|
|
io::println(shout(~"hey ho, let's go"));
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-24 03:45:42 +02:00
|
|
|
|
## Owned closures
|
2012-07-03 08:04:55 +02:00
|
|
|
|
|
2012-10-13 02:48:45 +02:00
|
|
|
|
Owned closures, written `~fn` in analogy to the `~` pointer type,
|
2012-07-10 09:35:14 +02:00
|
|
|
|
hold on to things that can safely be sent between
|
2012-09-24 03:45:42 +02:00
|
|
|
|
processes. They copy the values they close over, much like managed
|
2012-10-11 06:06:22 +02:00
|
|
|
|
closures, but they also own them: that is, no other code can access
|
2012-09-24 03:45:42 +02:00
|
|
|
|
them. Owned closures are used in concurrent code, particularly
|
2012-10-11 23:10:05 +02:00
|
|
|
|
for spawning [tasks][tasks].
|
|
|
|
|
|
|
|
|
|
[tasks]: tutorial-tasks.html
|
2012-07-03 08:04:55 +02:00
|
|
|
|
|
2012-07-08 00:08:38 +02:00
|
|
|
|
## Closure compatibility
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-11 06:06:22 +02:00
|
|
|
|
Rust closures have a convenient subtyping property: you can pass any kind of
|
2012-01-19 12:51:20 +01:00
|
|
|
|
closure (as long as the arguments and return types match) to functions
|
2013-04-01 03:34:51 +02:00
|
|
|
|
that expect a `&fn()`. Thus, when writing a higher-order function that
|
2012-10-11 06:06:22 +02:00
|
|
|
|
only calls its function argument, and does nothing else with it, you
|
2013-04-01 03:34:51 +02:00
|
|
|
|
should almost always declare the type of that argument as `&fn()`. That way,
|
2012-10-11 06:06:22 +02:00
|
|
|
|
callers may pass any kind of closure.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-09 01:21:58 +01:00
|
|
|
|
fn call_twice(f: &fn()) { f(); f(); }
|
2012-10-13 02:48:45 +02:00
|
|
|
|
let closure = || { "I'm a closure, and it doesn't matter what type I am"; };
|
|
|
|
|
fn function() { "I'm a normal function"; }
|
|
|
|
|
call_twice(closure);
|
|
|
|
|
call_twice(function);
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-24 03:45:42 +02:00
|
|
|
|
> ***Note:*** Both the syntax and the semantics will be changing
|
2012-10-11 06:06:22 +02:00
|
|
|
|
> in small ways. At the moment they can be unsound in some
|
2012-09-24 03:45:42 +02:00
|
|
|
|
> scenarios, particularly with non-copyable types.
|
|
|
|
|
|
2012-07-07 00:10:12 +02:00
|
|
|
|
## Do syntax
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-07 10:52:06 +02:00
|
|
|
|
The `do` expression provides a way to treat higher-order functions
|
|
|
|
|
(functions that take closures as arguments) as control structures.
|
2012-10-04 21:41:45 +02:00
|
|
|
|
|
2012-10-11 06:06:22 +02:00
|
|
|
|
Consider this function that iterates over a vector of
|
2012-09-19 06:41:37 +02:00
|
|
|
|
integers, passing in a pointer to each integer in the vector:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-09 01:21:58 +01:00
|
|
|
|
fn each(v: &[int], op: &fn(v: &int)) {
|
2012-07-04 10:50:51 +02:00
|
|
|
|
let mut n = 0;
|
|
|
|
|
while n < v.len() {
|
2012-09-19 06:41:37 +02:00
|
|
|
|
op(&v[n]);
|
2012-07-04 10:50:51 +02:00
|
|
|
|
n += 1;
|
|
|
|
|
}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-07 10:52:06 +02:00
|
|
|
|
As an aside, the reason we pass in a *pointer* to an integer rather
|
|
|
|
|
than the integer itself is that this is how the actual `each()`
|
|
|
|
|
function for vectors works. `vec::each` though is a
|
|
|
|
|
[generic](#generics) function, so must be efficient to use for all
|
|
|
|
|
types. Passing the elements by pointer avoids copying potentially
|
|
|
|
|
large objects.
|
|
|
|
|
|
|
|
|
|
As a caller, if we use a closure to provide the final operator
|
|
|
|
|
argument, we can write it in a way that has a pleasant, block-like
|
|
|
|
|
structure.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-09 01:21:58 +01:00
|
|
|
|
# fn each(v: &[int], op: &fn(v: &int)) { }
|
2012-10-07 10:52:06 +02:00
|
|
|
|
# fn do_some_work(i: &int) { }
|
2012-10-13 03:47:46 +02:00
|
|
|
|
each([1, 2, 3], |n| {
|
2012-10-07 10:52:06 +02:00
|
|
|
|
do_some_work(n);
|
2012-07-03 08:04:55 +02:00
|
|
|
|
});
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-07-04 10:50:51 +02:00
|
|
|
|
This is such a useful pattern that Rust has a special form of function
|
|
|
|
|
call that can be written more like a built-in control structure:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-09 01:21:58 +01:00
|
|
|
|
# fn each(v: &[int], op: &fn(v: &int)) { }
|
2012-10-07 10:52:06 +02:00
|
|
|
|
# fn do_some_work(i: &int) { }
|
2012-10-13 03:47:46 +02:00
|
|
|
|
do each([1, 2, 3]) |n| {
|
2012-10-07 10:52:06 +02:00
|
|
|
|
do_some_work(n);
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-07-04 10:50:51 +02:00
|
|
|
|
The call is prefixed with the keyword `do` and, instead of writing the
|
2012-10-11 06:06:22 +02:00
|
|
|
|
final closure inside the argument list, it appears outside of the
|
|
|
|
|
parentheses, where it looks more like a typical block of
|
2012-10-04 21:41:45 +02:00
|
|
|
|
code.
|
2012-07-04 10:50:51 +02:00
|
|
|
|
|
2012-10-11 06:06:22 +02:00
|
|
|
|
`do` is a convenient way to create tasks with the `task::spawn`
|
2012-10-13 02:48:45 +02:00
|
|
|
|
function. `spawn` has the signature `spawn(fn: ~fn())`. In other
|
2012-10-11 06:06:22 +02:00
|
|
|
|
words, it is a function that takes an owned closure that takes no
|
|
|
|
|
arguments.
|
2012-07-04 10:50:51 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
use core::task::spawn;
|
2012-07-04 10:50:51 +02:00
|
|
|
|
|
|
|
|
|
do spawn() || {
|
2012-08-23 02:44:14 +02:00
|
|
|
|
debug!("I'm a task, whatever");
|
2012-07-04 10:50:51 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
Look at all those bars and parentheses -- that's two empty argument
|
2012-10-07 10:52:06 +02:00
|
|
|
|
lists back to back. Since that is so unsightly, empty argument lists
|
|
|
|
|
may be omitted from `do` expressions.
|
2012-07-04 10:50:51 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use core::task::spawn;
|
2012-07-04 10:50:51 +02:00
|
|
|
|
do spawn {
|
2012-08-23 02:44:14 +02:00
|
|
|
|
debug!("Kablam!");
|
2012-07-04 10:50:51 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-07-07 00:10:12 +02:00
|
|
|
|
## For loops
|
2012-04-18 15:41:33 +02:00
|
|
|
|
|
2012-10-11 06:06:22 +02:00
|
|
|
|
The most common way to express iteration in Rust is with a `for`
|
|
|
|
|
loop. Like `do`, `for` is a nice syntax for describing control flow
|
|
|
|
|
with closures. Additionally, within a `for` loop, `break`, `loop`,
|
|
|
|
|
and `return` work just as they do with `while` and `loop`.
|
2012-04-18 15:41:33 +02:00
|
|
|
|
|
2012-07-04 10:50:51 +02:00
|
|
|
|
Consider again our `each` function, this time improved to
|
|
|
|
|
break early when the iteratee returns `false`:
|
2012-07-03 01:27:53 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-09 01:21:58 +01:00
|
|
|
|
fn each(v: &[int], op: &fn(v: &int) -> bool) {
|
2012-07-03 01:27:53 +02:00
|
|
|
|
let mut n = 0;
|
|
|
|
|
while n < v.len() {
|
2012-09-19 06:41:37 +02:00
|
|
|
|
if !op(&v[n]) {
|
2012-07-03 01:27:53 +02:00
|
|
|
|
break;
|
|
|
|
|
}
|
|
|
|
|
n += 1;
|
|
|
|
|
}
|
|
|
|
|
}
|
2012-04-18 15:41:33 +02:00
|
|
|
|
~~~~
|
2012-07-03 01:27:53 +02:00
|
|
|
|
|
|
|
|
|
And using this function to iterate over a vector:
|
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use each = core::vec::each;
|
|
|
|
|
# use println = core::io::println;
|
2012-10-13 03:47:46 +02:00
|
|
|
|
each([2, 4, 8, 5, 16], |n| {
|
2012-09-19 06:41:37 +02:00
|
|
|
|
if *n % 2 != 0 {
|
2012-10-13 01:41:16 +02:00
|
|
|
|
println("found odd number!");
|
2012-04-18 15:41:33 +02:00
|
|
|
|
false
|
|
|
|
|
} else { true }
|
2012-06-26 22:55:56 +02:00
|
|
|
|
});
|
2012-04-18 15:41:33 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-07-04 10:50:51 +02:00
|
|
|
|
With `for`, functions like `each` can be treated more
|
2012-10-11 06:06:22 +02:00
|
|
|
|
like built-in looping structures. When calling `each`
|
2012-07-04 10:50:51 +02:00
|
|
|
|
in a `for` loop, instead of returning `false` to break
|
2012-07-07 00:46:31 +02:00
|
|
|
|
out of the loop, you just write `break`. To skip ahead
|
2012-09-27 05:28:39 +02:00
|
|
|
|
to the next iteration, write `loop`.
|
2012-04-18 15:41:33 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use each = core::vec::each;
|
|
|
|
|
# use println = core::io::println;
|
2012-10-13 03:47:46 +02:00
|
|
|
|
for each([2, 4, 8, 5, 16]) |n| {
|
2012-09-19 06:41:37 +02:00
|
|
|
|
if *n % 2 != 0 {
|
2012-10-13 01:41:16 +02:00
|
|
|
|
println("found odd number!");
|
2012-04-18 15:41:33 +02:00
|
|
|
|
break;
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-08-02 02:30:05 +02:00
|
|
|
|
As an added bonus, you can use the `return` keyword, which is not
|
2012-07-04 10:50:51 +02:00
|
|
|
|
normally allowed in closures, in a block that appears as the body of a
|
2012-10-11 06:06:22 +02:00
|
|
|
|
`for` loop: the meaning of `return` in such a block is to return from
|
|
|
|
|
the enclosing function, not just the loop body.
|
2012-04-18 15:41:33 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use each = core::vec::each;
|
2012-09-24 03:45:42 +02:00
|
|
|
|
fn contains(v: &[int], elt: int) -> bool {
|
2012-07-03 01:27:53 +02:00
|
|
|
|
for each(v) |x| {
|
2012-09-19 06:41:37 +02:00
|
|
|
|
if (*x == elt) { return true; }
|
2012-04-18 15:41:33 +02:00
|
|
|
|
}
|
|
|
|
|
false
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-21 01:51:37 +01:00
|
|
|
|
Notice that, because `each` passes each value by borrowed pointer,
|
2012-12-30 22:09:34 +01:00
|
|
|
|
the iteratee needs to dereference it before using it.
|
2012-12-21 01:51:37 +01:00
|
|
|
|
In these situations it can be convenient to lean on Rust's
|
|
|
|
|
argument patterns to bind `x` to the actual value, not the pointer.
|
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use each = core::vec::each;
|
2012-12-21 01:51:37 +01:00
|
|
|
|
# fn contains(v: &[int], elt: int) -> bool {
|
|
|
|
|
for each(v) |&x| {
|
|
|
|
|
if (x == elt) { return true; }
|
|
|
|
|
}
|
|
|
|
|
# false
|
|
|
|
|
# }
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-07-04 10:50:51 +02:00
|
|
|
|
`for` syntax only works with stack closures.
|
|
|
|
|
|
2012-10-04 21:41:45 +02:00
|
|
|
|
> ***Note:*** This is, essentially, a special loop protocol:
|
|
|
|
|
> the keywords `break`, `loop`, and `return` work, in varying degree,
|
|
|
|
|
> with `while`, `loop`, `do`, and `for` constructs.
|
|
|
|
|
|
2012-12-20 11:22:05 +01:00
|
|
|
|
# Methods
|
|
|
|
|
|
|
|
|
|
Methods are like functions except that they always begin with a special argument,
|
|
|
|
|
called `self`,
|
|
|
|
|
which has the type of the method's receiver. The
|
|
|
|
|
`self` argument is like `this` in C++ and many other languages.
|
|
|
|
|
Methods are called with dot notation, as in `my_vec.len()`.
|
|
|
|
|
|
|
|
|
|
_Implementations_, written with the `impl` keyword, can define
|
|
|
|
|
methods on most Rust types, including structs and enums.
|
|
|
|
|
As an example, let's define a `draw` method on our `Shape` enum.
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# fn draw_circle(p: Point, f: float) { }
|
|
|
|
|
# fn draw_rectangle(p: Point, p: Point) { }
|
|
|
|
|
struct Point {
|
|
|
|
|
x: float,
|
|
|
|
|
y: float
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
enum Shape {
|
|
|
|
|
Circle(Point, float),
|
|
|
|
|
Rectangle(Point, Point)
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
impl Shape {
|
|
|
|
|
fn draw(&self) {
|
|
|
|
|
match *self {
|
|
|
|
|
Circle(p, f) => draw_circle(p, f),
|
|
|
|
|
Rectangle(p1, p2) => draw_rectangle(p1, p2)
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
let s = Circle(Point { x: 1f, y: 2f }, 3f);
|
|
|
|
|
s.draw();
|
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
This defines an _implementation_ for `Shape` containing a single
|
|
|
|
|
method, `draw`. In most respects the `draw` method is defined
|
|
|
|
|
like any other function, except for the name `self`.
|
|
|
|
|
|
|
|
|
|
The type of `self` is the type on which the method is implemented,
|
|
|
|
|
or a pointer thereof. As an argument it is written either `self`,
|
|
|
|
|
`&self`, `@self`, or `~self`.
|
|
|
|
|
A caller must in turn have a compatible pointer type to call the method.
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# fn draw_circle(p: Point, f: float) { }
|
|
|
|
|
# fn draw_rectangle(p: Point, p: Point) { }
|
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
# enum Shape {
|
|
|
|
|
# Circle(Point, float),
|
|
|
|
|
# Rectangle(Point, Point)
|
|
|
|
|
# }
|
|
|
|
|
impl Shape {
|
|
|
|
|
fn draw_borrowed(&self) { ... }
|
|
|
|
|
fn draw_managed(@self) { ... }
|
|
|
|
|
fn draw_owned(~self) { ... }
|
|
|
|
|
fn draw_value(self) { ... }
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
let s = Circle(Point { x: 1f, y: 2f }, 3f);
|
|
|
|
|
|
|
|
|
|
(@s).draw_managed();
|
|
|
|
|
(~s).draw_owned();
|
|
|
|
|
(&s).draw_borrowed();
|
|
|
|
|
s.draw_value();
|
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
Methods typically take a borrowed pointer self type,
|
|
|
|
|
so the compiler will go to great lengths to convert a callee
|
|
|
|
|
to a borrowed pointer.
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# fn draw_circle(p: Point, f: float) { }
|
|
|
|
|
# fn draw_rectangle(p: Point, p: Point) { }
|
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
# enum Shape {
|
|
|
|
|
# Circle(Point, float),
|
|
|
|
|
# Rectangle(Point, Point)
|
|
|
|
|
# }
|
|
|
|
|
# impl Shape {
|
|
|
|
|
# fn draw_borrowed(&self) { ... }
|
|
|
|
|
# fn draw_managed(@self) { ... }
|
|
|
|
|
# fn draw_owned(~self) { ... }
|
|
|
|
|
# fn draw_value(self) { ... }
|
|
|
|
|
# }
|
|
|
|
|
# let s = Circle(Point { x: 1f, y: 2f }, 3f);
|
|
|
|
|
// As with typical function arguments, managed and unique pointers
|
|
|
|
|
// are automatically converted to borrowed pointers
|
|
|
|
|
|
|
|
|
|
(@s).draw_borrowed();
|
|
|
|
|
(~s).draw_borrowed();
|
|
|
|
|
|
|
|
|
|
// Unlike typical function arguments, the self value will
|
|
|
|
|
// automatically be referenced ...
|
|
|
|
|
s.draw_borrowed();
|
|
|
|
|
|
|
|
|
|
// ... and dereferenced
|
|
|
|
|
(& &s).draw_borrowed();
|
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
// ... and dereferenced and borrowed
|
2012-12-20 11:22:05 +01:00
|
|
|
|
(&@~s).draw_borrowed();
|
|
|
|
|
~~~
|
|
|
|
|
|
2013-03-23 01:24:26 +01:00
|
|
|
|
Implementations may also define standalone (sometimes called "static")
|
|
|
|
|
methods. The absence of a `self` paramater distinguishes such methods.
|
|
|
|
|
These methods are the preferred way to define constructor functions.
|
2012-12-20 11:22:05 +01:00
|
|
|
|
|
|
|
|
|
~~~~ {.xfail-test}
|
|
|
|
|
impl Circle {
|
|
|
|
|
fn area(&self) -> float { ... }
|
2013-03-23 01:24:26 +01:00
|
|
|
|
fn new(area: float) -> Circle { ... }
|
2012-12-20 11:22:05 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2013-03-23 01:24:26 +01:00
|
|
|
|
To call such a method, just prefix it with the type name and a double colon:
|
2012-12-20 11:22:05 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use core::float::consts::pi;
|
|
|
|
|
# use core::float::sqrt;
|
2012-12-20 11:22:05 +01:00
|
|
|
|
struct Circle { radius: float }
|
|
|
|
|
impl Circle {
|
2013-03-23 01:24:26 +01:00
|
|
|
|
fn new(area: float) -> Circle { Circle { radius: sqrt(area / pi) } }
|
2012-12-20 11:22:05 +01:00
|
|
|
|
}
|
|
|
|
|
let c = Circle::new(42.5);
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-01-19 12:51:20 +01:00
|
|
|
|
# Generics
|
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
Throughout this tutorial, we've been defining functions that act only
|
|
|
|
|
on specific data types. With type parameters we can also define
|
|
|
|
|
functions whose arguments have generic types, and which can be invoked
|
|
|
|
|
with a variety of types. Consider a generic `map` function, which
|
|
|
|
|
takes a function `function` and a vector `vector` and returns a new
|
|
|
|
|
vector consisting of the result of applying `function` to each element
|
|
|
|
|
of `vector`:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-09 01:21:58 +01:00
|
|
|
|
fn map<T, U>(vector: &[T], function: &fn(v: &T) -> U) -> ~[U] {
|
2012-07-08 22:58:37 +02:00
|
|
|
|
let mut accumulator = ~[];
|
2012-09-19 06:41:37 +02:00
|
|
|
|
for vec::each(vector) |element| {
|
2012-09-27 02:33:34 +02:00
|
|
|
|
accumulator.push(function(element));
|
2012-07-10 00:28:08 +02:00
|
|
|
|
}
|
2012-12-20 11:37:21 +01:00
|
|
|
|
return accumulator;
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-24 06:07:01 +02:00
|
|
|
|
When defined with type parameters, as denoted by `<T, U>`, this
|
|
|
|
|
function can be applied to any type of vector, as long as the type of
|
2012-10-11 06:29:25 +02:00
|
|
|
|
`function`'s argument and the type of the vector's contents agree with
|
2012-09-24 06:07:01 +02:00
|
|
|
|
each other.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-07-10 00:28:08 +02:00
|
|
|
|
Inside a generic function, the names of the type parameters
|
2012-10-11 06:29:25 +02:00
|
|
|
|
(capitalized by convention) stand for opaque types. All you can do
|
|
|
|
|
with instances of these types is pass them around: you can't apply any
|
|
|
|
|
operations to them or pattern-match on them. Note that instances of
|
2012-12-30 22:09:34 +01:00
|
|
|
|
generic types are often passed by pointer. For example, the parameter
|
2012-10-11 06:29:25 +02:00
|
|
|
|
`function()` is supplied with a pointer to a value of type `T` and not
|
2012-12-30 22:09:34 +01:00
|
|
|
|
a value of type `T` itself. This ensures that the function works with
|
2012-10-11 06:29:25 +02:00
|
|
|
|
the broadest set of types possible, since some types are expensive or
|
|
|
|
|
illegal to copy and pass by value.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-01 02:36:40 +02:00
|
|
|
|
Generic `type`, `struct`, and `enum` declarations follow the same pattern:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-21 20:57:33 +01:00
|
|
|
|
# use core::hashmap::linear::LinearMap;
|
|
|
|
|
type Set<T> = LinearMap<T, ()>;
|
2012-09-24 06:07:01 +02:00
|
|
|
|
|
2012-09-01 02:36:40 +02:00
|
|
|
|
struct Stack<T> {
|
2013-01-30 04:34:16 +01:00
|
|
|
|
elements: ~[T]
|
2012-09-01 02:36:40 +02:00
|
|
|
|
}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-12-11 05:56:25 +01:00
|
|
|
|
enum Option<T> {
|
|
|
|
|
Some(T),
|
|
|
|
|
None
|
2012-09-01 02:36:40 +02:00
|
|
|
|
}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
These declarations can be instantiated to valid types like `Set<int>`,
|
2012-12-30 22:09:34 +01:00
|
|
|
|
`Stack<int>`, and `Option<int>`.
|
2012-12-11 05:56:25 +01:00
|
|
|
|
|
|
|
|
|
The last type in that example, `Option`, appears frequently in Rust code.
|
|
|
|
|
Because Rust does not have null pointers (except in unsafe code), we need
|
|
|
|
|
another way to write a function whose result isn't defined on every possible
|
|
|
|
|
combination of arguments of the appropriate types. The usual way is to write
|
|
|
|
|
a function that returns `Option<T>` instead of `T`.
|
|
|
|
|
|
|
|
|
|
~~~~
|
2012-12-30 22:09:34 +01:00
|
|
|
|
# struct Point { x: float, y: float }
|
2012-12-11 22:43:14 +01:00
|
|
|
|
# enum Shape { Circle(Point, float), Rectangle(Point, Point) }
|
2012-12-11 05:56:25 +01:00
|
|
|
|
fn radius(shape: Shape) -> Option<float> {
|
2012-12-30 22:09:34 +01:00
|
|
|
|
match shape {
|
|
|
|
|
Circle(_, radius) => Some(radius),
|
|
|
|
|
Rectangle(*) => None
|
|
|
|
|
}
|
2012-12-11 05:56:25 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
The Rust compiler compiles generic functions very efficiently by
|
|
|
|
|
*monomorphizing* them. *Monomorphization* is a fancy name for a simple
|
2012-12-20 11:22:56 +01:00
|
|
|
|
idea: generate a separate copy of each generic function at each call site,
|
|
|
|
|
a copy that is specialized to the argument
|
2012-10-11 06:29:25 +02:00
|
|
|
|
types and can thus be optimized specifically for them. In this
|
|
|
|
|
respect, Rust's generics have similar performance characteristics to
|
|
|
|
|
C++ templates.
|
2012-09-24 06:07:01 +02:00
|
|
|
|
|
2012-09-25 02:37:33 +02:00
|
|
|
|
## Traits
|
2012-09-24 06:07:01 +02:00
|
|
|
|
|
2012-09-25 02:37:33 +02:00
|
|
|
|
Within a generic function the operations available on generic types
|
|
|
|
|
are very limited. After all, since the function doesn't know what
|
|
|
|
|
types it is operating on, it can't safely modify or query their
|
|
|
|
|
values. This is where _traits_ come into play. Traits are Rust's most
|
|
|
|
|
powerful tool for writing polymorphic code. Java developers will see
|
2012-10-11 06:29:25 +02:00
|
|
|
|
them as similar to Java interfaces, and Haskellers will notice their
|
|
|
|
|
similarities to type classes. Rust's traits are a form of *bounded
|
|
|
|
|
polymorphism*: a trait is a way of limiting the set of possible types
|
|
|
|
|
that a type parameter could refer to.
|
|
|
|
|
|
|
|
|
|
As motivation, let us consider copying in Rust. The `copy` operation
|
|
|
|
|
is not defined for all Rust types. One reason is user-defined
|
|
|
|
|
destructors: copying a type that has a destructor could result in the
|
|
|
|
|
destructor running multiple times. Therefore, types with user-defined
|
|
|
|
|
destructors cannot be copied, either implicitly or explicitly, and
|
2012-12-19 01:27:26 +01:00
|
|
|
|
neither can types that own other types containing destructors.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
This complicates handling of generic functions. If you have a type
|
|
|
|
|
parameter `T`, can you copy values of that type? In Rust, you can't,
|
2012-09-25 02:37:33 +02:00
|
|
|
|
and if you try to run the following code the compiler will complain.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-25 02:37:33 +02:00
|
|
|
|
~~~~ {.xfail-test}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
// This does not compile
|
2012-09-25 02:37:33 +02:00
|
|
|
|
fn head_bad<T>(v: &[T]) -> T {
|
|
|
|
|
v[0] // error: copying a non-copyable value
|
2012-09-24 06:07:01 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
However, we can tell the compiler that the `head` function is only for
|
|
|
|
|
copyable types: that is, those that have the `Copy` trait.
|
2012-09-25 02:37:33 +02:00
|
|
|
|
|
2012-09-24 06:07:01 +02:00
|
|
|
|
~~~~
|
2012-01-19 12:51:20 +01:00
|
|
|
|
// This does
|
2012-09-25 02:37:33 +02:00
|
|
|
|
fn head<T: Copy>(v: &[T]) -> T {
|
|
|
|
|
v[0]
|
2012-09-24 06:07:01 +02:00
|
|
|
|
}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-25 02:37:33 +02:00
|
|
|
|
This says that we can call `head` on any type `T` as long as that type
|
|
|
|
|
implements the `Copy` trait. When instantiating a generic function,
|
|
|
|
|
you can only instantiate it with types that implement the correct
|
2012-10-11 06:29:25 +02:00
|
|
|
|
trait, so you could not apply `head` to a type with a
|
|
|
|
|
destructor. (`Copy` is a special trait that is built in to the
|
|
|
|
|
compiler, making it possible for the compiler to enforce this
|
|
|
|
|
restriction.)
|
2012-09-24 06:07:01 +02:00
|
|
|
|
|
|
|
|
|
While most traits can be defined and implemented by user code, three
|
2012-09-25 02:37:33 +02:00
|
|
|
|
traits are automatically derived and implemented for all applicable
|
|
|
|
|
types by the compiler, and may not be overridden:
|
2012-09-24 06:07:01 +02:00
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
* `Copy` - Types that can be copied, either implicitly, or explicitly with the
|
2012-11-06 02:50:01 +01:00
|
|
|
|
`copy` operator. All types are copyable unless they have destructors or
|
|
|
|
|
contain types with destructors.
|
2012-09-24 06:07:01 +02:00
|
|
|
|
|
2012-12-12 01:51:37 +01:00
|
|
|
|
* `Owned` - Owned types. Types are owned unless they contain managed
|
2012-12-30 22:09:34 +01:00
|
|
|
|
boxes, managed closures, or borrowed pointers. Owned types may or
|
2012-12-12 01:51:37 +01:00
|
|
|
|
may not be copyable.
|
2012-09-24 06:07:01 +02:00
|
|
|
|
|
|
|
|
|
* `Const` - Constant (immutable) types. These are types that do not contain
|
|
|
|
|
mutable fields.
|
|
|
|
|
|
|
|
|
|
> ***Note:*** These three traits were referred to as 'kinds' in earlier
|
|
|
|
|
> iterations of the language, and often still are.
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
2012-12-19 01:27:26 +01:00
|
|
|
|
Additionally, the `Drop` trait is used to define destructors. This
|
|
|
|
|
trait defines one method called `finalize`, which is automatically
|
2012-12-19 19:46:24 +01:00
|
|
|
|
called when a value of the type that implements this trait is
|
2012-12-19 01:27:26 +01:00
|
|
|
|
destroyed, either because the value went out of scope or because the
|
|
|
|
|
garbage collector reclaimed it.
|
2012-11-06 02:50:01 +01:00
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
struct TimeBomb {
|
2012-12-30 22:09:34 +01:00
|
|
|
|
explosivity: uint
|
2012-11-06 02:50:01 +01:00
|
|
|
|
}
|
|
|
|
|
|
2013-02-14 08:19:27 +01:00
|
|
|
|
impl Drop for TimeBomb {
|
2012-11-30 01:23:24 +01:00
|
|
|
|
fn finalize(&self) {
|
2012-11-06 02:50:01 +01:00
|
|
|
|
for iter::repeat(self.explosivity) {
|
|
|
|
|
io::println("blam!");
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
It is illegal to call `finalize` directly. Only code inserted by the compiler
|
|
|
|
|
may call it.
|
|
|
|
|
|
2012-09-24 06:07:01 +02:00
|
|
|
|
## Declaring and implementing traits
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
2012-10-03 05:28:53 +02:00
|
|
|
|
A trait consists of a set of methods, without bodies, or may be empty,
|
2012-12-12 01:51:37 +01:00
|
|
|
|
as is the case with `Copy`, `Owned`, and `Const`. For example, we could
|
2012-10-03 05:28:53 +02:00
|
|
|
|
declare the trait `Printable` for things that can be printed to the
|
|
|
|
|
console, with a single method:
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-25 02:37:33 +02:00
|
|
|
|
trait Printable {
|
2012-12-20 10:51:28 +01:00
|
|
|
|
fn print(&self);
|
2012-09-08 02:12:16 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-03 05:28:53 +02:00
|
|
|
|
Traits may be implemented for specific types with [impls]. An impl
|
2012-10-12 05:13:04 +02:00
|
|
|
|
that implements a trait includes the name of the trait at the start of
|
2012-10-03 05:28:53 +02:00
|
|
|
|
the definition, as in the following impls of `Printable` for `int`
|
2013-02-26 20:34:00 +01:00
|
|
|
|
and `~str`.
|
2012-10-03 05:28:53 +02:00
|
|
|
|
|
|
|
|
|
[impls]: #functions-and-methods
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-12-20 10:51:28 +01:00
|
|
|
|
# trait Printable { fn print(&self); }
|
2013-02-14 08:19:27 +01:00
|
|
|
|
impl Printable for int {
|
2012-12-20 10:51:28 +01:00
|
|
|
|
fn print(&self) { io::println(fmt!("%d", *self)) }
|
2012-09-08 02:12:16 +02:00
|
|
|
|
}
|
2012-09-25 02:37:33 +02:00
|
|
|
|
|
2013-02-26 20:34:00 +01:00
|
|
|
|
impl Printable for ~str {
|
2012-12-20 10:51:28 +01:00
|
|
|
|
fn print(&self) { io::println(*self) }
|
2012-09-08 02:12:16 +02:00
|
|
|
|
}
|
2012-09-24 06:07:01 +02:00
|
|
|
|
|
2012-09-25 02:37:33 +02:00
|
|
|
|
# 1.print();
|
2013-02-26 20:34:00 +01:00
|
|
|
|
# (~"foo").print();
|
2012-09-08 02:12:16 +02:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
Methods defined in an implementation of a trait may be called just like
|
2012-10-03 05:28:53 +02:00
|
|
|
|
any other method, using dot notation, as in `1.print()`. Traits may
|
|
|
|
|
themselves contain type parameters. A trait for generalized sequence
|
|
|
|
|
types might look like the following:
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-09-16 03:44:44 +02:00
|
|
|
|
trait Seq<T> {
|
2012-12-20 10:51:28 +01:00
|
|
|
|
fn len(&self) -> uint;
|
2013-03-09 01:21:58 +01:00
|
|
|
|
fn iter(&self, b: &fn(v: &T));
|
2012-09-08 02:12:16 +02:00
|
|
|
|
}
|
2012-09-25 02:37:33 +02:00
|
|
|
|
|
2013-02-14 08:19:27 +01:00
|
|
|
|
impl<T> Seq<T> for ~[T] {
|
2012-12-20 10:51:28 +01:00
|
|
|
|
fn len(&self) -> uint { vec::len(*self) }
|
2013-03-09 01:21:58 +01:00
|
|
|
|
fn iter(&self, b: &fn(v: &T)) {
|
2012-12-20 10:51:28 +01:00
|
|
|
|
for vec::each(*self) |elt| { b(elt); }
|
2012-09-08 02:12:16 +02:00
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-25 02:37:33 +02:00
|
|
|
|
The implementation has to explicitly declare the type parameter that
|
|
|
|
|
it binds, `T`, before using it to specify its trait type. Rust
|
|
|
|
|
requires this declaration because the `impl` could also, for example,
|
2012-10-11 06:29:25 +02:00
|
|
|
|
specify an implementation of `Seq<int>`. The trait type (appearing
|
2013-02-14 08:19:27 +01:00
|
|
|
|
between `impl` and `for`) *refers* to a type, rather than
|
2012-09-25 02:37:33 +02:00
|
|
|
|
defining one.
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
|
|
|
|
The type parameters bound by a trait are in scope in each of the
|
|
|
|
|
method declarations. So, re-declaring the type parameter
|
2012-10-11 06:29:25 +02:00
|
|
|
|
`T` as an explicit type parameter for `len`, in either the trait or
|
|
|
|
|
the impl, would be a compile-time error.
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
2012-09-25 02:49:04 +02:00
|
|
|
|
Within a trait definition, `self` is a special type that you can think
|
|
|
|
|
of as a type parameter. An implementation of the trait for any given
|
|
|
|
|
type `T` replaces the `self` type parameter with `T`. Simply, in a
|
|
|
|
|
trait, `self` is a type, and in an impl, `self` is a value. The
|
|
|
|
|
following trait describes types that support an equality operation:
|
|
|
|
|
|
|
|
|
|
~~~~
|
2013-01-31 04:42:06 +01:00
|
|
|
|
// In a trait, `self` refers to the self argument.
|
|
|
|
|
// `Self` refers to the type implementing the trait.
|
2012-09-25 02:49:04 +02:00
|
|
|
|
trait Eq {
|
2013-01-31 04:42:06 +01:00
|
|
|
|
fn equals(&self, other: &Self) -> bool;
|
2012-09-25 02:49:04 +02:00
|
|
|
|
}
|
|
|
|
|
|
2012-12-20 10:51:28 +01:00
|
|
|
|
// In an impl, `self` refers just to the value of the receiver
|
2013-02-14 08:19:27 +01:00
|
|
|
|
impl Eq for int {
|
2012-12-30 22:09:34 +01:00
|
|
|
|
fn equals(&self, other: &int) -> bool { *other == *self }
|
2012-09-25 02:49:04 +02:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-20 10:51:28 +01:00
|
|
|
|
Notice that in the trait definition, `equals` takes a
|
|
|
|
|
second parameter of type `self`.
|
|
|
|
|
In contrast, in the `impl`, `equals` takes a second parameter of
|
|
|
|
|
type `int`, only using `self` as the name of the receiver.
|
2012-09-25 02:49:04 +02:00
|
|
|
|
|
2013-03-23 01:24:26 +01:00
|
|
|
|
Just as in type implementations, traits can define standalone (static)
|
|
|
|
|
methods. These methods are called by prefixing the method name with the trait
|
|
|
|
|
name and a double colon. The compiler uses type inference to decide which
|
|
|
|
|
implementation to use.
|
2012-12-20 11:15:59 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use core::float::consts::pi;
|
|
|
|
|
# use core::float::sqrt;
|
2013-03-27 05:17:29 +01:00
|
|
|
|
trait Shape { fn new(area: float) -> Self; }
|
2012-12-20 11:15:59 +01:00
|
|
|
|
struct Circle { radius: float }
|
|
|
|
|
struct Square { length: float }
|
|
|
|
|
|
2013-02-14 08:19:27 +01:00
|
|
|
|
impl Shape for Circle {
|
2013-03-23 01:24:26 +01:00
|
|
|
|
fn new(area: float) -> Circle { Circle { radius: sqrt(area / pi) } }
|
2012-12-20 11:15:59 +01:00
|
|
|
|
}
|
2013-02-14 08:19:27 +01:00
|
|
|
|
impl Shape for Square {
|
2013-03-23 01:24:26 +01:00
|
|
|
|
fn new(area: float) -> Square { Square { length: sqrt(area) } }
|
2012-12-20 11:15:59 +01:00
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
let area = 42.5;
|
|
|
|
|
let c: Circle = Shape::new(area);
|
|
|
|
|
let s: Square = Shape::new(area);
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-09-25 02:37:33 +02:00
|
|
|
|
## Bounded type parameters and static method dispatch
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
Traits give us a language for defining predicates on types, or
|
|
|
|
|
abstract properties that types can have. We can use this language to
|
|
|
|
|
define _bounds_ on type parameters, so that we can then operate on
|
|
|
|
|
generic types.
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-12-20 10:51:28 +01:00
|
|
|
|
# trait Printable { fn print(&self); }
|
2012-09-25 02:37:33 +02:00
|
|
|
|
fn print_all<T: Printable>(printable_things: ~[T]) {
|
|
|
|
|
for printable_things.each |thing| {
|
|
|
|
|
thing.print();
|
|
|
|
|
}
|
2012-09-08 02:12:16 +02:00
|
|
|
|
}
|
2012-09-25 02:37:33 +02:00
|
|
|
|
~~~~
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
Declaring `T` as conforming to the `Printable` trait (as we earlier
|
|
|
|
|
did with `Copy`) makes it possible to call methods from that trait
|
|
|
|
|
on values of type `T` inside the function. It will also cause a
|
2012-09-25 02:37:33 +02:00
|
|
|
|
compile-time error when anyone tries to call `print_all` on an array
|
|
|
|
|
whose element type does not have a `Printable` implementation.
|
|
|
|
|
|
2013-02-21 03:04:57 +01:00
|
|
|
|
Type parameters can have multiple bounds by separating them with `+`,
|
2012-10-11 06:29:25 +02:00
|
|
|
|
as in this version of `print_all` that copies elements.
|
2012-09-25 02:37:33 +02:00
|
|
|
|
|
|
|
|
|
~~~
|
2012-12-20 10:51:28 +01:00
|
|
|
|
# trait Printable { fn print(&self); }
|
2013-02-21 03:04:57 +01:00
|
|
|
|
fn print_all<T: Printable + Copy>(printable_things: ~[T]) {
|
2012-09-25 02:37:33 +02:00
|
|
|
|
let mut i = 0;
|
|
|
|
|
while i < printable_things.len() {
|
2012-11-26 19:28:17 +01:00
|
|
|
|
let copy_of_thing = printable_things[i];
|
2012-09-25 02:37:33 +02:00
|
|
|
|
copy_of_thing.print();
|
2012-11-26 19:28:17 +01:00
|
|
|
|
i += 1;
|
2012-09-25 02:37:33 +02:00
|
|
|
|
}
|
2012-09-08 02:12:16 +02:00
|
|
|
|
}
|
2012-09-25 02:37:33 +02:00
|
|
|
|
~~~
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
2012-09-25 02:37:33 +02:00
|
|
|
|
Method calls to bounded type parameters are _statically dispatched_,
|
|
|
|
|
imposing no more overhead than normal function invocation, so are
|
|
|
|
|
the preferred way to use traits polymorphically.
|
2012-09-08 02:12:16 +02:00
|
|
|
|
|
2012-09-25 02:37:33 +02:00
|
|
|
|
This usage of traits is similar to Haskell type classes.
|
|
|
|
|
|
2012-12-20 09:33:15 +01:00
|
|
|
|
## Trait objects and dynamic method dispatch
|
|
|
|
|
|
|
|
|
|
The above allows us to define functions that polymorphically act on
|
|
|
|
|
values of a single unknown type that conforms to a given trait.
|
|
|
|
|
However, consider this function:
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
# type Circle = int; type Rectangle = int;
|
2013-02-14 08:19:27 +01:00
|
|
|
|
# impl Drawable for int { fn draw(&self) {} }
|
2012-12-20 09:33:15 +01:00
|
|
|
|
# fn new_circle() -> int { 1 }
|
2012-12-20 10:51:28 +01:00
|
|
|
|
trait Drawable { fn draw(&self); }
|
2012-12-20 09:33:15 +01:00
|
|
|
|
|
|
|
|
|
fn draw_all<T: Drawable>(shapes: ~[T]) {
|
|
|
|
|
for shapes.each |shape| { shape.draw(); }
|
|
|
|
|
}
|
|
|
|
|
# let c: Circle = new_circle();
|
|
|
|
|
# draw_all(~[c]);
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-30 22:09:34 +01:00
|
|
|
|
You can call that on an array of circles, or an array of rectangles
|
2012-12-20 09:33:15 +01:00
|
|
|
|
(assuming those have suitable `Drawable` traits defined), but not on
|
2012-12-30 22:09:34 +01:00
|
|
|
|
an array containing both circles and rectangles. When such behavior is
|
2012-12-20 09:33:15 +01:00
|
|
|
|
needed, a trait name can alternately be used as a type, called
|
|
|
|
|
an _object_.
|
|
|
|
|
|
|
|
|
|
~~~~
|
2012-12-20 10:51:28 +01:00
|
|
|
|
# trait Drawable { fn draw(&self); }
|
2012-12-20 09:33:15 +01:00
|
|
|
|
fn draw_all(shapes: &[@Drawable]) {
|
|
|
|
|
for shapes.each |shape| { shape.draw(); }
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
In this example, there is no type parameter. Instead, the `@Drawable`
|
|
|
|
|
type denotes any managed box value that implements the `Drawable`
|
|
|
|
|
trait. To construct such a value, you use the `as` operator to cast a
|
|
|
|
|
value to an object:
|
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
# type Circle = int; type Rectangle = bool;
|
2012-12-20 10:51:28 +01:00
|
|
|
|
# trait Drawable { fn draw(&self); }
|
2012-12-20 09:33:15 +01:00
|
|
|
|
# fn new_circle() -> Circle { 1 }
|
|
|
|
|
# fn new_rectangle() -> Rectangle { true }
|
|
|
|
|
# fn draw_all(shapes: &[@Drawable]) {}
|
|
|
|
|
|
2013-02-14 08:19:27 +01:00
|
|
|
|
impl Drawable for Circle { fn draw(&self) { ... } }
|
|
|
|
|
impl Drawable for Rectangle { fn draw(&self) { ... } }
|
2012-12-20 09:33:15 +01:00
|
|
|
|
|
|
|
|
|
let c: @Circle = @new_circle();
|
|
|
|
|
let r: @Rectangle = @new_rectangle();
|
|
|
|
|
draw_all([c as @Drawable, r as @Drawable]);
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
We omit the code for `new_circle` and `new_rectangle`; imagine that
|
|
|
|
|
these just return `Circle`s and `Rectangle`s with a default size. Note
|
|
|
|
|
that, like strings and vectors, objects have dynamic size and may
|
|
|
|
|
only be referred to via one of the pointer types.
|
|
|
|
|
Other pointer types work as well.
|
|
|
|
|
Casts to traits may only be done with compatible pointers so,
|
|
|
|
|
for example, an `@Circle` may not be cast to an `~Drawable`.
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# type Circle = int; type Rectangle = int;
|
2012-12-20 10:51:28 +01:00
|
|
|
|
# trait Drawable { fn draw(&self); }
|
2013-02-14 08:19:27 +01:00
|
|
|
|
# impl Drawable for int { fn draw(&self) {} }
|
2012-12-20 09:33:15 +01:00
|
|
|
|
# fn new_circle() -> int { 1 }
|
|
|
|
|
# fn new_rectangle() -> int { 2 }
|
|
|
|
|
// A managed object
|
|
|
|
|
let boxy: @Drawable = @new_circle() as @Drawable;
|
|
|
|
|
// An owned object
|
|
|
|
|
let owny: ~Drawable = ~new_circle() as ~Drawable;
|
|
|
|
|
// A borrowed object
|
|
|
|
|
let stacky: &Drawable = &new_circle() as &Drawable;
|
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
Method calls to trait types are _dynamically dispatched_. Since the
|
|
|
|
|
compiler doesn't know specifically which functions to call at compile
|
|
|
|
|
time, it uses a lookup table (also known as a vtable or dictionary) to
|
|
|
|
|
select the method to call at runtime.
|
|
|
|
|
|
|
|
|
|
This usage of traits is similar to Java interfaces.
|
|
|
|
|
|
2012-12-20 09:30:04 +01:00
|
|
|
|
## Trait inheritance
|
2012-12-20 04:38:28 +01:00
|
|
|
|
|
2012-12-20 09:30:04 +01:00
|
|
|
|
We can write a trait declaration that _inherits_ from other traits, called _supertraits_.
|
|
|
|
|
Types that implement a trait must also implement its supertraits.
|
2012-12-21 01:36:13 +01:00
|
|
|
|
For example,
|
|
|
|
|
we can define a `Circle` trait that inherits from `Shape`.
|
2012-12-20 04:38:28 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2012-12-20 10:51:28 +01:00
|
|
|
|
trait Shape { fn area(&self) -> float; }
|
|
|
|
|
trait Circle : Shape { fn radius(&self) -> float; }
|
2012-12-20 04:38:28 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-21 01:36:13 +01:00
|
|
|
|
Now, we can implement `Circle` on a type only if we also implement `Shape`.
|
2012-12-20 04:38:28 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
2013-03-27 05:17:29 +01:00
|
|
|
|
# use core::float::consts::pi;
|
|
|
|
|
# use core::float::sqrt;
|
2012-12-20 10:51:28 +01:00
|
|
|
|
# trait Shape { fn area(&self) -> float; }
|
|
|
|
|
# trait Circle : Shape { fn radius(&self) -> float; }
|
2012-12-20 04:38:28 +01:00
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
# fn square(x: float) -> float { x * x }
|
|
|
|
|
struct CircleStruct { center: Point, radius: float }
|
2013-02-14 08:19:27 +01:00
|
|
|
|
impl Circle for CircleStruct {
|
2012-12-30 22:09:34 +01:00
|
|
|
|
fn radius(&self) -> float { sqrt(self.area() / pi) }
|
2012-12-20 04:38:28 +01:00
|
|
|
|
}
|
2013-02-14 08:19:27 +01:00
|
|
|
|
impl Shape for CircleStruct {
|
2012-12-30 22:09:34 +01:00
|
|
|
|
fn area(&self) -> float { pi * square(self.radius) }
|
2013-02-19 18:10:31 +01:00
|
|
|
|
}
|
2012-12-20 04:38:28 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-21 01:36:13 +01:00
|
|
|
|
Notice that methods of `Circle` can call methods on `Shape`, as our
|
|
|
|
|
`radius` implementation calls the `area` method.
|
2012-12-20 04:38:28 +01:00
|
|
|
|
This is a silly way to compute the radius of a circle
|
|
|
|
|
(since we could just return the `circle` field), but you get the idea.
|
|
|
|
|
|
2012-12-20 09:30:04 +01:00
|
|
|
|
In type-parameterized functions,
|
|
|
|
|
methods of the supertrait may be called on values of subtrait-bound type parameters.
|
|
|
|
|
Refering to the previous example of `trait Circle : Shape`:
|
|
|
|
|
|
|
|
|
|
~~~
|
2012-12-20 10:51:28 +01:00
|
|
|
|
# trait Shape { fn area(&self) -> float; }
|
|
|
|
|
# trait Circle : Shape { fn radius(&self) -> float; }
|
2012-12-20 09:30:04 +01:00
|
|
|
|
fn radius_times_area<T: Circle>(c: T) -> float {
|
|
|
|
|
// `c` is both a Circle and a Shape
|
|
|
|
|
c.radius() * c.area()
|
|
|
|
|
}
|
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
Likewise, supertrait methods may also be called on trait objects.
|
|
|
|
|
|
|
|
|
|
~~~ {.xfail-test}
|
2013-03-01 19:44:43 +01:00
|
|
|
|
# use core::float::consts::pi;
|
|
|
|
|
# use core::float::sqrt;
|
2013-03-27 05:17:29 +01:00
|
|
|
|
# trait Shape { fn area(&self) -> float; }
|
|
|
|
|
# trait Circle : Shape { fn radius(&self) -> float; }
|
2013-01-29 10:14:05 +01:00
|
|
|
|
# struct Point { x: float, y: float }
|
|
|
|
|
# struct CircleStruct { center: Point, radius: float }
|
2013-02-14 08:19:27 +01:00
|
|
|
|
# impl Circle for CircleStruct { fn radius(&self) -> float { sqrt(self.area() / pi) } }
|
|
|
|
|
# impl Shape for CircleStruct { fn area(&self) -> float { pi * square(self.radius) } }
|
2013-01-29 10:14:05 +01:00
|
|
|
|
|
|
|
|
|
let concrete = @CircleStruct{center:Point{x:3f,y:4f},radius:5f};
|
|
|
|
|
let mycircle: Circle = concrete as @Circle;
|
2012-12-20 09:30:04 +01:00
|
|
|
|
let nonsense = mycircle.radius() * mycircle.area();
|
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
> ***Note:*** Trait inheritance does not actually work with objects yet
|
|
|
|
|
|
2012-01-19 12:51:20 +01:00
|
|
|
|
# Modules and crates
|
|
|
|
|
|
2012-10-07 07:23:16 +02:00
|
|
|
|
The Rust namespace is arranged in a hierarchy of modules. Each source
|
|
|
|
|
(.rs) file represents a single module and may in turn contain
|
|
|
|
|
additional modules.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
~~~~
|
|
|
|
|
mod farm {
|
2012-10-13 01:41:16 +02:00
|
|
|
|
pub fn chicken() -> &str { "cluck cluck" }
|
|
|
|
|
pub fn cow() -> &str { "mooo" }
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
2012-10-07 07:23:16 +02:00
|
|
|
|
|
2012-01-19 12:51:20 +01:00
|
|
|
|
fn main() {
|
2012-03-13 04:04:27 +01:00
|
|
|
|
io::println(farm::chicken());
|
2012-01-19 12:51:20 +01:00
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-07 07:23:16 +02:00
|
|
|
|
The contents of modules can be imported into the current scope
|
|
|
|
|
with the `use` keyword, optionally giving it an alias. `use`
|
|
|
|
|
may appear at the beginning of crates, `mod`s, `fn`s, and other
|
|
|
|
|
blocks.
|
|
|
|
|
|
|
|
|
|
~~~
|
|
|
|
|
# mod farm { pub fn chicken() { } }
|
|
|
|
|
# fn main() {
|
|
|
|
|
// Bring `chicken` into scope
|
|
|
|
|
use farm::chicken;
|
|
|
|
|
|
|
|
|
|
fn chicken_farmer() {
|
|
|
|
|
// The same, but name it `my_chicken`
|
|
|
|
|
use my_chicken = farm::chicken;
|
|
|
|
|
...
|
2013-02-26 07:23:36 +01:00
|
|
|
|
# my_chicken();
|
2012-10-07 07:23:16 +02:00
|
|
|
|
}
|
2013-02-26 07:23:36 +01:00
|
|
|
|
# chicken();
|
2012-10-07 07:23:16 +02:00
|
|
|
|
# }
|
|
|
|
|
~~~
|
|
|
|
|
|
|
|
|
|
These farm animal functions have a new keyword, `pub`, attached to
|
2012-12-30 22:09:34 +01:00
|
|
|
|
them. The `pub` keyword modifies an item's visibility, making it
|
2012-10-11 06:29:25 +02:00
|
|
|
|
visible outside its containing module. An expression with `::`, like
|
|
|
|
|
`farm::chicken`, can name an item outside of its containing
|
|
|
|
|
module. Items, such as those declared with `fn`, `struct`, `enum`,
|
2013-03-23 01:24:26 +01:00
|
|
|
|
`type`, or `static`, are module-private by default.
|
2012-10-07 07:23:16 +02:00
|
|
|
|
|
|
|
|
|
Visibility restrictions in Rust exist only at module boundaries. This
|
2012-10-11 06:29:25 +02:00
|
|
|
|
is quite different from most object-oriented languages that also
|
|
|
|
|
enforce restrictions on objects themselves. That's not to say that
|
|
|
|
|
Rust doesn't support encapsulation: both struct fields and methods can
|
2012-12-30 22:09:34 +01:00
|
|
|
|
be private. But this encapsulation is at the module level, not the
|
2012-10-11 06:29:25 +02:00
|
|
|
|
struct level. Note that fields and methods are _public_ by default.
|
2012-10-07 07:23:16 +02:00
|
|
|
|
|
|
|
|
|
~~~
|
2013-03-02 00:55:31 +01:00
|
|
|
|
pub mod farm {
|
2013-01-09 04:37:25 +01:00
|
|
|
|
# pub type Chicken = int;
|
|
|
|
|
# type Cow = int;
|
2013-03-08 03:15:07 +01:00
|
|
|
|
# struct Human(int);
|
2013-01-09 04:37:25 +01:00
|
|
|
|
# impl Human { fn rest(&self) { } }
|
2013-03-02 00:55:31 +01:00
|
|
|
|
# pub fn make_me_a_farm() -> Farm { Farm { chickens: ~[], cows: ~[], farmer: Human(0) } }
|
2012-10-07 07:23:16 +02:00
|
|
|
|
pub struct Farm {
|
2013-02-26 19:02:36 +01:00
|
|
|
|
priv chickens: ~[Chicken],
|
|
|
|
|
priv cows: ~[Cow],
|
2012-10-07 07:23:16 +02:00
|
|
|
|
farmer: Human
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
impl Farm {
|
2012-12-20 10:51:28 +01:00
|
|
|
|
priv fn feed_chickens(&self) { ... }
|
|
|
|
|
priv fn feed_cows(&self) { ... }
|
2013-02-27 22:45:37 +01:00
|
|
|
|
pub fn add_chicken(&self, c: Chicken) { ... }
|
2012-10-07 07:23:16 +02:00
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
pub fn feed_animals(farm: &Farm) {
|
|
|
|
|
farm.feed_chickens();
|
|
|
|
|
farm.feed_cows();
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
fn main() {
|
|
|
|
|
let f = make_me_a_farm();
|
|
|
|
|
f.add_chicken(make_me_a_chicken());
|
|
|
|
|
farm::feed_animals(&f);
|
|
|
|
|
f.farmer.rest();
|
|
|
|
|
}
|
|
|
|
|
# fn make_me_a_farm() -> farm::Farm { farm::make_me_a_farm() }
|
2013-01-09 04:37:25 +01:00
|
|
|
|
# fn make_me_a_chicken() -> farm::Chicken { 0 }
|
2012-10-07 07:23:16 +02:00
|
|
|
|
~~~
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
## Crates
|
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
The unit of independent compilation in Rust is the crate: rustc
|
2012-10-07 07:23:16 +02:00
|
|
|
|
compiles a single crate at a time, from which it produces either a
|
2012-12-30 22:09:34 +01:00
|
|
|
|
library or an executable.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-12-15 04:12:51 +01:00
|
|
|
|
When compiling a single `.rs` source file, the file acts as the whole crate.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
You can compile it with the `--lib` compiler switch to create a shared
|
|
|
|
|
library, or without, provided that your file contains a `fn main`
|
|
|
|
|
somewhere, to create an executable.
|
|
|
|
|
|
2012-12-15 04:12:51 +01:00
|
|
|
|
Larger crates typically span multiple files and are, by convention,
|
|
|
|
|
compiled from a source file with the `.rc` extension, called a *crate file*.
|
|
|
|
|
The crate file extension distinguishes source files that represent
|
|
|
|
|
crates from those that do not, but otherwise source files and crate files are identical.
|
|
|
|
|
|
|
|
|
|
A typical crate file declares attributes associated with the crate that
|
|
|
|
|
may affect how the compiler processes the source.
|
|
|
|
|
Crate attributes specify metadata used for locating and linking crates,
|
|
|
|
|
the type of crate (library or executable),
|
|
|
|
|
and control warning and error behavior,
|
|
|
|
|
among other things.
|
|
|
|
|
Crate files additionally declare the external crates they depend on
|
|
|
|
|
as well as any modules loaded from other files.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-07 07:23:16 +02:00
|
|
|
|
~~~~ { .xfail-test }
|
2012-12-15 04:12:51 +01:00
|
|
|
|
// Crate linkage metadata
|
2012-01-19 12:51:20 +01:00
|
|
|
|
#[link(name = "farm", vers = "2.5", author = "mjh")];
|
2012-12-15 04:12:51 +01:00
|
|
|
|
|
|
|
|
|
// Make a library ("bin" is the default)
|
2012-04-06 01:26:36 +02:00
|
|
|
|
#[crate_type = "lib"];
|
2012-10-07 07:23:16 +02:00
|
|
|
|
|
2012-12-15 04:12:51 +01:00
|
|
|
|
// Turn on a warning
|
|
|
|
|
#[warn(non_camel_case_types)]
|
|
|
|
|
|
|
|
|
|
// Link to the standard library
|
|
|
|
|
extern mod std;
|
|
|
|
|
|
|
|
|
|
// Load some modules from other files
|
2012-01-19 12:51:20 +01:00
|
|
|
|
mod cow;
|
|
|
|
|
mod chicken;
|
|
|
|
|
mod horse;
|
2012-12-15 04:12:51 +01:00
|
|
|
|
|
|
|
|
|
fn main() {
|
|
|
|
|
...
|
|
|
|
|
}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
Compiling this file will cause `rustc` to look for files named
|
2012-10-07 07:23:16 +02:00
|
|
|
|
`cow.rs`, `chicken.rs`, and `horse.rs` in the same directory as the
|
|
|
|
|
`.rc` file, compile them all together, and, based on the presence of
|
|
|
|
|
the `crate_type = "lib"` attribute, output a shared library or an
|
2012-12-30 22:09:34 +01:00
|
|
|
|
executable. (If the line `#[crate_type = "lib"];` was omitted,
|
2012-10-07 07:23:16 +02:00
|
|
|
|
`rustc` would create an executable.)
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-07 07:23:16 +02:00
|
|
|
|
The `#[link(...)]` attribute provides meta information about the
|
|
|
|
|
module, which other crates can use to load the right module. More
|
|
|
|
|
about that later.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
To have a nested directory structure for your source files, you can
|
2012-12-15 04:12:51 +01:00
|
|
|
|
nest mods:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-03-21 00:01:32 +01:00
|
|
|
|
~~~~ {.ignore}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
mod poultry {
|
|
|
|
|
mod chicken;
|
|
|
|
|
mod turkey;
|
|
|
|
|
}
|
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
The compiler will now look for `poultry/chicken.rs` and
|
|
|
|
|
`poultry/turkey.rs`, and export their content in `poultry::chicken`
|
|
|
|
|
and `poultry::turkey`. You can also provide a `poultry.rs` to add
|
|
|
|
|
content to the `poultry` module itself.
|
|
|
|
|
|
2012-10-07 07:23:16 +02:00
|
|
|
|
## Using other crates
|
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
The `extern mod` directive lets you use a crate (once it's been
|
|
|
|
|
compiled into a library) from inside another crate. `extern mod` can
|
|
|
|
|
appear at the top of a crate file or at the top of modules. It will
|
|
|
|
|
cause the compiler to look in the library search path (which you can
|
|
|
|
|
extend with the `-L` switch) for a compiled Rust library with the
|
|
|
|
|
right name, then add a module with that crate's name into the local
|
|
|
|
|
scope.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-07 07:23:16 +02:00
|
|
|
|
For example, `extern mod std` links the [standard library].
|
|
|
|
|
|
|
|
|
|
[standard library]: std/index.html
|
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
When a comma-separated list of name/value pairs appears after `extern
|
|
|
|
|
mod`, the compiler front-end matches these pairs against the
|
|
|
|
|
attributes provided in the `link` attribute of the crate file. The
|
|
|
|
|
front-end will only select this crate for use if the actual pairs
|
|
|
|
|
match the declared attributes. You can provide a `name` value to
|
|
|
|
|
override the name used to search for the crate.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
Our example crate declared this set of `link` attributes:
|
|
|
|
|
|
2013-01-21 07:48:47 +01:00
|
|
|
|
~~~~
|
2012-01-19 12:51:20 +01:00
|
|
|
|
#[link(name = "farm", vers = "2.5", author = "mjh")];
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
Which you can then link with any (or all) of the following:
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-07 07:23:16 +02:00
|
|
|
|
~~~~ {.xfail-test}
|
|
|
|
|
extern mod farm;
|
|
|
|
|
extern mod my_farm (name = "farm", vers = "2.5");
|
|
|
|
|
extern mod my_auxiliary_farm (name = "farm", author = "mjh");
|
|
|
|
|
~~~~
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-11 06:29:25 +02:00
|
|
|
|
If any of the requested metadata do not match, then the crate
|
2012-10-07 07:23:16 +02:00
|
|
|
|
will not be compiled successfully.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
|
|
|
|
## A minimal example
|
|
|
|
|
|
|
|
|
|
Now for something that you can actually compile yourself. We have
|
|
|
|
|
these two files:
|
|
|
|
|
|
|
|
|
|
~~~~
|
2012-10-04 21:41:45 +02:00
|
|
|
|
// world.rs
|
|
|
|
|
#[link(name = "world", vers = "1.0")];
|
2012-10-13 01:41:16 +02:00
|
|
|
|
pub fn explore() -> &str { "world" }
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
2012-12-24 23:07:37 +01:00
|
|
|
|
~~~~ {.xfail-test}
|
2012-01-19 12:51:20 +01:00
|
|
|
|
// main.rs
|
2012-10-04 21:41:45 +02:00
|
|
|
|
extern mod world;
|
2012-12-23 08:51:49 +01:00
|
|
|
|
fn main() { io::println(~"hello " + world::explore()); }
|
2012-01-19 12:51:20 +01:00
|
|
|
|
~~~~
|
|
|
|
|
|
|
|
|
|
Now compile and run like this (adjust to your platform if necessary):
|
|
|
|
|
|
2012-03-21 00:01:32 +01:00
|
|
|
|
~~~~ {.notrust}
|
2012-10-04 21:41:45 +02:00
|
|
|
|
> rustc --lib world.rs # compiles libworld-94839cbfe144198-1.0.so
|
|
|
|
|
> rustc main.rs -L . # compiles main
|
2012-01-19 12:51:20 +01:00
|
|
|
|
> ./main
|
|
|
|
|
"hello world"
|
|
|
|
|
~~~~
|
|
|
|
|
|
2012-10-07 07:23:16 +02:00
|
|
|
|
Notice that the library produced contains the version in the filename
|
|
|
|
|
as well as an inscrutable string of alphanumerics. These are both
|
|
|
|
|
part of Rust's library versioning scheme. The alphanumerics are
|
|
|
|
|
a hash representing the crate metadata.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-10-07 07:23:16 +02:00
|
|
|
|
## The core library
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2013-04-01 21:11:07 +02:00
|
|
|
|
The Rust core library provides runtime features required by the language,
|
|
|
|
|
including the task scheduler and memory allocators, as well as library
|
|
|
|
|
support for Rust built-in types, platform abstractions, and other commonly
|
|
|
|
|
used features.
|
|
|
|
|
|
|
|
|
|
[`core`] includes modules corresponding to each of the integer types, each of
|
|
|
|
|
the floating point types, the [`bool`] type, [tuples], [characters], [strings],
|
|
|
|
|
[vectors], [managed boxes], [owned boxes],
|
|
|
|
|
and unsafe and borrowed [pointers]. Additionally, `core` provides
|
|
|
|
|
some pervasive types ([`option`] and [`result`]),
|
|
|
|
|
[task] creation and [communication] primitives,
|
|
|
|
|
platform abstractions ([`os`] and [`path`]), basic
|
|
|
|
|
I/O abstractions ([`io`]), [containers] like [`hashmap`],
|
|
|
|
|
common traits ([`kinds`], [`ops`], [`cmp`], [`num`],
|
|
|
|
|
[`to_str`], [`clone`]), and complete bindings to the C standard library ([`libc`]).
|
|
|
|
|
|
|
|
|
|
### Core injection and the Rust prelude
|
|
|
|
|
|
|
|
|
|
`core` is imported at the topmost level of every crate by default, as
|
|
|
|
|
if the first line of each crate was
|
|
|
|
|
|
|
|
|
|
extern mod core;
|
|
|
|
|
|
|
|
|
|
This means that the contents of core can be accessed from from any context
|
|
|
|
|
with the `core::` path prefix, as in `use core::vec`, `use core::task::spawn`,
|
|
|
|
|
etc.
|
|
|
|
|
|
|
|
|
|
Additionally, `core` contains a `prelude` module that reexports many of the
|
|
|
|
|
most common core modules, types and traits. The contents of the prelude are
|
|
|
|
|
imported into every *module* by default. Implicitly, all modules behave as if
|
|
|
|
|
they contained the following prologue:
|
|
|
|
|
|
|
|
|
|
use core::prelude::*;
|
|
|
|
|
|
|
|
|
|
[`core`]: core/index.html
|
|
|
|
|
[`bool`]: core/bool.html
|
|
|
|
|
[tuples]: core/tuple.html
|
|
|
|
|
[characters]: core/char.html
|
2012-10-07 07:23:16 +02:00
|
|
|
|
[strings]: core/str.html
|
2013-04-01 21:11:07 +02:00
|
|
|
|
[vectors]: core/vec.html
|
|
|
|
|
[managed boxes]: core/managed.html
|
|
|
|
|
[owned boxes]: core/owned.html
|
|
|
|
|
[pointers]: core/ptr.html
|
|
|
|
|
[`option`]: core/option.html
|
|
|
|
|
[`result`]: core/result.html
|
|
|
|
|
[task]: core/task.html
|
|
|
|
|
[communication]: core/comm.html
|
|
|
|
|
[`os`]: core/os.html
|
|
|
|
|
[`path`]: core/path.html
|
|
|
|
|
[`io`]: core/io.html
|
|
|
|
|
[containers]: core/container.html
|
|
|
|
|
[`hashmap`]: core/hashmap.html
|
|
|
|
|
[`kinds`]: core/kinds.html
|
|
|
|
|
[`ops`]: core/ops.html
|
|
|
|
|
[`cmp`]: core/cmp.html
|
|
|
|
|
[`num`]: core/num.html
|
|
|
|
|
[`to_str`]: core/to_str.html
|
|
|
|
|
[`clone`]: core/clone.html
|
|
|
|
|
[`libc`]: core/libc.html
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 00:33:50 +02:00
|
|
|
|
# What next?
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 00:33:50 +02:00
|
|
|
|
Now that you know the essentials, check out any of the additional
|
|
|
|
|
tutorials on individual topics.
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 00:33:50 +02:00
|
|
|
|
* [Borrowed pointers][borrow]
|
|
|
|
|
* [Tasks and communication][tasks]
|
|
|
|
|
* [Macros][macros]
|
|
|
|
|
* [The foreign function interface][ffi]
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2013-04-01 21:11:07 +02:00
|
|
|
|
There is further documentation on the [wiki].
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 00:33:50 +02:00
|
|
|
|
[borrow]: tutorial-borrowed-ptr.html
|
|
|
|
|
[tasks]: tutorial-tasks.html
|
|
|
|
|
[macros]: tutorial-macros.html
|
|
|
|
|
[ffi]: tutorial-ffi.html
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 00:33:50 +02:00
|
|
|
|
[wiki]: https://github.com/mozilla/rust/wiki/Docs
|
|
|
|
|
[unit testing]: https://github.com/mozilla/rust/wiki/Doc-unit-testing
|
|
|
|
|
[rustdoc]: https://github.com/mozilla/rust/wiki/Doc-using-rustdoc
|
|
|
|
|
[cargo]: https://github.com/mozilla/rust/wiki/Doc-using-cargo-to-manage-packages
|
|
|
|
|
[attributes]: https://github.com/mozilla/rust/wiki/Doc-attributes
|
2012-01-19 12:51:20 +01:00
|
|
|
|
|
2012-09-23 00:33:50 +02:00
|
|
|
|
[pound-rust]: http://chat.mibbit.com/?server=irc.mozilla.org&channel=%23rust
|