From 289ba105ae613291f0a3a16ef295875130abdc49 Mon Sep 17 00:00:00 2001 From: Alex Crichton Date: Fri, 10 Jan 2014 16:34:55 -0800 Subject: [PATCH] dox: Write a guide to the rust runtime --- configure | 1 + doc/guide-runtime.md | 263 +++++++++++++++++++++++++++++++++++++++++++ doc/index.md | 1 + doc/tutorial.md | 4 +- mk/docs.mk | 8 +- mk/tests.mk | 2 +- 6 files changed, 276 insertions(+), 3 deletions(-) create mode 100644 doc/guide-runtime.md diff --git a/configure b/configure index a2468f64a75..1b498bff6b2 100755 --- a/configure +++ b/configure @@ -795,6 +795,7 @@ do make_dir $h/test/codegen make_dir $h/test/doc-tutorial make_dir $h/test/doc-guide-ffi + make_dir $h/test/doc-guide-runtime make_dir $h/test/doc-guide-macros make_dir $h/test/doc-guide-lifetimes make_dir $h/test/doc-guide-pointers diff --git a/doc/guide-runtime.md b/doc/guide-runtime.md new file mode 100644 index 00000000000..86a8c23d947 --- /dev/null +++ b/doc/guide-runtime.md @@ -0,0 +1,263 @@ +% A Guide to the Rust Runtime + +Rust includes two runtime libraries in the standard distribution, which provide +a unified interface to primitives such as I/O, but the language itself does not +require a runtime. The compiler is capable of generating code that works in all +environments, even kernel environments. Neither does the Rust language need a +runtime to provide memory safety; the type system itself is sufficient to write +safe code, verified statically at compile time. The runtime merely uses the +safety features of the language to build a number of convenient and safe +high-level abstractions. + +That being said, code without a runtime is often very limited in what it can do. +As a result, Rust's standard libraries supply a set of functionality that is +normally considered the Rust runtime. This guide will discuss Rust's user-space +runtime, how to use it, and what it can do. + +# What is the runtime? + +The Rust runtime can be viewed as a collection of code which enables services +like I/O, task spawning, TLS, etc. It's essentially an ephemeral collection of +objects which enable programs to perform common tasks more easily. The actual +implementation of the runtime itself is mostly a sparse set of opt-in primitives +that are all self-contained and avoid leaking their abstractions into libraries. + +The current runtime is the engine behind these features (not a comprehensive +list): + +* I/O +* Task spawning +* Message passing +* Task synchronization +* Task-local storage +* Logging +* Local heaps (GC heaps) +* Task unwinding + +## What is the runtime accomplishing? + +The runtime is designed with a few goals in mind: + +* Rust libraries should work in a number of environments without having to worry + about the exact details of the environment itself. Two commonly referred to + environments are the M:N and 1:1 environments. Since the Rust runtime was + first designed, it has supported M:N threading, and it has since gained 1:1 + support as well. + +* The runtime should not enforce separate "modes of compilation" in order to + work in multiple circumstances. Is it an explicit goal that you compile a Rust + library once and use it forever (in all environments). + +* The runtime should be fast. There should be no architectural design barrier + which is preventing programs from running at optimal speeds. It is not a goal + for the runtime to be written "as fast as can be" at every moment in time. For + example, no claims will be made that the current implementation of the runtime + is the fastest it will ever be. This goal is simply to prevent any + architectural roadblock from hindering performance. + +* The runtime should be nearly invisible. The design of the runtime should not + encourage direct interaction with it, and using the runtime should be + essentially transparent to libraries. This does not mean it should be + impossible to query the runtime, but rather it should be unconventional. + +# Architecture of the runtime + +This section explains the current architecture of the Rust runtime. It has +evolved over the development of Rust through many iterations, and this is simply +the documentation of the current iteration. + +## A local task + +The core abstraction of the Rust runtime is the task. A task represents a +"thread" of execution of Rust code, but it does not necessarily correspond to an +OS thread. Most runtime services are accessed through the local task, allowing +for runtime policy decisions to be made on a per-task basis. + +A consequence of this decision is to require all Rust code using the standard +library to have a local `Task` structure available to them. This `Task` is +stored in the OS's thread local storage (OS TLS) to allow for efficient access +to it. + +It has also been decided that the presence or non-presence of a local `Task` is +essentially the *only* assumption that the runtime can make. Almost all runtime +services are routed through this local structure. + +This requirement of a local task is a core assumption on behalf of *all* code +using the standard library, hence it is defined in the standard library itself. + +## I/O + +When dealing with I/O in general, there are a few flavors by which it can be +dealt with, and not all flavors are right for all situations. I/O is also a +tricky topic that is nearly impossible to get consistent across all +environments. As a result, a Rust task is not guaranteed to have access to I/O, +and it is not even guaranteed what the implementation of the I/O will be. + +This conclusion implies that I/O *cannot* be defined in the standard library. +The standard library does, however, provide the interface to I/O that all Rust +tasks are able to consume. + +This interface is implemented differently for various flavors of tasks, and is +designed with a focus around synchronous I/O calls. This architecture does not +fundamentally prevent other forms of I/O from being defined, but it is not done +at this time. + +The I/O interface that the runtime must provide can be found in the +[std::rt::rtio](std/rt/rtio/trait.IoFactory.html) module. Note that this +interface is *unstable*, and likely always will be. + +## Task Spawning + +A frequent operation performed by tasks is to spawn a child task to perform some +work. This is the means by which parallelism is enabled in Rust. This decision +of how to spawn a task is not a general decision, and is hence a local decision +to the task (not defined in the standard library). + +Task spawning is interpreted as "spawning a sibling" and is enabled through the +high level interface in `std::task`. The child task can be configured +accordingly, and runtime implementations must respect these options when +spawning a new task. + +Another local task operation is dealing with the runnable state of the task +itself. This frequently comes up when the question is "how do I block a task?" +or "how do I wake up a task?". These decisions are inherently local to the task +itself, yet again implying that they are not defined in the standard library. + +## The `Runtime` trait and the `Task` structure + +The full complement of runtime features is defined by the [`Runtime` +trait](std/rt/trait.Runtime.html) and the [`Task` +struct](std/rt/task/struct.Task.html). A `Task` is constant among all runtime +implementations, but each runtime implements has its own implementation of the +`Runtime` trait. + +The local `Task` stores the runtime value inside of itself, and then ownership +dances ensue to invoke methods on the runtime. + +# Implementations of the runtime + +The Rust distribution provides two implementations of the runtime. These two +implementations are generally known as 1:1 threading and M:N threading. + +As with many problems in computer science, there is no right answer in this +question of which implementation of the runtime to choose. Each implementation +has its benefits and each has its drawbacks. The descriptions below are meant to +inform programmers about what the implementation provides and what it doesn't +provide in order to make an informed decision about which to choose. + +## 1:1 - using `libnative` + +The library `libnative` is an implementation of the runtime built upon native OS +threads plus libc blocking I/O calls. This is called 1:1 threading because each +user-space thread corresponds to exactly one kernel thread. + +In this model, each Rust task corresponds to one OS thread, and each I/O object +essentially corresponds to a file descriptor (or the equivalent of the platform +you're running on). + +Some benefits to using libnative are: + +* Guaranteed interop with FFI bindings. If a C library you are using blocks the + thread to do I/O (such as a database driver), then this will not interfere + with other Rust tasks (because only the OS thread will be blocked). +* Less I/O overhead as opposed to M:N in some cases. Not all M:N I/O is + guaranteed to be "as fast as can be", and some things (like filesystem APIs) + are not truly asynchronous on all platforms, meaning that the M:N + implementation may incur more overhead than a 1:1 implementation. + +## M:N - using `libgreen` + +The library `libgreen` implements the runtime with "green threads" on top of the +asynchronous I/O framework [libuv][libuv]. The M in M:N threading is the number +of OS threads that a process has, and the N is the number of Rust tasks. In this +model, N Rust tasks are multiplexed among M OS threads, and context switching is +implemented in user-space. + +The primary concern of an M:N runtime is that a Rust task cannot block itself in +a syscall. If this happens, then the entire OS thread is frozen and unavailable +for running more Rust tasks, making this a (M-1):N runtime (and you can see how +this can reach 0/deadlock. By using asynchronous I/O under the hood (all I/O +still looks synchronous in terms of code), OS threads are never blocked until +the appropriate time comes. + +Upon reading `libgreen`, you may notice that there is no I/O implementation +inside of the library, but rather just the infrastructure for maintaining a set +of green schedulers which switch among Rust tasks. The actual I/O implementation +is found in `librustuv` which are the Rust bindings to libuv. This distinction +is made to allow for other I/O implementations not built on libuv (but none +exist at this time). + +Some benefits of using libgreen are: + +* Fast task spawning. When using M:N threading, spawning a new task can avoid + executing a syscall entirely, which can lead to more efficient task spawning + times. +* Fast task switching. Because context switching is implemented in user-space, + all task contention operations (mutexes, channels, etc) never execute + syscalls, leading to much faster implementations and runtimes. An efficient + context switch also leads to higher throughput servers than 1:1 threading + because tasks can be switched out much more efficiently. + +### Pools of Schedulers + +M:N threading is built upon the concept of a pool of M OS threads (which +libgreen refers to as schedulers), able to run N Rust tasks. This abstraction is +encompassed in libgreen's [`SchedPool`][schedpool] type. This type allows for +fine-grained control over the pool of schedulers which will be used to run Rust +tasks. + +In addition the `SchedPool` type is the *only* way through which a new M:N task +can be spawned. Sibling tasks to Rust tasks themselves (created through +`std::task::spawn`) will be spawned into the same pool of schedulers that the +original task was home to. New tasks must previously have some form of handle +into the pool of schedulers in order to spawn a new task. + +## Which to choose? + +With two implementations of the runtime available, a choice obviously needs to +be made to see which will be used. The compiler itself will always by-default +link to one of these runtimes. At the time of this writing, the default runtime +is `libgreen` but in the future this will become `libnative`. + +Having a default decision made in the compiler is done out of necessity and +convenience. The compiler's decision of runtime to link to is *not* an +endorsement of one over the other. As always, this decision can be overridden. + +For example, this program will be linked to "the default runtime" + +~~~{.rust} +fn main() {} +~~~ + +Whereas this program explicitly opts into using a particular runtime + +~~~{.rust} +extern mod green; + +#[start] +fn start(argc: int, argv: **u8) -> int { + do green::start(argc, argv) { + main(); + } +} + +fn main() {} +~~~ + +Both libgreen/libnative provide a top-level `start` function which is used to +boot an initial Rust task in that specified runtime. + +# Finding the runtime + +The actual code for the runtime is spread out among a few locations: + +* [std::rt][stdrt] +* [libnative][libnative] +* [libgreen][libgreen] +* [librustuv][librustuv] + +[libuv]: https://github.com/joyent/libuv/ +[stdrt]: std/rt/index.html +[libnative]: native/index.html +[libgreen]: green/index.html +[librustuv]: rustuv/index.html diff --git a/doc/index.md b/doc/index.md index 456fedf2d47..cf68310d353 100644 --- a/doc/index.md +++ b/doc/index.md @@ -20,6 +20,7 @@ [Packaging](guide-rustpkg.html) [Testing](guide-testing.html) [Conditions](guide-conditions.html) +[Rust's Runtime](guide-runtime.html) # Libraries diff --git a/doc/tutorial.md b/doc/tutorial.md index bb5e1bdd7bc..fd92a4cc080 100644 --- a/doc/tutorial.md +++ b/doc/tutorial.md @@ -2067,7 +2067,7 @@ fn head_bad(v: &[T]) -> T { However, we can tell the compiler that the `head` function is only for copyable types. -In Rust, copyable types are those that _implement the `Clone` trait_. +In Rust, copyable types are those that _implement the `Clone` trait_. We can then explicitly create a second copy of the value we are returning by calling the `clone` method: @@ -3282,6 +3282,7 @@ guides on individual topics. * [Packaging up Rust code][rustpkg] * [Documenting Rust code][rustdoc] * [Testing Rust code][testing] +* [The Rust Runtime][runtime] There is further documentation on the [wiki], however those tend to be even more out of date as this document. @@ -3294,6 +3295,7 @@ There is further documentation on the [wiki], however those tend to be even more [conditions]: guide-conditions.html [rustpkg]: guide-rustpkg.html [testing]: guide-testing.html +[runtime]: guide-runtime.html [rustdoc]: rustdoc.html [wiki]: https://github.com/mozilla/rust/wiki/Docs diff --git a/mk/docs.mk b/mk/docs.mk index 1982ab0050c..71ea4ba44d1 100644 --- a/mk/docs.mk +++ b/mk/docs.mk @@ -72,7 +72,7 @@ endif ifneq ($(NO_DOCS),1) DOCS += doc/rust.html -doc/rust.html: rust.md doc/full-toc.inc $(HTML_DEPS) +doc/rust.html: rust.md doc/full-toc.inc $(HTML_DEPS) @$(call E, pandoc: $@) $(Q)$(CFG_NODE) $(S)doc/prep.js --highlight $< | \ $(CFG_PANDOC) $(HTML_OPTS) --include-in-header=doc/full-toc.inc --output=$@ @@ -224,6 +224,12 @@ doc/guide-pointers.html: $(S)doc/guide-pointers.md $(HTML_DEPS) $(Q)$(CFG_NODE) $(S)doc/prep.js --highlight $< | \ $(CFG_PANDOC) $(HTML_OPTS) --output=$@ +DOCS += doc/guide-runtime.html +doc/guide-runtime.html: $(S)doc/guide-runtime.md $(HTML_DEPS) + @$(call E, pandoc: $@) + $(Q)$(CFG_NODE) $(S)doc/prep.js --highlight $< | \ + $(CFG_PANDOC) $(HTML_OPTS) --output=$@ + ifeq ($(CFG_PDFLATEX),) $(info cfg: no pdflatex found, omitting doc/rust.pdf) else diff --git a/mk/tests.mk b/mk/tests.mk index d50f9878bd1..85794cacc96 100644 --- a/mk/tests.mk +++ b/mk/tests.mk @@ -22,7 +22,7 @@ TEST_CRATES = $(TEST_TARGET_CRATES) $(TEST_HOST_CRATES) # Markdown files under doc/ that should have their code extracted and run DOC_TEST_NAMES = tutorial guide-ffi guide-macros guide-lifetimes \ guide-tasks guide-conditions guide-container guide-pointers \ - complement-cheatsheet \ + complement-cheatsheet guide-runtime \ rust ######################################################################