Dart Builds - Where are we, how did we get here, and where are we going?
Where we were: pub build (barback)
When barback was written it was intended to satisfy a small need within web
development: things like sass compilation, or compacting sprites to a single
image. It was a very flexible system and it turned out to be useful for more
intensive build steps, like the Angular compiler. Flexibility comes with a cost
though. If any file can be rewritten (or deleted) at any step in the build, then
care needs to be taken not to let other build steps read it until it is
‘stable’. Barback adds protection with what turns out to be a significant
limitation - a Transformer running in package Foo
can’t read any file from
package Bar
until all transformers on Bar
have finished. This is
problematic in the context of Dart source files because of features like type
inference. In order to understand the “meaning” of a Dart source file it is
necessary to also understand details from imported libraries. If two
Transformers running in different packages want to understand the resolved
meaning of a Dart library they must read transitive imports - and when those
imports cycle across package boundaries it will cause deadlock in Barback.
Neither Transformer is allowed to read imported Dart libraries until the other
finishes, and neither will finish until it can read the files.
This fundamental limitation meant that the Angular compiler before version 4 had to cheat the system. Barback wouldn’t let it read the source Dart files - but the Angular transformer was also running on those source files and would (hopefully) have already read them before the point where the information was needed. Angular started storing what it needed in (effectively) global variables that were shared between the runs across different packages. This introduced a race condition - it relied on the transformers across all packages starting together, and not reaching a certain point until the global data had been populated.
When we rewrote the Angular compiler to use the analyzer and performer deeper type resolution we hit a roadblock: In order to resolve types we needed to give the analyzer transitive dart imports, but we couldn’t do that if there were package cycles. We were forced to disallow package cycles for packages using the Angular transformer, and discovered that performance was dramatically impacted. We could only do work up until the point where we needed to read an asset from a dependency - which in the case of using the analyzer meant immediately. We were serializing the work of the angular transformer across packages, because we couldn’t start doing anything on a package until Barback let us read in our transitive imports.
Pub’s intended use case also doesn’t cover compilation steps that take a long time. There is limited caching and cross-run incremental compiles. The fact that there is not a consistent view of what a single file looks like make these hard or impossible to add in.
How we got here: bridging the gap with package:build
As teams inside Google were building ever bigger and bigger projects we were
also running into other difficulties integrating pub’s “write whatever,
whenever” approach with bazel’s much stricter statically analyzable
build graph. We could not take pub’s model and make it incremental or modular -
it has to be monolithic. Bazel does not allow you to rewrite a file. Source
files can’t be changed, and anything that generates an output needs to happen in
a single build step. We wrote package:build with a more restrictive model
which could let us run with a bazel-like set of restrictions while easily
shimming to the pub
interface when we want to run in that build environment.
Over time we gradually adopted, in addition to the bazel restrictions, a
definition approach that added bazel’s static analyzability. Instead of
executing Dart code to determine the files that will be written, we shifted to
configuration metadata which says what output extensions can be output for a
given input extension. The Builder
concept was more restrictive to the author,
but gave us a lot more flexibility to integrate it with build systems. We could
take a single Builder implementation and run it in three ways - as a Transformer
in pub, as a build rule in a Bazel build, or by writing to the source tree
directly using build_runner
for smaller local-only builders.
Our long term goal for the build package was to enable external users to use the
full power of the bazel build system like our internal users. We built a
prototype for dazel
- a tool that could take builder configuration
metadata and generate the skylark build rules and BUILD files necessary for a
Dart project. We hit a few roadblocks with this approach:
- Internally we only need to build Dart on unix hosts but some Dart users develop on Windows. It would take a lot of effort to migrate our bazel build rules and tools to be cross platform.
- The bazel build system, despite attempts to hide it’s complexity, still feels
heavyweight for smaller projects which worked with the simpler
pub build
.
We were in a tough position. Our dev compiler is tricky to use without being tightly integrated into a build system. External Angular builds were too slow to be acceptable for most users, and the cost to generate angular code was paid every time pub serve was started. We could give a good experience with bazel, but only for a fraction of our users on a fraction of their projects.
Where we are: build_runner
as a full build system
Although it was originally written to satisfy a small part of the use case for
builders - generating code in a single package intended to be published with the
package - we did have a proof of concept build system which was pure Dart and
followed a more restrictive (read “easier to optimize”) model. We’ve put a lot
of effort into making build_runner
a fully capable build system for Dart
projects. There are two major areas we’ve improved:
- We’ve expanded the power of
build_runner
to run builds across all packages, and for files that are not intended to be published. - We’ve improved the usability of
build_runner
so that it can be used more likepub build
with automatic discovery of Builders rather than expressing all the requirements in a manual build script.
With Dart 2 we’re completely transitioning builds for web projects to
build_runner
(we’re calling the CLI webdev
) and we’ve dropped support for
the build
and serve
commands in pub
.
Where we are going: better, faster builds
Easier
We’re pushing more of the configuration and complexity on to the authors of
Builders so that end users don’t need to do manual work. Most Builders can be
enabled automatically based on dependencies and the Builder configuration is
expressive enough to automatically determine things like the order of work.
Builder authors can also decide default options for dev and release mode, so all
it takes to enable dart2js
compilation is --release
!
Better
We’ve proven that build_runner
works for many projects using a small set of
Builders. We’ll be working on expanding the happy path and blunting the sharp
edges which surround it. As we work to finish migrating builds over to
build_runner
from pub we’ll be exploring patterns for working within the
restrictions of the package:build paradigm.
We’ll also be working to improve integration with other tools, like the analysis
server. Unlike with pub build
, generated assets live on disk and not in
memory. Other tools can see these files which make them easier to inspect,
debug, and understand.
Faster
We wanted to move quickly to a working end-to-end system - when we could reuse code we did. Some of the APIs we use were written pub compatibility in mind. Even though a Builder wouldn’t overwrite a file, we couldn’t be sure that some other Transformer wouldn’t and so we needed to be cautious. We have a lot of room for improvement now that we can focus on build system that has a statically analyzable model.
Stronger
Despite it’s restrictions we’re focusing on making sure the new build system has
the right generalizations for Dart. In fact, it’s capable of running our web
compilers without any hardcoded knowledge - they appear to the system like any
other Builder - something that was not possible with Barback. The benefit is
that we can swap out implementations, say for compilers tuned for
node.js
, without changes to the build system itself.