Files
librsvg/devel-docs/architecture.rst

446 lines
18 KiB
ReStructuredText
Raw Permalink Blame History

This file contains invisible Unicode characters
This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
Architecture
============
This document roughly describes the architecture of librsvg, and future
plans for it. The code is continually evolving, so dont consider this
as the ground truth, but rather like a cheap map you buy at a street
corner.
The librarys internals are documented as Rust documentation comments;
you can look at the rendered version at
https://gnome.pages.gitlab.gnome.org/librsvg/internals/rsvg/index.html
You may also want to see the section below on
:ref:`some_interesting_parts_of_the_code`.
A bit of history
----------------
Librsvg is an old library. It started around 2001, when Eazel (the
original makers of GNOMEs file manager, Nautilus) needed a library to
render SVG images. At that time the SVG format was being standardized,
so librsvg grew along with the SVG specification. This is why you will
sometimes see references to deprecated SVG features in the source code.
Librsvg started as an experiment to use libxml2s new SAX parser, so
that SVG could be streamed in and rendered on the fly, instead of first
creating a DOM tree. Originally it used
`libart <https://levien.com/libart/>`__ as a rendering library; this was
GNOMEs first antialiased renderer with alpha compositing. Later, the
renderer was replaced with `Cairo <https://www.cairographics.org/>`__.
Librsvg is currently striving to support other rendering backends.
These days librsvg indeed builds a DOM tree by itself; it needs the
tree to run the CSS cascade, do selector matching, and to support
cross-element references like in SVG filters.
Librsvg started as a C library with an ad-hoc API. At some point it
got turned into a GObject library, so that the main
:internals:struct:`librsvg_c::handle::RsvgHandle`
class defines most of the entry points into the library. Through
`GObject Introspection <https://gi.readthedocs.io/en/latest/>`__, this
allows librsvg to be used from other programming languages.
In 2016, librsvg started getting ported to Rust. As of early 2021, the
whole library is implemented in Rust, and exports an intact C API/ABI.
It also exports a more idiomatic Rust API as well.
The C and Rust APIs
-------------------
Librsvg exports two public APIs, one for C and one for Rust.
The C API has hard requirements for API/ABI stability, because it is
used all over the GNOME project and API/ABI breaks would be highly
disruptive. Also, the C API is what allows librsvg to be called from
other programming languages, through `GObject
Introspection <https://gi.readthedocs.io/en/latest/>`__.
The Rust API is a bit more lax in its API stability, but we try to stick
to `semantic versioning <https://semver.org/>`__ as is common in Rust.
The public Rust API is implemented in :source:`rsvg/src/api.rs`. This
has all the primitives needed to load and render SVG documents or
individual elements, and to configure loading/rendering options.
The public C API is implemented in :source:`librsvg-c/src/`, and
it is implemented in terms of the public Rust API. Note that as of
2021/Feb the corresponding C header files are hand-written in
:source:`include/librsvg/`; maybe in the future they will be generated
automatically with
`cbindgen <https://github.com/mozilla/cbindgen/blob/master/docs.md>`__.
We consider it good practice to provide simple and clean primitives in
the Rust API, and have ``librsvg-c`` deal with all the idiosyncrasies and
historical considerations for the C API.
In short: the public C API calls the public Rust API, and the public
Rust API calls into the library's internals.
::
+----------------+
| Public C API |
| librsvg-c/src |
+----------------+
|
calls
|
v
+-------------------+
| Public Rust API |
| rsvg/src/api.rs |
+-------------------+
|
calls
|
v
+-------------------+
| library internals |
| rsvg/src/*.rs |
+-------------------+
The test suite
--------------
The test suite is documented in :source:`rsvg/tests/README.md`.
Code flow
---------
The caller of librsvg loads a document into a handle, and later may ask
to render the document or one of its elements, or measure their
geometries.
Loading an SVG document
~~~~~~~~~~~~~~~~~~~~~~~
The Rust API starts by constructing an :internals:struct:`rsvg::api::SvgHandle`
from a :internals:struct:`rsvg::api::Loader`; both of those are public types.
Internally, the ``SvgHandle`` is just a wrapper around a
:internals:struct:`rsvg::document::Document`, which is a private type that
stores an SVG document loaded in memory.
``SvgHandle`` and its companion :internals:struct:`rsvg::api::CairoRenderer`
provide the basic primitive operations like “render the whole
document” or “compute the geometry of an element” that are needed to
implement the public APIs.
A ``Document`` gets created by loading XML from a stream, into a tree of
:internals:type:`rsvg::node::Node` structures. This is similar to a web
browsers DOM tree. ``Node`` is just a type alias for
``rctree::Node<NodeData>``: an ``rctree`` is an N-ary tree of reference-counted
nodes, and :internals:enum:`rsvg::node::NodeData` is the enum that librsvg uses
to represent either XML element nodes, or text nodes in the XML.
Each XML element causes a new ``Node`` to get created with a
``NodeData::Element(e)``. The ``e`` is an
:internals:struct:`rsvg::element::Element`, which is a struct that holds an XML
element's name and its attributes. It also contains an
:internals:struct-field:`~rsvg::element::Element::element_data` field, which is
an :internals:enum:`rsvg::element::ElementData`: an enum that can represent all
the SVG element types. For example, a ``<path>`` element from XML gets turned
into a ``NodeData::Element(e)`` that has its ``element_data`` set to
:internals:enum-variant:`rsvg::element::ElementData::Path`.
When an ``Element`` is created from its corresponding XML, its
:internals:struct:`rsvg::xml::attributes::Attributes` get parsed. On one hand,
attributes that are specific to a particular element type, like the ``d`` in
``<path d="...">`` get parsed by the
:internals:trait-method:`~rsvg::element::ElementTrait::set_attributes` method
of each particular element type (in that case,
:internals:struct-method:`rsvg::shapes::Path::set_attributes`).
On the other hand, attributes that refer to styles, and which may
appear for any kind of element, get all parsed into a
:internals:struct:`rsvg::properties::SpecifiedValues` struct.
This is a memory-efficient representation of the CSS style properties that
an element has.
When the XML document is fully parsed, a ``Document`` contains a tree of
``Node`` structs and their inner ``Element`` structs. The tree has also
been validated to ensure that the root is an ``<svg>`` element.
After that, the CSS cascade step gets run.
The CSS cascade
~~~~~~~~~~~~~~~
Each ``Element`` has a :internals:struct:`rsvg::properties::SpecifiedValues`,
which has the CSS style properties that the XML specified for that
element. However, ``SpecifiedValues`` is sparse, as not all the
possible style properties may have been filled in. Cascading means
following the CSS/SVG rules for each property type to inherit missing
properties from parent elements. For example, in this document
fragment:
::
<g stroke-width="2" stroke="black">
<path d="M0,0 L10,0" fill="blue"/>
<path d="M20,0 L30,0" fill="green"/>
</g>
Each ``<path>`` element has a different fill color, but they both
*inherit* the ``stroke-width`` and ``stroke`` values from their parent
group. This is because both the ``stroke-width`` and ``stroke``
properties are defined in the CSS/SVG specifications to inherit
automatically. Some other properties, like ``opacity``, do not inherit
and are thus not copied to child elements.
In librsvg, the individual types for CSS properties are defined with
the ``make_property`` macro.
The cascading step takes each elements ``SpecifiedValues`` and
composes it by CSS inheritance onto a
:internals:struct:`rsvg::properties::ComputedValues`,
which has the result of the cascade for each element's properties.
When cascading is done, each ``Element`` has a fully resolved
``ComputedValues`` struct, which is what gets used during rendering to
look up things like the elements stroke width or fill color.
Parsing XML into a tree of Nodes / Elements
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Librsvg uses an XML parser (`libxml2
<https://gitlab.gnome.org/GNOME/libxml2/-/wikis/home>`_ at the time of
this writing) to do the first-stage parsing of the SVG
document. :internals:struct:`rsvg::xml::XmlState`
contains the XML parsing state, which is a stack of contexts depending
on the XML nesting structure. ``XmlState`` has public methods, called
from the XML parser as it goes. The most important one is
:internals:struct-method:`~rsvg::xml::XmlState::start_element`;
this is responsible for creating new ``Node`` structures in the tree,
within the :internals:struct:`rsvg::document::DocumentBuilder`
being built.
Nodes are either SVG elements (the :internals:struct:`rsvg::element::Element`
struct), or text data inside elements (the
:internals:struct:`rsvg::text::Chars` struct); this last one will not concern
us here, and we will only talk about ``Element``.
Each supported kind of ``Element`` parses its attributes in a
:internals:trait-method:`~rsvg::element::ElementTrait::set_attributes`
method. Each attribute is just a key/value pair; for example, the
``<rect width="5px">`` element has a ``width`` attribute whose value
is ``5px``.
While parsing its attributes, an element may encounter an invalid value,
for example, a negative width where only nonnegative ones are allowed.
In this case, the elements ``set_attributes`` method may return a
``Result::Err``. The caller will then do ``set_error`` to mark that
element as being in an error state. If an element is in error, its
children will get parsed as usual, but the element and its children will
be ignored during the rendering stage.
The SVG spec says that SVG rendering should stop on the first element
that is “in error”. However, most implementations simply seem to ignore
erroneous elements instead of completely stopping rendering, and we do
the same in librsvg.
CSS and styles
~~~~~~~~~~~~~~
Librsvg uses Servos `cssparser <https://crates.io/crates/cssparser>`_
crate as a CSS tokenizer, and `selectors
<https://crates.io/crates/selectors>`_ as a high-level parser for CSS
style data.
With the ``cssparser`` crate, the caller is responsible for providing
an implementation of the `DeclarationParser
<https://docs.rs/cssparser/0.29.6/cssparser/trait.DeclarationParser.html>`_
trait. Its `parse_value
<https://docs.rs/cssparser/0.29.6/cssparser/trait.DeclarationParser.html#tymethod.parse_value>`_
method takes the name of a CSS property like ``fill``, plus a
value like ``rgb(255, 0, 0)``, and it must return a value that
represents a parsed declaration. Librsvg uses the
:internals:struct:`rsvg::css::Declaration` struct for this.
The core of parsing CSS is the ``parse_value`` function, which returns
a :internals:enum:`rsvg::properties::ParsedProperty`:
.. code:: rust
pub enum ParsedProperty {
BaselineShift(SpecifiedValue<BaselineShift>),
ClipPath(SpecifiedValue<ClipPath>),
Color(SpecifiedValue<Color>),
// etc.
}
What is :internals:enum:`rsvg::properties::SpecifiedValue`?
It is the parsed value for a CSS property directly as it comes out of
the SVG document:
.. code:: rust
pub enum SpecifiedValue<T>
where
T: Property + Clone + Default,
{
Unspecified,
Inherit,
Specified(T),
}
A property declaration can look like ``opacity: inherit;`` - this would
create a ``ParsedProperty::Opacity(SpecifiedValue::Inherit)``.
Or it can look like ``opacity: 0.5;`` - this would create a
``ParsedProperty::Opacity(SpecifiedValue::Specified(Opacity(UnitInterval(0.5))))``.
Lets break this down:
- :internals:enum-variant:`rsvg::properties::ParsedProperty::Opacity` -
which property did we parse?
- :internals:enum-variant:`rsvg::properties::SpecifiedValue::Specified` -
it actually was specified by the document with a value; the other
interesting alternative is
:internals:enum-variant:`~rsvg::properties::SpecifiedValue::Inherit`,
which corresponds to the value ``inherit`` that all CSS property
declarations can have.
- ``Opacity(UnitInterval(0.5))`` - This is the type
:internals:struct:`rsvg::property_defs::Opacity`
property, which is a newtype around an internal
:internals:struct:`rsvg::unit_interval::UnitInterval` type, which in
turn guarantees that we have a float in the range ``[0.0, 1.0]``.
There is a Rust type for every CSS property that librsvg supports; many
of these types are newtypes around primitive types like ``f64``.
Eventually an entire CSS stylesheet, like the contents of a ``<style>``
element, gets parsed into a :internals:struct:`rsvg::css::Stylesheet`
struct. A stylesheet has a list of rules, where each rule is the CSS
selectors defined for it, and the style declarations that should be
applied for the ``Node``\ s that match the selectors. For example, in
a little stylesheet like this:
.. code:: xml
<style type="text/css">
rect, #some_id {
fill: blue;
stroke-width: 5px;
}
</style>
This stylesheet has a single rule. The rule has a selector list with two
selectors (``rect`` and ``#some_id``) and two style declarations
(``fill: blue`` and ``stroke-width: 5px``).
After parsing is done, there is a **cascading stage** where librsvg
walks the tree of nodes, and for each node it finds the CSS rules that
should be applied to it.
Rendering
---------
The rendering process starts at the
:internals:fn:`rsvg::drawing_ctx::draw_tree` function. This sets up a
:internals:struct:`rsvg::drawing_ctx::DrawingCtx`,
which carries around all the mutable state during rendering.
Rendering is a recursive process, which goes back and forth between
the utility functions in ``DrawingCtx`` and the
:internals:trait-method:`~rsvg::element::ElementTrait::draw`
method in elements.
The main job of ``DrawingCtx`` is to deal with the SVG drawing model.
Each element renders itself independently, and its result gets modified
before getting composited onto the final image:
1. Render an element to a temporary surface (example: stroke and fill a
path).
2. Apply filter effects (blur, color mapping, etc.).
3. Apply clipping paths.
4. Apply masks.
5. Composite the result onto the final image.
The temporary result from the last step also gets put in a stack; this
is because filter effects sometimes need to look at the currently-drawn
background to apply further filtering to it.
Youll see that most of the rendering-related functions return a
``Result<BoundingBox, RenderingError>``. Some SVG features require
knowing the bounding box of the object that is being rendered; for
historical reasons this bounding box is computed as part of the
rendering process in librsvg. When computing a subtrees bounding box,
the bounding boxes from the leaves get aggregated up to the root of
the subtree. Each node in the tree has its own coordinate system;
:internals:struct:`rsvg::bbox::BoundingBox`
is able to transform coordinate systems to get a bounding box that is
meaningful with respect to the roots transform.
Comparing floating-point numbers
--------------------------------
Librsvg sometimes needs to compute things like “are these points equal?”
or “did this computed result equal this test reference number?”.
We use ``f64`` numbers in Rust for all computations on real numbers.
Floating-point numbers cannot be compared with ``==`` effectively, since
it doesnt work when the numbers are slightly different due to numerical
inaccuracies.
Similarly, we dont ``assert_eq!(a, b)`` for floating-point numbers.
Most of the time we are dealing with coordinates which will get passed
to Cairo. In turn, Cairo converts them from doubles to a fixed-point
representation (as of March 2018, Cairo uses 24.8 fixnums with 24 bits
of integral part and 8 bits of fractional part).
So, we can consider two numbers to be “equal” if they would be
represented as the same fixed-point value by Cairo. Librsvg implements
this in the :internals:trait:`rsvg::float_eq_cairo::ApproxEqCairo`
trait. You can use it like this:
.. code:: rust
use float_eq_cairo::ApproxEqCairo; // bring the trait into scope
let a: f64 = ...;
let b: f64 = ...;
if a.approx_eq_cairo(&b) { // not a == b
... // equal!
}
assert!(1.0_f64.approx_eq_cairo(&1.001953125_f64)); // 1 + 1/512 - cairo rounds to 1
.. _some_interesting_parts_of_the_code:
Some interesting parts of the code
----------------------------------
- Are you adding support for a CSS property? Look at the
:doc:`adding_a_property` tutorial; look in the
:internals:module:`rsvg::property_defs`
and :internals:module:`rsvg::properties`
modules. ``property_defs`` defines most of the CSS properties that
librsvg supports, and ``properties`` actually puts all those
properties in the :internals:struct:`rsvg::properties::SpecifiedValues`
and :internals:struct:`rsvg::properties::ComputedValues` structs.
- The :internals:struct:`rsvg::drawing_ctx::DrawingCtx`
struct is active while an SVG handle is being drawn. It has all the
mutable state related to the drawing process, such as the stack of
temporary rendered surfaces, and the viewport stack.
- The :internals:struct:`rsvg::document::Document`
struct represents a loaded SVG document. It holds the tree of
:internals:type:`rsvg::node::Node` structs, some of which contain
:internals:struct:`rsvg::element::Element` and some others contain
:internals:struct:`rsvg::text::Chars` for text data in the XML.
A ``Document`` also contains a mapping of ``id`` attributes to the
corresponding element nodes.
- The :internals:module:`rsvg::xml` module receives events from an XML
parser, and builds a ``Document`` tree.
- The :internals:module:`rsvg::css`
module has the high-level machinery for parsing CSS and representing
parsed stylesheets. The low-level parsers for individual properties
are in :internals:module:`rsvg::property_defs` and
:internals:module:`rsvg::font_props`.