日志

Dora provides a structured logging system for real-time robotics and AI dataflows. Logs are captured per-node as structured JSONL files, forwarded to the coordinator for live streaming, and optionally routed through the dataflow graph as data messages.

Which Logging Approach Should I Use?

Start here if you’re unsure which approach fits your use case.

I want to…	Approach	配置
Log from Python	Use Python’s `logging` module (auto-bridged)	Nothing – just `import logging`
Log from Rust	Use `node.log_info()` / `node.log_error()` etc.	Nothing – works out of the box
Log from C/C++	Use `dora_log()` / `log_message()`	Nothing – works out of the box
Filter noisy nodes	Set `min_log_level` in YAML	Per-node YAML field
Watch all logs in one place	Subscribe to `dora/logs` virtual input	`inputs: logs: dora/logs`
Process one node’s logs as data	Use `send_logs_as` on that node	Per-node YAML + wire the output
Rotate log files	Set `max_log_size` in YAML	Per-node YAML field
Build a custom log sink	Use `dora-log-utils` crate	Rust dependency
Filter CLI display	Use `--log-level` / `--log-filter` flags	CLI flags or env vars

Language-Specific Quick Start

Python – the simplest path is Python’s built-in logging module:

import logging
from dora import Node

node = Node()  # Automatically bridges Python logging -> dora

logging.info("Sensor started")       # Captured as structured "info" log
logging.warning("High temp: 42C")    # Captured as structured "warn" log
print("raw debug output")            # Captured as "stdout" level

When Node() is created, it installs a handler that routes all Python logging calls through Rust’s tracing system. The daemon parses these as structured log entries with level, message, file, and line number. No extra configuration needed.

You can also use the explicit API for structured fields:

node.log_info("Reading acquired")
node.log("info", "Reading acquired", fields={"sensor_id": "temp-01"})

Rust – use the node API convenience methods:

#![allow(unused)]
fn main() {
let (node, mut events) = DoraNode::init_from_env()?;

// Convenience methods (recommended for most cases)
node.log_info("Sensor started");
node.log_warn("High temperature");

// With structured fields
let mut fields = BTreeMap::new();
fields.insert("sensor_id".into(), "temp-01".into());
node.log_with_fields("info", "Reading acquired", None, Some(&fields));
}

Alternatively, Rust nodes can use the tracing crate. When dora’s tracing subscriber is initialized (via init_tracing()), tracing::info!() etc. output structured JSON to stdout, which the daemon parses automatically:

#![allow(unused)]
fn main() {
// Also works -- parsed as structured logs by the daemon
tracing::info!("Sensor started");
tracing::warn!(sensor_id = "temp-01", "High temperature");
}

Use node.log_*() when you want explicit control over the log format. Use tracing::*!() when you want ecosystem integration (spans, instrumentation, OpenTelemetry). Both produce identical structured log entries in the daemon.

C – use the dora_log() function:

dora_log(ctx, "info", 4, "Sensor started", 14);

C++ – use the log_message() function:

log_message(node.send_output, "info", "Sensor started");

功能一览

特性	范围	配置
日志级别过滤	CLI display	`--log-level`, `DORA_LOG_LEVEL`
Output formats	CLI display	`--log-format`, `DORA_LOG_FORMAT`
Per-node level overrides	CLI display	`--log-filter`, `DORA_LOG_FILTER`
Source-level filtering	Per-node YAML	`min_log_level`
Stdout-as-data routing	Per-node YAML	`send_stdout_as`
Structured log routing	Per-node YAML	`send_logs_as`
Log file rotation	Per-node YAML	`max_log_size`
Rotation file limit	Per-node YAML	`max_rotated_files`
Node log API	Rust/Python/C/C++ node	`node.log()`, `dora_log()`, etc.
Log utilities library	Rust crate	`dora-log-utils`
Log aggregation	Dataflow input	`dora/logs` virtual input
Time-range filtering	`dora logs`	`--since`, `--until`
Live log streaming	`dora logs`	`--follow`
Text search	`dora logs`	`--grep`
Local log reading	`dora logs`	`--local`, `--all-nodes`

Log File Format

Each node produces a JSONL file (one JSON object per line) at:

<working_dir>/out/<dataflow_uuid>/log_<node_id>.jsonl

Each line has this structure:

{
  "timestamp": "2024-01-15T10:30:00.123Z",
  "level": "info",
  "node_id": "sensor",
  "message": "Starting sensor...",
  "target": "sensor::module",
  "fields": { "key": "value" }
}

Field	类型	描述
`timestamp`	string	RFC3339 timestamp with millisecond precision
`level`	string	`"error"`, `"warn"`, `"info"`, `"debug"`, `"trace"`, or `"stdout"`
`node_id`	string	Node ID
`message`	string	The log message text
`target`	string?	Rust module target (e.g. `"sensor::module"`), null if absent
`fields`	object?	Structured key-value fields from the logging framework. Trust model: fields originate from node stdout and are passed through without sanitization. In mixed-trust environments, log consumers should validate field contents before acting on them

How Node Output Becomes Log Entries

The daemon captures each line of stdout/stderr from a node process and attempts to parse it as a structured log message (JSON with level, message, timestamp, and optional fields). If parsing succeeds, the structured fields are preserved. If parsing fails, the raw line becomes a "stdout"-level entry.

This means nodes using Rust’s tracing or log crate with JSON output get full structured logging automatically. Nodes that simply println! produce "stdout"-level entries.

Viewing Logs: `dora run`

When running a dataflow with dora run, logs from all nodes are displayed in real-time on the terminal.

Flags

dora run dataflow.yml [OPTIONS]

标志	默认	Env Var	描述
`--log-level LEVEL`	`stdout`	`DORA_LOG_LEVEL`	Minimum level to display
`--log-format FORMAT`	`pretty`	`DORA_LOG_FORMAT`	Output format: `pretty`, `json`, `compact`
`--log-filter FILTER`	none	`DORA_LOG_FILTER`	Per-node level overrides

日志级别

From most to least verbose:

Level	描述
`stdout`	Everything including raw stdout from nodes (default)
`trace`	Fine-grained diagnostic messages
`debug`	Developer-level diagnostic messages
`info`	General informational messages
`warn`	Warning conditions
`error`	Error conditions only

Setting --log-level info hides stdout, trace, and debug messages. The stdout level is a special catch-all that passes everything.

Level Filtering Logic

The level filter uses LogLevelOrStdout::passes():

Message level    Filter level    Displayed?
─────────────    ────────────    ──────────
stdout           stdout          yes
stdout           info            no       (stdout only passes stdout filter)
info             stdout          yes      (any log level passes stdout filter)
debug            info            no       (debug is more verbose than info)
error            info            yes      (error is less verbose than info)

Per-Node Overrides

The --log-filter flag lets you set different levels for different nodes:

dora run dataflow.yml --log-level info --log-filter "sensor=debug,planner=warn"

This shows info and above for all nodes, except sensor (shows debug and above) and planner (shows warn and above).

Format: "node1=level,node2=level" (comma-separated name=level pairs).

Output Formats

Pretty (default) – colored, human-readable:

10:30:00 INFO   sensor: Starting sensor...

10:30:01 INFO   [dora]: spawning node processor

10:30:01 stdout sensor: raw output line

Timestamp in local timezone (HH:MM:SS)
Level colored: ERROR (red), WARN (yellow), INFO (green), DEBUG (blue), TRACE (dimmed), stdout (italic dimmed blue)
Node name in bold with a unique color based on the name
System messages prefixed with [dora]
Lifecycle messages (spawning, node finished, stopping) get visual separation with blank lines

Json – full LogMessage struct as JSON, one per line:

{"build_id":null,"dataflow_id":"abc-123","node_id":"sensor","level":"INFO","message":"Starting...","timestamp":"2024-01-15T10:30:00Z",...}

Useful for piping to jq or ingesting into log aggregation systems.

Compact – minimal, no color:

10:30:00 INFO sensor: Starting sensor...

Useful for CI/CD environments and log files.

Viewing Logs: `dora logs`

Read historical logs or stream live logs from a running dataflow.

Basic Usage

# Read logs for a specific node (via coordinator)
dora logs <dataflow_uuid> <node_name>

# Read local log files directly
dora logs --local <node_name>
dora logs --local --all-nodes

# Stream live logs
dora logs <dataflow_uuid> <node_name> --follow
dora logs --local <node_name> --follow

Flags

标志	Short	默认	描述
`--local`		false	Read from local `out/` directory instead of coordinator
`--all-nodes`		false	Merge logs from all nodes, sorted by timestamp
`--tail N`	`-n`	all	Show only the last N lines
`--follow`	`-f`	false	Stream new log entries as they arrive
`--since DURATION`		none	Only show logs newer than this duration ago
`--until DURATION`		none	Only show logs older than this duration ago
`--level LEVEL`		`stdout`	Minimum log level (env: `DORA_LOG_LEVEL`)
`--grep PATTERN`		none	Case-insensitive text search
`--coordinator-addr IP`		`127.0.0.1`	Coordinator address
`--coordinator-port PORT`		default	Coordinator control port

Time Filters

--since and --until accept duration strings relative to now:

# Logs from the last 5 minutes
dora logs --local sensor --since 5m

# Logs from 1 hour ago to 30 minutes ago
dora logs --local sensor --since 1h --until 30m

# Last 10 errors from the past hour
dora logs --local sensor --since 1h --level error --tail 10

Supported duration formats: 30 (seconds), 30s, 5m, 1h, 2d.

Text Search

--grep performs case-insensitive substring matching against:

The log message text
The node ID
The module target

# Find all timeout-related messages
dora logs --local --all-nodes --grep "timeout"

# Find errors from a specific module
dora logs --local sensor --grep "camera::driver" --level error

Filter Pipeline

All filters are applied in this order:

Read/Parse -> Time Filters -> Grep -> Tail -> Display

When --since, --until, or --grep are used in coordinator mode, the CLI fetches all logs from the server (ignoring --tail server-side) and applies all filters client-side. This ensures correct results when combining filters.

Local vs Coordinator Mode

Local mode (--local) reads JSONL files directly from the out/ directory in the current working directory. No coordinator or daemon needs to be running. If --all-nodes is used or no node name is given, all log files are merged and sorted by timestamp.

Coordinator mode (default) connects to a running coordinator via WebSocket. The coordinator reads log files from the daemon’s working directory and streams them back. This works for both local and distributed deployments.

Follow Mode

Local follow (--local --follow): Polls log files every 200ms for new content. New lines are parsed, filtered by --grep, and printed. Time/tail filters only apply to the initial historical output.

Coordinator follow (--follow): Opens a WebSocket subscription to the coordinator. The coordinator forwards log messages from the daemon in real-time. Level filtering is applied server-side for efficiency. --grep and --since are applied client-side on the stream.

环境变量

All environment variables serve as fallbacks – CLI flags always take precedence.

变量	Used By	Values	描述
`DORA_LOG_LEVEL`	`dora run`, `dora logs`	`error`, `warn`, `info`, `debug`, `trace`, `stdout`	Default minimum log level
`DORA_LOG_FORMAT`	`dora run`	`pretty`, `json`, `compact`	Default output format
`DORA_LOG_FILTER`	`dora run`	`"node1=level,node2=level"`	Default per-node overrides
`DORA_QUIET`	daemon	any value	Suppress log forwarding to display (file writing continues)

Example:

# Set defaults for a development session
export DORA_LOG_LEVEL=info
export DORA_LOG_FORMAT=pretty
export DORA_LOG_FILTER="sensor=debug"

# These are equivalent:
dora run dataflow.yml
dora run dataflow.yml --log-level info --log-format pretty --log-filter "sensor=debug"

# CLI flag overrides env var:
dora run dataflow.yml --log-level debug   # overrides DORA_LOG_LEVEL=info

YAML Configuration

`min_log_level`

Filter logs at the source (daemon-side) before they reach log files, the coordinator, or send_logs_as routing.

nodes:
  - id: noisy-sensor
    path: ./target/debug/sensor
    min_log_level: info    # suppress debug/trace/stdout from this node

Valid values: error, warn, info, debug, trace, stdout.

When set, the daemon drops log messages below this level immediately after parsing. This reduces disk I/O, network traffic, and log file size. The filtering uses the same passes() logic as the CLI display filter.

`send_stdout_as`

Route raw stdout/stderr lines as dataflow output messages.

nodes:
  - id: legacy-node
    path: ./legacy-script.py
    send_stdout_as: raw_output
    outputs:
      - raw_output
      - data

  - id: log-consumer
    inputs:
      logs: legacy-node/raw_output

Each stdout/stderr line is sent as an Arrow-encoded string. This is useful for integrating legacy nodes that output data on stdout (e.g., Python scripts using print()).

Both send_stdout_as and normal log file writing happen – stdout routing does not suppress log files.

`send_logs_as`

Route parsed structured log entries as dataflow output messages.

nodes:
  - id: sensor
    path: ./target/debug/sensor
    send_logs_as: log_entries
    outputs:
      - data
      - log_entries

  - id: log-aggregator
    inputs:
      sensor_logs: sensor/log_entries

Unlike send_stdout_as, this only sends lines that were successfully parsed as structured logs (not raw stdout). Each entry is serialized as a full JSON LogMessage string. The min_log_level filter applies before routing – suppressed messages are not sent.

Use this to build log aggregation, alerting, or monitoring nodes within the dataflow itself.

`dora/logs` – Automatic Log Aggregation

Subscribe to logs from all nodes with a single input line – no manual wiring needed:

nodes:
  - id: sensor
    path: sensor.py
    inputs:
      tick: dora/timer/millis/200
    outputs:
      - reading

  - id: processor
    path: processor.py
    inputs:
      reading: sensor/reading
    outputs:
      - result

  - id: log-viewer
    path: log_viewer.py
    inputs:
      logs: dora/logs              # all nodes, all levels
      errors: dora/logs/error      # only error+ from all nodes
      sensor: dora/logs/info/sensor  # info+ from one node

The dora/logs virtual input works like dora/timer – the daemon handles subscription internally. Each log message arrives as a JSON-encoded LogMessage string in an Arrow array. To prevent infinite loops, a node never receives its own log messages.

Syntax:

Input	描述
`dora/logs`	All logs from all nodes
`dora/logs/<level>`	Logs at `<level>` or above from all nodes
`dora/logs/<level>/<node-id>`	Logs at `<level>` or above from a specific node

Levels: stdout, error, warn, info, debug, trace.

When to use dora/logs vs send_logs_as:

	`dora/logs`	`send_logs_as`
范围	All nodes at once	One node at a time
YAML changes	Only the consumer	Each source node
Adding a node	Zero wiring changes	Must update consumer
用例	Dashboard, monitoring	Per-node log processing

See examples/log-aggregator/ for a complete working example.

`max_log_size`

Enable size-based log file rotation.

nodes:
  - id: sensor
    path: ./target/debug/sensor
    max_log_size: "50MB"

值	Bytes
`"1KB"` or `"1K"`	1,024
`"50MB"` or `"50M"`	52,428,800
`"1GB"` or `"1G"`	1,073,741,824
`"1000"`	1,000 (plain number = bytes)

When the active log file exceeds the configured size, the daemon:

Flushes and closes the current file
Renames existing rotated files: .4.jsonl -> .5.jsonl, .3.jsonl -> .4.jsonl, etc.
Renames the current file: log_sensor.jsonl -> log_sensor.1.jsonl
Creates a fresh log_sensor.jsonl
Deletes any file beyond the rotation limit (default 5, configurable via max_rotated_files)

Naming convention:

log_sensor.jsonl       # current (active)
log_sensor.1.jsonl     # previous
log_sensor.2.jsonl     # older
log_sensor.3.jsonl
log_sensor.4.jsonl
log_sensor.5.jsonl     # oldest (deleted on next rotation)

Maximum disk usage per node: max_log_size * (1 + max_rotated_files) (1 active + N rotated).

Without max_log_size, log files grow unbounded. For long-running dataflows, always set this.

The dora logs --local command automatically reads all rotated files for a node and merges them in chronological order (oldest rotated file first, current file last).

`max_rotated_files`

Control how many rotated log files to keep (default: 5, range: 1-100).

nodes:
  - id: sensor
    path: ./target/debug/sensor
    max_log_size: "50MB"
    max_rotated_files: 10    # keep 10 rotated files instead of 5

With max_rotated_files: 10 and max_log_size: "50MB", maximum disk usage is 50MB * 11 = 550MB per node. Lower values save disk space; higher values preserve more history.

Runtime Node Restrictions

For runtime nodes (operators), only one of each logging field is allowed per runtime:

# OK -- single operator
nodes:
  - id: runtime-node
    operator:
      python: process.py
      send_logs_as: logs
      min_log_level: info
      max_log_size: "100MB"

# ERROR -- multiple operators with conflicting configs
nodes:
  - id: runtime-node
    operators:
      - id: op1
        python: a.py
        send_logs_as: logs1
      - id: op2
        python: b.py
        send_logs_as: logs2    # Error: multiple send_logs_as

When a single operator in a runtime sets these fields, the output name is prefixed with the operator ID (e.g., op1/logs).

Node Log API

Nodes can emit structured log messages programmatically using the node API. These are equivalent to writing JSON-formatted log lines to stdout – the daemon parses them identically.

Rust

#![allow(unused)]
fn main() {
use dora_node_api::DoraNode;
use std::collections::BTreeMap;

let (node, mut events) = DoraNode::init_from_env()?;

// General log with level string and optional target
node.log("info", "sensor initialized", Some("sensor::init"));

// Convenience methods (no target parameter)
node.log_error("connection failed");
node.log_warn("temperature elevated");
node.log_info("reading acquired");
node.log_debug("raw bytes received");
node.log_trace("entering loop iteration");

// Structured fields (key-value context preserved through send_logs_as)
let mut fields = BTreeMap::new();
fields.insert("sensor_id".to_string(), "temp-01".to_string());
fields.insert("reading".to_string(), "42.5".to_string());
node.log_with_fields("info", "reading acquired", None, Some(&fields));
}

The level parameter accepts "error", "warn" (or "warning"), "info", "debug", "trace". Unknown levels default to "info". Fields are capped at 60 KB total to match the downstream 64 KB parse limit.

Python

Python nodes have three ways to log, all producing structured log entries:

from dora import Node
import logging

node = Node()

# Option 1: Python's logging module (recommended -- auto-bridged by Node())
logging.info("sensor initialized")
logging.warning("temperature elevated")
logging.debug("raw bytes: %s", data)

# Option 2: Explicit dora API with level string
node.log("info", "sensor initialized", target="sensor.init")
node.log("info", "reading acquired", fields={"sensor_id": "temp-01", "reading": "42.5"})

# Option 3: Convenience methods
node.log_error("connection failed")
node.log_warn("temperature elevated")
node.log_info("reading acquired")
node.log_debug("raw bytes received")
node.log_trace("entering loop iteration")

# This also works but produces "stdout"-level entries (no structure):
print("raw output")

How the Python logging bridge works: When Node() is created, it installs a custom logging.Handler that routes all Python logging calls through Rust’s tracing system. The daemon parses these as structured log entries with level, message, file path, and line number. This happens automatically – no configuration needed.

方法	Structured?	Fields support?	When to use
`logging.info()`	是	No (use `extra=` for custom formatters)	General-purpose logging
`node.log("info", msg, fields={...})`	是	是	When you need structured key-value context
`node.log_info(msg)`	是	否	Quick one-liner, same as `node.log("info", msg)`
`print()`	No (`stdout` level)	否	Legacy code, quick debugging

Common pitfall: Do not call logging.basicConfig() before creating Node(). The node constructor sets up the logging bridge; calling basicConfig() first may install a conflicting handler. If you need custom formatters, configure them after Node() creation.

C

#include "node_api.h"

void *ctx = init_dora_context_from_env();
const char *level = "info";
const char *msg = "sensor initialized";
dora_log(ctx, level, strlen(level), msg, strlen(msg));

C++

// Via the cxx bridge
auto node = init_dora_node();
log_message(node.send_output, "info", "sensor initialized");

Log Utilities Library (`dora-log-utils`)

The dora-log-utils crate provides parsing, merging, filtering, and formatting utilities for working with LogMessage entries in custom sink nodes. Use it when building nodes that consume log data via send_logs_as.

API

#![allow(unused)]
fn main() {
use dora_log_utils;

// Parse a LogMessage from JSON (as received from send_logs_as)
let log = dora_log_utils::parse_log(json_str)?;

// Parse directly from Arrow input data (convenience for event handlers)
let log = dora_log_utils::parse_log_from_arrow(&data)?;

// Merge multiple log streams into a single timeline
let merged = dora_log_utils::merge_by_timestamp(vec![stream_a, stream_b]);

// Filter by minimum level
let errors = dora_log_utils::filter_by_level(&logs, &min_level);

// Format as JSON (one line, no trailing newline)
let json = dora_log_utils::format_json(&log);

// Format as compact single-line: "<timestamp> <node> <LEVEL>: <message>"
let compact = dora_log_utils::format_compact(&log);

// Format as pretty: "[<timestamp>][<LEVEL>][<node>] <message>"
let pretty = dora_log_utils::format_pretty(&log);
}

Dependency

Add to your sink node’s Cargo.toml:

[dependencies]
dora-log-utils = { workspace = true }

Log Sink Examples

Three example sink nodes demonstrate how to consume logs routed via send_logs_as and forward them to external destinations.

File Sink (`examples/log-sink-file/`)

Merges log streams from multiple nodes into a single JSONL file. Useful for unified log collection.

nodes:
  - id: sensor
    path: sensor.py
    send_logs_as: log_entries
    inputs:
      tick: dora/timer/millis/200
    outputs:
      - reading
      - log_entries

  - id: processor
    path: processor.py
    send_logs_as: log_entries
    inputs:
      reading: sensor/reading
    outputs:
      - result
      - log_entries

  - id: file_sink
    path: log-sink-file
    inputs:
      sensor_logs: sensor/log_entries
      processor_logs: processor/log_entries
    env:
      LOG_FILE: "./combined.jsonl"

The file sink reads LOG_FILE from the environment (default ./combined.jsonl), parses each incoming Arrow message with dora_log_utils::parse_log_from_arrow(), formats it as JSON, and appends it to the file.

TCP Sink (`examples/log-sink-tcp/`)

Forwards log entries over a TCP socket to a remote log collector. Useful for embedded systems that lack local filesystems and need to stream logs off-device.

nodes:
  - id: source
    path: source.py
    send_logs_as: log_entries
    inputs:
      tick: dora/timer/millis/500
    outputs:
      - data
      - log_entries

  - id: tcp_sink
    path: log-sink-tcp
    inputs:
      logs: source/log_entries
    env:
      SINK_ADDR: "127.0.0.1:9876"

The TCP sink reads SINK_ADDR from the environment (default 127.0.0.1:9876), connects to the server on startup, and sends each log entry as a JSON line. It reconnects automatically on write failure.

Alert Router (`examples/log-sink-alert/`)

Splits incoming log entries by severity. All logs are forwarded to the all_logs output; only error and warn logs are forwarded to the alerts output. This enables downstream nodes to handle alerts differently (e.g., trigger notifications, write to a dedicated file).

nodes:
  - id: source
    path: my_node.py
    send_stdout_as: log_entries
    inputs:
      tick: dora/timer/millis/200
    outputs:
      - log_entries

  - id: alert_router
    path: log-sink-alert
    inputs:
      logs: source/log_entries
    outputs:
      - all_logs
      - alerts

The source node uses send_stdout_as to route its stdout lines as Arrow string data. The router parses each log entry with dora_log_utils::parse_log_from_arrow(), checks the level, and uses node.send_output() to forward data to the appropriate outputs. Nodes using the node API can alternatively use send_logs_as to route structured logs from node.log().

Building a Custom Sink

To build your own sink node, follow this pattern:

use dora_node_api::{DoraNode, Event};

fn main() -> eyre::Result<()> {
    let (_node, mut events) = DoraNode::init_from_env()?;

    while let Some(event) = events.recv() {
        match event {
            Event::Input { data, .. } => {
                let log = dora_log_utils::parse_log_from_arrow(&data)?;
                // Process the log entry: write to file, send over network, etc.
                let json = dora_log_utils::format_json(&log);
                println!("{json}");
            }
            Event::Stop(_) => break,
            _ => {}
        }
    }
    Ok(())
}

How the Daemon Processes Logs

Understanding the internal pipeline helps with debugging and tuning. For each node, the daemon runs a dedicated async task that processes log lines in order:

Node Process (stdout/stderr)
    |
    v
[1] Capture: lines buffered in mpsc channel (capacity 100)
    |
    v
[2] send_stdout_as: raw line -> Arrow data -> dataflow output
    |
    v
[3] Parse: try JSON structured log, fall back to Stdout-level
    |
    v
[4] min_log_level filter: drop messages below threshold
    |
    v
[5] send_logs_as: LogMessage -> JSON -> Arrow data -> dataflow output
    |
    v
[6] Write JSONL: compact format to log file, track bytes written
    |
    v
[7] Rotation check: if bytes_written >= max_log_size, rotate files
    |
    v
[8] Forward: send LogMessage to display channel (unless DORA_QUIET)
    |
    v
[9] Sync: fsync log file to disk

Key details:

Step 2 happens before parsing, so send_stdout_as captures every line including non-structured output
Step 4 happens before Steps 5-8, so min_log_level suppresses messages from all downstream processing
Step 5 only fires for successfully parsed structured logs (Step 3 success path)
Step 8 sends to either a flume channel (dora run direct mode) or the coordinator (distributed mode)
Step 9 calls sync_all() after every write, ensuring durability at the cost of some I/O overhead

Structured Log Parsing

When a node emits JSON-formatted log output (e.g., from tracing-subscriber with JSON formatting), the daemon extracts:

level: log severity
message: the log text
target: module path
timestamp: when the log was emitted
fields: arbitrary key-value pairs
build_id, dataflow_id, node_id, daemon_id: extracted from fields as fallback

The daemon also sets dataflow_id, node_id, and daemon_id on all messages to ensure they are always present in the log file.

Coordinator Log Streaming Protocol

When a daemon runs under a coordinator (distributed mode), log forwarding works via WebSocket:

Daemon -> Coordinator: Each LogMessage is wrapped in DaemonEvent::Log(message) and sent over the daemon’s WebSocket connection
Coordinator storage: The coordinator stores/forwards logs
CLI subscription: The CLI sends ControlRequest::LogSubscribe { dataflow_id, level } over its WebSocket connection
Server-side filtering: The coordinator only forwards messages where msg_level <= subscription_level. This reduces network traffic for filtered subscriptions
CLI receive: Messages arrive as serialized LogMessage structs

The --level flag maps to log::LevelFilter:

stdout -> LevelFilter::Trace (most permissive, receives everything)
info -> LevelFilter::Info (receives Error, Warn, Info)
etc.

Complete YAML Reference

nodes:
  - id: sensor
    path: ./target/debug/sensor
    outputs:
      - data
      - raw_output       # for send_stdout_as
      - log_entries       # for send_logs_as

    # Source-level log filtering (daemon-side)
    min_log_level: info          # suppress debug/trace/stdout

    # Route stdout to dataflow
    send_stdout_as: raw_output   # every stdout line becomes a data message

    # Route structured logs to dataflow
    send_logs_as: log_entries    # parsed log entries become data messages

    # Log file rotation
    max_log_size: "50MB"         # rotate when file exceeds 50MB
    max_rotated_files: 5         # keep 5 rotated files (default, range 1-100)

    inputs:
      tick: dora/timer/millis/100

Complete Example

The examples/python-logging/ directory contains a runnable three-node pipeline that exercises every logging feature:

sensor (noisy, high-volume) --> processor (structured logs) --> monitor (log aggregator)

Dataflow configuration highlights:

nodes:
  - id: sensor
    path: sensor.py
    min_log_level: info       # suppress debug noise at source
    max_log_size: "1KB"       # small for demo (triggers rotation quickly)
    inputs:
      tick: dora/timer/millis/50
    outputs:
      - reading

  - id: processor
    path: processor.py
    send_logs_as: log_entries  # route structured logs as data
    inputs:
      reading: sensor/reading
    outputs:
      - result
      - log_entries

  - id: monitor
    path: monitor.py
    inputs:
      logs: processor/log_entries
      reading: sensor/reading

What each node demonstrates:

sensor – Mixes print() (raw stdout), logging.info(), logging.debug(), and logging.warning(). With min_log_level: info, debug messages are dropped by the daemon before reaching log files. With max_log_size: "1KB", log rotation kicks in after a few seconds.
processor – Uses send_logs_as: log_entries to route its structured log entries as dataflow data. Raw print() output is not routed (only parsed structured entries are).
monitor – Subscribes to processor/log_entries and counts warnings/errors, demonstrating in-dataflow log aggregation.

Direct mode (dora run – single process, good for quick testing):

# Basic run
dora run examples/python-logging/dataflow.yml --stop-after 5s

# Only warnings and above
dora run examples/python-logging/dataflow.yml --log-level warn --stop-after 5s

# Per-node overrides
dora run examples/python-logging/dataflow.yml --log-filter "monitor=debug,sensor=warn" --stop-after 5s

# JSON output for machine parsing
dora run examples/python-logging/dataflow.yml --log-format json --stop-after 3s

# Environment variable control
DORA_LOG_LEVEL=warn dora run examples/python-logging/dataflow.yml --stop-after 5s

Distributed mode (dora up + dora start – coordinator/daemon architecture, required for multi-machine deployments):

# Start infrastructure
dora up

# Start attached (live log stream)
dora start examples/python-logging/dataflow.yml --attach

# Or start detached and query logs separately
dora start examples/python-logging/dataflow.yml
dora logs <dataflow-id> sensor --follow                    # stream one node
dora logs <dataflow-id> sensor --follow --level warn       # only warnings
dora logs <dataflow-id> --all-nodes --tail 20              # last 20 lines
dora logs <dataflow-id> processor --grep "error" --since 5m  # targeted search

In distributed mode, logs flow Node -> Daemon -> Coordinator -> CLI over WebSocket. The coordinator buffers log messages until a subscriber connects, so you won’t miss logs even if you attach late. YAML-level settings (min_log_level, send_logs_as, max_log_size) work identically since they are applied at the daemon.

	`dora run`	`dora start`
Display filtering	`--log-level`, `--log-format`, `--log-filter`	`--level` on `dora logs`
Per-node overrides	`--log-filter "sensor=debug"`	Separate `dora logs` per node
Remote nodes	否	是
Live streaming	Always attached	`--attach` or `dora logs --follow`

Post-run log analysis (works the same for both modes):

# Read all local logs
dora logs --local --all-nodes --tail 20

# Search for warnings in sensor logs
dora logs --local sensor --grep "high temp"

# Check that rotation created multiple files
ls -la out/*/log_sensor*.jsonl

Use Case Scenarios

1. Debugging a Noisy Sensor Pipeline

A camera sensor node floods the logs with debug messages, making it hard to see errors from other nodes.

nodes:
  - id: camera
    path: ./target/debug/camera
    min_log_level: warn          # suppress info/debug/trace at the source
    max_log_size: "10MB"         # limit disk usage

  - id: detector
    path: ./target/debug/detector

  - id: planner
    path: ./target/debug/planner

# During development: see everything from detector, only warnings from camera
dora run dataflow.yml --log-level debug --log-filter "camera=warn,detector=debug"

# In production: only errors
export DORA_LOG_LEVEL=error
dora run dataflow.yml

What happens:

Camera node’s debug/info messages are dropped by the daemon before reaching the log file (min_log_level: warn)
The CLI further filters display based on --log-filter
Log files rotate at 10MB, keeping at most 60MB on disk for the camera node

2. Log Aggregation Within the Dataflow

Build an in-dataflow log monitoring node that watches for errors across multiple nodes and sends alerts.

nodes:
  - id: camera
    path: ./target/debug/camera
    send_logs_as: logs
    outputs:
      - frames
      - logs

  - id: detector
    path: ./target/debug/detector
    send_logs_as: logs
    outputs:
      - detections
      - logs

  - id: log-monitor
    path: ./target/debug/log-monitor
    inputs:
      camera_logs: camera/logs
      detector_logs: detector/logs
    outputs:
      - alerts

Node-side handling in the log monitor (using dora-log-utils):

#![allow(unused)]
fn main() {
use dora_node_api::{DoraNode, Event};
use dora_message::common::{LogLevel, LogLevelOrStdout};

let (mut node, mut events) = DoraNode::init_from_env()?;
while let Some(event) = events.recv() {
    match event {
        Event::Input { data, .. } => {
            let log = dora_log_utils::parse_log_from_arrow(&data)?;

            let is_error = matches!(log.level,
                LogLevelOrStdout::LogLevel(LogLevel::Error));

            if is_error || log.message.contains("timeout") {
                // Send alert downstream
                node.send_output("alerts", /* ... */)?;
            }
        }
        Event::Stop(_) => break,
        _ => {}
    }
}
}

See also the Log Sink Examples section for complete runnable examples.

3. Post-Mortem Debugging of a Crash

After a dataflow crashes, investigate what happened in the last few minutes.

# Find available dataflows
ls out/

# Read the last 50 lines from all nodes around the crash
dora logs --local --all-nodes --tail 50

# Focus on errors in the last 5 minutes
dora logs --local --all-nodes --since 5m --level error

# Search for a specific error pattern
dora logs --local --all-nodes --grep "out of memory"

# Drill into a specific node
dora logs --local detector --since 2m

# Export as JSON for external analysis
dora run dataflow.yml --log-format json 2>logs.json

4. Long-Running Production Dataflow

A dataflow runs for days or weeks. Without log rotation, disk space fills up.

nodes:
  - id: ingest
    path: ./target/debug/ingest
    min_log_level: info        # no debug noise in production
    max_log_size: "100MB"      # ~600MB max per node (100MB * 6)
    restart_policy: always
    inputs:
      tick: dora/timer/millis/1000
    outputs:
      - data

  - id: processor
    path: ./target/debug/processor
    min_log_level: warn        # only warnings and errors
    max_log_size: "50MB"
    restart_policy: on-failure
    inputs:
      data: ingest/data
    outputs:
      - results

  - id: writer
    path: ./target/debug/writer
    min_log_level: error       # minimal logging
    max_log_size: "20MB"
    inputs:
      results: processor/results

Disk budget:

ingest: up to 600MB (100MB x 6 files)
processor: up to 300MB (50MB x 6 files)
writer: up to 120MB (20MB x 6 files)
Total: ~1GB maximum disk usage for all logs

5. Live Monitoring of a Distributed Deployment

Multiple daemons running on different machines, monitored from a central workstation.

# Start infrastructure (coordinator + local daemon)
dora up

# On remote machines, start a daemon pointing to the coordinator:
#   dora daemon --coordinator-addr 192.168.1.10

# Start the dataflow (detached)
dora start dataflow.yml

# Open targeted log streams in separate terminals:

# Terminal 1: all sensor warnings
dora logs <dataflow-id> sensor --follow --level warn

# Terminal 2: processor errors with text search
dora logs <dataflow-id> processor --follow --level error --grep "timeout"

# Terminal 3: all nodes merged
dora logs <dataflow-id> --all-nodes --follow

# Terminal 4: historical + live (errors from the last hour, then stream)
dora logs <dataflow-id> processor --since 1h --level error --follow

# Monitor a remote coordinator from another machine:
dora logs <dataflow-id> sensor --follow --coordinator-addr 192.168.1.10

How it works internally:

CLI connects to the coordinator (default localhost:6013, or --coordinator-addr)
For historical logs: request-reply with filters applied client-side (--since, --grep, --tail)
For --follow: opens a WebSocket subscription to the coordinator
Coordinator filters by --level server-side before forwarding (reduces network traffic)
CLI applies --grep and --since client-side on the live stream
Coordinator buffers log messages until a subscriber connects, so late-joining subscribers see recent history

6. CI/CD Pipeline with Structured Logging

In CI, use JSON format for machine-parseable output and compact format for readable logs.

# Machine-parseable logs for CI tooling
dora run dataflow.yml --log-format json --stop-after 30s 2>test-logs.json

# Compact logs for CI console output
dora run dataflow.yml --log-format compact --log-level info --stop-after 30s

# Post-run analysis: count errors per node
dora logs --local --all-nodes --level error | wc -l

With JSON format, each line is a complete LogMessage that can be processed by jq, log aggregators, or custom scripts:

# Extract error messages with jq
cat test-logs.json | jq -r 'select(.level == "ERROR") | "\(.node_id): \(.message)"'

Performance Considerations

Logging adds I/O overhead proportional to log volume. Here’s how to tune it:

min_log_level is the most impactful setting. It filters at the daemon before any I/O: no log file write, no coordinator forwarding, no send_logs_as routing. A node emitting 1000 debug lines/sec at min_log_level: info generates zero overhead for those lines.

send_logs_as adds a dataflow message per log line. Each parsed log entry is serialized to JSON, converted to Arrow, and sent through the dataflow. For high-volume nodes, this can consume significant bandwidth. Use min_log_level to limit what gets routed.

dora/logs subscribers share a single serialization. The daemon converts each log line to Arrow once and clones the result for each subscriber. The cost scales linearly with subscriber count, not log volume x subscriber count. For most dataflows (1-3 log subscribers), this is negligible.

Log line size is capped at 1 MB. Lines longer than 1 MB from node stdout/stderr are truncated to prevent heap exhaustion. This protects against buggy nodes that dump large binary data to stdout.

Log file rotation is recommended for long-running dataflows. Without max_log_size, log files grow unbounded. A node emitting 100 lines/sec at ~200 bytes/line fills 1 GB in ~14 hours.

Recommended production settings:

nodes:
  - id: my-node
    path: ./my-node
    min_log_level: info        # drop debug/trace at source
    max_log_size: "50MB"       # rotate at 50MB
    max_rotated_files: 5       # keep 5 rotated files (300MB max)

最佳实践

Set min_log_level in production. Source-level filtering at the daemon prevents debug noise from reaching log files and the network. This is the most effective way to reduce log volume since it filters before any I/O.

Always set max_log_size for long-running dataflows. Without rotation, a single noisy node can fill the disk. Start with "50MB" (300MB total per node with rotation) and adjust based on your storage budget. Use max_rotated_files to tune how much history to keep (default 5, range 1-100).

Use environment variables for team defaults. Set DORA_LOG_LEVEL and DORA_LOG_FORMAT in your shell profile or CI configuration. Individual developers can override with CLI flags.

Use --log-filter during development. Instead of changing YAML config, use per-node display overrides to focus on the node you’re debugging: --log-filter "my-node=debug".

Use send_logs_as for operational monitoring. Build monitoring nodes that watch for error patterns, compute error rates, or forward alerts. This keeps monitoring logic within the dataflow graph. Use dora-log-utils to parse and format log entries in custom sink nodes (see examples/log-sink-file/ and examples/log-sink-tcp/).

Prefer send_logs_as over send_stdout_as for structured data. send_stdout_as captures every stdout line (including raw prints), while send_logs_as only captures parsed structured log entries with full metadata.

Use --local for post-mortem debugging. After a crash, dora logs --local --all-nodes works without a running coordinator and merges all node logs chronologically.

Combine --since with --grep for targeted debugging. Instead of scrolling through thousands of lines, narrow the window: dora logs --local sensor --since 5m --grep "error".

Use JSON format for log pipelines. When feeding logs to external systems (ELK, Grafana Loki, Datadog), use --log-format json for structured ingestion.

Keyboard shortcuts

Dora User Guide