perf: pre-cached indent arrays for bulk newline+spaces by He-Pin · Pull Request #676 · databricks/sjsonnet

He-Pin · 2026-04-04T14:23:43Z

Motivation

The JSON renderer generates indentation strings (newline + spaces) on every nested element. For deeply nested Jsonnet output, Renderer.visitKey and visitEnd repeatedly construct identical indent strings. The current implementation calls elemBuilder.append('\n') followed by a while loop appending spaces — this is O(depth) per indent operation.

Key Design Decision

Pre-cache indent strings (newline + spaces) in a companion object array up to depth 64 (MaxCachedDepth). For depths ≤64, indent operations become a single array lookup + bulk write. For depths >64 (rare in practice), fall through to the original loop.

Modification

sjsonnet/src/sjsonnet/Renderer.scala:

Added companion object with MaxCachedDepth = 64 constant and indentCache: Array[Array[Char]]
Cache stores pre-computed "\n" + " " * (depth * indent) as char arrays for depths 0–64
flushBuffer() fast path: when depth ≤ MaxCachedDepth, uses elemBuilder.appendAll(cachedArray, len) instead of character-by-character loop
Original loop preserved as fallback for depths > 64

Benchmark Results

JMH — Full Suite (35 benchmarks, 1+1 warmup)

No regressions detected. All benchmarks within noise margin.

Note

The indentation cache optimization primarily benefits:

Deeply nested JSON output — common in Jsonnet configurations (Kubernetes manifests, CI configs)
std.manifestJsonEx — uses indentation for pretty-printing
Scala Native — no JIT to optimize the loop; pre-cached arrays enable System.arraycopy

Analysis

Memory: One-time allocation of 64 char arrays (total ~2KB) — negligible.
Thread safety: Cache is in a companion object, initialized once. Arrays are read-only after initialization.
Threshold: 64 levels covers virtually all real-world Jsonnet output (even deeply nested Kubernetes manifests rarely exceed 20 levels).

References

upickle.core.CharBuilder.appendAll(char[], int) for bulk writes
Original character-by-character indent loop in Renderer.flushBuffer

Result

Pre-cached indent arrays eliminate per-character overhead for nested JSON rendering. No regressions. Benefits deeply nested output and Scala Native.

sjsonnet/src/sjsonnet/Renderer.scala

He-Pin · 2026-04-09T04:46:32Z

Good catch — extracted the magic 16 into a named constant Renderer.MaxCachedDepth in a new companion object. The comparison now reads depth < MaxCachedDepth instead of depth < indentCache.length.

Note that the indent cache content is instance-specific (depends on the indent constructor parameter — commonly 2, 3, or 4), but the size (16 depth levels) is a fixed constant shared across all instances.

Extract MaxCachedDepth=16 to Renderer companion object constant per review. Pre-compute indentCache arrays for depths 0..15 to replace per-character space emission with a single bulk appendAll in flushBuffer.

stephenamar-db · 2026-04-09T18:35:05Z

sjsonnet/src/sjsonnet/Renderer.scala

+      var d = 0
+      while (d < MaxCachedDepth) {
+        val spaces = indent * d
+        val buf = new Array[Char](spaces + 1)


would'nt

val buf = Array.fill(spaces + 1) { ' ' } buf(0) = '\n'

be more terse?

He-Pin marked this pull request as ready for review April 5, 2026 00:28

He-Pin mentioned this pull request Apr 5, 2026

performance optimization #666

Open

stephenamar-db requested changes Apr 8, 2026

View reviewed changes

sjsonnet/src/sjsonnet/Renderer.scala Outdated Show resolved Hide resolved

He-Pin force-pushed the perf/renderer-indent-cache branch 5 times, most recently from f2e7618 to 9db668d Compare April 9, 2026 04:46

He-Pin force-pushed the perf/renderer-indent-cache branch from 9db668d to 7ec85ce Compare April 9, 2026 11:30

perf: pre-cached indent arrays for bulk newline+spaces

abfe59a

Extract MaxCachedDepth=16 to Renderer companion object constant per review. Pre-compute indentCache arrays for depths 0..15 to replace per-character space emission with a single bulk appendAll in flushBuffer.

He-Pin force-pushed the perf/renderer-indent-cache branch from 7ec85ce to abfe59a Compare April 9, 2026 15:18

stephenamar-db reviewed Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: pre-cached indent arrays for bulk newline+spaces#676

perf: pre-cached indent arrays for bulk newline+spaces#676
He-Pin wants to merge 1 commit intodatabricks:masterfrom
He-Pin:perf/renderer-indent-cache

He-Pin commented Apr 4, 2026 •

edited

Loading

Uh oh!

Uh oh!

He-Pin commented Apr 9, 2026

Uh oh!

stephenamar-db Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

He-Pin commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Key Design Decision

Modification

Benchmark Results

JMH — Full Suite (35 benchmarks, 1+1 warmup)

Note

Analysis

References

Result

Uh oh!

Uh oh!

He-Pin commented Apr 9, 2026

Uh oh!

stephenamar-db Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

He-Pin commented Apr 4, 2026 •

edited

Loading