Export hotblocks storage size metrics NET-910 by kalabukdima · Pull Request #78 · subsquid/data

kalabukdima · 2026-06-25T09:55:29Z

Run a background task to roughly estimate how much disk space each dataset is using. Expose it with the metrics. These numbers may be used to quickly understand which datasets started growing, but not as a precise disk usage profiler.

Also move all the DB reads out of the scraping path to avoid locking the DB on calls to /metrics.

Run a background task to roughly estimate how much disk space each dataset is using. Expose it with the metrics. These numbers may be used to quickly understand which datasets started growing, but not as precise disk usage profiler. Also move all the DB reads out of the scraping path to avoid locking the DB on calls to /metrics.

define-null · 2026-06-30T14:26:31Z

Have you run the benchmarks? I'm not familiar with this code, but by the quick look - PR introduces a basically fetch_all loop across all datasets, which is triggered every 60 seconds. Given that it reads from disk, de-serializes the data - it would be great to validate how it affects the overall performance.

Additionally - have you considered keeping counters per dataset, with recording how much is added/removed at the places where it is done instead? If yes - why that idea was discarded?

define-null · 2026-06-30T14:27:24Z


+#[instrument(name = "dataset_stats", skip_all)]
+async fn dataset_stats_loop(db: DBRef, dataset_id: DatasetId, sender: tokio::sync::watch::Sender<DatasetStats>) {
+    const REFRESH: Duration = Duration::from_secs(60);


better move this to config parameters to better control the overhead, that this and similar loop causes

define-null · 2026-06-30T14:34:04Z

    }

+    /// Approximate on-disk size of each column family as `(cf_name, bytes)`. Counts
+    /// SST files only, excluding WAL and memtables. Constant-time property reads.


It's not constant-time property reads
https://github.com/facebook/rocksdb/blob/08809f5e6cd9cc4bc3958dd4d59457ae78c76660/include/rocksdb/db.h#L579

// "rocksdb.total-sst-files-size" - returns total size (bytes) of all SST // files. // WARNING: may slow down online queries if there are too many files.

Thanks for noticing! Moving to the background task as well

define-null · 2026-06-30T14:36:33Z

+    loop {
+        let db = db.clone();
+        let span = tracing::Span::current();
+        let result = tokio::task::spawn_blocking(move || {


One more blocking task per dataset, per minute doesn't look right to me.

I've measured the time of running, and it's <40 ms per dataset at worst, without blocking anything.
The time of running the global metadata estimator is <10 ms and holds the global locks for writes, but doesn't affect reads.

Sounds good, would you mind sharing some details related to the overall load, size of the db, etc?

define-null · 2026-06-30T14:42:42Z

There exist write_chunk/delete_chunk that could be instrumented. I suggest we go that route instead. You may get the current numbers from the DB at the startup of the system.

kalabukdima · 2026-07-01T12:33:13Z

have you considered keeping counters per dataset, with recording how much is added/removed at the places where it is done instead? If yes - why that idea was discarded?

Even if we knew how much data is added/removed by every operation, we would need to read some initial value at the startup. But I don't think that storing N bytes of data in the KV storage corresponds to storing N bytes on disk. So I don't see a way to combine the values read from disk at startup and the incremental changes.

My plan was to build an image and test it in a live environment. But let me do it in advance for safety.

- Compute rocksdb_column_family_size_bytes on a background loop (storage_metrics_loop), published via a watch channel, instead of hitting RocksDB on every Prometheus scrape - Share one configurable --storage-stats-interval-secs between this loop and the existing per-dataset stats loop - Log measurement timing (elapsed_us) for both loops - Fix doc comments that understated column_family_sizes' cost: it holds RocksDB's DB mutex and scales with live SST files, not O(1) Part of NET-442

kalabukdima requested a review from define-null June 25, 2026 09:55

kalabukdima marked this pull request as ready for review June 25, 2026 09:56

style: apply cargo +nightly fmt

13af542

kalabukdima changed the title ~~Export hotblocks storage size metrics NET-442~~ Export hotblocks storage size metrics NET-910 Jun 30, 2026

define-null reviewed Jun 30, 2026

View reviewed changes

define-null requested changes Jun 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Export hotblocks storage size metrics NET-910#78

Export hotblocks storage size metrics NET-910#78
kalabukdima wants to merge 3 commits into
masterfrom
dataset-sizes

kalabukdima commented Jun 25, 2026

Uh oh!

define-null commented Jun 30, 2026

Uh oh!

define-null Jun 30, 2026 •

edited

Loading

Uh oh!

define-null Jun 30, 2026

Uh oh!

kalabukdima Jul 2, 2026

Uh oh!

define-null Jun 30, 2026

Uh oh!

kalabukdima Jul 2, 2026

Uh oh!

define-null Jul 3, 2026

Uh oh!

define-null commented Jun 30, 2026

Uh oh!

kalabukdima commented Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kalabukdima commented Jun 25, 2026

Uh oh!

define-null commented Jun 30, 2026

Uh oh!

define-null Jun 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

define-null Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

kalabukdima Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

define-null Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

kalabukdima Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

define-null Jul 3, 2026

Choose a reason for hiding this comment

Uh oh!

define-null commented Jun 30, 2026

Uh oh!

kalabukdima commented Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

define-null Jun 30, 2026 •

edited

Loading