Parallelize node preloads #454

afterdusk · 2024-10-13T02:57:15Z

Overview

This PR adds parallelization for node preloading. This primarily benefits publishing, which performs node preloading prior to node insertion and audit proof generation.

During node preloading, we perform a Breadth First Search on the tree starting from the root, fetching each level of the tree in sequence. This operation's latency is lower bounded since a certain minimum number of back-to-back round trips to DB is always required. However, the amount of work performed between each DB read is not trivial, and can contribute significantly to the overall preload latency when publishing hundreds of thousands of keys.

We can improve preload latency and bring it closer to the theoretical lower bound, by parallelizing the work performed between DB fetches. In this PR, I do so by having having each step in the BFS split the subsequent work for the next level into two chunks, having each chunk be processed concurrently. The storage layer might also benefit from performing DB operations in batches that are not overly large.

Note that node preloading is used in publishing and lookups, and parallelism is always disabled for the lookup path. This is because lookups are likely performed on a machine that services client requests. We do not want a single lookup to consume all the resources on the machine and starve other client requests.

Benchmark

Ran the azks benchmark, on trunk (fcd665a) and on this PR:

cargo bench -p akd --bench azks -F bench

Batch Insertion (1000 Initial Leaves, 1000 Inserted Leaves)

Trunk:

This PR:

Batch Insertion (200,000 Initial Leaves, 200,000 Inserted Leaves)

Trunk:

This PR:

On my 10-core Macbook Pro, we get a decent amount of improvement (~27%). This improvement should be larger on higher core count machines, and at higher load levels.

codecov-commenter · 2024-10-13T03:38:27Z

Codecov Report

Attention: Patch coverage is 99.20635% with 1 line in your changes missing coverage. Please review.

Project coverage is 88.04%. Comparing base (3ce5335) to head (079d2a4).
Report is 20 commits behind head on main.

Files with missing lines	Patch %	Lines
akd/src/append_only_zks.rs	99.20%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #454      +/-   ##
==========================================
- Coverage   88.61%   88.04%   -0.58%     
==========================================
  Files          39       38       -1     
  Lines        9109     8288     -821     
==========================================
- Hits         8072     7297     -775     
+ Misses       1037      991      -46

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

slawlor

nifty! But I think we could do with making the bool arg an enum for safer usage (As I'm guessing @dillonrg would say 😛 )

akd/src/append_only_zks.rs

akd/src/lib.rs

kevinlewi

LGTM!

Rename parallel_insert feature to parallel_azks

36a1ba1

afterdusk requested review from dillongeorge and kevinlewi October 13, 2024 02:57

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 13, 2024

Make node preload recursive and parallel

56c9ba8

afterdusk force-pushed the parallel-preloads branch from 9badf3d to 56c9ba8 Compare October 13, 2024 03:39

Merge branch 'main' into parallel-preloads

839283c

slawlor requested changes Oct 16, 2024

View reviewed changes

akd/src/append_only_zks.rs Outdated Show resolved Hide resolved

akd/src/append_only_zks.rs Show resolved Hide resolved

Use enum for parallelism param

03c0ef5

dillongeorge approved these changes Oct 17, 2024

View reviewed changes

akd/src/append_only_zks.rs Show resolved Hide resolved

akd/src/lib.rs Show resolved Hide resolved

afterdusk added 3 commits October 17, 2024 01:06

Sort performance optimizations documentation list

7f99bc7

Prepare for v0.12.0-pre.10 release

fdf57c9

Fix stale no_parallelism comment

079d2a4

dillongeorge approved these changes Oct 17, 2024

View reviewed changes

kevinlewi approved these changes Oct 17, 2024

View reviewed changes

slawlor approved these changes Oct 17, 2024

View reviewed changes

afterdusk merged commit cf09e1b into facebook:main Oct 17, 2024
14 checks passed

afterdusk mentioned this pull request Nov 19, 2024

Parallelize audit node preload #456

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parallelize node preloads #454

Parallelize node preloads #454

Uh oh!

afterdusk commented Oct 13, 2024 •

edited

Loading

Uh oh!

codecov-commenter commented Oct 13, 2024 •

edited

Loading

Uh oh!

slawlor left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevinlewi left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Parallelize node preloads #454

Parallelize node preloads #454

Uh oh!

Conversation

afterdusk commented Oct 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Benchmark

Uh oh!

codecov-commenter commented Oct 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

slawlor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevinlewi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

afterdusk commented Oct 13, 2024 •

edited

Loading

codecov-commenter commented Oct 13, 2024 •

edited

Loading