Remove queued duration minimum threshold #2184

pkoenig10 · 2025-05-29T16:41:50Z

Before this PR

In our internal authentication service, we have a single threaded executor where, at any given time, there is at most one executing task and one queued task. The code looks something like:

private final ExecutorService updateExecutor = Executors.newSingleThreadExecutor();
private final AtomicReference<SettableFuture<Void>> pendingUpdate = new AtomicReference<>();

Future<Void> updateCache() {
    SettableFuture<Void> future = pendingUpdate.get();
    if (future != null) {
        return future;
    }

    future = SettableFuture.create();

    SettableFuture<Void> witness = pendingUpdate.compareAndExchange(null, future);
    if (witness != null) {
        return witness;
    }

    future.setFuture(updateExecutor.submit(this::doUpdateCache, null));

    return future;
}

void doUpdateCache() {
    pendingUpdate.set(null);

    ...
}

Metrics seem to indicate that the queued duration p99 is longer than the duration p99. Here are metrics from our internal test environment.

This should be impossible, given how this executor is used.

But it happens because TaggedMetricsExecutorService is simply dropping any samples below the threshold. This causes the value of the queued duration metrics to be artificially inflated - especially for executors that typically have short queue duration.

It's confusing for measurements to simply be dropped in this way and causes the resulting metrics to be misleading.

After this PR

TaggedMetricsExecutorService no longer excludes measurements from the queued duration metric. This metric now accurately captures the time between submission and execution for all submitted tasks.

changelog-app · 2025-05-29T16:41:54Z

Generate changelog in `changelog/@unreleased`

What do the change types mean?

feature: A new feature of the service.
improvement: An incremental improvement in the functionality or operation of the service.
fix: Remedies the incorrect behaviour of a component of the service in a backwards-compatible way.
break: Has the potential to break consumers of this service's API, inclusive of both Palantir services
and external consumers of the service's API (e.g. customer-written software or integrations).
deprecation: Advertises the intention to remove service functionality without any change to the
operation of the service itself.
manualTask: Requires the possibility of manual intervention (running a script, eyeballing configuration,
performing database surgery, ...) at the time of upgrade for it to succeed.
migration: A fully automatic upgrade migration task with no engineer input required.

Note: only one type should be chosen.

How are new versions calculated?

❗The break and manual task changelog types will result in a major release!
🐛 The fix changelog type will result in a minor release in most cases, and a patch release version for patch branches. This behaviour is configurable in autorelease.
✨ All others will result in a minor version release.

Type

Description

The executor queued duration metric no longer excludes small measurements. This ensures that the metrics accurately measure the time between submission and execution.

Check the box to generate changelog(s)

Generate changelog entry

schlosna · 2025-06-02T15:52:07Z

tritium-metrics/src/main/java/com/palantir/tritium/metrics/TaggedMetricsExecutorService.java

-    // it doesn't necessarily mean there's a queue at all. We assume anything longer than
-    // this threshold, which should be longer than pauses in most cases, is the result
-    // of queueing.
-    private static final long QUEUED_DURATION_MINIMUM_THRESHOLD_NANOS = 250_000_000L;


#1230 was what originally added this threshold. Per discussions with @carterkozak & @pkoenig10 , we explicitly do not add queue metrics for cached executors in tritium clients (support for this was added in #1012).

autorelease3 · 2025-06-02T15:52:43Z

Released 0.100.0

Remove queued duration minimum threshold

5485e7f

pkoenig10 requested a review from schlosna May 29, 2025 16:41

pkoenig10 added the autorelease label May 29, 2025

Add generated changelog entries

c4af251

pkoenig10 requested a review from carterkozak May 30, 2025 08:39

pkoenig10 added the merge when ready label Jun 2, 2025

schlosna reviewed Jun 2, 2025

View reviewed changes

schlosna approved these changes Jun 2, 2025

View reviewed changes

bulldozer-bot bot merged commit fd19631 into develop Jun 2, 2025
5 checks passed

bulldozer-bot bot deleted the pkoenig/queuedDuration branch June 2, 2025 15:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove queued duration minimum threshold #2184

Remove queued duration minimum threshold #2184

Uh oh!

pkoenig10 commented May 29, 2025 •

edited

Loading

Uh oh!

changelog-app bot commented May 29, 2025 •

edited by pkoenig10

Loading

Uh oh!

schlosna Jun 2, 2025 •

edited by pkoenig10

Loading

Uh oh!

Uh oh!

autorelease3 bot commented Jun 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Remove queued duration minimum threshold #2184

Remove queued duration minimum threshold #2184

Uh oh!

Conversation

pkoenig10 commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before this PR

After this PR

Uh oh!

changelog-app bot commented May 29, 2025 • edited by pkoenig10 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Generate changelog in changelog/@unreleased

Uh oh!

schlosna Jun 2, 2025 • edited by pkoenig10 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

autorelease3 bot commented Jun 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pkoenig10 commented May 29, 2025 •

edited

Loading

changelog-app bot commented May 29, 2025 •

edited by pkoenig10

Loading

Generate changelog in `changelog/@unreleased`

schlosna Jun 2, 2025 •

edited by pkoenig10

Loading