SOLR-18290: Support configurable fusion candidate pool in Combined Query by ercsonusharma · Pull Request #4546 · apache/solr

ercsonusharma · 2026-06-23T08:11:13Z

https://issues.apache.org/jira/browse/SOLR-18290

Description

Add combiner.queryDepth request parameter to the combined-query / RRF flow. It controls how many candidate documents each subquery fetches from each shard for fusion, decoupled from start + rows.
Holding queryDepth constant while paging keeps the underlying candidate pool and therefore the fused ranking stable across pages.

Solution

The combined-query coordinator already issues a single shard request per shard carrying every combiner.query=... key. Each shard runs all subqueries locally with the request's rows value. So per-subquery depth is governed by what the outer ResponseBuilder.shards_rows carries to createMainQuery.

Tests

Updated DistributedCombinedQueryComponentTest#testHybridQueryWithPagination to exercise the new param: same multi-subquery JSON request issued with and without combiner.queryDepth, asserting (a) returned doc count matches limit, (b) ordering matches RRF expectations for the configured depth.
Validation paths (combiner.queryDepth=0, combiner.queryDepth > maxQueryDepth) covered by negative-path assertions.
Existing CombinedQueryComponent and RRF tests run green. no behavior change when combiner.queryDepth is absent.

Checklist

Please review the following and check all that apply:

I have reviewed the guidelines for How to Contribute and my code conforms to the standards described there to the best of my ability.
I have created a Jira issue and added the issue ID to my pull request title.
I have given Solr maintainers access to contribute to my PR branch. (optional but recommended, not available for branches on forks living under an organisation)
I have developed this patch against the main branch.
I have run ./gradlew check.
I have added tests for my changes.
I have added documentation for the Reference Guide
I have added a changelog entry for my change

ercsonusharma · 2026-06-23T08:21:01Z

  public void testMultipleLexicalQuery() throws Exception {
    prepareIndexDocs();
    String jsonQuery =
-        "{\"queries\":"


converted many test queries in this class as json string.

ercsonusharma · 2026-06-23T11:08:11Z

Hi David, when you get a chance, could you please take a look at this PR? Thank you, @dsmiley

dsmiley

Interesting conundrum.

Can't we use the existing org.apache.solr.common.params.ShardParams#SHARDS_ROWS for this use-case?

I like to seek re-use/expansion of existing params instead of adding yet another bespoke param.

I can't tell but would a hypothetical queryResultWindowSize of say 20 mean that a page (rows) of 10 on first & second pages (start=0 & start=10) should get consistent results, even without this PR?

dsmiley · 2026-06-25T02:03:29Z

+    if (depthParam > maxQueryDepth) {
+      throw new SolrException(
+          SolrException.ErrorCode.BAD_REQUEST,
+          CombinerParams.COMBINER_QUERY_DEPTH
+              + "="
+              + depthParam
+              + " exceeds configured maxQueryDepth="
+              + maxQueryDepth);
+    }


why bother enforce a max query depth? Solr doesn't stop a user from requesting a bajillion rows, after all. And the value will come from a ~colleague programmer... not some random internet user.

ercsonusharma · 2026-06-25T02:55:57Z

Can't we use the existing org.apache.solr.common.params.ShardParams#SHARDS_ROWS for this use-case?

I thought about this but since it was specific to rrf and have to put some limit, I chose this. But since, we don't need the limit so we can re-use that param with a minor fix at mergeIds which is the actual bottle-neck.

I can't tell but would a hypothetical queryResultWindowSize of say 20 mean that a page (rows) of 10 on first & second pages (start=0 & start=10) should get consistent results, even without this PR?

The inconsistency in combined query paging happens at the coordinator's RRF step, not the shard query step. For above example, queryResultWindowSize=20 would just mean each shard's queryResultCache happens to already hold 20 docs, so the page-2 shard request is a cache hit instead of a re-execution. But the coordinator still receives 10 candidates on page 1 and 20 on page 2, and RRF over a 10-doc pool for page 1 produces different scores/ordering than RRF over a 20-doc pool.

Support configurable fusion candidate pool in Combined Query

e26648a

github-actions Bot added documentation Improvements or additions to documentation client:solrj tests cat:search labels Jun 23, 2026

ercsonusharma commented Jun 23, 2026

View reviewed changes

Comment thread solr/solrj/src/java/org/apache/solr/common/params/CombinerParams.java Outdated

ercsonusharma commented Jun 23, 2026

View reviewed changes

dsmiley reviewed Jun 25, 2026

View reviewed changes

review comment changes

8201603

github-actions Bot removed documentation Improvements or additions to documentation client:solrj labels Jun 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SOLR-18290: Support configurable fusion candidate pool in Combined Query#4546

SOLR-18290: Support configurable fusion candidate pool in Combined Query#4546
ercsonusharma wants to merge 2 commits into
apache:mainfrom
ercsonusharma:feat_combined_query_depth

ercsonusharma commented Jun 23, 2026

Uh oh!

Uh oh!

ercsonusharma Jun 23, 2026 •

edited

Loading

Uh oh!

ercsonusharma commented Jun 23, 2026

Uh oh!

dsmiley left a comment

Uh oh!

Uh oh!

dsmiley Jun 25, 2026

Uh oh!

ercsonusharma commented Jun 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ercsonusharma commented Jun 23, 2026

Description

Solution

Tests

Checklist

Uh oh!

Uh oh!

ercsonusharma Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ercsonusharma commented Jun 23, 2026

Uh oh!

dsmiley left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dsmiley Jun 25, 2026

Choose a reason for hiding this comment

Uh oh!

ercsonusharma commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ercsonusharma Jun 23, 2026 •

edited

Loading

ercsonusharma commented Jun 25, 2026 •

edited

Loading