Fix Broken Lucene Query Tracking and Cancellation for OOM Protection by anuragrai16 · Pull Request #17884 · apache/pinot

anuragrai16 · 2026-03-14T14:06:22Z

Prospective fix for #17877

Summary

Fix missing CPU/memory tracking for realtime Lucene and HNSW index searcher threads
Enable OOM killer to properly terminate TEXT_MATCH and VECTOR_SIMILARITY queries

Changes

RealtimeLuceneTextIndex: Propagate QueryThreadContext to async searcher threads for resource tracking
MultiColumnRealtimeLuceneTextIndex: Same fix for multi-column text index variant
MutableVectorIndex: Convert synchronous Lucene search to async pattern using RealtimeLuceneTextIndexSearcherPool, preventing FSDirectory corruption on thread interrupt while enabling proper resource tracking
LuceneDocIdCollector/HnswDocIdCollector: Add periodic checkTerminationAndSampleUsagePeriodically() calls in collectors to detect OOM termination during document collection

Tests
RealtimeLuceneTextIndexResourceTrackingTest: Unit tests verifying context propagation, thread registration, and termination handling
TextMatchOomKillingIntegrationTest: Integration test validating OOM killing for TEXT_MATCH queries

codecov-commenter · 2026-03-14T15:43:17Z

Codecov Report

❌ Patch coverage is 22.72727% with 102 lines in your changes missing coverage. Please review.
✅ Project coverage is 63.26%. Comparing base (6d30490) to head (f6374df).

Files with missing lines	Patch %	Lines
...ertedindex/MultiColumnRealtimeLuceneTextIndex.java	0.00%	51 Missing ⚠️
...local/realtime/impl/vector/MutableVectorIndex.java	0.00%	23 Missing ⚠️
...me/impl/invertedindex/RealtimeLuceneTextIndex.java	47.61%	20 Missing and 2 partials ⚠️
...pl/invertedindex/RealtimeLuceneDocIdCollector.java	66.66%	2 Missing ⚠️
...gment/index/readers/text/LuceneDocIdCollector.java	60.00%	2 Missing ⚠️
...gment/index/readers/vector/HnswDocIdCollector.java	60.00%	2 Missing ⚠️

Additional details and impacted files

@@ Coverage Diff @@ ## master #17884 +/- ## ============================================ + Coverage 63.25% 63.26% +0.01%  Complexity 1481 1481 ============================================ Files 3190 3190 Lines 192285 192347 +62 Branches 29470 29470 ============================================ + Hits 121630 121689 +59  - Misses 61118 61125 +7  + Partials 9537 9533 -4

Flag	Coverage Δ
custom-integration1	`100.00% <ø> (ø)`
integration	`100.00% <ø> (ø)`
integration1	`100.00% <ø> (ø)`
integration2	`0.00% <ø> (ø)`
java-11	`63.23% <20.45%> (-0.01%)`	⬇️
java-21	`63.22% <22.72%> (+0.01%)`	⬆️
temurin	`63.26% <22.72%> (+0.01%)`	⬆️
unittests	`63.26% <22.72%> (+0.01%)`	⬆️
unittests1	`55.54% <2.27%> (-0.04%)`	⬇️
unittests2	`34.33% <22.72%> (+0.07%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Jackie-Jiang

Thanks for the fix!

Jackie-Jiang · 2026-03-18T21:05:20Z

...ache/pinot/segment/local/realtime/impl/invertedindex/MultiColumnRealtimeLuceneTextIndex.java

- "Failed while releasing the searcher manager for realtime text index for columns {}, exception {}",
- _columns, e.getMessage());
+ // Propagate context to register searcher thread for CPU/memory tracking
+ if (parentContext != null) {


QueryThreadContext.contextAwareExecutorService() provides a wrapper over the executor to maintain the context. You may modify RealtimeLuceneTextIndexSearcherPool to initialize a wrapped executor service instead

The problem with contextAwareExecutorService() is that it also registers the future with executionContext.addTask(future) so that it can be terminated with task.cancel(true);. For Lucene index searchers on consuming segments (that have an Index writer too), there is a problem in Lucene (see this), where it doesn't react well to Thread interruption and can corrupt the underlying FSDirectory used to do Index creation as well.

So, instead we only propagate the QueryThreadContext explicitly to make sure tracking of this works and not register the future to be cancelled. The cancellation is done manually in a cooperative way with _shouldCancel. This is only relevant for realtime segment lucene searches (that shares the same FSDirectory object b/w the IndexWriter and IndexSearcher).

Good call. In that case, I'd suggest adding a boolean flag to control whether to skip registering task for auto termination into QueryThreadContext.contextAwareExecutorService(). Wrapping the logic in the same place is easier to track and manage.

Jackie-Jiang · 2026-03-18T21:06:16Z

...sts/src/test/java/org/apache/pinot/integration/tests/TextMatchOomKillingIntegrationTest.java

+import static org.testng.Assert.assertNotNull;
+import static org.testng.Assert.assertTrue;
+
+public class TextMatchOomKillingIntegrationTest extends BaseClusterIntegrationTest {


How long does this test run and how robust is it?

It runs for 2-4 mins on my machine. OOM Killing in itself is indeterministic and will depend on the docker jvm that is used to run this test. So, I've made this test accept either outcome for now with,

if (exceptionsNode.isEmpty()) { // Query completed successfully - verify resource tracking worked assertTrue(memAllocated > 0 || cpuTime > 0, ...); } else { // Query was killed - verify it was OOM killed verifyOomKill(query, response); }

from function verifyOomKillOrResourceTracking()

so, the test in itself would not fail, and would demonstrate either the tracking worked and the query was not killed. But in most cases, would show the query being killed since the threshold for the heap is set to very low (15%) for query killing and 0% for ALARMING tracking.

Adding a 2-4 minutes test for this bug fix would be too much overhead. Do you see a way to make it light weight? E.g. a unit test to ensure the memory usage is tracked

Jackie-Jiang · 2026-03-18T21:08:04Z

...org/apache/pinot/segment/local/realtime/impl/invertedindex/RealtimeLuceneDocIdCollector.java

+ _numDocsCollected++, "RealtimeLuceneDocIdCollector");
+ } catch (RuntimeException e) {
+ // Convert to CollectionTerminatedException for clean Lucene handling
+ throw new CollectionTerminatedException();


Why do we need a separate exception? We have special handling on TerminationException so it would be good to preserve it for correct error message in query response

This exception is needed for Lucene IndexSearcher to correctly close all open objects. See documentation of this exception here.

Throw this exception in LeafCollector.collect(int) to prematurely terminate collection of the current leaf.
Note: IndexSearcher swallows this exception and never re-throws it.

Note that, the Lucene collector is run via the IndexSearcher, and the original interrupted TerminationException is preserved via RealtimeLuceneTextIndex/MultiColumnRealtimeLuceneTextIndex that catches the interruption and retains the correct error message that the TEXT_MATCH was interrupted.

I see. Could you please add some more comments explaining this?

anuragrai16 added 3 commits March 14, 2026 19:02

improve lucene OOM thread CPU and mem tracking and early killing

1cf4327

add text match OOM kill IT

ca1a12e

add text match OOM kill IT

f6374df

Jackie-Jiang added bugfix oom-protection Related to out-of-memory protection mechanisms labels Mar 18, 2026

Jackie-Jiang reviewed Mar 18, 2026

View reviewed changes

xiangfu0 added text-search Related to text/Lucene indexing and search bug Something is not working as expected index Related to indexing (general) real-time Related to realtime table ingestion and serving and removed bugfix labels Mar 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Broken Lucene Query Tracking and Cancellation for OOM Protection#17884

Fix Broken Lucene Query Tracking and Cancellation for OOM Protection#17884
anuragrai16 wants to merge 3 commits intoapache:masterfrom
anuragrai16:luceneOOMKillingImprovements

anuragrai16 commented Mar 14, 2026

codecov-commenter commented Mar 14, 2026 •

edited

Loading

Jackie-Jiang left a comment

Jackie-Jiang Mar 18, 2026

anuragrai16 Mar 25, 2026 •

edited

Loading

Jackie-Jiang Mar 25, 2026

Jackie-Jiang Mar 18, 2026

anuragrai16 Mar 25, 2026 •

edited

Loading

Jackie-Jiang Mar 25, 2026

Jackie-Jiang Mar 18, 2026

anuragrai16 Mar 25, 2026

Jackie-Jiang Mar 25, 2026

Labels

4 participants

Conversation

anuragrai16 commented Mar 14, 2026

codecov-commenter commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Jackie-Jiang left a comment

Choose a reason for hiding this comment

Jackie-Jiang Mar 18, 2026

Choose a reason for hiding this comment

anuragrai16 Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Jackie-Jiang Mar 25, 2026

Choose a reason for hiding this comment

Jackie-Jiang Mar 18, 2026

Choose a reason for hiding this comment

anuragrai16 Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Jackie-Jiang Mar 25, 2026

Choose a reason for hiding this comment

Jackie-Jiang Mar 18, 2026

Choose a reason for hiding this comment

anuragrai16 Mar 25, 2026

Choose a reason for hiding this comment

Jackie-Jiang Mar 25, 2026

Choose a reason for hiding this comment

Labels

4 participants

codecov-commenter commented Mar 14, 2026 •

edited

Loading

anuragrai16 Mar 25, 2026 •

edited

Loading

anuragrai16 Mar 25, 2026 •

edited

Loading