Core: Close FileIO on cache eviction to prevent thread leaks by utafrali · Pull Request #15910 · apache/iceberg

utafrali · 2026-04-08T04:09:13Z

When tables get evicted from the cache, the FileIO wasn't being closed. This leaves S3FileIO with open thread pools that never get cleaned up, which is a problem in long-running applications. Added a try-catch around the close call since FileIO implementations can throw exceptions, and added a test to verify it gets called.

When tables get evicted from the cache, the FileIO wasn't being closed. This leaves S3FileIO with open thread pools that never get cleaned up, which is a problem in long-running applications. Added a try-catch around the close call since FileIO implementations can throw exceptions, and added a test to verify it gets called. Fixes apache#15898

Copilot

Pull request overview

This PR addresses resource/thread leaks in CachingCatalog by ensuring a table’s FileIO is closed when the table is evicted from the cache, preventing long-lived background threads (notably from S3FileIO) from accumulating in long-running applications.

Changes:

Close table.io() when a cached table entry is removed due to expiration.
Add a unit test verifying FileIO.close() is invoked on cache eviction.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
core/src/main/java/org/apache/iceberg/CachingCatalog.java	Adds `FileIO.close()` in the cache removal listener for expired base tables.
core/src/test/java/org/apache/iceberg/hadoop/TestCachingCatalog.java	Adds a Mockito-based test asserting `FileIO.close()` is called after TTL eviction.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-08T09:46:38Z

core/src/main/java/org/apache/iceberg/CachingCatalog.java

      if (RemovalCause.EXPIRED.equals(cause)) {
        if (!MetadataTableUtils.hasMetadataTableName(tableIdentifier)) {
          tableCache.invalidateAll(metadataTableIdentifiers(tableIdentifier));
+          if (table != null) {
+            try {
+              table.io().close();
+            } catch (Exception e) {
+              LOG.warn("Failed to close FileIO for evicted table {}", tableIdentifier, e);
+            }


The close logic only runs when the removal cause is EXPIRED. Caffeine can also remove entries due to COLLECTED (softValues GC) and potentially SIZE (if a max-size/weight policy is added); in those cases the FileIO would still not be closed and the thread leak described in #15898 can persist. Consider running the same invalidation/close path for all eviction causes (e.g., cause.wasEvicted()), while still skipping metadata table identifiers.

utafrali · 2026-04-09T21:41:12Z

Done, pushed the fix.

anoopj · 2026-04-10T01:30:31Z

core/src/main/java/org/apache/iceberg/CachingCatalog.java

          tableCache.invalidateAll(metadataTableIdentifiers(tableIdentifier));
+          if (table != null) {
+            try {
+              table.io().close();


This may not be safe for all catalogs. e.g. HadoopCatalog creates a common fileIO at the catalog level and not table level. The same fileIO reference is shared with the table object. So the close() here would break tables in the catalog that are still actively used.

Can this be just handled as part of the catalog's close() instead?

utafrali · 2026-04-10T02:06:40Z

Done, pushed the fix.

github-actions bot added the core label Apr 8, 2026

manuzhang requested a review from Copilot April 8, 2026 09:40

Copilot started reviewing on behalf of manuzhang April 8, 2026 09:40 View session

Copilot AI reviewed Apr 8, 2026

View reviewed changes

Address review feedback

038f0ed

anoopj reviewed Apr 10, 2026

View reviewed changes

Address review feedback

e16a5d9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Core: Close FileIO on cache eviction to prevent thread leaks#15910

Core: Close FileIO on cache eviction to prevent thread leaks#15910
utafrali wants to merge 3 commits intoapache:mainfrom
utafrali:fix/issue-15898-cachingcatalog-does-not-close-fileio-on-

utafrali commented Apr 8, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 8, 2026

Uh oh!

utafrali commented Apr 9, 2026

Uh oh!

anoopj Apr 10, 2026 •

edited

Loading

Uh oh!

utafrali commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

utafrali commented Apr 8, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

utafrali commented Apr 9, 2026

Uh oh!

anoopj Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

utafrali commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

anoopj Apr 10, 2026 •

edited

Loading