Skip to content

feat: enable native_datafusion scan in auto mode#3781

Draft
andygrove wants to merge 14 commits intoapache:mainfrom
andygrove:auto-native-df
Draft

feat: enable native_datafusion scan in auto mode#3781
andygrove wants to merge 14 commits intoapache:mainfrom
andygrove:auto-native-df

Conversation

@andygrove
Copy link
Copy Markdown
Member

@andygrove andygrove commented Mar 24, 2026

Which issue does this PR close?

Part of #3321

Rationale for this change

Improve performance.

What changes are included in this PR?

  • Update "auto" mode to try native_datafusion first, falling back to native_iceberg_compat or Spark when not supported.
  • Update some test assumptions
  • Review ignored tests across all Spark diffs and make sure they are up-to-date
  • Ensure documentation is up-to-date

How are these changes tested?

Existing tests.

@andygrove
Copy link
Copy Markdown
Member Author

Some row index generation - vectorized reader tests are currently failing.

auto scan mode now uses native_datafusion scan, which does not support row index generation, so these tests should be skipped for auto mode just as they are for native_datafusion mode.
@andygrove andygrove changed the title feat: enable native_datafusion scan in auto mode [WIP] feat: enable native_datafusion scan in auto mode Mar 24, 2026
…usion mode Remove IgnoreCometNativeDataFusion annotations from 3.5.8 Spark SQL test diff for issues that have been closed: apache#3311, apache#3313, apache#3314, apache#3315, apache#3320, apache#3401.
…_datafusion The auto scan mode now tries native_datafusion first and falls back to native_iceberg_compat if the scan cannot be converted, rather than always using native_iceberg_compat.
Remove IgnoreCometNativeDataFusion tags for issues that have been resolved and closed: apache#3312, apache#3313, apache#3314, apache#3315.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

1 participant