feat: enable native_datafusion scan in auto mode by andygrove · Pull Request #3781 · apache/datafusion-comet

andygrove · 2026-03-24T13:38:45Z

Which issue does this PR close?

Part of #3321

Rationale for this change

Improve performance.

What changes are included in this PR?

Update "auto" mode to try native_datafusion first, falling back to native_iceberg_compat or Spark when not supported.
Update some test assumptions
Review ignored tests across all Spark diffs and make sure they are up-to-date
Ensure documentation is up-to-date

How are these changes tested?

Existing tests.

andygrove · 2026-03-24T15:58:12Z

Some row index generation - vectorized reader tests are currently failing.

auto scan mode now uses native_datafusion scan, which does not support row index generation, so these tests should be skipped for auto mode just as they are for native_datafusion mode.

…usion mode Remove IgnoreCometNativeDataFusion annotations from 3.5.8 Spark SQL test diff for issues that have been closed: apache#3311, apache#3313, apache#3314, apache#3315, apache#3320, apache#3401.

…ve_datafusion mode" This reverts commit d7fd22e.

…_datafusion The auto scan mode now tries native_datafusion first and falls back to native_iceberg_compat if the scan cannot be converted, rather than always using native_iceberg_compat.

Remove IgnoreCometNativeDataFusion tags for issues that have been resolved and closed: apache#3312, apache#3313, apache#3314, apache#3315.

This reverts commit 96622cf.

andygrove added 6 commits March 24, 2026 06:32

enable native_datafusion scan in auto mode

9ea95ab

scalastyle

5d6c4ff

add CometDateTimeUtilsSuite to CI workflow

44f9673

update schema evolution test

ea28ffd

add link to issue

c7a402f

Merge branch 'add-suite' into auto-native-df

03c691f

andygrove added 2 commits March 24, 2026 09:03

skip row index tests for auto scan mode

d38ffc4

auto scan mode now uses native_datafusion scan, which does not support row index generation, so these tests should be skipped for auto mode just as they are for native_datafusion mode.

Merge remote-tracking branch 'apache/main' into auto-native-df

d094726

andygrove changed the title ~~feat: enable native_datafusion scan in auto mode [WIP]~~ feat: enable native_datafusion scan in auto mode Mar 24, 2026

andygrove added 6 commits March 24, 2026 13:38

chore: Enable spark SQL tests for issues now resolved in native_dataf…

d7fd22e

…usion mode Remove IgnoreCometNativeDataFusion annotations from 3.5.8 Spark SQL test diff for issues that have been closed: apache#3311, apache#3313, apache#3314, apache#3315, apache#3320, apache#3401.

Revert "chore: Enable spark SQL tests for issues now resolved in nati…

1b35dbc

…ve_datafusion mode" This reverts commit d7fd22e.

docs: update parquet_scans.md to reflect auto mode now prefers native…

dc2506f

…_datafusion The auto scan mode now tries native_datafusion first and falls back to native_iceberg_compat if the scan cannot be converted, rather than always using native_iceberg_compat.

chore: re-enable Spark SQL tests for fixed issues in 3.5.8 diff

96622cf

Remove IgnoreCometNativeDataFusion tags for issues that have been resolved and closed: apache#3312, apache#3313, apache#3314, apache#3315.

Revert "chore: re-enable Spark SQL tests for fixed issues in 3.5.8 diff"

7e4ee86

This reverts commit 96622cf.

Merge remote-tracking branch 'apache/main' into auto-native-df

da052f8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: enable native_datafusion scan in auto mode#3781

feat: enable native_datafusion scan in auto mode#3781
andygrove wants to merge 14 commits intoapache:mainfrom
andygrove:auto-native-df

andygrove commented Mar 24, 2026 •

edited

Loading

andygrove commented Mar 24, 2026

Labels

1 participant

Conversation

andygrove commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!