Skip to content

[GLUTEN-11550][UT] Enable 15 disabled test suites for Spark 4.x#11816

Open
baibaichen wants to merge 7 commits intoapache:mainfrom
baibaichen:fix/11550-enable-all
Open

[GLUTEN-11550][UT] Enable 15 disabled test suites for Spark 4.x#11816
baibaichen wants to merge 7 commits intoapache:mainfrom
baibaichen:fix/11550-enable-all

Conversation

@baibaichen
Copy link
Copy Markdown
Contributor

@baibaichen baibaichen commented Mar 24, 2026

What changes were proposed in this pull request?

Enable 15 previously disabled test suites for Spark 4.0/4.1, organized by root cause:

Commit 1: VTS-only changes (8 suites)
Direct enable with no code changes needed:

  • GlutenTPCDSQueryWithStatsSuite / GlutenTPCDSQueryANSISuite — fix wrong trait (GlutenTestsCommonTrait -> GlutenSQLTestsTrait)
  • GlutenToPrettyStringSuite, GlutenCollationRegexpExpressionsSuite (spark41), GlutenSimpleSQLViewSuite, GlutenDataSourceScanExecRedactionSuite, GlutenDataSourceV2ScanExecRedactionSuite — ANSI fallback, all tests pass
  • Disable GlutenStreamingQueryHashPartitionVerifySuite — uses wrong trait, runs as vanilla Spark (no GlutenPlugin loaded)

Commit 2: Exception wrapper fix (1 suite)

  • GlutenCsvExpressionsSuite — modify GlutenTestsTrait.glutenCheckExpression to unwrap SparkException before fail(), matching Spark's checkEvaluationWithoutCodegen behavior

Commit 3: Test resource path fix (2 suites)

  • GlutenWholeTextFileV1Suite / GlutenWholeTextFileV2Suite — override testFile() to use getWorkspaceFilePath() instead of default jar-based path (same pattern as GlutenCSVSuite/GlutenJsonSuite/GlutenParquetIOSuite)

Commits 4-7: testGluten rewrites (4 suites)

  • GlutenSparkPlanSuite — match ColumnarToRowExecBase instead of ColumnarToRowExec
  • GlutenInsertSortForLimitAndOffsetSuite — 6 testGluten matching TakeOrderedAndProjectExecTransformer + LimitExecTransformer
  • GlutenJoinHintSuite — match both CartesianProductExec and CartesianProductExecTransformer
  • GlutenExplainSuite — fix warehouse path pattern + DPP subquery naming (dynamicpruning vs subquery)

How was this patch tested?

Each suite verified on spark41 and spark40 via run-scala-test.sh. All excluded tests have corresponding testGluten replacements.

Related issue: #11550

@github-actions github-actions bot added the CORE works for Gluten Core label Mar 24, 2026
@baibaichen baibaichen changed the title [GLUTEN-11550][UT] Enable 13 disabled test suites for Spark 4.0/4.1 [GLUTEN-11550][UT] Enable 30 disabled test suites for Spark 4.0/4.1 Mar 24, 2026
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@baibaichen baibaichen force-pushed the fix/11550-enable-all branch from 5782b90 to 451760d Compare March 26, 2026 07:30
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

1 similar comment
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@baibaichen baibaichen force-pushed the fix/11550-enable-all branch from 431721f to bb137ad Compare April 10, 2026 04:50
@baibaichen baibaichen changed the title [GLUTEN-11550][UT] Enable 30 disabled test suites for Spark 4.0/4.1 [GLUTEN-11550][UT] Enable 18 disabled test suites for Spark 4.x Apr 10, 2026
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@baibaichen baibaichen force-pushed the fix/11550-enable-all branch 2 times, most recently from 8ec0857 to 0c2ca3d Compare April 10, 2026 05:10
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

1 similar comment
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@baibaichen baibaichen force-pushed the fix/11550-enable-all branch from 0c2ca3d to d9b6122 Compare April 10, 2026 05:22
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

baibaichen and others added 7 commits April 10, 2026 05:43
Direct enable (no code changes needed, ANSI fallback or trait fix):
- GlutenTPCDSQueryWithStatsSuite: fix trait GlutenTestsCommonTrait -> GlutenSQLTestsTrait
- GlutenTPCDSQueryANSISuite: fix trait
- GlutenToPrettyStringSuite: ANSI fallback
- GlutenCollationRegexpExpressionsSuite: ANSI fallback (spark41-only)
- GlutenSimpleSQLViewSuite: ANSI fallback
- GlutenDataSourceScanExecRedactionSuite: ANSI fallback
- GlutenDataSourceV2ScanExecRedactionSuite: ANSI fallback
- Disable GlutenStreamingQueryHashPartitionVerifySuite (wrong trait, runs as vanilla Spark)

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Wrap exception in glutenCheckExpression with fail() to match Spark's
checkEvaluationWithoutCodegen behavior. No testGluten override needed.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
…extFileV2Suite

Override testFile() to use getWorkspaceFilePath() instead of default
jar-based path. The default testFile() returns jar:file: URI which
Hadoop Path does not support. Same pattern used by GlutenCSVSuite,
GlutenJsonSuite, GlutenParquetIOSuite.
V1 3/3 passed, V2 3/3 passed.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Override test to find ColumnarToRowExecBase instead of ColumnarToRowExec.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
6 tests rewritten with testGluten (match TakeOrderedAndProjectExecTransformer,
LimitExecTransformer, ColumnarCollectLimitBaseExec). Original tests excluded.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
testGluten rewrite for 'join strategy hint - shuffle-replicate-nl':
Gluten replaces CartesianProductExec with CartesianProductExecTransformer,
so assertShuffleReplicateNLJoin needs to match both types.
Original test excluded, testGluten 18/18 passed.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
testGluten rewrite for 'explain formatted - check presence of subquery
in case of DPP': Gluten uses different warehouse path (no suite class
name) and DPP subquery is named 'dynamicpruning' instead of 'subquery'.
Original test excluded, testGluten 24/24 passed.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@baibaichen baibaichen force-pushed the fix/11550-enable-all branch from d9b6122 to 666ce08 Compare April 10, 2026 05:52
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@baibaichen baibaichen changed the title [GLUTEN-11550][UT] Enable 18 disabled test suites for Spark 4.x [GLUTEN-11550][UT] Enable 15 disabled test suites for Spark 4.x Apr 10, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant