Skip to content

Upgrade to Spark 3.5 / Scala 2.13, add dependabot and CI#4

Merged
oluies merged 3 commits into
masterfrom
task/upgrade-components-and-add-dependabot-a7e221
May 9, 2026
Merged

Upgrade to Spark 3.5 / Scala 2.13, add dependabot and CI#4
oluies merged 3 commits into
masterfrom
task/upgrade-components-and-add-dependabot-a7e221

Conversation

@oluies
Copy link
Copy Markdown
Owner

@oluies oluies commented May 9, 2026

Summary

  • Modernize build from Spark 1.6 / Scala 2.10 / sbt 0.13 → Spark 3.5.1 / Scala 2.13.14 / sbt 1.10.5; scopt 4.x; sbt-assembly with merge strategy. Drop sbt-spark-package, sbt-idea, spark-csv (now built-in).
  • Migrate sources to Spark 3.5 APIs: SparkSession (with Hive support where used), built-in CSV reader/writer, .rdd.map for Row patterns, df.as("a") strings, MulticlassMetrics.accuracy (no-arg precision/recall/fMeasure removed in Spark 3).
  • Add .github/dependabot.yml (sbt + github-actions, weekly).
  • Add .github/workflows/ci.yml (JDK 17, sbt cache, compile + test on push/PR to master).

Test plan

  • sbt compile — clean, 0 warnings, 0 errors locally
  • CI workflow green on the PR
  • Spot-check one job end-to-end on a Spark 3.5 cluster (spark-submit --class com.combient.sparkjob.tedsds.PrepareTrainData ...)

oluies added 3 commits May 9, 2026 09:54
Modernize build from Spark 1.6 / Scala 2.10 / sbt 0.13:
- Scala 2.13.14, Spark 3.5.1, scopt 4.1.0, sbt 1.10.5
- Drop sbt-spark-package, sbt-idea, spark-csv (built-in now)
- Add sbt-assembly with merge strategy

Source migrations for Spark 3.5 API:
- SparkConf+SparkContext+SQLContext/HiveContext -> SparkSession
- com.databricks.spark.csv -> built-in .csv(...)
- DataFrame.map(Row) -> .rdd.map for RDD output
- Symbol literals df.as('a) -> df.as("a")
- MulticlassMetrics.precision (removed) -> accuracy

Add .github/dependabot.yml for sbt and github-actions.
Compile + test on push/PR to master with JDK 17 and sbt cache.
ubuntu-latest no longer ships sbt preinstalled.
@oluies oluies merged commit a5dabc4 into master May 9, 2026
2 checks passed
@oluies oluies deleted the task/upgrade-components-and-add-dependabot-a7e221 branch May 9, 2026 08:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant