[SPARK-49701] Use JDK for Spark 3.5+ Docker image #66

dongjoon-hyun · 2024-09-18T21:30:56Z

What changes were proposed in this pull request?

This PR aims to use JDK for Spark 3.5+ Docker image. Apache Spark Dockerfile are updated already.

[SPARK-47636][K8S][3.5] Use Java 17 instead of 17-jre image in K8s Dockerfile spark#45762
[SPARK-47635][K8S] Use Java 21 instead of 21-jre image in K8s Dockerfile spark#45761

Why are the changes needed?

Since Apache Spark 3.5.0, SPARK-44153 starts to use jmap like the following.

[SPARK-44153][CORE][UI] Support Heap Histogram column in Executors tab spark#41709

https://github.com/apache/spark/blob/c832e2ac1d04668c77493577662c639785808657/core/src/main/scala/org/apache/spark/util/Utils.scala#L2030

Does this PR introduce any user-facing change?

Yes, the user can use Heap Histogram feature.

How was this patch tested?

Pass the CIs.

dongjoon-hyun · 2024-09-18T22:07:31Z

Thank you, @viirya .

dongjoon-hyun · 2024-09-18T23:08:57Z

Merged to master.

dongjoon-hyun · 2024-09-19T00:14:33Z

For the record, the fixed images are released.

$ docker run -it --rm apache/spark:4.0.0-preview1 jmap | head -n3 Usage: jmap -clstats <pid> to connect to running process and print class loader statistics

### What changes were proposed in this pull request? This PR aims to propose to use `apache/spark` images instead of `spark` because `apache/spark` images are published first. For example, the following are only available in `apache/spark` as of now. - apache/spark-docker#66 - apache/spark-docker#67 - apache/spark-docker#68 ### Why are the changes needed? To apply the latest bits earlier. ### Does this PR introduce _any_ user-facing change? There is no change from `Apache Spark K8s Operator`. Only the underlying images are changed. ### How was this patch tested? Pass the CIs. ### Was this patch authored or co-authored using generative AI tooling? No. Closes #128 from dongjoon-hyun/SPARK-49706. Authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

anish97IND · 2024-10-17T20:09:04Z

Hi Team, think similar changes is required for Java 11 as well. At the moment we are using spark:3.5.2-scala2.12-java11-python3-ubuntu. We wanted to check heap dump but ended up into issue , It show something like below -:

We went aheads and checked the Java installed within docker image to get Jmap by doing an kubectl exec -ti <executor-pod> sh - however even within the java installed we did not find heap dump. Will the change in java version work for 3.5.2 .

dongjoon-hyun · 2024-10-17T22:08:05Z

To @anish97IND , please use the latest Spark 3.5.3 images instead of 3.5.2.

FYI, there are correctness fixes in Apache Spark 3.5.3.

[SPARK-49701] Use JDK for Spark 3.5+ Docker image

6467280

dongjoon-hyun force-pushed the SPARK-49701 branch from ea60f78 to 6467280 Compare September 18, 2024 22:01

viirya approved these changes Sep 18, 2024

View reviewed changes

dongjoon-hyun closed this in daa6f94 Sep 18, 2024

dongjoon-hyun deleted the SPARK-49701 branch September 18, 2024 23:08

This was referenced Sep 19, 2024

[SPARK-49703] Publish Java 21 Docker image for preview1 #69

Closed

[SPARK-49706] Use apache/spark images instead of spark apache/spark-kubernetes-operator#128

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-49701] Use JDK for Spark 3.5+ Docker image #66

[SPARK-49701] Use JDK for Spark 3.5+ Docker image #66

Uh oh!

dongjoon-hyun commented Sep 18, 2024 •

edited

Loading

dongjoon-hyun commented Sep 18, 2024

dongjoon-hyun commented Sep 18, 2024

dongjoon-hyun commented Sep 19, 2024

anish97IND commented Oct 17, 2024

dongjoon-hyun commented Oct 17, 2024

Labels

3 participants

[SPARK-49701] Use JDK for Spark 3.5+ Docker image #66

[SPARK-49701] Use JDK for Spark 3.5+ Docker image #66

Uh oh!

Conversation

dongjoon-hyun commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

dongjoon-hyun commented Sep 18, 2024

dongjoon-hyun commented Sep 18, 2024

dongjoon-hyun commented Sep 19, 2024

anish97IND commented Oct 17, 2024

dongjoon-hyun commented Oct 17, 2024

Labels

3 participants

dongjoon-hyun commented Sep 18, 2024 •

edited

Loading