Skip to content

Spark History Server support for Azure Data Lake Storage Gen2 (ADLS) #464

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
4 tasks done
siegfriedweber opened this issue Sep 17, 2024 · 0 comments · Fixed by #479
Closed
4 tasks done

Spark History Server support for Azure Data Lake Storage Gen2 (ADLS) #464

siegfriedweber opened this issue Sep 17, 2024 · 0 comments · Fixed by #479

Comments

@siegfriedweber
Copy link
Member

siegfriedweber commented Sep 17, 2024

Spark application logs should be storable in Azure Data Lake Storage Gen2 (ADLS).

The hadoop-azure module must be added to the Spark image and the spec.logFileDirectory structure could be extended with adls. Currently only s3 is supported. Alternatively, a custom log directory should be specifiable.

The SAS token should be read from a Secret.

Documentation

https://docs.stackable.tech/home/nightly/spark-k8s/usage-guide/history-server#_custom_log_directory

Release Notes

New / extended platform features

Other product features

  • Apache Spark: A custom log directory can be specified for the event logs, allowing to choose another location than an S3 bucket, e.g. HDFS or ABFS.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

2 participants