Skip to content

Investigate setting up a history server #84

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
razvan opened this issue Jun 15, 2022 · 1 comment · Fixed by stackabletech/documentation#253
Closed

Investigate setting up a history server #84

razvan opened this issue Jun 15, 2022 · 1 comment · Fixed by stackabletech/documentation#253
Assignees

Comments

@razvan
Copy link
Member

razvan commented Jun 15, 2022

Description

Spark monitoring with the history server: https://spark.apache.org/docs/latest/monitoring.html

An example using the Google operator and a shared NFS volume : https://stackoverflow.com/a/58593909

Multiple buckets: https://hadoop.apache.org/docs/stable/hadoop-aws/tools/hadoop-aws/index.html#Configuring_different_S3_buckets_with_Per-Bucket_Configuration

Refinement questions

CRD and instances

  • Create a CRD specifically for the history server ?

It should be possible to create multiple instances of history servers. The history server CR can reference an S3Bucket.
Spark applications can reference one of these.
The operator resolves the necessary spark configuration for the job and adds it to the spark-submit command.

  1. The job references a history server -> the operator reads the endpoint, bucket, path from there but uses the same credentials as the data access.
  2. There are separate s3 connection definitions for reading and writing. The job uses the write connection while the history server uses the read connection.

History event storage

  • Support for ReadWriteMany volumes ? Keep as issue for the future.
  • Integration with Hadoop ? Keep as issue for the future. Depends on a future HDFSConnection object.

UI usage and security

  • Kerberos support ? Keep as issue for the future.
  • Will internal links in the web UI use the published k8s service name ? Eventually it's only a problem with the ui of the running applications, not the UI of the history server.

Acceptance criteria:

  • we have a plan! An ADR.
@lfrancke lfrancke moved this to Development: In Progress in Stackable Engineering Aug 23, 2022
@adwk67 adwk67 moved this from Development: In Progress to Development: In Review in Stackable Engineering Aug 24, 2022
@sbernauer sbernauer moved this from Development: In Review to Development: Done in Stackable Engineering Sep 2, 2022
@lfrancke lfrancke moved this from Development: Done to Acceptance: Waiting for in Stackable Engineering Sep 2, 2022
@lfrancke lfrancke moved this from Acceptance: Waiting for to Acceptance: In Progress in Stackable Engineering Sep 5, 2022
@lfrancke
Copy link
Member

lfrancke commented Sep 6, 2022

I'm looking at this

@lfrancke lfrancke moved this from Acceptance: In Progress to Done in Stackable Engineering Sep 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

2 participants