Serverless for Apache Spark
About
The Serverless for Apache Spark source allows Toolbox to interact with Spark batches hosted on Google Cloud Serverless for Apache Spark.
Available Tools
serverless-spark-list-batchesList and filter Serverless Spark batches.serverless-spark-get-batchGet a Serverless Spark batch.serverless-spark-cancel-batchCancel a running Serverless Spark batch operation.
Requirements
IAM Permissions
Serverless for Apache Spark uses Identity and Access Management (IAM) to control user and group access to serverless Spark resources like batches and sessions.
Toolbox will use your Application Default Credentials (ADC) to authorize and authenticate when interacting with Google Cloud Serverless for Apache Spark. When using this method, you need to ensure the IAM identity associated with your ADC has the correct permissions for the actions you intend to perform. Common roles include roles/dataproc.serverlessEditor (which includes permissions to run batches) or roles/dataproc.serverlessViewer. Follow this guide to set up your ADC.
Example
sources: my-serverless-spark-source: kind: serverless-spark project: my-project-id location: us-central1 Reference
| field | type | required | description |
|---|---|---|---|
| kind | string | true | Must be “serverless-spark”. |
| project | string | true | ID of the GCP project with Serverless for Apache Spark resources. |
| location | string | true | Location containing Serverless for Apache Spark resources. |