> For the complete documentation index, see [llms.txt](https://docs.akamas.io/akamas-docs/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.akamas.io/akamas-docs/3.5.0/reference/workflow-operators/sparksubmit-operator.md).

# SparkSubmit Operator

The **SparkSubmit** operator connects to a Spark instance and invokes a local *spark-submit* to schedule a job.

## Operator arguments <a href="#operator-arguments" id="operator-arguments"></a>

| Name              | Type                                 | Value Restrictions                                                                                                                                                                                                                 | Required | Default                                | Description                                                                                  |
| ----------------- | ------------------------------------ | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | -------- | -------------------------------------- | -------------------------------------------------------------------------------------------- |
| `file`            | String                               | It should be a path to a valid java or python spark application file                                                                                                                                                               | Yes      |                                        | Spark application to submit (jar or python file)                                             |
| `args`            | List of Strings, Numbers or Booleans |                                                                                                                                                                                                                                    | Yes      |                                        | Additional application arguments                                                             |
| `master`          | String                               | <p>It should be a valid supported Master URL:</p><ul><li>local</li><li>local\[K]</li><li>local\[K,F]</li><li>local\[]</li><li>local\[,F]</li><li>spark://HOST:PORT</li><li>spark://HOST1:PORT1, HOST2:PORT2</li><li>yarn</li></ul> | Yes      |                                        | The master URL for the Spark cluster                                                         |
| `deployMode`      | `client` `cluster`                   |                                                                                                                                                                                                                                    | No       | `cluster`                              | Whether to launch the driver locally (`client`) or in the cluster (`cluster`)                |
| `className`       | String                               |                                                                                                                                                                                                                                    | No       |                                        | The entry point of the java application. Required for java applications.                     |
| `name`            | String                               |                                                                                                                                                                                                                                    | No       |                                        | Name of the task. When submitted the id of the study, experiment and trial will be appended. |
| `jars`            | List of Strings                      | Each item of the list should be a path that matches an existing jar file                                                                                                                                                           | No       |                                        | A list of jars to be added in the classpath.                                                 |
| `pyFiles`         | List of Strings                      | Each item of the list should be a path that matches an existing python file                                                                                                                                                        | No       |                                        | A list of python scripts to be added to the PYTHONPATH                                       |
| `files`           | List of Strings                      | Each item of the list should be a path that matches an existing file                                                                                                                                                               | No       |                                        | A list of files to be added to the context of the spark-submit                               |
| `conf`            | Object (key-value pairs)             |                                                                                                                                                                                                                                    | No       |                                        | Mapping containing additional Spark configurations. See Spark documentation.                 |
| `envVars`         | Object (key-value pairs)             |                                                                                                                                                                                                                                    | No       |                                        | Env variables when running the *spark-submit* command                                        |
| `sparkSubmitExec` | String                               | It should be a path that matches an existing executable                                                                                                                                                                            | No       | The default for the Spark installation | The path of the *spark-submit* executable command                                            |
| `sparkHome`       | String                               | It should be a path that matches an existing directory                                                                                                                                                                             | No       | The default for the Spark installation | The path of the SPARK\_HOME                                                                  |
| `proxyUser`       | String                               |                                                                                                                                                                                                                                    | No       |                                        | The user to be used to execute Spark applications                                            |
| `verbose`         | Boolean                              |                                                                                                                                                                                                                                    | No       | true                                   | If additional debugging output should be displayed                                           |
| `component`       | String                               | It should match the name of an existing Component of the System under test                                                                                                                                                         | Yes      |                                        | The name of the component whose properties can be used as arguments of the operator          |


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.akamas.io/akamas-docs/3.5.0/reference/workflow-operators/sparksubmit-operator.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.