The Spark integration uses OpenLineage to collect and send metadata to Grai. Complete documentation for integrating OpenLineage with Spark can be found here (opens in a new tab).
Web App
 
Fields
| Field | Value | Example | 
|---|---|---|
| source | The name of the source, see sources | Spark | 
| Name | Name for connection | production | 
| Namespace | Namespace for the connection, see namespaces | default | 
| namespaces | Optional | 
Webhook
âšī¸
In order to authenticate your webhook, you will need to create an API key in the Grai web app.
Grai will open an endpoint at /api/v1/openlineage/<connection_id>/ which can be used to receive OpenLineage events.
See the OpenLineage docs (opens in a new tab) for how to configure your client to send OpenLineage events. As part of the configuration settings you'll need to to provide configuration settings via SparkConf parameters.
| Field | Value | 
|---|---|
| spark.openlineage.transport.type | http | 
| spark.openlineage.transport.endpoint | api/v1/openlineage/<connection_id> | 
| spark.openlineage.transport.url | https://api.grai.io (opens in a new tab) | 
| spark.openlineage.transport.auth.type | <your_grai_api_key> |