- Notifications
You must be signed in to change notification settings - Fork 219
Support OpenLineage in spark-3.x-bigquery connectors #1212
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support OpenLineage in spark-3.x-bigquery connectors #1212
Conversation
codelixir commented Apr 15, 2024
- Add openlineage properties to Spark31BigQueryTable class
- Add BigQueryRelationProvider as an abstract class to v2 module, to be extended by BaseBigQuerySource (parent class of all the Spark BigQuery Table Provider classes).
| /gcbrun |
davidrabinowitz left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add an integration test testing that the lineage events are created.
...uery-dsv2-common/src/main/java/com/google/cloud/spark/bigquery/BigQueryRelationProvider.java Outdated Show resolved Hide resolved
...igquery-dsv2-common/src/main/java/com/google/cloud/spark/bigquery/v2/BaseBigQuerySource.java Show resolved Hide resolved
...uery-dsv2-common/src/main/java/com/google/cloud/spark/bigquery/BigQueryRelationProvider.java Show resolved Hide resolved
| /gcbrun |
…gquery-connector-common
| I have moved the logic to the common module, as discussed, so that both dsv1 and dsv2 connectors call the same method internally. |
| /gcbrun |
1 similar comment
| /gcbrun |
| Hey @codelixir, thank you for your contribution! We appreciate your effort. Have you thought about leveraging the |
| Hi @ddebowczyk92 , thanks for the input! We try to keep the DataSource v2 connectors Scala agnostic in order to simplify the usage for customers due to the incompatibility between Scala 2.12 and 2.13. Once this is PR is done, we can think how to incorporate the interface into the connector. |
...rc/test/java/com/google/cloud/spark/bigquery/integration/OpenLineageIntegrationTestBase.java Show resolved Hide resolved
...rc/test/java/com/google/cloud/spark/bigquery/integration/OpenLineageIntegrationTestBase.java Outdated Show resolved Hide resolved
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>
| /gcbrun |
...rc/test/java/com/google/cloud/spark/bigquery/integration/OpenLineageIntegrationTestBase.java Outdated Show resolved Hide resolved
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>
| /gcbrun |
| /gcbrun |
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com> (cherry picked from commit 558f18f)