Skip to content

Commit 614dcfc

Browse files
Add support for Salesforce sink (#18)
* init * Update index.md * add schema option, batch write to salesforce * update doc: salesforce sink * update option: schema * add tests * rm tests * rm sys * fix poetry.lock
1 parent 7d48f9a commit 614dcfc

File tree

10 files changed

+1345
-5
lines changed

10 files changed

+1345
-5
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -46,6 +46,7 @@ spark.readStream.format("fake").load().writeStream.format("console").start()
4646
| [KaggleDataSource](pyspark_datasources/kaggle.py) | `kaggle` | Read datasets from Kaggle | `kagglehub`, `pandas` |
4747
| [SimpleJsonDataSource](pyspark_datasources/simplejson.py) | `simplejson` | Write JSON data to Databricks DBFS | `databricks-sdk` |
4848
| [OpenSkyDataSource](pyspark_datasources/opensky.py) | `opensky` | Read from OpenSky Network. | None |
49+
| [SalesforceDataSource](pyspark_datasources/salesforce.py) | `salesforce` | Streaming sink for writing data to Salesforce | `simple-salesforce` |
4950

5051
See more here: https://allisonwang-db.github.io/pyspark-data-sources/.
5152

docs/datasources/salesforce.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
# SalesforceDataSource
2+
3+
> Requires the [`simple-salesforce`](https://github.com/simple-salesforce/simple-salesforce) library. You can install it manually: `pip install simple-salesforce`
4+
> or use `pip install pyspark-data-sources[salesforce]`.
5+
6+
::: pyspark_datasources.salesforce.SalesforceDataSource

docs/index.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -38,5 +38,6 @@ spark.readStream.format("fake").load().writeStream.format("console").start()
3838
| [HuggingFaceDatasets](./datasources/huggingface.md) | `huggingface` | Read datasets from the HuggingFace Hub | `datasets` |
3939
| [StockDataSource](./datasources/stock.md) | `stock` | Read stock data from Alpha Vantage | None |
4040
| [SimpleJsonDataSource](./datasources/simplejson.md) | `simplejson` | Write JSON data to Databricks DBFS | `databricks-sdk` |
41+
| [SalesforceDataSource](./datasources/salesforce.md) | `salesforce` | Write streaming data to Salesforce objects |`simple-salesforce` |
4142
| [GoogleSheetsDataSource](./datasources/googlesheets.md) | `googlesheets` | Read table from public Google Sheets document | None |
4243
| [KaggleDataSource](./datasources/kaggle.md) | `kaggle` | Read datasets from Kaggle | `kagglehub`, `pandas` |

0 commit comments

Comments
 (0)