Skip to content

Pull requests: huggingface/pyspark_huggingface

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

v2
#14 by lhoestq was merged Aug 12, 2025 Loading…
Enable content defined chunking
#13 by lhoestq was merged Aug 12, 2025 Loading…
Use uv project manager and add CI tests
#12 by wengh was merged Jun 21, 2025 Loading…
Backport data source to pyspark 3
#11 by lhoestq was merged May 29, 2025 Loading…
5 of 6 tasks
Update demo.ipynb
#10 by CharlesCNorton was merged Aug 12, 2025 Loading…
Fix import for compatibility with older huggingface_hub
#9 by wengh was merged Feb 19, 2025 Loading…
Use same data source name for reader and writer
#7 by wengh was merged Jan 31, 2025 Loading…
Support custom split name by renaming files
#6 by wengh was merged Jan 31, 2025 Loading…
Add HuggingFaceSink data source
#5 by wengh was merged Jan 30, 2025 Loading…
Enable predicate pushdown
#4 by lhoestq was merged Dec 16, 2024 Loading…
Add more features to huggingface reader
#3 by allisonwang-db was merged Dec 4, 2024 Loading…
initial pyproject.toml
#2 by lhoestq was merged Nov 26, 2024 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.