Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →
Iceberg-python Alternatives
Similar projects and alternatives to iceberg-python
-
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
PostgreSQL
Mirror of the official PostgreSQL GIT repository. Note that this is just a *mirror* - we don't work with pull requests on github. To contribute, please see https://wiki.postgresql.org/wiki/Submitting_a_Patch
-
Ansible
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.
-
-
-
-
LocalStack
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
docker
FreeBSD port of docker, take a look at PORTING-FREEBSD.md in freebsd-compat branch (by kvasdopil)
-
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs (by delta-io)
-
-
-
langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
-
lance
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
-
Daft
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
-
-
-
-
incubator-airflow
Discontinued Apache Airflow - A platform to programmatically author, schedule, and monitor workflows [Moved to: https://github.com/apache/airflow]
-
-
polaris-local-forge
A comprehensive development environment for Apache Polaris featuring LocalStack integration on k3s. This kit automates the setup of a complete Polaris environment with S3-compatible storage, authentication, and role-based access control.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
iceberg-python discussion
iceberg-python reviews and mentions
- DuckLake is an integrated data lake and catalog format
Have you tried out PyIceberg yet? It's a pure Python implementation and it works pretty well. It supports a SQL Catalog as well as an In-Memory Catalog via a baked in SQLite SQL Catalog.
https://py.iceberg.apache.org/
- AWS open source newsletter, #207
Access data in Amazon S3 Tables using PyIceberg through the AWS Glue Iceberg REST endpoint - demonstrates how to access Iceberg tables stored in S3 Tables using PyIceberg, a Python library for programmatic access to Iceberg table metadata as well as to table data in Iceberg format [hands on]
- Let's Build Together: A Local Playground for Apache Polaris
PyIceberg - Python library for Apache Iceberg
- Ultimate Directory of Apache Iceberg Resources
PyIceberg Docs
- Quick tip: Using SingleStore with PyIceberg
In a previous article, we implemented an Iceberg catalog using SingleStore and JDBC. Another way that we can create the catalog is using PyIceberg. In this article, we'll see how.
- Lessons Learned from Scaling to Multi-Terabyte Datasets
Iceberg is working hard to support pure python[0] / rust[1] workflows without Spark. Following Tabular's acquisition [2], I hope it still moves in this direction at the same clip.
We're using iceberg + duckdb to power analytics in our app[3] and I'm really happy with the combo.
0 - https://github.com/apache/iceberg-python
1 - https://github.com/apache/iceberg-rust
2 - https://x.com/thisritchie/status/1800522255426072647
3 - https://www.definite.app/
- Understanding Parquet, Iceberg and Data Lakehouses
You don't need a Spark deployment. The first reference implementations for reading and writing were in Spark.
Now, with PyIceberg, there is read support in Python. Write support should be merged very soon - https://github.com/apache/iceberg-python/pull/41
- A note from our sponsor - Stream getstream.io | 23 Dec 2025
Stats
apache/iceberg-python is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of iceberg-python is Python.