Python Datalake

Open-source Python projects categorized as Datalake

Python Datalake Projects

  1. pandas-ai

    Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

    Project mention: Pandas AI | news.ycombinator.com | 2025-07-18
  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. datarepo

    Project mention: Show HN: Datarepo – a data catalog that doesn't need a service or database | news.ycombinator.com | 2025-07-08
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Datalake discussion

Python Datalake related posts

  • Show HN: Datarepo – a data catalog that doesn't need a service or database

    1 project | news.ycombinator.com | 8 Jul 2025
  • Neuralink Open Sources Data Catalog for Multimodal Data

    1 project | news.ycombinator.com | 24 Jun 2025

Index

# Project Stars
1 pandas-ai 22,882
2 datarepo 151

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?