Turn Any Website into an API

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. crawler-buddy

    Crawling framework, RSS reader and parser

    For YouTube yet dlp is also used to augment results.

    I can crawl using requests, selenium, Httpx and others. Response is via json so it easy to process.

    The downside is that it may not be the fastest solution, and I have not tested it against proxies.

    https://github.com/rumca-js/crawler-buddy

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. cinescrapers

    Web scrapers for London indie cinemas

    This is relevant to my interests[0]

    Based on the website I was quite skeptical. It looks too much like an "indiehacker", minimum-almost-viable-product, fake-it-till-you-make-it, trolling-for-email-addresses kind of website.

    But after a quick search on twitter, it seems like people are actually using it and reporting good results. Maybe I'll take a proper look at it at some point.

    I'd still like to know more about pricing, how it deals with cloudflare challenges, non-semantic markup and other awkwardnesses.

    [0] https://github.com/Joeboy/cinescrapers

  4. pydoll

    Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

    It's not a browser extension, but controlling the actual browser without using webdriver is already a thing.

    https://github.com/autoscrape-labs/pydoll

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

Did you know that Python is
the 2nd most popular programming language
based on number of references?