Best way to programmatically extract data from a set of .pdf files?

This page summarizes the projects mentioned and recommended in the original post on /r/artificial

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
  1. haystack

    AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

    But if you want an API that you can use to develop your own flow, Haystack from Deepset could be worth a look.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Generative AI Frameworks and Tools Every Developer Should Know!

    1 project | dev.to | 13 Dec 2023
  • Llama2 and Haystack on Colab

    2 projects | news.ycombinator.com | 21 Jul 2023
  • Build with LLMs for production with Haystack – has 10k stars on GitHub

    2 projects | news.ycombinator.com | 17 Jul 2023
  • Show HN: Haystack – Production-Ready LLM Framework

    1 project | news.ycombinator.com | 11 Jul 2023
  • Show HN: "banks" Using Jinja as the basis of LLM prompt templating

    2 projects | news.ycombinator.com | 15 Jun 2023

Did you know that Python is
the 2nd most popular programming language
based on number of references?