InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 23 Java Database Projects
- Project mention: CLI to manage your SQL database schemas and migrations | news.ycombinator.com | 2025-11-01
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
Chat2DB
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
Project mention: Understanding MySQL Composite Indexes: Structure, Search Behavior, and Optimization Principles | dev.to | 2025-05-06Go to Chat2DB website 🙋 Join the Chat2DB Community 🐦 Follow us on X 📝 Find us on Discord
- Project mention: 100000 TPS over a billion rows: the unreasonable effectiveness of SQLite | news.ycombinator.com | 2025-12-02
A larger pool actually makes the number worse because it adds more contention. I tested it with 64 and all the results were worse. The last example which was 348 TPS drops to 164 TPS!
Tangentially I also highly recommend this article on pool sizing.
https://github.com/brettwooldridge/HikariCP/wiki/About-Pool-...
-
shardingsphere
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
Project mention: Apache ShardingSphere: Effortless Database Scaling and Beyond! | dev.to | 2025-02-21View the Project on GitHub
- Project mention: 15 AWS EMR Cost Optimization Tips to Slash Your EMR Spending (2025) | dev.to | 2025-12-16
AWS EMR (Elastic MapReduce) is a fully managed big data platform. It manages the setup, configuration, and tuning of open source frameworks like Apache Hadoop, Apache Spark, Apache Hive, Presto, and more at scale on AWS infrastructure. EMR handles cluster scaling, resource allocation, and lifecycle management. This allows you to work with large datasets for various use cases, from ETL pipelines to ML workloads. EMR uses a pay-as-you-go pricing model. Costs for compute, storage, and other AWS services can add up quickly as your data grows, clusters get bigger, and jobs become more complex. If you're not careful, costs can skyrocket due to inefficient resource use, poor instance choices, and misconfigured storage. That's why AWS EMR Cost Optimization is key. It helps you get the best performance per dollar while maintaining data processing speed, reliability, and scalability.
-
- Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26
Neo4j - neo4j.com/
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
- Project mention: Apache Doris 4.0: One Engine for Analytics, Full-Text Search, and Vector Search | dev.to | 2025-10-24
The latest version of Apache Doris is now available for download. Visit doris.apache.org for detailed release notes and upgrade guides, and join the Doris community to explore, test, and share your feedback.
- Project mention: Reverse Proxy Deep Dive: The Hidden Complexity of Service Discovery | dev.to | 2025-07-30
ZooKeeper (classic choice for strongly consistent registries). Examples: Pinterest, Linkedin
-
We can now create a schema on Iceberg to hold our table. We also create a new table called employee to store employee details. The Trino processing engine will be used to perform these actions.
-
As we mentioned earlier, we conducted a prototype test that revealed the limit of 30k msg/s throughput when using PostgreSQL for persistence message storage. At the moment we migrated to Redis, we already used the Jedis library for Redis interactions, primarily for cache management, and extended it to handle message persistence for persistent MQTT clients. However, the initial results of the Redis implementation with Jedis were unexpected. While we anticipated Redis would significantly outperform PostgreSQL, the performance improvement was modest — reaching only 40k msg/s throughput compared to the 30k msg/s limit with PostgreSQL.
-
debezium
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
Project mention: Listen to Database Changes Through the Postgres WAL | news.ycombinator.com | 2025-11-16 -
-
starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
-
-
Flyway is a Java-based migration tool that’s widely used in enterprise settings. While not Go-native, you can integrate it into Go projects using its CLI or by calling its Java library. Flyway is great for teams needing robust versioning and audit-ready migration history.
-
Android Debug Database
A library for debugging android databases and shared preferences - Make Debugging Great Again
-
- Project mention: Apache HertzBeat is a real-time monitoring system | news.ycombinator.com | 2025-05-31
-
-
> How? Stored procedures?
Not neccesarily. jOOQ[1] and sqlc[2] are great options if you don't like stored procedures, but for a small app or a prototype, you could also just have plain SQL strings in your app.
My point isn't that the code has to be stored in the database, but rather that the processing happens in one place where your data is stored and your middle tier just gets the results. Pure, stateless data. This means you don't have to synchronise shared mutable state between your app and your DB server, cutting out all the headaches of ORMs, such as having to specify your data model in two separate places, n+1 queries, locking, caching, and optimistic concurrency control.
SQL also provides a declarative approach to defining your business logic. You define the 'what', not the 'how'. In addition to greater productivity, the programming model is much simpler because you aren't complecting control flow with data flow.
> How do you version that code
You put it into your VCS. SQL is part of your code base, you can and should version control it just like any Python, Ruby or Java code. When using stored procedures, I recommend putting them in a separate schema, so that the schema can be dropped and recreated in a single transaction during deployment.
> how do you reason with the business logic split all over the DB and code
There is no split. The core business logic is in SQL, with your middle tier doing the plumbing, orchestration of external services and presentation. You separate your concerns instead of mixing them.
[1] https://www.jooq.org/
-
Zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Project mention: 📊 Visualise Presto Queries with Apache Zeppelin: A Hands-On Guide | dev.to | 2025-05-12In the previous article, we explored the installation of Presto. Building on that foundation, it's time to take your data exploration one step further by integrating Presto with Apache Zeppelin, a powerful web-based notebook that allows interactive data analytics.
-
For years, Hibernate ORM has been one of the most popular frameworks in the Java ecosystem. It was built to simplify data persistence by letting developers work with Java objects instead of SQL statements, a technique known as object-relational mapping (ORM).
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Java Database discussion
Java Database related posts
-
More databases should be single-threaded
-
15 AWS EMR Cost Optimization Tips to Slash Your EMR Spending (2025)
-
Big data technology that is orders of magnitude faster than SQL
-
What kind of ORM engine does a low-code platform need? (2)
-
Apache Doris 4.0: One Engine for Analytics, Full-Text Search, and Vector Search
-
Apache Doris 4.0: One Engine for Analytics, Full-Text Search, and Vector Search
-
Liquibase continues to advertise itself as "open source" despite license switch
- A note from our sponsor - InfluxDB www.influxdata.com | 23 Dec 2025
Index
What are some of the best open-source Database projects in Java? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | dbeaver | 47,863 |
| 2 | Chat2DB | 24,844 |
| 3 | HikariCP | 20,909 |
| 4 | shardingsphere | 20,596 |
| 5 | Presto | 16,593 |
| 6 | QuestDB | 16,477 |
| 7 | Neo4j | 15,570 |
| 8 | doris | 14,765 |
| 9 | Apache ZooKeeper | 12,691 |
| 10 | Trino | 12,318 |
| 11 | Jedis | 12,239 |
| 12 | debezium | 12,210 |
| 13 | Realm | 11,484 |
| 14 | starrocks | 11,109 |
| 15 | Apache Cassandra | 9,541 |
| 16 | Flyway | 9,388 |
| 17 | Android Debug Database | 8,607 |
| 18 | shardingsphere-elasticjob | 8,215 |
| 19 | hertzbeat | 6,900 |
| 20 | Leaf | 6,682 |
| 21 | jOOQ | 6,607 |
| 22 | Zeppelin | 6,591 |
| 23 | Hibernate | 6,373 |