⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
- Updated
Oct 7, 2025 - Python
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
re_data - fix data issues before your users & CEO would discover them 😊
A Python library for efficient feature ranking and selection on sparse data sets.
Load dbt artifacts uploaded to GCS to BigQuery in order to track historical dbt results
⚡ Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.
DEPRECATED! Please move to bunnyxt/tdd-spider. The crawler of https://tdd.bunnyxt.com via python.
Um conjunto de ferramentas simples para uso no monitoramento de dados no site da Câmara dos Deputados
Códigos, plataformas, ferramentas e processos em alta;
Real-time Network Data Monitoring System using RabbitMQ , InfluxDB and Chronograf
Apache Airflow Pipeline extracts JSON files from AWS S3 bucket and inserts these into an AWS Redshift Cluster.
Automate password review reminders using GitHub Actions, Excel data, and email alerts. Stay secure by getting notified when your passwords are due for review or haven’t been updated in a long time.
Python-based web scraping project that automates the process of fetching SEC-8K cybersecurity incident filings from the EDGAR database and provides automated email alerts with detailed reports on material breaches or incidents reported by U.S. public companies.
Um conjunto de ferramentas simples para uso no monitoramento de dados no site da Câmara dos Deputados
End-to-end real-time data pipeline combining embedded systems and Python for ingesting, transforming, and visualizing sensor data. Demonstrates hands-on skills in data acquisition, serial communication, real-time analytics, and dashboard development. Ideal showcase of edge-to-GUI telemetry architecture for data-centric roles.
Airflow + Postgres DQ platform: observability, rule-based monitoring (SQL+JSON), pluggable alerting, Metabase dashboards.
Version-0 >> A Python-based network monitoring tool that logs traffic, detects spikes or high usage, stores data in MySQL, and sends email alerts when thresholds are exceeded.
RepoRadar tracks GitHub repo transfers in real time and turns them into early M&A signals. Startups see what tech and teams are getting scooped up; investors get alerts on stealth acquisitions and acqui-hires before the press release. One-command deploy, Slack pings, clean dashboard.
An online monitor for acquired Schottky data during storage-ring experiments
Add a description, image, and links to the data-monitoring topic page so that developers can more easily learn about it.
To associate your repository with the data-monitoring topic, visit your repo's landing page and select "manage topics."