A tool I built to automatically monitor darkweb forums and deliver threat intelligence alerts to Discord
Tired of manually checking darkweb forums for threat intelligence? I created this automation system that uses AI agents to scrape forum posts, detect keyword alerts, and deliver professional threat intel straight to your Discord. It's a personal project for my portfolio, but I hope it helps other cybersecurity folks stay informed about underground activities without the manual work.
After spending way too much time manually monitoring darkweb forums for threat intelligence, I realized we all face the same frustrations:
- β° Manual monitoring takes forever - Checking multiple forums every day eats up your time
- π Easy to miss important stuff - Critical posts happen while you're sleeping or busy
- π Information overload - Hundreds of posts with no good way to prioritize what matters
- π Same routine every day - Manually checking the same forums over and over
- π± Hard to share with the team - Screenshot and copy-pasting isn't scalable
- π‘οΈ Stealth requirements - Forums detect and block automated scrapers
So I built this DarkWeb Forums Tracker to automate the tedious monitoring routine:
β¨ AI Does the Monitoring - Google Gemini powered agents scrape forums with human-like behavior
π€ Workflows Handle Everything - n8n orchestrates the entire forum monitoring pipeline automatically
π± Discord Delivers Alerts - Real-time notifications with screenshots for keyword matches
π₯οΈ Human-in-the-Loop - VNC interface allows manual intervention for CAPTCHAs and login challenges
π³ Easy Setup - Just run docker compose up -d and you're monitoring forums
π Every 4 Hours - Set-and-forget automation that runs around the clock
If you're dealing with threat intelligence monitoring, this might help:
- π‘οΈ SOC Teams - Early warning system for emerging threats
- π΅οΈ Threat Hunters - Monitor threat actor communications and TTPs
- π‘ Threat Intel Analysts - Automate darkweb data collection
- ποΈ Security Managers - Executive summaries of underground activities
- π Security Consultants - Threat intelligence as a service for clients
- π’ MSP Teams - Monitor threats targeting your client industries
Prerequisites: Docker, Supabase Cloud account, Discord webhook, Google Gemini API key
# Clone and setup git clone https://github.com/brunosergi/darkweb-forums-tracker.git cd darkweb-forums-tracker cp .env.example .env # Configure your .env with Supabase and API credentials # Launch the platform docker compose up -d # Configure N8N credentials at http://localhost:5678 # Activate both workflows and start monitoring!π Complete Setup Guide: See SETUP.md for detailed step-by-step configuration
Services: N8N (5678) β’ VNC (6080) β’ Discord alerts every 4 hours
- n8n - Visual workflows that connect everything together
- Playwright MCP - AI-powered browser automation with stealth capabilities
- Google Gemini - The LLM that reads and analyzes forum content
- Supabase - Cloud PostgreSQL database with file storage
- Discord Webhooks - Where your team gets the real-time alerts
- Docker - Everything runs in containers
- π΅πΏ DarkForums.st - Track database leaks and breach discussions
- π Scheduled Monitoring - System checks configured forums every 4 hours
- π€ AI Agent Scraping - Playwright MCP with browser automation extracts forum posts and timestamps
- π Smart Deduplication - Only processes new posts (no duplicates from database)
- π― Entity Detection - Advanced keyword matching with canonical names, variations, and text normalization
- π Retry Logic - 2-attempt retry system with intelligent backoff for failed operations
- πΈ Screenshot & Analysis - For alerts: captures screenshots and generates AI summaries
- π± Discord Delivery - Comprehensive logging with color-coded alerts and detailed status updates
- πΎ Database Storage - Enhanced schema with timestampz format and entity tracking
- CAPTCHA Solving: AI gets stuck on DDoS-Guard or forum CAPTCHAs
- Manual Login: First-time authentication for login-protected forums
- Bot Detection: Bypass anti-bot measures that require human interaction
- Session Recovery: Re-authenticate when login sessions expire
- AI Agent Running: Playwright MCP browser automation in progress
- Challenge Detected: Agent encounters CAPTCHA or login requirement
- Manual Intervention: Connect to VNC and solve the challenge
- AI Continues: Agent resumes automated scraping after manual help
The VNC interface runs a full Chrome browser where you can see exactly what the AI agent sees and interact with any elements that require human input.
When you need to manually control the browser for troubleshooting, authentication, or CAPTCHA solving:
VNC Web Interface: Access http://localhost:6080
- Press Alt+F2 and enter:
chromium - Or right-click desktop β Applications β run terminal and type:
chromium &
Container Terminal:
docker exec -it darkweb-forums-tracker-playwright bash chromium &Perfect for solving CAPTCHAs, setting up authentication cookies, debugging failed scrapes, or manually navigating complex login flows that the AI agent couldn't handle automatically.
β
Schedule trigger - Starts each 4 hours
β
Discord notifications - Send scan started info (helps human attention if something goes wrong)
β
Forum URL loop - Given URLs for forums to iterate through
β
AI Agent scanning - Scan current URL target in loop
β
Error handling - IF loop success β continue workflow, IF loop fail β failure branch (bot_captcha, login_needed, etc)
β
Deduplication - Remove duplicate posts
β
Keyword branch separation - Add keywords to "Keywords" node in array format (user customizable)
β
Alert workflows - IF has any keyword β alert branch (screenshot + AI summary), IF no keywords β normal branch
β
Human-in-the-Loop VNC - Web-based browser GUI for manual CAPTCHA solving and login assistance
β
Shared browser sessions - VNC and Playwright MCP use same Chromium executable with shared user data directory for persistent login sessions
β
AI Agent retry logic - 2-attempt retry system with intelligent backoff and Discord notifications
β
Enhanced entity detection - Smart keyword matching with canonical names, variations, and text normalization
β
Timestamp format standardization - All dates stored in timestampz format for proper temporal analysis
β
Advanced Discord logging - Comprehensive status tracking with color-coded alerts and detailed error reporting
β
Database storage - Saves everything for historical analysis and tracking β
Discord results - Send to Discord with distinction between normal/alert posts using entity-based detection
- Multiple forums support - Track breach forums, lockbit, xss.is, etc.
- Scroll down a little bit before taking screenshot for better view
- Modularize workflows into multiple workflows to separate concerns
- Solve captcha and login sub-workflow tools - Call AI Agents to handle automatically
- Discord notifications asking for human presence when captcha/forum blocks occur
- Commercial/residential proxies support for anti-bot problems (IP rotation, captcha solver)
- AI Agent auto-login - Provide forum credentials in .env for AI Agent usage
- AI agent N8N model selector - Define which LLM to use and AI Agent fallback
- 2nd AI Agent retry logic - Apply same 2-attempt retry system to alert post analysis workflow
- pg_vector - Vectorize Supabase data and turn forum posts into RAG for AI chat
- Better schedule trigger - Configure minutes/hours for each forum URL path
- Tor proxy to reach .onion forums
- More notification apps - Slack, Telegram, etc.
These features will transform it from a simple forum monitor into a comprehensive darkweb threat intelligence platform. The goal is to make it reliable enough that security teams actually depend on it for underground threat detection.
β Star this repo if you find it useful!
π Setup Guide β’ βοΈ Workflows
Just a guy building tools for the cybersecurity community π‘οΈ