>
Tired of 403s and blank pages when scraping JavaScript-heavy websites?
You're not alone — and that's exactly why I built ScrapeSome.
🚀 What Is ScrapeSome?
ScrapeSome is a developer-friendly Python library that makes scraping modern websites simple — even the ones loaded with dynamic JavaScript or tough anti-bot protections.
It combines:
- ✅ Sync and async support
- 🎭 Automatic Playwright fallback for headless browser rendering
- 💻 CLI support: scrape straight from your terminal
- 🛡️ Built-in error handling, timeouts, and retries
- 📄 Output formats:
HTML
,Markdown
,text
, orJSON
It’s fast, lightweight, and requires zero boilerplate.
🔧 Why I Built It
I kept hitting walls on scraping projects:
- Pages rendered everything with JavaScript
- APIs were locked down or undocumented
-
requests
,Scrapy
failed or got 403 request error - Setting up full browser automation felt too heavy for small jobs
So I built ScrapeSome — to fill the gap between requests
and full-on headless scraping frameworks.
⚙️ Quick Example
from scrapesome import sync_scraper html = sync_scraper("https://example.com") html = sync_scraper( "https://example.com", force_playwright=True, output_format="markdown", user_agents=["Mozilla/5.0"] )
💻 CLI usage
scrapesome scrape --url https://example.com --output-format json
You can even configure behavior with environment variables — great for scripting.
📦 Install It
pip install scrapesome
🧪 Try it out on PyPI:
👉 https://pypi.org/project/scrapesome/
🔗 Links
- 🔧 GitHub: github.com/scrapesome/scrapesome
- 📚 Docs: scrapesome.onrender.com
- 📄 Full blog post: Medium
🙌 Feedback Welcome
This is an early release, and I’d love to hear your thoughts.
Try it, break it, file issues, suggest features — or just ⭐ the repo if you like the idea!
Happy scraping! 🕷️
— Vishnu Vardhan Reddy
Top comments (0)