Automated web scraper that collects real-time transport schedule data
A headless worker service that continuously scrapes transport schedule data from third-party websites using Puppeteer. Runs in configurable cycles with proxy rotation, posts collected data to a central API, and supports seat availability tracking and capacity change detection. Deployed as a Docker container with automatic restarts.
8,612 total lines of code
March 2025 — Present