Scheduler Engine
Cron (Linux) / AWS EventBridge / Azure Scheduler
Automates job triggering at fixed intervals
Job Executor
Python CLI / Container Entrypoint
Executes the scraping pipeline when triggered
Lock Mechanism
File lock / Redis / Cloud-native mutex
Prevents overlapping executions
Trigger API
Flask / FastAPI (optional)
Allows manual triggering from a web panel or CLI
Metadata Logger
Python Logging / CloudWatch / Log Analytics
Records timestamps, durations, and execution results