browsertrix/backend/btrixcloud/operator
Ilya Kreymer 8ea16393c5
Optimize single-page crawl workflows (#2656)
For single page crawls:
- Always force 1 browser to be used, ignoring browser windows/scale
setting
- Don't use custom PVC volumes in crawler / redis, just use emptyDir -
no chance of crawler being interrupted and restarted on different
machine for a single page.

Adds a 'is_single_page' check to CrawlConfig, checking for either limit
or scopeType / no extra hops.

Fixes #2655
2025-06-10 12:13:57 -07:00
..
__init__.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
baseoperator.py Allow users to run crawls with 1 or 2 browser windows (#2627) 2025-06-03 13:37:30 -07:00
bgjobs.py Consolidate list page endpoints + better QA sorting + optimize pages fix (#2417) 2025-02-21 13:47:20 -08:00
crawls.py Optimize single-page crawl workflows (#2656) 2025-06-10 12:13:57 -07:00
cronjobs.py Optimize single-page crawl workflows (#2656) 2025-06-10 12:13:57 -07:00
models.py Optimize single-page crawl workflows (#2656) 2025-06-10 12:13:57 -07:00
profiles.py support overriding crawler image pull policy per channel (#2523) 2025-03-31 14:11:41 -07:00