browsertrix/backend/btrixcloud/operator
Ilya Kreymer 8a507f0473
Consolidate list page endpoints + better QA sorting + optimize pages fix (#2417)
- consolidate list_pages() and list_replay_query_pages() into
list_pages()
- to keep backwards compatibility, add <crawl>/pagesSearch that does not
include page totals, keep <crawl>/pages with page total (slower)
- qa frontend: add default 'Crawl Order' sort order, to better show
pages in QA view
- bgjob: account for parallelism in bgjobs, add logging if succeeded
mismatches parallelism
- QA sorting: default to 'crawl order' by default to get better results.
- Optimize pages job: also cover crawls that may not have any pages but have pages listed in done stats
- Bgjobs: give custom op jobs more memory
2025-02-21 13:47:20 -08:00
..
__init__.py
baseoperator.py
bgjobs.py Consolidate list page endpoints + better QA sorting + optimize pages fix (#2417) 2025-02-21 13:47:20 -08:00
crawls.py Adjust crawler pvc on exit code 3 (out of storage) (#2375) 2025-02-20 11:03:28 -08:00
cronjobs.py Fix max pages quota setting and display (#2370) 2025-02-10 16:15:21 -08:00
models.py ensure running crawl configmap is updated when exclusions are added/removed (#2409) 2025-02-19 11:42:19 -08:00
profiles.py