browsertrix/backend
Ilya Kreymer 793611e5bb
add exclusion api, fixes #311 (#349)
* add exclusion api, fixes #311
add new apis: `POST crawls/{crawl_id}/exclusion?regex=...` and `DELETE crawls/{crawl_id}/exclusion?regex=...` which will:
- create new config with add 'regex' as exclusion (deleting or making inactive previous config) OR remove as exclusion.
- update crawl to point to new config
- update statefulset to point to new config, causing crawler pods to restart
- filter out urls matching 'regex' from both queue and seen list (currently a bit slow) (when adding only)
- return 400 if exclusion already existing when adding, or doesn't exist when removing
- api reads redis list in reverse to match how exclusion queue is used
2022-11-12 17:24:30 -08:00
..
btrixcloud add exclusion api, fixes #311 (#349) 2022-11-12 17:24:30 -08:00
test affinity: add affinity for k8s crawl deployments: 2022-06-07 21:52:04 -07:00
Dockerfile Single config and env vars (#267) 2022-06-16 21:50:03 -07:00
requirements.txt improvements / bug fixes for stop/cancel handling: (#279) 2022-06-29 17:47:25 -07:00