browsertrix/backend
Ilya Kreymer f7836c345d
Crawl Queue API (#342)
* crawl queue api work: (#329)
- add api to /crawls/{crawl_id}/queue api to get crawl queue, with offset, count, and optional regex. returns results and regex matches within the results, along with total urls in queue.
- add api to match entire crawl queue, /crawls/{crawl_id}/queueMatch with query 'regex' arg, which processes entire crawl queue on backend and returns a list of matches (more experimental)
- if crawl not yet started / redis not available, return empty queue
- only supported for k8s deployment at the moment
2022-10-12 19:56:13 -07:00
..
btrixcloud Crawl Queue API (#342) 2022-10-12 19:56:13 -07:00
test affinity: add affinity for k8s crawl deployments: 2022-06-07 21:52:04 -07:00
Dockerfile Single config and env vars (#267) 2022-06-16 21:50:03 -07:00
requirements.txt improvements / bug fixes for stop/cancel handling: (#279) 2022-06-29 17:47:25 -07:00