browsertrix

History

Ilya Kreymer 57a4b6b46f add collections api: - collections defined by name per archive - can update collections with additional metadata (currently just description) - crawl config api accepts a list of collections by name, resolved to collection uids and stored in config - finished crawls also associated with collection list - /archives/{aid}/collections/{name} can list all crawl artifacts (wacz files) from a named collection (in frictionless data package-ish format) - /archives/{aid}/collections/$all lists all crawled artifacts for the archive readiness check: add /healthz endpoints for app and nginx ingress: add /data/ route to local bucket storage improvements: - for default storages, store path only, and prepend default storage access endpoint - collections api returns the paths using the storage access endpoint - define default storages as secrets in k8s (can support multiple), hard-coded in docker (only one for now)		2021-10-27 09:39:14 -07:00
..
archives.py	add collections api:	2021-10-27 09:39:14 -07:00
colls.py	add collections api:	2021-10-27 09:39:14 -07:00
crawlconfigs.py	add collections api:	2021-10-27 09:39:14 -07:00
crawls.py	add collections api:	2021-10-27 09:39:14 -07:00
db.py	use redis based queue instead of url for crawl done webhook	2021-10-10 12:18:28 -07:00
Dockerfile	initial commit!	2021-06-28 15:48:59 -07:00
dockerman.py	add collections api:	2021-10-27 09:39:14 -07:00
emailsender.py	new features:	2021-08-20 11:02:29 -07:00
k8sman.py	add collections api:	2021-10-27 09:39:14 -07:00
main.py	add collections api:	2021-10-27 09:39:14 -07:00
requirements.txt	Storage + Data Model Refactor (fixes #3 ):	2021-10-09 18:58:40 -07:00
scheduler.py	add missing scheduler!	2021-08-25 16:18:53 -07:00
storages.py	Storage + Data Model Refactor (fixes #3 ):	2021-10-09 18:58:40 -07:00
users.py	add ingress + nginx container for better routing	2021-10-09 23:47:29 -07:00