browsertrix/backend
Ilya Kreymer 57a4b6b46f add collections api:
- collections defined by name per archive
- can update collections with additional metadata (currently just description)
- crawl config api accepts a list of collections by name, resolved to collection uids and stored in config
- finished crawls also associated with collection list
- /archives/{aid}/collections/{name} can list all crawl artifacts (wacz files) from a named collection (in frictionless data package-ish format)
- /archives/{aid}/collections/$all lists all crawled artifacts for the archive

readiness check: add /healthz endpoints for app and nginx
ingress: add /data/ route to local bucket

storage improvements:
- for default storages, store path only, and prepend default storage access endpoint
- collections api returns the paths using the storage access endpoint
- define default storages as secrets in k8s (can support multiple), hard-coded in docker (only one for now)
2021-10-27 09:39:14 -07:00
..
archives.py add collections api: 2021-10-27 09:39:14 -07:00
colls.py add collections api: 2021-10-27 09:39:14 -07:00
crawlconfigs.py add collections api: 2021-10-27 09:39:14 -07:00
crawls.py add collections api: 2021-10-27 09:39:14 -07:00
db.py use redis based queue instead of url for crawl done webhook 2021-10-10 12:18:28 -07:00
Dockerfile initial commit! 2021-06-28 15:48:59 -07:00
dockerman.py add collections api: 2021-10-27 09:39:14 -07:00
emailsender.py new features: 2021-08-20 11:02:29 -07:00
k8sman.py add collections api: 2021-10-27 09:39:14 -07:00
main.py add collections api: 2021-10-27 09:39:14 -07:00
requirements.txt Storage + Data Model Refactor (fixes #3): 2021-10-09 18:58:40 -07:00
scheduler.py add missing scheduler! 2021-08-25 16:18:53 -07:00
storages.py Storage + Data Model Refactor (fixes #3): 2021-10-09 18:58:40 -07:00
users.py add ingress + nginx container for better routing 2021-10-09 23:47:29 -07:00