browsertrix/backend/btrixcloud
Ilya Kreymer 00fb8ac048
Concurrent Crawl Limit (#874)
concurrent crawl limits: (addresses #866)
- support limits on concurrent crawls that can be run within a single org
- change 'waiting' state to 'waiting_org_limit' for concurrent crawl limit and 'waiting_capacity' for capacity-based
limits

orgs:
- add 'maxConcurrentCrawl' to new 'quotas' object on orgs
- add /quotas endpoint for updating quotas object

operator:
- add all crawljobs as related, appear to be returned in creation order
- operator: if concurrent crawl limit set, ensures current job is in the first N set of crawljobs (as provided via 'related' list of crawljob objects) before it can proceed to 'starting', otherwise set to 'waiting_org_limit'
- api: add org /quotas endpoint for configuring quotas
- remove 'new' state, always start with 'starting'
- crawljob: add 'oid' to crawljob spec and label for easier querying
- more stringent state transitions: add allowed_from to set_state()
- ensure state transitions only happened from allowed states, while failed/canceled can happen from any state
- ensure finished and state synched from db if transition not allowed
- add crawl indices by oid and cid

frontend: 
- show different waiting states on frontend: 'Waiting (Crawl Limit) and 'Waiting (At Capacity)'
- add gear icon on orgs admin page
- and initial popup for setting org quotas, showing all properties from org 'quotas' object

tests:
- add concurrent crawl limit nightly tests
- fix state waiting -> waiting_capacity
- ci: add logging of operator output on test failure
2023-05-30 15:38:03 -07:00
..
migrations Optimize workflow statistics updates (#892) 2023-05-26 22:57:08 -07:00
templates Concurrent Crawl Limit (#874) 2023-05-30 15:38:03 -07:00
__init__.py
colls.py Sort collection by last updated (modified) (#897) 2023-05-30 14:09:10 -04:00
crawlconfigs.py Optimize workflow statistics updates (#892) 2023-05-26 22:57:08 -07:00
crawlmanager.py Concurrent Crawl Limit (#874) 2023-05-30 15:38:03 -07:00
crawls.py Concurrent Crawl Limit (#874) 2023-05-30 15:38:03 -07:00
db.py Rework collections to track collections in Crawl (#878) 2023-05-25 15:41:50 -04:00
emailsender.py
invites.py
k8sapi.py Concurrent Crawl Limit (#874) 2023-05-30 15:38:03 -07:00
main_op.py
main_scheduled_job.py Concurrent Crawl Limit (#874) 2023-05-30 15:38:03 -07:00
main.py Wait for DB init for healthcheck + settings (#885) 2023-05-25 09:58:30 -07:00
operator.py Concurrent Crawl Limit (#874) 2023-05-30 15:38:03 -07:00
orgs.py Concurrent Crawl Limit (#874) 2023-05-30 15:38:03 -07:00
pagination.py
profiles.py
storages.py
users.py
utils.py stopping fix: backend fixes for #836 + prep for additional status fields (#837) 2023-05-08 14:02:20 -07:00
version.py version: bump to 1.6.0-beta.0 2023-05-19 11:29:31 -07:00
zip.py