browsertrix/backend/test
Tessa Walsh e667fe2e97
Add max crawl size option to backend and frontend (#1045)
Backend:
- add 'maxCrawlSize' to models and crawljob spec
- add 'MAX_CRAWL_SIZE' to configmap
- add maxCrawlSize to new crawlconfig + update APIs
- operator: gracefully stop crawl if current size (from stats) exceeds maxCrawlSize
- tests: add max crawl size tests

Frontend:
- Add Max Crawl Size text box Limits tab
- Users enter max crawl size in GB, convert to bytes
- Add BYTES_PER_GB as constant for converting to bytes
- docs: Crawl Size Limit to user guide workflow setup section

Operator Refactor:
- use 'status.stopping' instead of 'crawl.stopping' to indicate crawl is being stopped, as changing later has no effect in operator
- add is_crawl_stopping() to return if crawl is being stopped, based on crawl.stopping or size or time limit being reached
- crawlerjob status: store byte size under 'size', human readable size under 'sizeHuman' for clarity
- size stat always exists so remove unneeded conditional (defaults to 0)
- store raw byte size in 'size', human readable size in 'sizeHuman'

Charts:
- subchart: update crawlerjob crd in btrix-crds to show status.stopping instead of spec.stopping
- subchart: show 'sizeHuman' property instead of 'size'
- bump subchart version to 0.1.1

---------
Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>
2023-08-26 22:00:37 -07:00
..
data Uploads API: BaseCrawl refactor + Initial support for /uploads endpoint (#937) 2023-07-07 09:13:26 -07:00
__init__.py
conftest.py Backend: standardize search values, filters, and sorting for archived items (#1039) 2023-08-04 09:56:52 -07:00
test_collections.py feat: implement 'collections' array with {name, id} for archived item details (#1098) 2023-08-25 00:26:46 -07:00
test_crawl_config_search_values.py Backend API consistency pass (#921) 2023-06-16 18:52:46 -07:00
test_crawl_config_tags.py Backend API consistency pass (#921) 2023-06-16 18:52:46 -07:00
test_crawlconfigs.py Add max crawl size option to backend and frontend (#1045) 2023-08-26 22:00:37 -07:00
test_filter_sort_results.py feat: implement 'collections' array with {name, id} for archived item details (#1098) 2023-08-25 00:26:46 -07:00
test_invites.py Paginate API list endpoints (#659) 2023-03-06 14:41:25 -05:00
test_login.py
test_org.py Paginate API list endpoints (#659) 2023-03-06 14:41:25 -05:00
test_permissions.py Paginate API list endpoints (#659) 2023-03-06 14:41:25 -05:00
test_run_crawl.py Backend: standardize search values, filters, and sorting for archived items (#1039) 2023-08-04 09:56:52 -07:00
test_settings.py tests: fixes for crawl cancel + crawl stopped (#864) 2023-05-22 20:17:29 -07:00
test_stop_cancel_crawl.py Backend: standardize search values, filters, and sorting for archived items (#1039) 2023-08-04 09:56:52 -07:00
test_uploads.py feat: implement 'collections' array with {name, id} for archived item details (#1098) 2023-08-25 00:26:46 -07:00
test_users.py
test_workflow_auto_add_to_collection.py feat: implement 'collections' array with {name, id} for archived item details (#1098) 2023-08-25 00:26:46 -07:00
utils.py Uploads API: BaseCrawl refactor + Initial support for /uploads endpoint (#937) 2023-07-07 09:13:26 -07:00