browsertrix

Author	SHA1	Message	Date
Emma Segal-Grossman	d64def00c2	Move execution time formatting into its own util (#1386 ) Refactors and rewrites the humanize time functions used on the dashboard, and swaps out these new functions in a couple of places. Examples of these functions' behaviours can be found in the tests for them. <img width="375" alt="Screenshot 2023-11-16 at 8 07 14 PM" src="https://github.com/webrecorder/browsertrix-cloud/assets/5727389/775b3a49-1061-4002-8c34-961777423542"> <img width="267" alt="Screenshot 2023-11-16 at 8 07 45 PM" src="https://github.com/webrecorder/browsertrix-cloud/assets/5727389/1d22aec0-4b88-4a9a-b1d7-f6612d287769"> <img width="224" alt="Screenshot 2023-11-16 at 8 21 13 PM" src="https://github.com/webrecorder/browsertrix-cloud/assets/5727389/7d895938-ea02-4ffa-9f82-8526725f36c5"> Also fixes inconsistent tooltip text alignment on the dashboard :)	2023-11-21 16:51:08 -05:00
Ilya Kreymer	dfba4b3940	Replace partial_complete -> stopped_by_user or stopped_quota_reached + operator edge cases (#1368 ) - Adds two new crawl finished state, stopped_by_user and stopped_quota_reached - Tracking other possible 'stop reasons' in operator, though not making them distinct states for now. - Updated frontend with 'Stopped by User' and 'Stopped: Time Quota Reached', shown with same icon as current partial_complete - Added migration of partial_complete to either stopped_by_user or complete (no historical quota data available) - Addresses edge case in scaling: if crawl never scaled (no redis entry, no pod), automatically scale down - Edge case in status: if crawl is somehow 'canceled' but not deleted, immediately delete crawl object and begin finalizing. --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2023-11-14 11:17:16 -08:00
Tessa Walsh	38f32f11ea	Enforce quota and hard cap for monthly execution minutes (#1284 ) Fixes #1261 Closes #1092 The quota for monthly execution minutes is treated as a hard cap. Once it is exceeded, an alert indicating that an org has exceeded its monthly execution minutes will display and the user will be unable to start new crawls. Any running crawls will be stopped once the quota is exceeded. An execution minutes meter bar is also added in the Org Dashboard and displayed if a quota is set. More detail in #1305 which was merged into this branch. ## Changes - Enable setting 'maxExecMinutesPerMonth' in orgs list quotas by superadmin - Enforce quota by stopping crawls in operator once quota is reached - Show alert banner once execution time quota is hit: - Once quota is hit, disable Run Crawl buttons in frontend, return 403 message with `exec_minutes_quota_reached` detail in backend from crawl config `/run` endpoint, and don't run new workflows on creation (similar to storage quota) - Display execution time for crawls in the crawl details overview, immediately below - Show execution minutes meter on dashboard (from #1305) --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics> Co-authored-by: Ilya Kreymer <ikreymer@gmail.com> Co-authored-by: sua yoo <sua@webrecorder.org>	2023-10-26 15:38:51 -07:00
sua yoo	4610d95cd7	Use org slug in place of UUIDs in app URLs (#1277 ) - Replaces org UUID in URL/browser location bar with org slug. - Refactor: Adds shared app state utility using https://sijakret.github.io/lit-shared-state/ to access org data from deep descendants. - Backwards compatible: org UUID URLs should auto-redirect to org slug URLs. - Show the org UUID in org settings general tab for use with APIs (Resolves #1258, Follows #1279)	2023-10-18 09:28:30 -07:00
sua yoo	630c00c5b0	Enforce strong passwords in UI (#1266 )	2023-10-12 19:36:59 -07:00
sua yoo	f2261bcb34	Fix frontend not redirecting on 401 (#1244 ) - Ensures need-login event bubbles until handled - Redirects on 401 from /refresh endpoint - Go to previous URL upon login, rather than always to home page - Shows accurate login notification (rather than less precise "couldn't retrieve org" or similar message)	2023-10-04 00:17:22 -07:00
sua yoo	730a160f75	New org home page dashboard (#1201 )	2023-09-21 19:20:08 -07:00
Ilya Kreymer	c9c39d47b7	Scheduled Crawl Refactor: Handle via Operator + Add Skipped Crawls on Quota Reached (#1162 ) * use metacontroller's decoratorcontroller to create CrawlJob from Job * scheduled job work: - use existing job name for scheduled crawljob - use suspended job, set startTime, completionTime and succeeded status on job when crawljob is done - simplify cronjob template: remove job_image, cron_namespace, using same namespace as crawls, placeholder job image for cronjobs * move storage quota check to crawljob handler: - add 'skipped_quota_reached' as new failed status type - check for storage quota before checking if crawljob can be started, fail if not (check before any pods/pvcs created) * frontend: - show all crawls in crawl workflow, no need to filter by status - add 'skipped_quota_reached' status, show as 'Skipped (Quota Reached)', render same as failed * migration: make release namespace available as DEFAULT_NAMESPACE, delete old cronjobs in DEFAULT_NAMESPACE and recreate in crawlers namespace with new template	2023-09-12 13:05:43 -07:00
Tessa Walsh	9377a6f456	Issue all non-upload storage-quota-update events from LiteElement (#1151 ) - More specific toast notification error messages to the action being attempted - Single dismissable global banner shown when org storage is reached - Removed check for storage quota reached in `runNow`, since buttons are disabled in UI, and errors handled if request fails. - Allow creating new workflow when storage quota reached - More responsive storage quota updates: add storageQuotaReached to archived item replay.json, updates w/o reload when crawl pushes quota over limit - Modify LiteElement to check for storageQuotaReached on GET requests --------- Co-authored-by: sua yoo <sua@suayoo.com>	2023-09-11 18:17:48 -07:00
Tessa Walsh	d2ededc895	Add and enforce org storage quota (#1106 ) * Implement in backend - Track bytesStored in org - Add migration to pre-calculate based on size of crawlfiles and profilefiles - Add methods to increase or decrease org storage when crawl or profile files are added or deleted - Include storageQuotaReached boolean in API responses that alter storage - Don't start new crawls and fail uploads if storage quota reached * Implement in frontend - Add to orgs-list quotas - Update org's storageQuotaReached based on backend endpoint responses - Disable buttons when storage quota is met - Show toast notification when attempting to run a crawl when org storage quota is met	2023-09-07 12:45:43 -04:00
sua yoo	54cf4f23e4	Paginate Workflows and refactor to use server-side queries (#1078 ) - Paginates Crawl Workflows when there are more than 10 workflows - Refactors workflow search and crawl search to use the same component - Adds sort by first seed, workflow creation date, and workflow modified date - Separates "last run" date from "modified" date - Update column layout into Name & Schedule (or Manual Ru'ri=), Latest Crawl (<finish time> in <duration>), total size, and last modified (modified by and modified time)	2023-08-22 16:29:17 -07:00
sua yoo	89983542f9	Update archived item URLs (#1064 ) - Changes to URLs in "Crawling", "All Archived Items", and "Collections": - Rename Artifacts -> Items - Unifies view crawl view as loaded from All Archived Items and from Workflows - Includes redirect for /artifacts/uploads -> /items/uploads to support archiveweb.page usage	2023-08-14 18:28:37 -07:00
Ilya Kreymer	06cf9c7cc3	add crawl ending states: 'generate-wacz', 'uploading-wacz', 'pending-wait' that occur after a crawl is finished or is being stopped (#1022 ) operator: ensure transitions from each of these states is supported, including to 'waiting_capacity' add extra check on stopping to avoid transitioning back to a running state after crawl is finished ui: add states to UI display, localization, add as active states fixes #263	2023-08-01 00:15:59 -07:00
sua yoo	75b011f951	Upload WACZ via UI (#992 ) - Users can now upload .WACZ archives from the "Archived Data" page. - Can specify name, description, tags and collection(s) to add upload to - Show progress of upload - Support canceling upload	2023-07-21 16:45:52 +02:00
sua yoo	66b3befef9	Frontend collections beta UI (#886 ) - Support for creating new collections and editing existing collections - Can select crawling workflows which adds entire workflow, and then deselect individual crawls - Can edit existing collections and add more crawls - Can view, create and delete collections via new Collections top-level nav entry	2023-06-06 17:52:01 -07:00
Ilya Kreymer	00fb8ac048	Concurrent Crawl Limit (#874 ) concurrent crawl limits: (addresses #866) - support limits on concurrent crawls that can be run within a single org - change 'waiting' state to 'waiting_org_limit' for concurrent crawl limit and 'waiting_capacity' for capacity-based limits orgs: - add 'maxConcurrentCrawl' to new 'quotas' object on orgs - add /quotas endpoint for updating quotas object operator: - add all crawljobs as related, appear to be returned in creation order - operator: if concurrent crawl limit set, ensures current job is in the first N set of crawljobs (as provided via 'related' list of crawljob objects) before it can proceed to 'starting', otherwise set to 'waiting_org_limit' - api: add org /quotas endpoint for configuring quotas - remove 'new' state, always start with 'starting' - crawljob: add 'oid' to crawljob spec and label for easier querying - more stringent state transitions: add allowed_from to set_state() - ensure state transitions only happened from allowed states, while failed/canceled can happen from any state - ensure finished and state synched from db if transition not allowed - add crawl indices by oid and cid frontend: - show different waiting states on frontend: 'Waiting (Crawl Limit) and 'Waiting (At Capacity)' - add gear icon on orgs admin page - and initial popup for setting org quotas, showing all properties from org 'quotas' object tests: - add concurrent crawl limit nightly tests - fix state waiting -> waiting_capacity - ci: add logging of operator output on test failure	2023-05-30 15:38:03 -07:00
Ilya Kreymer	2cae065c46	Add Waiting state on the backend and frontend (#839 ) * operator: add waiting state - add pods as related objects - inspect pod status, set crawl status to 'waiting' if no pods are running frontend: - frontend support for 'waiting' state - show waiting icon from mocks --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-05-08 17:05:01 -07:00
sua yoo	85c96de883	Show critical errors in Crawl detail logs (#811 )	2023-05-05 11:30:38 -07:00
sua yoo	7888c4fde3	Frontend crawl workflows rework (#775 )	2023-04-25 14:16:07 -07:00
sua yoo	c60dc5d086	Crawls list backend pagination (#735 )	2023-04-05 10:55:42 -07:00
sua yoo	bca67c74e2	chore: format frontend files with prettier	2023-03-27 11:05:19 -07:00
sua yoo	f2b7946960	Improve crawl list rendering (#645 ) * add load more button * adjust height * refactor to improve performance * remove unused observable component * contain status * update dropdown animation	2023-02-28 18:36:23 -08:00
sua yoo	23795ec5fd	Compute name from seed URLs in UI (#644 )	2023-02-28 15:51:43 -08:00
sua yoo	1dea7ecdf9	Update crawls list styles (#630 ) - Improves crawls list UI for UX and visual consistency - Enables editing crawl metadata from the crawls list - Upgraded Tailwind CSS	2023-02-24 17:36:34 -08:00
sua yoo	9532f48515	Fix app not rendering with bad auth storage states (#597 ) * render even if session store throws * handle after timeout * remove localstorage key * update tests	2023-02-14 18:35:21 -08:00
sua yoo	d128525e4e	Run unit tests in frontend PR check (#569 )	2023-02-06 17:47:15 -08:00
sua yoo	17e1628d2d	Allow superadmins to create org from UI (#563 )	2023-02-06 14:58:28 -08:00
sua yoo	4875d7727d	Fix invite accept in UI (#560 )	2023-02-06 12:18:24 -08:00
sua yoo	10c96ed2ae	Update tab access by user role (#549 ) * update types * update user org type * update tabs	2023-02-02 22:26:22 -08:00
sua yoo	8957eda966	Improve org routing & performance (#520 )	2023-01-26 15:02:27 -08:00
Tessa Walsh	0fa60ebc45	Rename archives/teams -> orgs in codebase + add db migration (#486 ) * Rename archives to orgs and aid to oid on backend * Rename archive to org and aid to oid in frontend * Remove translation artifact * Rename team -> organization * Add database migrations and run once on startup * This commit also applies the new by_one_worker decorator to other asyncio tasks to prevent heavy tasks from being run in each worker. * Run black, pylint, and husky via pre-commit * Set db version and use in migrations * Update and prepare database in single task * Migrate k8s configmaps	2023-01-18 14:51:04 -08:00
sua yoo	4a23dd12cb	Crawl config detail view & edit workflow UI updates (#415 )	2022-12-22 09:37:43 -08:00
sua yoo	28346e0a54	New create crawl config user workflow (#391 )	2022-12-12 13:50:33 -08:00
sua yoo	e7f1a00411	Fix authentication getting out of sync between tabs (#380 ) Fixes regression to #361 found after increasing the token timeout by preventing app load until the authentication service is initialized (and finishing check if another tab is logged in.)	2022-11-23 23:36:36 -08:00
sua yoo	321f78b861	Upgrade Shoelace 2.0.0-beta.61 -> 2.0.0-beta.83 (#358 )	2022-11-21 08:16:51 -08:00
sua yoo	4d4ce40443	Refactor & sync user session across tab/windows (#370 )	2022-11-15 19:49:18 -08:00
sua yoo	1ef9f7df6d	Fix auth not persisting on reload (#360 )	2022-11-15 13:17:29 -08:00
sua yoo	97eb17784d	Display exclusions & list of URLs in crawl queue (#337 ) - including pagination of queue results (30 results per page currently) - show numbering on paginated results - allow user navigation to each result page	2022-10-12 20:19:13 -07:00
sua yoo	9606d59c3d	Improve format of crawl template config error from server (#281 ) * better display of api errors, such as fields missing or invalid urls, addresses #280	2022-06-29 17:57:03 -07:00
sua yoo	d144591dbf	Display & edit crawl schedule in user local time (#271 ) closes #255	2022-06-27 13:01:20 -07:00
sua yoo	f90ef071de	enable opening crawl in new tab	2022-04-11 13:03:10 -07:00
sua yoo	c577e36b74	add debug for access token	2022-02-08 17:52:27 -08:00
sua yoo	02f46f108b	Crawl & crawl config UX improvements (#136 )	2022-02-01 14:28:07 -08:00
sua yoo	d7f58c964c	Fix in-app link UX (#132 ) closes #130, closes #113	2022-01-31 17:36:50 -08:00
sua yoo	2666b6f6aa	Duplicate crawl config from list (#99 )	2022-01-25 17:07:54 -08:00
sua yoo	3a461d86d4	Crawl config detail views (#97 )	2022-01-25 11:56:34 -08:00
sua yoo	cb5cf55c69	Add helper for dispatching notify events (#92 )	2022-01-19 21:01:47 -08:00
sua yoo	c3edb4bba4	Allow user to configure crawls with JSON (#86 )	2022-01-18 19:58:55 -08:00
sua yoo	ff77a92108	Schedule time of day when creating config (#85 )	2022-01-18 13:58:28 -08:00
sua yoo	b2088f5634	Add initial crawl template form (#80 )	2022-01-16 14:43:33 -08:00

1 2

64 Commits