browsertrix

Author	SHA1	Message	Date
Henry Wilkinson	21905205dc	Adds <btrix-details> to org dashboard table (#1311 ) - Updates text with "Elapsed Time" label in the table - Makes the table collapsible and collapsed by default.	2023-10-26 19:46:35 -07:00
Tessa Walsh	38f32f11ea	Enforce quota and hard cap for monthly execution minutes (#1284 ) Fixes #1261 Closes #1092 The quota for monthly execution minutes is treated as a hard cap. Once it is exceeded, an alert indicating that an org has exceeded its monthly execution minutes will display and the user will be unable to start new crawls. Any running crawls will be stopped once the quota is exceeded. An execution minutes meter bar is also added in the Org Dashboard and displayed if a quota is set. More detail in #1305 which was merged into this branch. ## Changes - Enable setting 'maxExecMinutesPerMonth' in orgs list quotas by superadmin - Enforce quota by stopping crawls in operator once quota is reached - Show alert banner once execution time quota is hit: - Once quota is hit, disable Run Crawl buttons in frontend, return 403 message with `exec_minutes_quota_reached` detail in backend from crawl config `/run` endpoint, and don't run new workflows on creation (similar to storage quota) - Display execution time for crawls in the crawl details overview, immediately below - Show execution minutes meter on dashboard (from #1305) --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics> Co-authored-by: Ilya Kreymer <ikreymer@gmail.com> Co-authored-by: sua yoo <sua@webrecorder.org>	2023-10-26 15:38:51 -07:00
Ilya Kreymer	4b9ca44adb	Frontend typo fixes (#1315 ) - fix missing org slug instead of org id change - fix login validation to check for 429 response code	2023-10-25 13:28:41 -04:00
sua yoo	2e5952a444	Display crawl time usage history table (#1304 ) Partially resolves #1223, fixes #1298 - Adds crawl usage table in dashboard under metrics - Shows skeleton loading indicator when metrics are loading (@Shrinks99 feel free to adjust how this looks) - Shows max number of concurrent crawls running if any are running ("`running` / `max` Crawls Running")	2023-10-23 16:25:16 -07:00
Henry Wilkinson	e274462ba0	Update tag spacing and styling for remove button (#1283 ) ### Context - Adds custom padding to each side based on if the tag is removable or not - Improves hover state for the remove button when the tag is focused - Adds padding to the remove button	2023-10-20 16:02:32 -07:00
Tessa Walsh	5c5ef68a8a	Prevent user from logging in after 5 consecutive failed login attempts until pw is reset (#1281 ) Fixes #1270 After 5 consecutive failed logins from the same user, we now prevent the user from logging in even with the correct password until they reset it via their email, or wait an hour. - After failure threshold is reached, all further login attempts are rejected - Attempts for invalid email addresses are also tracked - On 6th try, a reset password email is automatically sent, only once - Failed login counter resets after an hour of no further logins after last attempted login. --------- Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>	2023-10-20 14:10:56 -07:00
Henry Wilkinson	40da1f8541	Make URLs in the settings viewer clickable, removes deeplinked titles (#1247 ) ### Changes - URLs on the config review pages are now links that open in a new tab - Does not do anything with the `Extra URLs in Scope` field (which we currently render as a regex so left that alone) - Hides / removes the previously deep-linked but now broken config section rendering.	2023-10-18 16:20:22 -07:00
Ilya Kreymer	9a2787f9c4	User refactor + remove fastapi_users dependency + update fastapi (#1290 ) Fixes #1050 Major refactor of the user/auth system to remove fastapi_users dependency. Refactors users.py to be standalone and adds new auth.py module for handling auth. UserManager now works similar to other ops classes. The auth should be fully backwards compatible with fastapi_users auth, including accepting previous JWT tokens w/o having to re-login. The User data model in mongodb is also unchanged. Additional fixes: - allows updating fastapi to latest - add webhook docs to openapi (follow up to #1041) API changes: - Removing the`GET, PATCH, DELETE /users/<id>` endpoints, which were not in used before, as users are scoped to orgs. For deletion, probably auto-delete when user is removed from last org (to be implemented). - Rename `/users/me-with-orgs` is renamed to just `/users/me/` - New `PUT /users/me/change-password` endpoint with password required to update password, fixes #1269, supersedes #1272 Frontend changes: - Fixes from #1272 to support new change password endpoint. --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net> Co-authored-by: sua yoo <sua@suayoo.com>	2023-10-18 10:49:23 -07:00
sua yoo	4610d95cd7	Use org slug in place of UUIDs in app URLs (#1277 ) - Replaces org UUID in URL/browser location bar with org slug. - Refactor: Adds shared app state utility using https://sijakret.github.io/lit-shared-state/ to access org data from deep descendants. - Backwards compatible: org UUID URLs should auto-redirect to org slug URLs. - Show the org UUID in org settings general tab for use with APIs (Resolves #1258, Follows #1279)	2023-10-18 09:28:30 -07:00
Ilya Kreymer	36bd228115	version: update to 1.8.0-beta.0	2023-10-17 18:06:55 -07:00
sua yoo	6b897e281c	hotfix: display workflow list date as utc	2023-10-17 15:51:24 -07:00
Ilya Kreymer	b3f530f8e6	version: bump to 1.7.0	2023-10-16 18:39:20 -07:00
sua yoo	ab8e82cd28	Update org custom URL label (#1292 ) Fast follower https://github.com/webrecorder/browsertrix-cloud/pull/1276 Updates label, info text, and preview text for org slug field to be more user-friendly use 'Custom URL Identifier' and 'Custom your organization's web address for accessing Browsertrix Cloud' --------- Co-authored-by: Ilya Kreymer <ikreymer@users.noreply.github.com> Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-10-16 15:08:43 -07:00
Henry Wilkinson	6d6fa03ade	Disable collection share button actions for viewer users (#1282 ) Closes #1273 - Viewers can see the share button and the dialogue's sharing info if the collection is sharable - Viewers can't see or change the share toggle - Viewers can't see the share button if the collection is not sharable	2023-10-16 10:50:33 -07:00
Ilya Kreymer	a295f5d05d	version: bump to 1.7.0-beta.3	2023-10-15 18:31:03 -07:00
sua yoo	8466caf1d9	Allow org admins to update slug (#1276 ) - Allows editing of org slugs (actual URL updates will be handled in https://github.com/webrecorder/browsertrix-cloud/issues/1258.) - Converts user input to slug using slugify - Adds help text to org name and slug - Renames tab from "information" to "general" settings	2023-10-13 17:00:43 -07:00
Henry Wilkinson	0bd8748e68	Minor Workflow Creator UX Changes (#1267 ) - Adds `position: sticky` to the workflow creator / editor controls to affix them to the bottom of the screen, they are now always visible! - Renames "Extra URLs in Scope" to "Extra URL Prefixes in Scope" - Updates documentation accordingly - Adjusts casing for checkboxes - Adds the multiplication sign to the crawler instances settings to better communicate that they are increases in scale and not arbitrary numbers.	2023-10-13 16:55:54 -07:00
sua yoo	22fbf92ed6	Show storage values for each item type when no quota (#1260 ) Hides chart and shows size values for each Storage line when org has no quota. No changes to orgs with quota. (Follow-up to #1188)	2023-10-13 14:31:33 -07:00
sua yoo	630c00c5b0	Enforce strong passwords in UI (#1266 )	2023-10-12 19:36:59 -07:00
sua yoo	f1dcc7e48a	Allow users to change display name and email (#1265 )	2023-10-11 13:42:41 -07:00
Ilya Kreymer	20560abb81	version: bump to 1.7.0-beta.2	2023-10-05 20:33:38 -07:00
sua yoo	f2261bcb34	Fix frontend not redirecting on 401 (#1244 ) - Ensures need-login event bubbles until handled - Redirects on 401 from /refresh endpoint - Go to previous URL upon login, rather than always to home page - Shows accurate login notification (rather than less precise "couldn't retrieve org" or similar message)	2023-10-04 00:17:22 -07:00
sua yoo	38efeccc25	Limit URL list entry to maximum URLs (#1242 ) - Limits URL list entry to 1,000 URLs - Limits additional URL list entry to 100 URLs - Shows first invalid URL in list in error message - Quick and dirty fix for long URLs wrapping: Show URLs in list on one line, with entire container scrolling --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-10-03 21:02:32 -07:00
Henry Wilkinson	99ccdf2de8	Browser Profile Warning & Dialog Style Updates (#1243 ) * Give protocol selection box smaller max-width * Add warning and docs link to browser profile creation - Updates dialog styling to btrix dialog - Updates button sizes - Updates button placement in dialog - Updates button labels for consistency with other buttons in app - Updates docs page with new button labels * Update browser profile edit metadata dialog. Matches updated dialog shown on profile creation * Open docs page in new tab	2023-10-03 18:59:19 -07:00
Tessa Walsh	bbdb7f8ce5	Require that all passwords are between 8 and 64 characters (#1239 ) - Require that all passwords are between 8 and 64 characters - Fixes account settings password reset form to only trigger logged-in event after successful password change. - Password validation can be extended within the UserManager's validate_password method to add or modify requirements. - Add tests for password validation	2023-10-03 18:57:46 -07:00
Tessa Walsh	b1ead614ee	Add --failOnFailedSeed checkbox to URL list workflows (#1236 ) - If set, and any of the seeds fails, the entire crawl is marked as a failure. - Add checkbox which adds --failOnFailedSeed checkbox to URL list workflows - Add 'Fail Crawl On Failed URL' to crawl workflow setup docs	2023-10-03 18:46:09 -07:00
Tessa Walsh	e9bac4c088	API delete endpoint improvements (#1232 ) - Applies user permissions check before deleting anything in all /delete endpoints - Shuts down running crawls before deleting anything in /all-crawls/delete as well as /crawls/delete - Splits delete_list.crawl_ids into crawls and upload lists at same time as checks in /all-crawls/delete - Updates frontend notification message to Only org owners can delete other users' archived items. when a crawler user attempts to delete another users' archived items	2023-10-03 13:05:00 -07:00
sua yoo	df190e12b9	Show running workflow error logs (#1224 ) - Adds "Logs" tab to workflow detail - Shows error logs in expandable section in "Watch" tab - Show corresponding message (no logs yet or logs temporarily unavailable) when `/errors` returns 503 based on crawl state - text tweaks: use error logs instead of logs, change 'crawl start' -> 'crawl continue' in log message --------- Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>	2023-10-03 00:03:21 -07:00
sua yoo	3fea4cabe2	Show storage meter even with no quota (#1240 ) - Displays how much storage items and browser profiles take up even when quota is not specified	2023-10-02 20:01:39 -07:00
sua yoo	941a75ef12	Separate seeds into a new endpoints (#1217 ) - Remove config.seeds from workflow and crawl detail endpoints - Add new paginated GET /crawls/{crawl_id}/seeds and /crawlconfigs/{cid}/seeds endpoints to retrieve seeds for a crawl or workflow - Include firstSeed in GET /crawlconfigs/{cid} endpoint (was missing before) - Modify frontend to fetch seeds from new /seeds endpoints with loading indicator --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2023-10-02 10:56:12 -07:00
Anish Lakhwara	1bf531e1ec	Fix: Make Collections Public on Creation (#1213 ) - Add isPublic to Add Collection endpoint, send isPublic from frontend - Fixes #1212	2023-09-29 12:08:10 -07:00
sua yoo	90e3a300cc	"Add new" dialog for all resources (#1202 ) - Replaces individual "New" buttons in home page with dropdown button in header (includes Crawl Workflow, Upload Collection, Browser Profile) - Refactors required step of new workflow and new collection into dialog	2023-09-29 09:11:24 -07:00
Henry Wilkinson	e93f195d59	fix: Right Align Copy Buttons & `<btrix-desc-list>` vertical `width: 100%` (#1177 ) * Reorders actions, adds tooltip - All copy buttons on the collection share dialog are now on the right side - Adds a tooltip to tell the user the button opens the link in a new tab * Make vertical `dec-list` items fill 100% width of their parent container - Allows for better placement of items within the container - Adds horizontal padding to info bars * Right align copy button in item details page	2023-09-28 12:08:27 -07:00
Vinzenz Sinapius	9b125bc2c6	Passthrough X-Forwarded-Proto header in frontend nginx (#1226 ) If X-Forwarded-Proto header is already set, pass that through instead of setting to current scheme.	2023-09-28 10:58:57 -07:00
sua yoo	e5cc70754e	Show org storage quotas in dashboard (#1210 ) - Displays storage quota in subdivided meter - Updates icon colors - Adds new <btrix-meter> component --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-09-27 10:38:59 -07:00
Tessa Walsh	304ea6d52f	Always display Download Logs button in Error Logs tab (#1209 )	2023-09-22 12:08:03 -04:00
sua yoo	730a160f75	New org home page dashboard (#1201 )	2023-09-21 19:20:08 -07:00
sua yoo	d05a27e8a4	Separate "run now" switch from scheduling options (#1175 )	2023-09-21 19:18:57 -07:00
sua yoo	f4d9c0e3d5	build: fix webpack dev server recompiling without changes See https://stackoverflow.com/questions/70990356/ionic-serve-keeps-recompiling-without-changes\#comment133341886_70990356	2023-09-19 12:15:58 -07:00
Tessa Walsh	9224f52f51	Remove config from list endpoints to speed up responses (#1193 ) * Remove config from list endpoints - Remove config field from workflow and crawl list endpoints - Add seedCount to CrawlConfigOut on backend and Workflow on frontend - Refactor CrawlConfig and CrawlConfigOut to extend CrawlConfigCore + CrawlConfigAdditional - Refactor workflow list in frontend to use firstSeed and seedCount - Frontend uses ListWorkflow type which is Omit<Workflow, "config">	2023-09-19 11:05:48 -05:00
sua yoo	58ff64dfbb	build: disable webpack polling for hot reload potential fix for dev server recompiling--currently not using hot reload anyway	2023-09-18 15:14:34 -07:00
Ilya Kreymer	65b7c10ba1	bump version to 1.7.0-beta.1	2023-09-18 14:33:03 -07:00
sua yoo	6ddba105f4	Enable saving individual collection form sections (#1166 ) - Moves metadata tab to first position - Adds save button to each section, stays in edit view on saving - Validates name exists before moving to next section or saving - Changes save button text to "Create Collection without Items" if crawl/uploads aren't selected in new collection - Fix server error not showing in UI	2023-09-14 15:21:01 -07:00
sua yoo	6234346d84	Fix crawl scope help text (#1169 ) * update text * remove trailing slash removal * make scope help text responsive as user types --------- Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>	2023-09-13 11:46:58 -07:00
Ilya Kreymer	9159c7c914	ensure max crawl size and max crawl timeout values are set to 0 when unused, instead of null (#1167 ) - convert None->0 when creating CrawlJob - ensure frontend sends 0 not null - make input model require 'int = 0' instead of 'Optional[int] = 0'	2023-09-13 09:51:26 -07:00
Ilya Kreymer	c9c39d47b7	Scheduled Crawl Refactor: Handle via Operator + Add Skipped Crawls on Quota Reached (#1162 ) * use metacontroller's decoratorcontroller to create CrawlJob from Job * scheduled job work: - use existing job name for scheduled crawljob - use suspended job, set startTime, completionTime and succeeded status on job when crawljob is done - simplify cronjob template: remove job_image, cron_namespace, using same namespace as crawls, placeholder job image for cronjobs * move storage quota check to crawljob handler: - add 'skipped_quota_reached' as new failed status type - check for storage quota before checking if crawljob can be started, fail if not (check before any pods/pvcs created) * frontend: - show all crawls in crawl workflow, no need to filter by status - add 'skipped_quota_reached' status, show as 'Skipped (Quota Reached)', render same as failed * migration: make release namespace available as DEFAULT_NAMESPACE, delete old cronjobs in DEFAULT_NAMESPACE and recreate in crawlers namespace with new template	2023-09-12 13:05:43 -07:00
Tessa Walsh	9377a6f456	Issue all non-upload storage-quota-update events from LiteElement (#1151 ) - More specific toast notification error messages to the action being attempted - Single dismissable global banner shown when org storage is reached - Removed check for storage quota reached in `runNow`, since buttons are disabled in UI, and errors handled if request fails. - Allow creating new workflow when storage quota reached - More responsive storage quota updates: add storageQuotaReached to archived item replay.json, updates w/o reload when crawl pushes quota over limit - Modify LiteElement to check for storageQuotaReached on GET requests --------- Co-authored-by: sua yoo <sua@suayoo.com>	2023-09-11 18:17:48 -07:00
Ilya Kreymer	ad9bca2e92	Operator refactor to control pods + pvcs directly instead of statefulsets (#1149 ) - Ability for pod to be Completed, unlike in Statefulset - eg. if 3 pods are running and first one finishes, all 3 must be running until all 3 are done. With this setup, the first finished pod can remain in Completed state. - Fixed shutdown order - crawler pods now correctly shutdown first before redis pods, by switching to background deletion. - Pod priority decreases with scale: 1st instance of a new crawl can preempt 3rd or 2nd instance of another crawl - Create priority classes upto 'max_crawl_scale, configured in values.yaml - Improved scale change reconciliation: if increasing scale, immediately scale up. If decreasing scale, graceful stop scaled-down instance to complete via redis 'stopone' key, wait until they exit with Completed state before adjust status.scale / removing scaled down pods. Ensures unaccepted interrupts don't cause scaled down data to be deleted. - Redis pod remains inactive until crawler is first active, or after no crawl pods are active for 60 seconds - Configurable Redis storage with 'redis_storage' value, set to 3Gi by default - CrawlJob deletion starts as soon as post-finish crawl operations are run - Post-crawl operations get their own redis instance, since one during response is being cleaned up in finalizer - Finalizer ignores request with incorrect state (returns 400 if reported as not finished while crawl is finished) - Current resource usage added to status - Profile browser: also manage single pod directly without statefulset for consistency. - Restart pods via restartTime value: if spec.restartTime != status.restartTime, clear out pods and update status.restartTime (using OnDelete policy to avoid recreate loops in edge cases). - Update to latest metacontroller (v4.11.0) - Add --restartOnError flag for crawler (for browsertrix-crawler 0.11.0) - Failed crawl logging: dd 'fail_crawl()' to be used for failing a crawl, which prints logs for default container (if enabled) as well as pod status - tests: check other finished states to avoid stuck in infinite loop if crawl fails - tests: disable disk utilization check, which adds unpredictability to crawl testing! fixes #1147 --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2023-09-11 10:38:04 -07:00
Tessa Walsh	d2ededc895	Add and enforce org storage quota (#1106 ) * Implement in backend - Track bytesStored in org - Add migration to pre-calculate based on size of crawlfiles and profilefiles - Add methods to increase or decrease org storage when crawl or profile files are added or deleted - Include storageQuotaReached boolean in API responses that alter storage - Don't start new crawls and fail uploads if storage quota reached * Implement in frontend - Add to orgs-list quotas - Update org's storageQuotaReached based on backend endpoint responses - Disable buttons when storage quota is met - Show toast notification when attempting to run a crawl when org storage quota is met	2023-09-07 12:45:43 -04:00
Henry Wilkinson	8850e35f7a	Changes "Crawls" → "Items" (#1145 )	2023-09-05 23:58:12 -04:00

1 2 3 4 5 ...

478 Commits