browsertrix

Author	SHA1	Message	Date
sua yoo	98d82184e6	Fix superadmin running crawls views (#846 ) - Updates superadmin "Running Crawls" to show active crawls (starting, waiting, running, stopping) and sort by start by default - Navigates to crawl workflow watch view on clicking crawl item - Adds "Copy Crawl ID" to crawl actions for easy paste into "Jump to crawl" - Navigates to crawl workflow watch when jumping to crawl	2023-05-11 08:15:52 +02:00
Ilya Kreymer	82b21b6813	frontend crawl stopping improvements (#836 ) (#838 ) * frontend crawl stopping improvements (#836) - support new backend 'stopping' property - for now, keep 'stopping' indicator state when crawl is running but stopping set to true	2023-05-08 23:52:49 -07:00
Ilya Kreymer	2cae065c46	Add Waiting state on the backend and frontend (#839 ) * operator: add waiting state - add pods as related objects - inspect pod status, set crawl status to 'waiting' if no pods are running frontend: - frontend support for 'waiting' state - show waiting icon from mocks --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-05-08 17:05:01 -07:00
sua yoo	9fcbc3f87e	Allow users to set max depth/hop out within scope (#816 ) - Adds an input to the Workflow creation and edit form for specifying crawl depth. This input is conditionally shown for seeded crawls when the scope is set to "Pages on this domain", "Pages on this domain & subdomains" or "Custom page prefix". The "any" scope is also supported for backwards compatibility but is not shown by default or in new configs. - API implementation: The depth value is set in the primary seed config, i.e. the first seed in seeds: [], not in the outer .config.depth property.	2023-05-05 14:26:48 -07:00
sua yoo	0d23b45dac	Crawl workflow detail page improvements (#823 ) Resolves #817 - Adds relevant action buttons to each Workflow detail tab header - Adds "Delete" action menu item to crawls in Crawls tab - Prevent automatically switching to "Watch" tab after running crawl from detail page - Removes "Stop" confirmation prompt and only shows "Cancel" confirmation prompt if there are one or more pages crawled - Replaces "Cancel" confirmation prompt with web component dialog (partially addresses Switch to in-page dialogue boxes #619) - Fixes hash routing to fix going back with browser back button	2023-05-05 13:50:45 -07:00
sua yoo	85c96de883	Show critical errors in Crawl detail logs (#811 )	2023-05-05 11:30:38 -07:00
sua yoo	60581411eb	Refactor screencast IDs (#800 ) Fixes #713, mapping watch windows to exact column/row by id	2023-05-03 10:33:04 -07:00
sua yoo	9a1c2ba871	Fix workflow limit empty values being set to `0` (#795 ) * default to null * pass undefined for removing values * handle 0 default	2023-05-03 09:25:22 -07:00
Henry Wilkinson	a13964c4c4	Merge pull request #809 from webrecorder/frontend-icon-button-aria-label-fixes	2023-05-01 15:38:49 -04:00
Henry Wilkinson	ee92eb6646	Merge pull request #810 from webrecorder/frontend-minor-visual-updates	2023-05-01 15:38:37 -04:00
Henry Wilkinson	624e7083cf	Merge pull request #806 from webrecorder/frontend-update-copy-button	2023-05-01 15:38:22 -04:00
Henry Wilkinson	bddbe35315	Runs yarn format	2023-05-01 15:33:17 -04:00
Henry Wilkinson	6e921cc065	Add margin to crawls list Mirrors workflow list	2023-05-01 03:26:57 -04:00
Henry Wilkinson	e04a6a7825	Improves icon button aria labels - Adds some labels to missing icon buttons - Fixes metadata `aria-label` usage → `label` so it actually gets added to the rendered `button` - Changes the "More" label to a (hopefully) more descriptive "Actions" label for dropdown actions menus	2023-05-01 02:57:32 -04:00
Henry Wilkinson	1d7518af07	Ensure that button returns to its default state uses the .blur() method to set the icon button back to its unfocused state after the set time	2023-04-29 17:17:49 -04:00
Henry Wilkinson	228e2187e3	Copy button text → icon - Converts to icon button - Adds accessibility label field	2023-04-28 14:53:11 -04:00
sua yoo	7888c4fde3	Frontend crawl workflows rework (#775 )	2023-04-25 14:16:07 -07:00
sua yoo	80bc4a3eb9	Fix additional URLs (#752 )	2023-04-05 20:11:09 -07:00
sua yoo	91c2c1ad62	Allow users to set additional page time limits (#744 )	2023-04-05 20:06:46 -07:00
sua yoo	c60dc5d086	Crawls list backend pagination (#735 )	2023-04-05 10:55:42 -07:00
sua yoo	370b8cbd4d	Set max pages to API default (#739 )	2023-04-04 08:47:37 -07:00
Ilya Kreymer	2b0d5ff8b3	misc frontend build fixes: playwright version + chunking (#740 ) * misc frontend build fixes: - fix playwright version to be consistent to fix playwright test - chunking: set max number of chunks generated * lock playwright version * remove intl polyfill --------- Co-authored-by: sua yoo <sua@suayoo.com>	2023-04-03 21:27:44 -07:00
Tessa Walsh	4724754efc	Filter and sort crawl and workflow list API endpoints in backend (#724 ) * Re-implement pagination and paginate crawlconfig revs First step toward simplifying pagination to set us up for sorting and filtering of list endpoints. This commit removes fastapi-pagination as a dependency. * Migrate all HttpUrl seeds to Seeds This commit also updates the frontend to always use Seeds and to fix display issues resulting from the change. * Filter and sort crawls and workflows Crawls: - Filter by createdBy (via userid param) - Filter by state (comma-separated string for multiple values) - Filter by first_seed, name, description - Sort by started, finished, fileSize, firstSeed - Sort descending by default to match frontend Workflows: - Filter by createdBy (formerly userid) and modifiedBy - Filter by first_seed, name, description - Sort by created, modified, firstSeed, lastCrawlTime * Add crawlconfigs search-values API endpoint and test	2023-03-28 17:55:40 -04:00
sua yoo	bca67c74e2	chore: format frontend files with prettier	2023-03-27 11:05:19 -07:00
sua yoo	5f5bb5ea6e	Allow users to set workflow description (#708 )	2023-03-21 13:40:23 -07:00
sua yoo	b9a24fa5e2	Combine watch crawl with crawl queue (#710 ) - crawl queue and watch page are now part of single view - exclusions can be edited via 'Edit Exclusions' popup	2023-03-17 21:04:08 -07:00
Ilya Kreymer	de9212eec7	exclusions editor fix: (#692 ) - backend: fix updating model after exclusions change - frontend: don't check for new_cid, just success - fixes #691	2023-03-10 22:36:10 -08:00
sua yoo	8ca4276c57	Migrate crawl config frontend -> workflow (#686 )	2023-03-10 11:39:42 -08:00
sua yoo	fecdc6229d	Improve crawl queue pagination UX (#680 ) * switches to infinite scroll for crawl queue	2023-03-09 12:18:26 -08:00
sua yoo	666c28f420	Limit organization name length (#671 )	2023-03-08 09:21:48 -08:00
Ilya Kreymer	544346d1d4	backend: make crawlconfigs mutable! (#656 ) (#662 ) * backend: make crawlconfigs mutable! (#656) - crawlconfig PATCH /{id} can now receive a new JSON config to replace the old one (in addition to scale, schedule, tags) - exclusions: add / remove APIs mutate the current crawlconfig, do not result in a new crawlconfig created - exclusions: ensure crawl job 'config' is updated when exclusions are added/removed, unify add/remove exclusions on crawl - k8s: crawlconfig json is updated along with scale - k8s: stateful set is restarted by updating annotation, instead of changing template - crawl object: now has 'config', as well as 'profileid', 'schedule', 'crawlTimeout', 'jobType' properties to ensure anything that is changeable is stored on the crawl - crawlconfigcore: store share properties between crawl and crawlconfig in new crawlconfigcore (includes 'schedule', 'jobType', 'config', 'profileid', 'schedule', 'crawlTimeout', 'tags', 'oid') - crawlconfig object: remove 'oldId', 'newId', disallow deactivating/deleting while crawl is running - rename 'userid' -> 'createdBy' - remove unused 'completions' field - add missing return to fix /run response - crawlout: ensure 'profileName' is resolved on CrawlOut from profileid - crawlout: return 'name' instead of 'configName' for consistent response - update: 'modified', 'modifiedBy' fields to set modification date and user modifying config - update: ensure PROFILE_FILENAME is updated in configmap is profileid provided, clear if profileid=="" - update: return 'settings_changed' and 'metadata_changed' if either crawl settings or metadata changed - tests: update tests to check settings_changed/metadata_changed return values add revision tracking to crawlconfig: - store each revision separate mongo db collection - revisions accessible via /crawlconfigs/{cid}/revs - store 'rev' int in crawlconfig and in crawljob - only add revision history if crawl config changed migration: - update to db v3 - copy fields from crawlconfig -> crawl - rename userid -> createdBy - copy userid -> modifiedBy, created -> modified - skip invalid crawls (missing config), make createdBy optional (just in case) frontend: Update crawl config keys with new API (#681), update frontend to use new PATCH endpoint, load config from crawl object in details view --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net> Co-authored-by: sua yoo <sua@webrecorder.org> Co-authored-by: sua yoo <sua@suayoo.com>	2023-03-07 20:36:50 -08:00
sua yoo	ebce2ec384	fix: show crawl start date in local time	2023-03-07 16:05:00 -08:00
sua yoo	91e415fac2	Hide file size when crawl is running (#648 )	2023-03-07 16:02:19 -08:00
Tessa Walsh	e98c7172a9	Paginate API list endpoints (#659 ) * Paginate API list endpoints fastapi-pagination is pinned to 0.9.3, the latest release that plays nicely with pinned versions of fastapi and fastapi-users. * Increase page size via overriden Params and Page classes * update api resource list keys --------- Co-authored-by: sua yoo <sua@suayoo.com>	2023-03-06 14:41:25 -05:00
sua yoo	f2b7946960	Improve crawl list rendering (#645 ) * add load more button * adjust height * refactor to improve performance * remove unused observable component * contain status * update dropdown animation	2023-02-28 18:36:23 -08:00
sua yoo	a1f939ad29	Improve tag input keyboard navigation (#650 )	2023-02-28 15:52:31 -08:00
sua yoo	d0182a3e13	Hide file size when crawl is running (#648 )	2023-02-28 15:52:06 -08:00
sua yoo	23795ec5fd	Compute name from seed URLs in UI (#644 )	2023-02-28 15:51:43 -08:00
sua yoo	de8a5f1c00	fix: tag input target in chrome	2023-02-25 19:54:58 -08:00
Ilya Kreymer	0d2a2de66e	rename Information -> Metadata, rebuild localization strings list (#642 )	2023-02-24 18:01:33 -08:00
sua yoo	1dea7ecdf9	Update crawls list styles (#630 ) - Improves crawls list UI for UX and visual consistency - Enables editing crawl metadata from the crawls list - Upgraded Tailwind CSS	2023-02-24 17:36:34 -08:00
sua yoo	c309b809da	Edit crawl notes from crawl detail view (#595 )	2023-02-21 12:26:38 -06:00
sua yoo	a180b92f4a	Improve superadmin invite UI (#581 )	2023-02-12 10:12:53 -08:00
sua yoo	7463becdff	Manage org member roles and invites (#558 ) - View and delete pending invites - Update user roles for members - Remove members	2023-02-08 18:32:40 -08:00
sua yoo	17e1628d2d	Allow superadmins to create org from UI (#563 )	2023-02-06 14:58:28 -08:00
Henry Wilkinson	a2a8d283ff	Fixes url word breaking Would probably ideally be break-word for all the non URL related things in the form but I don't think it will have any effect on anything that's not URLs in practice?	2023-02-03 03:10:28 -05:00
sua yoo	16ca8ecefd	Support additional seed URLs and custom scope type (#543 )	2023-02-02 21:39:29 -08:00
sua yoo	c1a612d73f	Update crawl tags from detail view (#539 )	2023-02-02 20:42:18 -08:00
sua yoo	f94be79364	Fix browser profile origins sidebar overlap (#530 )	2023-01-31 13:44:17 -08:00
sua yoo	05ce32d898	Update org settings & org invite UI (#528 )	2023-01-29 11:38:22 -08:00
Sara Tavares	bc69d6f0d4	chore(typos): fix typos (#524 ) Co-authored-by: stavares843 <stavares843@users.noreply.github.com>	2023-01-26 16:34:47 -08:00
sua yoo	8957eda966	Improve org routing & performance (#520 )	2023-01-26 15:02:27 -08:00
sua yoo	c0fdf738e4	Autocomplete tag options (#505 )	2023-01-18 20:15:42 -08:00
sua yoo	be10ea5239	Improve Page Time Limit UX (#503 )	2023-01-18 20:13:27 -08:00
Tessa Walsh	0fa60ebc45	Rename archives/teams -> orgs in codebase + add db migration (#486 ) * Rename archives to orgs and aid to oid on backend * Rename archive to org and aid to oid in frontend * Remove translation artifact * Rename team -> organization * Add database migrations and run once on startup * This commit also applies the new by_one_worker decorator to other asyncio tasks to prevent heavy tasks from being run in each worker. * Run black, pylint, and husky via pre-commit * Set db version and use in migrations * Update and prepare database in single task * Migrate k8s configmaps	2023-01-18 14:51:04 -08:00
sua yoo	ebb634a20d	Crawl config form panel UX enhancement & fix (#489 )	2023-01-17 16:20:35 -08:00
sua yoo	d32428134a	Crawl config frontend fixes (#482 ) * fix current time limit (#480) * fix browser profile name not shown (#474) * fix finish setup button label (#473) * fix paginated exclusion, border and page size (#475)	2023-01-16 19:19:42 -08:00
sua yoo	6a30660728	Apply suggestions from code review	2023-01-16 10:08:45 -08:00
Henry Wilkinson	7e93ec57e0	Updates name (again) to actually reflect what this is Because they _aren't_ additional seed URLs!	2023-01-13 17:13:42 -05:00
Henry Wilkinson	77a46aa2d9	Allowed URL Prefixes → Extra Start URLs Changes name, hopefully people have an easier time understanding this now! Will close #476	2023-01-13 16:50:00 -05:00
sua yoo	e3d34ff08f	Crawl config tag editor UI (#422 ) Allow users to set tags on a crawl config. Resolves #362	2023-01-12 13:59:42 -08:00
Ilya Kreymer	30bda8c75d	VNC-Based Profile Browser (#433 ) * profile browser vnc support + fixes: - switch profile browser rendering to use VNC - frontend: add @novnc/novnc as dependency, create separate bundle novnc.js to load into vnc browser (to avoid loading from each container) - frontend: update proxy paths to proxy websocket, index page to crawler - frontend: allow browser profiles in all browsers, remove browser compatibility check - frontend: update webpack dev config, apply prettier - frontend: node version fix - backend: get vncpassword, build new URL for proxying to crawler iframe - backend: fix profile / crawl job pull policy from 'Always' -> 'Never', should use existing image for job - backend: fix kill signal to use bash -c to work with latest backend image - backend/chart: add 'profile_browser_timeout_seconds' to chart values to control how long profile browser to remain when idle (default to 60) - backend: remove utils.py, now using secret.token_hex() for random suffix Co-authored-by: sua yoo <sua@suayoo.com>	2023-01-10 14:42:42 -08:00
sua yoo	303df2869c	Sticky the crawl config progress indicator position (#445 ) * fix tabs on scroll * adjust for smaller creen size * scroll to top of section on change * only scroll if needed	2023-01-09 19:58:33 -08:00
sua yoo	5daf550cb8	Frontend archives -> teams migration (#429 )	2023-01-03 15:37:32 -08:00
sua yoo	4a23dd12cb	Crawl config detail view & edit workflow UI updates (#415 )	2022-12-22 09:37:43 -08:00
sua yoo	28346e0a54	New create crawl config user workflow (#391 )	2022-12-12 13:50:33 -08:00
sua yoo	5d18ffa938	Fix language configuration UI (#388 )	2022-12-01 10:02:13 -08:00
sua yoo	da8260a028	Make exclusion table cells editable (#379 )	2022-11-23 09:43:52 -08:00
sua yoo	003b3c7a78	Allow users to set crawl config language (#377 )	2022-11-22 15:15:35 -08:00
sua yoo	321f78b861	Upgrade Shoelace 2.0.0-beta.61 -> 2.0.0-beta.83 (#358 )	2022-11-21 08:16:51 -08:00
sua yoo	13d5ab7d2b	Fix exclusion dropdown & removal in crawl config edit (#372 ) * fix dialog closing on regex * fix unable to delete regex with same text	2022-11-18 14:47:40 -08:00
sua yoo	8c4348b9f8	Show exclusion editor when creating & editing crawl templates (#353 )	2022-11-14 19:34:15 -08:00
sua yoo	d41b582ef6	Remove exclusion from running crawl (#352 )	2022-11-14 10:58:33 -08:00
sua yoo	95ec1599ef	Add exclusion to running crawl (#347 )	2022-11-08 18:09:11 -06:00
sua yoo	baacbbdc52	Highlight regular expression syntax in Exclusions Table (#341 )	2022-11-01 15:31:01 -07:00
sua yoo	97eb17784d	Display exclusions & list of URLs in crawl queue (#337 ) - including pagination of queue results (30 results per page currently) - show numbering on paginated results - allow user navigation to each result page	2022-10-12 20:19:13 -07:00
sua yoo	8708c24a74	Improve crawl elapsed time UX (#323 ) Smoother elapsed crawl timer: - Crawls list: show seconds increment up to 2 minutes, then show minutes only - Crawls detail: show seconds increment up to one day	2022-10-05 21:12:31 -04:00
sua yoo	0bbb7905bd	Add crawl queue editor UI components (#331 ) WIP #304	2022-10-04 13:13:40 -07:00
sua yoo	63ada3e5b3	Update base fonts and text sizes (#327 )	2022-09-27 14:32:57 -07:00
sua yoo	319a8a3c07	make clearer that profile selection is optional and that a default profile is used by default (#290 ) - Rename 'Select Profile' -> 'Default Profile' - Rename 'No Profiles' -> 'No Additional Profiles'	2022-08-10 15:54:39 -07:00
sua yoo	ee6161ad43	Frontend browser profile editor enhancements (#288 ) - add button to duplicate profile from main view - add save / cancel button when editing - change location of 'full screen' button	2022-08-10 15:51:34 -07:00
sua yoo	301b05ff4e	Refactor screencast websocket connection and retry (#276 ) * replace ip with index and retry connection, fixes #252	2022-06-29 17:55:32 -07:00
sua yoo	d144591dbf	Display & edit crawl schedule in user local time (#271 ) closes #255	2022-06-27 13:01:20 -07:00
sua yoo	c2aa4e6319	Fix AM/PM toggle (#272 )	2022-06-23 16:35:47 -07:00
sua yoo	a8757e2e50	Screencast UX enhancements (#251 ) * animate starting state * consistent fixed-size slots for each browser (url + screencast) * add tooltip for expected number of browsers (workers x scale)	2022-06-15 18:50:14 -07:00
sua yoo	502d687620	Enable duplicating and editing browser profile (#237 ) * ensure editing other config options does not lose profile * support adding/editing/removing profile of existing config * when duplicating config, ensure profile setting is also copied in the duplicate	2022-06-04 08:26:19 -07:00
sua yoo	2355de3067	docs: remove extra comment	2022-05-31 14:13:17 -07:00
sua yoo	bda817dadd	View and edit browser profile (#218 )	2022-04-23 20:12:16 -07:00
sua yoo	29b586b03f	Edit crawl config as YAML (#207 )	2022-04-06 17:40:25 -07:00
Ilya Kreymer	9a6483630e	Support for Admin interface for viewing web archives (#198 ) * backend api - superadmin has admin access to all archives - new superadmin endpoints: /archives/all/crawls and /archives/all/crawls/<crawl_id>.json for list all running crawls and loading crawl data by id - frontend superadmin view (fixes #201) * show all archives on superadmin home page * show jump to crawl for super admin (#200) * navbar links for: all archives, all running crawls and jump to crawl Co-authored-by: sua yoo <sua@suayoo.com>	2022-04-06 12:42:04 -07:00
sua yoo	ec3a77b71e	Mobile layout fixes (#206 ) closes #202	2022-03-30 15:54:25 -07:00
sua yoo	9e2274f612	remove temp file	2022-03-30 13:51:02 -07:00
sua yoo	8863776c54	Define websocket host in common webpack config (#195 ) * move websocket host var to common config, better fix for #193	2022-03-15 18:34:49 -07:00
Ilya Kreymer	912004751d	quickfix: partial mitigation for #193 , use current host for websock address	2022-03-14 15:29:35 -07:00
sua yoo	6fabea3e7a	Frontend build fixes (#191 ) * copy specific files * replace api host env var * remove unused dotenv * Update frontend/webpack.dev.js Co-authored-by: Ilya Kreymer <ikreymer@users.noreply.github.com>	2022-03-10 23:26:21 -08:00
sua yoo	c18418ff09	Show invite message to super admin & layout fixes (#181 )	2022-03-02 18:09:26 -08:00
sua yoo	fe31f551b2	Add "crawler" role to members (#174 ) closes #139	2022-03-02 18:09:10 -08:00
sua yoo	373c489b00	Watch crawl from crawl detail page (#156 ) closes #164 closes #134 Co-authored-by: Ilya Kreymer <ikreymer@users.noreply.github.com>	2022-03-02 18:08:08 -08:00
sua yoo	3fe3691e74	Update crawl run duration at intervals (#155 ) fixes #138	2022-02-23 16:14:01 -08:00
sua yoo	b5874c3f8c	call super disconnected callback after custom callback	2022-02-22 15:59:55 -08:00

1 2 3 4

170 Commits