browsertrix

Author	SHA1	Message	Date
Ilya Kreymer	41655ef829	version: bump to 1.10.0-beta.2	2024-04-23 23:19:16 +02:00
Ilya Kreymer	b94070160b	allow configuring designated registration org to which new users can register (#1735 ) if 'registration_enabled' is set, check 'registration_org_id' for org id of an existing org that new users should be added to when they register. if omitted, default to the default org Fixes #1729	2024-04-23 17:11:37 -04:00
sua yoo	7ac06608b2	Fix archived item status icon size + minor review UI updates (#1734 ) Related to https://github.com/webrecorder/browsertrix/issues/1477, minor UI tweaks as fast follow: - Makes archived item status icons all the same size - Increases hover hit target of archived item cells - Reduces top padding of QA review page - Adds "Back" button to QA review page to match all other pages - Updates percentage formatting of tab labels to match pages	2024-04-23 17:08:24 -04:00
sua yoo	1915274e26	Fix QA review comments (#1723 ) Fixes https://github.com/webrecorder/browsertrix/issues/1710 Fixes date and deletion for newly added comments.	2024-04-23 16:31:52 -04:00
sua yoo	9836e750c8	Hide QA UI from users without crawler role (#1733 ) Resolves https://github.com/webrecorder/browsertrix/issues/1732 - Hides QA tab and QA review page from users with viewer role - Fixes empty menu button rendering for viewers on archived item detail page	2024-04-23 16:01:23 -04:00
sua yoo	1f4440db64	Poll archived items list (#1730 ) - Refreshes archived items page every 5 seconds to get in progress QA runs. - Refactors archived items to use `TailwindComponent` and lit tasks - Adds back all columns to upload list to prevent layout shift (see follow-ups) - Fixes list not updating after search value is selected.	2024-04-23 12:53:55 -07:00
sua yoo	cb9012a6df	Fix QA navigation (#1731 ) - Resolves https://github.com/webrecorder/browsertrix/issues/1705 - Resolves https://github.com/webrecorder/browsertrix/issues/1477 Changes: - Takes user back to archived item QA tab after submitting review. - Prevents clicking on page in QA tab when there's no analysis runs. - Disables analysis-related sort options in QA tab when there's no analysis runs. - Handles pages without a title in QA tab. - Adds initial loading state to QA review page list.	2024-04-23 15:45:06 -04:00
sua yoo	dde1175bd0	Add QA page analysis chart (#1725 )	2024-04-23 10:37:22 -04:00
Emma Segal-Grossman	1f59c0b452	add best text & screenshot match sorting options (#1726 )	2024-04-22 20:53:17 -04:00
Emma Segal-Grossman	7871d30ba1	QA UI: Copy updates & beta tag (#1722 ) Closes #1718 - Adds beta badge & icon components - Changes the capitalization of "analysis run" to lowercase in sentences - Changes instances of "in Replay" and "From Replay" to "During Analysis" \| Screenshots \| \| --- \| \| <img width="400" alt="Screenshot 2024-04-22 at 6 09 28 PM" src="https://github.com/webrecorder/browsertrix/assets/5727389/4822e361-c82c-4349-957a-a7a9f15dbc7a"> \| \| <img width="254" alt="Screenshot 2024-04-22 at 6 09 44 PM" src="https://github.com/webrecorder/browsertrix/assets/5727389/01a1457f-e00c-46ba-8af5-66b702344b7a"> \| \| <img width="313" alt="Screenshot 2024-04-22 at 6 10 04 PM" src="https://github.com/webrecorder/browsertrix/assets/5727389/bd38e2f7-28ea-49e9-b856-cc7ade6ae618"> \| \| <img width="597" alt="image" src="https://github.com/webrecorder/browsertrix/assets/5727389/ac95c2a3-ba67-42b3-a33d-c0b61e2ba6f7"> \|	2024-04-22 20:03:55 -04:00
Emma Segal-Grossman	7710be0f6e	More Frontend QA Polish Changes (#1709 ) Sort of relevant to #1696 - Improves a number of layout elements at smaller viewport sizes (specifically, letting button groups wrap onto next lines & ensure titles can't shrink to 0 width) - Adds "No page title" to places where there'd normally be a page title but isn't - Applies grid styling to page list area to fix overflow issues - When selecting or loading with a page selected that's farther down on the page list, the top of the page header could be scrolled away from, and there'd be no way to scroll back because the area had `overflow: hidden` applied - Adds default width and height to image comparer element, so that it displays correctly when images haven't loaded and doesn't change layout when images load --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2024-04-22 20:02:07 -04:00
Tessa Walsh	b8caeb88e9	Ensure QA run WACZs are deleted (#1715 ) - When qa run is deleted - When crawl is deleted And adds tests for WACZ deletion. Fixes #1713	2024-04-22 18:04:09 -04:00
Emma Segal-Grossman	127315189f	Frontend QA: show currently sorted heuristic percentages in page list (#1720 ) Closes #1690 Swaps out the "+n" number for a percentage when sorting by percentage values	2024-04-22 16:44:59 -04:00
sua yoo	e56c412737	Add QA columns to archived item list (#1683 ) - Shows QA review status, latest QA state, and QA run count in archived item list. "Created by" column replaced for space and moved to "Date Created" tooltip - Enables sort archived items by QA columns - Adds tooltip to all columns except name - Hides non-applicable columns in when archived items are filtered to uploads only - Fixes issue where tooltips weren't activated on hover when using `btrix-table` - Minor refactor to reuse status labels/colors, renames page review -> page approval to clarify	2024-04-22 13:19:07 -07:00
sua yoo	db6091b09c	Fix QA run deletion (#1721 ) Fixes unable to delete QA run from the QA tab "Analysis Runs" list.	2024-04-22 13:15:42 -07:00
Ilya Kreymer	1844e761dc	Support sorting by last QA started time (#1712 ) To support #1683, it would be useful to be able to sort by 'last QA start time' in addition to/instead of last QA state. - make sorting consistent with workflow sorting - sortBy fields renamed to lastQAState and lastQAStarted - Current QA runs are now included in the lastQAState/lastQAStarted fields, rather than being separated out to different values --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2024-04-22 13:00:52 -07:00
Ilya Kreymer	b574f00d2b	Add Repository Index + Chart Rename + Docs Rename (#1708 ) Repository Index: Generate an index.yaml in ./docx/helm-repo/index.yaml to allow for browsertrix to be a helm repository. docs: rename docs.browsertrix.cloud -> docs.browsertrix.com docs: update deployment doc to mention helm repo as preferred way to install docs build action: generate repository index in GH action publish action: update auto-generated message to mention installing from the repo. --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2024-04-21 09:42:25 -07:00
Tessa Walsh	d31bdd2a21	Change resource table to be based on mime type (#1698 ) Follow-up to #1689 Instead of just using resource types, using a combination of mime type, resource type and extension analysis. Current logic is: - if resourceType is script, stylesheet, image, font, use that definitely - if resourceType is fetch/xhr, also check for extension, specifically for images - categorize favicon.ico separately as 'favicon' - also check for extension for json - also check for pdf by type + extension - categorize 'ping' as 'other' - by default, use the first part of mime as the type	2024-04-20 19:02:15 -07:00
Ilya Kreymer	4360e0c1b5	Update tests with latest crawler (#1711 ) tests: use 'latest' crawler release for testing, now that 1.1.x is released.	2024-04-20 15:56:26 -07:00
Emma Segal-Grossman	c5b808ba40	Ensure dates are formatted with the current app locale (and not browser default) (#1697 ) ### Motivation While using the browser default locale is often good enough, we probably want more full control over locales, especially when we allow users to choose their own locale — it's a poor experience to not have dates and numbers formatted consistently with your chosen locale. ### Changes Ensures (almost) all instances of `<sl-format-date>` as well as `Intl.*` have the correct locale passed into them from our Lit localization system. The one exception here is in `frontend/src/utils/number.ts` where ordinal suffixes aren't localized, so the locale is hardcoded to `en` — I'll revisit this in the future.	2024-04-20 16:30:33 -04:00
Henry Wilkinson	a60f56b87a	Frontend QA Polish Changes (#1703 ) Fixes #1696 - Inverts the toggle button state for screenshot comparison, is now untoggled by default - Changes the Resources tab to a puzzle piece - Parity with the metaphor used in ReplayWeb.page. Would like to leave databases to represent actual databases... Which given the app could very well make their way in here! - Fixes the dashed boarder radius being applied to the text comparison tab - Sets the data table header row to `position: sticky;` so it's always visible if the table scrolls! - This applies to all data tables, tested the one in QA review, the overview page, and the org settings members list - Casing changes for parity with other dropdown sorting controls Co-authored-by: Emma Segal-Grossman <hi@emma.cafe>	2024-04-19 22:26:04 -04:00
Emma Segal-Grossman	ecaa851688	Fix workflow language setting showing HTML code in select element (#1702 ) Closes #1655 ### Changes Removes the separate element inside the `<sl-select>` and instead doesn't show localized language unless it's different from the language in the current locale	2024-04-19 08:39:04 -07:00
Tessa Walsh	80008a2853	Add post load delay to Browsertrix (#1700 ) Fixes #1699 Adds post load delay to: - Backend `RawCrawlConfig` model - Frontend (workflow editor and config details component) - Workflow setup docs	2024-04-18 20:03:47 -07:00
Ilya Kreymer	9609ff4194	Add 'activeQAStats' field (#1694 ) As additional support for #1683, include the active QA stats in the crawl response, along with active QA state. This will allow showing progress of QA run in the archived items list.	2024-04-18 10:05:39 -04:00
Henry Wilkinson	3e4b0e491a	Frontend: App Branding! (#1592 ) Closes #424 ### Changes - Adds Browsertrix logo to the nav bar!! - Will license the typeface (Konsole) as soon as this gets merged. - Changes the theme colour to Webrecorder's turquoise brand colour - Adjusts the usage of `primary` vs `blue` for elements where it shouldn't be the same - Changes the _Cancel & Discard_ button of the cancel crawl dialogue to `danger` button variant - Replaces the Replay icon with the new ReplayWebpage icon! - Fixes the favicon SVG linking (wrong name, oops!) - Fills the microscope lens in the microscope icon - Removes the old btrix-cloud.svg logo Co-authored-by: sua yoo <sua@webrecorder.org>	2024-04-17 23:40:02 -04:00
sua yoo	f0921feb70	Display QA resources as table (#1692 ) - Renders QA resource data as a basic table - Updates screenshot and resource tab icons	2024-04-17 19:43:34 -07:00
Tessa Walsh	b87860c68a	Ensure /all-crawls?sortBy=qaState always sorts crawls above uploads (#1691 ) Follow-up to #1686	2024-04-17 19:14:29 -07:00
Emma Segal-Grossman	9f0a1fcc95	QA page details (#1656 ) ### Changes - improves overflow issues at smaller screen sizes - adds icons to buttons - updates text & layout to match mocks - changes primary button & button options depending on if there's a qa run available - adds a loading state for qa run status & buttons - updates `<btrix-crawl-status>` with a `type` param allowing for crawls, uploads, and QA runs - Updates `<btrix-alert>` to match `<sl-tag>` styling - Improves overflow issues at smaller viewport sizes by making tab lists overflow when necessary ### Features - Ability to start/stop/cancel QA runs https://github.com/webrecorder/browsertrix-cloud/pull/1666 @SuaYoo - Ability to see progress of current QA run @emma-sg - Ability to delete QA runs @emma-sg - Ability to download QA run files https://github.com/webrecorder/browsertrix-cloud/pull/1666 @SuaYoo - Only able to start review if a QA Run is finished (for now, initial pass). @SuaYoo - Only most recent running or successful QA run is displayed in header --------- Co-authored-by: sua yoo <sua@webrecorder.org> Co-authored-by: sua yoo <sua@suayoo.com> Co-authored-by: Ilya Kreymer <ikreymer@gmail.com> Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2024-04-17 15:17:13 -07:00
Henry Wilkinson	6cabb9c875	Changes download icons to cloud-download (#1688 ) And now they're all the same!	2024-04-17 15:27:38 -04:00
Vinzenz Sinapius	a8336925b6	Run crawler and profilebrowser with non-root user (#1625 ) With these changes, crawler and profilebrowser jobs run as a non-root user.	2024-04-17 12:03:33 -07:00
Tessa Walsh	30ab139ff2	Add QA run aggregate stats API endpoint (#1682 ) Fixes #1659 Takes an arbitrary set of thresholds for text and screenshot matches as a comma-separated list of floats. Returns a list of groupings for each that include the lower boundary and count for all thresholds passed in.	2024-04-17 13:24:18 -04:00
Ilya Kreymer	835014d829	restrict qa runs to a 'min_qa_crawler_image' if set in the chart (#1685 ) - fixes #1684 - can be used to optionally restrict QA to only some crawls (eg. with browsertrix-crawler>=1.0.0) - enforce error on backend (return 400) and handle special error on the frontend	2024-04-17 08:48:33 -07:00
Tessa Walsh	c800da1732	Add reviewStatus, qaState, and qaRunCount sort options to crawls/all-crawls list endpoints (#1686 ) Backend work for #1672 Adds new sort options to /crawls and /all-crawls GET list endpoints: - `reviewStatus` - `qaRunCount`: number of completed QA runs for crawl (also added to CrawlOut) - `qaState` (sorts by `activeQAState` first, then `lastQAState`, both of which are added to CrawlOut)	2024-04-16 23:54:09 -07:00
Tessa Walsh	87e0873f1a	Add mime field to Page model (#1678 )	2024-04-17 00:57:49 -04:00
Vinzenz Sinapius	1b034957ff	Improve reliability of backend tests (#1675 ) - Remove globals from profile, uploads, and qa test modules in favor of fixtures - Add retries to fix intermittent test failures due to timing --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net> Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>	2024-04-16 14:22:41 -04:00
Ilya Kreymer	95f5605af7	renumber crawl priority classes: (#1673 ) - priority classes <-10 are ignored by cluster-autoscaler so QA jobs with too low priorities never run - start crawl priorities at 0 going down (same as before) - start qa run priorities at -2 going down (instead of -100) - this means a crawl of with scale of 3 can be preempted by 1st qa pod, but otherwise crawls have higher priority - rename priority classes as they are otherwise immutable and error on helm upgrade This allows for more room in lower pri classes for other type of objects, while keeping in mind the -10 and below threshold: (see: https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/FAQ.md)	2024-04-13 12:24:43 -07:00
Ilya Kreymer	f243d34395	Remove pages from QA Configmap (#1671 ) Fixes #1670 No longer need to pass pages to the ConfigMap. The ConfigMap has a size limit and will fail if there are too many pages. With this change, the page list for QA will be read directly from the WACZ files pages.jsonl / extraPages.jsonl entries.	2024-04-12 16:04:33 -07:00
emma	ed08b734ba	Fix failing Webpack build - Includes eslint-related files in Dockerfile build step - Switches to using a cache mount in Dockerfile build step for yarn cache, rather than deleting yarn cache, which should speed up most builds	2024-04-10 14:40:44 -04:00
Tessa Walsh	172a9bf0cd	Change crawl.reviewStatus to 1-5 scale int (#1664 )	2024-04-09 17:51:06 -07:00
Emma Segal-Grossman	9aba24a90e	Emit eslint errors during webpack builds & during dev (#1667 ) Following [discussion on Discord](https://discord.com/channels/895426029194207262/910966759165657161/1227299497546223707), this enables ESLint errors showing up during dev within Webpack, and also enforces that all issues (both warnings and errors) will cause builds to fail during CI. Other changes: - Updates some dependencies (primarily to versions that are type-checked)	2024-04-09 17:42:31 -04:00
Tessa Walsh	435ca08f26	Remove URL prefix dropdown from new browser profile screen (#1660 ) Switch to a text field and prepend `https://` as prefix if no valid prefix is provided in the provided URL.	2024-04-08 15:36:21 -04:00
Ilya Kreymer	17f49a52de	email templates update + customization + doc update (fixes #1652 ) (#1653 ) - modify invite email template to answer common questions - email templates: make each email template overridable with --set-file - docs: update customization doc to document how to customize email templates --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2024-04-08 12:27:47 -07:00
Ilya Kreymer	a7cda3b11b	version: bump to 1.10.0-beta.1	2024-04-05 18:24:14 -07:00
Tessa Walsh	4229b94736	Track failed QA runs and include in list endpoint (#1650 ) Fixes #1648 - Tracks failed QA runs in database, not only successful ones - Includes failed QA runs in list endpoint by default - Adds `skipFailed` param to list endpoint to return only successful runs --------- Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>	2024-04-04 18:51:06 -07:00
Ilya Kreymer	5c08c9679c	fix issue with incorrect number of total pages if any of the seeds is a redirect (#1649 ) Following changes in webrecorder/browsertrix-crawler#475, webrecorder/browsertrix-crawler#509, the crawler adds a redirected seed to the seen list. To account for this, it needs to be subtracted to get the total page count.	2024-04-04 15:55:44 -07:00
sua yoo	83c9203a11	Initial QA Review UI! (#1624 ) QA Details page: - Enables QA tab with ability to start automated analysis QA Run + view a and manual review status - Pages listed with review status + overall crawl review status shown on QA details (relates to #1508) - Initial placeholder for QA run analytics (part of #1589) - Addresses a good deal of #1477 Automated Analysis QA in Review Mode: - Ability to select from multiple analysis QA runs / view QA runs in QA details - Shows analysis screenshot, text and resources compare and replay tabs (fixes #1496) - Sorting by worst screenshot / worst text score for each QA run - Includes pages sidebar with screenshot/text/resource compare results (fixes #1497) Manual Review QA in Review Mode: - Per-page replay available as separate tab (fixes #1499) - Supports thumbs up, thumbs down, notes for each page - Supports entering review status approval (good/acceptable/bad can be entered when finishing review --------- Co-authored-by: Emma Segal-Grossman <hi@emma.cafe> Co-authored-by: Ilya Kreymer <ikreymer@gmail.com> Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2024-04-04 15:09:52 -07:00
Henry Wilkinson	432a727218	Frontend: Fixes the "Replay Latest Crawl" button path (#1636 ) - Fixes the path for the "Replay Latest Crawl" button on the watch crawl page when the crawl workflow isn't running	2024-04-03 18:19:46 -04:00
Henry Wilkinson	6e8867c550	Adds documentation for exporting files (#1643 ) Closes #1642 ### Changes - Adds section to the collections page on downloading collections - Changes the Files section on the archived items page to be more explicit about downloading files because that's the only action you can do there! --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2024-04-03 17:33:57 -04:00
Ilya Kreymer	ffc4b5b58f	operator state fixes (follow up fomr #1639 ) (#1640 ) - increase time for going to waiting_capacity from starting to 150 seconds - relax requirement for state transitions, allow complete from waiting - additional type safety for different states, ensure mark_finished() only called with non-running states, add `Literal` types for all the state types.	2024-03-29 15:12:16 -07:00
Ilya Kreymer	c1817cbe04	add horizontal pod autoscaler for backend and frontend via helm charts (#1633 ) Supports horizontal pod autoscaling (hpa) for backend and frontend pods: - use cpu and memory averages - adjust base memory + cpu for backend - threshold set to 80% cpu and 95% memory utilization by default (configurable in values.yaml) - instead of backend and frontend replicas, set max replicas in values.yaml - only enable hpa if backend_max_replicas or frontend_max_replicas is >1, default to 1 for now	2024-03-28 16:39:27 -07:00

1 2 3 4 5 ...

1121 Commits