browsertrix

Author	SHA1	Message	Date
Henry Wilkinson	92fdcfd986	Docs: Adds example section on basic auth (#2021 )	2024-08-15 06:13:28 -04:00
Henry Wilkinson	251aef3ac1	Docs: Elaborates on using user agents (#1841 ) - Provides a link to Mozilla's page explaining what they are (good for folks new to the concept) - Provides a link to useragents.me, the same site we link to in the app - Provides two examples of situations where they may be helpful to get around content restrictions	2024-05-30 14:50:10 -04:00
sua yoo	aa6429049e	Display name of user who last updated browser profile (#1834 ) - Shows browser profile last modified or created by name, if available - Moves backed-up status to browser profile subsection header - Moves "Last Updated" column to last and displays user name on hover, to match archived items list view - Updates browser profile docs	2024-05-29 13:40:56 -07:00
Henry Wilkinson	1a668fe82f	Adds QA features to user docs (#1784 ) Fixes #1695 ### Changes - Adds Crawl Review user docs - Adds Quality Assurance section to the Archived Items page - Adds note in the user roles list on crawl review not being available for viewers Co-authored-by: Emma Segal-Grossman <hi@emma.cafe> Co-authored-by: sua yoo <sua@webrecorder.org> Co-authored-by: Ilya Kreymer <ikreymer@users.noreply.github.com>	2024-05-15 15:01:44 -04:00
Henry Wilkinson	93c35ee2ee	Update dash and slash icons (#1783 ) Fixes #1782 - Dash icons are now used to convey status exclusively - Slash icons are now used to convey no data states - Updates status icons to filled in the docs (also required for QA docs!)	2024-05-03 12:52:07 -04:00
Henry Wilkinson	9c7fdb4fac	Add note on checking browser profiles for scheduled crawls (#1763 )	2024-04-30 06:25:42 +02:00
Tessa Walsh	80008a2853	Add post load delay to Browsertrix (#1700 ) Fixes #1699 Adds post load delay to: - Backend `RawCrawlConfig` model - Frontend (workflow editor and config details component) - Workflow setup docs	2024-04-18 20:03:47 -07:00
Henry Wilkinson	6e8867c550	Adds documentation for exporting files (#1643 ) Closes #1642 ### Changes - Adds section to the collections page on downloading collections - Changes the Files section on the archived items page to be more explicit about downloading files because that's the only action you can do there! --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2024-04-03 17:33:57 -04:00
Henry Wilkinson	652856e74c	docs: Adds more details about browser profile capabilities (#1523 ) Fixes #1522 ## Changes - Adds further security recommendations to change the password to accounts you care about after crawling Adds more details about the capabilities afforded with browser profiles. This is now split into the following sections: - Logging into Websites - Accepting Popups - Changing Browser Settings - More in the future??? Extensions??? --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2024-02-09 16:16:47 -08:00
Henry Wilkinson	45c9a91c9e	Docs: Improve relative links (#1476 ) ### Changes - Fixes one broken link (["Ansible Playbooks" here](https://docs.browsertrix.cloud/deploy/remote/)) - Formats relative links better to conform with [mkdocs 1.5 link validation improvements](https://www.mkdocs.org/about/release-notes/#expanded-validation-of-links)	2024-02-07 11:33:57 -08:00
Henry Wilkinson	b2d526f09a	docs: Explains execution time (#1475 ) Fixes #1463 ### Changes - Explains execution time - Adds style guide section about adding a badge for paid features - Updates config for mkdocs-material 9.5, materialx emoji support is being removed. - Adds better tooltips, a cool feature that also got released with mkdocs-material 9.5 - Adds search suggestions ### Caveats - [mkdocs 1.5 has improved the way they handle link validation](https://www.mkdocs.org/about/release-notes/#expanded-validation-of-links). Looks like way I've gone about linking things could be improved, and it will give a bunch of warnings as a result. The site still builds fine, but I'm going to fix this in a different PR so this one doesn't take as much effort to review :) EDIT: Here's that PR https://github.com/webrecorder/browsertrix-cloud/pull/1476 ### Testing - Make sure you are up to date with `pip install --upgrade mkdocs-material` ### Screenshot Badge! <img width="884" alt="Screenshot 2024-01-17 at 11 59 00 PM" src="https://github.com/webrecorder/browsertrix-cloud/assets/5672810/62a51cf6-24bd-49f1-a6d0-d335f730bfbe"> ### Future - Should mkdocs-material be versioned in our deployment script? We risk things breaking if I don't get to them fast enough! 🙃 --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2024-01-31 15:12:39 -05:00
Tessa Walsh	07fa46d9aa	Add custom user agent to workflows (#1465 ) Fixes #1341 Adds "User Agent" field to workflow editor under the Browser Settings tab. If not set, the crawler will use the browser's default user agent. Also added to docs and to the workflow details page (if set). --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics> Co-authored-by: Ilya Kreymer <ikreymer@users.noreply.github.com>	2024-01-17 17:33:50 -05:00
Tessa Walsh	032859f361	Support multiple crawler versions (#1420 ) Fixes #1385 ## Changes Supports multiple crawler 'channels' which can be configured to different browsertrix-crawler versions - Replaces `crawler_image` in helm chart with `crawler_channels` array similar to how storages are handled - The `default` crawler channel must always be provided and specifies the default crawler image - Adds backend `/orgs/{oid}/crawlconfigs/crawler-channels` API endpoint to fetch information about available crawler versions (name, image, and label) and test - Adds crawler channel select to workflow creation/edit screens and profile creation dialog, and updates related API endpoints and configmaps accordingly. The select dropdown is shown only if more than one channel is configured. - Adds `crawlerChannel` to workflow and crawl details. - Add `image` to crawler image, used to display actual image used as part of the crawl. - Modifies `crawler_crawl_id` backend test fixture to use `test` crawler version to ensure crawler versions other than latest work - Adds migration to add `crawlerChannel` set to `default` to existing workflow and profile objects and workflow configmaps --------- Co-authored-by: Ilya Kreymer <ikreymer@gmail.com> Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2024-01-16 15:32:12 -08:00
Henry Wilkinson	05c5e09d25	Adds status information to user documentation (#1459 ) Closes #1434 ### Changes #### Developer - Adds the K3S playbook guide to the navigation - Adds note about restarting MKDocs when adding new icons - Adds note about concise language to the styleguide ([see previous discussion](https://github.com/webrecorder/browsertrix-cloud/pull/1394#discussion_r1402666872)) - Adds a note about noun usage to the styleguide #### User guide - Adds tables for archived item and workflow statuses - Adds custom styles for displaying statuses with their icons like we do in the app - Fixes capitalization issues --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net> Co-authored-by: sua yoo <sua@webrecorder.org>	2024-01-14 16:44:51 -08:00
sua yoo	dbd48cf8e3	Improvements to collection creation and editing flow (#1424 ) Resolves https://github.com/webrecorder/browsertrix-cloud/issues/1333 - Moves "Select Crawls" / "Select Uploads" steps into a single "Select Archived Items" dialog - Refactors new collection metadata dialog to accept editing existing collection - Prevents RWP component from rendering if there are no archived items (@Shrinks99 made a comment about this figma, but this prevents unnecessary requests when there isn't an archive to replay) - Shows collection description at bottom of detail page at all times (@Shrinks99 seems useful to see even on archived items view?) - Switches collection detail primary action to "Add Archived Items" if none are included (cc @Shrinks99) - Displays friendlier "name taken" error - Removes unused Collection edit route - Upgrades markdown dependencies for fixes/improvements to description editing --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-12-19 18:12:43 -08:00
Henry Wilkinson	ae8804d87f	Improves user documentation intro (#1376 ) Closes #1369 ### Changes - Adds improved getting started steps and intro contact information to the User Guide homepage - Adds a small section about the execution minutes graph for orgs with a quota set - Moves existing signup content to a dedicated signup page - Changes admonitions from using em dashes to using colons. - Em dashes are great and I love em.... But sometimes I love them a little _too_ much and they were a bad fit here. - Fixes user guide homepage link - Fixes `ReplayWeb.page` and `ArchiveWeb.page` names - Fixes broken links (would be good to have a CI system for this I think) --------- Co-authored-by: Emma Segal-Grossman <hi@emma.cafe> Co-authored-by: Tessa Walsh <tessa@bitarchivist.net> Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>	2023-11-15 17:55:47 -08:00
Tessa Walsh	38f32f11ea	Enforce quota and hard cap for monthly execution minutes (#1284 ) Fixes #1261 Closes #1092 The quota for monthly execution minutes is treated as a hard cap. Once it is exceeded, an alert indicating that an org has exceeded its monthly execution minutes will display and the user will be unable to start new crawls. Any running crawls will be stopped once the quota is exceeded. An execution minutes meter bar is also added in the Org Dashboard and displayed if a quota is set. More detail in #1305 which was merged into this branch. ## Changes - Enable setting 'maxExecMinutesPerMonth' in orgs list quotas by superadmin - Enforce quota by stopping crawls in operator once quota is reached - Show alert banner once execution time quota is hit: - Once quota is hit, disable Run Crawl buttons in frontend, return 403 message with `exec_minutes_quota_reached` detail in backend from crawl config `/run` endpoint, and don't run new workflows on creation (similar to storage quota) - Display execution time for crawls in the crawl details overview, immediately below - Show execution minutes meter on dashboard (from #1305) --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics> Co-authored-by: Ilya Kreymer <ikreymer@gmail.com> Co-authored-by: sua yoo <sua@webrecorder.org>	2023-10-26 15:38:51 -07:00
Henry Wilkinson	adf71f132e	Adds missing user documentation for launch! (#1286 ) Closes #1215 - Adds account settings page - Adds overview page - Adds archived items page - Adds note about browser profile metadata editing - Adds note on editing the crawler instances scale while crawling - Adds details on permission levels for the org settings - Removes note about not being able to change your display name (follows #1265)	2023-10-16 19:16:38 -07:00
Henry Wilkinson	0bd8748e68	Minor Workflow Creator UX Changes (#1267 ) - Adds `position: sticky` to the workflow creator / editor controls to affix them to the bottom of the screen, they are now always visible! - Renames "Extra URLs in Scope" to "Extra URL Prefixes in Scope" - Updates documentation accordingly - Adjusts casing for checkboxes - Adds the multiplication sign to the crawler instances settings to better communicate that they are increases in scale and not arbitrary numbers.	2023-10-13 16:55:54 -07:00
Henry Wilkinson	99ccdf2de8	Browser Profile Warning & Dialog Style Updates (#1243 ) * Give protocol selection box smaller max-width * Add warning and docs link to browser profile creation - Updates dialog styling to btrix dialog - Updates button sizes - Updates button placement in dialog - Updates button labels for consistency with other buttons in app - Updates docs page with new button labels * Update browser profile edit metadata dialog. Matches updated dialog shown on profile creation * Open docs page in new tab	2023-10-03 18:59:19 -07:00
Tessa Walsh	b1ead614ee	Add --failOnFailedSeed checkbox to URL list workflows (#1236 ) - If set, and any of the seeds fails, the entire crawl is marked as a failure. - Add checkbox which adds --failOnFailedSeed checkbox to URL list workflows - Add 'Fail Crawl On Failed URL' to crawl workflow setup docs	2023-10-03 18:46:09 -07:00
Tessa Walsh	e667fe2e97	Add max crawl size option to backend and frontend (#1045 ) Backend: - add 'maxCrawlSize' to models and crawljob spec - add 'MAX_CRAWL_SIZE' to configmap - add maxCrawlSize to new crawlconfig + update APIs - operator: gracefully stop crawl if current size (from stats) exceeds maxCrawlSize - tests: add max crawl size tests Frontend: - Add Max Crawl Size text box Limits tab - Users enter max crawl size in GB, convert to bytes - Add BYTES_PER_GB as constant for converting to bytes - docs: Crawl Size Limit to user guide workflow setup section Operator Refactor: - use 'status.stopping' instead of 'crawl.stopping' to indicate crawl is being stopped, as changing later has no effect in operator - add is_crawl_stopping() to return if crawl is being stopped, based on crawl.stopping or size or time limit being reached - crawlerjob status: store byte size under 'size', human readable size under 'sizeHuman' for clarity - size stat always exists so remove unneeded conditional (defaults to 0) - store raw byte size in 'size', human readable size in 'sizeHuman' Charts: - subchart: update crawlerjob crd in btrix-crds to show status.stopping instead of spec.stopping - subchart: show 'sizeHuman' property instead of 'size' - bump subchart version to 0.1.1 --------- Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>	2023-08-26 22:00:37 -07:00
Henry Wilkinson	2952988864	docs: formatting fixes & minor content updates (#1091 ) Additional tweaks on Browser Profiles pages + general consistency pass	2023-08-21 13:26:43 -07:00
Henry Wilkinson	02a01e7abb	docs: Adds information about 1.6 features to documentation (#1086 ) * 1.6 docs update ### Changes - Adds note in style guide about referencing actions in the app - Adds page for Browser Profiles - Adds callout for uploads in the context of combining items from multiple sources - Adds page for Collections - Adds page for Crawl Workflows - Updates index to link to new dedicated Crawl Workflow page in addition to the Crawl Workflow Setup page - Updates Org Settings page action styling in accordance with new rules - Updates Crawl Workflow Setup page with links to the new pages and a hierarchy fix for the first item - Updates user guide navigation with a new section for crawling related items --------- Co-authored-by: sua yoo <sua@webrecorder.org> Co-authored-by: Ilya Kreymer <ikreymer@users.noreply.github.com>	2023-08-18 21:55:20 -07:00
Tessa Walsh	d5c3a8519f	Add crawler Use Sitemap option to Browsertrix Cloud (#978 ) * Add user-guide docs for Use Sitemap option --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-07-19 13:57:52 -04:00
Henry Wilkinson	d9e73fcbc3	Reorder Limits section (#966 ) * Reorder Limits section - Minor text change to section names - "Limit Per Page" → "Per-Page Limits" - "Limit Per Crawl" → "Per-Crawl Limits" * Reorder limits section in documentation	2023-07-08 08:54:30 -07:00
Henry Wilkinson	ac4716614e	Minor gramatical changes to documentation (#919 )	2023-07-04 17:14:49 -04:00
Tessa Walsh	bd6dc79449	Add frontend support for auto-adding collections to workflows (#916 ) - Adds collections search and list to workflow editor - Adds collections to workflow details component - Adds namePrefix filter to backend GET /orgs/{oid}/collections endpoint to support case-insensitive searching of collections - Adds documentation for new setting --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-06-12 18:18:05 -07:00
Henry Wilkinson	79703baa69	Org Settings documetation & Getting Started docs page updates	2023-06-11 17:39:16 -04:00
Henry Wilkinson	8477919989	Adds all workflow settings to the user docs with descriptions (#894 ) Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2023-06-08 14:28:58 -04:00
Ilya Kreymer	6e81b44ff8	docs: fix typos in production, missing TODO in user-guide section	2022-12-06 15:24:10 -08:00
Henry Wilkinson	a74d88dcda	mkdocs setup (deploy, dev, user-guide) (#375 ) * Initial docs move * Setup mkdocs * Adds instructions for building docs * add new deployment docs, local and prod * set up three sections: deployment, dev and user guide * remove old deployment docs * ci: mkdocs gh-pages publish Co-authored-by: sua yoo <sua@suayoo.com> Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>	2022-12-05 16:41:37 -08:00

32 Commits