browsertrix

Author	SHA1	Message	Date
sua yoo	54e2b2c703	List web captures in Collection (#1024 ) - Adds tab for "Web Captures" in Collection detail view - Move Collection description under Replay section - Fixes app reloading when clicking into a Collection - Standardizes Web Capture list headers from "Finished -> "Created Date"	2023-08-01 09:14:27 -07:00
Ilya Kreymer	06cf9c7cc3	add crawl ending states: 'generate-wacz', 'uploading-wacz', 'pending-wait' that occur after a crawl is finished or is being stopped (#1022 ) operator: ensure transitions from each of these states is supported, including to 'waiting_capacity' add extra check on stopping to avoid transitioning back to a running state after crawl is finished ui: add states to UI display, localization, add as active states fixes #263	2023-08-01 00:15:59 -07:00
Anish Lakhwara	d8502da885	fix(build): use `/usr/bin/env bash` instead of `/bin/bash` (#1020 ) * fix: add to various other shell scripts	2023-07-28 21:50:04 -07:00
sua yoo	7069b33646	Show only running crawls in superadmin view (#1015 ) - Show separate crawls list for admin view, fixes #1010	2023-07-26 15:48:20 -07:00
Ilya Kreymer	6506965d98	Streaming Download for Collections (#1012 ) * support streaming download of collections (part of #927) - WACZ zip created on the fly using stream-zip - add 'Download Collection' option to collection detail and list - after editing collection, return to collection view - tests: add test for streaming download, ensure WACZ files + datapackage present, STORE compression used --------- Co-authored-by: sua yoo <sua@suayoo.com>	2023-07-26 15:42:17 -07:00
Tessa Walsh	c21153255a	Rename notes to description in frontend and backend (#1011 ) - Rename crawl notes to description - Add migration renaming notes -> description - Stop inheriting workflow description in crawl - Update frontend to replace crawl/upload notes with description - Remove setting of config description from crawl list - Adjust tests for changes	2023-07-26 13:00:04 -07:00
sua yoo	75b011f951	Upload WACZ via UI (#992 ) - Users can now upload .WACZ archives from the "Archived Data" page. - Can specify name, description, tags and collection(s) to add upload to - Show progress of upload - Support canceling upload	2023-07-21 16:45:52 +02:00
sua yoo	85913112a2	Upgrade lit + shoelace to reduce build size (#938 ) * upgrade lit * upgrade shoelace * upgrade testing libraries * add webpack bundle analyzer * revert shoelace changes * remove bundle analyzer * remove console log	2023-07-20 11:50:05 +02:00
Tessa Walsh	d5c3a8519f	Add crawler Use Sitemap option to Browsertrix Cloud (#978 ) * Add user-guide docs for Use Sitemap option --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-07-19 13:57:52 -04:00
sua yoo	c5b3be0680	Fix frontend formatting pre-commit (#991 ) * update lint staged config * remove prettier defaults	2023-07-18 17:51:13 +02:00
Ilya Kreymer	2372f43c2c	frontend: fix to collection editor with crawls and uploads (#971 ) * frontend: - follow up to #969, fixes crawl workflows by using crawl-specific endpoint and merging results * get crawls and uploads concurrently --------- Co-authored-by: sua yoo <sua@suayoo.com>	2023-07-10 19:29:19 +02:00
sua yoo	f3660839bf	Allow users to add uploads to collections (#968 ) * show uploads in 'Select Uploads' section	2023-07-09 22:21:50 -07:00
Henry Wilkinson	d9e73fcbc3	Reorder Limits section (#966 ) * Reorder Limits section - Minor text change to section names - "Limit Per Page" → "Per-Page Limits" - "Limit Per Crawl" → "Per-Crawl Limits" * Reorder limits section in documentation	2023-07-08 08:54:30 -07:00
Ilya Kreymer	8eeb66e11f	Frontend more upload path fixes (#961 ) * additional fixes for #935: - don't use artifactType for detail pages, ensure correct artifact selected based on path * naming tweaks: - from uploads detail, return to 'All Uploads' with filter - from crawls detail, return to 'All Crawls' with filter - rename general to 'All Archived Data'	2023-07-07 15:41:03 -07:00
Ilya Kreymer	d3a757e20b	partial fix for: #935 : (#960 ) - add route for /artifacts/upload/<id> to be used for uploads - link uploads to /artifacts/upload/<id> instead of /artifacts/crawl/<id>	2023-07-07 14:23:26 -07:00
sua yoo	de4b18aa67	List crawls, uploads, and all objects in UI (#941 ) - Adds top-level "Archived Data" view, replacing "Finished Crawls" and moving it as "Crawls" into view - Adds list for viewing all artifacts/data - Adds list for viewing all uploaded crawls - Updates crawl detail view to show upload details - Edit upload metadata, including 'name' - Delete uploads --------- Co-authored-by: Ilya Kreymer <ikreymer@gmail.com> Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-07-07 13:20:28 -07:00
Ilya Kreymer	00eb62214d	Uploads API: BaseCrawl refactor + Initial support for /uploads endpoint (#937 ) * basecrawl refactor: make crawls db more generic, supporting different types of 'base crawls': crawls, uploads, manual archives - move shared functionality to basecrawl.py - create a base BaseCrawl object, which contains start / finish time, metadata and files array - create BaseCrawlOps, base class for CrawlOps, which supports base crawl deletion, querying and collection add/remove * uploads api: (part of #929) - new UploadCrawl object which extends BaseCrawl, has name and description - support multipart form data data upload to /uploads/formdata - support streaming upload of a single file via /uploads/stream, using botocore multipart upload to upload to s3-endpoint in parts - require 'filename' param to set upload filename for streaming uploads (otherwise use form data names) - sanitize filename, place uploads in /uploads/<uuid>/<sanitized-filename>-<random>.wacz - uploads have internal id 'upload-<uuid>' - create UploadedCrawl object with CrawlFiles pointing to the newly uploaded files, set state to 'complete' - handle upload failures, abort multipart upload - ensure uploads added within org bucket path - return id / added when adding new UploadedCrawl - support listing, deleting, and patch /uploads - support upload details via /replay.json to support for replay - add support for 'replaceId=<id>', which would remove all previous files in upload after new upload succeeds. if replaceId doesn't exist, create new upload. (only for stream endpoint so far). - support patching upload metadata: notes, tags and name on uploads (UpdateUpload extends UpdateCrawl and adds 'name') * base crawls api: Add /all-crawls list and delete endpoints for all crawl types (without resources) - support all-crawls/<id>/replay.json with resources - Use ListCrawlOut model for /all-crawls list endpoint - Extend BaseCrawlOut from ListCrawlOut, add type - use 'type: crawl' for crawls and 'type: upload' for uploads - migration: ensure all previous crawl objects / missing type are set to 'type: crawl' - indexes: add db indices on 'type' field and with 'type' field and oid, cid, finished, state * tests: add test for multipart and streaming upload, listing uploads, deleting upload - add sample WACZ for upload testing: 'example.wacz' and 'example-2.wacz' * collections: support adding and remove both crawls and uploads via base crawl - include collection_ids in /all-crawls list - collections replay.json can include both crawls and uploads bump version to 1.6.0-beta.2 --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>	2023-07-07 09:13:26 -07:00
Tessa Walsh	29a6f0f6bc	Fix links in watch crawl after workflow crawl completes (#943 )	2023-07-06 15:04:26 -07:00
Henry Wilkinson	8a240ad044	Fixes z-index (#939 )	2023-07-04 23:05:09 -04:00
Ilya Kreymer	e37f220d6c	version: bump to 1.6.0-beta.1	2023-06-16 18:53:32 -07:00
Tessa Walsh	c7051d5fbf	Backend API consistency pass (#921 ) * Make API add and update method returns consistent - Updates return {"updated": True} - Adds return {"added": True} - Both can additionally have other fields as needed, e.g. id or name - remove Profile response model, as returning added / id only - reformat --------- Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>	2023-06-16 18:52:46 -07:00
Ilya Kreymer	d9ad8c11d2	frontend: fix RWP_BASE_URL not being set correctly for nginx image	2023-06-13 00:04:46 -07:00
Tessa Walsh	bd6dc79449	Add frontend support for auto-adding collections to workflows (#916 ) - Adds collections search and list to workflow editor - Adds collections to workflow details component - Adds namePrefix filter to backend GET /orgs/{oid}/collections endpoint to support case-insensitive searching of collections - Adds documentation for new setting --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-06-12 18:18:05 -07:00
Henry Wilkinson	71e9984e65	Adds documentation link and version copy button to footer (#920 ) * Updates footer - Adds documentation link - Adds label to GitHub link, moves outside of the version code - Adds copy button to version code for quick access when filing bug reports :) * Comments out invisible div * Improves responsiveness on mobile	2023-06-12 17:51:21 -07:00
Ilya Kreymer	ec3404c798	Fix Extra URLs in Scope (#913 ) * scope fix: when using 'Custom Page Prefix scope (fixes #873) - don't include primary seed URL in include list - don't always add trailing slash to extra in scope URLs - set seed scope to 'prefix' (supported via webrecorder/browsertrix-crawler#318) instead of re-including seed URL - add comments on using 'custom' to indicate 'Custom Prefix Scope' semantics on frontend, setting actual scope to 'prefix' on backend - remove unneeded conditional for additional urls, main scopeType overridden per seed anyway	2023-06-12 17:29:41 -07:00
Henry Wilkinson	2364433932	Admin Panel Minor Frontend Style Updates (#915 ) - Unifies trash icons on all pages to use trash3 (there were a few stragglers!) - Brings styling of org quotas dialogue in-line with the rest of our dialogues - Adds missing localization strings - Swaps button with icon button to match table row action styling elsewhere	2023-06-10 19:21:34 -07:00
Ilya Kreymer	9707fb55e4	fix finished workflows incorrectly being displayed as running (#909 )	2023-06-08 11:26:42 -07:00
Ilya Kreymer	4428184aea	frontend: configure running with a fixed 'replay.json', auth headers passed via separate config (#899 ) wabac.js will reload the replay.json on 403 with new token (will be in next version of wabac.js) presign urls: make presign timeout configurable (in minutes), defaults to 60 mins dockerfile: fix configuring RWP_BASE_URL	2023-06-08 11:26:26 -07:00
Henry Wilkinson	a718043fa8	Adds icon `name` and tooltip `content` fields to `btrix-copy-button` (#879 ) - Adds two new properties, name to pick the icon's name and content to pick a custom tooltip message. These are in-line with what Shoelace uses but are perhaps not the best descriptors... - Swaps the existing anchor links on the Workflow Details' Settings tab for these and relocates them to after the heading. (Navigation to the links is broken right now... but the copying part works nicely!) - Updates btrix-section-heading to better handle multiple elements with flexbox and an 8px gap between elements	2023-06-06 17:54:17 -07:00
sua yoo	66b3befef9	Frontend collections beta UI (#886 ) - Support for creating new collections and editing existing collections - Can select crawling workflows which adds entire workflow, and then deselect individual crawls - Can edit existing collections and add more crawls - Can view, create and delete collections via new Collections top-level nav entry	2023-06-06 17:52:01 -07:00
Ilya Kreymer	00fb8ac048	Concurrent Crawl Limit (#874 ) concurrent crawl limits: (addresses #866) - support limits on concurrent crawls that can be run within a single org - change 'waiting' state to 'waiting_org_limit' for concurrent crawl limit and 'waiting_capacity' for capacity-based limits orgs: - add 'maxConcurrentCrawl' to new 'quotas' object on orgs - add /quotas endpoint for updating quotas object operator: - add all crawljobs as related, appear to be returned in creation order - operator: if concurrent crawl limit set, ensures current job is in the first N set of crawljobs (as provided via 'related' list of crawljob objects) before it can proceed to 'starting', otherwise set to 'waiting_org_limit' - api: add org /quotas endpoint for configuring quotas - remove 'new' state, always start with 'starting' - crawljob: add 'oid' to crawljob spec and label for easier querying - more stringent state transitions: add allowed_from to set_state() - ensure state transitions only happened from allowed states, while failed/canceled can happen from any state - ensure finished and state synched from db if transition not allowed - add crawl indices by oid and cid frontend: - show different waiting states on frontend: 'Waiting (Crawl Limit) and 'Waiting (At Capacity)' - add gear icon on orgs admin page - and initial popup for setting org quotas, showing all properties from org 'quotas' object tests: - add concurrent crawl limit nightly tests - fix state waiting -> waiting_capacity - ci: add logging of operator output on test failure	2023-05-30 15:38:03 -07:00
sua yoo	ab518f51fb	Fix ResizeObserver loop error (#902 )	2023-05-30 14:59:34 -07:00
sua yoo	4852532866	Show org creation form if there are no orgs (#883 )	2023-05-24 13:10:12 -07:00
Henry Wilkinson	f788934ef5	Fix copy tags button disabling when no tags on Crawl Details page (#877 )	2023-05-24 12:30:31 -04:00
Tessa Walsh	bd8b306fbd	Improve sorting workflows by lastUpdated (#826 ) * Precompute config crawl stats Includes a database migration to move preciously dynamically computed crawl stats for workflows into the CrawlConfig model. * Add lastRun sorting option and enable it by default * Add modified as final sort key to order non-run workflows * Remove currCrawl* fields and update frontend accordingly * Add isCrawlRunning field to backend and use in frontend	2023-05-22 18:42:30 -04:00
sua yoo	821fbc12d8	Upgrade Shoelace to stable version (v2) (#856 )	2023-05-22 10:01:48 -07:00
Ilya Kreymer	826c2e8298	version: bump to 1.6.0-beta.0	2023-05-19 11:29:31 -07:00
Ilya Kreymer	d07204e59d	version: bump to 1.5.1	2023-05-18 17:28:42 -07:00
sua yoo	b5781c8869	Fix workflow edit back button (#857 )	2023-05-17 12:07:12 -07:00
Henry Wilkinson	da33231be9	Removes webkit `<summary>` element triangle (#852 )	2023-05-16 18:13:59 -04:00
Ilya Kreymer	a1ef93a46a	version: bump to 1.5.0 for release!	2023-05-16 17:36:58 +02:00
Ilya Kreymer	ebee5e1788	version: bump to 1.5.0-beta.4	2023-05-12 07:34:50 +02:00
sua yoo	f250293794	Fix workflow edit page not loading (#848 ) * fix workflow not loading * don't add hash if editing * remove controller	2023-05-12 07:33:35 +02:00
sua yoo	98d82184e6	Fix superadmin running crawls views (#846 ) - Updates superadmin "Running Crawls" to show active crawls (starting, waiting, running, stopping) and sort by start by default - Navigates to crawl workflow watch view on clicking crawl item - Adds "Copy Crawl ID" to crawl actions for easy paste into "Jump to crawl" - Navigates to crawl workflow watch when jumping to crawl	2023-05-11 08:15:52 +02:00
Ilya Kreymer	d8b36c0ae2	version: bump to 1.5.0-beta.3	2023-05-11 03:05:46 +02:00
sua yoo	a6435ae3d0	Improve Workflow Detail tab and button UX (#840 ) - Adds primary action button next to "Actions" dropdown - Switches "Edit Workflow Settings" button to icon button - Redirects user to "Watch Crawl" tab when starting crawl - Now uses crawl ID from `data.started` in API `/run` response for more responsive UI - Keeps "Watch Crawl" tab navigation button in list but disable when crawl is not running - Also handles watch view when workflow is not running to cover navigational edge cases - Adds banner in "Crawls" list to direct users to the Watch Crawl when workflow is running - Shows notification when crawl is done to make redirect to Crawls tab smoother - Uses workflow scale when updating crawl scale - Removes "All" from "View: All Finished Crawls" on Finished Crawl page for wording consistency	2023-05-11 02:57:38 +02:00
Ilya Kreymer	d1e5b0a021	version: bump to 1.5.0-beta.2	2023-05-10 14:55:35 +02:00
sua yoo	42794cad46	Add stop crawl confirmation dialog (#841 ) * switch dialog control * wait for workflow update to complete before showing dialog * add stop dialog * close scale after save * update crawl text	2023-05-10 07:21:16 +02:00
Ilya Kreymer	82b21b6813	frontend crawl stopping improvements (#836 ) (#838 ) * frontend crawl stopping improvements (#836) - support new backend 'stopping' property - for now, keep 'stopping' indicator state when crawl is running but stopping set to true	2023-05-08 23:52:49 -07:00
Ilya Kreymer	2cae065c46	Add Waiting state on the backend and frontend (#839 ) * operator: add waiting state - add pods as related objects - inspect pod status, set crawl status to 'waiting' if no pods are running frontend: - frontend support for 'waiting' state - show waiting icon from mocks --------- Co-authored-by: Henry Wilkinson <henry@wilkinson.graphics>	2023-05-08 17:05:01 -07:00

1 2 3 4 5 ...

385 Commits