browsertrix

Author	SHA1	Message	Date
sua yoo	370b8cbd4d	Set max pages to API default (#739 )	2023-04-04 08:47:37 -07:00
Ilya Kreymer	2b0d5ff8b3	misc frontend build fixes: playwright version + chunking (#740 ) * misc frontend build fixes: - fix playwright version to be consistent to fix playwright test - chunking: set max number of chunks generated * lock playwright version * remove intl polyfill --------- Co-authored-by: sua yoo <sua@suayoo.com>	2023-04-03 21:27:44 -07:00
Ilya Kreymer	1c47a648a9	Max page limit override (#737 ) * more page limit: update to #717, instead of setting --limit in each crawlconfig, apply override --maxPageLimit setting, implemented in crawler, to override individually configured page limit * update tests, no longer returning 'crawl_page_limit_exceeds_allowed'	2023-04-03 14:01:32 -07:00
Tessa Walsh	3b99bdf26a	Update nightly test fixtures to use Seed objects (#734 )	2023-04-03 16:21:25 -04:00
Tessa Walsh	e9b61c632d	Add pageSize to pagination format (#736 )	2023-04-03 15:57:47 -04:00
Ilya Kreymer	887cb16146	Allow configurable max pages per crawl in deployment settings (#717 ) * backend: max pages per crawl limit, part of fix for #716: - set 'max_pages_crawl_limit' in values.yaml, default to 100,000 - if set/non-0, automatically set limit if none provided - if set/non-0, return 400 if adding config with limit exceeding max limit - return limit as 'maxPagesPerCrawl' in /api/settings - api: /all/crawls - add runningOnly=0 to show all crawls, default to 1/true (for more reliable testing) tests: add test for 'max_pages_per_crawl' setting - ensure 'limit' can not be set higher than max_pages_per_crawl - ensure pages crawled is at the limit - set test limit to max 2 pages - add settings test - check for pages.jsonl and extraPages.jsonl when crawling 2 pages	2023-03-28 16:26:29 -07:00
Sara Tavares	948cce3d30	Add README.md related to run playwright tests locally (#722 )	2023-03-28 16:08:28 -07:00
Tessa Walsh	4724754efc	Filter and sort crawl and workflow list API endpoints in backend (#724 ) * Re-implement pagination and paginate crawlconfig revs First step toward simplifying pagination to set us up for sorting and filtering of list endpoints. This commit removes fastapi-pagination as a dependency. * Migrate all HttpUrl seeds to Seeds This commit also updates the frontend to always use Seeds and to fix display issues resulting from the change. * Filter and sort crawls and workflows Crawls: - Filter by createdBy (via userid param) - Filter by state (comma-separated string for multiple values) - Filter by first_seed, name, description - Sort by started, finished, fileSize, firstSeed - Sort descending by default to match frontend Workflows: - Filter by createdBy (formerly userid) and modifiedBy - Filter by first_seed, name, description - Sort by created, modified, firstSeed, lastCrawlTime * Add crawlconfigs search-values API endpoint and test	2023-03-28 17:55:40 -04:00
Sara Tavares	36cfb2591f	ci: fix version related to @playwright/test (#729 ) * fix version, add resolutions to have fixed playwright version	2023-03-28 14:30:36 -07:00
sua yoo	25e4da2522	fix: enable semibold variable	2023-03-28 12:17:34 -07:00
sua yoo	8033061540	Leave trailing slash in seed URLs (#731 )	2023-03-27 14:46:59 -07:00
Tessa Walsh	e293e98ac3	Fix migration to avoid jobType KeyError (#727 ) * Fix migration to avoid KeyError * Use .get() for other optional fields	2023-03-27 13:52:05 -07:00
sua yoo	bca67c74e2	chore: format frontend files with prettier	2023-03-27 11:05:19 -07:00
Sara Tavares	48163db5d3	ci: fix version playwright version for tests (#725 )	2023-03-26 21:57:06 -07:00
Sara Tavares	b61592b5ed	CI: Add Playwright UI e2e tests + CI (#614 ) Adds Playwright for UI tests. Basic Playwright test to login. Playwright Github Action. --------- Co-authored-by: sua yoo <sua@suayoo.com>	2023-03-22 16:23:22 -07:00
sua yoo	e8f88a797b	Remove new issue project automation config (#718 )	2023-03-21 13:49:34 -07:00
sua yoo	5f5bb5ea6e	Allow users to set workflow description (#708 )	2023-03-21 13:40:23 -07:00
Tessa Walsh	4136bdad2e	Add optional description to crawl configs and return in crawl endpoints (#707 )	2023-03-21 15:39:09 -04:00
sua yoo	0b0bae00c8	chore: add PR template for UI changes	2023-03-21 11:32:36 -07:00
Sara Tavares	3fa93b01b8	ci: Create proofread-action.yaml (#714 )	2023-03-20 21:08:56 -07:00
Ilya Kreymer	ba70d3227e	version: update to 1.4.0-beta.1	2023-03-17 21:14:42 -07:00
Ilya Kreymer	07e9f51292	backend: update queue apis to work with new sorted queue apis (also b… (#712 ) * backend: update queue apis to work with new sorted queue apis (also backwards compatible to existing apis) designed for browsertrix-crawler 0.9.0-beta.1 but also backwards compatible with older list-based queue as well	2023-03-17 21:11:17 -07:00
sua yoo	b9a24fa5e2	Combine watch crawl with crawl queue (#710 ) - crawl queue and watch page are now part of single view - exclusions can be edited via 'Edit Exclusions' popup	2023-03-17 21:04:08 -07:00
sua yoo	03e9b2aba5	Disable copy tags menu item if no tags (#709 )	2023-03-16 19:45:04 -07:00
sua yoo	0009ce8bf6	fix limit fields (#704 )	2023-03-14 18:28:13 -07:00
Ilya Kreymer	de9212eec7	exclusions editor fix: (#692 ) - backend: fix updating model after exclusions change - frontend: don't check for new_cid, just success - fixes #691	2023-03-10 22:36:10 -08:00
D. Lee	7528f2ec6d	Add lightweight logging mode (#668 ) Enabled with `logging.fileMode`: true - disables elasticsearch, kibana and ingress - only enables fluentd to write logs in the node's volume - lightweight logging into files (in JSON format and compressed in gzip) - log file rotation (default: rotating files every 4 hours, retention 3 days)	2023-03-10 14:34:37 -08:00
Ilya Kreymer	86ca9c4bac	backend: Fix for total crawl time limit. (#665 ) * backend: fix for total crawl timelimit: - time limit is computed for total job run time - when limit is exceeded, job starts to stop crawls gracefully, equivalent to 'stop crawl' operation - fix for #664 * rename crawl-timeout -> crawl_expire_time * fix lint	2023-03-10 11:43:16 -08:00
sua yoo	8ca4276c57	Migrate crawl config frontend -> workflow (#686 )	2023-03-10 11:39:42 -08:00
sua yoo	fecdc6229d	Improve crawl queue pagination UX (#680 ) * switches to infinite scroll for crawl queue	2023-03-09 12:18:26 -08:00
sua yoo	934ee18044	chore: switch actions for issue assign automation addresses #658	2023-03-08 10:01:00 -08:00
Ilya Kreymer	c2fa78859b	permissions: allow user with 'viewer' permissions to access read-only crawlconfig apis (#687 ) addresses issue in #653, fixes #685	2023-03-08 09:29:25 -08:00
sua yoo	666c28f420	Limit organization name length (#671 )	2023-03-08 09:21:48 -08:00
Ilya Kreymer	544346d1d4	backend: make crawlconfigs mutable! (#656 ) (#662 ) * backend: make crawlconfigs mutable! (#656) - crawlconfig PATCH /{id} can now receive a new JSON config to replace the old one (in addition to scale, schedule, tags) - exclusions: add / remove APIs mutate the current crawlconfig, do not result in a new crawlconfig created - exclusions: ensure crawl job 'config' is updated when exclusions are added/removed, unify add/remove exclusions on crawl - k8s: crawlconfig json is updated along with scale - k8s: stateful set is restarted by updating annotation, instead of changing template - crawl object: now has 'config', as well as 'profileid', 'schedule', 'crawlTimeout', 'jobType' properties to ensure anything that is changeable is stored on the crawl - crawlconfigcore: store share properties between crawl and crawlconfig in new crawlconfigcore (includes 'schedule', 'jobType', 'config', 'profileid', 'schedule', 'crawlTimeout', 'tags', 'oid') - crawlconfig object: remove 'oldId', 'newId', disallow deactivating/deleting while crawl is running - rename 'userid' -> 'createdBy' - remove unused 'completions' field - add missing return to fix /run response - crawlout: ensure 'profileName' is resolved on CrawlOut from profileid - crawlout: return 'name' instead of 'configName' for consistent response - update: 'modified', 'modifiedBy' fields to set modification date and user modifying config - update: ensure PROFILE_FILENAME is updated in configmap is profileid provided, clear if profileid=="" - update: return 'settings_changed' and 'metadata_changed' if either crawl settings or metadata changed - tests: update tests to check settings_changed/metadata_changed return values add revision tracking to crawlconfig: - store each revision separate mongo db collection - revisions accessible via /crawlconfigs/{cid}/revs - store 'rev' int in crawlconfig and in crawljob - only add revision history if crawl config changed migration: - update to db v3 - copy fields from crawlconfig -> crawl - rename userid -> createdBy - copy userid -> modifiedBy, created -> modified - skip invalid crawls (missing config), make createdBy optional (just in case) frontend: Update crawl config keys with new API (#681), update frontend to use new PATCH endpoint, load config from crawl object in details view --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net> Co-authored-by: sua yoo <sua@webrecorder.org> Co-authored-by: sua yoo <sua@suayoo.com>	2023-03-07 20:36:50 -08:00
sua yoo	d3bb524971	Fix missing crawl config name (#683 )	2023-03-07 19:13:56 -08:00
sua yoo	ebce2ec384	fix: show crawl start date in local time	2023-03-07 16:05:00 -08:00
sua yoo	91e415fac2	Hide file size when crawl is running (#648 )	2023-03-07 16:02:19 -08:00
sua yoo	85416e2ca2	Fix crawl config name in "run now" alert (#673 )	2023-03-06 15:11:04 -08:00
sua yoo	3b61266eed	chore: switch to issue node ID proposed fix for update-project-column	2023-03-06 12:32:08 -08:00
sua yoo	ba2d8db413	chore: fix update-project-column org	2023-03-06 12:27:05 -08:00
sua yoo	0007e9bf0b	chore: remove operation from gh action see: https://github.com/github/update-project-action/pull/50	2023-03-06 12:24:45 -08:00
sua yoo	1e3b384e31	chore: update assign issue automation action	2023-03-06 12:18:28 -08:00
Tessa Walsh	e98c7172a9	Paginate API list endpoints (#659 ) * Paginate API list endpoints fastapi-pagination is pinned to 0.9.3, the latest release that plays nicely with pinned versions of fastapi and fastapi-users. * Increase page size via overriden Params and Page classes * update api resource list keys --------- Co-authored-by: sua yoo <sua@suayoo.com>	2023-03-06 14:41:25 -05:00
sua yoo	31dc5c56c9	chore: update add-to-project action version	2023-03-06 11:40:28 -08:00
sua yoo	18abc84484	chore: update project automation action	2023-03-06 11:38:10 -08:00
Ilya Kreymer	ace4e79e3f	version: bump version to 1.4.0-beta.0	2023-03-06 10:20:56 -08:00
Henry Wilkinson	52106b1339	Merge pull request #666 from webrecorder/frontend-detail-nav-button-update	2023-03-06 13:11:08 -05:00
sua yoo	a112f467b3	Update frontend/src/pages/org/crawl-detail.ts	2023-03-06 08:37:18 -08:00
Henry Wilkinson	7e1276fd0d	Remove duplicate `gap` value	2023-03-03 16:27:16 -05:00
Henry Wilkinson	e4a178ff74	Updates crawl details navigation - Adds icons to details nav items - Adds replay glyph icon - Hides "Replay" & "Files" pages if the crawl is running - Updates border radius 3px → 4px - Updates colour values, aligns with mockups - Replaces `margin` from menu items with `gap` values - Removes animation Prettier made some spacing adjustments, I also moved some lines around so they're all in the same spot now. 😬	2023-03-02 16:23:37 -05:00

1 2 3 4 5 ...

505 Commits