browsertrix/backend/btrixcloud
Tessa Walsh 39d99e7c5d
Add support for custom link selectors to backend (#2346)
Related to #2152 

This PR adds backend support for custom link selectors via `selectLinks`
on the crawl workflow config. Tests have been updated as well.

It also adds `selectLinks` to the frontend in a minimal and for now
hardcoded way that we can use as a basis for proper frontend support
moving forward.

---------

Co-authored-by: Ilya Kreymer <ikreymer@users.noreply.github.com>
2025-02-13 22:22:27 -08:00
..
migrations Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352) 2025-02-05 15:50:04 -05:00
operator Fix max pages quota setting and display (#2370) 2025-02-10 16:15:21 -08:00
__init__.py
auth.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
background_jobs.py Add missing os import 2025-01-13 15:15:48 -08:00
basecrawls.py Add initial pages + pagesQuery endpoint to /replay.json APIs (#2380) 2025-02-13 16:53:47 -08:00
colls.py Add initial pages + pagesQuery endpoint to /replay.json APIs (#2380) 2025-02-13 16:53:47 -08:00
crawlconfigs.py Fix max pages quota setting and display (#2370) 2025-02-10 16:15:21 -08:00
crawlmanager.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
crawls.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
db.py Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352) 2025-02-05 15:50:04 -05:00
emailsender.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
invites.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
k8sapi.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
main_bg.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
main_op.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
main.py Add collection page list/search endpoint (#2354) 2025-02-10 16:44:37 -08:00
models.py Add support for custom link selectors to backend (#2346) 2025-02-13 22:22:27 -08:00
ops.py Add collection page list/search endpoint (#2354) 2025-02-10 16:44:37 -08:00
orgs.py feat: Update collection sorting, metadata, stats (#2327) 2025-01-23 13:32:23 -05:00
pages.py misc fixes: cors + disable buffering for uploads (#2395) 2025-02-13 19:38:20 -08:00
pagination.py
profiles.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
storages.py Mark all pages from pages.jsonl as seeds (#2390) 2025-02-13 16:54:30 -08:00
subs.py
uploads.py quickfix: fix typo (missing self) that did not make it into #2351 2025-01-30 13:11:42 -08:00
users.py
utils.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
version.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
webhooks.py