browsertrix/backend/btrixcloud
Tessa Walsh 0e9e70f3a3
Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352)
Adds `filename` to pages, pointed to the WACZ file those files come
from, as well as depth, favIconUrl, and isSeed. Also adds an idempotent
migration to backfill this information for existing pages, and increases
the backend container's startupProbe time to 24 hours to give it sufficient
time to finish the migration.
---------

Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>
2025-02-05 15:50:04 -05:00
..
migrations Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352) 2025-02-05 15:50:04 -05:00
operator Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
__init__.py
auth.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
background_jobs.py Add missing os import 2025-01-13 15:15:48 -08:00
basecrawls.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
colls.py Ensure collection stats are updated when WACZ is added on upload (#2351) 2025-01-30 13:05:56 -08:00
crawlconfigs.py Validate exclusion regexes on backend (#2316) 2025-01-23 13:32:54 -05:00
crawlmanager.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
crawls.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
db.py Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352) 2025-02-05 15:50:04 -05:00
emailsender.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
invites.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
k8sapi.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
main_bg.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
main_op.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
main.py feat: Update collection sorting, metadata, stats (#2327) 2025-01-23 13:32:23 -05:00
models.py Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352) 2025-02-05 15:50:04 -05:00
ops.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
orgs.py feat: Update collection sorting, metadata, stats (#2327) 2025-01-23 13:32:23 -05:00
pages.py Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352) 2025-02-05 15:50:04 -05:00
pagination.py
profiles.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
storages.py Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352) 2025-02-05 15:50:04 -05:00
subs.py Send subscription cancelation email (#2234) 2024-12-12 11:52:38 -08:00
uploads.py quickfix: fix typo (missing self) that did not make it into #2351 2025-01-30 13:11:42 -08:00
users.py Add superuser endpoint to get user emails with org info (#2211) 2024-12-09 16:38:01 -08:00
utils.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
version.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
webhooks.py Add webhooks for qaAnalysisStarted, qaAnalysisFinished, and crawlReviewed (#1974) 2024-07-25 16:53:49 -07:00