browsertrix/backend/btrixcloud/migrations
Tessa Walsh 0e9e70f3a3
Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352)
Adds `filename` to pages, pointed to the WACZ file those files come
from, as well as depth, favIconUrl, and isSeed. Also adds an idempotent
migration to backfill this information for existing pages, and increases
the backend container's startupProbe time to 24 hours to give it sufficient
time to finish the migration.
---------

Co-authored-by: Ilya Kreymer <ikreymer@gmail.com>
2025-02-05 15:50:04 -05:00
..
__init__.py Fix migration to avoid duplicate collection slugs and names (#2318) 2025-01-21 14:23:32 -08:00
migration_0001_archives_to_orgs.py
migration_0002_crawlconfig_crawlstats.py
migration_0003_mutable_crawl_configs.py Add created date to Organization and fix datetimes across backend (#1921) 2024-07-15 19:46:32 -07:00
migration_0004_config_seeds.py Pydantic 2.x update + type fixes + python 3.12 (#1947) 2024-07-22 17:23:03 -07:00
migration_0005_operator_scheduled_jobs.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0006_precompute_crawl_stats.py
migration_0007_colls_and_config_update.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
migration_0008_precompute_crawl_file_stats.py
migration_0009_crawl_types.py
migration_0010_collection_total_size.py
migration_0011_crawl_timeout_configmap.py
migration_0012_notes_to_description.py
migration_0013_crawl_name.py
migration_0014_to_collection_ids.py
migration_0015_org_storage_usage.py
migration_0016_operator_scheduled_jobs_v2.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0017_storage_by_type.py
migration_0018_usernames.py
migration_0019_org_slug.py
migration_0020_org_storage_refs.py
migration_0021_profile_filenames.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0022_partial_complete.py
migration_0023_available_extra_exec_mins.py
migration_0024_crawlerchannel.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0025_workflow_db_configmap_fixes.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0026_crawl_review_status.py
migration_0027_profile_modified.py
migration_0028_page_files_errors.py Backend: Move page file and error counts to crawl replay.json endpoint (#1868) 2024-06-20 19:02:57 -07:00
migration_0029_remove_workflow_configmaps.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0030_user_invites_flatten.py Refactor Invites and Registration, Flatten Per-User Invites (#1902) 2024-07-02 15:13:27 -07:00
migration_0031_org_created.py Add created date to Organization and fix datetimes across backend (#1921) 2024-07-15 19:46:32 -07:00
migration_0032_dupe_org_names.py Ensure org name and slug uniqueness is case-insensitive (#1929) 2024-07-18 15:30:12 -07:00
migration_0033_crawl_quota_states.py Standardize handling of storage and execution time quotas (#1969) 2024-07-25 12:49:11 -07:00
migration_0034_drop_invalid_crc.py remove crc32 from CrawlFile (#1980) 2024-07-30 11:23:15 -07:00
migration_0035_fix_failed_logins.py fix resetting of invalid logins: (#2002) 2024-08-07 12:36:06 -07:00
migration_0036_coll_visibility.py Make changes to collections to support publicly listed collections (#2164) 2025-01-13 15:15:47 -08:00
migration_0037_upload_pages.py Backend work for public collections (#2198) 2025-01-13 15:15:48 -08:00
migration_0038_org_last_crawl_finished.py Add last crawl and subscription status indicators to org list (#2273) 2025-01-14 10:57:06 -05:00
migration_0039_coll_slugs.py Fix migration to avoid duplicate collection slugs and names (#2318) 2025-01-21 14:23:32 -08:00
migration_0040_archived_item_page_count.py feat: Update collection sorting, metadata, stats (#2327) 2025-01-23 13:32:23 -05:00
migration_0041_pages_snapshots.py feat: Update collection sorting, metadata, stats (#2327) 2025-01-23 13:32:23 -05:00
migration_0042_page_filenames.py Add WACZ filename, depth, favIconUrl, isSeed to pages (#2352) 2025-02-05 15:50:04 -05:00