browsertrix/backend/btrixcloud/migrations
Ilya Kreymer 1570011ec7
compute top page origins for each collection (#2483)
A quick PR to fix #2482:
- compute topPageHosts as part of existing collection stats compute
- store top 10 results in collection for now.
- display in collection About sidebar
- fixes #2482 

Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>
2025-05-08 14:22:40 -07:00
..
__init__.py Fix migration to avoid duplicate collection slugs and names (#2318) 2025-01-21 14:23:32 -08:00
migration_0001_archives_to_orgs.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0002_crawlconfig_crawlstats.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0003_mutable_crawl_configs.py Add created date to Organization and fix datetimes across backend (#1921) 2024-07-15 19:46:32 -07:00
migration_0004_config_seeds.py Pydantic 2.x update + type fixes + python 3.12 (#1947) 2024-07-22 17:23:03 -07:00
migration_0005_operator_scheduled_jobs.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0006_precompute_crawl_stats.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0007_colls_and_config_update.py Reformat with Black for 2025 ruleset (#2349) 2025-01-29 16:57:06 -05:00
migration_0008_precompute_crawl_file_stats.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0009_crawl_types.py Change crawl.reviewStatus to 1-5 scale int (#1664) 2024-04-09 17:51:06 -07:00
migration_0010_collection_total_size.py
migration_0011_crawl_timeout_configmap.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0012_notes_to_description.py
migration_0013_crawl_name.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0014_to_collection_ids.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0015_org_storage_usage.py
migration_0016_operator_scheduled_jobs_v2.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0017_storage_by_type.py
migration_0018_usernames.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0019_org_slug.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0020_org_storage_refs.py
migration_0021_profile_filenames.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0022_partial_complete.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0023_available_extra_exec_mins.py Add crawl pages and related API endpoints (#1516) 2024-02-28 12:11:35 -05:00
migration_0024_crawlerchannel.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0025_workflow_db_configmap_fixes.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0026_crawl_review_status.py Change crawl.reviewStatus to 1-5 scale int (#1664) 2024-04-09 17:51:06 -07:00
migration_0027_profile_modified.py Add migration to set profile modified date (#1832) 2024-05-29 15:56:27 -04:00
migration_0028_page_files_errors.py Backend: Move page file and error counts to crawl replay.json endpoint (#1868) 2024-06-20 19:02:57 -07:00
migration_0029_remove_workflow_configmaps.py Remove Crawl Workflow Configmaps (#1894) 2024-06-28 15:25:23 -07:00
migration_0030_user_invites_flatten.py Refactor Invites and Registration, Flatten Per-User Invites (#1902) 2024-07-02 15:13:27 -07:00
migration_0031_org_created.py Add created date to Organization and fix datetimes across backend (#1921) 2024-07-15 19:46:32 -07:00
migration_0032_dupe_org_names.py Ensure org name and slug uniqueness is case-insensitive (#1929) 2024-07-18 15:30:12 -07:00
migration_0033_crawl_quota_states.py Standardize handling of storage and execution time quotas (#1969) 2024-07-25 12:49:11 -07:00
migration_0034_drop_invalid_crc.py remove crc32 from CrawlFile (#1980) 2024-07-30 11:23:15 -07:00
migration_0035_fix_failed_logins.py fix resetting of invalid logins: (#2002) 2024-08-07 12:36:06 -07:00
migration_0036_coll_visibility.py Make changes to collections to support publicly listed collections (#2164) 2025-01-13 15:15:47 -08:00
migration_0037_upload_pages.py Rework crawl page migration + MongoDB Query Optimizations (#2412) 2025-02-20 15:26:11 -08:00
migration_0038_org_last_crawl_finished.py Add last crawl and subscription status indicators to org list (#2273) 2025-01-14 10:57:06 -05:00
migration_0039_coll_slugs.py Fix migration to avoid duplicate collection slugs and names (#2318) 2025-01-21 14:23:32 -08:00
migration_0040_archived_item_page_count.py feat: Update collection sorting, metadata, stats (#2327) 2025-01-23 13:32:23 -05:00
migration_0041_pages_snapshots.py feat: Update collection sorting, metadata, stats (#2327) 2025-01-23 13:32:23 -05:00
migration_0042_page_filenames.py Rework crawl page migration + MongoDB Query Optimizations (#2412) 2025-02-20 15:26:11 -08:00
migration_0043_unset_file_expireat.py Better cacheing of presigned URLs + support for thumbnails (#2446) 2025-03-03 12:05:23 -08:00
migration_0044_coll_stats.py compute top page origins for each collection (#2483) 2025-05-08 14:22:40 -07:00