{"id":610074,"date":"2024-02-23T18:00:00","date_gmt":"2024-02-23T15:00:00","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/"},"modified":"2024-02-23T18:00:00","modified_gmt":"2024-02-23T15:00:00","slug":"how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/","title":{"rendered":"#How to speed up site migrations with AI-powered redirect mapping"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_84 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a26bb446db9e\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a26bb446db9e\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Website_migrations_often_involve_painstaking_redirect_mapping_but_AI_can_help_Heres_a_walkthrough_of_the_process_to_help_you_save_time\" >Website migrations often involve painstaking redirect mapping, but AI can help. Here&#8217;s a walkthrough of the process to help you save time.<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Should_you_use_AI_for_redirect_mapping\" >Should you use AI for redirect mapping?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Advantages_of_using_AI_for_redirect_mapping\" >Advantages of using AI for redirect mapping<\/a><ul class='ez-toc-list-level-3' ><li class='ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Speed\" >Speed<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Scalability\" >Scalability<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Efficiency\" >Efficiency<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Accuracy\" >Accuracy<\/a><\/li><\/ul><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Disadvantages_of_using_AI_for_redirect_mapping\" >Disadvantages of using AI for redirect mapping<\/a><ul class='ez-toc-list-level-3' ><li class='ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-9\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Over-reliance\" >Over-reliance<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-10\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Training\" >Training<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-11\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Output_variance\" >Output variance\u00a0<\/a><\/li><\/ul><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-12\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#A_step-by-step_guide_for_URL_mapping_with_AI\" >A step-by-step guide for URL mapping with AI<\/a><ul class='ez-toc-list-level-3' ><li class='ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-13\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Tools_well_be_using\" >Tools we\u2019ll be using<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-14\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_1_Crawl_your_live_website_with_Screaming_Frog\" >Step 1: Crawl your live website with Screaming Frog<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-15\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_2_Export_HTML_pages_with_200_Status_Code\" >Step 2: Export HTML pages with 200 Status Code<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-16\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_3_Repeat_steps_1_and_2_for_your_staging_website\" >Step 3: Repeat steps 1 and 2 for your staging website<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-17\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Optional_Find_and_replace_your_staging_site_domain_or_subdomain_to_match_your_live_site\" >Optional: Find and replace your staging site domain or subdomain to match your live site<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-18\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_4_Run_the_Google_Colab_Python_script\" >Step 4: Run the Google Colab Python script<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-19\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_5_Upload_origincsv_and_destinationcsv\" >Step 5: Upload origin.csv and destination.csv<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-20\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_6_Select_fields_to_use_for_similarity_matching\" >Step 6: Select fields to use for similarity matching<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-21\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_7_Watch_the_magic\" >Step 7: Watch the magic<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-22\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_7_Download_outputcsv_and_sort_by_similarity_score\" >Step 7: Download output.csv and sort by similarity_score<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-23\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_8_Human-validate_your_results\" >Step 8: Human-validate your results<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-24\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#Step_9_Tweak_and_repeat\" >Step 9: Tweak and repeat<\/a><\/li><\/ul><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-25\" href=\"https:\/\/buradabiliyorum.com\/en\/how-to-speed-up-site-migrations-with-ai-powered-redirect-mapping\/#More_AI_to_come\" >More AI to come<\/a><\/li><\/ul><\/nav><\/div>\n<h2 class=\"subhead\" itemprop=\"alternativeHeadline\"><span class=\"ez-toc-section\" id=\"Website_migrations_often_involve_painstaking_redirect_mapping_but_AI_can_help_Heres_a_walkthrough_of_the_process_to_help_you_save_time\"><\/span>Website migrations often involve painstaking redirect mapping, but AI can help. Here&#8217;s a walkthrough of the process to help you save time.<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><\/p>\n<div class=\"bialty-container\">\nMigrating a large website is always daunting. Big traffic is at stake among many moving parts, technical challenges and stakeholder management.<\/p>\n<p>Historically, one of the most onerous tasks in a migration plan has been redirect m<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>ing. The painstaking process of matching URLs on your current site to the equivalent version on the new website.<\/p>\n<p>Fortunately, this task that previously could involve teams of people combing through thousands of URLs can be drastically sped up with modern AI models.<\/p>\n<p><!-- \/1038259\/SEL_Post-text --><\/p>\n<div id=\"div-gpt-ad-1693000027709-0\"><\/div>\n<div id=\"post-break\"><\/div>\n<h2 class=\"wp-block-heading\" id=\"h-should-you-use-ai-for-redirect-mapping\"><span class=\"ez-toc-section\" id=\"Should_you_use_AI_for_redirect_mapping\"><\/span>Should you use AI for redirect mapping?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The term \u201cAI\u201d has become someone conflated with \u201cChatGPT\u201d over the last year, so to be very clear from the outset, we are not talking about using generative AI\/LLM-based systems to do your redirect mapping.\u00a0<\/p>\n<p>While there are some tasks that tools like ChatGPT can assist you with, such as writing that tricky regex for the redirect logic, the generative element that can cause hallucinations could potentially create accuracy issues for us.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-advantages-of-using-ai-for-redirect-mapping\"><span class=\"ez-toc-section\" id=\"Advantages_of_using_AI_for_redirect_mapping\"><\/span>Advantages of using AI for redirect mapping<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<h3 class=\"wp-block-heading\" id=\"h-speed\"><span class=\"ez-toc-section\" id=\"Speed\"><\/span>Speed<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>The primary advantage of using AI for redirect mapping is the sheer speed at which it can be done. An initial map of 10,000 URLs could be produced within a few minutes and human-reviewed within a few hours. Doing this process manually for a single person would usually be days of work.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-scalability\"><span class=\"ez-toc-section\" id=\"Scalability\"><\/span>Scalability<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Using AI to help map redirects is a method you can use on a site with 100 URLs or over 1,000,000. Large sites also tend to be more programmatic or templated, making similarity matching more accurate with these tools.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-efficiency\"><span class=\"ez-toc-section\" id=\"Efficiency\"><\/span>Efficiency<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>For larger sites, a multi-person job can easily be handled by a single person with the correct knowledge, freeing up colleagues to assist with other parts of the migration.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-accuracy\"><span class=\"ez-toc-section\" id=\"Accuracy\"><\/span>Accuracy<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>While the automated method will get some redirects \u201cwrong,\u201d in my experience, the overall accuracy of redirects has been higher, as the output can specify the similarity of the match, giving manual reviewers a guide on where their attention is most needed<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-disadvantages-of-using-ai-for-redirect-mapping\"><span class=\"ez-toc-section\" id=\"Disadvantages_of_using_AI_for_redirect_mapping\"><\/span>Disadvantages of using AI for redirect mapping<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<h3 class=\"wp-block-heading\" id=\"h-over-reliance\"><span class=\"ez-toc-section\" id=\"Over-reliance\"><\/span>Over-reliance<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Using automation tools can make people complacent and over-reliant on the output. With such an important task, a human review is always required.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-training\"><span class=\"ez-toc-section\" id=\"Training\"><\/span>Training<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>The script is pre-written and the process is straightforward. However, it will be new to many people and environments such as Google Colab can be intimidating.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-output-variance-nbsp\"><span class=\"ez-toc-section\" id=\"Output_variance\"><\/span>Output variance\u00a0<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>While the output is deterministic, the models will perform better on certain sites than others. Sometimes, the output can contain \u201csilly\u201d errors, which are obvious for a human to spot but harder for a machine.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-a-step-by-step-guide-for-url-mapping-with-ai\"><span class=\"ez-toc-section\" id=\"A_step-by-step_guide_for_URL_mapping_with_AI\"><\/span>A step-by-step guide for URL mapping with AI<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>By the end of this process, we are aiming to produce a spreadsheet that lists \u201cfrom\u201d and \u201cto\u201d URLs by mapping the origin URLs on our live website to the destination URLs on our staging (new) website.<\/p>\n<p>For this example, to keep things simple, we will just be mapping our HTML pages, not additional assets such as CSS or images, although this is also possible.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-tools-we-ll-be-using\"><span class=\"ez-toc-section\" id=\"Tools_well_be_using\"><\/span>Tools we\u2019ll be using<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<ul>\n<li><strong>Screaming Frog Website Crawler<\/strong>: A powerful and flexible website crawler, Screaming Frog is how we collect the URLs and associated metadata we need for the matching.<\/li>\n<li><strong>Google Colab: <\/strong>A free cloud service that uses a Jupyter notebook environment, allowing you to run a range of languages directly from your browser without having to install anything locally. Google Colab is how we are going to run our Python scripts to perform the URL matching.<\/li>\n<li><a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/colab.research.google.com\/drive\/1Y4msGtQf44IRzCotz8KMy0oawwZ2yIbT?usp=sharing\"><strong>Automated Redirect Matchmaker for Site Migrations<\/strong><\/a><strong>: <\/strong>The Python script by <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.linkedin.com\/in\/dpe1\/\">Daniel Emery<\/a> that we\u2019ll be running in Colab.<\/li>\n<\/ul>\n<h3 class=\"wp-block-heading\" id=\"h-step-1-crawl-your-live-website-with-screaming-frog\"><span class=\"ez-toc-section\" id=\"Step_1_Crawl_your_live_website_with_Screaming_Frog\"><\/span>Step 1: Crawl your live website with Screaming Frog<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>You\u2019ll need to perform a standard crawl on your website. Depending on how your website is built, this may or may not require a <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.screamingfrog.co.uk\/seo-spider\/tutorials\/crawl-javascript-seo\/#:~:text=If%20you'd%20prefer%20to,Rendering'%20and%20save%20your%20configuration.\">JavaScript crawl<\/a>. The goal is to produce a list of as many accessible pages on your site as possible.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1264\" height=\"586\" alt=\"Crawl your live website with Screaming Frog\" class=\"wp-image-437796\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog.png.webp 1264w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog-600x278.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog-800x371.png.webp 800w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog-200x93.png.webp 200w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog-768x356.png.webp 768w\" data-lazy-sizes=\"(max-width: 1264px) 100vw, 1264px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog.png.webp\"><noscript><img fetchpriority=\"high\" decoding=\"async\" width=\"1264\" height=\"586\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog.png.webp\" alt=\"Crawl your live website with Screaming Frog\" class=\"wp-image-437796\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog.png.webp 1264w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog-600x278.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog-800x371.png.webp 800w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog-200x93.png.webp 200w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Crawl-your-live-website-with-Screaming-Frog-768x356.png.webp 768w\" sizes=\"(max-width: 1264px) 100vw, 1264px\"><\/noscript><\/figure>\n<\/div>\n<h3 class=\"wp-block-heading\" id=\"h-step-2-export-html-pages-with-200-status-code\"><span class=\"ez-toc-section\" id=\"Step_2_Export_HTML_pages_with_200_Status_Code\"><\/span>Step 2: Export HTML pages with 200 Status Code<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Once the crawl has been completed, we want to export all of the found HTML URLs with a 200 Status Code.<\/p>\n<p>Firstly, in the top left-hand corner, we need to select \u201cHTML\u201d from the drop-down menu.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"580\" height=\"282\" alt=\"Screaming Frog - Highlighted- HTML filter\" class=\"wp-image-437797\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Screaming-Frog-Highlighted-HTML-filter.png.webp 580w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Screaming-Frog-Highlighted-HTML-filter-200x97.png.webp 200w\" data-lazy-sizes=\"(max-width: 580px) 100vw, 580px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Screaming-Frog-Highlighted-HTML-filter.png.webp\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"580\" height=\"282\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Screaming-Frog-Highlighted-HTML-filter.png.webp\" alt=\"Screaming Frog - Highlighted- HTML filter\" class=\"wp-image-437797\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Screaming-Frog-Highlighted-HTML-filter.png.webp 580w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Screaming-Frog-Highlighted-HTML-filter-200x97.png.webp 200w\" sizes=\"auto, (max-width: 580px) 100vw, 580px\"><\/noscript><\/figure>\n<\/div>\n<p>Next, click the sliders filter icon in the top right and create a filter for <em>Status Codes<\/em> containing 200.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1002\" height=\"366\" alt=\"Highlighted: Custom filter options\" class=\"wp-image-437798\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options.png.webp 1002w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options-600x219.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options-800x292.png.webp 800w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options-200x73.png.webp 200w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options-768x281.png.webp 768w\" data-lazy-sizes=\"(max-width: 1002px) 100vw, 1002px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options.png.webp\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"1002\" height=\"366\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options.png.webp\" alt=\"Highlighted: Custom filter options\" class=\"wp-image-437798\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options.png.webp 1002w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options-600x219.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options-800x292.png.webp 800w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options-200x73.png.webp 200w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Custom-filter-options-768x281.png.webp 768w\" sizes=\"auto, (max-width: 1002px) 100vw, 1002px\"><\/noscript><\/figure>\n<\/div>\n<p>Finally, click on <em>Export<\/em> to save this data as a CSV.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"521\" height=\"263\" alt=\"Highlighted: Export button\" class=\"wp-image-437799\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Export-button.png.webp 521w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Export-button-200x101.png.webp 200w\" data-lazy-sizes=\"(max-width: 521px) 100vw, 521px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Export-button.png.webp\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"521\" height=\"263\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Export-button.png.webp\" alt=\"Highlighted: Export button\" class=\"wp-image-437799\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Export-button.png.webp 521w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-Export-button-200x101.png.webp 200w\" sizes=\"auto, (max-width: 521px) 100vw, 521px\"><\/noscript><\/figure>\n<\/div>\n<p>This will provide you with a list of our current live URLs and all of the default metadata Screaming Frog collects about them, such as Titles and Header Tags. Save this file as <em>origin.csv<\/em>.<\/p>\n<p><strong>Important note: <\/strong>Your full migration plan needs to account for things such as existing 301 redirects and URLs that may get traffic on your site that are not accessible from an initial crawl. This guide is intended only to demonstrate part of this URL mapping process, it is not an exhaustive guide.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-step-3-repeat-steps-1-and-2-for-your-staging-website\"><span class=\"ez-toc-section\" id=\"Step_3_Repeat_steps_1_and_2_for_your_staging_website\"><\/span>Step 3: Repeat steps 1 and 2 for your staging website<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>We now need to gather the same data from our staging website, so we have something to compare to.<\/p>\n<p>Depending on how your staging site is secured, you may need to use features such as <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.screamingfrog.co.uk\/seo-spider\/tutorials\/crawling-password-protected-websites\/\">Screaming Frog\u2019s forms authentication<\/a> if password protected.<\/p>\n<p>Once the crawl has completed, you should export the data and save this file as <em>destination.csv<\/em>.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-optional-find-and-replace-your-staging-site-domain-or-subdomain-to-match-your-live-site\"><span class=\"ez-toc-section\" id=\"Optional_Find_and_replace_your_staging_site_domain_or_subdomain_to_match_your_live_site\"><\/span>Optional: Find and replace your staging site domain or subdomain to match your live site<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>It\u2019s likely your staging website is either on a different subdomain, TLD or even domain that won\u2019t match our actual destination URL. For this reason, I will use a Find and Replace function on my destination.csv to change the path to match the final live site subdomain, domain or TLD.<\/p>\n<p>For example:<\/p>\n<ul>\n<li>My live website is <code>https:\/\/withcandour.co.uk\/<\/code> (origin.csv)<\/li>\n<li>My staging website is <code>https:\/\/testing.withcandour.dev\/<\/code> (destination.csv)<\/li>\n<li>The site is staying on the same domain; it\u2019s just a redesign with different URLs, so I would open destination.csv and find any instance of <code>https:\/\/testing.withcandour.dev<\/code> and replace it with <code>https:\/\/withcandour.co.uk<\/code>.<\/li>\n<\/ul>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"439\" height=\"188\" alt=\"Find and Replace in Excel\" class=\"wp-image-437800\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Find-and-Replace-in-Excel.png 439w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Find-and-Replace-in-Excel-200x86.png.webp 200w\" data-lazy-sizes=\"(max-width: 439px) 100vw, 439px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Find-and-Replace-in-Excel.png\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"439\" height=\"188\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Find-and-Replace-in-Excel.png\" alt=\"Find and Replace in Excel\" class=\"wp-image-437800\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Find-and-Replace-in-Excel.png 439w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Find-and-Replace-in-Excel-200x86.png.webp 200w\" sizes=\"auto, (max-width: 439px) 100vw, 439px\"><\/noscript><\/figure>\n<\/div>\n<p>This also means when the redirect map is produced, the output is correct and only the final redirect logic needs to be written.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-step-4-run-the-google-colab-python-script\"><span class=\"ez-toc-section\" id=\"Step_4_Run_the_Google_Colab_Python_script\"><\/span>Step 4: Run the Google Colab Python script<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>When you <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/colab.research.google.com\/drive\/1Y4msGtQf44IRzCotz8KMy0oawwZ2yIbT?usp=sharing#scrollTo=jmu3C5OA0g0d\">navigate to the script<\/a> in your browser, you will see it is broken up into several code blocks and hovering over each one will give you a\u201dplay\u201d icon. This is if you wish to execute one block of code at a time.<\/p>\n<p>However, the script will work perfectly just executing all of the code blocks, which you can do by going to the <em>Runtime<\/em>\u2019menu and selecting <em>Run all<\/em>.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"699\" height=\"499\" alt=\"Google Colab Runtime\" class=\"wp-image-437801\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Google-Colab-Runtime.png.webp 699w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Google-Colab-Runtime-473x338.png.webp 473w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Google-Colab-Runtime-158x113.png.webp 158w\" data-lazy-sizes=\"(max-width: 699px) 100vw, 699px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Google-Colab-Runtime.png.webp\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"699\" height=\"499\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Google-Colab-Runtime.png.webp\" alt=\"Google Colab Runtime\" class=\"wp-image-437801\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Google-Colab-Runtime.png.webp 699w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Google-Colab-Runtime-473x338.png.webp 473w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Google-Colab-Runtime-158x113.png.webp 158w\" sizes=\"auto, (max-width: 699px) 100vw, 699px\"><\/noscript><\/figure>\n<\/div>\n<p>There are no prerequisites to run the script; it will create a cloud environment and on the first execution in your instance, it will take around one minute to install the required modules.<\/p>\n<p>Each code block will have a small green tick next to it once it is complete, but the third code block will require your input to continue and it\u2019s easy to miss as you\u2019ll likely need to scroll down to see the prompt.<\/p>\n<hr class=\"wp-block-separator has-text-color has-cyan-bluish-gray-color has-css-opacity has-cyan-bluish-gray-background-color has-background\"><!-- START INLINE FORM --><br \/>\n<!-- END INLINE FORM --><\/p>\n<hr class=\"wp-block-separator has-text-color has-cyan-bluish-gray-color has-css-opacity has-cyan-bluish-gray-background-color has-background\">\n<h3 class=\"wp-block-heading\" id=\"h-step-5-upload-origin-csv-and-destination-csv\"><span class=\"ez-toc-section\" id=\"Step_5_Upload_origincsv_and_destinationcsv\"><\/span>Step 5: Upload origin.csv and destination.csv<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"680\" height=\"344\" alt=\"Highlighted: File upload prompt\" class=\"wp-image-437802\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-File-upload-prompt.png 680w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-File-upload-prompt-600x304.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-File-upload-prompt-200x101.png.webp 200w\" data-lazy-sizes=\"(max-width: 680px) 100vw, 680px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-File-upload-prompt.png\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"680\" height=\"344\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-File-upload-prompt.png\" alt=\"Highlighted: File upload prompt\" class=\"wp-image-437802\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-File-upload-prompt.png 680w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-File-upload-prompt-600x304.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Highlighted-File-upload-prompt-200x101.png.webp 200w\" sizes=\"auto, (max-width: 680px) 100vw, 680px\"><\/noscript><\/figure>\n<\/div>\n<p>When prompted, click <em>Choose files<\/em> and navigate to where you saved your origin.csv file. Once you have selected this file, it will upload and you will be prompted to do the same for your destination.csv.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-step-6-select-fields-to-use-for-similarity-matching\"><span class=\"ez-toc-section\" id=\"Step_6_Select_fields_to_use_for_similarity_matching\"><\/span>Step 6: Select fields to use for similarity matching<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>What makes this script particularly powerful is the ability to use multiple sets of metadata for your comparison.<\/p>\n<p>This means if you\u2019re in a situation where you\u2019re moving architecture where your URL Address is not comparable, you can run the similarity algorithm on other factors under your control, such as Page Titles or Headings.<\/p>\n<p>Have a look at both sites and try and judge what you think are elements that remain fairly consistent between them. <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" data-internallinksmanager029f6b8e52c=\"3\" title=\"General\" target=\"_blank\" rel=\"noopener\">General<\/a>ly, I would advise to start simple and add more fields if you are not getting the results you want.<\/p>\n<p>In my example, we have kept a similar URL naming convention, although not identical and our page titles remain consistent as we are copying the content over.<\/p>\n<p>Select the elements you to use and click the <em>Let\u2019s Go!<\/em><\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"587\" height=\"357\" alt=\"Similarity matching fields\" class=\"wp-image-437804\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Similarity-matching-fields.png 587w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Similarity-matching-fields-556x338.png.webp 556w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Similarity-matching-fields-186x113.png.webp 186w\" data-lazy-sizes=\"(max-width: 587px) 100vw, 587px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Similarity-matching-fields.png\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"587\" height=\"357\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Similarity-matching-fields.png\" alt=\"Similarity matching fields\" class=\"wp-image-437804\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Similarity-matching-fields.png 587w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Similarity-matching-fields-556x338.png.webp 556w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Similarity-matching-fields-186x113.png.webp 186w\" sizes=\"auto, (max-width: 587px) 100vw, 587px\"><\/noscript><\/figure>\n<\/div>\n<h3 class=\"wp-block-heading\" id=\"h-step-7-watch-the-magic\"><span class=\"ez-toc-section\" id=\"Step_7_Watch_the_magic\"><\/span>Step 7: Watch the magic<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>The script\u2019s main components are all-MiniLM-L6-v2 and FAISS, but what are they and what are they doing?<\/p>\n<p>all-MiniLM-L6-v2 is a small and efficient model within the Microsoft <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/watch-movies-tv-seriess\/\" data-internallinksmanager029f6b8e52c=\"8\" title=\"Watch Movies &amp; TV Series\" target=\"_blank\" rel=\"noopener\">series<\/a> of MiniLM models which are designed for natural language processing tasks (NLP). MiniLM is going to convert our text data we\u2019ve given it into numerical vectors that capture their meaning.<\/p>\n<p>These vectors then enable the similarity search, performed by <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">Facebook<\/a> AI Similarity Search (FAISS), a library developed by Facebook AI Research for efficient similarity search and clustering of dense vectors. This will quickly find our most similar content pairs across the dataset.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-step-7-download-output-csv-and-sort-by-similarity-score\"><span class=\"ez-toc-section\" id=\"Step_7_Download_outputcsv_and_sort_by_similarity_score\"><\/span>Step 7: Download output.csv and sort by similarity_score<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>The output.csv should automatically download from your browser. If you open it, you should have three columns: origin_url, matched_url and similarity_score.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"838\" height=\"363\" alt=\"Output csv example\" class=\"wp-image-437805\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example.png 838w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example-600x260.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example-800x347.png.webp 800w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example-200x87.png.webp 200w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example-768x333.png.webp 768w\" data-lazy-sizes=\"(max-width: 838px) 100vw, 838px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example.png\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"838\" height=\"363\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example.png\" alt=\"Output csv example\" class=\"wp-image-437805\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example.png 838w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example-600x260.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example-800x347.png.webp 800w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example-200x87.png.webp 200w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-example-768x333.png.webp 768w\" sizes=\"auto, (max-width: 838px) 100vw, 838px\"><\/noscript><\/figure>\n<\/div>\n<p>In your favorite spreadsheet software, I would recommend sorting by <strong>similarity_score<\/strong>.\u00a0<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"584\" height=\"266\" alt=\"Excel Sort by similarity score\" class=\"wp-image-437806\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Excel-Sort-by-similarity-score.png 584w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Excel-Sort-by-similarity-score-200x91.png.webp 200w\" data-lazy-sizes=\"(max-width: 584px) 100vw, 584px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Excel-Sort-by-similarity-score.png\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"584\" height=\"266\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Excel-Sort-by-similarity-score.png\" alt=\"Excel Sort by similarity score\" class=\"wp-image-437806\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Excel-Sort-by-similarity-score.png 584w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Excel-Sort-by-similarity-score-200x91.png.webp 200w\" sizes=\"auto, (max-width: 584px) 100vw, 584px\"><\/noscript><\/figure>\n<\/div>\n<p>The similarity score gives you an idea of how good the match is. A similarity score of 1 suggests an exact match. <\/p>\n<p>By checking my output file, I immediately saw that approximately 95% of my URLs have a similarity score of more than 0.98, so there is a good chance I\u2019ve saved myself a lot of time.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-step-8-human-validate-your-results\"><span class=\"ez-toc-section\" id=\"Step_8_Human-validate_your_results\"><\/span>Step 8: Human-validate your results<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Pay special attention to the lowest similarity scores on your sheet; this is likely where no good matches can be found.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"846\" height=\"407\" alt=\"Output.csv: Lower-scored similarities\" class=\"wp-image-437807\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities.png 846w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities-600x289.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities-800x385.png.webp 800w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities-200x96.png.webp 200w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities-768x369.png.webp 768w\" data-lazy-sizes=\"(max-width: 846px) 100vw, 846px\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities.png\"><noscript><img loading=\"lazy\" decoding=\"async\" width=\"846\" height=\"407\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities.png\" alt=\"Output.csv: Lower-scored similarities\" class=\"wp-image-437807\" srcset=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities.png 846w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities-600x289.png.webp 600w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities-800x385.png.webp 800w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities-200x96.png.webp 200w,https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Output-csv-Lower-scored-similarities-768x369.png.webp 768w\" sizes=\"auto, (max-width: 846px) 100vw, 846px\"><\/noscript><\/figure>\n<\/div>\n<p>In my example, there were some poor matches on the team page, which led me to discover not all of the team profiles had yet been created on the staging site \u2013 a really helpful find.<\/p>\n<p>The script has also quite helpfully given us redirect recommendations for old blog content we decided to axe and not include on the new website, but now we have a suggested redirect should we want to pass the traffic to something related \u2013 that\u2019s ultimately your call.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-step-9-tweak-and-repeat\"><span class=\"ez-toc-section\" id=\"Step_9_Tweak_and_repeat\"><\/span>Step 9: Tweak and repeat<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>If you didn\u2019t get the desired results, I would double-check that the fields you use for matching are staying as consistent as possible between sites. If not, try a different field or group of fields and rerun.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-more-ai-to-come\"><span class=\"ez-toc-section\" id=\"More_AI_to_come\"><\/span>More AI to come<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>In general, I have been slow to adopt any AI (especially generative AI) into the redirect mapping process, as the cost of mistakes can be high, and AI errors can sometimes be tricky to spot.<\/p>\n<p>However, from my testing, I\u2019ve found these specific AI models to be robust for this particular task and it has fundamentally changed how I approach site migrations.\u00a0<\/p>\n<p>Human checking and oversight are still required, but the amount of time saved with the bulk of the work means you can do a more thorough and thoughtful human intervention and finish the task many hours ahead of where you would usually be.<\/p>\n<p>In the not-too-distant future, I expect we\u2019ll see more specific models that will allow us to take additional steps, including improving the speed and efficiency of the next step, the redirect logic.<\/p>\n<\/div>\n<p><\/p>\n<div class=\"about-author\">\n    About the author<\/p>\n<div class=\"information\">\n<div class=\"author-module\">\n<div class=\"row\">\n<div class=\"col-12 col-lg-3 text-center\">\n<div class=\"avatar\">\n                        <img loading=\"lazy\" decoding=\"async\" class=\"img-fluid rounded-circle avatar-border\" alt=\"Mark Williams-Cook\" width=\"140\" height=\"140\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2023\/08\/Mark-Williams-Cook.jpeg.webp\"><noscript><img loading=\"lazy\" decoding=\"async\" class=\"img-fluid rounded-circle avatar-border\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2023\/08\/Mark-Williams-Cook.jpeg.webp\" alt=\"Mark Williams-Cook\" width=\"140\" height=\"140\"><\/noscript>\n                                            <\/div>\n<\/p><\/div>\n<div class=\"col-12 col-lg-9\">\n<div class=\"about\">\n<div class=\"name\">\n                            <strong>Mark Williams-Cook<\/strong>\n                        <\/div>\n<div class=\"row g-2 pt-2\">\n<div class=\"col-auto twitter\">\n                                    <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/twitter.com\/intent\/follow?original_referer=https%3A%2F%2Fsearchengineland.com%2F&amp;region=follow_link&amp;screen_name=thetafferboy&amp;tw_p=followbutton&amp;variant=2.0\" aria-label=\"opens in a new tab\"><i class=\"fab fa-x-twitter\"><\/i><\/a>\n                            <\/div>\n<div class=\"col-auto\">\n                                    <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.linkedin.com\/in\/markseo\/\" aria-label=\"opens in a new tab\"><i class=\"fab fa-linkedin\"><\/i><\/a>\n                                <\/div>\n<\/p><\/div>\n<p>                        Mark Williams-Cook has over 20 years of SEO experience and is co-owner of search agency <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/withcandour.co.uk\">Candour<\/a>, the founder of AlsoAsked, and runs a pet category eCommerce business. Outside of speaking at conferences, Mark has trained over 3,000 SEOs with his Udemy course.                 <\/div>\n<\/p><\/div>\n<\/p><\/div>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<p><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/technology\/\" target=\"_blank\" rel=\"noopener\">Technology<\/a><\/span> category.<\/strong>\n<\/p><\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/searchengineland.com\/site-migrations-ai-powered-redirect-mapping-437793\" target=\"_blank\" rel=\"noopener\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Website migrations often involve painstaking redirect mapping, but AI can help. Here&#8217;s a walkthrough of the process to help you save time. Migrating a large website is always daunting. Big traffic is at stake among many moving parts, technical challenges and stakeholder management. Historically, one of the most onerous tasks in a migration plan has&#8230;<\/p>\n","protected":false},"author":1,"featured_media":610075,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/02\/Speeding-up-site-migrations-with-AI-powered-redirect-mapping-800x450.png","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[78070,148084],"class_list":["post-610074","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-seo","tag-technical-optimization"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/610074","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=610074"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/610074\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/610075"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=610074"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=610074"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=610074"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}