{"id":297727,"date":"2021-07-13T15:00:04","date_gmt":"2021-07-13T12:00:04","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/"},"modified":"2021-07-13T15:00:04","modified_gmt":"2021-07-13T12:00:04","slug":"what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/","title":{"rendered":"#What is Data Scraping, And Why Is It a Threat? \u2013 CloudSavvy IT"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_85 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a3a1ec5e39cf\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a3a1ec5e39cf\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#What_Are_Data_Scraping_and_Web_Scraping\" >What Are Data Scraping and Web Scraping?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Big_Numbers_%E2%80%93_Scraping_and_Cybercrime\" >Big Numbers \u2013 Scraping and Cybercrime<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Commercial_Scraping_is_Problematic_Too\" >Commercial Scraping is Problematic Too<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#How_To_Protect_Your_Organization\" >How To Protect Your Organization<\/a><ul class='ez-toc-list-level-3' ><li class='ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Terms_of_Use_and_Conditions\" >Terms of Use and Conditions<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Disable_Hotlinking\" >Disable Hotlinking<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Use_CSRF_Tokens\" >Use CSRF Tokens<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Rate_Limit_Page_Requests\" >Rate Limit Page Requests<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-9\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Use_Dedicated_Anti-Scraping_Software\" >Use Dedicated Anti-Scraping Software<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-10\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Require_Human_Interaction\" >Require Human Interaction<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-11\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Make_Your_APIs_Tight-Lipped\" >Make Your APIs Tight-Lipped<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-12\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Use_Decoy_Links\" >Use Decoy Links<\/a><\/li><\/ul><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-13\" href=\"https:\/\/buradabiliyorum.com\/en\/what-is-data-scraping-and-why-is-it-a-threat-cloudsavvy-it\/#Time_Will_Tell\" >Time Will Tell<\/a><\/li><\/ul><\/nav><\/div>\n<p><strong>&#8220;#What is Data Scraping, And Why Is It a Threat? \u2013 CloudSavvy IT&#8221;<\/strong><\/p>\n<div id=\"article-content-area\">\n<figure style=\"width: 3000px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"type:primaryImage wp-image-12663 size-full\" data-pagespeed-lazy-src=\"https:\/\/www.cloudsavvyit.com\/p\/uploads\/2021\/07\/1bcdb125.jpg?width=1198&amp;trim=1,1&amp;bg-color=000&amp;pad=1,1\" alt=\"mosaic of faces\" width=\"3000\" height=\"1504\" src=\"https:\/\/www.shutterstock.com\/image-photo\/hundreds-multiracial-people-crowd-portraits-headshots-1734128516\" data-credittext=\"fizkes\/Shutterstock.com\" onload=\"pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\" onerror=\"this.onerror=null;pagespeed.lazyLoadImages.loadIfVisibleAndMaybeBeacon(this);\"\/><figcaption class=\"wp-caption-text\"><span class=\"type:primaryImage imagecredit\"><a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.shutterstock.com\/image-photo\/hundreds-multiracial-people-crowd-portraits-headshots-1734128516\">fizkes\/Shutterstock.com<\/a><\/span><\/figcaption><\/figure>\n<p>Data scraping is yet another way data can be extracted from your website, portal, or platform. Surprisingly, the legality of data scraping is a gray area. Here\u2019s how to defend against it.<\/p>\n<h2 id=\"what-are-data-scraping-and-web-scraping\"><span class=\"ez-toc-section\" id=\"What_Are_Data_Scraping_and_Web_Scraping\"><\/span>What Are Data Scraping and Web Scraping?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Data scraping and web scraping are two different automated techniques that achieve the same end. They harvest data from systems owned by third parties. They extract the data, collate it, and store it in ways that facilitate its reuse. Typically this means putting it into a database or into a portable format like CSV.<\/p>\n<p><em>Data scraping<\/em> makes use of APIs provided by the platform that is being scraped, even though the terms of use of the API almost certainly prohibit the gathering of data\u00a0<em>en masse<\/em>.<\/p>\n<p><em>Web scraping<\/em> works by making requests for web pages just like a web browser does. But instead of displaying the webpage, the software extracts the data it is interested in, saves it, and requests another page. The terms and conditions of most websites and certainly all <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">social media<\/a> platforms prohibit data and web scraping. Despite this, the user numbers associated with social media platforms make them attractive targets for scrapers.<\/p>\n<p>Scraping can be performed by cybercriminals who want to collect login credentials, payment details, or personally identifiable information. It can also be used for legitimate reasons such as aggregating <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">news<\/a> stories, monitoring your resellers to see that they don\u2019t break pricing agreements, or for market analysis. It\u2019s also used for collecting business intelligence, locating sales leads, and underpinning marketing and advertising.<\/p>\n<p><strong>RELATED:<\/strong> <strong><em>How To Defend Yourself Against API Attacks<\/em><\/strong><\/p>\n<h2 id=\"big-numbers---scraping-and-cybercrime\"><span class=\"ez-toc-section\" id=\"Big_Numbers_%E2%80%93_Scraping_and_Cybercrime\"><\/span>Big Numbers \u2013 Scraping and Cybercrime<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>In 2020, the number of personal records scraped from YouTube was 4 million. The figure for TikTok was over ten times higher, at 42 million. That same year, 191 million personal records were scraped from Instagram. All of these platforms prohibit the scraping of data.<\/p>\n<p>In April 2021, LinkedIn hit the headlines when a database of\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/redirect.viglink.com\/?key=204a528a336ede4177fff0d84a044482&amp;u=https%3A%2F%2Fnews.linkedin.com%2F2021%2Fapril%2Fan-update-from-linkedin\">500 million personal records<\/a>\u00a0was put up for sale on the dark web. Microsoft, which owns LinkedIn, said there had been no security breach. The database was the result of data scraping.<\/p>\n<p>The database contained each affected member\u2019s:<\/p>\n<ul>\n<li>Real name<\/li>\n<li>Gender<\/li>\n<li>LinkedIn profile URLs<\/li>\n<li>Registered email addresses<\/li>\n<li>Landline and smartphone numbers<\/li>\n<li>Physical addresses<\/li>\n<li>Geolocation details<\/li>\n<li>Usernames for other social media accounts<\/li>\n<\/ul>\n<p>In June 2021, a database of\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/redirect.viglink.com\/?key=204a528a336ede4177fff0d84a044482&amp;u=https%3A%2F%2Fnews.linkedin.com%2F2021%2Fjune%2Fan-update-from-linkedin\">700 million personal records<\/a>\u00a0<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>eared. That\u2019s over 90 percent of LinkedIn\u2019s membership. Together with the extra 200 million records, the second database is cross-referenced to data scraped from other sources, providing a more detailed picture of the affected individuals.<\/p>\n<p>Created by cybercriminals for cybercriminals, the database can be bought\u2014for $5000 at the time of writing\u2014on dark web marketplaces and forums. The information it contains will e used for crimes such as phishing attacks, spear-phishing attacks, social engineering attacks, and other financial frauds.<\/p>\n<p><strong>RELATED:<\/strong> <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.howtogeek.com\/209396\/how-to-prevent-identity-thieves-from-opening-accounts-in-your-name\/\"><strong><em>How to Stop Identity Thieves from Opening Accounts in Your Name<\/em><\/strong><\/a><\/p>\n<h2 id=\"commercial-scraping-is-problematic-too\"><span class=\"ez-toc-section\" id=\"Commercial_Scraping_is_Problematic_Too\"><\/span>Commercial Scraping is Problematic Too<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>What about the commercial web and data scraping that takes place? There are companies you can engage with who will scrape data for you. You can use data parsing toolkits such as the freely available\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.crummy.com\/software\/BeautifulSoup\/#Download\">Beautiful Soup<\/a>\u00a0Python library to create your own web scraping applications.<\/p>\n<p>The problem is, you\u2019re still almost certainly violating the rules of the platform you\u2019re scraping. And the platforms will try to defend themselves. If they don\u2019t, their members, customers, or other users are liable to leave their platform.<\/p>\n<p>When you choose to provide personal data to an online service, you\u2019re entrusting that organization with your data. You\u2019re not giving permission for anyone else to come and hoover up that data and use it as they see fit. When organizations scrape your data you don\u2019t know who they are, what they\u2019re going to do with the data, how they\u2019re going to safeguard and protect it, nor who they are going to share it with.<\/p>\n<p>LinkedIn took hiQ Labs Inc.\u00a0to court over their data and web scraping. In their defense, hiQ claimed that the data they were scraping from LinkedIn was in the public domain and that meant it was up for grabs. In 2019, the 9th US Circuit Court of Appeals ruled in hiQ\u2019s favor. But on June 14, 2021, the <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.supremecourt.gov\/orders\/courtorders\/061421zor_6j36.pdf\">Supreme Court vacated the Ninth Circuit\u2019s decision<\/a>. As of July 2021, data scraping and web scraping for non-criminal purposes is in a legal gray area.<\/p>\n<p>And things get more complicated when you take into account the data legislation that applies to the members of the platform. For example, whether an EU citizen\u2019s data is in the public domain or not, you can\u2019t harvest it, store it, and process it digitally without a lawful basis\u2014as defined by the GDPR\u2014for doing so. Also, there\u2019s a difference between publicly visible and in the public domain.<\/p>\n<p>Under the GDPR there are only two lawful bases that could conceivably apply to scraping data. One is \u201cconsent\u201d and the other is \u201clegitimate interest.\u201d Plainly, consent has not been given by the individuals, so that\u2019s off the table. And it would be extremely difficult to argue that you had a legitimate interest in scraping the data that didn\u2019t trample on the legitimate interests of the data subjects, and their data privacy rights and freedoms. The GDPR demands that you uphold those rights and freedoms and not ride roughshod over them.<\/p>\n<p>The GDPR protects the data privacy rights of EU citizens regardless of where the processing is taking place. An organization in the U.S. that is scraping data from another U.S.-based organization must still comply with the GDPR if personally identifiable information of EU citizens is in the data being scraped.<\/p>\n<p>Data protection legislation from other regions adopts the same stance, with some small variances. The legality of scraping is tenuous, to say the least. We\u2019re likely to see more formal challenges.<\/p>\n<p><strong>RELATED:<\/strong> <strong><em>How Data Breaches and Leaks Can Affect Your Employees<\/em><\/strong><\/p>\n<h2 id=\"how-to-protect-your-organization\"><span class=\"ez-toc-section\" id=\"How_To_Protect_Your_Organization\"><\/span>How To Protect Your Organization<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>There are steps and measures that you put in place to make life more difficult for the data scrapers.<\/p>\n<h3 id=\"terms-of-use-and-conditions\"><span class=\"ez-toc-section\" id=\"Terms_of_Use_and_Conditions\"><\/span>Terms of Use and Conditions<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Although Terms and Conditions and Terms of Use won\u2019t do anything to stop cybercriminals and might not even stop \u201clegitimate\u201d scraping, it still makes sense to explicitly prohibit the gathering, processing, storing, or sharing of any data including but not limited to personally identifiable data.<\/p>\n<p>It might stop some people from scraping. If it does, that was an easy win. Even if it doesn\u2019t, it\u2019ll give you a legal advantage if matters need to be resolved in court.<\/p>\n<h3 id=\"disable-hotlinking\"><span class=\"ez-toc-section\" id=\"Disable_Hotlinking\"><\/span>Disable Hotlinking<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Displaying images and other media on one website by linking back to the original website is called hotlinking. It uses the original website\u2019s bandwidth and other resources to serve the media.<\/p>\n<p>Web scraping usually retrieves images directly and so disabling hotlinking won\u2019t affect their scraping activities. But, if any scraping takes place that relies on hotlinking, it at least prevents insult from being added to injury. They won\u2019t be pinching even more bandwidth when your stolen data is being viewed.<\/p>\n<h3 id=\"use-csrf-tokens\"><span class=\"ez-toc-section\" id=\"Use_CSRF_Tokens\"><\/span>Use CSRF Tokens<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>The automated systems that do the scraping make successive HTTPS requests to your website. They crawl from page to page, following links. They also create URLs to try. If they spot a pattern\u2014such as\u00a0 URLs that differ by a single digit\u2014the software works its way through the predictable combinations until the sequence fails.<\/p>\n<p>Introducing Cross-Site Request Forgery tokens to your website can fox all but the smartest of scraping software. A CSRF token is a unique identifier sent from the webserver to the client making the request. Under normal circumstances, this would be a browser.<\/p>\n<p>The client must send the CSRF token back to the server when it makes its next request. The server will not respond to any requests that don\u2019t include the correct CSRF token. Most web scraping software cannot handle CSRF tokens, so this is an effective measure to limit your exposure.<\/p>\n<h3 id=\"rate-limit-page-requests\"><span class=\"ez-toc-section\" id=\"Rate_Limit_Page_Requests\"><\/span>Rate Limit Page Requests<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Rate limiting sets thresholds on the number of requests that can be made from a client within a given period of time. Typically this is done by IP address, with restrictions on how many page requests or downloads can be made per second.<\/p>\n<h3 id=\"use-dedicated-anti-scraping-software\"><span class=\"ez-toc-section\" id=\"Use_Dedicated_Anti-Scraping_Software\"><\/span>Use Dedicated Anti-Scraping Software<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Commercial packages are available that will detect scraping activity and block it. They use techniques that far surpass simply identifying a client by its IP Address. They use machine learning techniques to identify bot activity by measuring actions such as the speed the client can fill in fields and forms, the way the mouse moves across the page, and the way the client moves through the website. Any non-human activity is blocked.<\/p>\n<h3 id=\"require-human-interaction\"><span class=\"ez-toc-section\" id=\"Require_Human_Interaction\"><\/span>Require Human Interaction<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Forcing clients to create an account and using CAPTCHA or other challenge-response tests can help in rejecting automatic scrapers.<\/p>\n<h3 id=\"use-tight-lipped-apis\"><span class=\"ez-toc-section\" id=\"Make_Your_APIs_Tight-Lipped\"><\/span>Make Your APIs Tight-Lipped<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Secure your APIs, and limit their capabilities so that they return the minimum amount of data to satisfy the API call they\u2019re servicing.<\/p>\n<p>It\u2019s appealing to developers to provide data-rich APIs, and to over-provide rather than under-provide. This places the responsibility on the client to parse out the information they want and to reject the rest. It reduces the chance of rework being required because the API didn\u2019t provide a particular piece of information. But that verbosity plays into the scrapers\u2019 hands.<\/p>\n<p>Instead, make your APIs lean and mean. Provide what was asked for, and no more. You can rate limit API clients, too.<\/p>\n<h3 id=\"use-decoy-links\"><span class=\"ez-toc-section\" id=\"Use_Decoy_Links\"><\/span>Use Decoy Links<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Hidden links on a webpage will be invisible to genuine users but web scraping software will find and follow all links. If a client follows a hidden link it is likely an automated process. you can then block them.<\/p>\n<h2 role=\"heading\" aria-level=\"2\"><span class=\"ez-toc-section\" id=\"Time_Will_Tell\"><\/span>Time Will Tell<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Cybercriminals, by definition, don\u2019t care about the law. Commercial operations don\u2019t have a choice. If the hiQ v. LinkedIn case establishes a legal precedent and considers scraping to be in violation of the Computer Fraud and Abuse Act,\u00a0it\u2019ll only affect the execution of \u201ccommercial\u201d scraping.\u00a0Data scraping by cybercriminals will continue.<\/p>\n<p>So whatever the outcome, you\u2019ll still need to protect your organization.<\/p>\n<p>\u00a0<\/p>\n<p>\u00a0\n<\/p><\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMLG0nwswvr63Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\">For forums sites go to <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/forum.buradabiliyorum.com\/\" target=\"_blank\" rel=\"noopener\">Forum.BuradaBiliyorum.Com<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/technology\/\" target=\"_blank\" rel=\"noopener\">Technology category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/www.cloudsavvyit.com\/12576\/what-is-data-scraping-and-why-is-it-a-threat\/\" target=\"_blank\" rel=\"noopener\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&#8220;#What is Data Scraping, And Why Is It a Threat? \u2013 CloudSavvy IT&#8221; fizkes\/Shutterstock.com Data scraping is yet another way data can be extracted from your website, portal, or platform. Surprisingly, the legality of data scraping is a gray area. Here\u2019s how to defend against it. What Are Data Scraping and Web Scraping? Data scraping&#8230;<\/p>\n","protected":false},"author":1,"featured_media":297728,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/www.cloudsavvyit.com\/p\/uploads\/2021\/07\/1bcdb125.jpg","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[],"class_list":["post-297727","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/297727","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=297727"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/297727\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/297728"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=297727"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=297727"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=297727"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}