Google 搜索是一款功能强大的工具,可帮助您在开放网络中查找实用信息。遗憾的是,有些网页在创建时就“不安好心”。其中不乏专为欺骗用户而创建的网页,这些网页也是我们每天力求抵御的一种内容。为确保您的安全,并防止您的搜索体验遭到干扰性内容和恶意行为的损害,Google 搜索在 2020 年投入了多项创新。
更智能地抵御网络垃圾
早在 Google 搜索推出初期,我们就一直在抵御网络垃圾,而人工智能 (AI) 的近期发展为革新我们的方法提供了巨大潜力。
通过将我们对网络垃圾的深入了解与 AI 相结合,去年我们得以开发出我们自己的垃圾网站处理 AI,它能非常有效地发掘已知和新的网络垃圾趋势。例如,与几年前相比,我们将自动生成抄袭内容的网站减少了超过 80%。
得益于我们由 AI 辅助的自动化系统,用户在实际搜索时,很少会在热门搜索结果中看到网络垃圾。据估计,在这些自动化系统的帮助下,用户使用 Google 搜索进行查询时,在超过 99% 的情形下完全不会看到网络垃圾。对于极少数展现网络垃圾的情况,我们的团队会对其采取手动操作,并利用从中总结的教训进一步完善我们的自动化系统。
发展 AI 的另一个巨大优势体现在理解网站的内容上。我们改进对商品评价、信息类网站和购物网站排名的方式就很好地证明了这一点。Google 搜索非常便于您在购物前先研究和查找商品。因此,我们通过奖励能体现更多深入研究且提供更多实用信息的内容,让您获得最有用的信息,并且下次以此做出购买决策。
尽管我们在处理垃圾网站方面的工作取得了重大进展,但垃圾内容发布者也在非常积极地开发新技术来避开我们的检测。我们一直致力于不断完善技术,防止用户受到新的滥用行为的侵扰,而用户举报对我们很有帮助。您最近使用 Google 搜索时有没有觉得受误导、欺骗或被垃圾内容侵扰?您是否认为我们在防范这些侵扰方面需要进一步加强?如果是,请通过垃圾信息举报分享反馈,并提供相关查询或任何其他可能有用的信息。
[null,null,[],[[["\u003cp\u003eGoogle Search leverages AI to combat spam, reducing auto-generated and scraped content by over 80%.\u003c/p\u003e\n"],["\u003cp\u003eGoogle is actively working to prevent various types of online scams and fraud to enhance user safety.\u003c/p\u003e\n"],["\u003cp\u003eGoogle encourages website owners to practice good security hygiene to protect against hacking and spam.\u003c/p\u003e\n"],["\u003cp\u003eUsers can contribute to a safer web experience by reporting spam and suspicious content through feedback channels.\u003c/p\u003e\n"],["\u003cp\u003eGoogle's algorithms prioritize high-quality content, such as in-depth product reviews, to ensure users receive valuable information.\u003c/p\u003e\n"]]],["Google Search utilizes AI to combat spam and protect users. In 2020, AI enhancements reduced auto-generated and scraped content by over 80% and improved hacked spam detection by over 50%. They discovered 40 billion spammy pages daily and prevented over 99% of spam from appearing in top search results. They also expanded protections against online scams, detecting sites that imitate brands to obtain personal information. They use AI to understand sites, such as improving the ranking of product review sites. They encourage users to report spam to help improve.\n"],null,["# How we fought Search spam on Google in 2020\n\nThursday, April 29, 2021\n\n\nGoogle Search is a powerful tool to help you find useful information on the open web. Unfortunately, not all web pages are created with good intent. Many of them are explicitly created to deceive people, and that is something we fight against every day. To ensure your safety and protect your search experience against disruptive content and malicious behaviors, Search has invested in many innovations in 2020.\n\nFighting spam smarter\n---------------------\n\n\nWhile we have been [fighting spam](https://www.youtube.com/watch?v=oJixNEmrwFU) since the early days of Search, recent advances in Artificial Intelligence (AI) offer unprecedented potential to revolutionize our approach.\n\n\nBy combining our deep knowledge of spam with AI, last year we were able to build our very own spam-fighting AI that is incredibly effective at catching both known and new spam trends. For example, we have reduced sites with auto-generated and scraped content by more than 80% compared to a couple of years ago.\n\n\nHacked spam was still rampant in 2020 as the number of vulnerable web sites remained quite large, although we have improved our detection capability by more than 50% and [removed most of the hacked spam from search results](https://www.youtube.com/watch?v=TnhKznlJfTM).\n\n\nThis is a problem that we cannot solve alone. Even if we could detect and protect against all spam, the hackers would not cease exploiting loopholes until they're all closed. Website owners can protect their sites by practicing good security hygiene: it is easier to prevent a site from getting hacked than to recover from a hack. Google offers resources to help you understand [the most common ways websites get hacked](/web/fundamentals/security/hacked/top_ways_websites_get_hacked_by_spammers) and how to [use Search Console](/web/fundamentals/security/hacked/use_search_console) to check [whether your site got hacked](/web/fundamentals/security/hacked). Please do take a look and let's keep the web safer together!\n\n\nWith major events last year, including a global pandemic, we have devoted significant effort in extending protection to the billions of searches we received on such important topics. If you're looking for a COVID testing site near you, you shouldn't have to worry about landing on gibberish spam that may redirect you to phishing sites. Besides eliminating spam content, we worked with several other Search teams to make sure you receive the most up-to-date and highest quality information when and where it matters the most.\n\nPreventing spam from reaching you\n---------------------------------\n\n\nBefore we deliver a set of search results on Google, [there's a lot that happens behind the scenes](https://www.google.com/search/howsearchworks/). Every day, we're discovering, crawling, and indexing billions of web pages. Among those pages is a lot of spam---every day, we discover 40 billion spammy pages. Here's how we work to keep that spam from getting in the way of your search for helpful, useful information.\nThis diagram conceptualizes how we defend against spam.\n\n\nFirst, we have systems that can detect spam when we crawl pages or other content. Crawling is when our automatic systems visit content and consider it for inclusion in the index we use to provide search results. Some content detected as spam isn't added to the index.\n\n\nThese systems also work for content we discover through sitemaps and [Search Console](https://search.google.com/search-console/about). For example, Search Console has a [Request Indexing](/search/docs/crawling-indexing/ask-google-to-recrawl) feature so creators can let us know about new pages that should be added quickly. We observed spammers hacking into vulnerable sites, pretending to be the owners of these sites, verifying themselves in the Search Console and using the tool to ask Google to crawl and index the many spammy pages they created. Using AI, we were able to pinpoint suspicious verifications and prevented spam URLs from getting into our index this way.\n\n\nNext, we have systems that analyze the content that is included in our index. When you issue a search, they work to double-check if the content that matches might be spam. If so, that content won't appear in the top search results. We also use this information to better improve our systems to prevent such spam from being included in the index at all.\n\n\nThe result is that very little spam actually makes it into the top results anyone sees for a search, thanks to our automated systems that are aided by AI. We estimated that these automated systems help keep more than 99% of visits from Search completely without spam. As for the tiny percentage left, our teams take [manual action](https://support.google.com/webmasters/answer/9044175) and use the learnings from that to further improve our automated systems.\n\nProtecting you beyond spam\n--------------------------\n\n\nBeyond spam, we expanded our effort in 2020 to protect you against other types of abuse. Many of these can cause significant financial and personal harm.\n\n\nIn 2020, we made significant progress in improving our coverage and protecting more users against online scams and fraud. Online scams have many shapes and they can negatively affect you in more ways than traditional webspam. For example, many scammers pretend to be offering customer support phone numbers to popular services and products, only to trick users who call in into paying them via bank transfers or gift cards. Commonly known as 'customer support scam' or 'tech support scam', this type of scam has been reported by [hundreds of thousands of users](https://www.ftc.gov/system/files/documents/reports/consumer-sentinel-network-data-book-2020/csn_annual_data_book_2020.pdf) where users may lose [hundreds of dollars](https://www.ftc.gov/news-events/blogs/data-spotlight/2019/03/older-adults-hardest-hit-tech-support-scams) to scammers in each case.\n\n\nSince 2018, our systems have been able to protect hundreds of millions of searches a year by detecting potentially scammy sites. On the web, scammers attempted to create many low quality websites with keyword stuffing, logos of brands they're imitating, and a phone number they want you to call. Our algorithmic solutions made sure that scam and fraud are very unlikely to show up in your search results. This is but one of the several types of protections we have launched last year to ensure the quality of search results and your safety. Our mission is to get ahead of the challenges to provide you with the most trustworthy results. At the same time, you can also better protect yourself by staying informed and [learning about scams](https://blog.google/technology/safety-security/scam-spotter/).\n\n\nAnother dimension where advances in AI helped tremendously was in understanding content of sites. An example of this can be found in how we helped improve [the way we rank product review, informational, and shopping sites](/search/blog/2021/04/product-reviews-update). Google Search is a great way to research and find products before you make a purchase, and we wanted to make sure that you're getting the most useful information for your next purchase by rewarding content that has more in-depth research and useful information.\n\n\nIn spite of the significant advancements we made in our spam-fighting efforts, spammers are highly motivated to develop new techniques that can evade our detection. We're always working to get better and protect people from new types of abuse, and external reports can help. Do you have any recent experiences with Search where you feel misled, scammed, or spammed, and you think we can do a better job with preventing those experiences? If so, please share feedback using the [spam report](/search/docs/advanced/guidelines/report-spam), along with the query and any other information that might be useful.\nPosted by Cody Kwok, Principal Engineer"]]