网址移除说明(第四部分):跟踪您的要求以及不能移除的内容
使用集合让一切井井有条
根据您的偏好保存内容并对其进行分类。
2010 年 5 月 3 日(星期一)
在网址移除系列的最后一篇文章中,我们将讨论如何跟进您的移除要求,以及何时不应使用 Google 的网址移除工具。如果您还没有阅读本系列中的前几篇文章,建议您阅读:
您可能还想了解如何管理可在线获取的哪些信息。
了解请求的状态
您提交移除请求后,该请求即会显示在您的请求列表中。您可以随时查看请求的状态,以了解相应内容是否已被移除,或者相关请求是仍处于待处理状态还是待处理状态还是被拒绝。
如果请求被拒绝,您应该会在该请求旁边看到一个“了解详情”链接,其中说明了相应请求被拒绝的原因。由于不同类型的移除请求有不同的要求,因此特定请求被拒的原因可能会有所不同。“了解详情”链接应会帮助您了解需要做出哪些更改才能使请求成功执行。例如,您可能需要更改相关网址,使其满足您请求的移除类型的要求;或者,如果您无法这样做,则可能需要请求其他类型的移除(您的网址目前满足此类移除的要求)。
如果请求已标记为“已移除”,但您仍然在搜索结果中看到相应内容,请检查以下各项:
-
搜索结果中显示的网址与您提交移除的网址是否完全相同?相同或类似的内容显示在网站的多个网址上是相当常见的。您可能已成功移除一个网址,但仍会看到包含相同内容的其他网址。
解决方案:请求移除其他相关网址。如需帮助,请参阅关于应该使用哪个网址来提交移除请求/屏蔽请求的帮助中心文章。
-
请注意,网址区分大小写,因此请求移除 https://www.example.com/embarrassingstuff.html
与请求移除 https://www.example.com/EmbarrassingStuff.html
并不相同
解决方案:请求移除其他相关网址。如需帮助,请参阅关于应该使用哪个网址来提交移除请求/屏蔽请求的帮助中心文章。
-
如果请求被标记为“已移除”,则有多种含义,具体取决于您提交的请求类型。如果您请求移除整个网址,则“已移除”表示整个网址不会再显示在搜索结果中。如果您请求移除网址的缓存副本,则“已移除”表示缓存副本已被移除,不会再显示在搜索结果中;但该网址本身可能仍会显示。
解决方案:查看“移除类型”列,仔细检查您请求的移除类型。如果您请求移除缓存,但希望移除整个网址,请确保该网址符合关于彻底移除的要求,然后提交新的请求,要求彻底移除网址。
-
清除多余的内容,如存在
404
的旧网页。该工具专门用于移除包含急需移除内容(例如不慎泄露的机密数据)的网址。如果您近期更改过自己的网站,而且索引中仅存在一些过时的网址,那么 Google 的抓取工具在重新抓取您的网址时会发现这些更改,并逐步自动从搜索结果中删除这些过时的网页。这种情况下不需要通过此工具请求紧急移除。
-
从您的“网站站长工具”账号中移除抓取错误。移除工具是从 Google 的搜索结果中移除网址,而不是从您的“网站站长工具”账号中。目前,您无法从该报告中手动删除网址;当我们停止抓取反复出现
404
的网址时,就会逐步删除它们。
-
“从头开始”创建您的网站。如果您担心自己的网站可能会遭到惩罚,或在从他人处购买网域后“从头开始”创建网站,我们建议您不要尝试使用网址移除工具移除整个网站,然后“重新开始”。搜索引擎会从其他网站收集大量信息(例如哪些网站链接到了您的网站,或者这些网站使用哪些字词描述您的网站),并根据这些信息更好地了解您的网站。即使我们可以删除目前掌握的关于您的网站的全部信息,但在重新抓取上述其他网站并结合相关背景进行分析后,这些信息中会有很大一部分原样恢复。如果您担心自己的域有某些不良记录,我们建议您提交重新审核请求,告知我们您所担心的问题和已做的更改(例如,该域是您从他人手中获得的,或者您已对自己网站的某些方面进行了更改)。
-
在网站被黑后“下线”该网站。如果您因自己的网站被黑而想删除已编入索引的所有不良网址,则可以使用网址移除工具移除黑客创建的所有新网址,例如
https://www.example.com/buy-cheap-cialis-skq3w598.html
。但是,我们不建议您移除整个网站,也不要移除您最终想要编入索引的网址;您只需清理被黑的内容,让我们重新抓取您的网站,以便我们尽快将清理后的全新内容重新编入索引。此文详细介绍了如何处理黑客入侵问题。
-
确保编入索引的是正确“版本”的网站。如果接受移除
https://www.example.com/tattoo.html
的请求,http://www.example.com/tattoo.html
也会被移除。对于网址或网站的 www 和 www 版,我们也会采取同样的处理方法。因为这些网址提供的通常都是相同的内容,而我们发现大部分的网站站长和搜索用户都不希望搜索结果中出现这些重复的内容。简而言之,网址移除工具不应用作规范化工具。此工具不会保留您满意的版本,只会移除网址的所有版本(http/https 和 www/非 www)。
我们希望该系列能够解答您关于从 Google 搜索结果中移除内容的问题,并帮助您排查可能出现的任何问题。如果您仍有疑问,请加入我们的帮助论坛。
发布者:网站站长趋势分析师 Susan Moskwa
如未另行说明,那么本页面中的内容已根据知识共享署名 4.0 许可获得了许可,并且代码示例已根据 Apache 2.0 许可获得了许可。有关详情,请参阅 Google 开发者网站政策。Java 是 Oracle 和/或其关联公司的注册商标。
[null,null,[],[[["\u003cp\u003eThis post focuses on how to follow up on Google URL removal requests and when not to use the tool.\u003c/p\u003e\n"],["\u003cp\u003eUsers can check the status of their removal requests (removed, pending, or denied) and learn more about denials through the "Learn more" link.\u003c/p\u003e\n"],["\u003cp\u003eContent may still appear in search results after removal if similar content exists on multiple URLs, URLs are case-sensitive, or a cached removal was requested instead of a complete removal.\u003c/p\u003e\n"],["\u003cp\u003eThe URL removal tool should not be used for cleaning up 404 pages, crawl errors, restarting a site, removing hacked sites entirely, or canonicalization.\u003c/p\u003e\n"],["\u003cp\u003eGoogle recommends using the tool for urgent removals like exposed confidential data, and provides alternatives for other scenarios such as reconsideration requests and hacking recovery.\u003c/p\u003e\n"]]],["Upon submitting a removal request, you can track its status. If denied, a \"Learn more\" link explains why, guiding you to adjust the request or URL. \"Removed\" status can vary; it might mean the entire URL is gone or just the cached version. The tool shouldn't be used for cleaning up old pages, removing crawl errors from Webmaster Tools, starting a site from scratch, handling hacking cleanups of a site, or canonicalization; its main use is for removing urgent content. Remember that removing one version of a URL (http/https or www/non-www) removes all versions.\n"],null,["# URL removal explained, Part IV: Tracking your requests and what not to remove\n\nMonday, May 03, 2010\n\n\nIn this final installation in our URL removal series, let's talk about following up on your\nremoval requests, as well as when *not* to use Google's URL removal tool. If you haven't\nalready, I recommend reading the previous posts in this series:\n\n- [Part I: Removing URLs and directories](/search/blog/2010/03/url-removal-explained-part-i-urls)\n- [Part II: Removing and updating cached content](/search/blog/2010/04/url-removals-explained-part-ii-removing)\n- [Part III: Removing content you don't own](/search/blog/2010/04/url-removal-explained-part-iii-removing)\n- [Part IV: Tracking requests, what not to remove](/search/blog/2010/05/url-removal-explained-part-iv-tracking)\n\n\nYou might be also interested to read about\n[managing what information is available about you online](/search/blog/2009/10/managing-your-reputation-through-search).\n\nUnderstanding the status of your requests\n-----------------------------------------\n\n\nOnce you've submitted a removal request, it will appear in your list of requests. You can check\nthe status of your requests at any time to see whether the content has been removed, or whether\nthe request is still or pending or was denied.\n\n\nIf a request was denied, you should see a \"Learn more\" link next to it explaining why that\nparticular request was denied. Since different types of removals have different requirements, the\nreason why a particular request was denied can vary. The \"Learn more\" link should help you figure\nout what you need to change in order to make your request successful. For example, you may need to\nchange the URL in question so that it meets the requirements for the type of removal you\nrequested; or, if you can't do that, you may need to request a different type of removal (one\nwhose requirements your URL currently meets).\n\n\nIf a request has been marked \"Removed\" but you still see that content in search results, check\nthe following:\n\n-\n **Is the URL that's appearing in search results the exact same URL** that you submitted for\n removal? It's fairly common for the same, or similar, content to appear on multiple URLs on a\n site. You may have successfully removed one URL, but still see others containing that same\n content.\n\n **Solution:** Request removal of the other URL(s) in question. See our help center article\n about\n [which URL should you use for removal/block requests](https://www.google.com/support/webmasters/bin/answer.py?answer=63758)\n for help.\n-\n Keep in mind that **URLs are case sensitive** , so requesting removal of\n `https://www.example.com/embarrassingstuff.html` is not the same as requesting\n removal of `https://www.example.com/EmbarrassingStuff.html`\n\n **Solution:** Request removal of the other URL(s) in question. See our help center article\n about\n [which URL should you use for removal/block requests](https://www.google.com/support/webmasters/bin/answer.py?answer=63758)\n for help.\n-\n When a request is marked \"Removed,\" that can\n **mean different things depending on what type of request** you submitted. If you requested\n removal of an entire URL, then \"Removed\" should mean that that entire URL no longer appears in\n our search results. If you requested removal of the cached copy of a URL, \"Removed\" means that\n the cached copy has been removed and will no longer appear in search results; but the URL\n itself may still appear.\n\n\n **Solution:** Double-check what type of removal you requested by looking at the \"Removal\n Type\" column. If you requested a cache removal but you want the entire URL gone, make sure\n the URL meets the\n [requirements for complete removal](/search/blog/2010/03/url-removal-explained-part-i-urls)\n and then file a new request for complete removal of the URL.\n\nWhen not to use the URL removal tool\n------------------------------------\n\n- **To clean up cruft** , like old pages that `404`. The tool is intended for URLs that urgently need to be removed, such as confidential data that was accidentally exposed. If you recently made changes to your site and just have some outdated URLs in the index, Google's crawlers will see this as we recrawl your URLs, and those pages will naturally drop out of our search results over time. There's no need to request an urgent removal through this tool.\n- **To remove\n [crawl errors](https://support.google.com/webmasters/answer/9679690)** from your Webmaster Tools account. The removal tool removes URLs from Google's search results, not from your Webmaster Tools account. There's currently no way for you to manually remove URLs from this report; they will drop out naturally over time as we stop crawling URLs that repeatedly `404`.\n- **To \"start from scratch\"** with your site. If you're worried that your site may have a penalty, or you want to \"start from scratch\" after purchasing a domain from someone else, we don't recommend trying to use the URL removal tool to remove your entire site and then \"start over.\" Search engines gather a lot of information from other sites (such as who links to you, or what words they use to describe your site) and use this to help understand your site. Even if we could remove everything we currently know about your site, a lot of it would come back exactly the same once we'd recrawled all the other sites that help us understand your site and put it in context. If you're worried that your domain has some bad history, we recommend filing a [reconsideration request](https://www.google.com/support/webmasters/bin/answer.py?answer=35843) letting us know what you're worried about and what has changed (such as that you've acquired the domain from someone else, or that you've changed certain aspects of your site).\n- **To take your site \"offline\" after hacking.** If your site was hacked and you want to get rid of bad URLs that got indexed, you can use the URL removal tool to remove any new URLs that the hacker created, for example, `https://www.example.com/buy-cheap-cialis-skq3w598.html`. But we don't recommend removing your entire site, or removing URLs that you'll eventually want indexed; instead, simply clean up the hacking and let us recrawl your site so that we can reindex the new, cleaned-up content as soon as possible. [This article](/search/blog/2008/04/my-sites-been-hacked-now-what) contains more details on how to deal with hacking.\n- **To get the right \"version\" of your site indexed.** When a request to remove **https**`://www.example.com/tattoo.html` is accepted, **http**`://www.example.com/tattoo.html` is also removed. The same is true of the **www** and **non-www** versions of your URL or site. This is because the same content is often available at each of these URLs and we realize that most webmasters and searchers don't want these duplicates appearing in search results. In short, the URL removal tool should not be used as a [canonicalization](/search/docs/crawling-indexing/consolidate-duplicate-urls) tool. It won't keep your favorite version, it'll remove all versions (http/https and www/non-www) of a URL.\n\n\nWe hope this series has answered your questions about removing content from Google's search\nresults, and helped you troubleshoot any issues that may arise. Join us in our\n[Help Forum](https://support.google.com/webmasters/community/label?lid=5489e59697a233d7)\nif you still have questions.\n\nPosted by Susan Moskwa, Webmaster Trends Analyst"]]