透過集合功能整理內容
你可以依據偏好儲存及分類內容。
Feedfetcher
Google 使用 Feedfetcher 檢索 Google 新聞和 PubSubHubbub 的 RSS 或 Atom 動態消息。Feedfetcher 會儲存這些由應用程式或服務使用者要求的動態消息,並定期重新整理。只有 Podcast 動態消息會編入 Google 搜尋的索引中,但如果動態消息不符合 Atom 或 RSS 規格,系統還是有可能編入索引。以下提供幾個常見問題的解答,說明這個由使用者控制的動態消息收集工具如何運作。
如何要求 Google 不要擷取網站的部分動態消息或所有動態消息?
使用者新增的服務或應用程式使用 Feedfetcher 資料時,Google 的 Feedfetcher 便會嘗試取得該動態消息的內容,以便顯示在搜尋結果中。由於 Feedfetcher 要求來自真人使用者的明確指示,不是由自動檢索器控制,因此 Feedfetcher 會忽略 robots.txt 規則。
如果你的動態消息是公開的,Google 就無法禁止使用者存取。如要解決這項問題,可以設定網站,讓網站向 Feedfetcher-Google
使用者代理程式送出 404
、410
或其他錯誤狀態訊息。
如果動態消息是由網誌或網站代管服務所提供,請直接要求該服務限制使用者存取你的動態消息。
Feedfetcher 多久擷取一次我的動態消息?
平均而言,Feedfetcher 擷取大部分網站的頻率最多一個小時一次。如果是時常更新的網站,Feedfetcher 可能會提高重新整理頻率。但是請注意,如果出現網路延遲,Feedfetcher 擷取動態消息的頻率可能會提高,但這只是暫時性的現象。
為什麼 Feedfetcher 會試圖從伺服器下載不正確的連結,或試圖從不存在的網域下載內容?
Feedfetcher 會根據使用者安裝的服務或應用程式中的要求來擷取動態消息,因此可能是有使用者對不存在的動態消息網址提出要求。
Feedfetcher 會根據使用者安裝的服務或應用程式中的要求來擷取動態消息,因此可能是要求擷取該動態消息的使用者知道你的「祕密」伺服器,或不小心打錯網址,結果擷取到你的「祕密」伺服器。
為什麼 Feedfetcher 沒有依照 robots.txt 檔案中的指示運作?
只有使用者明確啟動的服務或應用程式要求動態消息資料時,Feedfetcher 才會擷取動態消息。Feedfetcher 會以真人使用者直接代理人的身分行動 (而非機器人),因此會忽略 robots.txt 中的指示。由於 Feedfetcher 可以做為多位使用者的代理人,因此可以一次代表所有使用者,透過應用程式或服務向常見的動態消息提出要求,節省頻寬。常見的動態消息有 RSS 和 Atom 兩種。
你可以調整伺服器設定,向 Feedfetcher-Google
使用者代理程式傳送 404
、410
或其他錯誤狀態訊息,禁止 Feedfetcher 檢索你的網站。
我發現多部機器的造訪記錄來自 Google.com,而且使用者代理程式都是 Feedfetcher,為什麼會這樣?
Feedfetcher 能分散在多部機器上運作,藉此提升執行效能,並配合網路規模擴充。為了減少頻寬用量,我們通常根據要擷取的網站,使用網路中距離最近的機器進行擷取。
我想篩選記錄,能否告訴我 Feedfetcher 提出要求的 IP 位址?
Feedfetcher 使用的 IP 位址會包含在 user-triggered-fetchers-google.json 物件中。
為什麼 Feedfetcher 多次下載網站的同一個網頁?
通常在每次擷取動態消息時,Feedfetcher 會從網站下載每個檔案一次。但有些時候,如果我們的機器停止運作又重新開機,就會導致 Feedfetcher 再次擷取最近才造訪過的網頁。
Feedfetcher 會檢索哪些類型的連結?
與一般網路檢索器不同,Feedfetcher 並不會找出要檢索的連結,而是檢索使用者在採用 Feedfetcher 的服務或應用程式中提供的單一網址。
我還有其他 Feedfetcher 相關問題,哪裡可以取得更多說明?
如果還有其他問題,歡迎在搜尋中心論壇發文提問。
除非另有註明,否則本頁面中的內容是採用創用 CC 姓名標示 4.0 授權,程式碼範例則為阿帕契 2.0 授權。詳情請參閱《Google Developers 網站政策》。Java 是 Oracle 和/或其關聯企業的註冊商標。
上次更新時間:2025-08-04 (世界標準時間)。
[null,null,["上次更新時間:2025-08-04 (世界標準時間)。"],[[["\u003cp\u003eFeedfetcher is Google's tool for crawling RSS or Atom feeds for Google News and PubSubHubbub, primarily used by apps and services to display feed content.\u003c/p\u003e\n"],["\u003cp\u003eFeedfetcher acts as a user agent, ignoring robots.txt; to block it, configure your server to return a 404, 410, or other error status to the \u003ccode\u003eFeedfetcher-Google\u003c/code\u003e user agent.\u003c/p\u003e\n"],["\u003cp\u003eFeedfetcher typically refreshes feeds hourly, though frequency may vary based on update frequency and network conditions.\u003c/p\u003e\n"],["\u003cp\u003eFeedfetcher requests originate from user actions in apps or services, occasionally leading to requests for nonexistent or "secret" URLs.\u003c/p\u003e\n"],["\u003cp\u003eFeedfetcher operates from various IP addresses, which are publicly listed in a JSON file for filtering purposes.\u003c/p\u003e\n"]]],["Feedfetcher crawls RSS/Atom feeds for Google News and PubSubHubbub, storing and refreshing them for app/service users. It ignores robots.txt due to user-initiated requests. To block Feedfetcher, serve a 404 or 410 error to the `Feedfetcher-Google` user agent. Retrieval typically occurs hourly, but may be more frequent for frequently updated feeds. Feedfetcher operates across multiple machines and fetches only user-provided URLs. IP addresses are listed in `user-triggered-fetchers-google.json`.\n"],null,["# Google Feedfetcher | Google Search Central\n\nFeedfetcher\n===========\n\n\nFeedfetcher is how Google crawls RSS or Atom feeds for\n[Google News](https://play.google.com/store/apps/details?id=com.google.android.apps.magazines)\nand [PubSubHubbub](https://en.wikipedia.org/wiki/PubSubHubbub).\nFeedfetcher stores and periodically refreshes feeds that are requested by users of an app or\nservice. Only podcast feeds get indexed in Google Search; however, if a feed doesn't follow the\n[Atom](https://www.rfc-editor.org/rfc/rfc4287.txt) or\n[RSS](https://cyber.harvard.edu/rss/rss.html) specification, it\nmay still be indexed. Here are some answers to the most commonly asked questions about how this\nuser-controlled feed grabber works.\n\nHow do I request that Google not retrieve some or all of my site's feeds?\n-------------------------------------------------------------------------\n\n\nWhen users add a service or app that uses Feedfetcher data, Google's Feedfetcher attempts to\nobtain the content of the feed in order to display it. Since Feedfetcher requests come from\nexplicit action by human users, and not from automated crawlers, Feedfetcher ignores robots.txt\nrules.\n\n\nIf your feed is publicly available, Google can't restrict users from accessing it. One\nsolution is to configure your site to serve a `404`, `410`, or other error\nstatus message to `Feedfetcher-Google` user agent.\n\n\nIf your feed is provided by a blog or site hosting service, work directly with that service to\nrestrict access to your feed.\n\nHow often will Feedfetcher retrieve my feeds?\n---------------------------------------------\n\n\nFeedfetcher shouldn't retrieve feeds from most sites more than once every hour on average. Some\nfrequently updated sites may be refreshed more often. Note, however, that due to network delays,\nit's possible that Feedfetcher may briefly appear to retrieve your feeds more frequently.\n\nWhy is Feedfetcher trying to download incorrect links from my server, or from a domain that\ndoesn't exist?\n----------------------------------------------------------------------------------------------------------\n\n\nFeedfetcher retrieves feeds at the request of services or apps installed by users. It is\npossible that a user has requested a feed URL that does not exist.\n\nWhy is Feedfetcher downloading information from my \"secret\" web server?\n-----------------------------------------------------------------------\n\n\nFeedfetcher retrieves feeds at the request of services or apps installed by users. It is\npossible that the request came from a user who knows about your \"secret\" server or typed it in\nby mistake.\n\nWhy isn't Feedfetcher obeying my robots.txt file?\n-------------------------------------------------\n\n\nFeedfetcher retrieves feeds only after users have explicitly started a service or app that\nrequests data from the feed. Feedfetcher behaves as a direct agent of the human user, not as a\nrobot, so it ignores robots.txt entries. Since Feedfetcher acts as an agent for multiple\nusers, it conserves bandwidth by making requests for common feeds only once for all users who\nrequested the feed through an app or service. The common feeds are\n[RSS](https://en.wikipedia.org/wiki/RSS) and\n[Atom](https://en.wikipedia.org/wiki/Atom_(Web_standard)).\n\n\nYou can prevent Feedfetcher from crawling your site by configuring your server to serve a\n`404`, `410`, or other error status message to the\n`Feedfetcher-Google` user agent.\n\nWhy are there visits from multiple machines at Google.com, all with user-agent Feedfetcher?\n-------------------------------------------------------------------------------------------\n\n\nFeedfetcher was designed to be distributed on several machines to improve performance and scale as\nthe web grows. To cut down on bandwidth usage, the machines used are often located near the sites\nthat they're retrieving in the network.\n\nCan you tell me the IP addresses from which Feedfetcher makes requests so that I can filter my\nlogs?\n----------------------------------------------------------------------------------------------------\n\n\nThe IP addresses used by Feedfetcher are included in the\n[user-triggered-fetchers-google.json](https://developers.google.com/static/search/apis/ipranges/user-triggered-fetchers-google.json)\nobject.\n\nWhy is Feedfetcher downloading the same page on my site multiple times?\n-----------------------------------------------------------------------\n\n\nIn general, Feedfetcher only downloads one copy of each file from your site during a given feed\nretrieval. Very occasionally, the machines are stopped and restarted, which may cause it to again\nretrieve pages that it's recently visited.\n\nWhat kinds of links does Feedfetcher crawl?\n-------------------------------------------\n\n\nUnlike normal web crawlers, Feedfetcher isn't discovering links to crawl at all; instead, it\ncrawls a single URL that's provided to it by users of a service or app that uses Feedfetcher.\n\nMy Feedfetcher question isn't answered here. Where can I get more help?\n-----------------------------------------------------------------------\n\n\nIf you're still having trouble, try posting your question in the Search Central\n[forum](https://support.google.com/webmasters/community)."]]