Baidu official webmaster help comments-webpage inclusion issues
The day before this Baidu update, I found out when searching the money site website
Tip: The number of relevant web pages found is an estimate, does not represent the actual number of results, and is for reference only. Webmaster help
This webmaster help is worth reading. Baidu added this sentence to remind everyone. But as far as I know, many SEO people, even Baidu SEO people, have not studied it carefully, so Souqian feels it is necessary to take everyone to read it and make comments at the same time.
As for why you should read it? Give me an example. Do you know what those at home and abroad who study China’s current affairs and politics must read? It’s the People’s Daily! Because the People’s Daily is the most authoritative official newspaper. Especially the People's Daily editorial.
So let's read Baidu Webmaster Help together, the full name is Baidu Search Help Center - Web Search Help - Webmaster FAQ, URL: http://www.baidu.com/search/guide.html
Today’s content is about webpage inclusion issues, and the emphasis is on Souqian SEO’s comments.
How do I get my website (independent website or blog) to be indexed by Baidu? How do I check whether my website is indexed by Baidu?
Looking at this title, the explanation of website is an independent website or blog. The blog here refers to the blog opened by the webmaster on major blog sites. In the past, Baidu had a relatively high weight on this kind of blog, especially Baidu Space and Sina Blog. But recently Baidu has lowered the weight of this kind of blog. If you want to use a blog, it is better to use an independent blog.
Baidu will include websites and web pages that meet the user's search experience.
This sentence actually answers the question above. Complying with user search experience is key. Both websites and web pages are mentioned here. So Souqian emphasizes that there are two levels of inclusion. The first is website inclusion. The main measure is the inclusion of the main domain name, such as www.seolabs.net.cn; the second is the inclusion of individual web pages.
To help Baidu Spider discover your site faster, you can also submit your website's entrance URL to us. The submission address is: http://www.baidu.com/search/url_submit.html. You only need to submit the homepage, no detailed content pages are required.
/This sentence tells everyone to go to the Baidu login portal to submit your website. The word "faster" means that even if you don't submit it, it may still be included, as long as Baidu spider can reach your website along a certain path.
Baidu's webpage inclusion mechanism is only related to the value of the webpage and has nothing to do with commercial factors such as bidding ranking.
The reason for this statement is that it is generally believed that there is something fishy about Baidu’s natural search.
Whether Baidu has included your website can be checked by executing site syntax. Directly enter site:your domain name in Baidu search, such as site:www.baidu.com. If the site syntax query can query the results, then your website has been Included by Baidu.
The number of search results obtained by site syntax is only an estimate and is for reference only.
This sentence is very important. The first is to tell you how to determine whether your site has been included, using site syntax. Some netizens directly enter the domain name of their website, and when they find that it is not included, they jump to the conclusion that their website is not included or has been K. Only by remembering the site can the problem be explained. The second is this example site:www.baidu.com. It must be mentioned here that site:www.baidu.com and site:baidu.com are different. The reason is that the one without www and the one with www are two domain names. Generally, a result page such as site:baidu.com includes the inclusion of site:www.baidu.com and other subdomains. Third, it is just an estimated value for reference only. This sentence is true, but it is actually useless. What else can we see if we don’t look at the site?
How to prevent my webpage from being indexed by Baidu?
Not many people care about this issue, because many sites are not included in Baidu at all.
Baidu strictly follows the search engine Robots protocol (for details, see http://www.robotstxt.org/).
You can set up a Robots file to restrict all web pages or web pages in some directories of your website from being included in Baidu. For specific writing methods, see: How to write Robots files.
If you set up a Robots file to disable crawling after your website has been indexed by Baidu, the new Robots file will usually take effect within 48 hours, and new web pages after it takes effect will no longer be indexed. It should be noted that robots.txt prohibits the inclusion of content that has been previously included by Baidu, and it may take several months to be removed from the search results.
This sentence answers why some URLs that no longer exist can still exist on Baidu for several months. Google has done a good job in this regard. It supports users to delete included pages themselves.
If your need to refuse inclusion is very urgent, you can also send an email to [email protected] to request processing.
Why are some private pages on my website without links, even pages that require access permission, included in Baidu?
The important thing about this question is that it actually answers why some sites can be included without external or internal links.
Baidu Spider crawls web pages through links between web pages.
The types of links between web pages include, in addition to page links within the site, there are also mutual links between different websites. Therefore, even if some web pages cannot be accessed through internal links on your website, if there are links to these pages on other people's websites, these pages will still be indexed by search engines.
The reason why it is included is that there is such a link path. Not necessarily something you can easily spot. Personally, I think Baidu will crawl by referring to domain name resolution data. So some people say that even though their website is not linked to, Baidu has included it, as if they are awesome. In fact, you didn't notice it, there must be a link path. Is Baidu Spider airborne?
Baidu Spider's access rights are the same as those of ordinary users. Therefore, spiders do not have permission to access content that ordinary users do not have permission to access. There are two reasons why it seems that some access-restricted content is included in Baidu:
A. The content has no permission restrictions when accessed by Spider, but after crawling, the permissions of the content have changed.
B. This content has permission restrictions, but due to website security vulnerabilities, users can directly access it through some special paths. Once such a path is published on the Internet, Spider will follow this path to grab restricted content.
If you do not want these private contents to be included in Baidu, on the one hand, you can restrict it through the Robots agreement; on the other hand, you can also contact [email protected] for resolution.
Why are my website inclusion numbers decreasing?
A must-read for webmasters who are often troubled by Baidu inclusion issues.
The server where your website is located is unstable. Spider was unable to crawl the web page when checking for updates and was temporarily removed.
Your site doesn't match the user's search experience.
The first sentence tells you that server stability is very important for inclusion, but it also tells you that this is temporary and will come back when your server is stable. Therefore, webmasters who have this kind of problem should solve the server problem immediately, and then have a good attitude, because they will come back. The second sentence is still a matter of user experience, which will be discussed in detail later.
Why does my webpage disappear from Baidu search results?
A must read for everyone who works on Baidu. It is normal to be raped on Baidu. You must carefully study the reasons for being raped. If the following conditions are not met, then you should have a better attitude and you will come back.
Baidu does not promise that all web pages can be searched from Baidu.
If your webpage cannot be searched from Baidu for a long time, or suddenly disappears from Baidu's search results, the possible reasons are:
There are actually two problems with this. One is that the new site is not included in Baidu; the other is that it is deleted by Baidu.
A. Your webpage does not meet the user’s search experience
B. The server where your website is located is unstable and has been temporarily removed by Baidu. After it becomes stable, the problem will be solved.
C. The content of your webpage does not comply with national laws and regulations.
D. Other technical issues
A will not talk about it for now, but B also mentioned the server issue, so I would like to remind everyone that the stability of server space is always the first priority, and under this premise, speed should be considered. C is a matter of content. Pornography, gambling, drug abuse, and reactionary activities are not acceptable in the country. Even if Baidu reluctantly includes it, it will be killed by the filing. D. Other technical problems are actually Baidu's problems in many cases. Baidu's technology is not as good as Google's and is inherently unstable.
The following statements are false and baseless:
A. If you participate in Baidu PPC but do not renew, you will disappear from Baidu search results.
B. Participating in advertising projects of other search engines will disappear from Baidu search results.
C. It competes with Baidu-owned websites and will disappear from Baidu search results.
D. The traffic obtained from Baidu is too large and will disappear from Baidu search results.
I won’t talk about these things because they are said to be fishy.
What kind of web pages will be considered by Baidu to be of no value and will not be included in Baidu or disappear from existing search results?
This paragraph is actually a counterexample to the user experience.
Baidu only includes web pages that are valuable to users. Any change in the presence or absence of any web page in the search results is the result of calculation and adjustment by machine algorithms. Baidu will definitely not welcome the following types of web pages:
First, value is the essence of user experience; second, they are all the result of machine algorithm calculation and adjustment. I don’t agree with this statement. Baidu’s manual intervention is very serious. The third is to make it clear that what is below is a minefield and should not be touched.
A. The web page does a lot of processing for search engines rather than users, so that the content users see in the search results is completely different from the actual content of the page, or the web page obtains inappropriate rankings in the search results, causing users to Feeling cheated.
If there are many such pages in your website, this may affect the page inclusion and ranking of your entire website.
B. Web pages are highly repetitive content copied from the Internet.
C. The webpage contains content that does not comply with Chinese laws and regulations.
A is don’t cheat. B is don’t copy it yourself, at least pseudo-original. C means the content is legal.
If my website disappears from Baidu search results due to cheating, is there any chance it will be re-included?
Any website that has been completely revised will have a chance to be re-included by Baidu. Baidu will automatically evaluate the processed sites on a regular basis and re-include those that meet the conditions.
If you tell the webmaster who was cheated, you still have a chance. But the word "regular" tells you that the time to be re-included can be accurate, because you don't know how it is scheduled.
It should be noted that Baidu’s technology and product departments are only responsible for user search experience. The following statements are wrong and baseless:
A. If I become an advertiser or affiliate website of Baidu, I can be re-included.
B. If I give Baidu some money, I can be included again.
C. If I know someone from Baidu, I can be re-included
My website has been updated, but the content included in Baidu has not been updated. What should I do?
Baidu will automatically update all web pages regularly (including removing dead links, updating domain name changes, and updating content changes). So please wait patiently for a while, and the changes on your website will be noticed and corrected by Baidu.
Baidu update problem. Mention the word "regular". In fact, it is the update rule we are talking about. At present, there are four small updates every week, and the Thursday at the end of the month is the big update. For example, May 28th is a Thursday at the end of May.
Why is the number of my website included in Baidu different from other search engines?
Under normal circumstances, this is a normal phenomenon. Different search engines have different algorithms for judging the value of web pages.
Therefore, many people often ask Souqian why their own website is included more or less on Baidu than on Google. The reason is that the algorithm is different.
Souqian SEO will continue to comment on Baidu Webmaster Help in the next time, and after the end, it will be compiled into an e-book to share with everyone.
Please indicate when reprinting it from SEO Lab.