Web page inclusion issues
How do I get my website (independent website or blog) to be indexed by Baidu? How do I check whether my website is indexed by Baidu?
Baidu will include websites and web pages that meet the user's search experience.
To help Baidu Spider discover your site faster, you can also submit your website's entrance URL to us. The submission address is: http://www.baidu.com/search/url_submit.html. You only need to submit the homepage, no detailed content pages are required.
Baidu's webpage inclusion mechanism is only related to the value of the webpage and has nothing to do with commercial factors such as bidding ranking.
Whether Baidu has included your website can be checked by executing site syntax. Directly enter site:your domain name in Baidu search, such as site:www.baidu.com. If the site syntax query can query the results, then your website has been Included by Baidu.
The number of search results obtained by site syntax is only an estimate and is for reference only.
How to prevent my webpage from being indexed by Baidu?
Baidu strictly follows the search engine Robots protocol (for details, see http://www.robotstxt.org/).
You can set up a Robots file to restrict all web pages or web pages in some directories of your website from being included in Baidu. For specific writing methods, see: How to write Robots files.
If you set up a Robots file to disable crawling after your website has been indexed by Baidu, the new Robots file will usually take effect within 48 hours, and new web pages after it takes effect will no longer be indexed. It should be noted that robots.txt prohibits the inclusion of content that has been previously included by Baidu, and it may take several months to be removed from the search results.
If your need to refuse inclusion is very urgent, you can also send an email to [email protected] to request processing.
Why are some private pages on my website without links, even pages that require access permission, included in Baidu?
Baidu Spider crawls web pages through links between web pages.
The types of links between web pages include, in addition to page links within the site, there are also mutual links between different websites. Therefore, even if some web pages cannot be accessed through internal links on your website, if there are links to these pages on other people's websites, these pages will still be indexed by search engines.
Baidu Spider's access rights are the same as those of ordinary users. Therefore, spiders do not have permission to access content that ordinary users do not have permission to access. There are two reasons why it seems that some access-restricted content is included in Baidu:
A. The content has no permission restrictions when accessed by Spider, but after crawling, the permissions of the content have changed.
B. This content has permission restrictions, but due to website security vulnerabilities, users can directly access it through some special paths. Once such a path is published on the Internet, Spider will follow this path to grab restricted content.
If you do not want these private contents to be included in Baidu, on the one hand, you can restrict it through the Robots agreement; on the other hand, you can also contact [email protected] for resolution.
Why are my website inclusion numbers decreasing?
The server where your website is located is unstable. Spider was unable to crawl the web page when checking for updates and was temporarily removed.
Your site doesn't match the user's search experience.
Why does my webpage disappear from Baidu search results?
Baidu does not promise that all web pages can be searched from Baidu.
If your webpage cannot be searched from Baidu for a long time, or suddenly disappears from Baidu's search results, the possible reasons are:
A. Your webpage does not meet the user’s search experience
B. The server where your website is located is unstable and has been temporarily removed by Baidu. After it becomes stable, the problem will be solved.
C. The content of your webpage does not comply with national laws and regulations.
D. Other technical issues
The following statements are false and baseless:
A. If you participate in Baidu PPC but do not renew, you will disappear from Baidu search results.
B. Participating in advertising projects of other search engines will disappear from Baidu search results.
C. It competes with Baidu-owned websites and will disappear from Baidu search results.
D. The traffic obtained from Baidu is too large and will disappear from Baidu search results.
What kind of web pages will be considered by Baidu to be of no value and will not be included in Baidu or disappear from existing search results?
Baidu only includes web pages that are valuable to users. Any change in the presence or absence of any web page in the search results is the result of calculation and adjustment by machine algorithms. Baidu will definitely not welcome the following types of web pages:
A. The web page does a lot of processing for search engines rather than users, so that the content users see in the search results is completely different from the actual content of the page, or the web page obtains inappropriate rankings in the search results, causing users to Feeling cheated.
If there are many such pages in your website, this may affect the page inclusion and ranking of your entire website.
B. Web pages are highly repetitive content copied from the Internet.
C. The webpage contains content that does not comply with Chinese laws and regulations.
If my website disappears from Baidu search results due to cheating, is there any chance it will be re-included?
Any website that has been completely revised will have a chance to be re-included by Baidu. Baidu will automatically evaluate the processed sites on a regular basis and re-include those that meet the conditions.
It should be noted that Baidu’s technology and product departments are only responsible for user search experience. The following statements are wrong and baseless:
A. If I become an advertiser or affiliate website of Baidu, I can be re-included.
B. If I give Baidu some money, I can be included again.
C. If I know someone from Baidu, I can be re-included
My website has been updated, but the content included in Baidu has not been updated. What should I do?
Baidu will automatically update all web pages regularly (including removing dead links, updating domain name changes, and updating content changes). So please wait patiently for a while, and the changes on your website will be noticed and corrected by Baidu.
Why is the number of my website included in Baidu different from other search engines?
Under normal circumstances, this is a normal phenomenon. Different search engines have different algorithms for judging the value of web pages.
Web page sorting problem
My website homepage has been included, but when searching for the website name, it doesn’t rank first. What should I do?
Answer: The sorting algorithm is very complex. Our goal is to allow users to search for the information they need at the lowest cost through algorithm improvements. There will still be various unsatisfactory aspects in this process. We will very much welcome your feedback to us about any confusions and problems you encounter. Our engineers will carefully track and analyze every problem in order to finally solve it. On the right side of the search box at the bottom of the Baidu search results page, there is a "Conversation with Baidu" link, where you can submit your questions or send your questions to [email protected] to help us improve.
We have been improving the search algorithm to make Baidu's search results more in line with users' search needs.
When searching for a certain keyword, the ranking of my webpage in Baidu search results changes drastically in a short period of time. Is this normal?
A: Typically, this is a normal change. Generally speaking, there are three types of reasons for changes in ordering:
A. Your web pages related to specific keywords have changed
B. Other web pages related to specific keywords have changed
C. Baidu’s sorting algorithm has changed
When searching for a certain keyword, the ranking position of my webpage in Baidu is very different from the ranking position in other search engines. Is this normal?
Answer: Normally, this is a normal phenomenon. Because the algorithms of different search engines are different.
What are the consequences if I hire some “SEO” to optimize my website or web page?
Answer: For reasonable search engine optimization, see Baidu’s “Website Building Suggestions for Webmasters”.
Many outside companies or individuals under the banner of SEO may be able to bring short-term ranking benefits to your website, but this will expose you to the risk of greater losses. After you entrust your website resources to others, many SEOs will use cheating techniques to improve rankings, and will even use your resources to carry out their own personal operating projects, ultimately causing damage to your interests.
Don’t risk entrusting your website to SEOs because of what they say:
A. I’m very familiar with the people at Baidu. I can do whatever I want with no risk.
B. I am a search engine expert and I know Baidu’s algorithm clearly. It doesn’t matter if I play with fire.
C. I have ranked first for keywords such as xxx, yyy, and zzz, so I am a great person.
You can also complain to Baidu about spam websites or web pages encountered in searches to help Baidu maintain the quality of search results.
Business customer related questions
I am a PPC customer of Baidu. If I do not renew my subscription, will Baidu punish me for this?
Answer: This is absolutely impossible.
The only criterion for Baidu's web search strategy lies in the user's search experience. Bidding ranking and web search natural ranking are two completely independent technical service systems. Whether a website is a Baidu PPC client has no impact on the natural ranking of web searches.
If you receive any threatening remarks, please report it directly to [email protected].
I am a PPC customer of Baidu. Why did my website disappear from Baidu after I stopped renewing?
Answer: Whether a website can be indexed by Baidu is only related to the quality of your website and has nothing to do with bidding ranking. The bidding ranking in the web search results does not mean that your website is included in Baidu. If your website has disappeared from Baidu, please refer to the instructions for web inclusion issues.
My website disappeared from Baidu due to cheating. Can it be re-included by Baidu by becoming a Baidu PPC customer, advertiser or affiliate site?
Answer: No. Our only criterion for including websites is user search experience. For instructions on re-indexing the punished website into Baidu, see the description in Web Page Index Question 7.
If my website joins Baidu PPC, Baidu Alliance, or becomes an advertiser of Baidu, will it receive special consideration in the inclusion and ranking of web pages?
Answer: Impossible.
Back to top
Website building suggestions for webmasters
Add an appropriate title to each web page. If it is the homepage of the website, it is recommended that the title use the name of the site or the name of the company or institution represented by the site; for the rest of the content pages, the title is recommended to be refined and summarized with the main text content, which allows you to of potential users quickly reach your page through the title in search engine results.
Make full use of the description tag on the website homepage or channel homepage to provide a summary description of the content of this webpage in the form of , which will help users and search engines enhance their understanding of your website and web pages.
The website should have clear navigation and hierarchical structure. Important web pages on the website should be found from relatively shallow locations on the website, ensuring that each page can be reached through at least one text link.
Try to use text instead of flash, Javascript, etc. to display important content or links. Baidu is temporarily unable to recognize the content in Flash and Javascript. This part of the content may not be found in Baidu searches; only include web pages pointed to by links in flash and Javascript. Baidu may not be able to include it.
Try to use frame and iframe frame structures as little as possible. Content displayed through iframe may be discarded by Baidu.
If the website uses dynamic web pages, reducing the number of parameters and controlling the length of parameters will be beneficial to inclusion.
When the website is revised or the links to important pages within the website change, the page before the revision should be permanently redirected 301 to the page after the revision.
When a website changes its domain name, all pages in the old domain name should be permanently redirected 301 to the corresponding pages in the new domain name.
Only when there is a tacit balance of interests between search engines, webmasters, and Internet users can this industry develop smoothly. An exhaustive website construction will only make you farther and farther away from users and search engines. Search engines and webmasters should develop harmoniously and embrace a beautiful vision together.
Here are some of our website quality recommendations:
The content of the website should be user-oriented, and search engines are just ordinary visitors to the website. Placing any content that is invisible to users or deceives users may be regarded as cheating by search engines. These behaviors include but are not limited to: In web pages Add hidden text or hidden links; add keywords that are irrelevant to the web page content; have deceptive jumps or redirects; create bridge pages specifically for search engines; use program-generated content for search engines; have a large number of duplications and unnecessary content. Valuable content; filled with a large number of malicious advertisements or malicious code, etc.
Baidu prefers unique and original content. If your site content is just collected and copied from various places, it is likely that it will not be included by Baidu.
Set your friendly links carefully. If most of the friendly links on your website point to spam sites, your site may be negatively affected.
Be cautious about joining plans such as channel co-building and content alliances that cannot or rarely generate original content, unless you can create original content for the content alliance.
Baidu will try its best to include web pages that provide different information. If the same content on your website can be displayed in different forms (such as a simplified version of the forum page, a printed page), you can use robots.txt to prevent spiders from crawling the forms you do not want to display to users. , which also helps save your bandwidth.
Internet forum includes open protocols
"Internet Forum Inclusion Open Protocol" is a forum content inclusion standard formulated by Baidu Web Search. The forum website can make the posts published in the forum into XML format web pages that comply with this open protocol for search engine indexing. The forum posts can be automatically and Notify Baidu search engine promptly. The adoption of the "Internet Forum Collection Open Agreement" is equivalent to the posts in the forum being subscribed by search engines. Through the platform of Baidu, the world's largest Chinese search engine, netizens will be able to access it in a wider range and with higher frequency. Posts in your website's forums, thereby driving potential traffic to your website.
Visit the Internet Forum Collection Open Protocol page
other
Will I receive a timely response if I send online feedback to Baidu or send an email to [email protected]?
Answer: Although Baidu's staff responsible for web search quality cannot respond to feedback and emails, they will carefully read and classify every online feedback and email, and forward it to the corresponding responsible department for processing in a timely manner.