Background information:
Mr. Ce——Special judge of Ceen’s “World Famous Shoe Taobao Customer” promotion competition
During the first season of the promotion competition, he conducted weekly streamlined reviews of the contestants' performance, specializing in treating difficult and complicated diseases, and was known to many contestants. He is low-key and mysterious, focusing on website promotion for several years. He is currently a judge of Ce En’s “World Famous Shoe Taobao Customer” promotion competition in the second season, but his whereabouts are unclear.
When it comes to Chinese word segmentation, I think anyone who is engaged in SEO work should have a deep understanding of this knowledge and understand how search engines identify the relationship between words and how to determine the meaning of sentences. Because only by fully understanding and consistently applying Chinese word segmentation can you be more comfortable in keyword collection, analysis, layout, including title writing, and will you be able to write better soft articles. Chinese word segmentation is already the core knowledge in SEO knowledge.
SEOER must understand how search engines work. This is particularly important and fundamental.
Let’s first understand the concept of Chinese word segmentation. What is Chinese word segmentation?
Chinese word segmentation technology belongs to the category of natural language processing technology. For a sentence, people can use their own knowledge to understand which words are words and which are not words. But how can the computer also understand it? The processing process is the word segmentation algorithm.
At present, search engines can be divided into three categories for Chinese word segmentation: word segmentation methods based on string matching, word segmentation methods based on understanding, and word segmentation methods based on statistics.
1. Word segmentation method based on string matching
When the search engine accepts a user's search, it will first determine whether the "word" searched by the user matches the "word" in the database, and how close the match is to scan;
According to the different lengths of the input "keywords" or "sentences", priority matching can be divided into maximum (longest) matching and minimum (shortest) matching;
as follows:
1) Forward maximum matching method (direction from left to right);
Example: Ceen’s “World Famous Shoes Taobao Customer” promotion contest.
The keywords that are separated by the maximum forward direction are: Ce En, World Famous Shoes Taobao Customer, Promotion Competition (not shown in the picture, you can search it on Baidu and then check the snapshot)
2) Reverse maximum matching method (right to left direction); this is the same word segmentation method as the first point, except that the order of the keywords is reversed, and the results obtained by querying the search engine database are the same.
3) Minimum segmentation (minimize the number of words in each sentence);
Example: Ceen’s “World Famous Shoes Taobao Customer” Promotion Contest
The least segmented words are: Ce En, world, famous shoes, Taobao customer, promotion, competition;
4) Bidirectional maximum matching method (two scans from left to right and from right to left)
2. Word segmentation method based on understanding
This word segmentation method achieves the effect of word recognition by allowing the computer to simulate human understanding of sentences. The basic idea is to perform syntactic and semantic analysis while segmenting words, and use syntactic information and semantic information to deal with ambiguity. This method has been fighting against the long-term practice of pseudo-originality. In 2009, synonyms were generally changed, and Baidu would include the pseudo-original synonyms. In 2010, the scope of pseudo-originality needed to be larger to be effective. At the end of June this year , after a major update of Baidu, many students found that the original collection sites or web content with little pseudo-original strength were released by Baidu in large quantities and were no longer included. This shows that Baidu's word segmentation technology method is becoming more and more mature. It's getting better and better. It is becoming increasingly difficult to fake originality.
3. Word segmentation method based on statistics
From a formal point of view, words are stable combinations of words, so in context, the more times adjacent words appear at the same time, the more likely they are to form a word. Therefore, the frequency or probability of adjacent words co-occurring between words can better reflect the credibility of the word. Search engines can fully display them in the "drop-down box" or "related searches" through this method. In recent years, we have discovered that many Internet companies have a business called "Brushing Baidu related keywords". When users enter a certain keyword, we can specify that the keyword they want will appear in the Baidu drop-down box. What we use is This is based on the statistical word segmentation principle.
Finally, let’s talk about the application of keywords in titles and understand the relationship between Chinese word segmentation and website titles.
1. The search terms exactly match the website title.
When users search on Baidu, if the search terms completely match the title of your website, and your website reaches a certain weight, it will have a good ranking. This also explains that keywords must appear in the title, that is The reason is that, on the contrary, if the searched word does not exactly match the title of your website, the search engine will perform word segmentation. The word segmentation method participates in the comprehensive method of Chinese word segmentation principles.
2. Word segmentation is performed only if the search terms do not match.
When the words do not match the title of your website, word segmentation will be performed. The methods of word segmentation will include forward, reverse, statistics, understanding, synonyms, and new words and ambiguous words. At this time, it will be more difficult for search engines to understand, which will affect the ranking of the web page;
Having said this, I mentioned that I previously posted some hot-searched keywords on Taobao in a QQ group 189321234 of a Cern Taobao customer competition. In an interview with me last time, a Cern Taobao customer said The number of searches for these words on Taobao is very large, but on Baidu it is almost zero. At that time, I suggested optimizing keywords such as "industry words + Ce En, industry words + Ce En leather shoes", and said that I would write about it in the future. The article specifically makes it clear, so I'll give you another example to make it clearer.
Title: Casual Leather Shoes Zeen Bowen, Fashionable Men's Korean Style
According to the maximum matching method, they are divided into: casual leather shoes, Zeenbowen, fashionable men, and Korean style. It contains at least four words: "casual leather shoes, casual leather shoes Cern, Cern Bowen, Cern leather shoes Bowen", which not only optimizes the popular word "casual leather shoes" on Baidu, but also the precise word: casual leather shoes Cern , and at the same time, we also optimized the hot words on Taobao and the precise words "Cern Bowen, Cern Leather Shoes Bowen" to try our best to match the words searched by users. This is what I meant at the time.
I would like to recommend an exclusive forum for Taobao customers: Tuike SEO Forum http://www.tuikeseo.com , which includes Taobao customer activities, exchanges, and experience sharing, as well as a forum for free Taobao customer templates, programs, promotion software, tutorial downloads, etc. , it is recommended to visit more.
A5 Webmaster Network & CEEN "World Famous Shoes Taobao Customer" Promotion Competition - Season 2 is in full swing, hurry up and watch: