K style web search system (.NET)

K style web search system (.NET)

Search link

v2.2

No Resources Available

SP1 improvements: Correct automatic recognition of web page encoding, improve hashing to make spider crawling more comprehensive, correct warehousing errors in special circumstances, etc.;
K-PageSearch is a professional web search engine system independently developed by Kwindsoft. It has advanced intelligent analysis and massive data retrieval technology. Its core consists of four parts: multi-threaded collection system, intelligent analysis system, massive indexing system, and full-text retrieval system. The system adopts a professional-level search engine system architecture and supports millisecond-level full-text retrieval of massive data. It is a professional full-text retrieval product designed mainly for large and medium-sized industry search engines, local search engines, specialized information search engines and other application fields, providing users with ideal solutions for full-text retrieval applications of massive data.
Main improvements of the V2.1 version: using .NET technology to develop Web front-end programs, using UTF-8 web page encoding, a new indexing system, and opening the source code of management tools;
Functional features: Multi-threaded network spider, web page directional acquisition, multi-language web page coding, automatic recognition, hash table, web page deduplication, intelligent web page text extraction, lexicon-based intelligent Chinese word segmentation, Chinese word segmentation, lexicon management, massive data, millisecond-level full-text retrieval, caching technology, web page snapshot, advanced search bidding Ranking web spiders

Web spiders use multi-threads to concurrently collect web pages, combined with efficient collection mechanisms and strategic deployment, to maximize the efficiency of web page collection. Supports targeted collection of web pages, a key technology for vertical search engines to improve data quality and relevance. Users can customize collection rules to collect specific web pages. Supports collection of multiple dynamic and static web page types, and automatic identification of multi-language web page encodings. It uses hash table web page deduplication technology, which has the characteristics of high performance and low system usage, allowing web spiders to run efficiently and stably. Supports single or batch website collection, automatic collection, and automatic update functions.

Text extraction

Intelligent web page text extraction technology, its function is to extract the central theme content of a web page and filter information unrelated to the web page theme (advertising, navigation, copyright and other non-web page body content information). This technology effectively improves the quality of web page information collection and retrieval relevance, intelligent automatic identification, accurate web page text extraction, and an accuracy rate of over 95%.

Chinese word segmentation

Intelligent Chinese word segmentation technology based on thesaurus supports multiple intelligent analysis technologies such as Chinese and English segmentation, Chinese simplified and traditional font conversion, full-width and half-width conversion, and Chinese name recognition. Users can expand and maintain the vocabulary library according to their own application needs to achieve the best word segmentation effect.

Full text search

It adopts massive data indexing system architecture and advanced full-text retrieval algorithm technology, combined with efficient retrieval optimization strategies, to support millisecond-level retrieval speeds of massive data and multi-user concurrent retrieval. Advanced search supports customized search methods to meet users' different search needs. Adopt efficient caching technology strategies to improve system stability and load capacity, reduce system burden, and cache data is automatically updated according to specific conditions.

Applicable objects

Suitable for internal website groups or Internet website groups such as enterprises, government agencies, schools, etc. to establish web search engines;
Suitable for website groups in various industries and fields to establish industry web search engines;
Suitable for local website groups such as provinces, cities, and districts to establish local web search engines;

Expand

Additional Information

Version v2.2
Type Search link
Update Time 2010-10-29
size 3006464
Language Simplified Chinese

Related Applications

K style web search (.NET) v2.2 SP3

2024-11-15
K style web search engine system 2.2 SP4

2022-07-04
K style web search (.NET)

2011-11-28
K style web search K-PageSearch

2011-06-28
K-wind web search engine system K-PageSearch Engine

2010-10-11
K-YellowPage Engine

2009-04-23

Recommended for You

Google Chrome

Home page browsing

3.0.190.0 build 18892 绿色多语版_Google Chrome浏览器
Google Chrome

Home page browsing

3.0.182.3 Dev 多国语言官方安装版
Google Chrome

Home page browsing

3.0.182.3 Dev 多国语言绿色便携版
WeChat Taobao Wealth Edition

E-commerce

v1.0
wordpress retains Chinese automatic interception summary format plug-in (Bobaiyou) optimized version

Blog X guest
Stroke color effect jpg format

Image animation
Partition format conversion tool

Disk tools

1.0 绿色版_轻松实现转换各个分区的格式
Apache2.0 Chinese manual (chm format)

Server tutorial
SWF/FLV file format and AMF standard documentation

Image animation

Related Information All

User Comments