IKAnalyzer open source Chinese word segmentation 2012u6-source code download-A5 download

IK Analyzer open source Chinese word segmentation 2012 u6

CMS system

v0

No Resources Available

IKAnalyzer is an open source, lightweight Chinese word segmentation toolkit developed based on Java language. Since the launch of version 1.0 in December 2006, IKAnalyzer has launched 4 major versions. Initially, it was a Chinese word segmentation component based on the open source project Luence, which combined dictionary word segmentation and grammatical analysis algorithms. Starting from version 3.0, IK has developed into a public word segmentation component for Java, independent of the Lucene project, and provides a default optimized implementation of Lucene. In the 2012 version, IK implemented a simple word segmentation ambiguity elimination algorithm, marking the evolution of the IK word segmenter from simple dictionary segmentation to simulated semantic word segmentation.
IKAnalyzer2012 features:
It adopts a unique "forward iteration of the finest-grained segmentation algorithm" and supports two segmentation modes: fine-grained and intelligent word segmentation;
In the system environment: Core2i73.4G dual-core, 4G memory, window764-bit, SunJDK1.6_2964-bit ordinary PC environment test, IK2012 has a high-speed processing capability of 1.6 million words/second (3000KB/S).
The 2012 version of the intelligent word segmentation mode supports simple word segmentation disambiguation processing and quantifier merging output.
It adopts a multi-subprocessor analysis mode, supports: word segmentation processing of English letters, numbers, Chinese vocabulary, etc., is compatible with Korean and Japanese character optimized dictionary storage, and has a smaller memory footprint. Supports user dictionary extended definitions. In particular, in the 2012 version, the dictionary supports Chinese, English, and digital mixed words.

Expand

Additional Information

Version v0
Type CMS system
Update Time 2022-05-28
size 125MB

Related Applications

jsp probe v2016

2022-05-22
SiteServer v3.4.4 for .net1.1

2024-11-14
Yinku open source online school system source code v2.0.6

2022-05-19
Several WeChat Moments test mini-games v1.0

2023-04-18
KesionICMS intelligent website building system v3.7 official version

2024-11-05
Ext JS v3.1.1

2022-05-31

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
jsp probe v2016

CMS system

v0
SiteServer v3.4.4 for .net1.1

CMS system

3.4.4
Yinku open source online school system source code v2.0.6

CMS system

v0
waymo open dataset

Other source code

December 2023 Update
wp functions

Other categories

1.0.0
termwind

Other categories

v2.3.0

Related Information All

IK Analyzer open source Chinese word segmentation 2012 u6