The original configuration and vocabulary are modified from the af2480b commit of Rime Pinyin.
The words in the Rime thesaurus are relatively complete, but not very complete. The main problem is that the word frequency and word frequency are not right, and there are a lot of useless words, so I remade it.
Mainly maintain the vocabulary and word frequency. Based on the rime lexicon, unhealthy words have been deleted, a large number of uncommon words (words with frequency == 1 and the word segmenter cannot distinguish them) have been deleted, and non-words such as "Deba" and "DeLe" have been deleted/adjusted. of words. Manually modified a large number of word frequencies. The first step is to do subtraction.
Then use high-quality corpus of 745,396,750 words to perform word segmentation, re-count word frequency and word frequency, and normalize to achieve better input effects. Both full and double spells can be used.
The usage method is basically the same as Wusong Pinyin, with some fine-tuning of the trigger instructions and the addition of Lua auxiliary code support. The auxiliary code is optional. Press ` to turn it on and it will not affect normal typing.
Symbol/fh For more symbols, see https://github.com/gaboolic/rime-frost/blob/master/symbols_v.yaml
Tonal finals /a /e /u, etc.
Date and time rq sj xq dt ts
Turn on the auxiliary code` Moqi auxiliary code splitting instructions
Part splitting and reverse search uU
unicode character U
Numeric amounts with a capital R
Lunar calendar N
Calculator V
Download the compressed package Code - Download ZIP of this warehouse (or download the latest source-code.zip of releases), and unzip it to the following path
Windows: %APPDATA%Rime
(you can right-click the Xiaolanghao input method in the lower right corner to open the menu and select the user folder). After copying, go to the input method settings and select Baishuang Pinyin, and then redeploy
Mac
The whisker path is ~/Library/Rime
The path of fcitx5-Mac version is ~/.local/share/fcitx5/rime
Linux
The fcitx5-rime path is ~/.local/share/fcitx5/rime
The path of fcitx5 flatpak version ~/.var/app/org.fcitx.Fcitx5/data/fcitx5/rime
The ibus-rime path is ~/.config/ibus/rime
Android
The path of fcitx5-Android version is /Android/data/org.fcitx.fcitx5.android/files/data/rime
The same text path is /rime
Swift has built-in Hoarfrost thesaurus word frequency, you can install it directly and use it.
The iOS warehouse input method is currently built-in, and Baishuang Pinyin can also be updated through [Input scheme settings - plus sign in the upper right corner - Scheme download - Overwrite and deploy].
First time installation:
Depending on the system used by the user and the software installed, first cd to the parent directory of the corresponding configuration file (for example, %APPDATA%
for Windows, ~/Library/
for Mac), and then execute the following command:
git clone --depth 1 https://github.com/gaboolic/rime-frost Rime
Subsequent updates:
Just execute git pull
in the Rime folder.
Mac: cd ~/Library/Rime && git pull
Windows: cd "$env:APPDATARime" && git pull
Other systems can be deduced similarly
Select the recipe (others/recipes/*.recipe.yaml) to install or update:
℞ To install or update all files, execute bash rime-install gaboolic/rime-frost:others/recipes/full
After sorting out the word segmentation, words with high word frequency but not in the vocabulary are added.
Typos and misspellings reminder Lua
Write an automated script to convert the sentences into Pinyin, then call rime_api to generate sentences using Pinyin, compare the accuracy, and iterate
In addition, the first and last glyphs of the Moqi code are reversely checked. For example, if you type the amq guide character and then type mu cun, you can type out the words "village cupboard tree" and so on.
Add the function of Lua boot auxiliary code
2 words, verb + noun structure, add "LE" and "END" in the middle, and add "MI" at the end to automatically derive vocabulary.
For example: Awesome → Isn’t it awesome?
Tooth extraction: Tooth extracted, tooth extracted, tooth not extracted, tooth not extracted, tooth extracted or not
For example:
Add "not" between adjectives:
Add words like "the fourth column = the fourth column"
Statistics of English word frequency in Chinese corpus
Train an intelligent language model
Rime thesaurus https://github.com/iDvel/rime-ice The initial thesaurus and most configurations of the hoarfrost thesaurus come from the rime thesaurus
Stuttering Chinese word segmentation https://github.com/fxsjy/jieba
Convert Chinese characters to Pinyin (pypinyin) https://github.com/mozillazg/python-pinyin
MNBVC ultra-large-scale Chinese corpus https://github.com/esbatmop/MNBVC currently has 33TB of data volume
kenlm https://github.com/kpu/kenlm official website https://kheafield.com/code/kenlm/
kenlm tutorial, python call https://github.com/mattzheng/py-kenlm-model
Mascot (picked up on 2024-10-12):
Solution using Baishuang thesaurus
Moqi sound shape https://github.com/gaboolic/rime-shuangpin-fuzhuma
Moqi Wubi Sentence https://github.com/gaboolic/rime-wubi-sentence
Mint Pinyin https://github.com/Mintimate/oh-my-rime
Yuyan input method https://github.com/gurecn/YuyanIme An out-of-the-box Android input method with built-in Baishuang vocabulary library