penelope下載 - penelope原始碼下載

penelope

其他源碼

v3.1.3

下載

佩內洛普

Penelope是一款用於創建、編輯和轉換詞典的多功能工具，尤其適用於電子閱讀器設備。

版本：3.1.3
日期：2016-09-23
開發商：阿爾貝托‧佩塔林
許可證：麻省理工學院許可證（MIT）
聯絡方式：點這裡

使用目前版本，您可以：

將字典轉換為以下格式：
- Bookeen Cybook 奧德賽 (讀/寫)
- CSV（讀/寫）
- EPUB（僅限 W）
- MOBI（限 Kindle、W）
- Kobo（僅限 R 索引，僅限 W 未加密/未混淆）
- 星際譯王 (讀/寫)
- XML（讀/寫）
將多個相同類型的字典合併為一個字典
合併同一詞條的多個定義
按詞條和/或定義排序
定義您自己的輸入解析器來合併/排序/編輯定義
定義您自己的排序函數（僅限bookeen輸出格式）
輸出包含字典的 EPUB 檔案（例如，為了回應電子閱讀器缺乏搜尋功能的情況）
輸出 MOBI (Kindle) 字典

重要更新

2016-04-17 可悲的是，我不能再花時間在 Penelope 上工作，因為我的其他 FLOSS 專案佔用了我 100% 的 FLOSS 時間，而且我仍然需要支付租金和賬單，與家人和朋友共度時光等.，和其他人一樣。因此，我不會處理問題或拉取請求，請不要指望它們會被處理。我正在積極尋找其他開發商來接手這個專案。（當發生轉換時，應刪除此通知。）如果您需要轉換字典並且當前版本的 Penelope 不適合您，您可能需要查看PyGlossary 。對於造成您的不便，我深表歉意。

安裝

使用點

開啟控制台並輸入：
```
$ [sudo] pip install penelope
```
就是這樣！只需不帶參數運行（或使用-h或--help ）即可取得手冊：
```
$ penelope
```

此過程將安裝lxml和marisa-trie 。您可能需要單獨安裝dictzip （StarDict 輸出）和kindlegen （MOBI 輸出），請參閱下文。

來自原始碼

取得原始碼：
- 使用git克隆此儲存庫：
```
$ git clone https://github.com/pettarin/penelope.git
```
- 或下載最新版本並將其解壓縮到某處，
- 或下載目前的主 ZIP 並將其解壓縮到某處。
開啟控制台並進入penelope （克隆）目錄：
```
$ cd /path/to/penelope
```
就是這樣！只需不帶參數運行（或使用-h或--help ）即可取得手冊：
```
$ python -m penelope
```

此過程不會安裝任何依賴項：您需要手動執行此操作，請參閱下文。

依賴關係

Python，版本 2.7.x 或 3.4.x（或更高版本）
編寫 StarDict 字典： dictzip可執行文件，可在$PATH中使用或使用--dictzip-path指定：
```
$ [sudo] apt-get install dictzip
```
讀/寫 Kobo 字典：Python 模組marisa-trie ：
```
$ [sudo] pip install marisa-trie
```
或$PATH中可用的或用--marisa-bin-path指定的 MARISA 執行檔
編寫 MOBI Kindle 字典：kindlegen 可執行文件，可在$PATH中使用或使用--kindlegen-path指定
讀/寫 XML 字典：Python 模組lxml ：
```
$ [sudo] pip install lxml
```

用法

 usage: 
  $ penelope -h
  $ penelope -i INPUT_FILE -j INPUT_FORMAT -f LANGUAGE_FROM -t LANGUAGE_TO -p OUTPUT_FORMAT -o OUTPUT_FILE [OPTIONS]
  $ penelope -i IN1,IN2[,IN3...] -j INPUT_FORMAT -f LANGUAGE_FROM -t LANGUAGE_TO -p OUTPUT_FORMAT -o OUTPUT_FILE [OPTIONS]

description:
  Convert dictionary file(s) with file name prefix INPUT_FILE from format INPUT_FORMAT to format OUTPUT_FORMAT, saving it as OUTPUT_FILE.
  The dictionary is from LANGUAGE_FROM to LANGUAGE_TO, possibly the same.
  You can merge several dictionaries (with the same format), by providing a list of comma-separated prefixes, as shown by the third synopsis above.

optional arguments:
  -h, --help            show this help message and exit
  -d, --debug           enable debug mode (default: False)
  -f LANGUAGE_FROM, --language-from LANGUAGE_FROM
                        from language (ISO 639-1 code)
  -i INPUT_FILE, --input-file INPUT_FILE
                        input file name prefix(es). Multiple prefixes must be
                        comma-separated.
  -j INPUT_FORMAT, --input-format INPUT_FORMAT
                        from format (values: bookeen|csv|kobo|stardict|xml)
  -k, --keep            keep temporary files (default: False)
  -o OUTPUT_FILE, --output-file OUTPUT_FILE
                        output file name
  -p OUTPUT_FORMAT, --output-format OUTPUT_FORMAT
                        to format (values:
                        bookeen|csv|epub|kobo|mobi|stardict|xml)
  -t LANGUAGE_TO, --language-to LANGUAGE_TO
                        to language (ISO 639-1 code)
  -v, --version         print version and exit
  --author AUTHOR       author string
  --copyright COPYRIGHT
                        copyright string
  --cover-path COVER_PATH
                        path of the cover image file
  --description DESCRIPTION
                        description string
  --email EMAIL         email string
  --identifier IDENTIFIER
                        identifier string
  --license LICENSE     license string
  --title TITLE         title string
  --website WEBSITE     website string
  --year YEAR           year string
  --apply-css APPLY_CSS
                        apply the given CSS file (epub and mobi output only)
  --bookeen-collation-function BOOKEEN_COLLATION_FUNCTION
                        use the specified collation function
  --bookeen-install-file
                        create *.install file (default: False)
  --csv-fs CSV_FS       CSV field separator (default: ',')
  --csv-ignore-first-line
                        ignore the first line of the input CSV file(s)
                        (default: False)
  --csv-ls CSV_LS       CSV line separator (default: 'n')
  --dictzip-path DICTZIP_PATH
                        path to dictzip executable
  --epub-no-compress    do not create the compressed container (epub output
                        only, default: False)
  --escape-strings      escape HTML strings (default: False)
  --flatten-synonyms    flatten synonyms, creating a new entry with
                        headword=synonym and using the definition of the
                        original headword (default: False)
  --group-by-prefix-function GROUP_BY_PREFIX_FUNCTION
                        compute the prefix of headwords using the given prefix
                        function file
  --group-by-prefix-length GROUP_BY_PREFIX_LENGTH
                        group headwords by prefix of given length (default: 2)
  --group-by-prefix-merge-across-first
                        merge headword groups even when the first character
                        changes (default: False)
  --group-by-prefix-merge-min-size GROUP_BY_PREFIX_MERGE_MIN_SIZE
                        merge headword groups until the given minimum number
                        of headwords is reached (default: 0, meaning no merge
                        will take place)
  --ignore-case         ignore headword case, all headwords will be lowercased
                        (default: False)
  --ignore-synonyms     ignore synonyms, not reading/writing them if present
                        (default: False)
  --include-index-page  include an index page (epub and mobi output only,
                        default: False)
  --input-file-encoding INPUT_FILE_ENCODING
                        use the specified encoding for reading the raw
                        contents of input file(s) (default: 'utf-8')
  --input-parser INPUT_PARSER
                        use the specified parser function after reading the
                        raw contents of input file(s)
  --kindlegen-path KINDLEGEN_PATH
                        path to kindlegen executable
  --marisa-bin-path MARISA_BIN_PATH
                        path to MARISA bin directory
  --marisa-index-size MARISA_INDEX_SIZE
                        maximum size of the MARISA index (default: 1000000)
  --merge-definitions   merge definitions for the same headword (default:
                        False)
  --merge-separator MERGE_SEPARATOR
                        add this string between merged definitions (default: '
                        | ')
  --mobi-no-kindlegen   do not run kindlegen, keep .opf and .html files
                        (default: False)
  --no-definitions      do not output definitions for EPUB and MOBI formats
                        (default: False)
  --sd-ignore-sametypesequence
                        ignore the value of sametypesequence in StarDict .ifo
                        files (default: False)
  --sd-no-dictzip       do not compress the .dict file in StarDict files
                        (default: False)
  --sort-after          sort after merging/flattening (default: False)
  --sort-before         sort before merging/flattening (default: False)
  --sort-by-definition  sort by definition (default: False)
  --sort-by-headword    sort by headword (default: False)
  --sort-ignore-case    ignore case when sorting (default: False)
  --sort-reverse        reverse the sort order (default: False)

examples:

  $ penelope -i dict.csv -j csv -f en -t it -p stardict -o output.zip
    Convert en->it dictionary dict.csv (in CSV format) into output.zip (in StarDict format)

  $ penelope -i dict.csv -j csv -f en -t it -p stardict -o output.zip --merge-definitions
    As above, but also merge definitions

  $ penelope -i d1,d2,d3 -j csv -f en -t it -p csv -o output.csv --sort-after --sort-by-headword
    Merge CSV dictionaries d1, d2, and d3 into output.csv, sorting by headword

  $ penelope -i d1,d2,d3 -j csv -f en -t it -p csv -o output.csv --sort-after --sort-by-headword --sort-ignore-case
    As above, but ignore case for sorting

  $ penelope -i d1,d2,d3 -j csv -f en -t it -p csv -o output.csv --sort-after --sort-by-headword --sort-reverse
    As above, but reverse the order

  $ penelope -i dict.zip -j stardict -f en -t it -p csv -o output.csv
    Convert en->it dictionary dict.zip (in StarDict format) into output.csv (in CSV format)

  $ penelope -i dict.zip -j stardict -f en -t it -p csv -o output.csv --ignore-synonyms
    As above, but do not read the .syn synonym file if present

  $ penelope -i dict.zip -j stardict -f en -t it -p csv -o output.csv --flatten-synonyms
    As above, but flatten synonyms

  $ penelope -i dict.zip -j stardict -f en -t it -p bookeen -o output
    Convert dict.zip into output.dict.idx and output.dict for Bookeen devices

  $ penelope -i dict.zip -j stardict -f en -t it -p kobo -o dicthtml-en-it
    Convert dict.zip into dicthtml-en-it.zip for Kobo devices

  $ penelope -i dict.csv -j csv -f en -t it -p mobi -o output.mobi --cover-path mycover.png --title "My English->Italian Dictionary"
    Convert dict.csv into a MOBI (Kindle) dictionary, using the specified cover image and title

  $ penelope -i dict.xml -j xml -f en -t it -p mobi -o output.epub
    Convert dict.xml into an EPUB dictionary

  $ penelope -i dict.xml -j xml -f en -t it -p mobi -o output.epub --epub-output-definitions
    As above, but also output definitions

您可以在此處找到 ISO 639-1 語言代碼。

安裝字典

Bookeen Odyssey 設備

例如，假設您想要使用 IT -> EN 字典。

在您的 PC 上，產生/下載 IT -> EN 字典檔案it-en.dict和it-en.dict.idx 。
透過 USB 連接線將 Odyssey 裝置連接到 PC。
使用檔案管理器，將it-en.dict和it-en.dict.idx這兩個檔案從 PC 複製到 Odyssey 裝置上的Dictionaries/目錄中。
重新啟動您的奧德賽，打開一本義大利語書籍並選擇一個單字：應該會出現英文定義。（對於此測試，請選擇一個常用單字，以便確保它出現在字典中！）

請注意，Bookeen 字典軟體將透過讀取電子書的dc:language元資料來選擇要使用的字典。確保您的電子書具有正確的dc:language元數據，否則可能無法載入正確的字典。

工房設備

在撰寫本文時（2016-02-16），只有當文件具有官方 Kobo 字典的檔案名稱時，Kobo 設備才會載入字典，即：

dicthtml.zip （EN）
dicthtml-de.zip (德語), dicthtml-de-en.zip (德語 -> EN), dicthtml-en-de.zip (EN -> 德語),
dicthtml-es.zip (ES), dicthtml-es-en.zip (ES -> EN), dicthtml-en-es.zip (EN -> ES),
dicthtml-fr.zip (法文), dicthtml-fr-en.zip (法文 -> EN), dicthtml-en-fr.zip (EN -> 法文),
dicthtml-it.zip (IT), dicthtml-it-en.zip (IT -> EN), dicthtml-en-it.zip (EN -> IT),
dicthtml-nl.zip (NL)
dicthtml-ja.zip (JA), dicthtml-en-ja.zip (EN -> JA),
dicthtml-pt.zip (PT)、 dicthtml-pt-en.zip (PT -> EN)、 dicthtml-en-pt.zip (EN -> PT)

（請參閱此 MobileRead 執行緒）

因此，如果您想安裝由 Penelope 製作的自訂字典，您必須選擇涵蓋其中一本官方 Kobo 字典，這實際上失去了使用後者的可能性。

例如，假設您想使用波蘭語字典 ( dicthtml-pl.zip )，而您對使用官方葡萄牙語字典 ( dicthtml-pt.zip ) 不感興趣。

在您的 PC 上，產生/下載波蘭語字典dicthtml-pl.zip 。
在您的 Kobo 裝置中，前往設定並啟動葡萄牙語字典。
透過 USB 線將 Kobo 裝置連接到 PC。
使用檔案管理器，將dicthtml-pl.zip從 PC 複製到 Kobo 裝置上的.kobo/dict/目錄中。（請注意， .kobo是一個隱藏目錄：您可能需要啟用檔案管理器的「顯示隱藏檔案/目錄」設定。）
將dicthtml-pl.zip重新命名為dicthtml-pt.zip 。
重新啟動 Kobo，打開一本波蘭語書籍並選擇一個單字：應該會出現定義。（對於此測試，請選擇一個常用單字，以便確保它出現在字典中！）

請注意，如果您更新 Kobo 的韌體，自訂字典可能會被官方字典覆蓋。因此，請將自訂字典的備份副本保存在安全的地方，例如您的 PC 或 SD 卡。

您可以在此 MobileRead 線程中找到自訂字典列表，其中大部分是由 Penelope 完成的。

執照

Penelope自 2.0.0 版 (2014-06-30) 起在 MIT 授權下發布。

先前的版本由 Google Code 託管，根據 GNU GPL 3 授權發布。

限制和缺少的功能

Bookeen 沒有關於其字典格式的官方文件（它已被逆向工程），YMMV
Kobo 沒有關於其字典格式的官方文件（它已被逆向工程），YMMV
部分支援讀取 Kobo 字典（讀取索引，但不讀取定義，因為它們已加密/混淆）
不支援閱讀EPUB(3)字典；寫作部分需要潤飾/重構
不支援閱讀 PRC/MOBI (Kindle) 字典
可以讀取的 StarDict 檔案有一些限制（請參閱format_stardict.py中的註解）
文檔不完整
缺少單元測試

贊助商

2015 年 12 月：IngleseXpress.it，“Grazie per averci aiutato a pubblicare per Kindle il Dizionario Inglese-Italiano della Pronuncia Scritta Semplificata！”