penelope ดาวน์โหลด - penelope พีดาวน์โหลดซอร์สโค้ด

penelope

ซอร์สโค้ดอื่น ๆ

v3.1.3

ดาวน์โหลด

เพเนโลพี

Penelope เป็นเครื่องมือที่หลากหลายสำหรับการสร้าง แก้ไข และแปลงพจนานุกรม โดยเฉพาะสำหรับอุปกรณ์ eReader

เวอร์ชัน: 3.1.3
วันที่: 23-09-2016
ผู้พัฒนา: อัลเบอร์โต เพตตาริน
ใบอนุญาต: ใบอนุญาต MIT (MIT)
ติดต่อ: คลิกที่นี่

ด้วยเวอร์ชันปัจจุบัน คุณสามารถ:

แปลงพจนานุกรมจาก/เป็นรูปแบบต่อไปนี้:
- Bookeen Cybook Odyssey (ขวา/ซ้าย)
- CSV (ขวา/ซ้าย)
- EPUB (W เท่านั้น)
- MOBI (Kindle, W เท่านั้น)
- Kobo (ดัชนี R เท่านั้น, W ไม่เข้ารหัส/ไม่ซับซ้อนเท่านั้น)
- สตาร์ดิกท์ (R/W)
- XML (R/W)
รวมพจนานุกรมประเภทเดียวกันหลายฉบับไว้ในพจนานุกรมเดียว
รวมคำจำกัดความหลายคำสำหรับคำสำคัญเดียวกัน
จัดเรียงตามคำสำคัญและ/หรือตามคำจำกัดความ
กำหนดตัวแยกวิเคราะห์อินพุตของคุณเองเพื่อผสาน/จัดเรียง/แก้ไขคำจำกัดความ
กำหนดฟังก์ชันการจัดเรียงของคุณเอง (รูปแบบเอาต์พุต bookeen เท่านั้น)
ส่งออกไฟล์ EPUB ที่มีพจนานุกรม (เช่น เพื่อรับมือกับการขาดฟังก์ชันการค้นหาของ eReader ของคุณ)
ส่งออกพจนานุกรม MOBI (Kindle)

การอัปเดตที่สำคัญ

17-04-2559 น่าเศร้าที่ฉันไม่สามารถใช้เวลาทำงานกับเพเนโลพีได้อีกต่อไป เนื่องจากโครงการ FLOSS อื่นๆ ของฉันใช้เวลา 100% ของเวลา FLOSS ของฉัน และฉันยังต้องจ่ายค่าเช่าและค่าใช้จ่าย ใช้เวลากับครอบครัวและเพื่อนฝูง ฯลฯ ., เช่นเดียวกับคนอื่นๆ. ดังนั้น ฉันจะไม่ดำเนินการแก้ไขปัญหาหรือดึงคำขอ โปรดอย่าคาดหวังว่าพวกเขาจะได้รับการจัดการเลย ฉันกำลัง มองหานักพัฒนาคนอื่น ๆ ที่จะเข้ามารับช่วงต่อโครงการนี้ (ประกาศนี้ควรถูกลบออกเมื่อมีการเปลี่ยนแปลง) หากคุณต้องการแปลงพจนานุกรมและเวอร์ชันปัจจุบันของ Penelope ไม่เหมาะกับคุณ คุณอาจต้องการดูที่ PyGlossary ฉันขอโทษอย่างจริงใจที่สุดสำหรับความไม่สะดวก

การติดตั้ง

การใช้ pip

เปิดคอนโซลแล้วพิมพ์:
```
$ [sudo] pip install penelope
```
แค่นั้นแหละ! เพียงทำงานโดยไม่มีข้อโต้แย้ง (หรือด้วย -h หรือ --help ) เพื่อรับคู่มือ:
```
$ penelope
```

ขั้นตอนนี้จะติดตั้ง lxml และ marisa-trie คุณอาจต้องติดตั้ง dictzip (เอาต์พุต StarDict) และ kindlegen (เอาต์พุต MOBI) แยกกัน ดูด้านล่าง

จากซอร์สโค้ด

รับซอร์สโค้ด:
- โคลน repo นี้ด้วย git :
```
$ git clone https://github.com/pettarin/penelope.git
```
- หรือดาวน์โหลดรุ่นล่าสุดและคลายการบีบอัดที่ไหนสักแห่ง
- หรือดาวน์โหลด ZIP หลักปัจจุบันแล้วคลายการบีบอัดที่ไหนสักแห่ง
เปิดคอนโซลและเข้าสู่ไดเร็กทอรี penelope (โคลน):
```
$ cd /path/to/penelope
```
แค่นั้นแหละ! เพียงทำงานโดยไม่มีข้อโต้แย้ง (หรือด้วย -h หรือ --help ) เพื่อรับคู่มือ:
```
$ python -m penelope
```

ขั้นตอนนี้จะไม่ติดตั้งการขึ้นต่อกันใด ๆ คุณจะต้องทำการติดตั้งด้วยตนเอง ดูด้านล่าง

การพึ่งพาอาศัยกัน

Python เวอร์ชัน 2.7.x หรือ 3.4.x (หรือสูงกว่า)
ในการเขียนพจนานุกรม StarDict: ไฟล์ปฏิบัติการ dictzip มีอยู่ใน $PATH ของคุณหรือระบุด้วย --dictzip-path :
```
$ [sudo] apt-get install dictzip
```
เพื่ออ่าน/เขียนพจนานุกรม Kobo: โมดูล Python marisa-trie :
```
$ [sudo] pip install marisa-trie
```
หรือไฟล์ปฏิบัติการ MARISA ที่มีอยู่ใน $PATH ของคุณหรือระบุด้วย --marisa-bin-path
เพื่อเขียนพจนานุกรม MOBI Kindle: ไฟล์ปฏิบัติการ Kindlegen มีอยู่ใน $PATH ของคุณหรือระบุด้วย --kindlegen-path
เพื่ออ่าน/เขียนพจนานุกรม XML: โมดูล Python lxml :
```
$ [sudo] pip install lxml
```

การใช้งาน

 usage: 
  $ penelope -h
  $ penelope -i INPUT_FILE -j INPUT_FORMAT -f LANGUAGE_FROM -t LANGUAGE_TO -p OUTPUT_FORMAT -o OUTPUT_FILE [OPTIONS]
  $ penelope -i IN1,IN2[,IN3...] -j INPUT_FORMAT -f LANGUAGE_FROM -t LANGUAGE_TO -p OUTPUT_FORMAT -o OUTPUT_FILE [OPTIONS]

description:
  Convert dictionary file(s) with file name prefix INPUT_FILE from format INPUT_FORMAT to format OUTPUT_FORMAT, saving it as OUTPUT_FILE.
  The dictionary is from LANGUAGE_FROM to LANGUAGE_TO, possibly the same.
  You can merge several dictionaries (with the same format), by providing a list of comma-separated prefixes, as shown by the third synopsis above.

optional arguments:
  -h, --help            show this help message and exit
  -d, --debug           enable debug mode (default: False)
  -f LANGUAGE_FROM, --language-from LANGUAGE_FROM
                        from language (ISO 639-1 code)
  -i INPUT_FILE, --input-file INPUT_FILE
                        input file name prefix(es). Multiple prefixes must be
                        comma-separated.
  -j INPUT_FORMAT, --input-format INPUT_FORMAT
                        from format (values: bookeen|csv|kobo|stardict|xml)
  -k, --keep            keep temporary files (default: False)
  -o OUTPUT_FILE, --output-file OUTPUT_FILE
                        output file name
  -p OUTPUT_FORMAT, --output-format OUTPUT_FORMAT
                        to format (values:
                        bookeen|csv|epub|kobo|mobi|stardict|xml)
  -t LANGUAGE_TO, --language-to LANGUAGE_TO
                        to language (ISO 639-1 code)
  -v, --version         print version and exit
  --author AUTHOR       author string
  --copyright COPYRIGHT
                        copyright string
  --cover-path COVER_PATH
                        path of the cover image file
  --description DESCRIPTION
                        description string
  --email EMAIL         email string
  --identifier IDENTIFIER
                        identifier string
  --license LICENSE     license string
  --title TITLE         title string
  --website WEBSITE     website string
  --year YEAR           year string
  --apply-css APPLY_CSS
                        apply the given CSS file (epub and mobi output only)
  --bookeen-collation-function BOOKEEN_COLLATION_FUNCTION
                        use the specified collation function
  --bookeen-install-file
                        create *.install file (default: False)
  --csv-fs CSV_FS       CSV field separator (default: ',')
  --csv-ignore-first-line
                        ignore the first line of the input CSV file(s)
                        (default: False)
  --csv-ls CSV_LS       CSV line separator (default: 'n')
  --dictzip-path DICTZIP_PATH
                        path to dictzip executable
  --epub-no-compress    do not create the compressed container (epub output
                        only, default: False)
  --escape-strings      escape HTML strings (default: False)
  --flatten-synonyms    flatten synonyms, creating a new entry with
                        headword=synonym and using the definition of the
                        original headword (default: False)
  --group-by-prefix-function GROUP_BY_PREFIX_FUNCTION
                        compute the prefix of headwords using the given prefix
                        function file
  --group-by-prefix-length GROUP_BY_PREFIX_LENGTH
                        group headwords by prefix of given length (default: 2)
  --group-by-prefix-merge-across-first
                        merge headword groups even when the first character
                        changes (default: False)
  --group-by-prefix-merge-min-size GROUP_BY_PREFIX_MERGE_MIN_SIZE
                        merge headword groups until the given minimum number
                        of headwords is reached (default: 0, meaning no merge
                        will take place)
  --ignore-case         ignore headword case, all headwords will be lowercased
                        (default: False)
  --ignore-synonyms     ignore synonyms, not reading/writing them if present
                        (default: False)
  --include-index-page  include an index page (epub and mobi output only,
                        default: False)
  --input-file-encoding INPUT_FILE_ENCODING
                        use the specified encoding for reading the raw
                        contents of input file(s) (default: 'utf-8')
  --input-parser INPUT_PARSER
                        use the specified parser function after reading the
                        raw contents of input file(s)
  --kindlegen-path KINDLEGEN_PATH
                        path to kindlegen executable
  --marisa-bin-path MARISA_BIN_PATH
                        path to MARISA bin directory
  --marisa-index-size MARISA_INDEX_SIZE
                        maximum size of the MARISA index (default: 1000000)
  --merge-definitions   merge definitions for the same headword (default:
                        False)
  --merge-separator MERGE_SEPARATOR
                        add this string between merged definitions (default: '
                        | ')
  --mobi-no-kindlegen   do not run kindlegen, keep .opf and .html files
                        (default: False)
  --no-definitions      do not output definitions for EPUB and MOBI formats
                        (default: False)
  --sd-ignore-sametypesequence
                        ignore the value of sametypesequence in StarDict .ifo
                        files (default: False)
  --sd-no-dictzip       do not compress the .dict file in StarDict files
                        (default: False)
  --sort-after          sort after merging/flattening (default: False)
  --sort-before         sort before merging/flattening (default: False)
  --sort-by-definition  sort by definition (default: False)
  --sort-by-headword    sort by headword (default: False)
  --sort-ignore-case    ignore case when sorting (default: False)
  --sort-reverse        reverse the sort order (default: False)

examples:

  $ penelope -i dict.csv -j csv -f en -t it -p stardict -o output.zip
    Convert en->it dictionary dict.csv (in CSV format) into output.zip (in StarDict format)

  $ penelope -i dict.csv -j csv -f en -t it -p stardict -o output.zip --merge-definitions
    As above, but also merge definitions

  $ penelope -i d1,d2,d3 -j csv -f en -t it -p csv -o output.csv --sort-after --sort-by-headword
    Merge CSV dictionaries d1, d2, and d3 into output.csv, sorting by headword

  $ penelope -i d1,d2,d3 -j csv -f en -t it -p csv -o output.csv --sort-after --sort-by-headword --sort-ignore-case
    As above, but ignore case for sorting

  $ penelope -i d1,d2,d3 -j csv -f en -t it -p csv -o output.csv --sort-after --sort-by-headword --sort-reverse
    As above, but reverse the order

  $ penelope -i dict.zip -j stardict -f en -t it -p csv -o output.csv
    Convert en->it dictionary dict.zip (in StarDict format) into output.csv (in CSV format)

  $ penelope -i dict.zip -j stardict -f en -t it -p csv -o output.csv --ignore-synonyms
    As above, but do not read the .syn synonym file if present

  $ penelope -i dict.zip -j stardict -f en -t it -p csv -o output.csv --flatten-synonyms
    As above, but flatten synonyms

  $ penelope -i dict.zip -j stardict -f en -t it -p bookeen -o output
    Convert dict.zip into output.dict.idx and output.dict for Bookeen devices

  $ penelope -i dict.zip -j stardict -f en -t it -p kobo -o dicthtml-en-it
    Convert dict.zip into dicthtml-en-it.zip for Kobo devices

  $ penelope -i dict.csv -j csv -f en -t it -p mobi -o output.mobi --cover-path mycover.png --title "My English->Italian Dictionary"
    Convert dict.csv into a MOBI (Kindle) dictionary, using the specified cover image and title

  $ penelope -i dict.xml -j xml -f en -t it -p mobi -o output.epub
    Convert dict.xml into an EPUB dictionary

  $ penelope -i dict.xml -j xml -f en -t it -p mobi -o output.epub --epub-output-definitions
    As above, but also output definitions

คุณสามารถค้นหารหัสภาษา ISO 639-1 ได้ที่นี่

การติดตั้งพจนานุกรม

อุปกรณ์ Bookeen Odyssey

ตัวอย่างเช่น สมมติว่าคุณต้องการใช้พจนานุกรม IT -> EN

บนพีซีของคุณ ให้ผลิต/ดาวน์โหลดไฟล์พจนานุกรม IT -> EN it-en.dict และ it-en.dict.idx
เชื่อมต่ออุปกรณ์ Odyssey ของคุณเข้ากับพีซีผ่านสาย USB
ใช้ตัวจัดการไฟล์ของคุณ คัดลอกสองไฟล์ it-en.dict และ it-en.dict.idx จากพีซีของคุณไปยังไดเร็กทอรี Dictionaries/ บนอุปกรณ์ Odyssey ของคุณ
รีบูต Odyssey ของคุณ เปิดหนังสือเป็นภาษาอิตาลีแล้วเลือกคำ: คำจำกัดความในภาษาอังกฤษควรปรากฏขึ้น (สำหรับการทดสอบนี้ ให้เลือกคำทั่วไปเพื่อให้แน่ใจว่ามีอยู่ในพจนานุกรม!)

โปรดทราบว่าซอฟต์แวร์พจนานุกรม Bookeen จะเลือกพจนานุกรมที่จะใช้โดยการอ่านข้อมูลเมตา dc:language ของ eBook ของคุณ ตรวจสอบให้แน่ใจว่า eBook ของคุณมีข้อมูลเมตา dc:language ที่เหมาะสม ไม่เช่นนั้นอาจโหลดพจนานุกรมที่ถูกต้องไม่ได้

โคโบ ดีไวซ์

ในขณะที่เขียนบทความนี้ (16-02-2559) อุปกรณ์ Kobo จะโหลดพจนานุกรมเฉพาะในกรณีที่ไฟล์มีชื่อไฟล์ของพจนานุกรม Kobo อย่างเป็นทางการ ซึ่งได้แก่:

dicthtml.zip (EN)
dicthtml-de.zip (DE), dicthtml-de-en.zip (DE -> EN), dicthtml-en-de.zip (EN -> DE)
dicthtml-es.zip (ES), dicthtml-es-en.zip (ES -> EN), dicthtml-en-es.zip (EN -> ES)
dicthtml-fr.zip (FR), dicthtml-fr-en.zip (FR -> EN), dicthtml-en-fr.zip (EN -> FR)
dicthtml-it.zip (ไอที), dicthtml-it-en.zip (ไอที -> EN), dicthtml-en-it.zip (EN -> มัน)
dicthtml-nl.zip (NL)
dicthtml-ja.zip (JA), dicthtml-en-ja.zip (EN -> JA)
dicthtml-pt.zip (PT), dicthtml-pt-en.zip (PT -> EN), dicthtml-en-pt.zip (EN -> PT)

(ดูหัวข้อ MobileRead นี้)

ดังนั้น หากคุณต้องการติดตั้งพจนานุกรมแบบกำหนดเองที่สร้างด้วย Penelope คุณต้องเลือกที่จะเขียนทับพจนานุกรม Kobo อย่างเป็นทางการรายการใดรายการหนึ่ง ซึ่งจะทำให้สูญเสียความเป็นไปได้ในการใช้พจนานุกรมอย่างหลังอย่างมีประสิทธิภาพ

ตัวอย่างเช่น สมมติว่าคุณต้องการใช้พจนานุกรมภาษาโปแลนด์ ( dicthtml-pl.zip ) ในขณะที่คุณไม่สนใจใช้พจนานุกรมภาษาโปรตุเกสอย่างเป็นทางการ ( dicthtml-pt.zip )

บนพีซีของคุณ ให้ผลิต/ดาวน์โหลดพจนานุกรมภาษาโปแลนด์ dicthtml-pl.zip
ในอุปกรณ์ Kobo ของคุณ ไปที่การตั้งค่าและเปิดใช้งานพจนานุกรมภาษาโปรตุเกส
เชื่อมต่ออุปกรณ์ Kobo ของคุณกับพีซีผ่านสาย USB
ใช้ตัวจัดการไฟล์ของคุณ คัดลอก dicthtml-pl.zip จากพีซีของคุณไปยังไดเร็กทอรี .kobo/dict/ บนอุปกรณ์ Kobo ของคุณ (โปรดทราบว่า .kobo เป็นไดเร็กทอรีที่ซ่อนอยู่: คุณอาจต้องเปิดใช้งานการตั้งค่า "แสดงไฟล์/ไดเร็กทอรีที่ซ่อน" ของตัวจัดการไฟล์ของคุณ)
เปลี่ยนชื่อ dicthtml-pl.zip เป็น dicthtml-pt.zip
รีบูท Kobo ของคุณ เปิดหนังสือเป็นภาษาโปแลนด์แล้วเลือกคำ: คำจำกัดความควรปรากฏขึ้น (สำหรับการทดสอบนี้ ให้เลือกคำทั่วไปเพื่อให้แน่ใจว่ามีอยู่ในพจนานุกรม!)

โปรดทราบว่าหากคุณอัปเดตเฟิร์มแวร์ของ Kobo พจนานุกรมที่กำหนดเองอาจถูกเขียนทับด้วยพจนานุกรมอย่างเป็นทางการ ดังนั้น ให้เก็บสำเนาสำรองของพจนานุกรมที่คุณกำหนดเองไว้ในที่ปลอดภัย เช่น พีซีหรือการ์ด SD

คุณสามารถดูรายการพจนานุกรมแบบกำหนดเองซึ่งส่วนใหญ่ใช้ Penelope ได้ในหัวข้อ MobileRead นี้

ใบอนุญาต

Penelope เปิดตัวภายใต้ใบอนุญาต MIT ตั้งแต่เวอร์ชัน 2.0.0 (2014-06-30)

เวอร์ชันก่อนหน้าซึ่งโฮสต์โดย Google Code ได้รับการเผยแพร่ภายใต้ใบอนุญาต GNU GPL 3

ข้อจำกัดและคุณสมบัติที่ขาดหายไป

Bookeen ไม่มีเอกสารอย่างเป็นทางการสำหรับรูปแบบพจนานุกรม (มีวิศวกรรมย้อนกลับ), YMMV
Kobo ไม่มีเอกสารอย่างเป็นทางการสำหรับรูปแบบพจนานุกรม (ได้รับการออกแบบทางวิศวกรรมย้อนกลับ), YMMV
รองรับการอ่านพจนานุกรม Kobo บางส่วน (ดัชนีถูกอ่าน แต่คำจำกัดความไม่ได้ถูกเข้ารหัส เนื่องจากมีการเข้ารหัส/ทำให้สับสน)
ไม่รองรับการอ่านพจนานุกรม EPUB (3) ส่วนการเขียนจำเป็นต้องขัดเกลา/ปรับโครงสร้างใหม่
ไม่รองรับการอ่านพจนานุกรม PRC/MOBI (Kindle)
มีข้อ จำกัด บางประการเกี่ยวกับไฟล์ StarDict ที่สามารถอ่านได้ (ดูความคิดเห็นใน format_stardict.py )
เอกสารประกอบไม่ครบถ้วน
การทดสอบหน่วยหายไป

สปอนเซอร์

ธันวาคม 2015 : IngleseXpress.it, "Grazie per averci aiutato a pubblicare for Kindle il Dizionario Inglese-Italiano della Pronuncia Scritta Semplificata!"

รับทราบ

ขอบคุณมากที่:

uwelovesdonna สำหรับการสนับสนุนแนวคิดในการปรับปรุงโค้ดและสำหรับการตั้งค่าหลายหน้าของวิกิโครงการ
Jens Sadowski สำหรับการชี้ให้เห็นจุดบกพร่องด้วยชื่อไฟล์ Unicode และสำหรับการแนะนำให้ใช้ multiset dict() แทน set dict() ;
oldnat สำหรับการชี้ให้เห็นจุดบกพร่องใน Windows และ Python 3
Wolfgang Miller-Reichling ที่ให้รหัสสำหรับอ่านพจนานุกรม CSV
branok สำหรับจัดเตรียมแนวคิดและโค้ดเริ่มต้นสำหรับฟังก์ชันการจัดเรียงภาษาเยอรมัน
เพื่อน สำหรับการแนะนำให้ผ่าน -l เปลี่ยนเป็น MARISA_BUILD ;
Lukas Brückner สำหรับการแนะนำการหลบหนี & < > เมื่อส่งออกในรูปแบบ XML;
Stephan Lichtenhagen สำหรับการแนะนำการบังคับให้เข้ารหัส UTF-8 บน Python 3;
niconavarrete สำหรับการชี้ให้เห็นการพึ่งพาจาก $CWD (ฉบับที่ 1) แก้ไขใน v2.0.1;
elchamaco สำหรับจัดเตรียมพจนานุกรม StarDict พร้อมไฟล์ .syn สำหรับการทดสอบ

ขยาย

ข้อมูลเพิ่มเติม

เวอร์ชัน v3.1.3
ประเภท ซอร์สโค้ดอื่น ๆ
เวลาอัปเดต 2024-12-19
ขนาด 58.75KB
มาจาก Github

แอปที่เกี่ยวข้อง

waymo open dataset

2024-11-18
Sunamu

2024-12-14
MySchedule.py

2024-12-15
chat.petals.dev

2024-11-30
SmartTube

2024-12-14
viptools for eslam

2024-12-15

แนะนำสำหรับคุณ

chat.petals.dev

ซอร์สโค้ดอื่น ๆ

1.0.0
GPT Prompt Templates

ซอร์สโค้ดอื่น ๆ

1.0.0
GPTyped

ซอร์สโค้ดอื่น ๆ

GPTyped 1.0.5
waymo open dataset

ซอร์สโค้ดอื่น ๆ

December 2023 Update
Sunamu

ซอร์สโค้ดอื่น ๆ

Release 2.2.0
MySchedule.py

ซอร์สโค้ดอื่น ๆ

Updates to the fetching of week codes
waymo open dataset

ซอร์สโค้ดอื่น ๆ

December 2023 Update
termwind

หมวดหมู่อื่นๆ

v2.3.0
wp functions

หมวดหมู่อื่นๆ

1.0.0

ข้อมูลที่เกี่ยวข้อง ทั้งหมด