DS 1000下載 - DS 1000原始碼下載

DS 1000

其他源碼

下載

DS-1000 資料科學代碼生成

DS-1000 論文的官方數據和程式碼發布：數據科學程式碼產生的自然可靠基準。您可以參考我們的專案頁面以取得更多範例和基準。

新聞 (04/2024)：DS-1000 現已簡化並託管在 Huggingface 上。您也可以參考我們論文中所述的原始格式。

DS-1000 簡化版

這是 DS-1000 資料集的重新格式化，以在不更改內容的情況下提高其可用性。此資料集託管在 Huggingface 和data/ds1000.jsonl.gz中

用法

有兩種載入資料集的選項

 # Option 1 load from huggingface
from datasets import load_dataset
ds1000 = list ( load_dataset ( "xlangai/DS-1000" )[ "test" ])

# Option 2 load from raw jsonl.gz
import gzip
ds1000 = [ json . loads ( l ) for l in gzip . open ( "data/ds1000.jsonl.gz" , "rt" ). readlines ()]

提供的test_ds1000.py腳本在data/codex002-answers.jsonl上評估 DS-1000

conda env create -f environment.yml
conda activate ds1000-3.10
# the test code also needs: pip install datasets tqdm
python test_ds1000.py

預期輸出：

           count   mean
lib                    
Matplotlib   155  0.548
Numpy        220  0.432
Pandas       291  0.265
Pytorch       68  0.397
Scipy        106  0.349
Sklearn      115  0.435
Tensorflow    45  0.378

DS-1000 overall
mean      0.388

另請參閱結果資料夾中新模型的結果。

測試腳本執行生成的程式碼，因此強烈鼓勵您自己的沙箱，但參考程式碼和提供的解決方案似乎可以安全運行。您的沙箱必須允許一些檔案操作（例如在 matplotlib 中儲存繪圖）和作業系統操作（例如tensorflow、sklearn）

格式

欄位prompt包含上下文中的問題，以及一條簡單的指令，例如result = ... # put solution this variable 。這是模型一致性的標準提示。

欄位code_context定義了test_execution(solution: str)和test_string(solution: str)它將產生的程式碼作為參數並評估其正確性。 test_execution工作原理是將提示中包含的範例輸入替換為generate_test_case(test_case_id)產生的輸入，然後檢查輸出（由參考解決方案生成，也存在於code_context中）。如果解決方案通過了test_execution和test_string （如果存在），則解決方案通過。 code_context中的執行上下文 ( exec_context ) 與prompt提供的執行上下文相同。

與原來相比的變化

測試過程被清理並全部合併到code_context中。
- code_context定義了test_execution(solution: str)和test_string(solution: str)它將程式碼作為參數並評估其正確性
- 刪除：快取的pickle、讀/寫pickle、讀取多個檔案、單獨的測試腳本。
- 資料大小從 162MB 減少到 3.4MB
Python升級到3.10，函式庫升級到最新版本（ environment.yml ）
- 檢查效能對庫版本和Python版本不太敏感（3.7 - 3.11）
刪除了過時的parser模組並使用tokenize簡化了字串測試
由於後綴僅用於指定輸出格式，因此刪除了插入（填充）模式。完成模式下輸出格式的簡單指令對於當前型號來說很容易。
現在是 1000 個問題的簡單列表，庫資訊位於metadata中。

編輯資料集

若要將 ds1000 轉換為可編輯文件，請使用提供的edit_ds1000.py腳本透過以下命令拆分問題清單：

 # specify --edit_dir according to your needs
python edit_ds1000.py --to_edit --edit_dir data/ds1000_edit

分為以下結構：

 data/ds1000_edit
├── 0 # problem id
    ├── prompt.txt
    ├── reference_code.txt
    ├── metadata.json
    └── code_context.py
├── 1
├── ...

編輯後，使用以下命令將資料集重建為data/ds1000.jsonl ：

python edit_ds1000.py --to_jsonl --edit_dir data/ds1000_edit

已知問題

一小部分執行是有狀態的（tensorflow 和 matplotlib）。所以每個問題都應該在一個獨立的進程中執行。順序測試也會發生這種情況。
與大多數人工標記資料一樣，預計會出現少量錯誤。此外，雖然我們消除了可以看到的歧義，但我們無法預測提示的所有可能的正確/錯誤解釋。更多詳細資訊請參閱論文。
由於導入，與原始設定存在輕微不一致。例如import a; abc()預計會失敗，但如果在過程中的其他任何地方使用import ab則不會失敗。這只會影響除導入之外的少數正確解決方案。

重新格式化學分

貢獻者

引文

如果您發現我們的工作有幫助，請引用

 @article{Lai2022DS1000,
  title={DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation},
  author={Yuhang Lai and Chengxi Li and Yiming Wang and Tianyi Zhang and Ruiqi Zhong and Luke Zettlemoyer and Scott Wen-tau Yih and Daniel Fried and Sida Wang and Tao Yu},
  journal={ArXiv},
  year={2022},
  volume={abs/2211.11501}
}

展開

附加信息

版本
類型其他源碼
更新時間 2024-11-16
大小 50MB
來自於 Github

相關應用

武俠下1000層遊戲

2023-07-08
DS生活

2023-07-07
DS Life軟體

2023-07-05
逃跑吧少年1000鑽石兌換碼

2023-05-11
瑪利歐賽車DS下載最新版

2023-04-17
逃亡1000天

2022-08-26

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
waymo open dataset

其他源碼

December 2023 Update
SmartTube

其他源碼

24.71 Stable
Sunamu

其他源碼

Release 2.2.0
waymo open dataset

其他源碼

December 2023 Update
wp functions

其他類別

1.0.0
termwind

其他類別

v2.3.0

相關資訊全部