self refine
1.0.0
通过 Self-Refine,法学硕士可以生成有关其工作的反馈,用它来改进输出,并重复此过程。
目录
斯托克斯定理示例
独角兽示例
prompt-lib
: git clone https://github.com/reasoning-machines/prompt-lib
pip install prompt-lib/
根据您的默认设置,您可能需要使用以下命令来设置 PYTHONPATH:
export PYTHONPATH= " .:../:.:src:../:../../:.:prompt-lib "
python -u src/acronym/run.py " Using language models of code for few-shot commonsense "
0 INIT > Using language models of code for few - shot commonsense
0 GEN > CLoCK
0 SCORES > * Ease of pronunciation : CLoCK is pronounced " clah-k " . This is easy acronym to pronounce . 4 / 5
* Ease of spelling : CLoCK is easy to spell . 5 / 5
* Relation to title : CLoCK stands for " Code Language Models of Commonsense Knowledge " which is related to the title . 5 / 5
* Positive connotation : CLoCK is a positive acronym . It implies accuracy and efficiency . 5 / 5
* Well - known : CLoCK is not a well - known acronym . 2 / 5
* Total score : 21 / 25
1 GEN > CMNSC
1 SCORES > * Ease of pronunciation : CMNSC is pronounced " see-min-es-see " . This is a difficult acronym to pronounce . 1 / 5
* Ease of spelling : CMNSC is difficult to spell . 1 / 5
* Relation to title : CMNSC stands for " Commonsense " which is related to the title . 4 / 5
* Positive connotation : CMNSC is a neutral acronym . 3 / 5
* Well - known : CMNSC is not a well - known acronym . 1 / 5
* Total score : 10 / 25
2 GEN > COMMIT
2 SCORES > * Ease of pronunciation : COMMIT is pronounced " com-mit " . This is an easy acronym to pronounce . 4 / 5
* Ease of spelling : COMMIT is easy to spell . 5 / 5
* Relation to title : COMMIT stands for " Commonsense Model of Code for Intuitive Training " which is related to the title . 5 / 5
* Positive connotation : COMMIT is a positive acronym . It implies commitment and dedication . 5 / 5
* Well - known : COMMIT is not a well - known acronym . 2 / 5
* Total score : 21 / 25
PYTHONPATH= " . " python -u src/responsegen/run.py --output < OUTPUT FILE > --size < DATA SIZE >
PYTHONPATH= " . " python -u src/readability/readability.py --output < OUTPUT FILE >
PYTHONPATH= " . " python -u src/readability/{count_comment | count_function | count_meaningful_var}.py --file < INPUT FILE >
data/prompt/commongen
中。您可以通过运行以下命令来下载数据: python -u src/commongen/run.py cmd stair bubble team dryer puppy aliens cat
python -u src/gsm/run.py
输出将保存在data/tasks/gsm/gsm_outputs.jsonl
中
评估输出:
python src/gsm/gsm_selfref_eval.py --path data/tasks/gsm/gsm_outputs.jsonl
data/tasks/gsm/gsm_outputs.jsonl.reports.txt
),显示错误生成、反馈和细化反馈生成的示例。 python -u src/sentiment_transfer_sr/run.py data/tasks/yelp/yelp-extreme.jso
nl 4 none
data/tasks/yelp/
python -u src/pie/run.py --slow_programs_file data/tasks/pie/codenet-python-test-1k.jsonl --max_attempts 4 --outfile data/tasks/pie/output --feedback_type rich
Init
:用于初始化任务。这就是初始输出的生成方式。
Feedback
:用于从模型获取关于中间结果的反馈。
Iterate
:用于根据反馈从模型中获取下一次迭代。
每个任务都有一个run.py
,用于初始化提示并运行任务。
以commongen为例,提示如下:
python src/commongen/task_init.py
python src/commongen/feedback.py
python src/commongen/task_iterate.py
您还可以在我们的网站上看到这些提示。
@misc{madaan2023selfrefine,
title = {Self - Refine: Iterative Refinement with Self - Feedback},
author = {Aman Madaan and Niket Tandon and Prakhar Gupta and Skyler Hallinan and Luyu Gao and Sarah Wiegreffe and Uri Alon and Nouha Dziri and Shrimai Prabhumoye and Yiming Yang and Sean Welleck and Bodhisattwa Prasad Majumder and Shashank Gupta and Amir Yazdanbakhsh and Peter Clark},
year = { 2023 },
eprint = { 2303 . 17651 },
archivePrefix = {arXiv},
primaryClass = { cs . CL }
}
流程图LR
生成器-->|初始化|粗
Critic_1 --> Critique_fb
... --> Critique_fb
Critic_k --> Critique_fb
Critique_fb --> 未精炼{要精炼的输出}
未精炼 --> 精炼者
精炼器 --> |R: y_t, x, fb| Refined_Output{精炼输出}
Refined_Output --> |未满足停止条件|粗