calm textgame下载 - calm textgame源代码下载

calm textgame

Ai源码

1.0.0

下载

上下文动作语言模型 (CALM) 和 ClubFloyd 数据集

EMNLP 2020 论文《Keep CALM and Explore: Language Models for Action Generation in Text-based Games》的代码和数据。

概述

我们的ClubFloyd 数据集( calm/lm_data.zip ) 是从 ClubFloyd 网站抓取的，包含 426 个人类游戏记录，涵盖 590 个不同类型和风格的基于文本的游戏。

该数据由 223,527 个上下文-操作对组成，格式为[CLS] observation [SEP] action [SEP] next observation [SEP] next action [SEP] 。我们使用[CLS] observation [SEP] action [SEP] next observation [SEP]作为上下文来训练语言模型（n-gram，GPT-2）来预测next action [SEP] ，并表明这种动作生成能力泛化与强化学习相结合时，可以识别未见过的游戏并支持游戏玩法。

入门

克隆存储库并安装依赖项：

pip install torch==1.4 transformers==2.5.1 jericho fasttext wandb importlib_metadata
git clone https://github.com/princeton-nlp/calm-textgame && cd calm-textgame
ln -s ../lm calm && ln -s ../lm drrn

（如果 fasttext 的 pip 安装失败，请尝试此处的构建步骤：https://github.com/facebookresearch/fastText#building-fasttext-for-python）

火车平静：

 cd calm
unzip lm_data.zip
python train.py

可以在此处下载 GPT-2 和 n-gram 模型的训练模型权重。

然后使用经过训练的 CALM 训练 DRRN：

 cd ../drrn
python train.py --rom_path ../games/ ${GAME} --lm_path ${PATH_TO_CALM} --lm_type ${gpt_or_ngram}

要快速尝试 GPT-2 CALM 模型：

 from lm import GPT2LM
model = GPT2LM ( "model_weights/gpt2" )
print ( model . generate ( "[CLS] observation [SEP] action [SEP] next observation [SEP]" , k = 30 ))

引文

 @inproceedings{yao2020calm,
    title={Keep CALM and Explore: Language Models for Action Generation in Text-based Games},
    author={Yao, Shunyu and Rao, Rohan and Hausknecht, Matthew and Narasimhan, Karthik},
    booktitle={Empirical Methods in Natural Language Processing (EMNLP)},
    year={2020}
}