DSTC6 End to End Conversation Modeling Download - DSTC6 End to End Conversation Modeling Source code download

DSTC6 End to End Conversation Modeling

AI Source Code

1.0.0

Download

DSTC6: End-to-End Conversation Modeling Track

Registration

Please register: https://goo.gl/forms/Fxy061gHuSOZGC1i2

News

Evaluation analysis package: Jan 19 2018

The package includes all references generated by 11 humans, hypotheses of 20 systems, and evaluation results in DSTC6 end-to-end conversation modeling track. https://www.dropbox.com/s/oh1trbos0tjzn7t/dstc6_t2_evaluation.tgz
Download the official training data: Sep 7-18 2017
Test data distribution: Sep 25 2017
Submission: Oct 8 2017

Easy 3 Step Data Collection

Track Description

Main task (mandatory): Customer service dialog using Twitter

(*) The tools to download the twitter data and transform to the dialog format from the data are provided.

Task A: Full or part of the training data will be used to train conversation models.

Task B: Any open data, e.g. from web, are available as external knowledge to generate informative sentences. But they should not overlap with the training, validation and test data provided by organizers.
Pilot task: Movie scenario dialog using OpenSubtitle

Please cite the following paper if you will publish the results using this setup:

https://arxiv.org/pdf/1706.07440.pdf

@article{DSTC6_End-to-End_Conversation_Modeling,
  Author = {Chiori Hori and Takaaki Hori},
  Title = {End-to-end Conversation Modeling Track in DSTC6},    
  Journal = {arXiv:1706.07440},    
  Year = {2017}
}

Necessary steps

Preparation

Most tools are written in python, which were tested on python2.7.6+ and python3.4.1+, and some bash scripts are also used to execute those tools.

For data preparation, you will need additional python modules as follows:

six
tqdm
nltk

which can be installed by

pip install <module-name>

or

pip install <module-name> -t <some-directory>

where <some-directory> is a directory storing python modules and needs to be accessible from python, e.g. by including it in PYTHONPATH environment variable.

If you try the baseline system, you will need Chainer http://chainer.org ,a deep learning toolkit, to perform training and evaluation of neural conversation models. Please follow the instruction in ChatbotBaseline/README.md.

Twitter task

prepare data set using collect_twitter_dialogs scripts.
```
$ cd collect_twitter_dialogs
$ collect.sh
```
(a twitter account and access keys are necessary to run the script. follow the instruction in collect_twitter_dialogs/README.md)
extract training, development and test sets from stored twitter dialog data
```
$ cd ../tasks/twitter
$ make_trial_data.sh
```
Note: the extracted data are trial data at this moment.
run baseline system (optional)
```
$ cd ../../ChatbotBaseline/egs/twitter
$ run.sh
```
(see ChatbotBaseline/README.md)

OpenSubtitles task

download OpenSubtitles2016 data

$ cd tasks/opensubs
$ wget http://opus.lingfil.uu.se/download.php?f=OpenSubtitles2016/en.tar.gz
$ tar zxvf en.tar.gz

extract training, development and test sets from stored subtitle data
```
$ make_trial_data.sh
```
Note: the extracted data are trial data at this moment.
run baseline system (optional)
```
$ cd ../../ChatbotBaseline/egs/opensubs
$ run.sh
```
(see ChatbotBaseline/README.md)

Directories and files

README.md : this file
tasks : data preparation for each subtask
collect_twitter_dialogs : scripts to collect twitter data
ChatbotBaseline : a neural conversation model baseline system

Contact Information

You can get the latest updates and participate in discussions on DSTC mailing list

To join the mailing list, send an email to: ([email protected]) putting "subscribe DSTC" in the body of the message (without the quotes). To post a message, send your message to: ([email protected]).

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2024-12-16
size 241.85KB
From Github

Related Applications

Aeon’s End

2022-08-28
Stars End

2022-08-24
Dead End Job

2022-08-22
The End: Inari’s Quest

2022-08-20
Watchmen: The End Is Nigh

2022-08-10
Fateful End: True Case Files

2022-07-24

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
node telegram bot api

AI Source Code

v0.50.0
typebot.io

AI Source Code

v3.1.2
python wechaty getting started

AI Source Code

1.0.0
waymo open dataset

Other source code

December 2023 Update
termwind

Other categories

v2.3.0
wp functions

Other categories

1.0.0

Related Information All