Adversarial Learning for Generative Conversational Agents Download - Adversarial Learning for Generative Conversational Agents Source code download

Adversarial Learning for Generative Conversational Agents

AI Source Code

versarial Learning for Generative Conversational Agents

Download

Adversarial Learning for Generative Conversational Agents

This repository contains a new adversarial training method for our Generative Conversational Agent (GCA).

Further details on this new training method can be found in the paper Oswaldo Ludwig, "End-to-end Adversarial Learning for Generative Conversational Agents," arXiv:1711.10122 cs.CL, Nov 2017. In the case of publication using ideas or pieces of code from this repository, please kindly cite this paper.

Our method assumes the GCA as a generator that aims at fooling a discriminator that labels dialogues as human-generated or machine-generated. In our approach, the discriminator performs token-level classification, i.e. it indicates whether the current token was generated by humans or machines. To do so, the discriminator also receives the context utterances (the dialogue history) and the incomplete answer up to the current token as input. This new approach makes possible the end-to-end training by backpropagation. A self-conversation process enables to produce a set of generated data with more diversity for the adversarial training. This approach improves the performance on questions not related to the training data.

The trained model available here used a dataset collected from dialogues of English courses online, available here.

Our GCA model can be explained by the following flowchart:

alt tag

while the following pseudocode explains our GCA algorithm:

alt tag

Our new end-to-end adversarial training can be explained by the following Keras model (implemented in the file train_bot_GAN.py), which is composed by the generator and the discriminator. The yellow blocks belong to the GCA (the generator), while the green blocks belong to the discriminator. The white blocks are shared between generator and discriminator:

alt tag

while the following pseudocode explains the new algorithm (see the paper for the definition of the variables):

alt tag

To chat with the pre-trained models:

Download the python file "conversation_GAN.py", the vocabulary file "vocabulary_movie", and the net weights "my_model_weights20.h5" (trained by teacher forcing) and "my_model_weights.h5" (trained by the new adversarial method), which can be found here;
Run conversation_GAN.py.

To evaluate dialog lines using the pre-trained discriminator:

Download the python file "run_discriminator_GAN.py", the vocabulary file "vocabulary_movie", and the net weights of the discriminator "my_model_weights_discriminator.h5", which can be found here;
Run run_discriminator_GAN.py.

To train end-to-end using the new adversarial method:

Download all the files here;
Download the Glove folder 'glove.6B' and include this folder in the directory of the chatbot (you can find this folder here). This algorithm applies transfer learning by using a pre-trained word embedding;
Run GAN_train_script.py. This script is self-explained and summarizes the new adversarial training. If you want to train on your own data, include it in the files "context_simple" and "answers_simple" following the same pattern. As can be seen in the script, I am using Theano backend and GPU, a few modifications are required to run it with TensorFlow backend.

If you want to start the adversarial training from the scratch, make the weight file my_model_weights.h5 (pre-trained the new adversarial method) equal to my_model_weights20.h5 (pre-trained by teacher forcing) and run train_script.py.

Expand

Additional Information

Version versarial Learning for Generative Conversational Agents
Type AI Source Code
Update Time 2024-12-11
size 335.75KB
From Github

Related Applications

Parameter Efficient Transfer Learning Benchmark

2024-11-06
awesome generative ai guide

2024-11-05
atomic agents

2024-11-02
Agents of Mayhem

2022-08-20
PHP5 learning (Learning PHP)

2009-05-24
OReilly Learning PHP and MySQL 2nd Edition

2009-05-24

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
node telegram bot api

AI Source Code

v0.50.0
typebot.io

AI Source Code

v3.1.2
python wechaty getting started

AI Source Code

1.0.0
waymo open dataset

Other source code

December 2023 Update
termwind

Other categories

v2.3.0
wp functions

Other categories

1.0.0

Related Information All