Building a Large-scale Persona Dialog Dataset

Yinhe Zheng, G. Chen, Minlie Huang

Research output: Contribution to conferenceAbstractAcademic

Abstract

We proposed a primary version of a large scale multi-turn dialogue dataset in Chinese that contains over 25 million sessions of dialogues crawled from Weibo1. Diversified personality traits for each dialogue participant are collected to facilitate modelling persona in dialogues. Our dataset fills the blank of the resources for
building personalised dialogue systems in open-domain conversations and can also
serves as an important resource for a wide range of studies.
Original languageEnglish
Publication statusPublished - 8 Nov 2018
EventThe workshop on natural language generation for human robot interaction - Tilburg University, Tilburg, Netherlands
Duration: 8 Nov 20188 Nov 2018
https://hbuschme.github.io/nlg-hri-workshop-2018/

Workshop

WorkshopThe workshop on natural language generation for human robot interaction
Country/TerritoryNetherlands
CityTilburg
Period8/11/188/11/18
Internet address

Fingerprint

Dive into the research topics of 'Building a Large-scale Persona Dialog Dataset'. Together they form a unique fingerprint.

Cite this