The Japanese-English business conversation corpus, namely Business Scene Dialogue corpus, was constructed in 3 steps:
Half of the monolingual scenarios were written in Japanese and the other half were written in English. The whole construction process was supervised by a person who satisfies the following conditions to guarantee the conversations to be natural:
The BSD corpus is split into balanced training, development and evaluation sets. The documents in these sets are balanced in terms of scenes and original languages. In this repository we publicly share the full development and evaluation sets and a part of the training data set.
Source: BSDPaper | Code | Results | Date | Stars |
---|