|TREND||DATASET||BEST METHOD||PAPER TITLE||PAPER||CODE||COMPARE|
Recently, data-driven task-oriented dialogue systems have achieved promising performance in English.
Spoken Language Understanding (SLU) mainly involves two tasks, intent detection and slot filling, which are generally modeled jointly in existing works.
Computing author intent from multimodal data like Instagram posts requires modeling a complex relationship between text and image.
The subject area of our system is very specific, that is why there is a lack of training data.