no code implementations • 30 May 2021 • Jianning Wu, Zhuqing Jiang, Shiping Wen, Aidong Men, Haiying Wang
For multimodal tasks, a good feature extraction network should extract information as much as possible and ensure that the extracted feature embedding and other modal feature embedding have an excellent mutual understanding.