no code implementations • Findings (ACL) 2022 • Zihan Wang, Jiuxiang Gu, Jason Kuen, Handong Zhao, Vlad Morariu, Ruiyi Zhang, Ani Nenkova, Tong Sun, Jingbo Shang
We present a comprehensive study of sparse attention patterns in Transformer models.
no code implementations • EMNLP 2021 • Zihan Wang, chengyu dong, Jingbo Shang
In this paper, we present an empirical property of these representations—”average” approximates “first principal component”.
no code implementations • 25 Apr 2024 • Xiang Li, Yiqun Yao, Xin Jiang, Xuezhi Fang, Chao Wang, Xinzhang Liu, Zihan Wang, Yu Zhao, Xin Wang, Yuyao Huang, Shuangyong Song, Yongxiang Li, Zheng Zhang, Bo Zhao, Aixin Sun, Yequan Wang, Zhongjiang He, Zhongyuan Wang, Xuelong Li, Tiejun Huang
Large language models (LLMs) have showcased profound capabilities in language understanding and generation, facilitating a wide array of applications.
no code implementations • 10 Apr 2024 • Chenyang An, Zhibo Chen, Qihao Ye, Emily First, Letian Peng, Jiayun Zhang, Zihan Wang, Sorin Lerner, Jingbo Shang
Recent advances in Automated Theorem Proving have shown the effectiveness of leveraging a (large) language model that generates tactics (i. e. proof steps) to search through proof states.
1 code implementation • 9 Apr 2024 • Zihan Wang, Siyang Song, Cheng Luo, Songhe Deng, Weicheng Xie, Linlin Shen
Human facial action units (AUs) are mutually related in a hierarchical manner, as not only they are associated with each other in both spatial and temporal domains but also AUs located in the same/close facial regions show stronger relationships than those of different facial regions.
2 code implementations • 7 Apr 2024 • Zihan Wang, Bowen Li, Chen Wang, Sebastian Scherer
Few-shot object detection has drawn increasing attention in the field of robotic exploration, where robots are required to find unseen objects with a few online provided examples.
1 code implementation • 3 Apr 2024 • Yifan Xu, Xiao Liu, Xinghan Liu, Zhenyu Hou, Yueyan Li, Xiaohan Zhang, Zihan Wang, Aohan Zeng, Zhengxiao Du, Wenyi Zhao, Jie Tang, Yuxiao Dong
Large language models (LLMs) have shown excellent mastering of human language, but still struggle in real-world applications that require mathematical problem-solving.
1 code implementation • 2 Apr 2024 • Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Junjie Hu, Ming Jiang, Shuqiang Jiang
Vision-and-language navigation (VLN) enables the agent to navigate to a remote location following the natural language instruction in 3D environments.
1 code implementation • 30 Mar 2024 • Letian Peng, Zilong Wang, Feng Yao, Zihan Wang, Jingbo Shang
We construct the distillation dataset via sampling sentences from language model pre-training datasets (e. g., OpenWebText in our implementation) and prompting an LLM to identify the typed spans of "important information".
1 code implementation • 17 Mar 2024 • Zihan Wang, Fanheng Kong, Shi Feng, Ming Wang, Han Zhao, Daling Wang, Yifei Zhang
Furthermore, we conduct extensive experiments to delve deeper into the potential of Mamba compared to the Transformer in the TSF.
no code implementations • 16 Mar 2024 • Zihan Wang, Jiayu Xiao, Mengxiang Li, Zhongjiang He, Yongxiang Li, Chao Wang, Shuangyong Song
In our dynamic world where data arrives in a continuous stream, continual learning enables us to incrementally add new tasks/domains without the need to retrain from scratch.
no code implementations • 11 Mar 2024 • Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj
Foundation models are usually pre-trained on large-scale datasets and then adapted to downstream tasks through tuning.
1 code implementation • 29 Feb 2024 • Zihan Wang, Peiyi Wang, Houfeng Wang
Hierarchical text classification (HTC) is a challenging subtask of multi-label classification due to its complex taxonomic structure.
no code implementations • 21 Feb 2024 • Yongquan He, Zihan Wang, Peng Zhang, Zhaopeng Tu, Zhaochun Ren
To address this issue, recent works apply the graph neural network on the existing neighbors of the unseen entities.
1 code implementation • 15 Feb 2024 • Letian Peng, Yuwei Zhang, Zilong Wang, Jayanth Srinivasa, Gaowen Liu, Zihan Wang, Jingbo Shang
This work aims to build a text embedder that can capture characteristics of texts specified by user instructions.
no code implementations • 13 Feb 2024 • Sheng Liu, Zihan Wang, Qi Lei
In this work, we propose a strong reconstruction attack in the setting of federated learning.
no code implementations • 5 Feb 2024 • Zihan Wang, Yunxuan Li, Yuexin Wu, Liangchen Luo, Le Hou, Hongkun Yu, Jingbo Shang
Process supervision, using a trained verifier to evaluate the intermediate steps generated by reasoner, has demonstrated significant improvements in multi-step problem solving.
1 code implementation • 15 Jan 2024 • Dan Zhang, Ziniu Hu, Sining Zhoubian, Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, Jie Tang
To bridge these gaps, we introduce SciGLM, a suite of scientific language models able to conduct college-level scientific reasoning.
no code implementations • 13 Jan 2024 • Mengtian Li, Shaohui Lin, Zihan Wang, Yunhang Shen, Baochang Zhang, Lizhuang Ma
Semi-supervised learning (SSL), thanks to the significant reduction of data annotation costs, has been an active research topic for large-scale 3D scene understanding.
no code implementations • 8 Jan 2024 • Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang, Bingkai Yang, Xuewei he, Zhuoru Jiang, Qiyi Xie, Yanhan Zhang, Zhongqiu Li, Lingling Shi, Weiwei Fu, Yin Zhang, Zilu Huang, Sishi Xiong, Yuxiang Zhang, Chao Wang, Shuangyong Song
Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe.
2 code implementations • 19 Dec 2023 • Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun
They endow Large Language Models (LLMs) with powerful capabilities in visual understanding, enabling them to tackle diverse multi-modal tasks.
1 code implementation • 14 Dec 2023 • Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxuan Zhang, Juanzi Li, Bin Xu, Yuxiao Dong, Ming Ding, Jie Tang
People are spending an enormous amount of time on digital devices through graphical user interfaces (GUIs), e. g., computer or smartphone screens.
Ranked #14 on Visual Question Answering on MM-Vet
1 code implementation • 10 Dec 2023 • Yougang Lyu, Jitai Hao, Zihan Wang, Kai Zhao, Shen Gao, Pengjie Ren, Zhumin Chen, Fang Wang, Zhaochun Ren
Multiple defendants in a criminal fact description generally exhibit complex interactions, and cannot be well handled by existing Legal Judgment Prediction (LJP) methods which focus on predicting judgment results (e. g., law articles, charges, and terms of penalty) for single-defendant cases.
1 code implementation • 6 Nov 2023 • Letian Peng, Zihan Wang, Jingbo Shang
We study the named entity recognition (NER) problem under the extremely weak supervision (XWS) setting, where only one example entity per type is given in a context-free way.
1 code implementation • 3 Nov 2023 • Letian Peng, Zilong Wang, Hang Liu, Zihan Wang, Jingbo Shang
With the rapid development of the internet, online social media welcomes people with different backgrounds through its diverse content.
no code implementations • 31 Oct 2023 • Max Balsells, Marcel Torne, Zihan Wang, Samedh Desai, Pulkit Agrawal, Abhishek Gupta
We evaluate this system on a suite of robotic tasks in simulation and demonstrate its effectiveness at learning behaviors both in simulation and the real world.
no code implementations • 26 Oct 2023 • Zi Lin, Zihan Wang, Yongqi Tong, Yangkun Wang, Yuxin Guo, Yujia Wang, Jingbo Shang
This benchmark contains the rich, nuanced phenomena that can be tricky for current toxicity detection models to identify, revealing a significant domain difference compared to social media content.
no code implementations • 20 Oct 2023 • Xinyu Hu, Pengfei Tang, Simiao Zuo, Zihan Wang, Bowen Song, Qiang Lou, Jian Jiao, Denis Charles
In Evoke, there are two instances of a same LLM: one as a reviewer (LLM-Reviewer), it scores the current prompt; the other as an author (LLM-Author), it edits the prompt by considering the edit history and the reviewer's feedback.
1 code implementation • 15 Oct 2023 • Zihan Wang, Ziqi Zhao, Zhumin Chen, Pengjie Ren, Maarten de Rijke, Zhaochun Ren
To address this limitation, recent studies enable generalization to an unseen target domain with only a few labeled examples using data augmentation techniques.
1 code implementation • 4 Oct 2023 • Xiaohan Fu, Zihan Wang, Shuheng Li, Rajesh K. Gupta, Niloofar Mireshghallah, Taylor Berg-Kirkpatrick, Earlence Fernandes
Large Language Models (LLMs) are being enhanced with the ability to use tools and to process multiple modalities.
no code implementations • 4 Oct 2023 • An Yan, Yu Wang, Yiwu Zhong, Zexue He, Petros Karypis, Zihan Wang, chengyu dong, Amilcare Gentili, Chun-Nan Hsu, Jingbo Shang, Julian McAuley
Medical image classification is a critical problem for healthcare, with the potential to alleviate the workload of doctors and facilitate diagnoses of patients.
1 code implementation • 19 Sep 2023 • Xingyao Wang, Zihan Wang, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng, Heng Ji
However, current evaluation protocols often emphasize benchmark performance with single-turn exchanges, neglecting the nuanced interactions among the user, LLMs, and external tools, while also underestimating the importance of natural language feedback from users.
1 code implementation • ICCV 2023 • Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Shuqiang Jiang
Vision-and-language navigation (VLN) enables the agent to navigate to a remote location following the natural language instruction in 3D environments.
1 code implementation • 20 Jul 2023 • Marcel Torne, Max Balsells, Zihan Wang, Samedh Desai, Tao Chen, Pulkit Agrawal, Abhishek Gupta
This procedure can leverage noisy, asynchronous human feedback to learn policies with no hand-crafted reward design or exploration bonuses.
1 code implementation • 13 Jun 2023 • Liyang Liu, Zihan Wang, Minh Hieu Phan, BoWen Zhang, Jinchao Ge, Yifan Liu
Current knowledge distillation approaches in semantic segmentation tend to adopt a holistic approach that treats all spatial locations equally.
1 code implementation • 8 Jun 2023 • Jiongnan Liu, Jiajie Jin, Zihan Wang, Jiehan Cheng, Zhicheng Dou, Ji-Rong Wen
To support research in this area and facilitate the development of retrieval-augmented LLM systems, we develop RETA-LLM, a {RET}reival-{A}ugmented LLM toolkit.
no code implementations • 25 May 2023 • Zihan Wang, Arthur Jacot
The $L_{2}$-regularized loss of Deep Linear Networks (DLNs) with more than one hidden layers has multiple local minima, corresponding to matrices with different ranks.
no code implementations • 25 May 2023 • Zihan Wang, Yang Yang, Zhi Liu, Yifan Zheng
Our current related research addresses multiple novel proposed research works and compares their advantages and disadvantages between the derived deep learning frameworks rather than machine learning frameworks.
1 code implementation • 24 May 2023 • Yuwei Zhang, Zihan Wang, Jingbo Shang
First, we prompt ChatGPT for insights on clustering perspective by constructing hard triplet questions <does A better correspond to B than C>, where A, B and C are similar data points that belong to different clusters according to small embedder.
1 code implementation • 24 May 2023 • chengyu dong, Zihan Wang, Jingbo Shang
We show that the limited performance of seed matching is largely due to the label bias injected by the simple seed-match rule, which prevents the classifier from learning reliable confidence for selecting high-quality pseudo-labels.
1 code implementation • 23 May 2023 • Zihan Wang, Jingbo Shang, Ruiqi Zhong
We propose a new task formulation, "Goal-Driven Clustering with Explanations" (GoalEx), which represents both the goal and the explanations as free-form language descriptions.
1 code implementation • 22 May 2023 • Zihan Wang, Tianle Wang, Dheeraj Mekala, Jingbo Shang
Etremely Weakly Supervised Text Classification (XWS-TC) refers to text classification based on minimal high-level human guidance, such as a few label-indicative seed words or classification instructions.
1 code implementation • 21 May 2023 • Tianle Wang, Zihan Wang, Weitang Liu, Jingbo Shang
State-of-the-art weakly supervised text classification methods, while significantly reduced the required human supervision, still requires the supervision to cover all the classes of interest.
1 code implementation • 17 May 2023 • Zihan Wang, Kai Zhao, Yongquan He, Zhumin Chen, Pengjie Ren, Maarten de Rijke, Zhaochun Ren
Recent work on knowledge graph completion (KGC) focused on learning embeddings of entities and relations in knowledge graphs.
no code implementations • 18 Apr 2023 • Zihan Wang, Gang Wu, Haotong Wang
First, inter-session dependencies are not differentiated at the factor-level.
no code implementations • 18 Apr 2023 • Zihan Wang, Gang Wu, Haotong Wang
At factor-level, we employ Disentangled Representation Learning to obtain finer-grained data(e. g. factor-level embeddings), with which we can construct factor-level convolution channels.
2 code implementations • 30 Mar 2023 • Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang
Large pre-trained code generation models, such as OpenAI Codex, can generate syntax- and function-correct code, making the coding of programmers more productive and our pursuit of artificial general intelligence closer.
Ranked #81 on Code Generation on MBPP
1 code implementation • CVPR 2023 • Xiangyang Li, Zihan Wang, Jiahao Yang, YaoWei Wang, Shuqiang Jiang
The proposed KERM can automatically select and gather crucial and relevant cues, obtaining more accurate action prediction.
1 code implementation • 19 Mar 2023 • Zihan Wang, Siyang Song, Cheng Luo, Yuzhi Zhou, shiling Wu, Weicheng Xie, Linlin Shen
This paper presents our Facial Action Units (AUs) detection submission to the fifth Affective Behavior Analysis in-the-wild Competition (ABAW).
1 code implementation • CVPR 2023 • Anthony Chen, Kevin Zhang, Renrui Zhang, Zihan Wang, Yuheng Lu, Yandong Guo, Shanghang Zhang
Masked Autoencoders learn strong visual representations and achieve state-of-the-art results in several independent modalities, yet very few works have addressed their capabilities in multi-modality settings.
1 code implementation • 13 Feb 2023 • Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas
Reinforcement learning algorithms typically struggle in the absence of a dense, well-shaped reward function.
no code implementations • 21 Dec 2022 • Zihan Wang, Naoki Yoshinaga
Therefore, in this study, we introduce a task of generating game commentaries from structured data records to address the problem.
no code implementations • 7 Dec 2022 • Zihan Wang, Jason D. Lee, Qi Lei
Understanding when and how much a model gradient leaks information about the training sample is an important question in privacy.
no code implementations • 14 Nov 2022 • Weijie Sun, Sunil Vasu Kalmady, Nariman Sepehrvand, Luan Manh Chu, Zihan Wang, Amir Salimi, Abram Hindle, Russell Greiner, Padma Kaul
Pandemic outbreaks such as COVID-19 occur unexpectedly, and need immediate action due to their potential devastating consequences on global health.
no code implementations • 31 Oct 2022 • Zihan Wang, Qi Meng, HaiFeng Lan, Xinrui Zhang, Kehao Guo, Akshat Gupta
While Speech Emotion Recognition (SER) is a common application for popular languages, it continues to be a problem for low-resourced languages, i. e., languages with no pretrained speech-to-text recognition models.
10 code implementations • 5 Oct 2022 • Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, WenGuang Chen, Peng Zhang, Yuxiao Dong, Jie Tang
We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model with 130 billion parameters.
Ranked #1 on Language Modelling on CLUE (OCNLI_50K)
no code implementations • 5 Oct 2022 • Yufan Zhuang, Zihan Wang, Fangbo Tao, Jingbo Shang
Recent works show that learning attention in the Fourier space can improve the long sequence learning capability of Transformers.
no code implementations • 16 Sep 2022 • Yilun Hao, Ruinan Wang, Zhangjie Cao, Zihan Wang, Yuchen Cui, Dorsa Sadigh
Specifically, we design a masked policy network with a binary mask to block certain modalities.
1 code implementation • 15 Sep 2022 • Pingyi Hu, Zihan Wang, Ruoxi Sun, Hu Wang, Minhui Xue
To achieve this, we propose Multi-modal Models Membership Inference (M^4I) with two attack methods to infer the membership status, named metric-based (MB) M^4I and feature-based (FB) M^4I, respectively.
1 code implementation • 25 Jun 2022 • Akide Liu, Zihan Wang
This competition focus on Urban-Sense Segmentation based on the vehicle camera view.
1 code implementation • 24 Jun 2022 • Zihan Wang, Na Huang, Fei Sun, Pengjie Ren, Zhumin Chen, Hengliang Luo, Maarten de Rijke, Zhaochun Ren
To address the above limitations, we propose a Debiasing Learning for Membership Inference Attacks against recommender systems (DL-MIA) framework that has four main components: (1) a difference vector generator, (2) a disentangled encoder, (3) a weight estimator, and (4) an attack model.
no code implementations • 4 Jun 2022 • Zihan Wang, Ruimin Chen, Mengxuan Liu, Guanfang Dong, Anup Basu
We propose a method SPGNet for 3D human pose estimation that mixes multi-dimensional re-projection into supervised learning.
Ranked #46 on 3D Human Pose Estimation on Human3.6M
1 code implementation • 28 May 2022 • Ziang Li, Ming Ding, Weikai Li, Zihan Wang, Ziyu Zeng, Yukuo Cen, Jie Tang
graph benchmark (IGB) consisting of 4 datasets.
no code implementations • 24 May 2022 • Lesheng Jin, Zihan Wang, Jingbo Shang
Inspired by this observation, in WeDef, we define the reliability of samples based on whether the predictions of the weak classifier agree with their labels in the poisoned training set.
1 code implementation • 24 May 2022 • Zihan Wang, Kewen Zhao, Zilong Wang, Jingbo Shang
Fine-tuning pre-trained language models has recently become a common practice in building NLP models for various tasks, especially few-shot tasks.
1 code implementation • ACL 2022 • Jinyu Guo, Kai Shuang, Jijie Li, Zihan Wang, Yixuan Liu
However, no matter how the dialogue history is used, each existing model uses its own consistent dialogue history during the entire state tracking process, regardless of which slot is updated.
no code implementations • 9 May 2022 • Zihan Wang, Gang Wu, Yan Wang
The RNN often used in previous work is not suitable to process short sessions, because RNN only focuses on the sequential relationship, which we find is not the only relationship between items in short sessions.
1 code implementation • 28 Apr 2022 • Zihan Wang, Peiyi Wang, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui, Houfeng Wang
However, in this paradigm, there exists a huge gap between the classification tasks with sophisticated label hierarchy and the masked language model (MLM) pretraining tasks of PLMs and thus the potentials of PLMs can not be fully tapped.
1 code implementation • ACL 2022 • Zihan Wang, Peiyi Wang, Lianzhe Huang, Xin Sun, Houfeng Wang
Hierarchical text classification is a challenging subtask of multi-label classification due to its complex label hierarchy.
no code implementations • 2 Mar 2022 • Zihan Wang, Zhangjie Cao, Yilun Hao, Dorsa Sadigh
Correspondence learning is a fundamental problem in robotics, which aims to learn a mapping between state, action pairs of agents of different dynamics or embodiments.
no code implementations • 7 Feb 2022 • Zhangjie Cao, Zihan Wang, Dorsa Sadigh
Existing learning from demonstration algorithms usually assume access to expert demonstrations.
1 code implementation • 9 Nov 2021 • Zihan Wang, Jialin Lu, Oliver Snow, Martin Ester
Despite recent progress in artificial intelligence and machine learning, many state-of-the-art methods suffer from a lack of explainability and transparency.
1 code implementation • 16 Sep 2021 • Minxing Zhang, Zhaochun Ren, Zihan Wang, Pengjie Ren, Zhumin Chen, Pengfei Hu, Yang Zhang
In this paper, we make the first attempt on quantifying the privacy leakage of recommender systems through the lens of membership inference.
1 code implementation • ACL 2021 • Jinyu Guo, Kai Shuang, Jijie Li, Zihan Wang
However, the overwhelming majority of the slots in each turn should simply inherit the slot values from the previous turn.
no code implementations • 20 Jul 2021 • Zihan Wang, Olivia Byrnes, Hu Wang, Ruoxi Sun, Congbo Ma, Huaming Chen, Qi Wu, Minhui Xue
The advancement of secure communication and identity verification fields has significantly increased through the use of deep learning techniques for data hiding.
2 code implementations • 28 May 2021 • Xiaotao Gu, Zihan Wang, Zhenyu Bi, Yu Meng, Liyuan Liu, Jiawei Han, Jingbo Shang
Training a conventional neural tagger based on silver labels usually faces the risk of overfitting phrase surface names.
Ranked #1 on Phrase Tagging on KPTimes
no code implementations • 13 May 2021 • Zihan Wang, Hongye Song, Zhaochun Ren, Pengjie Ren, Zhumin Chen, Xiaozhong Liu, Hongsong Li, Maarten de Rijke
First, contract elements are far more fine-grained than named entities, which hinders the transfer of extractors.
Cross-Domain Named Entity Recognition named-entity-recognition +4
1 code implementation • 22 Apr 2021 • Runlong Yu, Yuyang Ye, Qi Liu, Zihan Wang, Chunfeng Yang, Yucheng Hu, Enhong Chen
Motivated by this, we propose a novel Extreme Cross Network, abbreviated XCrossNet, which aims at learning dense and sparse feature interactions in an explicit manner.
Ranked #22 on Click-Through Rate Prediction on Criteo
1 code implementation • 18 Apr 2021 • Zihan Wang, chengyu dong, Jingbo Shang
In this paper, we present an empirical property of these representations -- "average" approximates "first principal component".
no code implementations • 6 Nov 2020 • Zhendong Ai, Zihan Wang, Wei Cui
The ECG monitoring device, abbreviated as ECGM, is designed based on ferroelectric microprocessor which provides ultra-low power consumption and contains four parts-MCU, BLE, Sensors and Power.
3 code implementations • NAACL 2021 • Zihan Wang, Dheeraj Mekala, Jingbo Shang
Finally, we pick the most confident documents from each cluster to train a text classifier.
no code implementations • 10 Sep 2020 • Sarah E. Finch, James D. Finch, Ali Ahmadvand, Ingyu, Choi, Xiangjue Dong, Ruixiang Qi, Harshita Sahijwani, Sergey Volokhin, Zihan Wang, ZiHao Wang, Jinho D. Choi
Inspired by studies on the overwhelming presence of experience-sharing in human-human conversations, Emora, the social chatbot developed by Emory University, aims to bring such experience-focused interaction to the current field of conversational AI.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Zihan Wang, Karthikeyan K, Stephen Mayhew, Dan Roth
Multilingual BERT (M-BERT) has been a huge success in both supervised and zero-shot cross-lingual transfer learning.
no code implementations • ICLR 2020 • Karthikeyan K, Zihan Wang, Stephen Mayhew, Dan Roth
Recent work has exhibited the surprising cross-lingual abilities of multilingual BERT (M-BERT) -- surprising since it is trained without any cross-lingual objective and with no aligned data.
no code implementations • 11 Nov 2019 • Yunan Zhang, Xiang Cheng, Yufeng Zhang, Zihan Wang, Zhengqi Fang, Xiaoyan Wang, Zhenya Huang, ChengXiang Zhai
Answering complex questions involving multiple entities and relations is a challenging task.
1 code implementation • IJCNLP 2019 • Zihan Wang, Jingbo Shang, Liyuan Liu, Lihao Lu, Jiacheng Liu, Jiawei Han
Therefore, we manually correct these label mistakes and form a cleaner test set.
Ranked #3 on Named Entity Recognition (NER) on CoNLL++ (using extra training data)
1 code implementation • 20 Aug 2019 • Yu Meng, Jiaxin Huang, Guangyuan Wang, Zihan Wang, Chao Zhang, Yu Zhang, Jiawei Han
We propose a new task, discriminative topic mining, which leverages a set of user-provided category names to mine discriminative topics from text corpora.
1 code implementation • 14 Aug 2019 • Liyuan Liu, Zihan Wang, Jingbo Shang, Dandong Yin, Heng Ji, Xiang Ren, Shaowen Wang, Jiawei Han
Our model neither requires the conversion from character sequences to word sequences, nor assumes tokenizer can correctly detect all word boundaries.
no code implementations • 11 May 2019 • Zihan Wang, Yaoguang Li, Wei Cui
By applying various existing learning methods to our ECG dataset, we find that current methods which can well support the identification of individuals under rests, do not suffice to present satisfying ECGID performance under exercise situations, therefore exposing the deficiency of existing ECG identification methods.
no code implementations • 11 Oct 2018 • Homanga Bharadhwaj, Zihan Wang, Yoshua Bengio, Liam Paull
Learning effective visuomotor policies for robots purely from data is challenging, but also appealing since a learning-based system should not require manual tuning or calibration.