Search Results for author: Zihan Wang

Found 90 papers, 51 papers with code

Learning Adaptive Axis Attentions in Fine-tuning: Beyond Fixed Sparse Attention Patterns

no code implementations • Findings (ACL) 2022 • Zihan Wang, Jiuxiang Gu, Jason Kuen, Handong Zhao, Vlad Morariu, Ruiyi Zhang, Ani Nenkova, Tong Sun, Jingbo Shang

We present a comprehensive study of sparse attention patterns in Transformer models.

Paper
Add Code

“Average” Approximates “First Principal Component”? An Empirical Analysis on Representations from Neural Language Models

no code implementations • EMNLP 2021 • Zihan Wang, chengyu dong, Jingbo Shang

In this paper, we present an empirical property of these representations—”average” approximates “first principal component”.

Paper
Add Code

Tele-FLM Technical Report

no code implementations • 25 Apr 2024 • Xiang Li, Yiqun Yao, Xin Jiang, Xuezhi Fang, Chao Wang, Xinzhang Liu, Zihan Wang, Yu Zhao, Xin Wang, Yuyao Huang, Shuangyong Song, Yongxiang Li, Zheng Zhang, Bo Zhao, Aixin Sun, Yequan Wang, Zhongjiang He, Zhongyuan Wang, Xuelong Li, Tiejun Huang

Large language models (LLMs) have showcased profound capabilities in language understanding and generation, facilitating a wide array of applications.

Paper
Add Code

Learn from Failure: Fine-Tuning LLMs with Trial-and-Error Data for Intuitionistic Propositional Logic Proving

no code implementations • 10 Apr 2024 • Chenyang An, Zhibo Chen, Qihao Ye, Emily First, Letian Peng, Jiayun Zhang, Zihan Wang, Sorin Lerner, Jingbo Shang

Recent advances in Automated Theorem Proving have shown the effectiveness of leveraging a (large) language model that generates tactics (i. e. proof steps) to search through proof states.

Automated Theorem Proving Language Modelling +1

Paper
Add Code

Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition

1 code implementation • 9 Apr 2024 • Zihan Wang, Siyang Song, Cheng Luo, Songhe Deng, Weicheng Xie, Linlin Shen

Human facial action units (AUs) are mutually related in a hierarchical manner, as not only they are associated with each other in both spatial and temporal domains but also AUs located in the same/close facial regions show stronger relationships than those of different facial regions.

Paper
Code

AirShot: Efficient Few-Shot Detection for Autonomous Exploration

2 code implementations • 7 Apr 2024 • Zihan Wang, Bowen Li, Chen Wang, Sebastian Scherer

Few-shot object detection has drawn increasing attention in the field of robotic exploration, where robots are required to find unseen objects with a few online provided examples.

Few-Shot Object Detection object-detection

156

Paper
Code

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

1 code implementation • 3 Apr 2024 • Yifan Xu, Xiao Liu, Xinghan Liu, Zhenyu Hou, Yueyan Li, Xiaohan Zhang, Zihan Wang, Aohan Zeng, Zhengxiao Du, Wenyi Zhao, Jie Tang, Yuxiao Dong

Large language models (LLMs) have shown excellent mastering of human language, but still struggle in real-world applications that require mathematical problem-solving.

Math

Paper
Code

Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation

1 code implementation • 2 Apr 2024 • Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Junjie Hu, Ming Jiang, Shuqiang Jiang

Vision-and-language navigation (VLN) enables the agent to navigate to a remote location following the natural language instruction in 3D environments.

Navigate Vision and Language Navigation +1

Paper
Code

MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks

1 code implementation • 30 Mar 2024 • Letian Peng, Zilong Wang, Feng Yao, Zihan Wang, Jingbo Shang

We construct the distillation dataset via sampling sentences from language model pre-training datasets (e. g., OpenWebText in our implementation) and prompting an LLM to identify the typed spans of "important information".

Language Modelling named-entity-recognition +2

Paper
Code

Is Mamba Effective for Time Series Forecasting?

1 code implementation • 17 Mar 2024 • Zihan Wang, Fanheng Kong, Shi Feng, Ming Wang, Han Zhao, Daling Wang, Yifei Zhang

Furthermore, we conduct extensive experiments to delve deeper into the potential of Mamba compared to the Transformer in the TSF.

Time Series Time Series Forecasting

Paper
Code

Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization

no code implementations • 16 Mar 2024 • Zihan Wang, Jiayu Xiao, Mengxiang Li, Zhongjiang He, Yongxiang Li, Chao Wang, Shuangyong Song

In our dynamic world where data arrives in a continuous stream, continual learning enables us to incrementally add new tasks/domains without the need to retrain from scratch.

Continual Learning Data Augmentation +1

Paper
Add Code

Learning with Noisy Foundation Models

no code implementations • 11 Mar 2024 • Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

Foundation models are usually pre-trained on large-scale datasets and then adapted to downstream tasks through tuning.

Paper
Add Code

Utilizing Local Hierarchy with Adversarial Training for Hierarchical Text Classification

1 code implementation • 29 Feb 2024 • Zihan Wang, Peiyi Wang, Houfeng Wang

Hierarchical text classification (HTC) is a challenging subtask of multi-label classification due to its complex taxonomic structure.

Multi-Label Classification text-classification +1

Paper
Code

VN Network: Embedding Newly Emerging Entities with Virtual Neighbors

no code implementations • 21 Feb 2024 • Yongquan He, Zihan Wang, Peng Zhang, Zhaopeng Tu, Zhaochun Ren

To address this issue, recent works apply the graph neural network on the existing neighbors of the unseen entities.

Knowledge Graph Completion Network Embedding

Paper
Add Code

Answer is All You Need: Instruction-following Text Embedding via Answering the Question

1 code implementation • 15 Feb 2024 • Letian Peng, Yuwei Zhang, Zilong Wang, Jayanth Srinivasa, Gaowen Liu, Zihan Wang, Jingbo Shang

This work aims to build a text embedder that can capture characteristics of texts specified by user instructions.

abstractive question answering Instruction Following +1

Paper
Code

Data Reconstruction Attacks and Defenses: A Systematic Evaluation

no code implementations • 13 Feb 2024 • Sheng Liu, Zihan Wang, Qi Lei

In this work, we propose a strong reconstruction attack in the setting of federated learning.

Federated Learning Reconstruction Attack

Paper
Add Code

Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision

no code implementations • 5 Feb 2024 • Zihan Wang, Yunxuan Li, Yuexin Wu, Liangchen Luo, Le Hou, Hongkun Yu, Jingbo Shang

Process supervision, using a trained verifier to evaluate the intermediate steps generated by reasoner, has demonstrated significant improvements in multi-step problem solving.

GSM8K Math

Paper
Add Code

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning

1 code implementation • 15 Jan 2024 • Dan Zhang, Ziniu Hu, Sining Zhoubian, Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, Jie Tang

To bridge these gaps, we introduce SciGLM, a suite of scientific language models able to conduct college-level scientific reasoning.

Math Mathematical Reasoning

Paper
Code

Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization

no code implementations • 13 Jan 2024 • Mengtian Li, Shaohui Lin, Zihan Wang, Yunhang Shen, Baochang Zhang, Lizhuang Ma

Semi-supervised learning (SSL), thanks to the significant reduction of data annotation costs, has been an active research topic for large-scale 3D scene understanding.

Pseudo Label Representation Learning +2

Paper
Add Code

TeleChat Technical Report

no code implementations • 8 Jan 2024 • Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang, Bingkai Yang, Xuewei he, Zhuoru Jiang, Qiyi Xie, Yanhan Zhang, Zhongqiu Li, Lingling Shi, Weiwei Fu, Yin Zhang, Zilu Huang, Sishi Xiong, Yuxiang Zhang, Chao Wang, Shuangyong Song

Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe.

Code Generation Question Answering

Paper
Add Code

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

2 code implementations • 19 Dec 2023 • Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

They endow Large Language Models (LLMs) with powerful capabilities in visual understanding, enabling them to tackle diverse multi-modal tasks.

Visual Reasoning

8,973

Paper
Code

CogAgent: A Visual Language Model for GUI Agents

1 code implementation • 14 Dec 2023 • Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxuan Zhang, Juanzi Li, Bin Xu, Yuxiao Dong, Ming Ding, Jie Tang

People are spending an enormous amount of time on digital devices through graphical user interfaces (GUIs), e. g., computer or smartphone screens.

Ranked #14 on Visual Question Answering on MM-Vet

Language Modelling Visual Question Answering

5,035

Paper
Code

Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning

1 code implementation • 10 Dec 2023 • Yougang Lyu, Jitai Hao, Zihan Wang, Kai Zhao, Shen Gao, Pengjie Ren, Zhumin Chen, Fang Wang, Zhaochun Ren

Multiple defendants in a criminal fact description generally exhibit complex interactions, and cannot be well handled by existing Legal Judgment Prediction (LJP) methods which focus on predicting judgment results (e. g., law articles, charges, and terms of penalty) for single-defendant cases.

Paper
Code

Less than One-shot: Named Entity Recognition via Extremely Weak Supervision

1 code implementation • 6 Nov 2023 • Letian Peng, Zihan Wang, Jingbo Shang

We study the named entity recognition (NER) problem under the extremely weak supervision (XWS) setting, where only one example entity per type is given in a context-free way.

named-entity-recognition Named Entity Recognition +1

Paper
Code

EmojiLM: Modeling the New Emoji Language

1 code implementation • 3 Nov 2023 • Letian Peng, Zilong Wang, Hang Liu, Zihan Wang, Jingbo Shang

With the rapid development of the internet, online social media welcomes people with different backgrounds through its diverse content.

Language Modelling Large Language Model

Paper
Code

Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback

no code implementations • 31 Oct 2023 • Max Balsells, Marcel Torne, Zihan Wang, Samedh Desai, Pulkit Agrawal, Abhishek Gupta

We evaluate this system on a suite of robotic tasks in simulation and demonstrate its effectiveness at learning behaviors both in simulation and the real world.

reinforcement-learning Self-Supervised Learning

Paper
Add Code

ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation

no code implementations • 26 Oct 2023 • Zi Lin, Zihan Wang, Yongqi Tong, Yangkun Wang, Yuxin Guo, Yujia Wang, Jingbo Shang

This benchmark contains the rich, nuanced phenomena that can be tricky for current toxicity detection models to identify, revealing a significant domain difference compared to social media content.

Chatbot

Paper
Add Code

Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing

no code implementations • 20 Oct 2023 • Xinyu Hu, Pengfei Tang, Simiao Zuo, Zihan Wang, Bowen Song, Qiang Lou, Jian Jiao, Denis Charles

In Evoke, there are two instances of a same LLM: one as a reviewer (LLM-Reviewer), it scores the current prompt; the other as an author (LLM-Author), it edits the prompt by considering the edit history and the reviewer's feedback.

Logical Fallacy Detection

Paper
Add Code

Generalizing Few-Shot Named Entity Recognizers to Unseen Domains with Type-Related Features

1 code implementation • 15 Oct 2023 • Zihan Wang, Ziqi Zhao, Zhumin Chen, Pengjie Ren, Maarten de Rijke, Zhaochun Ren

To address this limitation, recent studies enable generalization to an unseen target domain with only a few labeled examples using data augmentation techniques.

Data Augmentation few-shot-ner +5

Paper
Code

Misusing Tools in Large Language Models With Visual Adversarial Examples

1 code implementation • 4 Oct 2023 • Xiaohan Fu, Zihan Wang, Shuheng Li, Rajesh K. Gupta, Niloofar Mireshghallah, Taylor Berg-Kirkpatrick, Earlence Fernandes

Large Language Models (LLMs) are being enhanced with the ability to use tools and to process multiple modalities.

SSIM

Paper
Code

Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models

no code implementations • 4 Oct 2023 • An Yan, Yu Wang, Yiwu Zhong, Zexue He, Petros Karypis, Zihan Wang, chengyu dong, Amilcare Gentili, Chun-Nan Hsu, Jingbo Shang, Julian McAuley

Medical image classification is a critical problem for healthcare, with the potential to alleviate the workload of doctors and facilitate diagnoses of patients.

Image Classification Language Modelling +1

Paper
Add Code

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

1 code implementation • 19 Sep 2023 • Xingyao Wang, Zihan Wang, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng, Heng Ji

However, current evaluation protocols often emphasize benchmark performance with single-turn exchanges, neglecting the nuanced interactions among the user, LLMs, and external tools, while also underestimating the importance of natural language feedback from users.

Decision Making

Paper
Code

GridMM: Grid Memory Map for Vision-and-Language Navigation

1 code implementation • ICCV 2023 • Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Shuqiang Jiang

Vision-and-language navigation (VLN) enables the agent to navigate to a remote location following the natural language instruction in 3D environments.

Navigate Vision and Language Navigation

Paper
Code

Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback

1 code implementation • 20 Jul 2023 • Marcel Torne, Max Balsells, Zihan Wang, Samedh Desai, Tao Chen, Pulkit Agrawal, Abhishek Gupta

This procedure can leverage noisy, asynchronous human feedback to learn policies with no hand-crafted reward design or exploration bonuses.

Decision Making reinforcement-learning +1

Paper
Code

BPKD: Boundary Privileged Knowledge Distillation For Semantic Segmentation

1 code implementation • 13 Jun 2023 • Liyang Liu, Zihan Wang, Minh Hieu Phan, BoWen Zhang, Jinchao Ge, Yifan Liu

Current knowledge distillation approaches in semantic segmentation tend to adopt a holistic approach that treats all spatial locations equally.

Knowledge Distillation Segmentation +1

Paper
Code

RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit

1 code implementation • 8 Jun 2023 • Jiongnan Liu, Jiajie Jin, Zihan Wang, Jiehan Cheng, Zhicheng Dou, Ji-Rong Wen

To support research in this area and facilitate the development of retrieval-augmented LLM systems, we develop RETA-LLM, a {RET}reival-{A}ugmented LLM toolkit.

Answer Generation Fact Checking +5

201

Paper
Code

Implicit bias of SGD in $L_{2}$-regularized linear DNNs: One-way jumps from high to low rank

no code implementations • 25 May 2023 • Zihan Wang, Arthur Jacot

The $L_{2}$-regularized loss of Deep Linear Networks (DLNs) with more than one hidden layers has multiple local minima, corresponding to matrices with different ranks.

Matrix Completion

Paper
Add Code

Deep Neural Networks in Video Human Action Recognition: A Review

no code implementations • 25 May 2023 • Zihan Wang, Yang Yang, Zhi Liu, Yifan Zheng

Our current related research addresses multiple novel proposed research works and compares their advantages and disadvantages between the derived deep learning frameworks rather than machine learning frameworks.

Action Recognition Optical Flow Estimation +1

Paper
Add Code

ClusterLLM: Large Language Models as a Guide for Text Clustering

1 code implementation • 24 May 2023 • Yuwei Zhang, Zihan Wang, Jingbo Shang

First, we prompt ChatGPT for insights on clustering perspective by constructing hard triplet questions <does A better correspond to B than C>, where A, B and C are similar data points that belong to different clusters according to small embedder.

Clustering Language Modelling +2

Paper
Code

Debiasing Made State-of-the-art: Revisiting the Simple Seed-based Weak Supervision for Text Classification

1 code implementation • 24 May 2023 • chengyu dong, Zihan Wang, Jingbo Shang

We show that the limited performance of seed matching is largely due to the label bias injected by the simple seed-match rule, which prevents the classifier from learning reliable confidence for selecting high-quality pseudo-labels.

text-classification Text Classification

Paper
Code

Goal-Driven Explainable Clustering via Language Descriptions

1 code implementation • 23 May 2023 • Zihan Wang, Jingbo Shang, Ruiqi Zhong

We propose a new task formulation, "Goal-Driven Clustering with Explanations" (GoalEx), which represents both the goal and the explanations as free-form language descriptions.

Clustering Language Modelling

Paper
Code

A Benchmark on Extremely Weakly Supervised Text Classification: Reconcile Seed Matching and Prompting Approaches

1 code implementation • 22 May 2023 • Zihan Wang, Tianle Wang, Dheeraj Mekala, Jingbo Shang

Etremely Weakly Supervised Text Classification (XWS-TC) refers to text classification based on minimal high-level human guidance, such as a few label-indicative seed words or classification instructions.

Benchmarking text-classification +1

Paper
Code

WOT-Class: Weakly Supervised Open-world Text Classification

1 code implementation • 21 May 2023 • Tianle Wang, Zihan Wang, Weitang Liu, Jingbo Shang

State-of-the-art weakly supervised text classification methods, while significantly reduced the required human supervision, still requires the supervision to cover all the classes of interest.

Image Classification text-classification +1

Paper
Code

Iteratively Learning Representations for Unseen Entities with Inter-Rule Correlations

1 code implementation • 17 May 2023 • Zihan Wang, Kai Zhao, Yongquan He, Zhumin Chen, Pengjie Ren, Maarten de Rijke, Zhaochun Ren

Recent work on knowledge graph completion (KGC) focused on learning embeddings of entities and relations in knowledge graphs.

Link Prediction Triple Classification

Paper
Code

Discreetly Exploiting Inter-session Information for Session-based Recommendation

no code implementations • 18 Apr 2023 • Zihan Wang, Gang Wu, Haotong Wang

First, inter-session dependencies are not differentiated at the factor-level.

Session-Based Recommendations

Paper
Add Code

Dual-Granularity Contrastive Learning for Session-based Recommendation

no code implementations • 18 Apr 2023 • Zihan Wang, Gang Wu, Haotong Wang

At factor-level, we employ Disentangled Representation Learning to obtain finer-grained data(e. g. factor-level embeddings), with which we can construct factor-level convolution channels.

Contrastive Learning Data Augmentation +2

Paper
Add Code

CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X

2 code implementations • 30 Mar 2023 • Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang

Large pre-trained code generation models, such as OpenAI Codex, can generate syntax- and function-correct code, making the coding of programmers more productive and our pursuit of artificial general intelligence closer.

Ranked #81 on Code Generation on MBPP

Code Generation

7,772

Paper
Code

KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation

1 code implementation • CVPR 2023 • Xiangyang Li, Zihan Wang, Jiahao Yang, YaoWei Wang, Shuqiang Jiang

The proposed KERM can automatically select and gather crucial and relevant cues, obtaining more accurate action prediction.

Navigate Vision and Language Navigation

Paper
Code

Spatio-Temporal AU Relational Graph Representation Learning For Facial Action Units Detection

1 code implementation • 19 Mar 2023 • Zihan Wang, Siyang Song, Cheng Luo, Yuzhi Zhou, shiling Wu, Weicheng Xie, Linlin Shen

This paper presents our Facial Action Units (AUs) detection submission to the fifth Affective Behavior Analysis in-the-wild Competition (ABAW).

Graph Learning Graph Representation Learning

Paper
Code

PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection

1 code implementation • CVPR 2023 • Anthony Chen, Kevin Zhang, Renrui Zhang, Zihan Wang, Yuheng Lu, Yandong Guo, Shanghang Zhang

Masked Autoencoders learn strong visual representations and achieve state-of-the-art results in several independent modalities, yet very few works have addressed their capabilities in multi-modality settings.

3D Object Detection object-detection +2

104

Paper
Code

Guiding Pretraining in Reinforcement Learning with Large Language Models

1 code implementation • 13 Feb 2023 • Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas

Reinforcement learning algorithms typically struggle in the absence of a dense, well-shaped reward function.

Common Sense Reasoning Language Modelling +2

Paper
Code

Esports Data-to-commentary Generation on Large-scale Data-to-text Dataset

no code implementations • 21 Dec 2022 • Zihan Wang, Naoki Yoshinaga

Therefore, in this study, we introduce a task of generating game commentaries from structured data records to address the problem.

Paper
Add Code

Reconstructing Training Data from Model Gradient, Provably

no code implementations • 7 Dec 2022 • Zihan Wang, Jason D. Lee, Qi Lei

Understanding when and how much a model gradient leaks information about the training sample is an important question in privacy.

Federated Learning Tensor Decomposition

Paper
Add Code

Improving ECG-based COVID-19 diagnosis and mortality predictions using pre-pandemic medical records at population-scale

no code implementations • 14 Nov 2022 • Weijie Sun, Sunil Vasu Kalmady, Nariman Sepehrvand, Luan Manh Chu, Zihan Wang, Amir Salimi, Abram Hindle, Russell Greiner, Padma Kaul

Pandemic outbreaks such as COVID-19 occur unexpectedly, and need immediate action due to their potential devastating consequences on global health.

COVID-19 Diagnosis Transfer Learning

Paper
Add Code

Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search

no code implementations • 31 Oct 2022 • Zihan Wang, Qi Meng, HaiFeng Lan, Xinrui Zhang, Kehao Guo, Akshat Gupta

While Speech Emotion Recognition (SER) is a common application for popular languages, it continues to be a problem for low-resourced languages, i. e., languages with no pretrained speech-to-text recognition models.

Neural Architecture Search Speech Emotion Recognition

Paper
Add Code

GLM-130B: An Open Bilingual Pre-trained Model

10 code implementations • 5 Oct 2022 • Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, WenGuang Chen, Peng Zhang, Yuxiao Dong, Jie Tang

We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model with 130 billion parameters.

Ranked #1 on Language Modelling on CLUE (OCNLI_50K)

Language Modelling Long-Context Understanding +2

39,275

Paper
Code

WavSpA: Wavelet Space Attention for Boosting Transformers' Long Sequence Learning Ability

no code implementations • 5 Oct 2022 • Yufan Zhuang, Zihan Wang, Fangbo Tao, Jingbo Shang

Recent works show that learning attention in the Fourier space can improve the long sequence learning capability of Transformers.

Paper
Add Code

Masked Imitation Learning: Discovering Environment-Invariant Modalities in Multimodal Demonstrations

no code implementations • 16 Sep 2022 • Yilun Hao, Ruinan Wang, Zhangjie Cao, Zihan Wang, Yuchen Cui, Dorsa Sadigh

Specifically, we design a masked policy network with a binary mask to block certain modalities.

Imitation Learning

Paper
Add Code

M^4I: Multi-modal Models Membership Inference

1 code implementation • 15 Sep 2022 • Pingyi Hu, Zihan Wang, Ruoxi Sun, Hu Wang, Minhui Xue

To achieve this, we propose Multi-modal Models Membership Inference (M^4I) with two attack methods to infer the membership status, named metric-based (MB) M^4I and feature-based (FB) M^4I, respectively.

Image Captioning Inference Attack +2

Paper
Code

CV 3315 Is All You Need : Semantic Segmentation Competition

1 code implementation • 25 Jun 2022 • Akide Liu, Zihan Wang

This competition focus on Urban-Sense Segmentation based on the vehicle camera view.

Segmentation Semantic Segmentation

Paper
Code

Debiasing Learning for Membership Inference Attacks Against Recommender Systems

1 code implementation • 24 Jun 2022 • Zihan Wang, Na Huang, Fei Sun, Pengjie Ren, Zhumin Chen, Hengliang Luo, Maarten de Rijke, Zhaochun Ren

To address the above limitations, we propose a Debiasing Learning for Membership Inference Attacks against recommender systems (DL-MIA) framework that has four main components: (1) a difference vector generator, (2) a disentangled encoder, (3) a weight estimator, and (4) an attack model.

Recommendation Systems

Paper
Code

SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional Space

no code implementations • 4 Jun 2022 • Zihan Wang, Ruimin Chen, Mengxuan Liu, Guanfang Dong, Anup Basu

We propose a method SPGNet for 3D human pose estimation that mixes multi-dimensional re-projection into supervised learning.

Ranked #46 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation Position

Paper
Add Code

Rethinking the Setting of Semi-supervised Learning on Graphs

1 code implementation • 28 May 2022 • Ziang Li, Ming Ding, Weikai Li, Zihan Wang, Ziyu Zeng, Yukuo Cen, Jie Tang

graph benchmark (IGB) consisting of 4 datasets.

Paper
Code

WeDef: Weakly Supervised Backdoor Defense for Text Classification

no code implementations • 24 May 2022 • Lesheng Jin, Zihan Wang, Jingbo Shang

Inspired by this observation, in WeDef, we define the reliability of samples based on whether the predictions of the weak classifier agree with their labels in the poisoned training set.

backdoor defense text-classification +1

Paper
Add Code

Formulating Few-shot Fine-tuning Towards Language Model Pre-training: A Pilot Study on Named Entity Recognition

1 code implementation • 24 May 2022 • Zihan Wang, Kewen Zhao, Zilong Wang, Jingbo Shang

Fine-tuning pre-trained language models has recently become a common practice in building NLP models for various tasks, especially few-shot tasks.

Few-shot NER Language Modelling +2

Paper
Code

Beyond the Granularity: Multi-Perspective Dialogue Collaborative Selection for Dialogue State Tracking

1 code implementation • ACL 2022 • Jinyu Guo, Kai Shuang, Jijie Li, Zihan Wang, Yixuan Liu

However, no matter how the dialogue history is used, each existing model uses its own consistent dialogue history during the entire state tracking process, regardless of which slot is updated.

Dialogue State Tracking

Paper
Code

Effectively Using Long and Short Sessions for Multi-Session-based Recommendations

no code implementations • 9 May 2022 • Zihan Wang, Gang Wu, Yan Wang

The RNN often used in previous work is not suitable to process short sessions, because RNN only focuses on the sequential relationship, which we find is not the only relationship between items in short sessions.

Session-Based Recommendations

Paper
Add Code

HPT: Hierarchy-aware Prompt Tuning for Hierarchical Text Classification

1 code implementation • 28 Apr 2022 • Zihan Wang, Peiyi Wang, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui, Houfeng Wang

However, in this paradigm, there exists a huge gap between the classification tasks with sophisticated label hierarchy and the masked language model (MLM) pretraining tasks of PLMs and thus the potentials of PLMs can not be fully tapped.

Language Modelling Multi-Label Classification +2

Paper
Code

Incorporating Hierarchy into Text Encoder: a Contrastive Learning Approach for Hierarchical Text Classification

1 code implementation • ACL 2022 • Zihan Wang, Peiyi Wang, Lianzhe Huang, Xin Sun, Houfeng Wang

Hierarchical text classification is a challenging subtask of multi-label classification due to its complex label hierarchy.

Contrastive Learning Multi-Label Classification +2

115

Paper
Code

Weakly Supervised Correspondence Learning

no code implementations • 2 Mar 2022 • Zihan Wang, Zhangjie Cao, Yilun Hao, Dorsa Sadigh

Correspondence learning is a fundamental problem in robotics, which aims to learn a mapping between state, action pairs of agents of different dynamics or embodiments.

Paper
Add Code

Learning from Imperfect Demonstrations via Adversarial Confidence Transfer

no code implementations • 7 Feb 2022 • Zhangjie Cao, Zihan Wang, Dorsa Sadigh

Existing learning from demonstration algorithms usually assume access to expert demonstrations.

Paper
Add Code

An Interactive Visualization Tool for Understanding Active Learning

1 code implementation • 9 Nov 2021 • Zihan Wang, Jialin Lu, Oliver Snow, Martin Ester

Despite recent progress in artificial intelligence and machine learning, many state-of-the-art methods suffer from a lack of explainability and transparency.

Active Learning BIG-bench Machine Learning

Paper
Code

Membership Inference Attacks Against Recommender Systems

1 code implementation • 16 Sep 2021 • Minxing Zhang, Zhaochun Ren, Zihan Wang, Pengjie Ren, Zhumin Chen, Pengfei Hu, Yang Zhang

In this paper, we make the first attempt on quantifying the privacy leakage of recommender systems through the lens of membership inference.

Recommendation Systems

Paper
Code

Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking

1 code implementation • ACL 2021 • Jinyu Guo, Kai Shuang, Jijie Li, Zihan Wang

However, the overwhelming majority of the slots in each turn should simply inherit the slot values from the previous turn.

Dialogue State Tracking

Paper
Code

Data Hiding with Deep Learning: A Survey Unifying Digital Watermarking and Steganography

no code implementations • 20 Jul 2021 • Zihan Wang, Olivia Byrnes, Hu Wang, Ruoxi Sun, Congbo Ma, Huaming Chen, Qi Wu, Minhui Xue

The advancement of secure communication and identity verification fields has significantly increased through the use of deep learning techniques for data hiding.

Paper
Add Code

UCPhrase: Unsupervised Context-aware Quality Phrase Tagging

2 code implementations • 28 May 2021 • Xiaotao Gu, Zihan Wang, Zhenyu Bi, Yu Meng, Liyuan Liu, Jiawei Han, Jingbo Shang

Training a conventional neural tagger based on silver labels usually faces the risk of overfitting phrase surface names.

Ranked #1 on Phrase Tagging on KPTimes

Keyphrase Extraction Language Modelling +3

165

Paper
Code

Cross-Domain Contract Element Extraction with a Bi-directional Feedback Clause-Element Relation Network

no code implementations • 13 May 2021 • Zihan Wang, Hongye Song, Zhaochun Ren, Pengjie Ren, Zhumin Chen, Xiaozhong Liu, Hongsong Li, Maarten de Rijke

First, contract elements are far more fine-grained than named entities, which hinders the transfer of extractors.

Cross-Domain Named Entity Recognition named-entity-recognition +4

Paper
Add Code

XCrossNet: Feature Structure-Oriented Learning for Click-Through Rate Prediction

1 code implementation • 22 Apr 2021 • Runlong Yu, Yuyang Ye, Qi Liu, Zihan Wang, Chunfeng Yang, Yucheng Hu, Enhong Chen

Motivated by this, we propose a novel Extreme Cross Network, abbreviated XCrossNet, which aims at learning dense and sparse feature interactions in an explicit manner.

Ranked #22 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Feature Engineering +1

Paper
Code

"Average" Approximates "First Principal Component"? An Empirical Analysis on Representations from Neural Language Models

1 code implementation • 18 Apr 2021 • Zihan Wang, chengyu dong, Jingbo Shang

In this paper, we present an empirical property of these representations -- "average" approximates "first principal component".

Paper
Code

Low-Power Wireless Wearable ECG Monitoring Chestbelt Based on Ferroelectric Microprocessor

no code implementations • 6 Nov 2020 • Zhendong Ai, Zihan Wang, Wei Cui

The ECG monitoring device, abbreviated as ECGM, is designed based on ferroelectric microprocessor which provides ultra-low power consumption and contains four parts-MCU, BLE, Sensors and Power.

Paper
Add Code

X-Class: Text Classification with Extremely Weak Supervision

3 code implementations • NAACL 2021 • Zihan Wang, Dheeraj Mekala, Jingbo Shang

Finally, we pick the most confident documents from each cluster to train a text classifier.

Clustering General Classification +3

Paper
Code

Emora: An Inquisitive Social Chatbot Who Cares For You

no code implementations • 10 Sep 2020 • Sarah E. Finch, James D. Finch, Ali Ahmadvand, Ingyu, Choi, Xiangjue Dong, Ruixiang Qi, Harshita Sahijwani, Sergey Volokhin, Zihan Wang, ZiHao Wang, Jinho D. Choi

Inspired by studies on the overwhelming presence of experience-sharing in human-human conversations, Emora, the social chatbot developed by Emory University, aims to bring such experience-focused interaction to the current field of conversational AI.

Chatbot intent-classification +1

Paper
Add Code

Extending Multilingual BERT to Low-Resource Languages

no code implementations • Findings of the Association for Computational Linguistics 2020 • Zihan Wang, Karthikeyan K, Stephen Mayhew, Dan Roth

Multilingual BERT (M-BERT) has been a huge success in both supervised and zero-shot cross-lingual transfer learning.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

Cross-Lingual Ability of Multilingual BERT: An Empirical Study

no code implementations • ICLR 2020 • Karthikeyan K, Zihan Wang, Stephen Mayhew, Dan Roth

Recent work has exhibited the surprising cross-lingual abilities of multilingual BERT (M-BERT) -- surprising since it is trained without any cross-lingual objective and with no aligned data.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Learning to Order Sub-questions for Complex Question Answering

no code implementations • 11 Nov 2019 • Yunan Zhang, Xiang Cheng, Yufeng Zhang, Zihan Wang, Zhengqi Fang, Xiaoyan Wang, Zhenya Huang, ChengXiang Zhai

Answering complex questions involving multiple entities and relations is a challenging task.

Question Answering Reinforcement Learning (RL)

Paper
Add Code

CrossWeigh: Training Named Entity Tagger from Imperfect Annotations

1 code implementation • IJCNLP 2019 • Zihan Wang, Jingbo Shang, Liyuan Liu, Lihao Lu, Jiacheng Liu, Jiawei Han

Therefore, we manually correct these label mistakes and form a cleaner test set.

Ranked #3 on Named Entity Recognition (NER) on CoNLL++ (using extra training data)

named-entity-recognition Named Entity Recognition +1

172

Paper
Code

Discriminative Topic Mining via Category-Name Guided Text Embedding

1 code implementation • 20 Aug 2019 • Yu Meng, Jiaxin Huang, Guangyuan Wang, Zihan Wang, Chao Zhang, Yu Zhang, Jiawei Han

We propose a new task, discriminative topic mining, which leverages a set of user-provided category names to mine discriminative topics from text corpora.

Document Classification General Classification +3

Paper
Code

Raw-to-End Name Entity Recognition in Social Media

1 code implementation • 14 Aug 2019 • Liyuan Liu, Zihan Wang, Jingbo Shang, Dandong Yin, Heng Ji, Xiang Ren, Shaowen Wang, Jiawei Han

Our model neither requires the conversion from character sequences to word sequences, nor assumes tokenizer can correctly detect all word boundaries.

named-entity-recognition Named Entity Recognition +1

Paper
Code

ECG Identification under Exercise and Rest Situations via Various Learning Methods

no code implementations • 11 May 2019 • Zihan Wang, Yaoguang Li, Wei Cui

By applying various existing learning methods to our ECG dataset, we find that current methods which can well support the identification of individuals under rests, do not suffice to present satisfying ECGID performance under exercise situations, therefore exposing the deficiency of existing ECG identification methods.

Paper
Add Code

A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

no code implementations • 11 Oct 2018 • Homanga Bharadhwaj, Zihan Wang, Yoshua Bengio, Liam Paull

Learning effective visuomotor policies for robots purely from data is challenging, but also appealing since a learning-based system should not require manual tuning or calibration.

Meta-Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.