Search Results for author: JianGuo Zhang

Found 40 papers, 18 papers with code

Are Pre-trained Transformers Robust in Intent Classification? A Missing Ingredient in Evaluation of Out-of-Scope Intent Detection

no code implementations • NLP4ConvAI (ACL) 2022 • JianGuo Zhang, Kazuma Hashimoto, Yao Wan, Zhiwei Liu, Ye Liu, Caiming Xiong, Philip Yu

Pre-trained Transformer-based models were reported to be robust in intent classification.

intent-classification Intent Classification +2

Paper
Add Code

Gradient-Guided Modality Decoupling for Missing-Modality Robustness

1 code implementation • 26 Feb 2024 • Hao Wang, Shengda Luo, Guosheng Hu, JianGuo Zhang

In aid of this indicator, we present a novel Gradient-guided Modality Decoupling (GMD) method to decouple the dependency on dominating modalities.

Sentiment Analysis

Paper
Code

AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System

1 code implementation • 23 Feb 2024 • Zhiwei Liu, Weiran Yao, JianGuo Zhang, Liangwei Yang, Zuxin Liu, Juntao Tan, Prafulla K. Choubey, Tian Lan, Jason Wu, Huan Wang, Shelby Heinecke, Caiming Xiong, Silvio Savarese

Thus, we open-source a new AI agent library, AgentLite, which simplifies this process by offering a lightweight, user-friendly platform for innovating LLM agent reasoning, architectures, and applications with ease.

275

Paper
Code

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

2 code implementations • 23 Feb 2024 • JianGuo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Tulika Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong

It meticulously standardizes and unifies these trajectories into a consistent format, streamlining the creation of a generic data loader optimized for agent training.

Paper
Code

Using Left and Right Brains Together: Towards Vision and Language Planning

no code implementations • 16 Feb 2024 • Jun Cen, Chenfei Wu, Xiao Liu, Shengming Yin, Yixuan Pei, Jinglong Yang, Qifeng Chen, Nan Duan, JianGuo Zhang

Large Language Models (LLMs) and Large Multi-modality Models (LMMs) have demonstrated remarkable decision masking capabilities on a variety of tasks.

Paper
Add Code

Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit

no code implementations • 30 Dec 2023 • Yao Wan, Yang He, Zhangqian Bi, JianGuo Zhang, Hongyu Zhang, Yulei Sui, Guandong Xu, Hai Jin, Philip S. Yu

We also benchmark several state-of-the-art neural models for code intelligence, and provide an open-source toolkit tailored for the rapid prototyping of deep-learning-based code intelligence models.

Representation Learning

Paper
Add Code

Video Understanding with Large Language Models: A Survey

1 code implementation • 29 Dec 2023 • Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, JianGuo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu

With the burgeoning growth of online video platforms and the escalating volume of video content, the demand for proficient video understanding tools has intensified markedly.

Video Understanding

650

Paper
Code

DRDT: Dynamic Reflection with Divergent Thinking for LLM-based Sequential Recommendation

no code implementations • 18 Dec 2023 • Yu Wang, Zhiwei Liu, JianGuo Zhang, Weiran Yao, Shelby Heinecke, Philip S. Yu

With our principle, we managed to outperform GPT-Turbo-3. 5 on three datasets using 7b models e. g., Vicuna-7b and Openchat-7b on NDCG@10.

In-Context Learning Sequential Recommendation

Paper
Add Code

Semi-supervised Semantic Segmentation via Boosting Uncertainty on Unlabeled Data

no code implementations • 30 Nov 2023 • Daoan Zhang, Yunhao Luo, JianGuo Zhang

We first figure out that the distribution gap between labeled and unlabeled datasets cannot be ignored, even though the two datasets are sampled from the same distribution.

Segmentation Semi-Supervised Semantic Segmentation

Paper
Add Code

Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System

no code implementations • 16 Aug 2023 • JianGuo Zhang, Stephen Roller, Kun Qian, Zhiwei Liu, Rui Meng, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong

End-to-end task-oriented dialogue (TOD) systems have achieved promising performance by leveraging sophisticated natural language understanding and natural language generation capabilities of pre-trained models.

Natural Language Understanding Retrieval +1

Paper
Add Code

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

2 code implementations • 11 Aug 2023 • Zhiwei Liu, Weiran Yao, JianGuo Zhang, Le Xue, Shelby Heinecke, Rithesh Murthy, Yihao Feng, Zeyuan Chen, Juan Carlos Niebles, Devansh Arpit, ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese

The massive successes of large language models (LLMs) encourage the emerging exploration of LLM-augmented Autonomous Agents (LAAs).

Benchmarking Decision Making

275

Paper
Code

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

no code implementations • 4 Aug 2023 • Weiran Yao, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Yihao Feng, Le Xue, Rithesh Murthy, Zeyuan Chen, JianGuo Zhang, Devansh Arpit, ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese

This demonstrates that using policy gradient optimization to improve language agents, for which we believe our work is one of the first, seems promising and can be applied to optimize other models in the agent architecture to enhance agent performances over time.

Language Modelling

Paper
Add Code

Cross Contrasting Feature Perturbation for Domain Generalization

2 code implementations • ICCV 2023 • Chenming Li, Daoan Zhang, Wenjian Huang, JianGuo Zhang

Domain generalization (DG) aims to learn a robust model from source domains that generalize well on unseen target domains.

Domain Generalization

Paper
Code

Strip-MLP: Efficient Token Interaction for Vision MLP

1 code implementation • ICCV 2023 • Guiping Cao, Shengda Luo, Wenjian Huang, Xiangyuan Lan, Dongmei Jiang, YaoWei Wang, JianGuo Zhang

Finally, based on the Strip MLP layer, we propose a novel \textbf{L}ocal \textbf{S}trip \textbf{M}ixing \textbf{M}odule (LSMM) to boost the token interaction power in the local region.

Paper
Code

Class Attention to Regions of Lesion for Imbalanced Medical Image Recognition

no code implementations • 19 Jul 2023 • Jia-Xin Zhuang, Jiabin Cai, JianGuo Zhang, Wei-Shi Zheng, Ruixuan Wang

The CARE framework needs bounding boxes to represent the lesion regions of rare diseases.

Image Classification Medical Image Classification

Paper
Add Code

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI

1 code implementation • 19 Jul 2023 • JianGuo Zhang, Kun Qian, Zhiwei Liu, Shelby Heinecke, Rui Meng, Ye Liu, Zhou Yu, Huan Wang, Silvio Savarese, Caiming Xiong

Despite advancements in conversational AI, language models encounter challenges to handle diverse conversational tasks, and existing dialogue dataset collections often lack diversity and comprehensiveness.

Few-Shot Learning Language Modelling +1

432

Paper
Code

DNAGPT: A Generalized Pre-trained Tool for Versatile DNA Sequence Analysis Tasks

no code implementations • 11 Jul 2023 • Daoan Zhang, Weitong Zhang, Yu Zhao, JianGuo Zhang, Bing He, Chenchen Qin, Jianhua Yao

Pre-trained large language models demonstrate potential in extracting information from DNA sequences, yet adapting to a variety of tasks and data modalities remains a challenge.

Binary Classification DNA analysis +1

Paper
Add Code

Inter-Rater Uncertainty Quantification in Medical Image Segmentation via Rater-Specific Bayesian Neural Networks

1 code implementation • 28 Jun 2023 • Qingqiao Hu, Hao Wang, Jing Luo, Yunhao Luo, Zhiheng Zhangg, Jan S. Kirschke, Benedikt Wiestler, Bjoern Menze, JianGuo Zhang, Hongwei Bran Li

We introduce a novel Bayesian neural network-based architecture to estimate inter-rater uncertainty in medical image segmentation.

Image Segmentation Medical Image Segmentation +3

Paper
Code

When SAM Meets Sonar Images

1 code implementation • 25 Jun 2023 • Lin Wang, Xiufen Ye, Liqiang Zhu, Weijie Wu, JianGuo Zhang, Huiming Xing, Chao Hu

Notably, there is a lack of research on the application of SAM to sonar imaging.

Segmentation Semantic Segmentation

Paper
Code

Efficient Deep Spiking Multi-Layer Perceptrons with Multiplication-Free Inference

no code implementations • 21 Jun 2023 • Boyan Li, Luziwei Leng, Ran Cheng, Shuaijie Shen, Kaixuan Zhang, JianGuo Zhang, Jianxing Liao

An expanded version of our network challenges the performance of the spiking VGG-16 network with a 71. 64% top-1 accuracy, all while operating with a model capacity 2. 1 times smaller.

Image Classification

Paper
Add Code

Zero-shot Item-based Recommendation via Multi-task Product Knowledge Graph Pre-Training

no code implementations • 12 May 2023 • Ziwei Fan, Zhiwei Liu, Shelby Heinecke, JianGuo Zhang, Huan Wang, Caiming Xiong, Philip S. Yu

This paper presents a novel paradigm for the Zero-Shot Item-based Recommendation (ZSIR) task, which pre-trains a model on product knowledge graph (PKG) to refine the item features from PLMs.

Recommendation Systems

Paper
Add Code

Feature Alignment and Uniformity for Test Time Adaptation

1 code implementation • CVPR 2023 • Shuai Wang, Daoan Zhang, Zipei Yan, JianGuo Zhang, Rui Li

Test time adaptation (TTA) aims to adapt deep neural networks when receiving out of distribution test domain samples.

Domain Generalization Image Segmentation +3

Paper
Code

Bootstrap The Original Latent: Learning a Private Model from a Black-box Model

no code implementations • 7 Mar 2023 • Shuai Wang, Daoan Zhang, JianGuo Zhang, Weiwei Zhang, Rui Li

In this paper, considering the balance of data/model privacy of model owners and user needs, we propose a new setting called Back-Propagated Black-Box Adaptation (BPBA) for users to better train their private models via the guidance of the back-propagated results of a Black-box foundation/source model.

Paper
Add Code

Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems

2 code implementations • 20 Feb 2023 • Yihao Feng, Shentao Yang, Shujian Zhang, JianGuo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang

Prior works mainly focus on adopting advanced RL techniques to train the ToD agents, while the design of the reward function is not well studied.

Learning-To-Rank Reinforcement Learning (RL) +2

819

Paper
Code

Aggregation of Disentanglement: Reconsidering Domain Variations in Domain Generalization

no code implementations • 5 Feb 2023 • Daoan Zhang, Mingkai Chen, Chenming Li, Lingyun Huang, JianGuo Zhang

Different from learning domain invariant features from source domains, we decouple the input images into Domain Expert Features and noise.

Contrastive Learning Disentanglement +1

Paper
Add Code

AugTriever: Unsupervised Dense Retrieval by Scalable Data Augmentation

no code implementations • 17 Dec 2022 • Rui Meng, Ye Liu, Semih Yavuz, Divyansh Agarwal, Lifu Tu, Ning Yu, JianGuo Zhang, Meghana Bhat, Yingbo Zhou

Dense retrievers have made significant strides in text retrieval and open-domain question answering, even though most achievements were made possible only with large amounts of human supervision.

Data Augmentation Open-Domain Question Answering +2

Paper
Add Code

A Domain-specific Perceptual Metric via Contrastive Self-supervised Representation: Applications on Natural and Medical Images

no code implementations • 3 Dec 2022 • Hongwei Bran Li, Chinmay Prabhakar, Suprosanna Shit, Johannes Paetzold, Tamaz Amiranashvili, JianGuo Zhang, Daniel Rueckert, Juan Eugenio Iglesias, Benedikt Wiestler, Bjoern Menze

We find that in the natural image domain, CSR behaves on par with the supervised one on several perceptual tests as a metric, and in the medical domain, CSR better quantifies perceptual similarity concerning the experts' ratings.

Image Generation

Paper
Add Code

Rethinking Alignment and Uniformity in Unsupervised Image Semantic Segmentation

no code implementations • 26 Nov 2022 • Daoan Zhang, Chenming Li, Haoquan Li, Wenjian Huang, Lingyun Huang, JianGuo Zhang

Experimental results on multiple semantic segmentation benchmarks show that our unsupervised segmentation framework specializes in catching semantic representations, which outperforms all the unpretrained and even several pretrained methods.

Ranked #1 on Unsupervised Semantic Segmentation on COCO-Stuff-3

Representation Learning Segmentation +2

Paper
Add Code

Partial Least Square Regression via Three-factor SVD-type Manifold Optimization for EEG Decoding

no code implementations • 9 Aug 2022 • Wanguang Yin, Zhichao Liang, JianGuo Zhang, Quanying Liu

To this end, we propose a new method to solve the partial least square regression, named PLSR via optimization on bi-Grassmann manifold (PLSRbiGr).

EEG Eeg Decoding +3

Paper
Add Code

Domain-Adaptive 3D Medical Image Synthesis: An Efficient Unsupervised Approach

1 code implementation • 2 Jul 2022 • Qingqiao Hu, Hongwei Li, JianGuo Zhang

This work focuses on exploring domain adaptation (DA) of 3D image-to-image synthesis models.

Domain Adaptation Image Generation

Paper
Code

Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning

1 code implementation • CVPR 2022 • Jiahao Xia, Weiwei qu, Wenjian Huang, JianGuo Zhang, Xi Wang, Min Xu

The SLPT generates the representation of each single landmark from a local patch and aggregates them by an adaptive inherent relation based on the attention mechanism.

Ranked #2 on Face Alignment on COFW-68

Face Alignment Relation +1

Paper
Code

Discrete Time Convolution for Fast Event-Based Stereo

1 code implementation • CVPR 2022 • Kaixuan Zhang, Kaiwei Che, JianGuo Zhang, Jie Cheng, Ziyang Zhang, Qinghai Guo, Luziwei Leng

Inspired by continuous dynamics of biological neuron models, we propose a novel encoding method for sparse events - continuous time convolution (CTC) - which learns to model the spatial feature of the data with intrinsic dynamics.

Depth Estimation Stereo Matching

Paper
Code

Detect Faces Efficiently: A Survey and Evaluations

2 code implementations • 3 Dec 2021 • Yuantao Feng, Shiqi Yu, Hanyang Peng, Yan-ran Li, JianGuo Zhang

However, with the tremendous increase in images and videos with variations in face scale, appearance, expression, occlusion and pose, traditional face detectors are challenged to detect various "in the wild" faces.

Face Detection Face Recognition +3

12,041

Paper
Code

Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning

2 code implementations • EMNLP 2021 • JianGuo Zhang, Trung Bui, Seunghyun Yoon, Xiang Chen, Zhiwei Liu, Congying Xia, Quan Hung Tran, Walter Chang, Philip Yu

In this work, we focus on a more challenging few-shot intent detection scenario where many intents are fine-grained and semantically similar.

Contrastive Learning Intent Detection

123

Paper
Code

Are Pretrained Transformers Robust in Intent Classification? A Missing Ingredient in Evaluation of Out-of-Scope Intent Detection

1 code implementation • 8 Jun 2021 • JianGuo Zhang, Kazuma Hashimoto, Yao Wan, Zhiwei Liu, Ye Liu, Caiming Xiong, Philip S. Yu

Pre-trained Transformer-based models were reported to be robust in intent classification.

intent-classification Intent Classification +2

123

Paper
Code

Enriching Non-Autoregressive Transformer with Syntactic and Semantic Structures for Neural Machine Translation

no code implementations • EACL 2021 • Ye Liu, Yao Wan, JianGuo Zhang, Wenting Zhao, Philip Yu

In this paper, we claim that the syntactic and semantic structures among natural language are critical for non-autoregressive machine translation and can further improve the performance.

Machine Translation Translation

Paper
Add Code

Imbalance-Aware Self-Supervised Learning for 3D Radiomic Representations

no code implementations • 6 Mar 2021 • Hongwei Li, Fei-Fei Xue, Krishna Chaitanya, Shengda Luo, Ivan Ezhov, Benedikt Wiestler, JianGuo Zhang, Bjoern Menze

Radiomic representations can quantify properties of regions of interest in medical image data.

Representation Learning Self-Supervised Learning

Paper
Add Code

Deep Class-Specific Affinity-Guided Convolutional Network for Multimodal Unpaired Image Segmentation

no code implementations • 5 Jan 2021 • Jingkun Chen, Wenqi Li, Hongwei Li, JianGuo Zhang

Our affinity matrix does not depend on spatial alignments of the visual features and thus allows us to train with unpaired, multimodal inputs.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

Sign-Agnostic Implicit Learning of Surface Self-Similarities for Shape Modeling and Reconstruction from Raw Point Clouds

no code implementations • CVPR 2021 • Wenbin Zhao, Jiabao Lei, Yuxin Wen, JianGuo Zhang, Kui Jia

Motivated from a universal phenomenon that self-similar shape patterns of local surface patches repeat across the entire surface of an object, we aim to push forward the data-driven strategies and propose to learn a local implicit surface network for a shared, adaptive modeling of the entire surface for a direct surface reconstruction from raw point cloud; we also enhance the leveraging of surface self-similarities by improving correlations among the optimized latent codes of individual surface patches.

Surface Reconstruction

Paper
Add Code

Deep CNNs for HEp-2 Cells Classification : A Cross-specimen Analysis

no code implementations • 20 Apr 2016 • Hongwei Li, Wei-Shi Zheng, JianGuo Zhang

Automatic classification of Human Epithelial Type-2 (HEp-2) cells staining patterns is an important and yet a challenging problem.

Classification General Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.