Search Results for author: Le Zhang

Found 68 papers, 36 papers with code

Robust Visual Tracking Using Oblique Random Forests

1 code implementation CVPR 2017 Le Zhang, Jagannadan Varadarajan, Ponnuthurai Nagaratnam Suganthan, Narendra Ahuja, Pierre Moulin

Unlike conventional orthogonal decision trees that use a single feature and heuristic measures to obtain a split at each node, we propose to use a more powerful proximal SVM to obtain oblique hyperplanes to capture the geometric structure of the data better.

General Classification Image Classification +5

Image Matching: An Application-oriented Benchmark

no code implementations12 Sep 2017 Jia-Wang Bian, Le Zhang, Yun Liu, Wen-Yan Lin, Ming-Ming Cheng, Ian D. Reid

To this end, we present a uniform benchmark with novel evaluation metrics and a large-scale dataset for evaluating the overall performance of image matching methods.

Attribute Benchmarking

Kernel Cross-Correlator

3 code implementations12 Sep 2017 Chen Wang, Le Zhang, Lihua Xie, Junsong Yuan

Cross-correlator plays a significant role in many visual perception tasks, such as object detection and tracking.

Human Activity Recognition object-detection +2

Semantic Edge Detection with Diverse Deep Supervision

1 code implementation9 Apr 2018 Yun Liu, Ming-Ming Cheng, Deng-Ping Fan, Le Zhang, Jiawang Bian, DaCheng Tao

Semantic edge detection (SED), which aims at jointly extracting edges as well as their category information, has far-reaching applications in domains such as semantic segmentation, object proposal generation, and object recognition.

Edge Detection Object Proposal Generation +2

A Deep Network for Arousal-Valence Emotion Prediction with Acoustic-Visual Cues

1 code implementation2 May 2018 Songyou Peng, Le Zhang, Yutong Ban, Meng Fang, Stefan Winkler

In this paper, we comprehensively describe the methodology of our submissions to the One-Minute Gradual-Emotion Behavior Challenge 2018.

Learning Pixel-wise Labeling from the Internet without Human Interaction

no code implementations19 May 2018 Yun Liu, Yujun Shi, Jia-Wang Bian, Le Zhang, Ming-Ming Cheng, Jiashi Feng

Collecting sufficient annotated data is very expensive in many applications, especially for pixel-level prediction tasks such as semantic segmentation.

Segmentation Semantic Segmentation

MatchBench: An Evaluation of Feature Matchers

no code implementations7 Aug 2018 Jia-Wang Bian, Ruihan Yang, Yun Liu, Le Zhang, Ming-Ming Cheng, Ian Reid, WenHai Wu

This leads to a critical absence in this field that there is no standard datasets and evaluation metrics to evaluate different feature matchers fairly.

Automatic Assessment of Full Left Ventricular Coverage in Cardiac Cine Magnetic Resonance Imaging with Fisher-Discriminative 3D CNN

no code implementations6 Nov 2018 Le Zhang, Ali Gooya, Marco Pereanez, Bo Dong, Stefan K. Piechnik, Stefan Neubauer, Steffen E. Petersen, Alejandro F. Frangi

Full coverage of the left ventricle (LV), from base to apex, is a basic criterion for CMR image quality and necessary for accurate measurement of cardiac volume and functional assessment.

Salient Object Detection via High-to-Low Hierarchical Context Aggregation

no code implementations28 Dec 2018 Yun Liu, Yu Qiu, Le Zhang, Jia-Wang Bian, Guang-Yu Nie, Ming-Ming Cheng

In this paper, we observe that the contexts of a natural image can be well expressed by a high-to-low self-learning of side-output convolutional features.

object-detection RGB Salient Object Detection +4

A Deep Framework for Bone Age Assessment based on Finger Joint Localization

no code implementations7 May 2019 Xiaoman Zhang, Ziyuan Zhao, Cen Chen, Songyou Peng, Min Wu, Zhongyao Cheng, Singee Teo, Le Zhang, Zeng Zeng

In this study, we applied powerful deep neural network and explored a process in the forecast of skeletal bone age with the specifically combine joints images to increase the performance accuracy compared with the whole hand images.

Robust Regression via Deep Negative Correlation Learning

no code implementations24 Aug 2019 Le Zhang, Zenglin Shi, Ming-Ming Cheng, Yun Liu, Jia-Wang Bian, Joey Tianyi Zhou, Guoyan Zheng, Zeng Zeng

Nonlinear regression has been extensively employed in many computer vision problems (e. g., crowd counting, age estimation, affective computing).

Age Estimation Crowd Counting +2

An Evaluation of Feature Matchers for Fundamental Matrix Estimation

no code implementations26 Aug 2019 Jia-Wang Bian, Yu-Huan Wu, Ji Zhao, Yun Liu, Le Zhang, Ming-Ming Cheng, Ian Reid

According to this, we propose three high-quality matching systems and a Coarse-to-Fine RANSAC estimator.

A novel centroid update approach for clustering-based superpixel methods and superpixel-based edge detection

2 code implementations18 Oct 2019 Houwang Zhang, Chong Wu, Le Zhang, Hanying Zheng

Then according to the statistical features of noise, we propose a novel centroid update approach to enhance the robustness of clustering-based superpixel methods.

Clustering Edge Detection

AdaSample: Adaptive Sampling of Hard Positives for Descriptor Learning

no code implementations27 Nov 2019 Xin-Yu Zhang, Le Zhang, Zao-Yi Zheng, Yun Liu, Jia-Wang Bian, Ming-Ming Cheng

The effectiveness of the triplet loss heavily relies on the triplet selection, in which a common practice is to first sample intra-class patches (positives) from the dataset for batch construction and then mine in-batch negatives to form triplets.

Informativeness

Ordered or Orderless: A Revisit for Video based Person Re-Identification

no code implementations24 Dec 2019 Le Zhang, Zenglin Shi, Joey Tianyi Zhou, Ming-Ming Cheng, Yun Liu, Jia-Wang Bian, Zeng Zeng, Chunhua Shen

Specifically, with a diagnostic analysis, we show that the recurrent structure may not be effective to learn temporal dependencies than what we expected and implicitly yields an orderless representation.

Video-Based Person Re-Identification

The 2019 BBN Cross-lingual Information Retrieval System

no code implementations LREC 2020 Le Zhang, Damianos Karakos, William Hartmann, Manaj Srivastava, Lee Tarlin, David Akodes, Sanjay Krishna Gouda, Numra Bathool, Lingjun Zhao, Zhuolin Jiang, Richard Schwartz, John Makhoul

In this paper, we describe a cross-lingual information retrieval (CLIR) system that, given a query in English, and a set of audio and text documents in a foreign language, can return a scored list of relevant documents, and present findings in a summary form in English.

Cross-Lingual Information Retrieval Machine Translation +4

Regularized Densely-connected Pyramid Network for Salient Instance Segmentation

1 code implementation28 Aug 2020 Yu-Huan Wu, Yun Liu, Le Zhang, Wang Gao, Ming-Ming Cheng

Much of the recent efforts on salient object detection (SOD) have been devoted to producing accurate saliency maps without being aware of their instance labels.

Instance Segmentation object-detection +3

Generalized Zero-Shot Learning via VAE-Conditioned Generative Flow

1 code implementation1 Sep 2020 Yu-Chao Gu, Le Zhang, Yun Liu, Shao-Ping Lu, Ming-Ming Cheng

Recent generative methods formulate GZSL as a missing data problem, which mainly adopts GANs or VAEs to generate visual features for unseen classes.

Generalized Zero-Shot Learning

Disentangling Human Error from Ground Truth in Segmentation of Medical Images

1 code implementation NeurIPS 2020 Le Zhang, Ryutaro Tanno, MouCheng Xu, Chen Jin, Joseph Jacob, Olga Cicarrelli, Frederik Barkhof, Daniel Alexander

In all cases, our method outperforms competing methods and relevant baselines particularly in cases where the number of annotations is small and the amount of disagreement is large.

Medical Image Segmentation Segmentation

EDN: Salient Object Detection via Extremely-Downsampled Network

1 code implementation24 Dec 2020 Yu-Huan Wu, Yun Liu, Le Zhang, Ming-Ming Cheng, Bo Ren

In this paper, we tap into this gap and show that enhancing high- level features is essential for SOD as well.

Object object-detection +3

Unsupervised Scale-consistent Depth Learning from Video

2 code implementations25 May 2021 Jia-Wang Bian, Huangying Zhan, Naiyan Wang, Zhichao Li, Le Zhang, Chunhua Shen, Ming-Ming Cheng, Ian Reid

We propose a monocular depth estimator SC-Depth, which requires only unlabelled videos for training and enables the scale-consistent prediction at inference time.

Monocular Depth Estimation Monocular Visual Odometry +1

Vision Transformers with Hierarchical Attention

3 code implementations6 Jun 2021 Yun Liu, Yu-Huan Wu, Guolei Sun, Le Zhang, Ajad Chhatkuli, Luc van Gool

This paper tackles the high computational/space complexity associated with Multi-Head Self-Attention (MHSA) in vanilla vision transformers.

Image Classification Instance Segmentation +4

Free Lunch for Co-Saliency Detection: Context Adjustment

no code implementations4 Aug 2021 Lingdong Kong, Prakhar Ganesh, Tan Wang, Junhao Liu, Le Zhang, Yao Chen

We hope that the scale, diversity, and quality of our dataset can benefit researchers in this area and beyond.

counterfactual Saliency Detection +1

Boosting Salient Object Detection with Transformer-based Asymmetric Bilateral U-Net

1 code implementation17 Aug 2021 Yu Qiu, Yun Liu, Le Zhang, Jing Xu

The asymmetric bilateral encoder has a transformer path and a lightweight CNN path, where the two paths communicate at each encoder stage to learn complementary global contexts and local spatial details, respectively.

Object object-detection +2

Learning to Iteratively Solve Routing Problems with Dual-Aspect Collaborative Transformer

2 code implementations NeurIPS 2021 Yining Ma, Jingwen Li, Zhiguang Cao, Wen Song, Le Zhang, Zhenghua Chen, Jing Tang

Moreover, the positional features are embedded through a novel cyclic positional encoding (CPE) method to allow Transformer to effectively capture the circularity and symmetry of VRP solutions (i. e., cyclic sequences).

Traveling Salesman Problem

Probing Simile Knowledge from Pre-trained Language Models

1 code implementation ACL 2022 WeiJie Chen, Yongzhu Chang, Rongsheng Zhang, Jiashu Pu, Guandan Chen, Le Zhang, Yadong Xi, Yijiang Chen, Chang Su

In this paper, we probe simile knowledge from PLMs to solve the SI and SG tasks in the unified framework of simile triple completion for the first time.

Language Modelling Position +1

Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics Driven Reinforcement Learning

no code implementations9 Aug 2022 Xin Jin, Shu Zhao, Le Zhang, Xin Zhao, Qiang Deng, Chaoen Xiao

In recent years, image generation has made great strides in improving the quality of images, producing high-fidelity ones.

Attribute Face Generation +3

Aesthetic Visual Question Answering of Photographs

no code implementations10 Aug 2022 Xin Jin, Wu Zhou, Xinghui Zhou, Shuai Cui, Le Zhang, Jianwen Lv, Shu Zhao

In this paper, we propose a new task of aesthetic language assessment: aesthetic visual question and answering (AVQA) of images.

Question Answering Sentiment Analysis +1

Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes

no code implementations18 Aug 2022 Yu-Huan Wu, Da Zhang, Le Zhang, Xin Zhan, Dengxin Dai, Yun Liu, Ming-Ming Cheng

Current efficient LiDAR-based detection frameworks are lacking in exploiting object relations, which naturally present in both spatial and temporal manners.

3D Object Detection Object +2

AIA: Attention in Attention Within Collaborate Domains

1 code implementation Pattern Recognition and Computer Vision 2022 Le Zhang, Qi Feng, Yao Lu, Chang Liu, and Guangming Lu

Attention mechanisms can effectively improve the performance of the mobile networks with a limited computational complexity cost.

Deep Attention Position

Deep Negative Correlation Classification

no code implementations14 Dec 2022 Le Zhang, Qibin Hou, Yun Liu, Jia-Wang Bian, Xun Xu, Joey Tianyi Zhou, Ce Zhu

Ensemble learning serves as a straightforward way to improve the performance of almost any machine learning algorithm.

Classification Ensemble Learning

FedTADBench: Federated Time-Series Anomaly Detection Benchmark

1 code implementation19 Dec 2022 Fanxing Liu, Cheng Zeng, Le Zhang, Yingjie Zhou, Qing Mu, Yanru Zhang, Ling Zhang, Ce Zhu

We would like to answer the following questions: (1)How is the performance of time series anomaly detection algorithms when meeting federated learning?

Anomaly Detection Federated Learning +2

GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples

1 code implementation13 May 2023 Tian Gao, Cheng-Zhong Xu, Le Zhang, Hui Kong

Compared with the full-precision one, the model with the binarization method replaces complex tensor multiplication with simple bit-wise binary operations and represents full-precision model parameters and activations with only 1-bit ones, which potentially solves the problem of model size and computational complexity, respectively.

Binarization Knowledge Distillation +2

Enhancing the Protein Tertiary Structure Prediction by Multiple Sequence Alignment Generation

2 code implementations2 Jun 2023 Le Zhang, Jiayang Chen, Tao Shen, Yu Li, Siqi Sun

The field of protein folding research has been greatly advanced by deep learning methods, with AlphaFold2 (AF2) demonstrating exceptional performance and atomic-level precision.

Language Modelling Multiple Sequence Alignment +2

CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation

1 code implementation7 Jun 2023 Boyuan Sun, YuQi Yang, Le Zhang, Ming-Ming Cheng, Qibin Hou

Motivated by these, we aim to improve the use efficiency of unlabeled data by designing two novel label propagation strategies.

Segmentation Semi-Supervised Semantic Segmentation

Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering

1 code implementation16 Jun 2023 Rabiul Awal, Le Zhang, Aishwarya Agrawal

In summary, our research sheds light on the intricacies of prompting strategies in VLMs for VQA, emphasizing the synergistic use of captions, templates, and pre-processing to enhance model efficacy.

Image Captioning Question Answering +1

A Comprehensive Survey of Artificial Intelligence Techniques for Talent Analytics

no code implementations3 Jul 2023 Chuan Qin, Le Zhang, Rui Zha, Dazhong Shen, Qi Zhang, Ying Sun, Chen Zhu, HengShu Zhu, Hui Xiong

To this end, we present an up-to-date and comprehensive survey on AI technologies used for talent analytics in the field of human resource management.

Decision Making Management

Joint adjustment image steganography networks

1 code implementation Signal Processing: Image Communication 2023 Le Zhang, Yao Lu, Tong Li, Guangming Lu

Thus, the security and quality of stego and revealed secret images still have much room for promotion, especially for large-capacity image steganography.

Image Steganography Steganographics

Sudowoodo: a Chinese Lyric Imitation System with Source Lyrics

no code implementations9 Aug 2023 Yongzhu Chang, Rongsheng Zhang, Lin Jiang, Qihang Chen, Le Zhang, Jiashu Pu

In this paper, we introduce \textbf{\textit{Sudowoodo}}, a Chinese lyrics imitation system that can generate new lyrics based on the text of source lyrics.

Text Generation

Feature Modulation Transformer: Cross-Refinement of Global Representation via High-Frequency Prior for Image Super-Resolution

1 code implementation ICCV 2023 Ao Li, Le Zhang, Yun Liu, Ce Zhu

Transformer-based methods have exhibited remarkable potential in single image super-resolution (SISR) by effectively extracting long-range dependencies.

Image Super-Resolution

Irregular Traffic Time Series Forecasting Based on Asynchronous Spatio-Temporal Graph Convolutional Network

no code implementations31 Aug 2023 Weijia Zhang, Le Zhang, Jindong Han, Hao liu, Jingbo Zhou, Yu Mei, Hui Xiong

Accurate traffic forecasting at intersections governed by intelligent traffic signals is critical for the advancement of an effective intelligent traffic signal control system.

Time Series Time Series Forecasting

Multi-Task Cooperative Learning via Searching for Flat Minima

no code implementations21 Sep 2023 Fuping Wu, Le Zhang, Yang Sun, Yuanhan Mo, Thomas Nichols, Bartlomiej W. Papiez

In this work, we propose to formulate MTL as a multi/bi-level optimization problem, and therefore force features to learn from each task in a cooperative approach.

Multi-Task Learning

Low-Resolution Self-Attention for Semantic Segmentation

no code implementations8 Oct 2023 Yu-Huan Wu, Shi-Chen Zhang, Yun Liu, Le Zhang, Xin Zhan, Daquan Zhou, Jiashi Feng, Ming-Ming Cheng, Liangli Zhen

Semantic segmentation tasks naturally require high-resolution information for pixel-wise segmentation and global context information for class prediction.

Segmentation Semantic Segmentation

MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model

1 code implementation20 Oct 2023 Le Zhang, Yihong Wu, Fengran Mo, Jian-Yun Nie, Aishwarya Agrawal

To enable LLMs to tackle the task in a zero-shot manner, we introduce MoqaGPT, a straightforward and flexible framework.

Language Modelling Large Language Model +2

A Novel Deep Clustering Framework for Fine-Scale Parcellation of Amygdala Using dMRI Tractography

no code implementations25 Nov 2023 Haolin He, Ce Zhu, Le Zhang, Yipeng Liu, Xiao Xu, Yuqian Chen, Leo Zekelman, Jarrett Rushmore, Yogesh Rathi, Nikos Makris, Lauren J. O'Donnell, Fan Zhang

The amygdala plays a vital role in emotional processing and exhibits structural diversity that necessitates fine-scale parcellation for a comprehensive understanding of its anatomico-functional correlations.

Clustering Deep Clustering +1

Learning Triangular Distribution in Visual World

1 code implementation30 Nov 2023 Ping Chen, Xingpeng Zhang, Chengtao Zhou, Dichao Fan, Peng Tu, Le Zhang, Yanlin Qian

Convolution neural network is successful in pervasive vision tasks, including label distribution learning, which usually takes the form of learning an injection from the non-linear visual features to the well-defined labels.

Learning to See Low-Light Images via Feature Domain Adaptation

no code implementations11 Dec 2023 Qirui Yang, Qihua Cheng, Huanjing Yue, Le Zhang, Yihao Liu, Jingyu Yang

To solve this problem, we propose a single-stage network empowered by Feature Domain Adaptation (FDA) to decouple the denoising and color mapping tasks in raw LLIE.

Denoising Domain Adaptation +1

ReliCD: A Reliable Cognitive Diagnosis Framework with Confidence Awareness

no code implementations29 Dec 2023 Yunfei Zhang, Chuan Qin, Dazhong Shen, Haiping Ma, Le Zhang, Xingyi Zhang, HengShu Zhu

To address this, in this paper, we propose a novel Reliable Cognitive Diagnosis(ReliCD) framework, which can quantify the confidence of the diagnosis feedback and is flexible for different cognitive diagnostic functions.

cognitive diagnosis

Increasing SAM Zero-Shot Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation

no code implementations24 Feb 2024 Zekun Jiang, Dongjie Cheng, Ziyuan Qin, Jun Gao, Qicheng Lao, Kang Li, Le Zhang

This study develops and evaluates a novel multimodal medical image zero-shot segmentation algorithm named Text-Visual-Prompt SAM (TV-SAM) without any manual annotations.

Descriptive Language Modelling +3

DGR: A General Graph Desmoothing Framework for Recommendation via Global and Local Perspectives

no code implementations7 Mar 2024 Leilei Ding, Dazhong Shen, Chao Wang, Tianfu Wang, Le Zhang, Hui Xiong, Yanyong Zhang

Graph Convolutional Networks (GCNs) have become pivotal in recommendation systems for learning user and item embeddings by leveraging the user-item interaction graph's node information and topology.

Recommendation Systems

RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection

1 code implementation25 Mar 2024 Zhiwei Lin, Zhe Liu, Zhongyu Xia, Xinhao Wang, Yongtao Wang, Shengxiang Qi, Yang Dong, Nan Dong, Le Zhang, Ce Zhu

In the dual-stream radar backbone, a point-based encoder and a transformer-based encoder are proposed to extract radar features, with an injection and extraction module to facilitate communication between the two encoders.

Autonomous Driving Object +2

QiuNiu: A Chinese Lyrics Generation System with Passage-Level Input

no code implementations ACL 2022 Le Zhang, Rongsheng Zhang, Xiaoxi Mao, Yongzhu Chang

In this paper, we demonstrate the QiuNiu, a Chinese lyrics generation system which is conditioned on passage-level text rather than a few attributes or keywords.

Text Generation Unsupervised Machine Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.