Search Results for author: Yan Huang

Found 102 papers, 31 papers with code

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

1 code implementation • CVPR 2023 • Zhengxiong Luo, Dayou Chen, Yingya Zhang, Yan Huang, Liang Wang, Yujun Shen, Deli Zhao, Jingren Zhou, Tieniu Tan

A diffusion probabilistic model (DPM), which constructs a forward diffusion process by gradually adding noise to data points and learns the reverse denoising process to generate new samples, has been shown to handle complex data distribution.

Ranked #7 on Video Generation on UCF-101

Code Generation Denoising +4

6,039

Paper
Code

Cyclic Differentiable Architecture Search

3 code implementations • 18 Jun 2020 • Hongyuan Yu, Houwen Peng, Yan Huang, Jianlong Fu, Hao Du, Liang Wang, Haibin Ling

First, the search network generates an initial architecture for evaluation, and the weights of the evaluation network are optimized.

Ranked #17 on Neural Architecture Search on NAS-Bench-201, CIFAR-10

Neural Architecture Search

1,561

Paper
Code

Unfolding the Alternating Optimization for Blind Super Resolution

1 code implementation • NeurIPS 2020 • Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

More importantly, \textit{Restorer} is trained with the kernel estimated by \textit{Estimator}, instead of ground-truth kernel, thus \textit{Restorer} could be more tolerant to the estimation error of \textit{Estimator}.

Ranked #2 on Blind Super-Resolution on Set5 - 2x upscaling

Blind Super-Resolution Burst Image Super-Resolution +1

229

Paper
Code

End-to-end Alternating Optimization for Blind Super Resolution

1 code implementation • 14 May 2021 • Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

More importantly, \textit{Restorer} is trained with the kernel estimated by \textit{Estimator}, instead of the ground-truth kernel, thus \textit{Restorer} could be more tolerant to the estimation error of \textit{Estimator}.

Ranked #2 on Blind Super-Resolution on DIV2KRK - 4x upscaling

Blind Super-Resolution Super-Resolution

229

Paper
Code

End-to-end Alternating Optimization for Real-World Blind Super Resolution

2 code implementations • 17 Aug 2023 • Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

To address this issue, instead of considering these two problems independently, we adopt an alternating optimization algorithm, which can estimate the degradation and restore the SR image in a single model.

Blind Super-Resolution Super-Resolution

229

Paper
Code

BEVBert: Multimodal Map Pre-training for Language-guided Navigation

1 code implementation • ICCV 2023 • Dong An, Yuankai Qi, Yangguang Li, Yan Huang, Liang Wang, Tieniu Tan, Jing Shao

Concretely, we build a local metric map to explicitly aggregate incomplete observations and remove duplicates, while modeling navigation dependency in a global topological map.

Ranked #2 on Visual Navigation on R2R

Vision and Language Navigation Visual Navigation

162

Paper
Code

1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)

1 code implementation • 23 Jun 2022 • Dong An, Zun Wang, Yangguang Li, Yi Wang, Yicong Hong, Yan Huang, Liang Wang, Jing Shao

Our model consists of three modules: the candidate waypoints predictor (CWP), the history enhanced planner and the tryout controller.

Data Augmentation Vision and Language Navigation

161

Paper
Code

ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments

1 code implementation • 6 Apr 2023 • Dong An, Hanqing Wang, Wenguan Wang, Zun Wang, Yan Huang, Keji He, Liang Wang

To develop a robust VLN-CE agent, we propose a new navigation framework, ETPNav, which focuses on two critical skills: 1) the capability to abstract environments and generate long-range navigation plans, and 2) the ability of obstacle-avoiding control in continuous environments.

Autonomous Navigation Navigate +1

161

Paper
Code

Learning the Degradation Distribution for Blind Image Super-Resolution

1 code implementation • CVPR 2022 • Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

Compared with previous deterministic degradation models, PDM could model more diverse degradations and generate HR-LR pairs that may better cover the various degradations of test images, and thus prevent the SR model from over-fitting to specific ones.

Image Super-Resolution

158

Paper
Code

Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation

1 code implementation • CVPR 2021 • Zhengxiong Luo, Zhicheng Wang, Yan Huang, Tieniu Tan, Erjin Zhou

However, for bottom-up methods, which need to handle a large variance of human scales and labeling ambiguities, the current practice seems unreasonable.

Pose Estimation regression

121

Paper
Code

Mask-Guided Contrastive Attention Model for Person Re-Identification

1 code implementation • CVPR 2018 • Chunfeng Song, Yan Huang, Wanli Ouyang, Liang Wang

We may be the first one to successfully introduce the binary mask into person ReID task and the first one to propose region-level contrastive learning.

Contrastive Learning Person Re-Identification

Paper
Code

Neighbor-view Enhanced Model for Vision and Language Navigation

1 code implementation • 15 Jul 2021 • Dong An, Yuankai Qi, Yan Huang, Qi Wu, Liang Wang, Tieniu Tan

Specifically, our NvEM utilizes a subject module and a reference module to collect contexts from neighbor views.

Ranked #82 on Vision and Language Navigation on VLN Challenge

Navigate Vision and Language Navigation

Paper
Code

Regularized Graph Structure Learning with Semantic Knowledge for Multi-variates Time-Series Forecasting

1 code implementation • 12 Oct 2022 • Hongyuan Yu, Ting Li, Weichen Yu, Jianguo Li, Yan Huang, Liang Wang, Alex Liu

In this paper, we propose Regularized Graph Structure Learning (RGSL) model to incorporate both explicit prior structure and implicit structure together, and learn the forecasting deep networks along with the graph structure.

Graph Generation Graph structure learning +2

Paper
Code

Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning

1 code implementation • 1 Dec 2023 • Shaohua Dong, Yunhe Feng, Qing Yang, Yan Huang, Dongfang Liu, Heng Fan

Existing approaches often fully fine-tune a dual-branch encoder-decoder framework with a complicated feature fusion strategy for achieving multimodal semantic segmentation, which is training-costly due to the massive parameter updates in feature extraction and fusion.

Ranked #2 on Semantic Segmentation on SUN-RGBD (using extra training data)

object-detection Object Detection +6

Paper
Code

Bag of Tricks for Training Data Extraction from Language Models

1 code implementation • 9 Feb 2023 • Weichen Yu, Tianyu Pang, Qian Liu, Chao Du, Bingyi Kang, Yan Huang, Min Lin, Shuicheng Yan

With the advance of language models, privacy protection is receiving more attention.

Text Generation

Paper
Code

Context-Guided Spatio-Temporal Video Grounding

1 code implementation • 3 Jan 2024 • Xin Gu, Heng Fan, Yan Huang, Tiejian Luo, Libo Zhang

The key of CG-STVG lies in two specially designed modules, including instance context generation (ICG), which focuses on discovering visual context information (in both appearance and motion) of the instance, and instance context refinement (ICR), which aims to improve the instance context from ICG by eliminating irrelevant or even harmful information from the context.

Ranked #1 on Spatio-Temporal Video Grounding on HC-STVG1

Object Spatio-Temporal Video Grounding +1

Paper
Code

3D Shape Temporal Aggregation for Video-Based Clothing-Change Person Re-Identication

1 code implementation • Asian Conference on Computer Vision 2023 • Ke Han, Shaogang Gong, Yan Huang, Liang Wang, Tieniu Tan

However, existing Re-ID methods usually generate 3D body shapes without considering identity modeling, which severely weakens the discriminability of 3D human shapes.

3D Shape Generation Person Re-Identification

Paper
Code

Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training

1 code implementation • 15 Jun 2023 • Chong Liu, Yuqi Zhang, Hongsong Wang, Weihua Chen, Fan Wang, Yan Huang, Yi-Dong Shen, Liang Wang

Most previous works either simply learn coarse-grained representations of the overall image and text, or elaborately establish the correspondence between image regions or pixels and text words.

Representation Learning Retrieval +1

Paper
Code

DGSSC: A Deep Generative Spectral-Spatial Classifier for Imbalanced Hyperspectral Imagery

1 code implementation • IEEE Transactions on Circuits and Systems for Video Technology 2022 • Bobo Xi, Jiaojiao Li, Yan Diao, Yunsong Li, Zan Li, Yan Huang, Jocelyn Chanussot

Specifically, the DGSSC comprises three components, a two-stage encoder, a decoder, and a classifier, which are trained in an end-to-end manner.

Data Augmentation Hyperspectral Image Classification

Paper
Code

Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision

1 code implementation • NeurIPS 2021 • Keji He, Yan Huang, Qi Wu, Jianhua Yang, Dong An, Shuanglin Sima, Liang Wang

In Vision-and-Language Navigation (VLN) task, an agent is asked to navigate inside 3D indoor environments following given instructions.

Navigate Vision and Language Navigation

Paper
Code

Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation

2 code implementations • 22 Aug 2023 • Yifei Su, Dong An, Yuan Xu, Kehan Chen, Yan Huang

This report details the methods of the winning entry of the AVDN Challenge in ICCV CLVL 2023.

Visual Grounding

Paper
Code

Box-driven Class-wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation

1 code implementation • CVPR 2019 • Chunfeng Song, Yan Huang, Wanli Ouyang, Liang Wang

To address this problem, it is a good choice to learn to segment with weak supervision from bounding boxes.

Weakly-supervised Learning Weakly supervised Semantic Segmentation +1

Paper
Code

SIN:Superpixel Interpolation Network

1 code implementation • 17 Oct 2021 • Qing Yuan, Songfeng Lu, Yan Huang, Wuxin Sha

The former is non-differentiable and the latter needs a non-differentiable post-processing step to enforce connectivity, which constraints the integration of superpixels and downstream tasks.

Computational Efficiency Superpixels +1

Paper
Code

GLARE: A Dataset for Traffic Sign Detection in Sun Glare

1 code implementation • 19 Sep 2022 • Nicholas Gray, Megan Moraes, Jiang Bian, Alex Wang, Allen Tian, Kurt Wilson, Yan Huang, Haoyi Xiong, Zhishan Guo

It provides an essential enrichment to the widely used LISA Traffic Sign dataset.

object-detection Object Detection +2

Paper
Code

A Hierarchical Contextual Attention-based GRU Network for Sequential Recommendation

1 code implementation • 14 Nov 2017 • Qiang Cui, Shu Wu, Yan Huang, Liang Wang

We fuse the current hidden state and a contextual hidden state built by the attention mechanism, which leads to a more suitable user's overall interest.

Sequential Recommendation

Paper
Code

CMF: Cascaded Multi-model Fusion for Referring Image Segmentation

1 code implementation • 16 Jun 2021 • Jianhua Yang, Yan Huang, Zhanyu Ma, Liang Wang

To solve this problem, we propose a simple yet effective Cascaded Multi-modal Fusion (CMF) module, which stacks multiple atrous convolutional layers in parallel and further introduces a cascaded branch to fuse visual and linguistic features.

Image Segmentation Segmentation +1

Paper
Code

VI-Diff: Unpaired Visible-Infrared Translation Diffusion Model for Single Modality Labeled Visible-Infrared Person Re-identification

1 code implementation • 6 Oct 2023 • Han Huang, Yan Huang, Liang Wang

In this paper, we propose VI-Diff, a diffusion model that effectively addresses the task of Visible-Infrared person image translation.

Image-to-Image Translation Person Re-Identification +1

Paper
Code

Co-Driven Recognition of Semantic Consistency via the Fusion of Transformer and HowNet Sememes Knowledge

1 code implementation • 21 Feb 2023 • Fan Chen, Yan Huang, Xinfang Zhang, Kang Luo, Jinxuan Zhu, Ruixian He

Multi-level encoding of internal sentence structures via data-driven is carried out firstly by Transformer, sememes knowledge base HowNet is introduced for knowledge-driven to model the semantic knowledge association among sentence pairs.

Paraphrase Identification Sentence +1

Paper
Code

Illumination Distillation Framework for Nighttime Person Re-Identification and A New Benchmark

1 code implementation • 31 Aug 2023 • Andong Lu, Zhang Zhang, Yan Huang, Yifan Zhang, Chenglong Li, Jin Tang, Liang Wang

The illumination enhancement branch first estimates an enhanced image from the nighttime image using a nonlinear curve mapping method and then extracts the enhanced features.

Person Re-Identification

Paper
Code

Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching

1 code implementation • IEEE Transactions on Multimedia 2021 • Aihua Zheng, Menglan Hu, Bo Jiang *, Yan Huang, Yan Yan, and Bin Luo

AML aims to generate a modality-independent representation for each person in each modality via adversarial learning, while simultaneously learns a robust similarity measure for cross-modality matching via metric learning.

audio-visual learning Metric Learning +1

Paper
Code

Relational Network for Skeleton-Based Action Recognition

no code implementations • 7 May 2018 • Wu Zheng, Lin Li, Zhao-Xiang Zhang, Yan Huang, Liang Wang

We introduce the Recurrent Relational Network to learn the spatial features in a single skeleton, followed by a multi-layer LSTM to learn the temporal features in the skeleton sequences.

Ranked #95 on Skeleton Based Action Recognition on NTU RGB+D

Action Recognition Skeleton Based Action Recognition +1

Paper
Add Code

Multi-pseudo Regularized Label for Generated Data in Person Re-Identification

no code implementations • 21 Jan 2018 • Yan Huang, Jinsong Xu, Qiang Wu, Zhedong Zheng, Zhao-Xiang Zhang, Jian Zhang

Unlike the traditional label which usually is a single integral number, the virtual label proposed in this work is a set of weight-based values each individual of which is a number in (0, 1] called multi-pseudo label and reflects the degree of relation between each generated data to every pre-defined class of real data.

Generative Adversarial Network Person Re-Identification +1

Paper
Add Code

Learning Semantic Concepts and Order for Image and Sentence Matching

no code implementations • CVPR 2018 • Yan Huang, Qi Wu, Liang Wang

This mainly arises from that the representation of pixel-level image usually lacks of high-level semantic information as in its matched sentence.

Ranked #11 on Image Retrieval on Flickr30K 1K test

Cross-Modal Retrieval Sentence

Paper
Add Code

Multimodal Memory Modelling for Video Captioning

no code implementations • 17 Nov 2016 • Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan

In this paper, we propose a Multimodal Memory Model (M3) to describe videos, which builds a visual and textual shared memory to model the long-term visual-textual dependency and further guide global visual attention on described targets.

Sentence Video Captioning

Paper
Add Code

Instance-aware Image and Sentence Matching with Selective Multimodal LSTM

no code implementations • CVPR 2017 • Yan Huang, Wei Wang, Liang Wang

Based on the observation that such a global similarity arises from a complex aggregation of multiple local similarities between pairwise instances of image (objects) and sentence (words), we propose a selective multimodal Long Short-Term Memory network (sm-LSTM) for instance-aware image and sentence matching.

Ranked #14 on Image Retrieval on Flickr30K 1K test

Paper
Add Code

Anchoring and Agreement in Syntactic Annotations

no code implementations • EMNLP 2016 • Yevgeni Berzak, Yan Huang, Andrei Barbu, Anna Korhonen, Boris Katz

Our agreement results control for parser bias, and are consequential in that they are on par with state of the art parsing performance for English newswire.

Decision Making Dependency Parsing

Paper
Add Code

DeepMove: Learning Place Representations through Large Scale Movement Data

no code implementations • 11 Jul 2018 • Yang Zhou, Yan Huang

DeepMove is spatial and temporal context aware.

Clustering

Paper
Add Code

RetGK: Graph Kernels based on Return Probabilities of Random Walks

no code implementations • NeurIPS 2018 • Zhen Zhang, Mianzhi Wang, Yijian Xiang, Yan Huang, Arye Nehorai

Graph-structured data arise in wide applications, such as computer vision, bioinformatics, and social networks.

Computational Efficiency General Classification +1

Paper
Add Code

Bidirectional Recurrent Convolutional Networks for Multi-Frame Super-Resolution

no code implementations • NeurIPS 2015 • Yan Huang, Wei Wang, Liang Wang

Super resolving a low-resolution video is usually handled by either single-image super-resolution (SR) or multi-frame SR. Single-Image SR deals with each video frame independently, and ignores intrinsic temporal dependency of video frames which actually plays a very important role in video super-resolution.

Ranked #18 on Video Super-Resolution on Vid4 - 4x upscaling

Multi-Frame Super-Resolution Optical Flow Estimation +2

Paper
Add Code

Aligning Infinite-Dimensional Covariance Matrices in Reproducing Kernel Hilbert Spaces for Domain Adaptation

no code implementations • CVPR 2018 • Zhen Zhang, Mianzhi Wang, Yan Huang, Arye Nehorai

Domain shift, which occurs when there is a mismatch between the distributions of training (source) and testing (target) datasets, usually results in poor performance of the trained model on the target domain.

Domain Adaptation

Paper
Add Code

M3: Multimodal Memory Modelling for Video Captioning

no code implementations • CVPR 2018 • Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan

Inspired by the facts that memory modelling poses potential advantages to long-term sequential problems [35] and working memory is the key factor of visual attention [33], we propose a Multimodal Memory Model (M3) to describe videos, which builds a visual and textual shared memory to model the long-term visual-textual dependency and further guide visual attention on described visual targets to solve visual-textual alignments.

Sentence Video Captioning

Paper
Add Code

Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking

1 code implementation • ECCV 2018 • Chenglong Li, Chengli Zhu, Yan Huang, Jin Tang, Liang Wang

To address this problem, this paper presents a novel approach to suppress background effects for RGB-T tracking.

Object Object Tracking +2

Paper
Code

Sparse Coding for Classification via Discrimination Ensemble

no code implementations • CVPR 2016 • Yuhui Quan, Yong Xu, Yuping Sun, Yan Huang, Hui Ji

Discriminative sparse coding has emerged as a promising technique in image analysis and recognition, which couples the process of classifier training and the process of dictionary learning for improving the discriminability of sparse codes.

Classification Dictionary Learning +1

Paper
Add Code

See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-Identification

no code implementations • CVPR 2017 • Zhen Zhou, Yan Huang, Wei Wang, Liang Wang, Tieniu Tan

Accordingly, a demanding need is to recognize a person under different cameras, which is called person re-identification.

Metric Learning Video-Based Person Re-Identification

Paper
Add Code

Dynamic Texture Recognition via Orthogonal Tensor Dictionary Learning

no code implementations • ICCV 2015 • Yuhui Quan, Yan Huang, Hui Ji

In addition, based on the proposed dictionary learning method, a DT descriptor is developed, which has better adaptivity, discriminability and scalability than the existing approaches.

Dictionary Learning Dynamic Texture Recognition

Paper
Add Code

Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning

no code implementations • ICCV 2015 • Yan Huang, Wei Wang, Liang Wang

Relation learning is a fundamental operation in many computer vision tasks.

Face Verification General Classification +2

Paper
Add Code

Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments

no code implementations • 23 Jun 2019 • Kai Niu, Yan Huang, Wanli Ouyang, Liang Wang

Firstly, the global-global alignment in the Global Contrast (GC) module is for matching the global contexts of images and descriptions.

Ranked #19 on Text based Person Retrieval on CUHK-PEDES

Person Re-Identification Text based Person Retrieval

Paper
Add Code

AIG Investments.AI at the FinSBD Task: Sentence Boundary Detection through Sequence Labelling and BERT Fine-tuning

no code implementations • WS 2019 • Jinhua Du, Yan Huang, Karo Moilanen

Boundary Detection Sentence

Paper
Add Code

Learning Compact Target-Oriented Feature Representations for Visual Tracking

no code implementations • 5 Aug 2019 • Chenglong Li, Yan Huang, Liang Wang, Jin Tang, Liang Lin

Many state-of-the-art trackers usually resort to the pretrained convolutional neural network (CNN) model for correlation filtering, in which deep features could usually be redundant, noisy and less discriminative for some certain instances, and the tracking performance might thus be affected.

Visual Tracking

Paper
Add Code

SBSGAN: Suppression of Inter-Domain Background Shift for Person Re-Identification

no code implementations • ICCV 2019 • Yan Huang, Qiang Wu, JingSong Xu, Yi Zhong

We observe that if backgrounds in the training and testing datasets are very different, it dramatically introduces difficulties to extract robust pedestrian features, and thus compromises the cross-domain person re-ID performance.

Generative Adversarial Network Person Re-Identification

Paper
Add Code

ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching

no code implementations • ICCV 2019 • Yan Huang, Liang Wang

Image and sentence matching has drawn much attention recently, but due to the lack of sufficient pairwise data for training, most previous methods still cannot well associate those challenging pairs of images and sentences containing rarely appeared regions and words, i. e., few-shot content.

Sentence

Paper
Add Code

Advances in Online Audio-Visual Meeting Transcription

no code implementations • 10 Dec 2019 • Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou

This increases marginally to 1. 6% when 50% of the attendees are unknown to the system.

speaker-diarization Speaker Diarization +2

Paper
Add Code

T-Drive: Driving Directions Based on Taxi Trajectories

no code implementations • ACM SIGSPATIAL GIS 2010 2010 • Jing Yuan, Yu Zheng, Chengyang Zhang, Wenlei Xie, Xing Xie, Guangzhong Sun, Yan Huang

GPS-equipped taxis can be regarded as mobile sensors probing traffic flows on road surfaces, and taxi drivers are usually experienced in finding the fastest (quickest) route to a destination based on their knowledge.

Clustering

Paper
Add Code

Large-scale Real-time Personalized Similar Product Recommendations

no code implementations • 12 Apr 2020 • Zhi Liu, Yan Huang, Jing Gao, Li Chen, Dong Li

Similar product recommendation is one of the most common scenes in e-commerce.

Collaborative Filtering Product Recommendation

Paper
Add Code

Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Zheng Zhang, Lizi Liao, Xiaoyan Zhu, Tat-Seng Chua, Zitao Liu, Yan Huang, Minlie Huang

Most existing approaches for goal-oriented dialogue policy learning used reinforcement learning, which focuses on the target agent policy and simply treat the opposite agent policy as part of the environment.

Decision Making

Paper
Add Code

L-Vector: Neural Label Embedding for Domain Adaptation

no code implementations • 25 Apr 2020 • Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee

We propose a novel neural label embedding (NLE) scheme for the domain adaptation of a deep neural network (DNN) acoustic model with unpaired data samples from source and target domains.

Domain Adaptation

Paper
Add Code

Crowd, Lending, Machine, and Bias

no code implementations • 20 Jul 2020 • Runshan Fu, Yan Huang, Param Vir Singh

We then use the machine to make investment decisions, and find that the machine benefits not only the lenders but also the borrowers.

Fairness

Paper
Add Code

Recurrent Deconvolutional Generative Adversarial Networks with Application to Text Guided Video Generation

no code implementations • 13 Aug 2020 • Hongyuan Yu, Yan Huang, Lihong Pi, Liang Wang

The RDN is a deconvolutional version of conventional recurrent neural network, which can well model the long-range temporal dependency of generated video frames and make good use of conditional information.

Generative Adversarial Network Video Classification +2

Paper
Add Code

Algorithmic Transparency with Strategic Users

no code implementations • 21 Aug 2020 • Qiaochu Wang, Yan Huang, Stefanus Jasin, Param Vir Singh

We show that, in some cases, even the predictive power of machine learning algorithms may increase if the firm makes them transparent.

BIG-bench Machine Learning Decision Making

Paper
Add Code

Towards Part-aware Monocular 3D Human Pose Estimation: An Architecture Search Approach

no code implementations • ECCV 2020 • Zerui Chen, Yan Huang, Hongyuan Yu, Bin Xue, Ke Han, Yiru Guo, Liang Wang

With roughly the same computational complexity as previous models, our approach achieves state-of-the-art results on both the single-person and multi-person 3D pose estimation benchmarks.

3D Pose Estimation Monocular 3D Human Pose Estimation

Paper
Add Code

Prediction and Recovery for Adaptive Low-Resolution Person Re-Identification

no code implementations • ECCV 2020 • Ke Han, Yan Huang, Zerui Chen, Liang Wang, Tieniu Tan

In this paper, we propose a novel Prediction, Recovery and Identification (PRI) model for LR re-id, which adaptively recovers missing details by predicting a preferable scale factor based on the image content.

Person Re-Identification Super-Resolution

Paper
Add Code

Actor and Action Modular Network for Text-based Video Segmentation

no code implementations • 2 Nov 2020 • Jianhua Yang, Yan Huang, Kai Niu, Linjiang Huang, Zhanyu Ma, Liang Wang

Previous methods fail to explicitly align the video content with the textual query in a fine-grained manner according to the actor and its action, due to the problem of \emph{semantic asymmetry}.

Ranked #9 on Referring Expression Segmentation on J-HMDB

Action Segmentation Action Understanding +5

Paper
Add Code

Collaborative City Digital Twin For Covid-19 Pandemic: A Federated Learning Solution

no code implementations • 5 Nov 2020 • Junjie Pang, Jianbo Li, Zhenzhen Xie, Yan Huang, Zhipeng Cai

In this work, we propose a collaborative city digital twin based on FL, a novel paradigm that allowing multiple city DT to share the local strategy and status in a timely manner.

Federated Learning Management

Paper
Add Code

Observation of Magnetic Droplets in Magnetic Tunnel Junctions

no code implementations • 10 Dec 2020 • Kewen Shi, Wenlong Cai, Sheng Jiang, Daoqian Zhu, Kaihua Cao, Zongxia Guo, Jiaqi Wei, Ao Du, Zhi Li, Yan Huang, Jialiang Yin, Johan Akerman, Weisheng Zhao

Magnetic droplets, a class of highly non-linear magnetodynamical solitons, can be nucleated and stabilized in nanocontact spin-torque nano-oscillators where they greatly increase the microwave output power.

Applied Physics

Paper
Add Code

Efficient Human Pose Estimation by Learning Deeply Aggregated Representations

no code implementations • 13 Dec 2020 • Zhengxiong Luo, Zhicheng Wang, Yuanhao Cai, GuanAn Wang, Yan Huang, Liang Wang, Erjin Zhou, Tieniu Tan, Jian Sun

Instead, we focus on exploiting multi-scale information from layers with different receptive-field sizes and then making full of use this information by improving the fusion method.

Pose Estimation

Paper
Add Code

FWB-Net:Front White Balance Network for Color Shift Correction in Single Image Dehazing via Atmospheric Light Estimation

no code implementations • 21 Jan 2021 • Cong Wang, Yan Huang, Yuexian Zou, Yong Xu

However, for images taken in real-world, the illumination is not uniformly distributed over whole image which brings model mismatch and possibly results in color shift of the deep models using ASM.

Image Dehazing Single Image Dehazing

Paper
Add Code

FDAN: Flow-guided Deformable Alignment Network for Video Super-Resolution

no code implementations • 12 May 2021 • Jiayi Lin, Yan Huang, Liang Wang

Recently, deformable alignment has drawn extensive attention in VSR community for its remarkable performance, which can adaptively align neighboring frames with the reference one.

Optical Flow Estimation Video Super-Resolution

Paper
Add Code

Hate Speech Detection in Clubhouse

no code implementations • 24 Jun 2021 • Hadi Mansourifar, Dana Alsagheer, Reza Fathi, Weidong Shi, Lan Ni, Yan Huang

This makes the hate speech detection challenging in new social media like Clubhouse.

Hate Speech Detection

Paper
Add Code

Statistical Analysis of Perspective Scores on Hate Speech Detection

no code implementations • 22 Jun 2021 • Hadi Mansourifar, Dana Alsagheer, Weidong Shi, Lan Ni, Yan Huang

It has proven that, state-of-the-art hate speech classifiers are efficient only when tested on the data with the same feature distribution as training data.

Hate Speech Detection

Paper
Add Code

Adaptive Dilated Convolution For Human Pose Estimation

no code implementations • 22 Jul 2021 • Zhengxiong Luo, Zhicheng Wang, Yan Huang, Liang Wang, Tieniu Tan, Erjin Zhou

It can generate and fuse multi-scale features of the same spatial sizes by setting different dilation rates for different channels.

Pose Estimation

Paper
Add Code

Fully Non-Homogeneous Atmospheric Scattering Modeling with Convolutional Neural Networks for Single Image Dehazing

no code implementations • 25 Aug 2021 • Cong Wang, Yan Huang, Yuexian Zou, Yong Xu

However, it is noted that ASM-based SIDM degrades its performance in dehazing real world hazy images due to the limited modelling ability of ASM where the atmospheric light factor (ALF) and the angular scattering coefficient (ASC) are assumed as constants for one image.

Image Dehazing Single Image Dehazing

Paper
Add Code

Pointing to Select: A Fast Pointer-LSTM for Long Text Classification

no code implementations • COLING 2020 • Jinhua Du, Yan Huang, Karo Moilanen

Recurrent neural networks (RNNs) suffer from well-known limitations and complications which include slow inference and vanishing gradients when processing long sequences in text classification.

text-classification Text Classification

Paper
Add Code

Clothing Status Awareness for Long-Term Person Re-Identification

no code implementations • ICCV 2021 • Yan Huang, Qiang Wu, Jingsong Xu, Yi Zhong, Zhaoxiang Zhang

This work argues that these approaches in fact are not aware of clothing status (i. e., change or no-change) of a pedestrian.

Person Re-Identification

Paper
Add Code

AI in Human-computer Gaming: Techniques, Challenges and Opportunities

no code implementations • 15 Nov 2021 • Qiyue Yin, Jun Yang, Kaiqi Huang, Meijing Zhao, Wancheng Ni, Bin Liang, Yan Huang, Shu Wu, Liang Wang

Through this survey, we 1) compare the main difficulties among different kinds of games and the corresponding techniques utilized for achieving professional human level AIs; 2) summarize the mainstream frameworks and techniques that can be properly relied on for developing AIs for complex human-computer gaming; 3) raise the challenges or drawbacks of current techniques in the successful AIs; and 4) try to point out future trends in human-computer gaming AIs.

Decision Making

Paper
Add Code

Uncovering the Source of Machine Bias

no code implementations • 9 Jan 2022 • Xiyang Hu, Yan Huang, Beibei Li, Tian Lu

We find two types of biases in gender, preference-based bias and belief-based bias, are present in human evaluators' decisions.

counterfactual

Paper
Add Code

Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption

no code implementations • 1 Mar 2022 • Ke Han, Chenyang Si, Yan Huang, Liang Wang, Tieniu Tan

In this paper, we investigate the generalization problem of person re-identification (re-id), whose major challenge is the distribution shift on an unseen domain.

Generalizable Person Re-identification

Paper
Add Code

Generative Compression for Face Video: A Hybrid Scheme

no code implementations • 21 Apr 2022 • Anni Tang, Yan Huang, Jun Ling, ZhiYu Zhang, Yiwei Zhang, Rong Xie, Li Song

As the latest video coding standard, versatile video coding (VVC) has shown its ability in retaining pixel quality.

Paper
Add Code

A Closer Look at Personalization in Federated Image Classification

no code implementations • 22 Apr 2022 • Changxing Jing, Yan Huang, Yihong Zhuang, Liyan Sun, Yue Huang, Zhenlong Xiao, Xinghao Ding

This paper shows that it is possible to achieve flexible personalization after the convergence of the global model by introducing representation learning.

Classification Edge-computing +3

Paper
Add Code

Intra Encoding Complexity Control with a Time-Cost Model for Versatile Video Coding

no code implementations • 13 Jun 2022 • Yan Huang, Jizheng Xu, Li Zhang, Yan Zhao, Li Song

Inspired by rate control algorithms, we propose a scheme to precisely control the intra encoding complexity of VVC.

Paper
Add Code

Tackling Data Heterogeneity: A New Unified Framework for Decentralized SGD with Sample-induced Topology

no code implementations • 8 Jul 2022 • Yan Huang, Ying Sun, Zehan Zhu, Changzhi Yan, Jinming Xu

We develop a general framework unifying several gradient-based stochastic optimization methods for empirical risk minimization problems both in centralized and distributed scenarios.

Stochastic Optimization

Paper
Add Code

Neural-FacTOR: Neural Representation Learning for Website Fingerprinting Attack over TOR Anonymity

no code implementations • 26 Sep 2022 • Haili Sun, Yan Huang, Lansheng Han, Xiang Long, Hongle Liu, Chunjie Zhou

TOR (The Onion Router) network is a widely used open source anonymous communication tool, the abuse of TOR makes it difficult to monitor the proliferation of online crimes such as to access criminal websites.

Representation Learning

Paper
Add Code

CNTN: Cyclic Noise-tolerant Network for Gait Recognition

no code implementations • 13 Oct 2022 • Weichen Yu, Hongyuan Yu, Yan Huang, Chunshui Cao, Liang Wang

Gait recognition aims to identify individuals by recognizing their walking patterns.

Gait Recognition Memorization

Paper
Add Code

Generalized Inter-class Loss for Gait Recognition

no code implementations • 13 Oct 2022 • Weichen Yu, Hongyuan Yu, Yan Huang, Liang Wang

The proposed method can be generalized to different gait recognition networks and achieves significant improvements.

Gait Recognition

Paper
Add Code

CFNet: Conditional Filter Learning with Dynamic Noise Estimation for Real Image Denoising

no code implementations • 26 Nov 2022 • Yifan Zuo, Jiacheng Xie, Yuming Fang, Yan Huang, Wenhui Jiang

A mainstream type of the state of the arts (SOTAs) based on convolutional neural network (CNN) for real image denoising contains two sub-problems, i. e., noise estimation and non-blind denoising.

Image Denoising Noise Estimation

Paper
Add Code

Few-shot Detection of Anomalies in Industrial Cyber-Physical System via Prototypical Network and Contrastive Learning

no code implementations • 21 Feb 2023 • Haili Sun, Yan Huang, Lansheng Han, Chunjie Zhou

The rapid development of Industry 4. 0 has amplified the scope and destructiveness of industrial Cyber-Physical System (CPS) by network attacks.

Anomaly Detection Contrastive Learning

Paper
Add Code

PlanarTrack: A Large-scale Challenging Benchmark for Planar Object Tracking

no code implementations • ICCV 2023 • Xinran Liu, Xiaoqiong Liu, Ziruo Yi, Xin Zhou, Thanh Le, Libo Zhang, Yan Huang, Qing Yang, Heng Fan

In addition, we further derive a variant named PlanarTrack$_{\mathbf{BB}}$ for generic object tracking from PlanarTrack.

Object Tracking

Paper
Add Code

Inclusive FinTech Lending via Contrastive Learning and Domain Adaptation

no code implementations • 10 May 2023 • Xiyang Hu, Yan Huang, Beibei Li, Tian Lu

We use contrastive learning to train our feature extractor on unapproved (unlabeled) loan applications and use domain adaptation to generalize the performance of our label predictor.

Contrastive Learning Decision Making +1

Paper
Add Code

Clothing-Change Feature Augmentation for Person Re-Identification

no code implementations • CVPR 2023 • Ke Han, Shaogang Gong, Yan Huang, Liang Wang, Tieniu Tan

Specifically, to formulate meaningful clothing variations in the feature space, our method first estimates a clothing-change normal distribution with intra-ID cross-clothing variances.

Person Re-Identification

Paper
Add Code

A Unified Framework to Super-Resolve Face Images of Varied Low Resolutions

no code implementations • 6 Jun 2023 • Qiuyu Peng, Zifei Jiang, Yan Huang, Jingliang Peng

By contrast, we explore in this work a unified framework that is trained once and then used to super-resolve input face images of varied low resolutions.

Image Super-Resolution

Paper
Add Code

On the Computation-Communication Trade-Off with A Flexible Gradient Tracking Approach

no code implementations • 12 Jun 2023 • Yan Huang, Jinming Xu

We propose a flexible gradient tracking approach with adjustable computation and communication steps for solving distributed stochastic optimization problem over networks.

Stochastic Optimization

Paper
Add Code

A 3D grain-based reconstruction method from a 2D surface image for the Distinct Lattice Spring Model

no code implementations • Numerical and Analytical Methods in Geomechanics 2023 • Xin-DongWei, Zhi-Qiang Deng, Qin Li, Yan Huang, Gao-Feng Zhao

The 3D GBM reconstruction is generated by a simulated annealing algorithm, with the Monte Carlo algorithm extending the calculation of the two-point probability function as the target function to the random particle model.

Computational Efficiency

Paper
Add Code

Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models

no code implementations • 20 Jul 2023 • Somayeh Ghanbarzadeh, Yan Huang, Hamid Palangi, Radames Cruz Moreno, Hamed Khanpour

Recent studies have revealed that the widely-used Pre-trained Language Models (PLMs) propagate societal biases from the large unmoderated pre-training corpora.

Language Modelling Masked Language Modeling

Paper
Add Code

Improving the Reusability of Pre-trained Language Models in Real-world Applications

no code implementations • 19 Jul 2023 • Somayeh Ghanbarzadeh, Hamid Palangi, Yan Huang, Radames Cruz Moreno, Hamed Khanpour

The reusability of state-of-the-art Pre-trained Language Models (PLMs) is often limited by their generalization problem, where their performance drastically decreases when evaluated on examples that differ from the training dataset, known as Out-of-Distribution (OOD)/unseen examples.

Language Modelling Masked Language Modeling

Paper
Add Code

Robust Fully-Asynchronous Methods for Distributed Training over General Architecture

no code implementations • 21 Jul 2023 • Zehan Zhu, Ye Tian, Yan Huang, Jinming Xu, Shibo He

Perfect synchronization in distributed machine learning problems is inefficient and even impossible due to the existence of latency, package losses and stragglers.

Paper
Add Code

Trajectory Generation and Tracking based on Energy Minimization for a Four-Link Brachiation Robot

no code implementations • 11 Aug 2023 • Zishang Ji, Xuanyu Zhang, Xuanzhe Wang, Yan Huang

Aiming to mimic the brachiation locomotion of primates, we establish a brachiation robot model capable of swinging between different bars.

Model Predictive Control

Paper
Add Code

Free Lunch for Gait Recognition: A Novel Relation Descriptor

no code implementations • 22 Aug 2023 • Jilong Wang, Saihui Hou, Yan Huang, Chunshui Cao, Xu Liu, Yongzhen Huang, Tianzhu Zhang, Liang Wang

Gait recognition is to seek correct matches for query individuals by their unique walking patterns.

Dimensionality Reduction Gait Recognition +1

Paper
Add Code

Neural Network-Based Histologic Remission Prediction In Ulcerative Colitis

no code implementations • 28 Aug 2023 • Yemin li, Zhongcheng Liu, Xiaoying Lou, Mirigual Kurban, Miao Li, Jie Yang, Kaiwei Che, Jiankun Wang, Max Q. -H Meng, Yan Huang, Qin Guo, Pinjin Hu

A total of 5105 images of 154 intestinal segments from 87 patients undergoing EC treatment at a center in China between March 2022 and March 2023 are scored according to the Geboes score.

Specificity

Paper
Add Code

MTS-DVGAN: Anomaly Detection in Cyber-Physical Systems using a Dual Variational Generative Adversarial Network

no code implementations • 4 Nov 2023 • Haili Sun, Yan Huang, Lansheng Han, Cai Fu, Hongle Liu, Xiang Long

Then, by exploiting the distribution property and modeling the normal patterns of multivariate time series, a variational autoencoder is introduced to force the generative adversarial network (GAN) to generate diverse samples.

Anomaly Detection Generative Adversarial Network +1

Paper
Add Code

Joint Design of ISAC Waveform under PAPR Constraints

no code implementations • 20 Nov 2023 • Yating Chen, Cai Wen, Yan Huang, Le Liang, Jie Li, HUI ZHANG, Wei Hong

In this paper, we formulate the precoding problem of integrated sensing and communication (ISAC) waveform as a non-convex quadratically constrainted quadratic program (QCQP), in which the weighted sum of communication multi-user interference (MUI) and the gap between dual-use waveform and ideal radar waveform is minimized with peak-to-average power ratio (PAPR) constraints.

Paper
Add Code

TDeLTA: A Light-weight and Robust Table Detection Method based on Learning Text Arrangement

no code implementations • 18 Dec 2023 • Yang Fan, XiangPing Wu, Qingcai Chen, Heng Li, Yan Huang, Zhixiang Cai, Qitian Wu

The diversity of tables makes table detection a great challenge, leading to existing models becoming more tedious and complex.

Optical Character Recognition (OCR) Table Detection +2

Paper
Add Code

Unsupervised Spatio-Temporal State Estimation for Fine-grained Adaptive Anomaly Diagnosis of Industrial Cyber-physical Systems

no code implementations • 5 Mar 2024 • Haili Sun, Yan Huang, Lansheng Han, Cai Fu, Chunjie Zhou

Subsequently, based on these two types of state matrices, a three-branch structure of series-temporal-spatial attention module is designed to simultaneously capture the series, temporal, and space dependencies among MTS.

Paper
Add Code

HCL-MTSAD: Hierarchical Contrastive Consistency Learning for Accurate Detection of Industrial Multivariate Time Series Anomalies

no code implementations • 12 Apr 2024 • Haili Sun, Yan Huang, Lansheng Han, Cai Fu, Chunjie Zhou

To address this issue, we propose a novel self-supervised hierarchical contrastive consistency learning method for detecting anomalies in MTS, named HCL-MTSAD.

Anomaly Detection Contrastive Learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.