Search Results for author: Ming Yang

Found 69 papers, 27 papers with code

BSN: Boundary Sensitive Network for Temporal Action Proposal Generation

17 code implementations ECCV 2018 Tianwei Lin, Xu Zhao, Haisheng Su, Chongjing Wang, Ming Yang

Temporal action proposal generation is an important yet challenging problem, since temporal proposals with rich action content are indispensable for analysing real-world videos with long duration and high proportion irrelevant content.

Action Detection Temporal Action Proposal Generation

Instance-level Human Parsing via Part Grouping Network

1 code implementation ECCV 2018 Ke Gong, Xiaodan Liang, Yicheng Li, Yimin Chen, Ming Yang, Liang Lin

Instance-level human parsing towards real-world human analysis scenarios is still under-explored due to the absence of sufficient data resources and technical difficulty in parsing multiple instances in a single pass.

Edge Detection Human Parsing +2

Bi-Directional Cascade Network for Perceptual Edge Detection

2 code implementations CVPR 2019 Jianzhong He, Shiliang Zhang, Ming Yang, Yanhu Shan, Tiejun Huang

Exploiting multi-scale representations is critical to improve edge detection for objects at different scales.

Edge Detection

Attention Guided Network for Retinal Image Segmentation

2 code implementations25 Jul 2019 Shihao Zhang, Huazhu Fu, Yuguang Yan, Yubing Zhang, Qingyao Wu, Ming Yang, Mingkui Tan, Yanwu Xu

Learning structural information is critical for producing an ideal result in retinal image segmentation.

Image Segmentation Segmentation +1

Tactics2D: A Reinforcement Learning Environment Library with Generative Scenarios for Driving Decision-making

2 code implementations18 Nov 2023 Yueyuan Li, Songan Zhang, Mingyang Jiang, Xingyuan Chen, Ming Yang

For access to the source code and participation in discussions, visit the official GitHub page for Tactcis2D at https://github. com/WoodOxen/Tactics2D.

Autonomous Driving Decision Making +3

Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

1 code implementation CVPR 2021 Bowen Cheng, Lu Sheng, Shaoshuai Shi, Ming Yang, Dong Xu

Inspired by the back-tracing strategy in the conventional Hough voting methods, in this work, we introduce a new 3D object detection method, named as Back-tracing Representative Points Network (BRNet), which generatively back-traces the representative points from the vote centers and also revisits complementary seed points around these generated points, so as to better capture the fine local structural features surrounding the potential objects from the raw point clouds.

3D Object Detection Object +1

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

1 code implementation27 Feb 2024 ZiCheng Zhang, Ruobing Zheng, Ziwen Liu, Congying Han, Tianqi Li, Meng Wang, Tiande Guo, Jingdong Chen, Bonan Li, Ming Yang

Recent works in implicit representations, such as Neural Radiance Fields (NeRF), have advanced the generation of realistic and animatable head avatars from video sequences.

Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs

2 code implementations1 Oct 2023 Shiyu Xuan, Qingpei Guo, Ming Yang, Shiliang Zhang

Specifically, we present a new method for constructing the instruction tuning dataset at a low cost by leveraging annotations in existing datasets.

Referring Expression

EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints

1 code implementation21 Aug 2023 Yutao Chen, Xingning Dong, Tian Gan, Chunluan Zhou, Ming Yang, Qingpei Guo

Compared with images, we conjecture that videos necessitate more constraints to preserve the temporal consistency during editing.

Video Editing

Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning

1 code implementation20 Sep 2023 Chen Jiang, Hong Liu, Xuzheng Yu, Qing Wang, Yuan Cheng, Jia Xu, Zhongyi Liu, Qingpei Guo, Wei Chu, Ming Yang, Yuan Qi

We thereby present a new Triplet Partial Margin Contrastive Learning (TPM-CL) module to construct partial order triplet samples by automatically generating fine-grained hard negatives for matched text-video pairs.

Contrastive Learning Retrieval +3

M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval

1 code implementation31 Jan 2024 Xingning Dong, Zipeng Feng, Chunluan Zhou, Xuzheng Yu, Ming Yang, Qingpei Guo

We then summarize this empirical study into the M2-RAAP recipe, where our technical contributions lie in 1) the data filtering and text re-writing pipeline resulting in 1M high-quality bilingual video-text pairs, 2) the replacement of video inputs with key-frames to accelerate pre-training, and 3) the Auxiliary-Caption-Guided (ACG) strategy to enhance video features.

Retrieval Text Retrieval +1

Resolution-invariant Person Re-Identification

1 code implementation24 Jun 2019 Shunan Mao, Shiliang Zhang, Ming Yang

RIFE adopts two feature extraction streams weighted by a dual-attention block to learn features for low and high resolution images, respectively.

Person Re-Identification Super-Resolution

SSAP: Single-Shot Instance Segmentation With Affinity Pyramid

2 code implementations ICCV 2019 Naiyu Gao, Yanhu Shan, Yupei Wang, Xin Zhao, Yinan Yu, Ming Yang, Kaiqi Huang

Moreover, incorporating with the learned affinity pyramid, a novel cascaded graph partition module is presented to sequentially generate instances from coarse to fine.

Instance Segmentation Segmentation +1

Recall and Learn: A Memory-augmented Solver for Math Word Problems

1 code implementation Findings (EMNLP) 2021 Shifeng Huang, Jiawei Wang, Jiao Xu, Da Cao, Ming Yang

Specifically, given a math word problem, the model first retrieves similar questions by a memory module and then encodes the unsolved problem and each retrieved question using a representation module.

Math Math Word Problem Solving

Threshold-adaptive Unsupervised Focal Loss for Domain Adaptation of Semantic Segmentation

1 code implementation23 Aug 2022 Weihao Yan, Yeqiang Qian, Chunxiang Wang, Ming Yang

In stage one, we design a threshold-adaptative unsupervised focal loss to regularize the prediction in the target domain, which has a mild gradient neutralization mechanism and mitigates the problem that hard samples are barely optimized in entropy-based methods.

Data Augmentation Segmentation +2

Efficient Generalization Improvement Guided by Random Weight Perturbation

1 code implementation21 Nov 2022 Tao Li, Weihao Yan, Zehao Lei, Yingwen Wu, Kun Fang, Ming Yang, Xiaolin Huang

To fully uncover the great potential of deep neural networks (DNNs), various learning algorithms have been developed to improve the model's generalization ability.

Monocular Pedestrian Orientation Estimation Based on Deep 2D-3D Feedforward

1 code implementation24 Sep 2019 Chenchen Zhao, Yeqiang Qian, Ming Yang

The 2D and 3D dimensions of pedestrians are determined from the camera captures and further utilized through two feedforward links connected to the orientation estimator.

Autonomous Driving Collision Avoidance

Top-N Recommendation on Graphs

1 code implementation27 Sep 2016 Zhao Kang, Chong Peng, Ming Yang, Qiang Cheng

To alleviate this problem, this paper proposes a simple recommendation algorithm that fully exploits the similarity information among users and items and intrinsic structural information of the user-item matrix.

Collaborative Filtering Recommendation Systems

Tensor Robust PCA with Nonconvex and Nonlocal Regularization

1 code implementation4 Nov 2022 Xiaoyu Geng, Qiang Guo, Shuaixiong Hui, Ming Yang, Caiming Zhang

To this end, we integrate nonlocal self-similarity into N-TRPCA, and further develop a nonconvex and nonlocal TRPCA (NN-TRPCA) model.

Probabilistic Latent Factor Model for Collaborative Filtering with Bayesian Inference

1 code implementation7 Dec 2020 Jiansheng Fang, Xiaoqing Zhang, Yan Hu, Yanwu Xu, Ming Yang, Jiang Liu

Latent Factor Model (LFM) is one of the most successful methods for Collaborative filtering (CF) in the recommendation system, in which both users and items are projected into a joint latent factor space.

Bayesian Inference Collaborative Filtering +1

Deep View Synthesis via Self-Consistent Generative Network

1 code implementation19 Jan 2021 Zhuoman Liu, Wei Jia, Ming Yang, Peiyao Luo, Yong Guo, Mingkui Tan

To address the above issues, in this paper, we propose a novel deep generative model, called Self-Consistent Generative Network (SCGN), which synthesizes novel views from the given input views without explicitly exploiting the geometric information.

Group-based Interleaved Pipeline Parallelism for Large-scale DNN Training

1 code implementation ICLR 2022 Pengcheng Yang, XiaoMing Zhang, Wenpeng Zhang, Ming Yang, Hong Wei

The recent trend of using large-scale deep neural networks (DNN) to boost performance has propelled the development of the parallel pipelining technique for efficient DNN training, which has resulted in the development of several prominent pipelines such as GPipe, PipeDream, and PipeDream-2BW.

Cross-to-merge training with class balance strategy for learning with noisy labels

1 code implementation Expert Systems with Applications 2024 Qian Zhang, Yi Zhu, Ming Yang, Ge Jin, YingWen Zhu, Qiu Chen

Although sample selection is a mainstream method in the field of learning with noisy labels, which aims to mitigate the impact of noisy labels during model training, the testing performance of these methods exhibits significant fluctuations across different noise rates and types.

Learning with noisy labels

Restricted Deformable Convolution based Road Scene Semantic Segmentation Using Surround View Cameras

no code implementations2 Jan 2018 Liuyuan Deng, Ming Yang, Hao Li, Tianyi Li, Bing Hu, Chunxiang Wang

Finally, an RDC based semantic segmentation model is built; the model is trained for real-world surround view images through a multi-task learning architecture by combining real-world images with transformed images.

Autonomous Driving Multi-Task Learning +2

Joint Calibration of Panoramic Camera and Lidar Based on Supervised Learning

no code implementations9 Sep 2017 Mingwei Cao, Ming Yang, Chunxiang Wang, Yeqiang Qian, Bing Wang

In view of contemporary panoramic camera-laser scanner system, the traditional calibration method is not suitable for panoramic cameras whose imaging model is extremely nonlinear.

Translation

A Survey of Multi-View Representation Learning

no code implementations3 Oct 2016 Yingming Li, Ming Yang, Zhongfei Zhang

Consequently, we first review the representative methods and theories of multi-view representation learning based on the perspective of alignment, such as correlation-based alignment.

Representation Learning

A Multi-model Combination Approach for Probabilistic Wind Power Forecasting

no code implementations13 Feb 2017 You Lin, Ming Yang, Can Wan, Jianhui Wang, Yonghua Song

Therefore, a novel multi-model combination (MMC) approach for short-term probabilistic wind generation forecasting is proposed in this paper to exploit the advantages of different forecasting models.

Density Estimation

Web-Scale Training for Face Identification

no code implementations CVPR 2015 Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, Lior Wolf

Scaling machine learning methods to very large datasets has attracted considerable attention in recent years, thanks to easy access to ubiquitous sensing and data from the web.

Face Identification Face Recognition +1

Compressing Deep Convolutional Networks using Vector Quantization

no code implementations18 Dec 2014 Yunchao Gong, Liu Liu, Ming Yang, Lubomir Bourdev

In this paper, we tackle this model storage issue by investigating information theoretical vector quantization methods for compressing the parameters of CNNs.

Classification Clustering +6

Conditional Generative Adversarial Network for Structured Domain Adaptation

no code implementations CVPR 2018 Weixiang Hong, Zhenzhen Wang, Ming Yang, Junsong Yuan

In recent years, deep neural nets have triumphed over many computer vision problems, including semantic segmentation, which is a critical task in emerging autonomous driving and medical image diagnostics applications.

Autonomous Driving Domain Adaptation +2

Deep Reinforcement Learning with Iterative Shift for Visual Tracking

no code implementations ECCV 2018 Liangliang Ren, Xin Yuan, Jiwen Lu, Ming Yang, Jie Zhou

Visual tracking is confronted by the dilemma to locate a target both}accurately and efficiently, and make decisions online whether and how to adapt the appearance model or even restart tracking.

Motion Estimation Object +4

Generating Synthesized Computed Tomography (CT) from Cone-Beam Computed Tomography (CBCT) using CycleGAN for Adaptive Radiation Therapy

no code implementations31 Oct 2018 Xiao Liang, Liyuan Chen, Dan Nguyen, Zhiguo Zhou, Xuejun Gu, Ming Yang, Jing Wang, Steve Jiang

Dose calculation accuracy using sCT images has been improved over the original CBCT images, with the average Gamma Index passing rate increased from 95. 4% to 97. 4% for 1 mm/1% criteria.

Medical Physics

A Novel Demodulation and Estimation Algorithm for Blackout Communication: Extract Principal Components with Deep Learning

no code implementations27 May 2019 Haoyan Liu, Yanming Liu, Ming Yang, Xiaoping Li

For reentry or near space communication, owing to the influence of the time-varying plasma sheath channel environment, the received IQ baseband signals are severely rotated on the constellation.

RFBNet: Deep Multimodal Networks with Residual Fusion Blocks for RGB-D Semantic Segmentation

no code implementations29 Jun 2019 Liuyuan Deng, Ming Yang, Tianyi Li, Yuesheng He, Chunxiang Wang

To instantiate this structure, the paper proposes a residual fusion block (RFB) to formulate the interdependences of the encoders.

Semantic Segmentation

Map-Enhanced Ego-Lane Detection in the Missing Feature Scenarios

no code implementations2 Apr 2020 Xiaoliang Wang, Yeqiang Qian, Chunxiang Wang, Ming Yang

As one of the most important tasks in autonomous driving systems, ego-lane detection has been extensively studied and has achieved impressive results in many scenarios.

Autonomous Driving Lane Detection

Class Distribution Alignment for Adversarial Domain Adaptation

no code implementations20 Apr 2020 Wanqi Yang, Tong Ling, Chengmei Yang, Lei Wang, Yinghuan Shi, Luping Zhou, Ming Yang

To address this issue, we propose a novel approach called Conditional ADversarial Image Translation (CADIT) to explicitly align the class distributions given samples between the two domains.

General Classification Translation +1

MAFF-Net: Filter False Positive for 3D Vehicle Detection with Multi-modal Adaptive Feature Fusion

no code implementations23 Sep 2020 Zehan Zhang, Ming Zhang, Zhidong Liang, Xian Zhao, Ming Yang, Wenming Tan, ShiLiang Pu

Experimental results on the KITTI dataset demonstrate significant improvement in filtering false positive over the approach using only point cloud data.

Autonomous Driving

The Period-Luminosity Relations of Red Supergiants in M33 and M31

no code implementations20 Feb 2019 Yi Ren, B. W. Jiang, Ming Yang, Jian Gao

The period-luminosity (P-L) relation is analyzed for the RSGs in the fundamental mode.

Solar and Stellar Astrophysics Astrophysics of Galaxies

Evolved Massive Stars at Low-metallicity IV. Using 1.6 $μ$m "H-bump" to identify red supergiant stars: a case study of NGC 6822

no code implementations21 Jan 2021 Ming Yang, Alceste Z. Bonanos, Biwei Jiang, Man I Lam, Jian Gao, Panagiotis Gavras, Grigoris Maravelias, Shu Wang, Xiao-Dian Chen, Frank Tramper, Yi Ren, Zoi T. Spetsieri

Further separating RSG candidates from the rest of the LSG candidates is done by using semi-empirical criteria on NIR CMDs and resulted in 323 RSG candidates.

Solar and Stellar Astrophysics Astrophysics of Galaxies

Momentum Accelerates the Convergence of Stochastic AUPRC Maximization

no code implementations2 Jul 2021 Guanghui Wang, Ming Yang, Lijun Zhang, Tianbao Yang

In this paper, we further improve the stochastic optimization of AURPC by (i) developing novel stochastic momentum methods with a better iteration complexity of $O(1/\epsilon^4)$ for finding an $\epsilon$-stationary solution; and (ii) designing a novel family of stochastic adaptive methods with the same iteration complexity, which enjoy faster convergence in practice.

imbalanced classification Stochastic Optimization

Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding

no code implementations6 Aug 2021 Shengqi Huang, Wanqi Yang, Lei Wang, Luping Zhou, Ming Yang

Inspired by the recent local descriptor based few-shot learning (FSL), our general UDA model is fully built upon local descriptors (LDs) for image classification and domain adaptation.

Few-Shot Learning Image Classification +1

Stacked Homography Transformations for Multi-View Pedestrian Detection

no code implementations ICCV 2021 Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan

This task is confronted with two challenges: how to establish the 3D correspondences from views to the BEV map and how to assemble occupancy information across views.

Multiview Detection Pedestrian Detection

Self-supervised Contrastive Attributed Graph Clustering

no code implementations15 Oct 2021 Wei Xia, Quanxue Gao, Ming Yang, Xinbo Gao

Thus, for the OOS nodes, SCAGC can directly calculate their clustering labels.

Attribute Clustering +3

BAANet: Learning Bi-directional Adaptive Attention Gates for Multispectral Pedestrian Detection

no code implementations4 Dec 2021 Xiaoxiao Yang, Yeqian Qiang, Huijie Zhu, Chunxiang Wang, Ming Yang

Thermal infrared (TIR) image has proven effectiveness in providing temperature cues to the RGB features for multispectral pedestrian detection.

Pedestrian Detection Specificity

SUNet: Scale-aware Unified Network for Panoptic Segmentation

no code implementations7 Sep 2022 Weihao Yan, Yeqiang Qian, Chunxiang Wang, Ming Yang

Panoptic segmentation combines the advantages of semantic and instance segmentation, which can provide both pixel-level and instance-level environmental perception information for intelligent vehicles.

Instance Segmentation Panoptic Segmentation +1

Double-Ended Palindromic Trees: A Linear-Time Data Structure and Its Applications

no code implementations5 Oct 2022 Qisheng Wang, Ming Yang, Xinrui Zhu

eertree) is a linear-size data structure that provides access to all palindromic substrings of a string.

FedSiam-DA: Dual-aggregated Federated Learning via Siamese Network under Non-IID Data

no code implementations17 Nov 2022 Ming Yang, Yanhan Wang, Xin Wang, Zhenyong Zhang, Xiaoming Wu, Peng Cheng

Federated learning is a distributed learning that allows each client to keep the original data locally and only upload the parameters of the local model to the server.

Contrastive Learning Federated Learning

High-level semantic feature matters few-shot unsupervised domain adaptation

no code implementations5 Jan 2023 Lei Yu, Wanqi Yang, Shengqi Huang, Lei Wang, Ming Yang

However, the goal of FS-UDA and FSL are relevant yet distinct, since FS-UDA aims to classify the samples in target domain rather than source domain.

Few-Shot Learning Unsupervised Domain Adaptation +1

Spatial Attention and Syntax Rule Enhanced Tree Decoder for Offine Handwritten Mathematical Expression Recognition

no code implementations13 Mar 2023 Zihao Lin, Jinrong Li, Fan Yang, Shuangping Huang, Xu Yang, Jianmin Lin, Ming Yang

In this paper, we propose a novel model called Spatial Attention and Syntax Rule Enhanced Tree Decoder (SS-TD), which is equipped with spatial attention mechanism to alleviate the prediction error of tree structure and use syntax masks (obtained from the transformation of syntax rules) to constrain the occurrence of ungrammatical mathematical expression.

Low-Light Image Enhancement by Learning Contrastive Representations in Spatial and Frequency Domains

no code implementations23 Mar 2023 Yi Huang, Xiaoguang Tu, Gui Fu, Tingting Liu, Bokai Liu, Ming Yang, Ziliang Feng

Images taken under low-light conditions tend to suffer from poor visibility, which can decrease image quality and even reduce the performance of the downstream tasks.

Contrastive Learning Low-Light Image Enhancement

Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms

no code implementations7 Jul 2023 Ming Yang, Xiyuan Wei, Tianbao Yang, Yiming Ying

Then, we establish the compositional uniform stability results for two popular stochastic compositional gradient descent algorithms, namely SCGD and SCSC.

Learning Theory Meta-Learning

Evolutionary Alternating Direction Method of Multipliers for Constrained Multi-Objective Optimization with Unknown Constraints

no code implementations2 Jan 2024 Shuang Li, Ke Li, Wei Li, Ming Yang

Constrained multi-objective optimization problems (CMOPs) pervade real-world applications in science, engineering, and design.

SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment

no code implementations4 Jan 2024 Ziping Ma, Furong Xu, Jian Liu, Ming Yang, Qingpei Guo

To achieve multimodal alignment from both global and local perspectives, this paper proposes Symmetrizing Contrastive Captioners (SyCoCa), which introduces bidirectional interactions on images and texts across the global and local representation levels.

Image Captioning Image Classification +6

A Survey for Foundation Models in Autonomous Driving

no code implementations2 Feb 2024 Haoxiang Gao, Yaqian Li, Kaiwen Long, Ming Yang, Yiqing Shen

The advent of foundation models has revolutionized the fields of natural language processing and computer vision, paving the way for their application in autonomous driving (AD).

3D Object Detection Autonomous Driving +2

Anchor-free Clustering based on Anchor Graph Factorization

no code implementations24 Feb 2024 Shikun Mei, Fangfang Li, Quanxue Gao, Ming Yang

Additionally, we evolve the concept of the membership matrix between cluster centers and samples in FKM into an anchor graph encompassing multiple anchor points and samples.

Clustering

One-Step Multi-View Clustering Based on Transition Probability

no code implementations3 Mar 2024 Wenhui Zhao, Quanxue Gao, Guangfei Li, Cheng Deng, Ming Yang

Despite their successes, current methods lack interpretability in the clustering process and do not sufficiently consider the complementary information across different views.

Clustering

Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

no code implementations17 Mar 2024 Kangyang Xie, BinBin Yang, Hao Chen, Meng Wang, Cheng Zou, Hui Xue, Ming Yang, Chunhua Shen

Beyond the superiority of the text-to-image diffusion model in generating high-quality images, recent studies have attempted to uncover its potential for adapting the learned semantic knowledge to visual perception tasks.

Image Generation

Interpretable Multi-View Clustering Based on Anchor Graph Tensor Factorization

no code implementations1 Apr 2024 Jing Li, Quanxue Gao, Cheng Deng, Qianqian Wang, Ming Yang

Nevertheless, existing multi-view clustering methods based on anchor graph factorization lack adequate cluster interpretability for the decomposed matrix and often overlook the inter-view information.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.