Search Results for author: Ming Yang

Found 69 papers, 27 papers with code

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

2 code implementations • Conference on Computer Vision and Pattern Recognition (CVPR) 2014 • Yaniv Taigman, Ming Yang, Marc’ Aurelio Ranzato, Lior Wolf

In modern face recognition, the conventional pipeline consists of four stages: detect => align => represent => classify.

Ranked #1 on 3D Face Modelling on LFW

3D Face Modelling Face Recognition +1

9,894

Paper
Code

BSN: Boundary Sensitive Network for Temporal Action Proposal Generation

17 code implementations • ECCV 2018 • Tianwei Lin, Xu Zhao, Haisheng Su, Chongjing Wang, Ming Yang

Temporal action proposal generation is an important yet challenging problem, since temporal proposals with rich action content are indispensable for analysing real-world videos with long duration and high proportion irrelevant content.

Ranked #3 on Temporal Action Proposal Generation on THUMOS' 14

Action Detection Temporal Action Proposal Generation

6,703

Paper
Code

Track to Detect and Segment: An Online Multi-Object Tracker

1 code implementation • CVPR 2021 • Jialian Wu, Jiale Cao, Liangchen Song, Yu Wang, Ming Yang, Junsong Yuan

Most online multi-object trackers perform object detection stand-alone in a neural net without any input from tracking.

Ranked #1 on Instance Segmentation on nuScenes

3D Multi-Object Tracking Instance Segmentation +7

545

Paper
Code

Instance-level Human Parsing via Part Grouping Network

1 code implementation • ECCV 2018 • Ke Gong, Xiaodan Liang, Yicheng Li, Yimin Chen, Ming Yang, Liang Lin

Instance-level human parsing towards real-world human analysis scenarios is still under-explored due to the absence of sufficient data resources and technical difficulty in parsing multiple instances in a single pass.

Ranked #6 on Human Part Segmentation on CIHP

Edge Detection Human Parsing +2

408

Paper
Code

Bi-Directional Cascade Network for Perceptual Edge Detection

2 code implementations • CVPR 2019 • Jianzhong He, Shiliang Zhang, Ming Yang, Yanhu Shan, Tiejun Huang

Exploiting multi-scale representations is critical to improve edge detection for objects at different scales.

Ranked #2 on Edge Detection on BRIND

Edge Detection

337

Paper
Code

Attention Guided Network for Retinal Image Segmentation

2 code implementations • 25 Jul 2019 • Shihao Zhang, Huazhu Fu, Yuguang Yan, Yubing Zhang, Qingyao Wu, Ming Yang, Mingkui Tan, Yanwu Xu

Learning structural information is critical for producing an ideal result in retinal image segmentation.

Image Segmentation Segmentation +1

162

Paper
Code

Tactics2D: A Reinforcement Learning Environment Library with Generative Scenarios for Driving Decision-making

2 code implementations • 18 Nov 2023 • Yueyuan Li, Songan Zhang, Mingyang Jiang, Xingyuan Chen, Ming Yang

For access to the source code and participation in discussions, visit the official GitHub page for Tactcis2D at https://github. com/WoodOxen/Tactics2D.

Autonomous Driving Decision Making +3

Paper
Code

Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

1 code implementation • CVPR 2021 • Bowen Cheng, Lu Sheng, Shaoshuai Shi, Ming Yang, Dong Xu

Inspired by the back-tracing strategy in the conventional Hough voting methods, in this work, we introduce a new 3D object detection method, named as Back-tracing Representative Points Network (BRNet), which generatively back-traces the representative points from the vote centers and also revisits complementary seed points around these generated points, so as to better capture the fine local structural features surrounding the potential objects from the raw point clouds.

Ranked #17 on 3D Object Detection on ScanNetV2

3D Object Detection Object +1

Paper
Code

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

1 code implementation • 27 Feb 2024 • ZiCheng Zhang, Ruobing Zheng, Ziwen Liu, Congying Han, Tianqi Li, Meng Wang, Tiande Guo, Jingdong Chen, Bonan Li, Ming Yang

Recent works in implicit representations, such as Neural Radiance Fields (NeRF), have advanced the generation of realistic and animatable head avatars from video sequences.

Paper
Code

Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs

2 code implementations • 1 Oct 2023 • Shiyu Xuan, Qingpei Guo, Ming Yang, Shiliang Zhang

Specifically, we present a new method for constructing the instruction tuning dataset at a low cost by leveraging annotations in existing datasets.

Referring Expression

Paper
Code

EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints

1 code implementation • 21 Aug 2023 • Yutao Chen, Xingning Dong, Tian Gan, Chunluan Zhou, Ming Yang, Qingpei Guo

Compared with images, we conjecture that videos necessitate more constraints to preserve the temporal consistency during editing.

Video Editing

Paper
Code

Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning

1 code implementation • 20 Sep 2023 • Chen Jiang, Hong Liu, Xuzheng Yu, Qing Wang, Yuan Cheng, Jia Xu, Zhongyi Liu, Qingpei Guo, Wei Chu, Ming Yang, Yuan Qi

We thereby present a new Triplet Partial Margin Contrastive Learning (TPM-CL) module to construct partial order triplet samples by automatically generating fine-grained hard negatives for matched text-video pairs.

Ranked #4 on Video Retrieval on MSR-VTT-1kA

Contrastive Learning Retrieval +3

Paper
Code

M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining

1 code implementation • 29 Jan 2024 • Qingpei Guo, Furong Xu, Hanxiao Zhang, Wang Ren, Ziping Ma, Lin Ju, Jian Wang, Jingdong Chen, Ming Yang

Vision-language foundation models like CLIP have revolutionized the field of artificial intelligence.

Ranked #1 on Zero-shot Image Retrieval on Flickr30k-CN (using extra training data)

Zero-Shot Cross-Modal Retrieval Zero-shot Image Retrieval +3

Paper
Code

M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval

1 code implementation • 31 Jan 2024 • Xingning Dong, Zipeng Feng, Chunluan Zhou, Xuzheng Yu, Ming Yang, Qingpei Guo

We then summarize this empirical study into the M2-RAAP recipe, where our technical contributions lie in 1) the data filtering and text re-writing pipeline resulting in 1M high-quality bilingual video-text pairs, 2) the replacement of video inputs with key-frames to accelerate pre-training, and 3) the Auxiliary-Caption-Guided (ACG) strategy to enhance video features.

Retrieval Text Retrieval +1

Paper
Code

Resolution-invariant Person Re-Identification

1 code implementation • 24 Jun 2019 • Shunan Mao, Shiliang Zhang, Ming Yang

RIFE adopts two feature extraction streams weighted by a dual-attention block to learn features for low and high resolution images, respectively.

Person Re-Identification Super-Resolution

Paper
Code

SSAP: Single-Shot Instance Segmentation With Affinity Pyramid

2 code implementations • ICCV 2019 • Naiyu Gao, Yanhu Shan, Yupei Wang, Xin Zhao, Yinan Yu, Ming Yang, Kaiqi Huang

Moreover, incorporating with the learned affinity pyramid, a novel cascaded graph partition module is presented to sequentially generate instances from coarse to fine.

Instance Segmentation Segmentation +1

Paper
Code

Recall and Learn: A Memory-augmented Solver for Math Word Problems

1 code implementation • Findings (EMNLP) 2021 • Shifeng Huang, Jiawei Wang, Jiao Xu, Da Cao, Ming Yang

Specifically, given a math word problem, the model first retrieves similar questions by a memory module and then encodes the unsolved problem and each retrieved question using a representation module.

Ranked #7 on Math Word Problem Solving on Math23K

Math Math Word Problem Solving

Paper
Code

SAM4UDASS: When SAM Meets Unsupervised Domain Adaptive Semantic Segmentation in Intelligent Vehicles

1 code implementation • 22 Nov 2023 • Weihao Yan, Yeqiang Qian, Xingyuan Chen, Hanyang Zhuang, Chunxiang Wang, Ming Yang

It involves Semantic-Guided Mask Labeling, which assigns semantic labels to unlabeled SAM masks using UDA pseudo-labels.

Semantic Segmentation Unsupervised Domain Adaptation

Paper
Code

Threshold-adaptive Unsupervised Focal Loss for Domain Adaptation of Semantic Segmentation

1 code implementation • 23 Aug 2022 • Weihao Yan, Yeqiang Qian, Chunxiang Wang, Ming Yang

In stage one, we design a threshold-adaptative unsupervised focal loss to regularize the prediction in the target domain, which has a mild gradient neutralization mechanism and mitigates the problem that hard samples are barely optimized in entropy-based methods.

Data Augmentation Segmentation +2

Paper
Code

Efficient Generalization Improvement Guided by Random Weight Perturbation

1 code implementation • 21 Nov 2022 • Tao Li, Weihao Yan, Zehao Lei, Yingwen Wu, Kun Fang, Ming Yang, Xiaolin Huang

To fully uncover the great potential of deep neural networks (DNNs), various learning algorithms have been developed to improve the model's generalization ability.

Paper
Code

Monocular Pedestrian Orientation Estimation Based on Deep 2D-3D Feedforward

1 code implementation • 24 Sep 2019 • Chenchen Zhao, Yeqiang Qian, Ming Yang

The 2D and 3D dimensions of pedestrians are determined from the camera captures and further utilized through two feedforward links connected to the orientation estimator.

Autonomous Driving Collision Avoidance

Paper
Code

Top-N Recommendation on Graphs

1 code implementation • 27 Sep 2016 • Zhao Kang, Chong Peng, Ming Yang, Qiang Cheng

To alleviate this problem, this paper proposes a simple recommendation algorithm that fully exploits the similarity information among users and items and intrinsic structural information of the user-item matrix.

Collaborative Filtering Recommendation Systems

Paper
Code

Tensor Robust PCA with Nonconvex and Nonlocal Regularization

1 code implementation • 4 Nov 2022 • Xiaoyu Geng, Qiang Guo, Shuaixiong Hui, Ming Yang, Caiming Zhang

To this end, we integrate nonlocal self-similarity into N-TRPCA, and further develop a nonconvex and nonlocal TRPCA (NN-TRPCA) model.

Paper
Code

Probabilistic Latent Factor Model for Collaborative Filtering with Bayesian Inference

1 code implementation • 7 Dec 2020 • Jiansheng Fang, Xiaoqing Zhang, Yan Hu, Yanwu Xu, Ming Yang, Jiang Liu

Latent Factor Model (LFM) is one of the most successful methods for Collaborative filtering (CF) in the recommendation system, in which both users and items are projected into a joint latent factor space.

Bayesian Inference Collaborative Filtering +1

Paper
Code

Deep View Synthesis via Self-Consistent Generative Network

1 code implementation • 19 Jan 2021 • Zhuoman Liu, Wei Jia, Ming Yang, Peiyao Luo, Yong Guo, Mingkui Tan

To address the above issues, in this paper, we propose a novel deep generative model, called Self-Consistent Generative Network (SCGN), which synthesizes novel views from the given input views without explicitly exploiting the geometric information.

Paper
Code

Group-based Interleaved Pipeline Parallelism for Large-scale DNN Training

1 code implementation • ICLR 2022 • Pengcheng Yang, XiaoMing Zhang, Wenpeng Zhang, Ming Yang, Hong Wei

The recent trend of using large-scale deep neural networks (DNN) to boost performance has propelled the development of the parallel pipelining technique for efficient DNN training, which has resulted in the development of several prominent pipelines such as GPipe, PipeDream, and PipeDream-2BW.

Paper
Code

Cross-to-merge training with class balance strategy for learning with noisy labels

1 code implementation • Expert Systems with Applications 2024 • Qian Zhang, Yi Zhu, Ming Yang, Ge Jin, YingWen Zhu, Qiu Chen

Although sample selection is a mainstream method in the field of learning with noisy labels, which aims to mitigate the impact of noisy labels during model training, the testing performance of these methods exhibits significant fluctuations across different noise rates and types.

Ranked #2 on Learning with noisy labels on Clothing1M

Learning with noisy labels

Paper
Code

Restricted Deformable Convolution based Road Scene Semantic Segmentation Using Surround View Cameras

no code implementations • 2 Jan 2018 • Liuyuan Deng, Ming Yang, Hao Li, Tianyi Li, Bing Hu, Chunxiang Wang

Finally, an RDC based semantic segmentation model is built; the model is trained for real-world surround view images through a multi-task learning architecture by combining real-world images with transformed images.

Autonomous Driving Multi-Task Learning +2

Paper
Add Code

Joint Calibration of Panoramic Camera and Lidar Based on Supervised Learning

no code implementations • 9 Sep 2017 • Mingwei Cao, Ming Yang, Chunxiang Wang, Yeqiang Qian, Bing Wang

In view of contemporary panoramic camera-laser scanner system, the traditional calibration method is not suitable for panoramic cameras whose imaging model is extremely nonlinear.

Translation

Paper
Add Code

A Survey of Multi-View Representation Learning

no code implementations • 3 Oct 2016 • Yingming Li, Ming Yang, Zhongfei Zhang

Consequently, we first review the representative methods and theories of multi-view representation learning based on the perspective of alignment, such as correlation-based alignment.

Representation Learning

Paper
Add Code

A Multi-model Combination Approach for Probabilistic Wind Power Forecasting

no code implementations • 13 Feb 2017 • You Lin, Ming Yang, Can Wan, Jianhui Wang, Yonghua Song

Therefore, a novel multi-model combination (MMC) approach for short-term probabilistic wind generation forecasting is proposed in this paper to exploit the advantages of different forecasting models.

Density Estimation

Paper
Add Code

Web-Scale Training for Face Identification

no code implementations • CVPR 2015 • Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, Lior Wolf

Scaling machine learning methods to very large datasets has attracted considerable attention in recent years, thanks to easy access to ubiquitous sensing and data from the web.

Face Identification Face Recognition +1

Paper
Add Code

Compressing Deep Convolutional Networks using Vector Quantization

no code implementations • 18 Dec 2014 • Yunchao Gong, Liu Liu, Ming Yang, Lubomir Bourdev

In this paper, we tackle this model storage issue by investigating information theoretical vector quantization methods for compressing the parameters of CNNs.

Classification Clustering +6

Paper
Add Code

Conditional Generative Adversarial Network for Structured Domain Adaptation

no code implementations • CVPR 2018 • Weixiang Hong, Zhenzhen Wang, Ming Yang, Junsong Yuan

In recent years, deep neural nets have triumphed over many computer vision problems, including semantic segmentation, which is a critical task in emerging autonomous driving and medical image diagnostics applications.

Autonomous Driving Domain Adaptation +2

Paper
Add Code

Image Blind Denoising With Generative Adversarial Network Based Noise Modeling

no code implementations • CVPR 2018 • Jingwen Chen, Jia-Wei Chen, Hongyang Chao, Ming Yang

In this paper, we consider a typical image blind denoising problem, which is to remove unknown noise from noisy images.

Denoising Generative Adversarial Network

Paper
Add Code

Deep Reinforcement Learning with Iterative Shift for Visual Tracking

no code implementations • ECCV 2018 • Liangliang Ren, Xin Yuan, Jiwen Lu, Ming Yang, Jie Zhou

Visual tracking is confronted by the dilemma to locate a target both}accurately and efficiently, and make decisions online whether and how to adapt the appearance model or even restart tracking.

Motion Estimation Object +4

Paper
Add Code

Generating Synthesized Computed Tomography (CT) from Cone-Beam Computed Tomography (CBCT) using CycleGAN for Adaptive Radiation Therapy

no code implementations • 31 Oct 2018 • Xiao Liang, Liyuan Chen, Dan Nguyen, Zhiguo Zhou, Xuejun Gu, Ming Yang, Jing Wang, Steve Jiang

Dose calculation accuracy using sCT images has been improved over the original CBCT images, with the average Gamma Index passing rate increased from 95. 4% to 97. 4% for 1 mm/1% criteria.

Medical Physics

Paper
Add Code

Bidirectional Long Short-Term Memory Networks for Relation Classification

no code implementations • PACLIC 2015 • Shu Zhang, Dequan Zheng, Xinchen Hu, Ming Yang

Ranked #29 on Relation Extraction on SemEval-2010 Task-8

Classification General Classification +4

Paper
Add Code

3D Graph Embedding Learning with a Structure-aware Loss Function for Point Cloud Semantic Instance Segmentation

no code implementations • 14 Feb 2019 • Zhidong Liang, Ming Yang, Chunxiang Wang

As a result, our framework can output both the semantic prediction and the instance prediction.

3D Instance Segmentation 3D Semantic Instance Segmentation +2

Paper
Add Code

A Novel Demodulation and Estimation Algorithm for Blackout Communication: Extract Principal Components with Deep Learning

no code implementations • 27 May 2019 • Haoyan Liu, Yanming Liu, Ming Yang, Xiaoping Li

For reentry or near space communication, owing to the influence of the time-varying plasma sheath channel environment, the received IQ baseband signals are severely rotated on the constellation.

Paper
Add Code

RFBNet: Deep Multimodal Networks with Residual Fusion Blocks for RGB-D Semantic Segmentation

no code implementations • 29 Jun 2019 • Liuyuan Deng, Ming Yang, Tianyi Li, Yuesheng He, Chunxiang Wang

To instantiate this structure, the paper proposes a residual fusion block (RFB) to formulate the interdependences of the encoders.

Ranked #3 on Semantic Segmentation on ScanNetV2

Semantic Segmentation

Paper
Add Code

Map-Enhanced Ego-Lane Detection in the Missing Feature Scenarios

no code implementations • 2 Apr 2020 • Xiaoliang Wang, Yeqiang Qian, Chunxiang Wang, Ming Yang

As one of the most important tasks in autonomous driving systems, ego-lane detection has been extensively studied and has achieved impressive results in many scenarios.

Autonomous Driving Lane Detection

Paper
Add Code

Class Distribution Alignment for Adversarial Domain Adaptation

no code implementations • 20 Apr 2020 • Wanqi Yang, Tong Ling, Chengmei Yang, Lei Wang, Yinghuan Shi, Luping Zhou, Ming Yang

To address this issue, we propose a novel approach called Conditional ADversarial Image Translation (CADIT) to explicitly align the class distributions given samples between the two domains.

General Classification Translation +1

Paper
Add Code

MAFF-Net: Filter False Positive for 3D Vehicle Detection with Multi-modal Adaptive Feature Fusion

no code implementations • 23 Sep 2020 • Zehan Zhang, Ming Zhang, Zhidong Liang, Xian Zhao, Ming Yang, Wenming Tan, ShiLiang Pu

Experimental results on the KITTI dataset demonstrate significant improvement in filtering false positive over the approach using only point cloud data.

Autonomous Driving

Paper
Add Code

The Period-Luminosity Relations of Red Supergiants in M33 and M31

no code implementations • 20 Feb 2019 • Yi Ren, B. W. Jiang, Ming Yang, Jian Gao

The period-luminosity (P-L) relation is analyzed for the RSGs in the fundamental mode.

Solar and Stellar Astrophysics Astrophysics of Galaxies

Paper
Add Code

Evolved Massive Stars at Low-metallicity IV. Using 1.6 $μ$m "H-bump" to identify red supergiant stars: a case study of NGC 6822

no code implementations • 21 Jan 2021 • Ming Yang, Alceste Z. Bonanos, Biwei Jiang, Man I Lam, Jian Gao, Panagiotis Gavras, Grigoris Maravelias, Shu Wang, Xiao-Dian Chen, Frank Tramper, Yi Ren, Zoi T. Spetsieri

Further separating RSG candidates from the rest of the LSG candidates is done by using semi-empirical criteria on NIR CMDs and resulted in 323 RSG candidates.

Solar and Stellar Astrophysics Astrophysics of Galaxies

Paper
Add Code

Enhancement of Superconductivity Linked with Linear-in-Temperature/Field Resistivity in Ion-Gated FeSe Films

no code implementations • 11 Mar 2021 • Xingyu Jiang, Mingyang Qin, Xinjian Wei, Zhongpei Feng, Jiezun Ke, Haipeng Zhu, Fucong Chen, Liping Zhang, Li Xu, Xu Zhang, Ruozhou Zhang, Zhongxu Wei, Peiyu Xiong, Qimei Liang, Chuanying Xi, Zhaosheng Wang, Jie Yuan, Beiyi Zhu, Kun Jiang, Ming Yang, Junfeng Wang, Jiangping Hu, Tao Xiang, Brigitte Leridon, Rong Yu, Qihong Chen, Kui Jin, Zhongxian Zhao

Iron selenide (FeSe) - the structurally simplest iron-based superconductor, has attracted tremendous interest in the past years.

Superconductivity

Paper
Add Code

Momentum Accelerates the Convergence of Stochastic AUPRC Maximization

no code implementations • 2 Jul 2021 • Guanghui Wang, Ming Yang, Lijun Zhang, Tianbao Yang

In this paper, we further improve the stochastic optimization of AURPC by (i) developing novel stochastic momentum methods with a better iteration complexity of $O(1/\epsilon^4)$ for finding an $\epsilon$-stationary solution; and (ii) designing a novel family of stochastic adaptive methods with the same iteration complexity, which enjoy faster convergence in practice.

imbalanced classification Stochastic Optimization

Paper
Add Code

Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding

no code implementations • 6 Aug 2021 • Shengqi Huang, Wanqi Yang, Lei Wang, Luping Zhou, Ming Yang

Inspired by the recent local descriptor based few-shot learning (FSL), our general UDA model is fully built upon local descriptors (LDs) for image classification and domain adaptation.

Few-Shot Learning Image Classification +1

Paper
Add Code

Stacked Homography Transformations for Multi-View Pedestrian Detection

no code implementations • ICCV 2021 • Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan

This task is confronted with two challenges: how to establish the 3D correspondences from views to the BEV map and how to assemble occupancy information across views.

Ranked #7 on Multiview Detection on MultiviewX

Multiview Detection Pedestrian Detection

Paper
Add Code

Self-supervised Contrastive Attributed Graph Clustering

no code implementations • 15 Oct 2021 • Wei Xia, Quanxue Gao, Ming Yang, Xinbo Gao

Thus, for the OOS nodes, SCAGC can directly calculate their clustering labels.

Attribute Clustering +3

Paper
Add Code

BAANet: Learning Bi-directional Adaptive Attention Gates for Multispectral Pedestrian Detection

no code implementations • 4 Dec 2021 • Xiaoxiao Yang, Yeqian Qiang, Huijie Zhu, Chunxiang Wang, Ming Yang

Thermal infrared (TIR) image has proven effectiveness in providing temperature cues to the RGB features for multispectral pedestrian detection.

Pedestrian Detection Specificity

Paper
Add Code

SUNet: Scale-aware Unified Network for Panoptic Segmentation

no code implementations • 7 Sep 2022 • Weihao Yan, Yeqiang Qian, Chunxiang Wang, Ming Yang

Panoptic segmentation combines the advantages of semantic and instance segmentation, which can provide both pixel-level and instance-level environmental perception information for intelligent vehicles.

Instance Segmentation Panoptic Segmentation +1

Paper
Add Code

Double-Ended Palindromic Trees: A Linear-Time Data Structure and Its Applications

no code implementations • 5 Oct 2022 • Qisheng Wang, Ming Yang, Xinrui Zhu

eertree) is a linear-size data structure that provides access to all palindromic substrings of a string.

Paper
Add Code

FedSiam-DA: Dual-aggregated Federated Learning via Siamese Network under Non-IID Data

no code implementations • 17 Nov 2022 • Ming Yang, Yanhan Wang, Xin Wang, Zhenyong Zhang, Xiaoming Wu, Peng Cheng

Federated learning is a distributed learning that allows each client to keep the original data locally and only upload the parameters of the local model to the server.

Contrastive Learning Federated Learning

Paper
Add Code

High-level semantic feature matters few-shot unsupervised domain adaptation

no code implementations • 5 Jan 2023 • Lei Yu, Wanqi Yang, Shengqi Huang, Lei Wang, Ming Yang

However, the goal of FS-UDA and FSL are relevant yet distinct, since FS-UDA aims to classify the samples in target domain rather than source domain.

Few-Shot Learning Unsupervised Domain Adaptation +1

Paper
Add Code

Spatial Attention and Syntax Rule Enhanced Tree Decoder for Offine Handwritten Mathematical Expression Recognition

no code implementations • 13 Mar 2023 • Zihao Lin, Jinrong Li, Fan Yang, Shuangping Huang, Xu Yang, Jianmin Lin, Ming Yang

In this paper, we propose a novel model called Spatial Attention and Syntax Rule Enhanced Tree Decoder (SS-TD), which is equipped with spatial attention mechanism to alleviate the prediction error of tree structure and use syntax masks (obtained from the transformation of syntax rules) to constrain the occurrence of ungrammatical mathematical expression.

Paper
Add Code

Vehicle Sequencing at Signal-Free Intersections: Analytical Performance Guarantees Based on PDMP Formulation

no code implementations • 21 Mar 2023 • Xiangchen Cheng, Wei Tang, Ming Yang, Li Jin

Signal-free intersections are a representative application of smart and connected vehicle technologies.

Autonomous Driving Trajectory Planning

Paper
Add Code

Low-Light Image Enhancement by Learning Contrastive Representations in Spatial and Frequency Domains

no code implementations • 23 Mar 2023 • Yi Huang, Xiaoguang Tu, Gui Fu, Tingting Liu, Bokai Liu, Ming Yang, Ziliang Feng

Images taken under low-light conditions tend to suffer from poor visibility, which can decrease image quality and even reduce the performance of the downstream tasks.

Contrastive Learning Low-Light Image Enhancement

Paper
Add Code

Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms

no code implementations • 7 Jul 2023 • Ming Yang, Xiyuan Wei, Tianbao Yang, Yiming Ying

Then, we establish the compositional uniform stability results for two popular stochastic compositional gradient descent algorithms, namely SCGD and SCSC.

Learning Theory Meta-Learning

Paper
Add Code

Choose Your Simulator Wisely: A Review on Open-source Simulators for Autonomous Driving

no code implementations • 18 Nov 2023 • Yueyuan Li, Wei Yuan, Songan Zhang, Weihao Yan, Qiyuan Shen, Chunxiang Wang, Ming Yang

Simulators play a crucial role in autonomous driving, offering significant time, cost, and labor savings.

Autonomous Driving

Paper
Add Code

SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery

no code implementations • 15 Dec 2023 • Xin Guo, Jiangwei Lao, Bo Dang, Yingying Zhang, Lei Yu, Lixiang Ru, Liheng Zhong, Ziyuan Huang, Kang Wu, Dingxiang Hu, Huimei He, Jian Wang, Jingdong Chen, Ming Yang, Yongjun Zhang, Yansheng Li

Prior studies on Remote Sensing Foundation Model (RSFM) reveal immense potential towards a generic model for Earth Observation.

Contrastive Learning Earth Observation +1

Paper
Add Code

Evolutionary Alternating Direction Method of Multipliers for Constrained Multi-Objective Optimization with Unknown Constraints

no code implementations • 2 Jan 2024 • Shuang Li, Ke Li, Wei Li, Ming Yang

Constrained multi-objective optimization problems (CMOPs) pervade real-world applications in science, engineering, and design.

Paper
Add Code

SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment

no code implementations • 4 Jan 2024 • Ziping Ma, Furong Xu, Jian Liu, Ming Yang, Qingpei Guo

To achieve multimodal alignment from both global and local perspectives, this paper proposes Symmetrizing Contrastive Captioners (SyCoCa), which introduces bidirectional interactions on images and texts across the global and local representation levels.

Image Captioning Image Classification +6

Paper
Add Code

A Survey for Foundation Models in Autonomous Driving

no code implementations • 2 Feb 2024 • Haoxiang Gao, Yaqian Li, Kaiwen Long, Ming Yang, Yiqing Shen

The advent of foundation models has revolutionized the fields of natural language processing and computer vision, paving the way for their application in autonomous driving (AD).

3D Object Detection Autonomous Driving +2

Paper
Add Code

Anchor-free Clustering based on Anchor Graph Factorization

no code implementations • 24 Feb 2024 • Shikun Mei, Fangfang Li, Quanxue Gao, Ming Yang

Additionally, we evolve the concept of the membership matrix between cluster centers and samples in FKM into an anchor graph encompassing multiple anchor points and samples.

Clustering

Paper
Add Code

One-Step Multi-View Clustering Based on Transition Probability

no code implementations • 3 Mar 2024 • Wenhui Zhao, Quanxue Gao, Guangfei Li, Cheng Deng, Ming Yang

Despite their successes, current methods lack interpretability in the clustering process and do not sufficiently consider the complementary information across different views.

Clustering

Paper
Add Code

Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

no code implementations • 17 Mar 2024 • Kangyang Xie, BinBin Yang, Hao Chen, Meng Wang, Cheng Zou, Hui Xue, Ming Yang, Chunhua Shen

Beyond the superiority of the text-to-image diffusion model in generating high-quality images, recent studies have attempted to uncover its potential for adapting the learned semantic knowledge to visual perception tasks.

Image Generation

Paper
Add Code

Interpretable Multi-View Clustering Based on Anchor Graph Tensor Factorization

no code implementations • 1 Apr 2024 • Jing Li, Quanxue Gao, Cheng Deng, Qianqian Wang, Ming Yang

Nevertheless, existing multi-view clustering methods based on anchor graph factorization lack adequate cluster interpretability for the decomposed matrix and often overlook the inter-view information.

Clustering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.