no code implementations • EMNLP 2021 • Xiaobao Guo, Adams Kong, Huan Zhou, Xianfeng Wang, Min Wang
Specifically, to improve unimodal representations, a unimodal refinement module is designed to refine modality-specific learning via iteratively updating the distribution with transformer-based attention layers.
1 code implementation • 5 Mar 2025 • Ping Chen, Xingpeng Zhang, Zhaoxiang Liu, Huan Hu, Xiang Liu, Kai Wang, Min Wang, Yanlin Qian, Shiguo Lian
In this research, we propose a novel denoising diffusion model based on shortest-path modeling that optimizes residual propagation to enhance both denoising efficiency and quality. Drawing on Denoising Diffusion Implicit Models (DDIM) and insights from graph theory, our model, termed the Shortest Path Diffusion Model (ShortDF), treats the denoising process as a shortest-path problem aimed at minimizing reconstruction error.
no code implementations • 19 Dec 2024 • Min Wang, Xin Huang, Guoqing Zhou, Qifeng Guo, Qing Wang
Neural Radiance Fields (NeRFs) have demonstrated prominent performance in novel view synthesis.
no code implementations • 18 Nov 2024 • Jiayi Li, XiLe Zhao, Jianli Wang, Chao Wang, Min Wang
Recently, implicit neural representations (INRs) have attracted increasing attention for multi-dimensional data recovery.
no code implementations • 18 Oct 2024 • Mengqi Wang, Krishna Kumar, Y. T. Feng, Tongming Qu, Min Wang
Artificial intelligence (AI) has become a buzz word since Google's AlphaGo beat a world champion in 2017.
no code implementations • 17 Sep 2024 • Weiye Xu, Min Wang, Wengang Zhou, Houqiang Li
Embodied Everyday Task is a popular task in the embodied AI community, requiring agents to make a sequence of actions based on natural language instructions and visual observations.
1 code implementation • 31 Jul 2024 • Shoujin Huang, GuanXiong Luo, Yunlin Zhao, Yilong Liu, Yuwan Wang, Kexin Yang, Jingzhe Liu, Hua Guo, Min Wang, Lingyan Zhang, Mengye Lyu
Simultaneous multislice (SMS) imaging is a powerful technique for accelerating magnetic resonance imaging (MRI) acquisitions.
1 code implementation • 23 Jul 2024 • Longtao Jiang, Min Wang, Zecheng Li, Yao Fang, Wengang Zhou, Houqiang Li
Furthermore, existing RGB-based sign retrieval works suffer from the huge memory cost of dense visual data embedding in end-to-end training, and adopt offline RGB encoder instead, leading to suboptimal feature representation.
1 code implementation • 10 Jul 2024 • Wei Dai, Rui Liu, Zixuan Wu, Tianyi Wu, Min Wang, Junxian Zhou, Yixuan Yuan, Jun Liu
Early detection and accurate diagnosis can predict the risk of malignant disease transformation, thereby increasing the probability of effective treatment.
no code implementations • 7 Jul 2024 • Ting-Wei Zhou, Xi-Le Zhao, Jian-Li Wang, Yi-Si Luo, Min Wang, Xiao-Xuan Bai, Hong Yan
Especially, the deep latent generative module can faithfully generate the latent tensor as compared with shallow matrix factorization.
no code implementations • 28 Jun 2024 • Hao Yue, Yingtao Wu, Min Wang, Hesuan Hu, Weimin Wu, Jihui Zhang
Cloud manufacturing system is a service-oriented and knowledge-based one, which can provide solutions for the large-scale customized production.
1 code implementation • 15 Jun 2024 • Weichao Zhao, Wengang Zhou, Hezhen Hu, Min Wang, Houqiang Li
On one hand, since the semantics of sign language are expressed by the cooperation of fine-grained hands and coarse-grained trunks, we utilize both granularity information and encode them into latent spaces.
1 code implementation • 31 May 2024 • Weichao Zhao, Hezhen Hu, Wengang Zhou, Yunyao Mao, Min Wang, Houqiang Li
To this end, we propose a Motion-Aware masked autoencoder with Semantic Alignment (MASA) that integrates rich motion cues and global semantic information in a self-supervised learning paradigm for SLR.
no code implementations • 22 Apr 2024 • Mingxuan Gao, Min Wang, Maoyin Chen
Deep learning has shown the great power in the field of fault detection.
1 code implementation • 1 Apr 2024 • Yuanhao Zeng, Min Wang, Yihang Wang, Yingxia Shao
With the same amount of task data, TELL leads in improving task performance compared to SFT.
1 code implementation • 18 Mar 2024 • Xiaohan Lei, Min Wang, Wengang Zhou, Houqiang Li
In embodied vision, Instance ImageGoal Navigation (IIN) requires an agent to locate a specific object depicted in a goal image within an unexplored environment.
no code implementations • 18 Mar 2024 • Zhiyang Guo, Wengang Zhou, Li Li, Min Wang, Houqiang Li
To address the above problem, we propose a novel motion-aware enhancement framework for dynamic scene reconstruction, which mines useful motion cues from optical flow to improve different paradigms of dynamic 3DGS.
no code implementations • 4 Mar 2024 • Jingyu Gong, Min Wang, Wentao Liu, Chen Qian, Zhizhong Zhang, Yuan Xie, Lizhuang Ma
To handle this problem, we propose the first Dynamic Environment MOtion Synthesis framework (DEMOS) to predict future motion instantly according to the current scene, and use it to dynamically update the latent motion for final motion synthesis.
no code implementations • 3 Mar 2024 • Yongchao Du, Min Wang, Wengang Zhou, Shuping Hui, Houqiang Li
To tackle the above problems, we propose Image2Sentence based Asymmetric zero-shot composed image retrieval (ISA), which takes advantage of the VL model and only relies on unlabeled images for composition learning.
1 code implementation • 1 Mar 2024 • Hui Wu, Min Wang, Wengang Zhou, Houqiang Li
The centroid vectors in the quantizer serve as anchor points in the embedding space of the gallery model to characterize its structure.
no code implementations • CVPR 2023 • Hui Wu, Min Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li
Then, a dynamic mixer is introduced to aggregate these features into compact embedding for efficient search.
1 code implementation • CVPR 2024 • Xiaohan Lei, Min Wang, Wengang Zhou, Li Li, Houqiang Li
As a new embodied vision task, Instance ImageGoal Navigation (IIN) aims to navigate to a specified object depicted by a goal image in an unexplored environment.
1 code implementation • 17 Aug 2023 • Ziyin Zhang, Ning Lu, Minghui Liao, Yongshuai Huang, Cheng Li, Min Wang, Wei Peng
It incorporates a framewise regularization term in CTC loss to emphasize individual supervision, and leverages the maximizing-a-posteriori of latent alignment to solve the inconsistency problem that arises in distillation between CTC-based models.
no code implementations • 16 Jun 2023 • Shuai Xiao, Chen Pan, Min Wang, Xinxin Zhu, Siqiao Xue, Jing Wang, Yunhua Hu, James Zhang, Jinghua Feng
To this end, we formulate the problem as a partially observable Markov decision problem (POMDP) and employ an environment correction algorithm based on the characteristics of the business.
Hierarchical Reinforcement Learning
reinforcement-learning
+1
no code implementations • 26 Mar 2023 • Yingda Guan, Zhengyang Feng, Huiying Chang, Kuo Du, TingTing Li, Min Wang
We present SDTracker, a method that harnesses the potential of synthetic data for multi-object tracking of real-world scenes in a domain generalization and semi-supervised fashion.
no code implementations • CVPR 2023 • Zhiyang Guo, Wengang Zhou, Min Wang, Li Li, Houqiang Li
We propose a novel framework to reconstruct accurate appearance and geometry with neural radiance fields (NeRF) for interacting hands, enabling the rendering of photo-realistic images and videos for gesture animation from arbitrary views.
no code implementations • 13 Feb 2023 • Mengxuan Li, Peng Peng, Min Wang, Hongwei Wang
The novelty of HDLCNN lies in its capability of processing tabular data with features of arbitrary order without seeking the optimal order, due to the ability to agglomerate correlated features of feature clustering and the large receptive field of dilated convolution.
no code implementations • 24 Aug 2022 • Min Wang, Ata Mahjoubfar, Anupama Joshi
We see that using the same transformer for encoding the question and decoding the answer, as in language models, achieves maximum accuracy, showing that visual language models (VLMs) make the best visual question answering systems for our dataset.
2 code implementations • 20 Aug 2022 • Jun Zhang, Sirui Liu, Mengyun Chen, Haotian Chu, Min Wang, Zidong Wang, Jialiang Yu, Ningxi Ni, Fan Yu, Diqing Chen, Yi Isaac Yang, Boxin Xue, Lijiang Yang, YuAn Liu, Yi Qin Gao
Data-driven predictive methods which can efficiently and accurately transform protein sequences into biologically active structures are highly valuable for scientific research and medical development.
1 code implementation • Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022 • Xiang Huang, Hongsheng Liu, Beiji Shi, Zidong Wang, Kang Yang, Yang Li, Min Wang, Haotian Chu, Jing Zhou, Fan Yu, Bei Hua, Bin Dong, Lei Chen
In recent years, deep learning technology has been used to solve partial differential equations (PDEs), among which the physics-informed neural networks (PINNs)method emerges to be a promising method for solving both forward and inverse PDE problems.
2 code implementations • 24 Jun 2022 • Sirui Liu, Jun Zhang, Haotian Chu, Min Wang, Boxin Xue, Ningxi Ni, Jialiang Yu, Yuhao Xie, Zhenyu Chen, Mengyun Chen, YuAn Liu, Piya Patra, Fan Xu, Jie Chen, Zidong Wang, Lijiang Yang, Fan Yu, Lei Chen, Yi Qin Gao
We provide in addition the benchmark training procedure for SOTA protein structure prediction model on this dataset.
1 code implementation • IEEE Transactions on Systems, Man, and Cybernetics: Systems 2022 • Min Wang, Chunyu Yang, Fei Zhao, Fan Min, XiZhao Wang
A reasonable machine learning scenario involves obtaining certain values and labels at cost on request.
2 code implementations • 9 May 2022 • Wei Dai, Rui Liu, Tianyi Wu, Min Wang, Jianqin Yin, Jun Liu
Visual features of skin lesions vary significantly because the images are collected from patients with different lesion colours and morphologies by using dissimilar imaging equipment.
1 code implementation • CVPR 2022 • Zhengyang Feng, Shaohua Guo, Xin Tan, Ke Xu, Min Wang, Lizhuang Ma
This paper presents a novel parametric curve-based method for lane detection in RGB images.
Ranked #2 on
Lane Detection
on LLAMAS
1 code implementation • 13 Feb 2022 • Wanglong Lu, Hanli Zhao, Xianta Jiang, Xiaogang Jin, YongLiang Yang, Min Wang, Jiankai Lyu, Kaijie Shi
We introduce a novel attribute similarity metric to encourage networks to learn the style of facial attributes from the exemplar in a self-supervised way.
no code implementations • 1 Jan 2022 • Hao Yang, Min Wang, Zhengfei Yu, Yun Zhou
Extensive experiments on well-known white- and black-box attacks show that MFDV-SNN achieves a significant improvement over existing methods, which indicates that it is a simple but effective method to improve model robustness.
no code implementations • CVPR 2022 • Hui Wu, Min Wang, Wengang Zhou, Houqiang Li, Qi Tian
To this end, we propose a flexible contextual similarity distillation framework to enhance the small query model and keep its output feature compatible with that of large gallery model, which is crucial with asymmetric retrieval.
1 code implementation • 12 Dec 2021 • Hui Wu, Min Wang, Wengang Zhou, Yang Hu, Houqiang Li
Next, a refinement block is introduced to enhance the visual tokens with self-attention and cross-attention.
Ranked #3 on
Image Retrieval
on RParis (Medium)
1 code implementation • 25 Nov 2021 • Jiachen Xu, Min Wang, Jingyu Gong, Wentao Liu, Chen Qian, Yuan Xie, Lizhuang Ma
Prior plays an important role in providing the plausible constraint on human motion.
no code implementations • 15 Nov 2021 • Xiang Huang, Zhanhong Ye, Hongsheng Liu, Beiji Shi, Zidong Wang, Kang Yang, Yang Li, Bingya Weng, Min Wang, Haotian Chu, Fan Yu, Bei Hua, Lei Chen, Bin Dong
Many important problems in science and engineering require solving the so-called parametric partial differential equations (PDEs), i. e., PDEs with different physical parameters, boundary conditions, shapes of computation domains, etc.
no code implementations • 2 Nov 2021 • Xiang Huang, Hongsheng Liu, Beiji Shi, Zidong Wang, Kang Yang, Yang Li, Bingya Weng, Min Wang, Haotian Chu, Jing Zhou, Fan Yu, Bei Hua, Lei Chen, Bin Dong
In recent years, deep learning technology has been used to solve partial differential equations (PDEs), among which the physics-informed neural networks (PINNs) emerges to be a promising method for solving both forward and inverse PDE problems.
1 code implementation • NeurIPS 2021 • Jianbo Ouyang, Hui Wu, Min Wang, Wengang Zhou, Houqiang Li
Since our re-ranking model is not directly involved with the visual feature used in the initial retrieval, it is ready to be applied to retrieval result lists obtained from various retrieval algorithms.
no code implementations • 12 Sep 2021 • Libing Wu, Min Wang, Dan Wu, Jia Wu
Then, to efficiently utilize the historical state information of the intersection, we design a sequence model with the temporal convolutional network (TCN) to capture the historical information and further merge it with the spatial information to improve its performance.
no code implementations • 23 Aug 2021 • Jian Zhao, Gang Wang, Jianan Li, Lei Jin, Nana Fan, Min Wang, Xiaojuan Wang, Ting Yong, Yafeng Deng, Yandong Guo, Shiming Ge, Guodong Guo
The 2nd Anti-UAV Workshop \& Challenge aims to encourage research in developing novel and accurate methods for multi-scale object tracking.
1 code implementation • CVPR 2021 • Zedong Tang, Fenlong Jiang, Maoguo Gong, Hao Li, Yue Wu, Fan Yu, Zidong Wang, Min Wang
For the fully connected layers, by utilizing the low-rank property of Kronecker factors of Fisher information matrix, our method only requires inverting a small matrix to approximate the curvature with desirable accuracy.
no code implementations • 3 Jun 2021 • Jie He, Min Wang
How does the control power of corporate shareholder arise?
1 code implementation • Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021 • Zedong Tang, Fenlong Jiang, Maoguo Gong, Hao Li, Yue Wu, Fan Yu, Zidong Wang, Min Wang
For the fully connected layers, by utilizing the low-rank property of Kronecker factors of Fisher information matrix, our method only requires inverting a small matrix to approximate the curvature with desirable accuracy.
no code implementations • AAAI Technical Track on Machine Learning 2021 • Mengyun Chen, Kaixin Gao, Xiaolei Liu, Zidong Wang, Ningxi Ni, Qian Zhang, Lei Chen, Chao Ding, ZhengHai Huang, Min Wang, Shuangling Wang, Fan Yu, Xinyuan Zhao, Dachuan Xu
It is well-known that second-order optimizer can accelerate the training of deep neural networks, however, the huge computation cost of second-order optimization makes it impractical to apply in real practice.
no code implementations • 28 Mar 2021 • Min Wang, Shanchen Pang, Tong Ding, Sibo Qiao, Xue Zhai, Shuo Wang, Neal N. Xiong, Zhengwen Huang
In addition, we design a utility prediction model for SSF based on the Generative Adversarial Networks (GAN) and Fully Connected Neural Network (FCNN).
no code implementations • 5 Jan 2021 • Jianfeng Lu, Yulong Lu, Min Wang
This paper concerns the a priori generalization analysis of the Deep Ritz Method (DRM) [W. E and B. Yu, 2017], a popular neural-network-based method for solving high dimensional partial differential equations.
1 code implementation • ICCV 2021 • Hui Wu, Min Wang, Wengang Zhou, Houqiang Li
To this end, we propose a novel deep local feature learning architecture to simultaneously focus on multiple discriminative local patterns in an image.
no code implementations • 24 Dec 2020 • Zedong Tang, Fenlong Jiang, Junke Song, Maoguo Gong, Hao Li, Fan Yu, Zidong Wang, Min Wang
Optimizers that further adjust the scale of gradient, such as Adam, Natural Gradient (NG), etc., despite widely concerned and used by the community, are often found poor generalization performance, compared with Stochastic Gradient Descent (SGD).
no code implementations • 27 Nov 2020 • Kai-Xin Gao, Xiao-Lei Liu, Zheng-Hai Huang, Min Wang, Shuangling Wang, Zidong Wang, Dachuan Xu, Fan Yu
Using second-order optimization methods for training deep neural networks (DNNs) has attracted many researchers.
no code implementations • 26 Nov 2020 • Sibo Qiao, Shanchen Pang, Gang Luo, Silin Pan, Xun Wang, Min Wang, Xue Zhai, Taotao Chen
The first step to automatically analyze fetal FC views is locating the fetal four crucial chambers of heart in a US image.
no code implementations • 21 Nov 2020 • Kai-Xin Gao, Xiao-Lei Liu, Zheng-Hai Huang, Min Wang, Zidong Wang, Dachuan Xu, Fan Yu
There have been many attempts to use second-order optimization methods for training deep neural networks.
no code implementations • 24 Mar 2020 • Min Wang, Feng Qiu, Wentao Liu, Chen Qian, Xiaowei Zhou, Lizhuang Ma
In this paper, we introduce body part segmentation as critical supervision.
Ranked #104 on
3D Human Pose Estimation
on Human3.6M
(PA-MPJPE metric)
no code implementations • 6 Jul 2018 • Chanseok Park, Min Wang
Based on the median and the median absolute deviation estimators, and the Hodges-Lehmann and Shamos estimators, robustified analogues of the conventional $t$-test statistic are proposed.
Applications
no code implementations • 2 Jul 2018 • Sihao Xue, Zhenyi Ying, Fan Mo, Min Wang, Jue Sun
Besides this, at most of time, ASR system is used to deal with real-time problem such as keyword spotting (KWS).
no code implementations • 13 Jun 2018 • Yating Wang, Siu Wun Cheung, Eric T. Chung, Yalchin Efendiev, Min Wang
Numerical results show that using deep learning and multiscale models, we can improve the forward models, which are conditioned to the available data.
no code implementations • SEMEVAL 2018 • Min Wang, Xiaobing Zhou
We perform the LSTM and BiLSTM model for the emotion intensity prediction.
no code implementations • 23 May 2018 • Min Wang, Xipeng Chen, Wentao Liu, Chen Qian, Liang Lin, Lizhuang Ma
In this paper, we propose a two-stage depth ranking based method (DRPose3D) to tackle the problem of 3D human pose estimation.
no code implementations • IJCNLP 2017 • Min Wang, Qingxun Liu, Peng Ding, Yongbin Li, Xiaobing Zhou
In this paper, we perform convolutional neural networks (CNN) to learn the joint representations of question-answer pairs first, then use the joint representations as the inputs of the long short-term memory (LSTM) with attention to learn the answer sequence of a question for labeling the matching quality of each answer.
1 code implementation • 15 Aug 2016 • Min Wang, Baoyuan Liu, Hassan Foroosh
A topological subdivisioning is adopted to reduce the connection between the input channels and output channels.
no code implementations • CVPR 2015 • Baoyuan Liu, Min Wang, Hassan Foroosh, Marshall Tappen, Marianna Pensky
Deep neural networks have achieved remarkable performance in both image classification and object detection problems, at the cost of a large number of parameters and computational complexity.