no code implementations • 19 May 2025 • Yuanbo Wang, Zhaoxuan Zhang, Jiajin Qiu, Dilong Sun, Zhengyu Meng, Xiaopeng Wei, Xin Yang
To overcome these limitations, we utilize tactile images to capture the local 3D information and propose a Touch2Shape model, which leverages a touch-conditioned diffusion model to explore and reconstruct the target shape from touch.
1 code implementation • 15 May 2025 • Xiang He, Dongcheng Zhao, Yang Li, Qingqun Kong, Xin Yang, Yi Zeng
Inspired by this biological mechanism, we explore the relationship between multimodal output and information from individual modalities, proposing an inverse effectiveness driven multimodal fusion (IEMF) strategy.
no code implementations • 12 May 2025 • Yi Zhang, Wenye Zhou, Ruonan Lin, Xin Yang, Hao Zheng
Traffic accident prediction and detection are critical for enhancing road safety, and vision-based traffic accident anticipation (Vision-TAA) has emerged as a promising approach in the era of deep learning. This paper reviews 147 recent studies, focusing on the application of supervised, unsupervised, and hybrid deep learning models for accident prediction, alongside the use of real-world and synthetic datasets. Current methodologies are categorized into four key approaches: image and video feature-based prediction, spatiotemporal feature-based prediction, scene understanding, and multimodal data fusion. While these methods demonstrate significant potential, challenges such as data scarcity, limited generalization to complex scenarios, and real-time performance constraints remain prevalent.
1 code implementation • 24 Apr 2025 • Lutao Jiang, Jiantao Lin, Kanghao Chen, Wenhang Ge, Xin Yang, Yifan Jiang, Yuanhuiyi Lyu, Xu Zheng, Yingcong Chen
As for texture branch, we use RGB images as input to obtain the textured mesh.
1 code implementation • 2 Apr 2025 • Jijun Xiang, Xuan Zhu, Xianqi Wang, Yu Wang, Hong Zhang, Fei Guo, Xin Yang
To address these challenges, we propose a novel completion-based method, named DEPTHOR, featuring advances in both the training strategy and model architecture.
no code implementations • 26 Mar 2025 • Yuhao Huang, Ao Chang, Haoran Dou, Xing Tao, Xinrui Zhou, Yan Cao, Ruobing Huang, Alejandro F Frangi, Lingyun Bao, Xin Yang, Dong Ni
(3) Implementation of a progressive curriculum learning strategy to enable agents to interact with the environment in a progressively challenging manner, thereby enhancing learning efficiency.
1 code implementation • 20 Mar 2025 • Xiangyu Li, Wanshu Fan, Yue Shen, Cong Wang, Wei Wang, Xin Yang, Qiang Zhang, Dongsheng Zhou
To address these limitations, we propose an Expectation Maximization Reconstruction Transformer (EMResformer) for single image rain streak removal.
no code implementations • 19 Mar 2025 • Yaofei Duan, Tao Tan, Zhiyuan Zhu, Yuhao Huang, Yuanji Zhang, Rui Gao, Patrick Cheong-Iao Pang, Xinru Gao, Guowei Tao, Xiang Cong, Zhou Li, Lianying Liang, Guangzhi He, Linliang Yin, Xuedong Deng, Xin Yang, Dong Ni
Fetal ultrasound (US) examinations require the acquisition of multiple planes, each providing unique diagnostic information to evaluate fetal development and screening for congenital anomalies.
no code implementations • 5 Mar 2025 • Gangwei Xu, Haotong Lin, Zhaoxing Zhang, Hongcheng Luo, Haiyang Sun, Xin Yang
Event cameras deliver visual information characterized by a high dynamic range and high temporal resolution, offering significant advantages in estimating optical flow for complex lighting conditions and fast-moving objects.
no code implementations • 5 Mar 2025 • Gangwei Xu, Jiaxin Liu, Xianqi Wang, Junda Cheng, Yong Deng, Jinliang Zang, Yurui Chen, Xin Yang
State-of-the-art stereo matching methods typically use costly 3D convolutions to aggregate a full cost volume, but their computational demands make mobile deployment challenging.
1 code implementation • 3 Mar 2025 • Jiantao Lin, Xin Yang, Meixi Chen, YingJie Xu, Dongyu Yan, Leyi Wu, Xinli Xu, Lie Xu, Shunsi Zhang, Ying-Cong Chen
The normal maps are then used to reconstruct a 3D mesh, and the multi-view images provide texture mapping, resulting in a complete 3D model.
1 code implementation • 3 Mar 2025 • Xuan Zhu, Jijun Xiang, Xianqi Wang, Longliang Liu, Yu Wang, Hong Zhang, Fei Guo, Xin Yang
However, due to the manufacturing constraints of compact devices and the inherent physical principles of imaging, dToF depth maps are sparse and noisy.
1 code implementation • 27 Feb 2025 • Yujie Li, Guannan Lai, Xin Yang, Yonghao Li, Marcello Bonsangue, Tianrui Li
Particularly, our HoliTrans effectively supports knowledge transfer for both known and unknown samples while dynamically updating representations of open samples during OWCL.
1 code implementation • 27 Feb 2025 • Guannan Lai, Yujie Li, Xiangkun Wang, Junbo Zhang, Tianrui Li, Xin Yang
In response, we first provide additional theoretical analysis, which reveals that when the similarity among a group of classes is lower, the model demonstrates increased robustness to the class order.
1 code implementation • 18 Feb 2025 • Xiang He, Dongcheng Zhao, Yiting Dong, Guobin Shen, Xin Yang, Yi Zeng
However, existing SNN models primarily focus on unimodal processing and lack efficient cross-modal information fusion, thereby limiting their effectiveness in real-world multimodal scenarios.
no code implementations • 14 Feb 2025 • Tao Fan, Hanlin Gu, Xuemei Cao, Chee Seng Chan, Qian Chen, Yiqiang Chen, Yihui Feng, Yang Gu, Jiaxiang Geng, Bing Luo, Shuoling Liu, Win Kent Ong, Chao Ren, Jiaqi Shao, Chuan Sun, Xiaoli Tang, Hong Xi Tae, Yongxin Tong, Shuyue Wei, Fan Wu, Wei Xi, Mingcong Xu, He Yang, Xin Yang, Jiangpeng Yan, Hao Yu, Han Yu, Teng Zhang, Yifei Zhang, Xiaojin Zhang, Zhenzhe Zheng, Lixin Fan, Qiang Yang
This paper provides a comprehensive summary of the ten challenging problems inherent in FedFMs, encompassing foundational theory, utilization of private data, continual learning, unlearning, Non-IID and graph data, bidirectional knowledge transfer, incentive mechanism design, game mechanism design, model watermarking, and efficiency.
no code implementations • 15 Jan 2025 • Xianqi Wang, Hao Yang, Gangwei Xu, Junda Cheng, Min Lin, Yong Deng, Jinliang Zang, Yurui Chen, Xin Yang
This pipeline utilizes arbitrary single images as left images and pseudo disparities generated by a monocular depth estimation model to synthesize high-quality corresponding right images.
1 code implementation • 15 Jan 2025 • Junda Cheng, Longliang Liu, Gangwei Xu, Xianqi Wang, Zhaoxing Zhang, Yong Deng, Jinliang Zang, Yurui Chen, Zhipeng Cai, Xin Yang
The refined monodepth is in turn guides stereo effectively at ill-posed regions.
1 code implementation • 9 Jan 2025 • Guannan Lai, Yihui Feng, Xin Yang, Xiaoyu Deng, Hao Yu, Shuyin Xia, Guoyin Wang, Tianrui Li
Federated Learning (FL) facilitates collaborative model training while prioritizing privacy by avoiding direct data sharing.
no code implementations • 30 Dec 2024 • Xin Yang, Xingrun Li, Heng Chang, Jinze Yang, Xihong Yang, Shengyu Tao, Ningkang Chang, Maiko Shigeno, Junfeng Wang, Dawei Yin, Erxue Min
The cold start problem is a challenging problem faced by most modern recommender systems.
no code implementations • 29 Dec 2024 • Xin Yang, Rachel Zheng, Madhumitha Mohan, Sonali Bhadra, Pansul Bhatt, Lingyu, Zhang, Rupesh Gupta
In the past, most search queries issued to a search engine were short and simple.
no code implementations • 24 Dec 2024 • Hao Yu, Xin Yang, Le Zhang, Hanlin Gu, Tianrui Li, Lixin Fan, Qiang Yang
Federated continual learning (FCL) allows each client to continually update its knowledge from task streams, enhancing the applicability of federated learning in real-world scenarios.
no code implementations • 23 Dec 2024 • Min Lin, Gangwei Xu, Yun Wang, Xianqi Wang, Xin Yang
In this paper, we propose a novel global-aware scene flow estimation network with global motion propagation, named FlowMamba.
no code implementations • 22 Dec 2024 • Zhaoxing Zhang, Junda Cheng, Gangwei Xu, Xiaoxiang Wang, Can Zhang, Xin Yang
Recent approaches to VO have significantly improved performance by using deep networks to predict optical flow between video frames.
1 code implementation • 20 Dec 2024 • Jun Ma, Feifei Li, Sumin Kim, Reza Asakereh, Bao-Hiep Le, Dang-Khoa Nguyen-Vu, Alexander Pfefferle, Muxin Wei, Ruochen Gao, Donghang Lyu, Songxiao Yang, Lennart Purucker, Zdravko Marinov, Marius Staring, Haisheng Lu, Thuy Thanh Dao, Xincheng Ye, Zhi Li, Gianluca Brugnara, Philipp Vollmuth, Martha Foltyn-Dumitru, Jaeyoung Cho, Mustafa Ahmed Mahmutoglu, Martin Bendszus, Irada Pflüger, Aditya Rastogi, Dong Ni, Xin Yang, Guang-Quan Zhou, Kaini Wang, Nicholas Heller, Nikolaos Papanikolopoulos, Christopher Weight, Yubing Tong, Jayaram K Udupa, Cahill J. Patrick, Yaqi Wang, Yifan Zhang, Francisco Contijoch, Elliot McVeigh, Xin Ye, Shucheng He, Robert Haase, Thomas Pinetz, Alexander Radbruch, Inga Krause, Erich Kobler, Jian He, Yucheng Tang, Haichun Yang, Yuankai Huo, Gongning Luo, Kaisar Kushibar, Jandos Amankulov, Dias Toleshbayev, Amangeldi Mukhamejan, Jan Egger, Antonio Pepe, Christina Gsaxner, Gijs Luijten, Shohei Fujita, Tomohiro Kikuchi, Benedikt Wiestler, Jan S. Kirschke, Ezequiel de la Rosa, Federico Bolelli, Luca Lumetti, Costantino Grana, Kunpeng Xie, Guomin Wu, Behrus Puladi, Carlos Martín-Isla, Karim Lekadir, Victor M. Campello, Wei Shao, Wayne Brisbane, Hongxu Jiang, Hao Wei, Wu Yuan, Shuangle Li, Yuyin Zhou, Bo wang
Promptable segmentation foundation models have emerged as a transformative approach to addressing the diverse needs in medical images, but most existing models require expensive computing, posing a big barrier to their adoption in clinical practice.
no code implementations • 19 Dec 2024 • Xin Yang, Omid Ardakanian
Sensor data collected by Internet of Things (IoT) devices carries detailed information about individuals in their vicinity.
no code implementations • 18 Dec 2024 • Yichen Li, Haozhao Wang, Wenchao Xu, Tianzhe Xiao, Hong Liu, Minzhu Tu, Yuying Wang, Xin Yang, Rui Zhang, Shui Yu, Song Guo, Ruixuan Li
To achieve high reliability and scalability in deploying this paradigm in distributed systems, it is essential to conquer challenges stemming from both spatial and temporal dimensions, manifesting as distribution shifts, catastrophic forgetting, heterogeneity, and privacy issues.
1 code implementation • 18 Dec 2024 • Yanhua Li, Xiaocao Ouyang, Chaofan Pan, Jie Zhang, Sen Zhao, Shuyin Xia, Xin Yang, Guoyin Wang, Tianrui Li
To tackle these issues, we propose a Multi-granularity Open intent classification method via adaptive Granular-Ball decision boundary (MOGB).
no code implementations • 16 Dec 2024 • Junda Cheng, Zhipeng Cai, Zhaoxing Zhang, Wei Yin, Matthias Muller, Michael Paulitsch, Xin Yang
We propose Robust Metric Visual Odometry (RoMeO), a novel method that resolves these issues leveraging priors from pre-trained depth models.
no code implementations • 13 Nov 2024 • YuTao Shen, HongYu Zhou, Xin Yang, Xuqi Lu, Ziyue Guo, Lixi Jiang, Yong He, Haiyan Cen
The SAM module achieved high segmentation accuracy, with a mean intersection over union (mIoU) of 0. 961 and an F1-score of 0. 980.
no code implementations • 13 Nov 2024 • Xiaoxiang Wang, Jiaxin Liu, Miaojie Feng, Zhaoxing Zhang, Xin Yang
3D Multi-Object Tracking (MOT), a fundamental component of environmental perception, is essential for intelligent systems like autonomous driving and robotic sensing.
no code implementations • 12 Oct 2024 • Hritam Basak, Hadi Tabatabaee, Shreekant Gayaka, Ming-Feng Li, Xin Yang, Cheng-Hao Kuo, Arnie Sen, Min Sun, Zhaozheng Yin
We propose bridging the gap between 2D and 3D diffusion models to address this limitation by integrating a two-stage frequency-based distillation loss with Gaussian Splatting.
1 code implementation • 5 Sep 2024 • Xixi Jiang, Dong Zhang, Xiang Li, Kangyi Liu, Kwang-Ting Cheng, Xin Yang
However, the limited availability of labeled foreground organs and the absence of supervision to distinguish unlabeled foreground organs from the background pose a significant challenge, which leads to a distribution mismatch between labeled and unlabeled pixels.
2 code implementations • 1 Sep 2024 • Gangwei Xu, Xianqi Wang, Zhaoxing Zhang, Junda Cheng, Chunyuan Liao, Xin Yang
We further propose a selective geometry feature fusion module to adaptively fuse multi-range and multi-granularity geometry features in MGEV.
no code implementations • 4 Aug 2024 • Xin Yang, Xuqi Lu, Pengyao Xie, Ziyue Guo, Hui Fang, Haowei Fu, Xiaochun Hu, Zhenbiao Sun, Haiyan Cen
The rice panicle traits significantly influence grain yield, making them a primary target for rice phenotyping studies.
1 code implementation • 4 Aug 2024 • Xiang He, Xiangxi Liu, Yang Li, Dongcheng Zhao, Guobin Shen, Qingqun Kong, Xin Yang, Yi Zeng
Specifically, we have enhanced the model's ability to discern subtle differences between event and background and improved the accuracy of event classification in our model.
no code implementations • 31 Jul 2024 • Junxuan Yu, Rusi Chen, Yongsong Zhou, Yanlin Chen, Yaofei Duan, Yuhao Huang, Han Zhou, Tan Tao, Xin Yang, Dong Ni
In this context, we propose an explainable and controllable method for echocardiography video generation, taking an initial frame and a motion curve as guidance.
no code implementations • 31 Jul 2024 • Zhe Liu, Xiliang Zhu, Tong Han, Yuhao Huang, Jian Wang, Lian Liu, Fang Wang, Dong Ni, Zhongshan Gou, Xin Yang
Since MR data is limited and has large intra-class variability, we propose an unsupervised out-of-distribution (OOD) detection method to identify MR rather than building a deep classifier.
Out-of-Distribution Detection
Out of Distribution (OOD) Detection
no code implementations • 31 Jul 2024 • Yuhao Huang, Xin Yang, Han Zhou, Yan Cao, Haoran Dou, Fajin Dong, Dong Ni
In this study, we propose a novel Robust Box prompt based SAM (\textbf{RoBox-SAM}) to ensure SAM's segmentation performance under prompts with different qualities.
1 code implementation • 20 Jul 2024 • Junjie Shi, Caozhi Shang, Zhaobin Sun, Li Yu, Xin Yang, Zengqiang Yan
In this paper, we, for the first time, formulate such a challenging setting and propose Preference-Aware Self-diStillatION (PASSION) for incomplete multi-modal medical image segmentation under imbalanced missing rates.
no code implementations • 5 Jul 2024 • Zhongnuo Yan, Xin Yang, Mingyuan Luo, Jiongquan Chen, Rusi Chen, Lian Liu, Dong Ni
In this context, we propose a novel method to exploit the long-range dependency management capabilities of the state space model (SSM) to address the above challenge.
1 code implementation • 2 Jul 2024 • Yangyang Xiang, Nannan Wu, Li Yu, Xin Yang, Kwang-Ting Cheng, Zengqiang Yan
We begin by evaluating the completeness of annotations at the client level using a designed indicator.
1 code implementation • 27 Jun 2024 • Hao Yu, Xin Yang, Xin Gao, Yan Kang, Hao Wang, Junbo Zhang, Tianrui Li
In addition, we design a selective prompt fusion mechanism for aggregating knowledge of global prompts distilled from different clients.
1 code implementation • 27 Jun 2024 • Zhaobin Sun, Nannan Wu, Junjie Shi, Li Yu, Xin Yang, Kwang-Ting Cheng, Zengqiang Yan
Experiments on two publicly-available medical datasets validate the superiority of FedMLP against the state-of-the-art both federated semi-supervised and noisy label learning approaches under task heterogeneity.
no code implementations • 25 Jun 2024 • Xin Yang, Heng Chang, Zhijian Lai, Jinze Yang, Xingrun Li, Yu Lu, Shuaiqiang Wang, Dawei Yin, Erxue Min
Cross-Domain Recommendation (CDR) seeks to utilize knowledge from different domains to alleviate the problem of data sparsity in the target recommendation domain, and it has been gaining more attention in recent years.
no code implementations • 20 Jun 2024 • Yang Wang, Haiyang Mei, Qirui Bao, Ziqi Wei, Mike Zheng Shou, Haizhou Li, Bo Dong, Xin Yang
We introduce a novel multimodality synergistic knowledge distillation scheme tailored for efficient single-eye motion recognition tasks.
1 code implementation • 12 Jun 2024 • Fengtian Lang, Ruiye Ming, Zikang Yuan, Xin Yang
In this work, we propose a fast and robust Image Feature Triangle Descriptor (IFTD) based on the STD method, aimed at improving the efficiency and accuracy of place recognition in driving scenarios.
1 code implementation • 3 Jun 2024 • Zehui Lin, Zhuoneng Zhang, Xindi Hu, Zhifan Gao, Xin Yang, Yue Sun, Dong Ni, Tao Tan
Ultrasound is widely used in clinical practice due to its affordability, portability, and safety.
no code implementations • 20 May 2024 • Jinxin Xu, Haixin Wu, Yu Cheng, Liyang Wang, Xin Yang, Xintong Fu, Yuelong Su
This paper addresses the optimization of scheduling for workers at a logistics depot using a combination of genetic algorithm and simulated annealing algorithm.
no code implementations • 4 May 2024 • Xin Gao, Xin Yang, Hao Yu, Yan Kang, Tianrui Li
Federated Class-Incremental Learning (FCIL) focuses on continually transferring the previous knowledge to learn new classes in dynamic Federated Learning (FL).
no code implementations • 17 Mar 2024 • Jingcheng Jiang, Haiyin Piao, Yu Fu, Yihang Hao, Chuanlu Jiang, Ziqi Wei, Xin Yang
Furthermore, we construct a dogfight scenario for aerial agents to demonstrate the practicality of the PDO algorithm.
1 code implementation • 15 Mar 2024 • Xuemei Cao, Xin Yang, Shuyin Xia, Guoyin Wang, Tianrui Li
To this end, the proposed CFS method combines the strengths of continual learning (CL) with granular-ball computing (GBC), which focuses on constructing a granular-ball knowledge base to detect unknown classes and facilitate the transfer of previously learned knowledge for further feature selection.
1 code implementation • CVPR 2024 • Junda Cheng, Wei Yin, Kaixuan Wang, Xiaozhi Chen, Shijie Wang, Xin Yang
In this work, we propose a new robustness benchmark to evaluate the depth estimation system under various noisy pose settings.
Ranked #1 on
Monocular Depth Estimation
on DDAD
no code implementations • CVPR 2024 • Gangwei Xu, Yujin Wang, Jinwei Gu, Tianfan Xue, Xin Yang
HDRFlow has three novel designs: an HDR-domain alignment loss (HALoss), an efficient flow network with a multi-size large kernel (MLK), and a new HDR flow training scheme.
1 code implementation • CVPR 2024 • Xianqi Wang, Gangwei Xu, Hao Jia, Xin Yang
Stereo matching methods based on iterative optimization, like RAFT-Stereo and IGEV-Stereo, have evolved into a cornerstone in the field of stereo matching.
no code implementations • 18 Feb 2024 • Jian Wang, Xin Yang, Xiaohong Jia, Wufeng Xue, Rusi Chen, Yanlin Chen, Xiliang Zhu, Lian Liu, Yan Cao, Jianqiao Zhou, Dong Ni, Ning Gu
In this study, we proposed a multi-view contrastive self-supervised method to improve thyroid nodule classification and segmentation performance with limited manual labels.
no code implementations • 25 Jan 2024 • Chaofan Pan, Xin Yang, Hao Wang, Wei Wei, Tianrui Li
Despite the progress in continual reinforcement learning (CRL), existing methods often suffer from insufficient knowledge transfer, particularly when the tasks are diverse.
no code implementations • 15 Jan 2024 • Xin Yang, Wending Yan, Yuan Yuan, Michael Bi Mi, Robby T. Tan
They struggle to acquire new knowledge while also retaining previously learned knowledge. To address these problems, we propose a semantic segmentation method for multiple adverse weather conditions that incorporates adaptive knowledge acquisition, pseudolabel blending, and weather composition replay.
1 code implementation • 5 Jan 2024 • Wen Dong, Haiyang Mei, Ziqi Wei, Ao Jin, Sen Qiu, Qiang Zhang, Xin Yang
Car detection is an important task that serves as a crucial prerequisite for many automated driving functions.
no code implementations • 28 Dec 2023 • Miaojie Feng, Longliang Liu, Hao Jia, Gangwei Xu, Xin Yang
This paper introduces FlowDA, an unsupervised domain adaptive (UDA) framework for optical flow estimation.
1 code implementation • 28 Dec 2023 • Zikang Yuan, Jie Deng, Ruiye Ming, Fengtian Lang, Xin Yang
Existing LiDAR-inertial-visual odometry and mapping (LIV-SLAM) systems mainly utilize the LiDAR-inertial odometry (LIO) module for structure reconstruction and the visual-inertial odometry (VIO) module for color rendering.
no code implementations • 27 Dec 2023 • Xin Yang, Hao Yu, Xin Gao, Hao Wang, Junbo Zhang, Tianrui Li
The key objective of FCL is to fuse heterogeneous knowledge from different clients and retain knowledge of previous tasks while learning on new ones.
no code implementations • 22 Dec 2023 • Yujie Li, Xin Yang, Hao Wang, Xiangkun Wang, Tianrui Li
This paper studies the problem of continual learning in an open-world scenario, referred to as Open-world Continual Learning (OwCL).
1 code implementation • 6 Dec 2023 • Gangwei Xu, Shujun Chen, Hao Jia, Miaojie Feng, Xin Yang
The full 4D cost volume in Recurrent All-Pairs Field Transforms (RAFT) or global matching by Transformer achieves impressive performance for optical flow estimation.
1 code implementation • 30 Nov 2023 • Xin Yang, Elyssa Sliheet, Reece Iriye, Daniel Reynolds, Weihua Geng
By considering the advantages of current algorithms and computer hardware, we focus on the parallelization of the treecode-accelerated boundary integral (TABI) PB solver using the Message Passing Interface (MPI) on CPUs and the direct-sum boundary integral (DSBI) PB solver using KOKKOS on GPUs.
1 code implementation • CVPR 2024 • Yixun Liang, Xin Yang, Jiantao Lin, Haodong Li, Xiaogang Xu, Yingcong Chen
The recent advancements in text-to-3D generation mark a significant milestone in generative models, unlocking new possibilities for creating imaginative 3D assets across various real-world scenarios.
1 code implementation • 4 Nov 2023 • Miaojie Feng, Junda Cheng, Hao Jia, Longliang Liu, Gangwei Xu, Qingyong Hu, Xin Yang
This architecture mitigates the multi-peak distribution problem in matching through the multi-peak lookup strategy, and integrates the coarse-to-fine concept into the iterative framework via the cascade search range.
no code implementations • 30 Oct 2023 • Chaoyu Chen, Xin Yang, Yuhao Huang, Wenlong Shi, Yan Cao, Mingyuan Luo, Xindi Hu, Lei Zhue, Lequan Yu, Kejuan Yue, Yuanji Zhang, Yi Xiong, Dong Ni, Weijun Huang
However, accurately estimating the 3D fetal pose in US volume has several challenges, including poor image quality, limited GPU memory for tackling high dimensional data, symmetrical or ambiguous anatomical structures, and considerable variations in fetal poses.
no code implementations • 28 Oct 2023 • Hao Wang, Zhi-Qi Cheng, Jingdong Sun, Xin Yang, Xiao Wu, Hongyang Chen, Yan Yang
Multi-view or even multi-modal data is appealing yet challenging for real-world applications.
no code implementations • 22 Oct 2023 • Yingkai Fu, Meng Li, Wenxi Liu, Yuanchen Wang, Jiqing Zhang, BaoCai Yin, Xiaopeng Wei, Xin Yang
We demonstrate that our tracker has superior performance against the state-of-the-art trackers in terms of both accuracy and efficiency.
1 code implementation • 11 Oct 2023 • Zhiwei Wang, Qiang Hu, Hongkuan Shi, Li He, Man He, Wenxuan Dai, Yinjiao Tian, Xin Yang, Mei Liu, Qiang Li
In response, we propose two innovative learning fashions, Improved Box-dice (IBox) and Contrastive Latent-Anchors (CLA), and combine them to train a robust box-supervised model IBoxCLA.
1 code implementation • 11 Oct 2023 • Yuxuan Cai, Dingkang Liang, Dongliang Luo, Xinwei He, Xin Yang, Xiang Bai
To alleviate this issue, we present a Discrepancy Aware Framework (DAF), which demonstrates robust performance consistently with simple and cheap strategies across different anomaly detection benchmarks.
no code implementations • 10 Oct 2023 • Yang Wang, Bo Dong, Ke Xu, Haiyin Piao, Yufei Ding, BaoCai Yin, Xin Yang
Hence, given different inputs, it requires different time for converging to an adversarial sample.
1 code implementation • 6 Oct 2023 • Haiwei Zhang, Jiqing Zhang, Bo Dong, Pieter Peers, Wenwei Wu, Xiaopeng Wei, Felix Heide, Xin Yang
To the best of our knowledge, our method is the first eye-based emotion recognition method that leverages event-based cameras and spiking neural network.
1 code implementation • 29 Sep 2023 • Zhongnuo Yan, Tong Han, Yuhao Huang, Lian Liu, Han Zhou, Jiongquan Chen, Wenlong Shi, Yan Cao, Xin Yang, Dong Ni
In this paper, we propose the first foundation model, named iMOS, for MOS in medical images.
no code implementations • 27 Aug 2023 • Xin Yang, Yi Lin, Zhiwei Wang, Xin Li, Kwang-Ting Cheng
A method for measuring the synthesis complexity is proposed to automatically determine the synthesis order in our sequential GAN.
no code implementations • 26 Aug 2023 • Ao Chang, Xing Tao, Xin Yang, Yuhao Huang, Xinrui Zhou, Jiajun Zeng, Ruobing Huang, Dong Ni
It can prevent the highly unfavorable scenarios, such as encountering a blank mask as the initial input after the first interaction.
no code implementations • 26 Aug 2023 • Chaoyu Chen, Xin Yang, Rusi Chen, Junxuan Yu, Liwei Du, Jian Wang, Xindi Hu, Yan Cao, Yingying Liu, Dong Ni
In this paper, we introduce a novel Fourier-anchor-based DTS framework called Fourier Feature Pyramid Network (FFPN) to address the aforementioned issues.
no code implementations • 16 Aug 2023 • Han Zhou, Dong Ni, Ao Chang, Xinrui Zhou, Rusi Chen, Yanlin Chen, Lian Liu, Jiamin Liang, Yuhao Huang, Tong Han, Zhe Liu, Deng-Ping Fan, Xin Yang
Second, to better preserve the integrity and textural information of US images, we implemented a dual-decoder that decouples the content and textural features in the generator.
2 code implementations • 10 Aug 2023 • Jun Ma, Yao Zhang, Song Gu, Cheng Ge, Shihao Ma, Adamo Young, Cheng Zhu, Kangkang Meng, Xin Yang, Ziyan Huang, Fan Zhang, Wentao Liu, YuanKe Pan, Shoujin Huang, Jiacheng Wang, Mingze Sun, Weixin Xu, Dengqiang Jia, Jae Won Choi, Natália Alves, Bram de Wilde, Gregor Koehler, Yajun Wu, Manuel Wiesenfarth, Qiongjie Zhu, Guoqiang Dong, Jian He, the FLARE Challenge Consortium, Bo wang
The best-performing algorithms successfully generalized to holdout external validation sets, achieving a median DSC of 89. 5\%, 90. 9\%, and 88. 3\% on North American, European, and Asian cohorts, respectively.
no code implementations • 10 Aug 2023 • Jun Ma, Ronald Xie, Shamini Ayyadhury, Cheng Ge, Anubha Gupta, Ritu Gupta, Song Gu, Yao Zhang, Gihun Lee, Joonkee Kim, Wei Lou, Haofeng Li, Eric Upschulte, Timo Dickscheid, José Guilherme de Almeida, Yixin Wang, Lin Han, Xin Yang, Marco Labagnara, Vojislav Gligorovski, Maxime Scheder, Sahand Jamal Rahi, Carly Kempster, Alice Pollitt, Leon Espinosa, Tâm Mignot, Jan Moritz Middeke, Jan-Niklas Eckardt, Wangkai Li, Zhaoyang Li, Xiaochen Cai, Bizhe Bai, Noah F. Greenwald, David Van Valen, Erin Weisbart, Beth A. Cimini, Trevor Cheung, Oscar Brück, Gary D. Bader, Bo wang
This benchmark and the improved algorithm offer promising avenues for more accurate and versatile cell analysis in microscopy imaging.
no code implementations • 16 Jul 2023 • Yeqi Gao, Zhao Song, Xin Yang, Ruizhe Zhang
It is well-known that quantum machine has certain computational advantages compared to the classical machine.
1 code implementation • ICCV 2023 • Tianyi Shi, Xiaohuan Ding, Liang Zhang, Xin Yang
Curvilinear object segmentation is critical for many applications.
no code implementations • 28 Jun 2023 • Mingyuan Luo, Xin Yang, Zhongnuo Yan, Junyu Li, Yuanji Zhang, Jiongquan Chen, Xindi Hu, Jikuan Qian, Jun Cheng, Dong Ni
Ultrasound (US) imaging is a popular tool in clinical diagnosis, offering safety, repeatability, and real-time capabilities.
1 code implementation • 26 Jun 2023 • Haoran Dou, Ning Bi, Luyi Han, Yuhao Huang, Ritse Mann, Xin Yang, Dong Ni, Nishant Ravikumar, Alejandro F. Frangi, Yunzhi Huang
In this study, we construct a registration model based on the gradient surgery mechanism, named GSMorph, to achieve a hyperparameter-free balance on multiple losses.
1 code implementation • 21 Jun 2023 • Wentao Liu, Tong Tian, Lemeng Wang, Weijin Xu, Lei LI, Haoyuan Li, Wenyi Zhao, Siyu Tian, Xipeng Pan, Huihua Yang, Feng Gao, Yiming Deng, Xin Yang, Ruisheng Su
In this work, we introduce DIAS, a dataset specifically developed for IA segmentation in DSA sequences.
1 code implementation • 19 Jun 2023 • Zhiwei Wang, Junlin Xian, Kangyi Liu, Xin Li, Qiang Li, Xin Yang
Mammogram image is important for breast cancer screening, and typically obtained in a dual-view form, i. e., cranio-caudal (CC) and mediolateral oblique (MLO), to provide complementary information.
1 code implementation • 6 Jun 2023 • Lian Liu, Han Zhou, Jiongquan Chen, Sijing Liu, Wenlong Shi, Dong Ni, Deng-Ping Fan, Xin Yang
Deep neural networks have been widely applied in dichotomous medical image segmentation (DMIS) of many anatomical structures in several modalities, achieving promising performance.
1 code implementation • 5 Jun 2023 • Xinrui Zhou, Yuhao Huang, Wufeng Xue, Xin Yang, Yuxin Zou, Qilong Ying, Yuanji Zhang, Jia Liu, Jie Ren, Dong Ni
First, to avoid the requirement of laborious and unreliable annotation, we propose a novel and effective video classification network for weakly-supervised CSG.
no code implementations • 5 Jun 2023 • Yuhao Huang, Xin Yang, Xiaoqiong Huang, Xinrui Zhou, Haozhe Chi, Haoran Dou, Xindi Hu, Jian Wang, Xuedong Deng, Dong Ni
Second, we introduce a regularization technique that utilizes style interpolation consistency in the frequency space to encourage self-consistency in the logit space of the model output.
1 code implementation • CVPR 2023 • Jiqing Zhang, Yuanchen Wang, Wenxi Liu, Meng Li, Jinpeng Bai, BaoCai Yin, Xin Yang
The alignment module is responsible for cross-style and cross-frame-rate alignment between frame and event modalities under the guidance of the moving cues furnished by events.
no code implementations • 18 May 2023 • Yu Xiao, Xin Yang, Sijuan Huang, Lihua Guo
Medical image segmentation is particularly critical as a prerequisite for relevant quantitative analysis in the treatment of clinical diseases.
no code implementations • 11 May 2023 • Pengyao Xie, Zhihong Ma, Ruiming Du, Xin Yang, Haiyan Cen
The integrity of the whole-plant data was improved by an average of 23. 6% compared to the fixed viewpoints alone.
no code implementations • 8 May 2023 • Yeqi Gao, Zhao Song, Xin Yang, Yufa Zhou
Large language models (LLMs), especially those based on the Transformer architecture, have had a profound impact on various aspects of daily life, such as natural language processing, content generation, research methodologies, and more.
1 code implementation • 28 Apr 2023 • Yuhao Huang, Xin Yang, Lian Liu, Han Zhou, Ao Chang, Xinrui Zhou, Rusi Chen, Junxuan Yu, Jiongquan Chen, Chaoyu Chen, Sijing Liu, Haozhe Chi, Xindi Hu, Kejuan Yue, Lei LI, Vicente Grau, Deng-Ping Fan, Fajin Dong, Dong Ni
To fully validate SAM's performance on medical data, we collected and sorted 53 open-source datasets and built a large medical segmentation dataset with 18 modalities, 84 objects, 125 object-modality paired targets, 1050K 2D images, and 6033K masks.
no code implementations • 14 Apr 2023 • Sijing Liu, Qilong Ying, Shuangchi He, Xin Yang, Dong Ni, Ruobing Huang
Ultrasound is the primary modality to examine fetal growth during pregnancy, while the image quality could be affected by various factors.
1 code implementation • CVPR 2023 • Chenyang Qi, Xin Yang, Ka Leong Cheng, Ying-Cong Chen, Qifeng Chen
Then, an efficient frequency-aware decoder reconstructs a high-fidelity HR image from the LR one in real time.
1 code implementation • CVPR 2023 • Gangwei Xu, Xianqi Wang, Xiaohuan Ding, Xin Yang
The proposed IGEV-Stereo builds a combined geometry encoding volume that encodes geometry and context information as well as local matching details, and iteratively indexes it to update the disparity map.
Ranked #3 on
Omnnidirectional Stereo Depth Estimation
on Helvipad
no code implementations • 8 Mar 2023 • Sanju Xaviar, Xin Yang, Omid Ardakanian
Compared to 2 related robust fusion architectures, Centaur is more robust, achieving 11. 59-17. 52% higher accuracy in HAR, especially in the presence of consecutive missing data in multiple sensor channels.
no code implementations • 4 Mar 2023 • Yun Wang, Cheng Chi, Xin Yang
Scene flow estimation, which predicts the 3D motion of scene points from point clouds, is a core task in autonomous driving and many other 3D vision applications.
no code implementations • 13 Feb 2023 • Mimee Xu, Jiankai Sun, Xin Yang, Kevin Yao, Chong Wang
Without incurring the cost of re-training, and without degrading the model unnecessarily, we develop Unlearn-ALS by making a few key modifications to the fine-tuning procedure under Alternating Least Squares optimisation, thus applicable to any bi-linear models regardless of the training procedure.
1 code implementation • 18 Jan 2023 • Shangyu Xie, Xin Yang, Yuanshun Yao, Tianyi Liu, Taiqing Wang, Jiankai Sun
In this work, we step further to study the leakage in the scenario of the regression model, where the private labels are continuous numbers (instead of discrete labels in classification).
1 code implementation • 7 Jan 2023 • Gangwei Xu, Huan Zhou, Xin Yang
In this paper, we propose CGI-Stereo, a novel neural network architecture that can concurrently achieve real-time performance, competitive accuracy, and strong generalization ability.
1 code implementation • CVPR 2023 • Haiyang Mei, Zuowen Wang, Xin Yang, Xiaopeng Wei, Tobi Delbruck
The polarization event camera PDAVIS is a novel bio-inspired neuromorphic vision sensor that reports both conventional polarization frames and asynchronous, continuously per-pixel polarization brightness changes (polarization events) with fast temporal resolution and large dynamic range.
no code implementations • ICCV 2023 • Zhaoxuan Zhang, Bo Dong, Tong Li, Felix Heide, Pieter Peers, BaoCai Yin, Xin Yang
In this paper, we present Iterative Symmetry Completion Network (ISCNet), a single depth-image shape completion method that exploits reflective symmetry cues to obtain more detailed shapes.
no code implementations • ICCV 2023 • Yu Qiao, Bo Dong, Ao Jin, Yu Fu, Seung-Hwan Baek, Felix Heide, Pieter Peers, Xiaopeng Wei, Xin Yang
In this paper, we present the first polarization-guided video glass segmentation propagation solution (PGVS-Net) that can robustly and coherently propagate glass segmentation in RGB-P video sequences.
no code implementations • ICCV 2023 • Yun Wang, Cheng Chi, Min Lin, Xin Yang
This approach circulates high-resolution estimated information (scene flow and feature) from the preceding iteration back to the low-resolution layer of the current iteration.
1 code implementation • ICCV 2023 • Xin Yang, Xiaogang Xu, Yingcong Chen
In this paper, we propose a novel framework that enhances the fidelity of human face inversion by designing a new module to decompose the input images to ID and OOD partitions with invertibility masks.
1 code implementation • 24 Nov 2022 • Xin Yang, Michael Bi Mi, Yuan Yuan, Xin Wang, Robby T. Tan
In our DA framework, we retain the depth and background information during the domain feature alignment.
1 code implementation • 12 Nov 2022 • Tianyi Shi, Xiaohuan Ding, Wei Zhou, Feng Pan, Zengqiang Yan, Xiang Bai, Xin Yang
Vessel segmentation is crucial in many medical image applications, such as detecting coronary stenoses, retinal vessel diseases and brain aneurysms.
no code implementations • 20 Oct 2022 • Xiaowen Liu, Renhua Wang, Hongtao Huo, Xin Yang, Jing Li
The GAN-based infrared and visible image fusion methods have gained ever-increasing attention due to its effectiveness and superiority.
Generative Adversarial Network
Infrared And Visible Image Fusion
1 code implementation • 15 Oct 2022 • Hongkuan Shi, Zhiwei Wang, Ying Zhou, Dun Li, Xin Yang, Qiang Li
The learned knowledge flows across branches along two directions: a cross direction (disparity guides distribution in ACS) and a parallel direction (disparity guides disparity in APS).
no code implementations • 13 Oct 2022 • Yu Qiao, Ziqi Wei, Yuhao Liu, Yuxin Wang, Dongsheng Zhou, Qiang Zhang, Xin Yang
This paper reviews recent deep-learning-based matting research and conceives our wider and higher motivation for image matting.
no code implementations • 13 Oct 2022 • Yu Qiao, Yuhao Liu, Ziqi Wei, Yuxin Wang, Qiang Cai, Guofeng Zhang, Xin Yang
In this paper, we propose an end-to-end Hierarchical and Progressive Attention Matting Network (HAttMatting++), which can better predict the opacity of the foreground from single RGB images without additional input.
no code implementations • 12 Oct 2022 • Zhaoxuan Zhang, Xiaoguang Han, Bo Dong, Tong Li, BaoCai Yin, Xin Yang
Given a single RGB-D image, our method first predicts its semantic segmentation map and goes through the 3D volume branch to obtain a volumetric scene reconstruction as a guide to the next view inpainting step, which attempts to make up the missing information; the third step involves projecting the volume under the same view of the input, concatenating them to complete the current view RGB-D and segmentation map, and integrating all RGB-D and segmentation maps into the point cloud.
no code implementations • 12 Oct 2022 • Yuanyuan Liu, Chengjiang Long, Zhaoxuan Zhang, Bokai Liu, Qiang Zhang, BaoCai Yin, Xin Yang
3D scene graph generation (SGG) has been of high interest in computer vision.
1 code implementation • 9 Oct 2022 • Yu Cai, Hao Chen, Xin Yang, Yu Zhou, Kwang-Ting Cheng
Due to the high-cost annotations of abnormal images, most methods utilize only known normal images during training and identify samples deviating from the normal profile as anomalies in the testing phase.
1 code implementation • 24 Sep 2022 • Xin Yang, Omid Ardakanian
This paper proposes a sensor data anonymization model that is trained on decentralized data and strikes a desirable trade-off between data utility and privacy, even in heterogeneous settings where the sensor data have different underlying distributions.
3 code implementations • 23 Sep 2022 • Gangwei Xu, Yun Wang, Junda Cheng, Jinhui Tang, Xin Yang
In this paper, we present a novel cost volume construction method, named attention concatenation volume (ACV), which generates attention weights from correlation clues to suppress redundant information and enhance matching-related information in the concatenation volume.
1 code implementation • 21 Sep 2022 • Dong Zhang, Yi Lin, Hao Chen, Zhuotao Tian, Xin Yang, Jinhui Tang, Kwang Ting Cheng
Over the past few years, the rapid development of deep learning technologies for computer vision has significantly improved the performance of medical image segmentation (MedISeg).
no code implementations • 18 Sep 2022 • Ramchandra Rimal, Mitchell Brannon, Yingxin Wang, Xin Yang
The autism dataset is studied to identify the differences between autistic and healthy groups.
no code implementations • 10 Sep 2022 • Haiyang Mei, Xin Yang, Letian Yu, Qiang Zhang, Xiaopeng Wei, Rynson W. H. Lau
Glass is very common in our daily life.
no code implementations • 6 Sep 2022 • Letian Yu, Haiyang Mei, Wen Dong, Ziqi Wei, Li Zhu, Yuxin Wang, Xin Yang
First, we attempt to bridge the characteristic gap between different levels of features by developing a Discriminability Enhancement (DE) module which enables level-specific features to be a more discriminative representation, alleviating the features incompatibility for fusion.
1 code implementation • 25 Aug 2022 • Jiankai Sun, Xin Yang, Yuanshun Yao, Junyuan Xie, Di wu, Chong Wang
Federated learning (FL) has gained significant attention recently as a privacy-enhancing tool to jointly train a machine learning model by multiple participants.
1 code implementation • 14 Jul 2022 • Chenyang Qi, Junming Chen, Xin Yang, Qifeng Chen
Recent multi-output inference works propagate the bidirectional temporal feature with a parallel or recurrent framework, which either suffers from performance drops on the temporal edges of clips or can not achieve online inference.
Ranked #1 on
Video Denoising
on CRVD
no code implementations • 9 Jul 2022 • Van T. Manh, Jianqiao Zhou, Xiaohong Jia, Zehui Lin, Wenwen Xu, Zihan Mei, Yijie Dong, Xin Yang, Ruobing Huang, Dong Ni
To overcome this, we propose a novel deep learning framework called multi-attribute attention network (MAA-Net) that is designed to mimic the clinical diagnosis process.
no code implementations • 1 Jul 2022 • Jiamin Liang, Xin Yang, Yuhao Huang, Kai Liu, Xinrui Zhou, Xindi Hu, Zehui Lin, Huanjia Luo, Yuanji Zhang, Yi Xiong, Dong Ni
First, leveraging the advantages of self- and fully-supervised learning, our proposed system is trained in weakly-supervised manner for keypoint detection.
no code implementations • 1 Jul 2022 • Yuxin Zou, Haoran Dou, Yuhao Huang, Xin Yang, Jikuan Qian, Chaojiong Zhen, Xiaodan Ji, Nishant Ravikumar, Guoqiang Chen, Weijun Huang, Alejandro F. Frangi, Dong Ni
First, we formulate SP localization in 3D US as a tangent-point-based problem in RL to restructure the action space and significantly reduce the search space.
no code implementations • 1 Jul 2022 • Yuhao Huang, Xin Yang, Xiaoqiong Huang, Jiamin Liang, Xinrui Zhou, Cheng Chen, Haoran Dou, Xindi Hu, Yan Cao, Dong Ni
Deep segmentation models often face the failure risks when the testing image presents unseen distributions.
no code implementations • 1 Jul 2022 • Mingyuan Luo, Xin Yang, Hongzhang Wang, Liwei Du, Dong Ni
Our contribution is two-fold.
no code implementations • 1 Jul 2022 • Chaoyu Chen, Xin Yang, Ruobing Huang, Xindi Hu, Yankai Huang, Xiduo Lu, Xinrui Zhou, Mingyuan Luo, Yinyu Ye, Xue Shuang, Juzheng Miao, Yi Xiong, Dong Ni
In this work, we propose to revisit the classic regression tasks with novel investigations on directly optimizing the fine-grained correlation losses.
1 code implementation • 28 Jun 2022 • Nannan Wu, Li Yu, Xin Yang, Kwang-Ting Cheng, Zengqiang Yan
In this paper, we present a privacy-preserving FL method named FedIIC to combat class imbalance from two perspectives: feature learning and classifier learning.
no code implementations • 16 Jun 2022 • Ruihan Wu, Xin Yang, Yuanshun Yao, Jiankai Sun, Tianyi Liu, Kilian Q. Weinberger, Chong Wang
Differentially Private (DP) data release is a promising technique to disseminate data without compromising the privacy of data subjects.
1 code implementation • 8 Jun 2022 • Yu Cai, Hao Chen, Xin Yang, Yu Zhou, Kwang-Ting Cheng
Subsequently, inter-discrepancy between the two modules, and intra-discrepancy inside the module that is trained on only normal images are designed as anomaly scores to indicate anomalies.
no code implementations • 5 Jun 2022 • Zhiwei Wang, Jinxin Lv, Yunqiao Yang, Yuanhuai Liang, Yi Lin, Qiang Li, Xin Li, Xin Yang
Vertebral landmark localization is a crucial step for variant spine-related clinical applications, which requires detecting the corner points of 17 vertebrae.
no code implementations • 24 May 2022 • Jiankai Sun, Xin Yang, Yuanshun Yao, Junyuan Xie, Di wu, Chong Wang
In this work, we propose two evaluation algorithms that can more accurately compute the widely used AUC (area under curve) metric when using label DP in vFL.
1 code implementation • 4 May 2022 • Jeffry Wicaksana, Zengqiang Yan, Dong Zhang, Xijie Huang, Huimin Wu, Xin Yang, Kwang-Ting Cheng
To relax this assumption, in this work, we propose a label-agnostic unified federated learning framework, named FedMix, for medical image segmentation based on mixed image labels.
no code implementations • 14 Apr 2022 • Jiamin Liang, Xin Yang, Yuhao Huang, Haoming Li, Shuangchi He, Xindi Hu, Zejian Chen, Wufeng Xue, Jun Cheng, Dong Ni
Our main contributions include: 1) we present the first work that can synthesize realistic B-mode US images with high-resolution and customized texture editing features; 2) to enhance structural details of generated images, we propose to introduce auxiliary sketch guidance into a conditional GAN.
no code implementations • 14 Apr 2022 • Jikuan Qian, Rui Li, Xin Yang, Yuhao Huang, Mingyuan Luo, Zehui Lin, Wenhui Hong, Ruobing Huang, Haining Fan, Dong Ni, Jun Cheng
The hybrid framework consists of a pre-trained backbone and several searched cells (i. e., network building blocks), which takes advantage of the strengths of both NAS and the expert knowledge from existing convolutional neural networks.
1 code implementation • 12 Apr 2022 • Wenjun Chen, Chunling Yang, Xin Yang
In recent years, deep learning-based image compressive sensing (ICS) methods have achieved brilliant success.
1 code implementation • CVPR 2022 • Xin Tian, Ke Xu, Xin Yang, Lin Du, BaoCai Yin, Rynson W. H. Lau
We observe that spatial attention works concurrently with object-based attention in the human visual recognition system.
2 code implementations • CVPR 2022 • Gangwei Xu, Junda Cheng, Peng Guo, Xin Yang
Stereo matching is a fundamental building block for many vision and robotics applications.
Ranked #1 on
Stereo Depth Estimation
on Spring
no code implementations • 4 Mar 2022 • Xin Yang, Jiankai Sun, Yuanshun Yao, Junyuan Xie, Chong Wang
Split learning is a distributed training framework that allows multiple parties to jointly train a machine learning model over vertically partitioned data (partitioned by attributes).
no code implementations • 2 Mar 2022 • Jiankai Sun, Xin Yang, Yuanshun Yao, Chong Wang
As the raw labels often contain highly sensitive information, some recent work has been proposed to prevent the label leakage from the backpropagated gradients effectively in vFL.
1 code implementation • 14 Jan 2022 • Kai-Ni Wang, Xin Yang, Juzheng Miao, Lei LI, Jing Yao, Ping Zhou, Wufeng Xue, Guang-Quan Zhou, Xiahai Zhuang, Dong Ni
Extensive experimental results on a publicly available dataset from Myocardial pathology segmentation combining multi-sequence CMR (MyoPS 2020) demonstrate our method can achieve promising performance compared with other state-of-the-art methods.
no code implementations • CVPR 2022 • Jiqing Zhang, Bo Dong, Haiwei Zhang, Jianchuan Ding, Felix Heide, BaoCai Yin, Xin Yang
In particular, the proposed architecture features a transformer module to provide global spatial information and a spiking neural network (SNN) module for extracting temporal cues.
no code implementations • CVPR 2022 • Haiyang Mei, Bo Dong, Wen Dong, Jiaxi Yang, Seung-Hwan Baek, Felix Heide, Pieter Peers, Xiaopeng Wei, Xin Yang
Transparent and semi-transparent materials pose significant challenges for existing scene understanding and segmentation algorithms due to their lack of RGB texture which impedes the extraction of meaningful features.
Ranked #2 on
Camouflaged Object Segmentation
on PCOD_1200
no code implementations • 28 Dec 2021 • Zexi Huang, Lihua Guo, Xin Yang, Sijuan Huang
SECP-Net extracts global and multi-size information flow with se connection (SEC) modules and a pyramid structure of network for improving the segmentation performance, especially that of small organs.
1 code implementation • CVPR 2022 • Ziqi Zhou, Lei Qi, Xin Yang, Dong Ni, Yinghuan Shi
For medical image segmentation, imagine if a model was only trained using MR images in source domain, how about its performance to directly segment CT images in target domain?
no code implementations • 11 Dec 2021 • Yu Qiao, Jincheng Zhu, Chengjiang Long, Zeyao Zhang, Yuxin Wang, Zhenjun Du, Xin Yang
Acquiring the most representative examples via active learning (AL) can benefit many data-dependent computer vision tasks by minimizing efforts of image-level or pixel-wise annotations.
no code implementations • 19 Nov 2021 • Xin Tian, Ke Xu, Xin Yang, BaoCai Yin, Rynson W. H. Lau
However, it is non-trivial to use only class labels to learn instance-aware saliency information, as salient instances with high semantic affinities may not be easily separated by the labels.
no code implementations • 10 Nov 2021 • Yi Lin, Jianchao Su, Xiang Wang, Xiang Li, Jingen Liu, Kwang-Ting Cheng, Xin Yang
We have evaluated our approach using the 20 CTPA test dataset from the PE challenge, achieving a sensitivity of 78. 9%, 80. 7% and 80. 7% at 2 false positives per volume at 0mm, 2mm and 5mm localization error, which is superior to the state-of-the-art methods.
no code implementations • 12 Oct 2021 • Xin Yang, Qingling Chang, Xinlin Liu, Yan Cui
In order to mitigate the boundary blur problem, we focus on the above two impact factors.
2 code implementations • ICCV 2021 • Jiqing Zhang, Xin Yang, Yingkai Fu, Xiaopeng Wei, BaoCai Yin, Bo Dong
Our approach's effectiveness is enforced by a novel designed cross-domain attention schemes, which can effectively enhance features based on self- and cross-domain attention schemes; The adaptiveness is guarded by a specially designed weighting scheme, which can adaptively balance the contribution of the two domains.
Ranked #3 on
Object Tracking
on FE108
no code implementations • 2 Sep 2021 • Qiang Gao, Wei Wang, Kunpeng Zhang, Xin Yang, Congcong Miao
Although recent deep recursive models (e. g., RNN) are capable of alleviating these concerns, existing solutions hardly recognize the practical reality, such as the diversity of tourist demands, uncertainties in the trip generation, and the complex visiting preference.
no code implementations • 18 Aug 2021 • Yu Jiang, Lei Hu, Yongmei Zhang, Xin Yang
With the purpose of improving change detection effectiveness of the model in the multi-resolution data set, a weighted rich-scale inception coder network (WRICNet) is proposed in this article, which can make a great fusion of shallow multi-scale features, and deep multi-scale features.
no code implementations • 11 Aug 2021 • Shuangchi He, Zehui Lin, Xin Yang, Chaoyu Chen, Jian Wang, Xue Shuang, Ziwei Deng, Qin Liu, Yan Cao, Xiduo Lu, Ruobing Huang, Nishant Ravikumar, Alejandro Frangi, Yuanji Zhang, Yi Xiong, Dong Ni
In this study, we build a novel multi-label learning (MLL) scheme to identify multiple standard planes and corresponding anatomical structures of fetus simultaneously.
no code implementations • 10 Aug 2021 • Jiqing Zhang, Kai Zhao, Bo Dong, Yingkai Fu, Yuxin Wang, Xin Yang, BaoCai Yin
Jointly exploiting multiple different yet complementary domain information has been proven to be an effective way to perform robust object tracking.
no code implementations • 2 Aug 2021 • Yuhao Huang, Xin Yang, Yuxin Zou, Chaoyu Chen, Jian Wang, Haoran Dou, Nishant Ravikumar, Alejandro F Frangi, Jianqiao Zhou, Dong Ni
Weakly-supervised segmentation (WSS) can help reduce time-consuming and cumbersome manual annotation.
2 code implementations • 1 Aug 2021 • Zhoubo Xu, Puqing Chen, Romain Raveaux, Xin Yang, Huadong Liu
Graph matching is an important problem that has received widespread attention, especially in the field of computer vision.
no code implementations • 1 Aug 2021 • Zhendong Liu, Van Manh, Xin Yang, Xiaoqiong Huang, Karim Lekadir, Víctor Campello, Nishant Ravikumar, Alejandro F Frangi, Dong Ni
A style transfer model with style fusion is employed to generate the curriculum samples.
no code implementations • 31 Jul 2021 • Mingyuan Luo, Xin Yang, Xiaoqiong Huang, Yuhao Huang, Yuxin Zou, Xindi Hu, Nishant Ravikumar, Alejandro F Frangi, Dong Ni
In this paper, we propose a novel approach to sensorless freehand 3D US reconstruction considering the complex skill sequences.
no code implementations • 28 Jun 2021 • Yuhao Liu, Jiake Xie, Yu Qiao, Yong Tang and, Xin Yang
Image matting is an ill-posed problem that aims to estimate the opacity of foreground pixels in an image.
no code implementations • CVPR 2021 • Cheng Chi, Qingjie Wang, Tianyu Hao, Peng Guo, Xin Yang
In this paper, we show that effective feature-level collaboration of the networks for the three respective tasks could achieve much greater performance improvement for all three tasks than only loss-level joint optimization.
no code implementations • CVPR 2021 • Haiyang Mei, Bo Dong, Wen Dong, Pieter Peers, Xin Yang, Qiang Zhang, Xiaopeng Wei
To exploit depth information in mirror segmentation, we first construct a large-scale RGB-D mirror segmentation dataset, which we subsequently employ to train a novel depth-aware mirror segmentation framework.
1 code implementation • 12 Jun 2021 • Yi Lin, Yanfei Liu, Hao Chen, Xin Yang, Kai Ma, Yefeng Zheng, Kwang-Ting Cheng
To mitigate the complexity introduced by the model ensemble, we adopt the teacher-student paradigm, leveraging the diverse outputs from multiple learned networks as supervisory signals to guide the training of the student network.
no code implementations • 10 Jun 2021 • Xindi Hu, LiMin Wang, Xin Yang, Xu Zhou, Wufeng Xue, Yan Cao, Shengfeng Liu, Yuhao Huang, Shuangping Guo, Ning Shang, Dong Ni, Ning Gu
In this study, we propose a multi-task framework to learn the relationships among landmarks and structures jointly and automatically evaluate DDH.
no code implementations • 10 Jun 2021 • Jiankai Sun, Xin Yang, Yuanshun Yao, Aonan Zhang, Weihao Gao, Junyuan Xie, Chong Wang
In this paper, we propose a vFL framework based on Private Set Union (PSU) that allows each party to keep sensitive membership information to itself.
1 code implementation • 7 Jun 2021 • Xin Yang, Ning Zhang, Donglin Wang
Fourth, we generate three corresponding masks based on the 20 selected ROIs from group ICA, the 20 ROIs selected from dictionary learning, and the 40 combined ROIs selected from both.
no code implementations • 22 May 2021 • Xin Yang, Yuhao Huang, Ruobing Huang, Haoran Dou, Rui Li, Jikuan Qian, Xiaoqiong Huang, Wenlong Shi, Chaoyu Chen, Yuanji Zhang, Haixia Wang, Yi Xiong, Dong Ni
First, our proposed method is general and it can accurately localize multiple SPs in different challenging US datasets.
no code implementations • 19 May 2021 • Guang-Quan Zhou, Juzheng Miao, Xin Yang, Rui Li, En-Ze Huo, Wenlong Shi, Yuhao Huang, Jikuan Qian, Chaoyu Chen, Dong Ni
Our proposed framework is general and shows the potential to improve the efficiency of anatomical landmark detection.
no code implementations • 11 May 2021 • Baihe Huang, Xiaoxiao Li, Zhao Song, Xin Yang
Nevertheless, training analysis of neural networks in FL is non-trivial for two reasons: first, the objective loss function we are optimizing is non-smooth and non-convex, and second, we are even not updating in the gradient direction.
1 code implementation • CVPR 2021 • Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, Deng-Ping Fan
In this paper, we strive to embrace challenges towards effective and efficient COS. To this end, we develop a bio-inspired framework, termed Positioning and Focus Network (PFNet), which mimics the process of predation in nature.
Ranked #10 on
Camouflaged Object Segmentation
on PCOD_1200
Camouflaged Object Segmentation
Dichotomous Image Segmentation
+3
1 code implementation • 21 Apr 2021 • Jiqing Zhang, Chengjiang Long, Yuxin Wang, Haiyin Piao, Haiyang Mei, Xin Yang, BaoCai Yin
Recently, deep convolutional neural networks (CNNs) have been widely explored in single image super-resolution (SISR) and contribute remarkable progress.
no code implementations • 31 Mar 2021 • Xin Yang, Yu Qiao, Shaozhe Chen, Shengfeng He, BaoCai Yin, Qiang Zhang, Xiaopeng Wei, Rynson W. H. Lau
Image matting is an ill-posed problem that usually requires additional user input, such as trimaps or scribbles.
1 code implementation • 26 Mar 2021 • Xin Yang, Haoran Dou, Ruobing Huang, Wufeng Xue, Yuhao Huang, Jikuan Qian, Yuanji Zhang, Huanjia Luo, Huizhi Guo, Tianfu Wang, Yi Xiong, Dong Ni
2D US has to perform scanning for each SP, which is time-consuming and operator-dependent.
2 code implementations • ICLR 2022 • Oscar Li, Jiankai Sun, Xin Yang, Weihao Gao, Hongyi Zhang, Junyuan Xie, Virginia Smith, Chong Wang
Two-party split learning is a popular technique for learning a model across feature-partitioned data.
no code implementations • 26 Jan 2021 • Xin Yang, Zongliang Ma, Letian Yu, Ying Cao, BaoCai Yin, Xiaopeng Wei, Qiang Zhang, Rynson W. H. Lau
Finally, as opposed to using the same type of balloon as in previous works, we propose an emotion-aware balloon generation method to create different types of word balloons by analyzing the emotion of subtitles and audios.
1 code implementation • ICLR 2021 • Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, Yingyan Celine Lin
In this paper, we attempt to explore low-precision training from a new perspective as inspired by recent findings in understanding DNN training: we conjecture that DNNs' precision might have a similar effect as the learning rate during DNN training, and advocate dynamic precision along the training trajectory for further boosting the time/energy efficiency of DNN training.
no code implementations • 11 Jan 2021 • Zhendong Liu, Xiaoqiong Huang, Xin Yang, Rui Gao, Rui Li, Yuanji Zhang, Yankai Huang, Guangquan Zhou, Yi Xiong, Alejandro F Frangi, Dong Ni
Deep segmentation models that generalize to images with unknown appearance are important for real-world medical image analysis.
no code implementations • 7 Jan 2021 • Yu Qiao, Yuhao Liu, Qiang Zhu, Xin Yang, Yuxin Wang, Qiang Zhang, Xiaopeng Wei
Image matting is a long-standing problem in computer graphics and vision, mostly identified as the accurate estimation of the foreground in input images.
1 code implementation • ICCV 2021 • Yuhao Liu, Jiake Xie, Xiao Shi, Yu Qiao, Yujie Huang, Yong Tang, Xin Yang
Regarding the nature of image matting, most researches have focused on solutions for transition regions.
1 code implementation • 13 Oct 2020 • Shujun Wang, Lequan Yu, Kang Li, Xin Yang, Chi-Wing Fu, Pheng-Ann Heng
Our DoFE framework dynamically enriches the image features with additional domain prior knowledge learned from multi-source domains to make the semantic features more discriminative.
no code implementations • 10 Oct 2020 • Haoming Li, Xin Yang, Jiamin Liang, Wenlong Shi, Chaoyu Chen, Haoran Dou, Rui Li, Rui Gao, Guangquan Zhou, Jinghui Fang, Xiaowen Liang, Ruobing Huang, Alejandro Frangi, Zhiyi Chen, Dong Ni
However, the lack of sharp boundaries in US images still remains an inherent challenge for segmentation.
no code implementations • 29 Sep 2020 • Xin Tian, Ke Xu, Xin Yang, Bao-Cai Yin, Rynson W. H. Lau
Inspired by this insight, we propose to use class and subitizing labels as weak supervision for the SID problem.
no code implementations • 24 Sep 2020 • Xiaoqiong Huang, Zejian Chen, Xin Yang, Zhendong Liu, Yuxin Zou, Mingyuan Luo, Wufeng Xue, Dong Ni
Based on the zero-shot style transfer to remove appearance shift and test-time augmentation to explore diverse underlying anatomy, our proposed method is effective in combating the appearance shift.
no code implementations • 26 Aug 2020 • Hammond Pearce, Xin Yang, Partha S. Roop, Marc Katzef, Tórur Biskopstø Strøm
This issue stems largely from the implementation strategies used within common neural network frameworks -- their underlying source code is often simply unsuitable for formal techniques such as static timing analysis.
no code implementations • 31 Jul 2020 • Junxiong Yu, Chaoyu Chen, Xin Yang, Yi Wang, Dan Yan, Jianxing Zhang, Dong Ni
The efficacy of our network is verified from a collected dataset of 418 patients with 145 benign tumors and 273 malignant tumors.
no code implementations • 30 Jul 2020 • Yuhao Huang, Xin Yang, Rui Li, Jikuan Qian, Xiaoqiong Huang, Wenlong Shi, Haoran Dou, Chaoyu Chen, Yuanji Zhang, Huanjia Luo, Alejandro Frangi, Yi Xiong, Dong Ni
In this study, we propose a novel Multi-Agent Reinforcement Learning (MARL) framework to localize multiple uterine SPs in 3D US simultaneously.
no code implementations • ECCV 2020 • Sucheng Ren, Chu Han, Xin Yang, Guoqiang Han, Shengfeng He
In this paper, we propose a simple yet effective approach, named Triple Excitation Network, to reinforce the training of video salient object detection (VSOD) from three aspects, spatial, temporal, and online excitations.
1 code implementation • 28 Apr 2020 • Xin Yang, Xu Wang, Yi Wang, Haoran Dou, Shengli Li, Huaxuan Wen, Yi Lin, Pheng-Ann Heng, Dong Ni
In this paper, we propose the first fully-automated solution to segment the whole fetal head in US volumes.
2 code implementations • 27 Apr 2020 • Haoran Dou, Davood Karimi, Caitlin K. Rollins, Cynthia M. Ortinau, Lana Vasung, Clemente Velasco-Annis, Abdelhakim Ouaalam, Xin Yang, Dong Ni, Ali Gholipour
Automatic segmentation of the cortical plate, on the other hand, is challenged by the relatively low resolution of the reconstructed fetal brain MRI scans compared to the thin structure of the cortical plate, partial voluming, and the wide range of variations in the morphology of the cortical plate as the brain matures during gestation.
1 code implementation • 26 Apr 2020 • Zhaohan Xiong, Qing Xia, Zhiqiang Hu, Ning Huang, Cheng Bian, Yefeng Zheng, Sulaiman Vesal, Nishant Ravikumar, Andreas Maier, Xin Yang, Pheng-Ann Heng, Dong Ni, Caizi Li, Qianqian Tong, Weixin Si, Elodie Puybareau, Younes Khoudli, Thierry Geraud, Chen Chen, Wenjia Bai, Daniel Rueckert, Lingchao Xu, Xiahai Zhuang, Xinzhe Luo, Shuman Jia, Maxime Sermesant, Yashu Liu, Kuanquan Wang, Davide Borra, Alessandro Masci, Cristiana Corsi, Coen de Vente, Mitko Veta, Rashed Karim, Chandrakanth Jayachandran Preetha, Sandy Engelhardt, Menyun Qiao, Yuanyuan Wang, Qian Tao, Marta Nunez-Garcia, Oscar Camara, Nicolo Savioli, Pablo Lamata, Jichao Zhao
Segmentation of cardiac images, particularly late gadolinium-enhanced magnetic resonance imaging (LGE-MRI) widely used for visualizing diseased cardiac structures, is a crucial first step for clinical diagnosis and treatment.
no code implementations • 1 Apr 2020 • Chaoyu Chen, Xin Yang, Ruobing Huang, Wenlong Shi, Shengfeng Liu, Mingrong Lin, Yuhao Huang, Yong Yang, Yuanji Zhang, Huanjia Luo, Yankai Huang, Yi Xiong, Dong Ni
The performance of the proposed framework is evaluated on a 3D US dataset to detect five key fetal facial landmarks.
no code implementations • 1 Apr 2020 • Jiamin Liang, Xin Yang, Haoming Li, Yi Wang, Manh The Van, Haoran Dou, Chaoyu Chen, Jinghui Fang, Xiaowen Liang, Zixin Mai, Guowen Zhu, Zhiyi Chen, Dong Ni
Efficiently synthesizing realistic, editable and high resolution US images can solve the problems.
no code implementations • 23 Feb 2020 • Yingyu Liang, Zhao Song, Mengdi Wang, Lin F. Yang, Xin Yang
We show that our approach obtains small error and is efficient in both space and time.
no code implementations • 14 Feb 2020 • Zhendong Liu, Xin Yang, Rui Gao, Shengfeng Liu, Haoran Dou, Shuangchi He, Yuhao Huang, Yankai Huang, Huanjia Luo, Yuanji Zhang, Yi Xiong, Dong Ni
In this paper, we propose a novel and intuitive framework to remove the appearance shift, and hence improve the generalization ability of DNNs.
no code implementations • 23 Dec 2019 • Chong Huang, Yuanjie Dang, Peng Chen, Xin Yang, Kwang-Ting, Cheng
Imitation learning has been applied to mimic the operation of a human cameraman in several autonomous cinematography systems.
1 code implementation • 17 Dec 2019 • Tangxin Xie, Xin Yang, Yu Jia, Chen Zhu, Xiaochuan Li
For a better performance in single image super-resolution(SISR), we present an image super-resolution algorithm based on adaptive dense connection (ADCSR).
no code implementations • 11 Oct 2019 • Xin Yang, Wenlong Shi, Haoran Dou, Jikuan Qian, Yi Wang, Wufeng Xue, Shengli Li, Dong Ni, Pheng-Ann Heng
(i) This is the first work about 3D pose estimation of fetus in the literature.