no code implementations • 14 Dec 2024 • Di Xu, Xin Miao, Hengjie Liu, Jessica E. Scholey, Wensha Yang, Mary Feng, Michael Ohliger, Hui Lin, Yi Lao, Yang Yang, Ke Sheng
The inference time of CIRNet, CS, and Re-Con-GAN are 11s, 120s, and 0. 15s.
no code implementations • 12 Nov 2024 • Di Xu, Yang Yang, Hengjie Liu, Qihui Lyu, Martina Descovich, Dan Ruan, Ke Sheng
Computed tomography (CT) provides high spatial resolution visualization of 3D structures for scientific and clinical applications.
no code implementations • 19 Jul 2024 • Kaibing Chen, Dong Shen, Hanwen Zhong, Huasong Zhong, Kui Xia, Di Xu, Wei Yuan, Yifei Hu, Bin Wen, Tianke Zhang, Changyi Liu, Dewen Fan, Huihui Xiao, JiaHong Wu, Fan Yang, Size Li, Di Zhang
However, when dealing with long sequences of visual signals or inputs such as videos, the self-attention mechanism of language models can lead to significant computational overhead.
no code implementations • 1 Jun 2024 • Xuanchen Li, Yuhao Cheng, Xingyu Ren, Haozhe Jia, Di Xu, Wenhan Zhu, Yichao Yan
To simplify this process, we propose Topo4D, a novel framework for automatic geometry and texture generation, which optimizes densely aligned 4D heads and 8K texture maps directly from calibrated multi-view time-series images.
no code implementations • 20 May 2024 • Di Xu, Xin Miao, Hengjie Liu, Jessica E. Scholey, Wensha Yang, Mary Feng, Michael Ohliger, Hui Lin, Yi Lao, Yang Yang, Ke Sheng
Re-Con-GAN processed the data (3D+t) as temporal slices (2D+t).
no code implementations • 12 Mar 2024 • Likun Li, Haoqi Zeng, Changpeng Yang, Haozhe Jia, Di Xu
The objective of personalization and stylization in text-to-image is to instruct a pre-trained diffusion model to analyze new concepts introduced by users and incorporate them into expected styles.
no code implementations • 12 Feb 2024 • Ashish Shenoy, Yichao Lu, Srihari Jayakumar, Debojeet Chatterjee, Mohsen Moslehpour, Pierce Chuang, Abhay Harpale, Vikas Bhardwaj, Di Xu, Shicong Zhao, Longfang Zhao, Ankit Ramchandani, Xin Luna Dong, Anuj Kumar
We introduce Lumos, the first end-to-end multimodal question-answering system with text understanding capabilities.
no code implementations • 4 Jan 2024 • Ecem Sogancioglu, Bram van Ginneken, Finn Behrendt, Marcel Bengs, Alexander Schlaefer, Miron Radu, Di Xu, Ke Sheng, Fabien Scalzo, Eric Marcus, Samuele Papa, Jonas Teuwen, Ernst Th. Scholten, Steven Schalekamp, Nils Hendrix, Colin Jacobs, Ward Hendrix, Clara I Sánchez, Keelin Murphy
To address this, we organized a public research challenge, NODE21, aimed at the detection and generation of lung nodules in chest X-rays.
no code implementations • CVPR 2024 • Yuhao Cheng, Zhuo Chen, Xingyu Ren, Wenhan Zhu, Zhengqin Xu, Di Xu, Changpeng Yang, Yichao Yan
To address the problem of distortion caused by tri-plane warping we train a warp-aware encoder to project the warped face onto a standardized latent space.
no code implementations • 11 Dec 2023 • Haozhe Jia, Yan Li, Hengfei Cui, Di Xu, Yuwang Wang, Tao Yu
We identify the key challenge as the exploration of disentangled conditional control between high-level semantics and explicit parameters (e. g., 3DMM) in the generation process, and accordingly propose a novel diffusion-based editing framework, named DisControlFace.
1 code implementation • CVPR 2024 • Zhanfeng Liao, Yuelang Xu, Zhe Li, Qijing Li, Boyao Zhou, Ruifeng Bai, Di Xu, Hongwen Zhang, Yebin Liu
To address the problem of dynamic hair modeling, we introduce a hybrid head model into our avatar representation based Gaussian Head Avatar and a training method that considers timing information and an occlusion perception module to model the non-rigid motion of hair.
no code implementations • 31 Oct 2023 • Di Xu, Qihui Lyu, Dan Ruan, Ke Sheng
Deep learning (DL) methods have shown promise to improve the MMD performance, but typical approaches of conducing DL-MMD in the image domain fail to fully utilize projection information or under iterative setup are computationally inefficient in both training and prediction.
no code implementations • ICCV 2023 • Haiyang Ying, Baowei Jiang, Jinzhi Zhang, Di Xu, Tao Yu, Qionghai Dai, Lu Fang
This paper proposes a method for fast scene radiance field reconstruction with strong novel view synthesis performance and convenient scene editing functionality.
no code implementations • 19 Sep 2023 • Di Xu, Hengjie Liu, Dan Ruan, Ke Sheng
Dynamic magnetic resonance imaging (DMRI) is an effective imaging tool for diagnosis tasks that require motion tracking of a certain anatomy.
1 code implementation • 27 Jul 2023 • Lingdong Kong, Yaru Niu, Shaoyuan Xie, Hanjiang Hu, Lai Xing Ng, Benoit R. Cottereau, Liangjun Zhang, Hesheng Wang, Wei Tsang Ooi, Ruijie Zhu, Ziyang Song, Li Liu, Tianzhu Zhang, Jun Yu, Mohan Jing, Pengwei Li, Xiaohua Qi, Cheng Jin, Yingfeng Chen, Jie Hou, Jie Zhang, Zhen Kan, Qiang Ling, Liang Peng, Minglei Li, Di Xu, Changpeng Yang, Yuanqi Yao, Gang Wu, Jian Kuai, Xianming Liu, Junjun Jiang, Jiamian Huang, Baojun Li, Jiale Chen, Shuang Zhang, Sun Ao, Zhenyu Li, Runze Chen, Haiyong Luo, Fang Zhao, Jingze Yu
In this paper, we summarize the winning solutions from the RoboDepth Challenge -- an academic competition designed to facilitate and advance robust OoD depth estimation.
no code implementations • 16 May 2023 • Di Xu, Yang Zhao, Xiang Hao, Xin Meng
We introduce a novel dataset consisting of images depicting pink eggs that have been identified as Pomacea canaliculata eggs, accompanied by corresponding bounding box annotations.
1 code implementation • 8 May 2023 • Peng Xia, Di Xu, Ming Hu, Lie Ju, ZongYuan Ge
Long-tailed multi-label visual recognition (LTML) task is a highly challenging task due to the label co-occurrence and imbalanced data distribution.
Ranked #1 on Long-tail Learning on COCO-MLT (using extra training data)
no code implementations • 20 Apr 2023 • Di Xu, Xiang He, Tonghua Su, Zhongjie Wang
This paper provides a comprehensive survey on the recent advances and challenges in DNN partition approaches over the cloud, edge, and end devices based on a detailed literature collection.
no code implementations • 19 Feb 2023 • Di Xu, Qifan Xu, Kevin Nhieu, Dan Ruan, Ke Sheng
Suppression of thoracic bone shadows on chest X-rays (CXRs) has been indicated to improve the diagnosis of pulmonary disease.
no code implementations • 8 Aug 2022 • Haoran Wang, Di Xu, Dongliang He, Fu Li, Zhong Ji, Jungong Han, Errui Ding
Video-text retrieval (VTR) is an attractive yet challenging task for multi-modal understanding, which aims to search for relevant video (text) given a query (video).
2 code implementations • 24 Dec 2021 • Gang Li, Di Xu, Xing Cheng, Lingyu Si, Changwen Zheng
Although vision Transformers have achieved excellent performance as backbone models in many vision tasks, most of them intend to capture global relations of all tokens in an image or a window, which disrupts the inherent spatial and local correlations between patches in 2D structure.
no code implementations • 11 Mar 2021 • Roy L. M. Op het Veld, Di Xu, Vanessa Schaller, Marcel A. Verheijen, Stan M. E. Peters, Jason Jung, Chuyao Tong, Qingzhen Wang, Michiel W. A. de Moor, Bart Hesselmann, Kiefer Vermeulen, Jouri D. S. Bommer, Joon Sue Lee, Andrey Sarikov, Mihir Pendharkar, Anna Marzegalli, Sebastian Koelling, Leo P. Kouwenhoven, Leo Miglio, Chris J. Palmstrøm, Hao Zhang, Erik P. A. M. Bakkers
Strong spin-orbit semiconductor nanowires coupled to a superconductor are predicted to host Majorana zero modes.
Mesoscale and Nanoscale Physics
no code implementations • 27 Jan 2021 • Hao Zhang, Michiel W. A. de Moor, Jouri D. S. Bommer, Di Xu, Guanzhong Wang, Nick van Loo, Chun-Xiao Liu, Sasa Gazibegovic, John A. Logan, Diana Car, Roy L. M. Op het Veld, Petrus J. van Veldhoven, Sebastian Koelling, Marcel A. Verheijen, Mihir Pendharkar, Daniel J. Pennachio, Borzoyeh Shojaei, Joon Sue Lee, Chris J. Palmstrøm, Erik P. A. M. Bakkers, S. Das Sarma, Leo P. Kouwenhoven
We report electron transport studies on InSb-Al hybrid semiconductor-superconductor nanowire devices.
Mesoscale and Nanoscale Physics
no code implementations • 30 Dec 2020 • Jillian M. Clements, Di Xu, Nooshin Yousefi, Dmitry Efimov
Machine learning plays an essential role in preventing financial losses in the banking industry.
no code implementations • 12 Jul 2020 • Di Xu, Zhen Li, Yanning Zhang, Qi Cao
This paper presents an illumination estimation method for virtual objects in real environment by learning.
no code implementations • 6 Feb 2020 • Dmitry Efimov, Di Xu, Luyang Kong, Alexey Nefedov, Archana Anandakrishnan
Generative Adversarial Networks (GANs) became very popular for generation of realistically looking images.
no code implementations • 19 Oct 2019 • Di Xu, Tianhang Long, Junbin Gao
Massive volumes of high-dimensional data that evolves over time is continuously collected by contemporary information processing systems, which brings up the problem of organizing this data into clusters, i. e. achieve the purpose of dimensional deduction, and meanwhile learning its temporal evolution patterns.
no code implementations • 29 Jan 2019 • Di Xu, Manjing Fang, Xia Hong, Junbin Gao
A general framework of least squares support vector machine with low rank kernels, referred to as LR-LSSVM, is introduced in this paper.
no code implementations • CVPR 2014 • Di Xu, Qi Duan, Jianming Zheng, Juyong Zhang, Jianfei Cai, Tat-Jen Cham
As a result, our approach is robust, stable and is able to efficiently recover high quality of surface details even starting with a coarse MVS.