no code implementations • 12 Mar 2024 • Likun Li, Haoqi Zeng, Changpeng Yang, Haozhe Jia, Di Xu
The objective of personalization and stylization in text-to-image is to instruct a pre-trained diffusion model to analyze new concepts introduced by users and incorporate them into expected styles.
no code implementations • 12 Feb 2024 • Ashish Shenoy, Yichao Lu, Srihari Jayakumar, Debojeet Chatterjee, Mohsen Moslehpour, Pierce Chuang, Abhay Harpale, Vikas Bhardwaj, Di Xu, Shicong Zhao, Longfang Zhao, Ankit Ramchandani, Xin Luna Dong, Anuj Kumar
We introduce Lumos, the first end-to-end multimodal question-answering system with text understanding capabilities.
no code implementations • 4 Jan 2024 • Ecem Sogancioglu, Bram van Ginneken, Finn Behrendt, Marcel Bengs, Alexander Schlaefer, Miron Radu, Di Xu, Ke Sheng, Fabien Scalzo, Eric Marcus, Samuele Papa, Jonas Teuwen, Ernst Th. Scholten, Steven Schalekamp, Nils Hendrix, Colin Jacobs, Ward Hendrix, Clara I Sánchez, Keelin Murphy
To address this, we organized a public research challenge, NODE21, aimed at the detection and generation of lung nodules in chest X-rays.
no code implementations • 11 Dec 2023 • Haozhe Jia, Yan Li, Hengfei Cui, Di Xu, Changpeng Yang, Yuwang Wang, Tao Yu
Our DisControlNet can perform robust editing on any facial image through training on large-scale 2D in-the-wild portraits and also supports low-cost fine-tuning with few additional images to further learn diverse personalized priors of a specific person.
no code implementations • 31 Oct 2023 • Di Xu, Qihui Lyu, Dan Ruan, Ke Sheng
Deep learning (DL) methods have shown promise to improve the MMD performance, but typical approaches of conducing DL-MMD in the image domain fail to fully utilize projection information or under iterative setup are computationally inefficient in both training and prediction.
no code implementations • ICCV 2023 • Haiyang Ying, Baowei Jiang, Jinzhi Zhang, Di Xu, Tao Yu, Qionghai Dai, Lu Fang
This paper proposes a method for fast scene radiance field reconstruction with strong novel view synthesis performance and convenient scene editing functionality.
no code implementations • 19 Sep 2023 • Di Xu, Hengjie Liu, Dan Ruan, Ke Sheng
Dynamic magnetic resonance imaging (DMRI) is an effective imaging tool for diagnosis tasks that require motion tracking of a certain anatomy.
1 code implementation • 27 Jul 2023 • Lingdong Kong, Yaru Niu, Shaoyuan Xie, Hanjiang Hu, Lai Xing Ng, Benoit R. Cottereau, Ding Zhao, Liangjun Zhang, Hesheng Wang, Wei Tsang Ooi, Ruijie Zhu, Ziyang Song, Li Liu, Tianzhu Zhang, Jun Yu, Mohan Jing, Pengwei Li, Xiaohua Qi, Cheng Jin, Yingfeng Chen, Jie Hou, Jie Zhang, Zhen Kan, Qiang Ling, Liang Peng, Minglei Li, Di Xu, Changpeng Yang, Yuanqi Yao, Gang Wu, Jian Kuai, Xianming Liu, Junjun Jiang, Jiamian Huang, Baojun Li, Jiale Chen, Shuang Zhang, Sun Ao, Zhenyu Li, Runze Chen, Haiyong Luo, Fang Zhao, Jingze Yu
In this paper, we summarize the winning solutions from the RoboDepth Challenge -- an academic competition designed to facilitate and advance robust OoD depth estimation.
no code implementations • 16 May 2023 • Di Xu, Yang Zhao, Xiang Hao, Xin Meng
We introduce a novel dataset consisting of images depicting pink eggs that have been identified as Pomacea canaliculata eggs, accompanied by corresponding bounding box annotations.
1 code implementation • 8 May 2023 • Peng Xia, Di Xu, Lie Ju, Ming Hu, Jun Chen, ZongYuan Ge
Long-tailed multi-label visual recognition (LTML) task is a highly challenging task due to the label co-occurrence and imbalanced data distribution.
Ranked #1 on Long-tail Learning on COCO-MLT (using extra training data)
no code implementations • 20 Apr 2023 • Di Xu, Xiang He, Tonghua Su, Zhongjie Wang
This paper provides a comprehensive survey on the recent advances and challenges in DNN partition approaches over the cloud, edge, and end devices based on a detailed literature collection.
no code implementations • 19 Feb 2023 • Di Xu, Qifan Xu, Kevin Nhieu, Dan Ruan, Ke Sheng
Suppression of thoracic bone shadows on chest X-rays (CXRs) has been indicated to improve the diagnosis of pulmonary disease.
no code implementations • 8 Aug 2022 • Haoran Wang, Di Xu, Dongliang He, Fu Li, Zhong Ji, Jungong Han, Errui Ding
Video-text retrieval (VTR) is an attractive yet challenging task for multi-modal understanding, which aims to search for relevant video (text) given a query (video).
2 code implementations • 24 Dec 2021 • Gang Li, Di Xu, Xing Cheng, Lingyu Si, Changwen Zheng
Although vision Transformers have achieved excellent performance as backbone models in many vision tasks, most of them intend to capture global relations of all tokens in an image or a window, which disrupts the inherent spatial and local correlations between patches in 2D structure.
no code implementations • 11 Mar 2021 • Roy L. M. Op het Veld, Di Xu, Vanessa Schaller, Marcel A. Verheijen, Stan M. E. Peters, Jason Jung, Chuyao Tong, Qingzhen Wang, Michiel W. A. de Moor, Bart Hesselmann, Kiefer Vermeulen, Jouri D. S. Bommer, Joon Sue Lee, Andrey Sarikov, Mihir Pendharkar, Anna Marzegalli, Sebastian Koelling, Leo P. Kouwenhoven, Leo Miglio, Chris J. Palmstrøm, Hao Zhang, Erik P. A. M. Bakkers
Strong spin-orbit semiconductor nanowires coupled to a superconductor are predicted to host Majorana zero modes.
Mesoscale and Nanoscale Physics
no code implementations • 27 Jan 2021 • Hao Zhang, Michiel W. A. de Moor, Jouri D. S. Bommer, Di Xu, Guanzhong Wang, Nick van Loo, Chun-Xiao Liu, Sasa Gazibegovic, John A. Logan, Diana Car, Roy L. M. Op het Veld, Petrus J. van Veldhoven, Sebastian Koelling, Marcel A. Verheijen, Mihir Pendharkar, Daniel J. Pennachio, Borzoyeh Shojaei, Joon Sue Lee, Chris J. Palmstrøm, Erik P. A. M. Bakkers, S. Das Sarma, Leo P. Kouwenhoven
We report electron transport studies on InSb-Al hybrid semiconductor-superconductor nanowire devices.
Mesoscale and Nanoscale Physics
no code implementations • 30 Dec 2020 • Jillian M. Clements, Di Xu, Nooshin Yousefi, Dmitry Efimov
Machine learning plays an essential role in preventing financial losses in the banking industry.
no code implementations • 12 Jul 2020 • Di Xu, Zhen Li, Yanning Zhang, Qi Cao
This paper presents an illumination estimation method for virtual objects in real environment by learning.
no code implementations • 6 Feb 2020 • Dmitry Efimov, Di Xu, Luyang Kong, Alexey Nefedov, Archana Anandakrishnan
Generative Adversarial Networks (GANs) became very popular for generation of realistically looking images.
no code implementations • 19 Oct 2019 • Di Xu, Tianhang Long, Junbin Gao
Massive volumes of high-dimensional data that evolves over time is continuously collected by contemporary information processing systems, which brings up the problem of organizing this data into clusters, i. e. achieve the purpose of dimensional deduction, and meanwhile learning its temporal evolution patterns.
no code implementations • 29 Jan 2019 • Di Xu, Manjing Fang, Xia Hong, Junbin Gao
A general framework of least squares support vector machine with low rank kernels, referred to as LR-LSSVM, is introduced in this paper.
no code implementations • CVPR 2014 • Di Xu, Qi Duan, Jianming Zheng, Juyong Zhang, Jianfei Cai, Tat-Jen Cham
As a result, our approach is robust, stable and is able to efficiently recover high quality of surface details even starting with a coarse MVS.