no code implementations • ECCV 2020 • Lezi Wang, Dong Liu, Rohit Puri, Dimitris N. Metaxas
We introduce a novel ranking network that utilizes the Co-Attention between movies and trailers as guidance to generate the training pairs, where the moments highly corrected with trailers are expected to be scored higher than the uncorrelated moments.
1 code implementation • 25 Sep 2023 • Bingyu Xin, Meng Ye, Leon Axel, Dimitris N. Metaxas
Then, we extend the baseline model to a prompt-based learning approach, PromptMR, for all-in-one MRI reconstruction from different views, contrasts, adjacent types, and acceleration factors.
no code implementations • ICCV 2023 • Di Liu, Xiang Yu, Meng Ye, Qilong Zhangli, Zhuowei Li, Zhixing Zhang, Dimitris N. Metaxas
Accurate 3D shape abstraction from a single 2D image is a long-standing problem in computer vision and graphics.
no code implementations • 2 Sep 2023 • Di Liu, Long Zhao, Qilong Zhangli, Yunhe Gao, Ting Liu, Dimitris N. Metaxas
The task of shape abstraction with semantic part consistency is challenging due to the complex geometries of natural objects.
1 code implementation • 18 Aug 2023 • Xiaoxiao He, Chaowei Tan, Ligong Han, Bo Liu, Leon Axel, Kang Li, Dimitris N. Metaxas
However, current cardiac MRI-based reconstruction technology used in clinical settings is 2D with limited through-plane resolution, resulting in low-quality reconstructed cardiac volumes.
no code implementations • 11 Aug 2023 • Shiyu Zhao, Samuel Schulter, Long Zhao, Zhixing Zhang, Vijay Kumar B. G, Yumin Suh, Manmohan Chandraker, Dimitris N. Metaxas
Second, a split-and-fusion (SAF) head is designed to remove the noise in localization of PLs, which is usually ignored in existing methods.
no code implementations • 9 Aug 2023 • Wentao Zhu, Yuan Jin, Gege Ma, Geng Chen, Jan Egger, Shaoting Zhang, Dimitris N. Metaxas
The accurate diagnosis on pathological subtypes for lung cancer is of significant importance for the follow-up treatments and prognosis managements.
1 code implementation • 22 Jul 2023 • Kexin Ding, Mu Zhou, Dimitris N. Metaxas, Shaoting Zhang
Survival outcome assessment is challenging and inherently associated with multiple clinical factors (e. g., imaging and genomics biomarkers) in cancer.
2 code implementations • 4 Jun 2023 • Yunhe Gao, Zhuowei Li, Di Liu, Mu Zhou, Shaoting Zhang, Dimitris N. Metaxas
Inspired by the training of medical residents, we explore universal medical image segmentation, whose goal is to learn from diverse medical imaging sources covering a range of clinical targets, body regions, and image modalities.
1 code implementation • 29 Apr 2023 • Ananya Jana, Aniruddha Maiti, Dimitris N. Metaxas
Tooth segmentation from intraoral scans is a crucial part of digital dentistry.
1 code implementation • CVPR 2023 • Yuxiao Chen, Jianbo Yuan, Yu Tian, Shijie Geng, Xinyu Li, Ding Zhou, Dimitris N. Metaxas, Hongxia Yang
However, direct aligning cross-modal information using such representations is challenging, as visual patches and text tokens differ in semantic levels and granularities.
1 code implementation • 25 Mar 2023 • Xiaoxiao He, Chaowei Tan, Bo Liu, Liping Si, Weiwu Yao, Liang Zhao, Di Liu, Qilong Zhangli, Qi Chang, Kang Li, Dimitris N. Metaxas
The supervised learning of the proposed method extracts features from limited labeled data in each client, while the unsupervised data is used to distill both feature and response-based knowledge from a national data repository to further improve the accuracy of the collaborative model and reduce the communication cost.
no code implementations • 16 Mar 2023 • Zhuowei Li, Long Zhao, Zizhao Zhang, Han Zhang, Di Liu, Ting Liu, Dimitris N. Metaxas
Prototype, as a representation of class embeddings, has been explored to reduce memory footprint or mitigate forgetting for continual learning scenarios.
1 code implementation • 25 Jan 2023 • Ananya Jana, Hrebesh Molly Subhash, Dimitris N. Metaxas
Summarizing of the mesh cell/triangle in this manner imposes an implicit structural constraint and makes it difficult to work with multiple resolutions which is done in many point cloud based deep learning algorithms.
1 code implementation • 22 Nov 2022 • Ran Gu, Guotai Wang, Jiangshan Lu, Jingyang Zhang, Wenhui Lei, Yinan Chen, Wenjun Liao, Shichuan Zhang, Kang Li, Dimitris N. Metaxas, Shaoting Zhang
First, a disentangle network is proposed to decompose an image into a domain-invariant anatomical representation and a domain-specific style code, where the former is sent to a segmentation model that is not affected by the domain shift, and the disentangle network is regularized by a decoder that combines the anatomical and style codes to reconstruct the input image.
no code implementations • 10 Oct 2022 • Yunhe Gao, Xingjian Shi, Yi Zhu, Hao Wang, Zhiqiang Tang, Xiong Zhou, Mu Li, Dimitris N. Metaxas
First, DePT plugs visual prompts into the vision Transformer and only tunes these source-initialized prompts during adaptation.
Ranked #2 on
Domain Adaptation
on VisDA2017
1 code implementation • 20 Jul 2022 • Yuxiao Chen, Long Zhao, Jianbo Yuan, Yu Tian, Zhaoyang Xia, Shijie Geng, Ligong Han, Dimitris N. Metaxas
Despite the success of fully-supervised human skeleton sequence modeling, utilizing self-supervised pre-training for skeleton sequence representation learning has been an active field because acquiring task-specific skeleton annotations at large scales is difficult.
1 code implementation • 8 Jun 2022 • Zhuowei Li, Yibo Gao, Zhenzhou Zha, Zhiqiang Hu, Qing Xia, Shaoting Zhang, Dimitris N. Metaxas
In this work, we propose the self-supervised and weight-preserving neural architecture search (SSWP-NAS) as an extension of the current NAS framework by allowing the self-supervision and retaining the concomitant weights discovered during the search stage.
2 code implementations • 28 Feb 2022 • Yunhe Gao, Mu Zhou, Di Liu, Zhennan Yan, Shaoting Zhang, Dimitris N. Metaxas
Transformers have demonstrated remarkable performance in natural language processing and computer vision.
1 code implementation • ICLR 2022 • Tingfeng Li, Shaobo Han, Martin Renqiang Min, Dimitris N. Metaxas
We propose a reinforcement learning based approach to query object localization, for which an agent is trained to localize objects of interest specified by a small exemplary set.
1 code implementation • 17 Dec 2021 • Bingyu Xin, Timothy S. Phan, Leon Axel, Dimitris N. Metaxas
Magnetic Resonance (MR) image reconstruction from highly undersampled $k$-space data is critical in accelerated MR imaging (MRI) techniques.
3 code implementations • 3 Nov 2021 • Xiangde Luo, Wenjun Liao, Jianghong Xiao, Jieneng Chen, Tao Song, Xiaofan Zhang, Kang Li, Dimitris N. Metaxas, Guotai Wang, Shaoting Zhang
Deep learning-based medical image segmentation has shown the potential to reduce manual delineation efforts, but it still requires a large-scale fine annotated dataset for training, and there is a lack of large-scale datasets covering the whole abdomen region with accurate and detailed annotations for the whole abdominal organ segmentation.
1 code implementation • ICCV 2021 • Andreas Voskou, Konstantinos P. Panousis, Dimitrios Kosmopoulos, Dimitris N. Metaxas, Sotirios Chatzis
In this paper, we attenuate this need, by introducing an end-to-end SLT model that does not entail explicit use of glosses; the model only needs text groundtruth.
1 code implementation • 2 Jul 2021 • Jiahui Li, Wen Chen, Xiaodi Huang, Zhiqiang Hu, Qi Duan, Hongsheng Li, Dimitris N. Metaxas, Shaoting Zhang
To handle this problem, we propose a hybrid supervision learning framework for this kind of high resolution images with sufficient image-level coarse annotations and a few pixel-level fine labels.
1 code implementation • 14 Jun 2021 • Jingru Yi, Pengxiang Wu, Hui Tang, Bo Liu, Qiaoying Huang, Hui Qu, Lianyi Han, Wei Fan, Daniel J. Hoeppner, Dimitris N. Metaxas
To deal with this problem, in this paper, we propose an object-guided instance segmentation method.
1 code implementation • NeurIPS 2021 • Long Zhao, Zizhao Zhang, Ting Chen, Dimitris N. Metaxas, Han Zhang
Attention-based models, exemplified by the Transformer, can effectively model long range dependency, but suffer from the quadratic complexity of self-attention operation, making them difficult to be adopted for high-resolution image generation based on Generative Adversarial Networks (GANs).
Ranked #2 on
Image Generation
on CelebA 256x256
(FID metric)
no code implementations • 20 May 2021 • Yuxiao Chen, Jianbo Yuan, Long Zhao, Tianlang Chen, Rui Luo, Larry Davis, Dimitris N. Metaxas
Cross-modal attention mechanisms have been widely applied to the image-text matching task and have achieved remarkable improvements thanks to its capability of learning fine-grained relevance across different modalities.
1 code implementation • ICLR 2021 • Yu Tian, Jian Ren, Menglei Chai, Kyle Olszewski, Xi Peng, Dimitris N. Metaxas, Sergey Tulyakov
We introduce a motion generator that discovers the desired trajectory, in which content and motion are disentangled.
Ranked #21 on
Video Generation
on UCF-101
1 code implementation • 12 Apr 2021 • Xiangde Luo, Tao Song, Guotai Wang, Jieneng Chen, Yinan Chen, Kang Li, Dimitris N. Metaxas, Shaoting Zhang
To overcome these problems, we propose a 3D sphere representation-based center-points matching detection network that is anchor-free and automatically predicts the position, radius, and offset of nodules without the manual design of nodule/anchor parameters.
1 code implementation • CVPR 2021 • Li SiYao, Shiyu Zhao, Weijiang Yu, Wenxiu Sun, Dimitris N. Metaxas, Chen Change Loy, Ziwei Liu
In the animation industry, cartoon videos are usually produced at low frame rate since hand drawing of such frames is costly and time-consuming.
1 code implementation • 5 Apr 2021 • Yunhe Gao, Rui Huang, Yiwei Yang, Jie Zhang, Kainan Shao, Changjuan Tao, YuanYuan Chen, Dimitris N. Metaxas, Hongsheng Li, Ming Chen
Radiotherapy is a treatment where radiation is used to eliminate cancer cells.
no code implementations • 11 Feb 2021 • Harris Partaourides, Andreas Voskou, Dimitrios Kosmopoulos, Sotirios Chatzis, Dimitris N. Metaxas
Memory-efficient continuous Sign Language Translation is a significant challenge for the development of assisted technologies with real-time applicability for the deaf.
no code implementations • ICCV 2021 • Alireza Naghizadeh, Hongye Xu, Mohab Mohamed, Dimitris N. Metaxas, Dongfang Liu
The importance of this subject is nested in the amount of training data that artificial neural networks need to accurately identify and segment objects in images and the infeasibility of acquiring a sufficient dataset within the biomedical field.
no code implementations • 1 Jan 2021 • Zhiqiang Tang, Yunhe Gao, Yi Zhu, Zhi Zhang, Mu Li, Dimitris N. Metaxas
CrossNorm exchanges styles between feature channels to perform style augmentation, diversifying the content and style mixtures.
no code implementations • 15 Dec 2020 • Qi Chang, Zhennan Yan, Lohendran Baskaran, Hui Qu, Yikai Zhang, Tong Zhang, Shaoting Zhang, Dimitris N. Metaxas
As deep learning technologies advance, increasingly more data is necessary to generate general and robust models for various tasks.
no code implementations • 19 Aug 2020 • Lezi Wang, Dong Liu, Rohit Puri, Dimitris N. Metaxas
A movie's key moments stand out of the screenplay to grab an audience's attention and make movie browsing efficient.
no code implementations • 10 Jul 2020 • Hui Qu, Pengxiang Wu, Qiaoying Huang, Jingru Yi, Zhennan Yan, Kang Li, Gregory M. Riedlinger, Subhajyoti De, Shaoting Zhang, Dimitris N. Metaxas
To alleviate such tedious and manual effort, in this paper we propose a novel weakly supervised segmentation framework based on partial points annotation, i. e., only a small portion of nuclei locations in each image are labeled.
no code implementations • CVPR 2020 • Long Zhao, Xi Peng, Yuxiao Chen, Mubbasir Kapadia, Dimitris N. Metaxas
Our key idea is to generalize the distilled cross-modal knowledge learned from a Source dataset, which contains paired examples from both modalities, to the Target dataset by modeling knowledge as priors on parameters of the Student.
1 code implementation • 9 Jan 2020 • Jingru Yi, Pengxiang Wu, Qiaoying Huang, Hui Qu, Dimitris N. Metaxas
The comparison results demonstrate the merits of our method in both Cobb angle measurement and landmark detection on low-contrast and ambiguous X-ray images.
no code implementations • 20 Nov 2019 • Jingru Yi, Hui Tang, Pengxiang Wu, Bo Liu, Daniel J. Hoeppner, Dimitris N. Metaxas, Lianyi Han, Wei Fan
Along with the instance normalization, the model is able to recover the target object distribution and suppress the distribution of neighboring attached objects.
1 code implementation • NeurIPS 2019 • Yu Tian, Long Zhao, Xi Peng, Dimitris N. Metaxas
Graph kernels are kernel methods measuring graph similarity and serve as a standard tool for graph classification.
Ranked #7 on
Link Prediction
on Cora
1 code implementation • 27 Sep 2019 • Jingru Yi, Pengxiang Wu, Dimitris N. Metaxas
This paper proposes a new deep neural network for object detection.
no code implementations • 13 Aug 2019 • Chaowei Tan, Zhennan Yan, Shaoting Zhang, Kang Li, Dimitris N. Metaxas
However, effective and efficient delineation of all the knee articular cartilages in large-sized and high-resolution 3D MR knee data is still an open challenge.
2 code implementations • 2 Aug 2019 • Alireza Naghizadeh, Mohammadsajad Abavisani, Dimitris N. Metaxas
This is a challenging problem and requires exploration for data augmentation policies to ensure their effectiveness in covering the search space.
1 code implementation • 22 Jul 2019 • Jingru Yi, Pengxiang Wu, Qiaoying Huang, Hui Qu, Bo Liu, Daniel J. Hoeppner, Dimitris N. Metaxas
In this paper, we propose a new box-based cell instance segmentation method.
1 code implementation • 20 Jul 2019 • Yuxiao Chen, Long Zhao, Xi Peng, Jianbo Yuan, Dimitris N. Metaxas
We propose a Dynamic Graph-Based Spatial-Temporal Attention (DG-STA) method for hand gesture recognition.
Ranked #3 on
Hand Gesture Recognition
on SHREC 2017
no code implementations • 16 Jul 2019 • Shijie Geng, Ji Zhang, Hang Zhang, Ahmed Elgammal, Dimitris N. Metaxas
We present a simple method that achieves unexpectedly superior performance for Complex Reasoning involved Visual Question Answering.
4 code implementations • CVPR 2019 • Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris N. Metaxas
In this paper, we study the problem of learning Graph Convolutional Networks (GCNs) for regression.
Ranked #21 on
Monocular 3D Human Pose Estimation
on Human3.6M
1 code implementation • ICCV 2019 • Lezi Wang, Ziyan Wu, Srikrishna Karanam, Kuan-Chuan Peng, Rajat Vikram Singh, Bo Liu, Dimitris N. Metaxas
Recent developments in gradient-based attention modeling have seen attention maps emerge as a powerful tool for interpreting convolutional neural networks.
1 code implementation • 20 Aug 2018 • Zhiqiang Tang, Xi Peng, Shijie Geng, Yizhe Zhu, Dimitris N. Metaxas
We design a new connectivity pattern for the U-Net architecture.
Ranked #30 on
Pose Estimation
on MPII Human Pose
1 code implementation • 28 Jun 2018 • Yu Tian, Xi Peng, Long Zhao, Shaoting Zhang, Dimitris N. Metaxas
Generating multi-view images from a single-view input is an essential yet challenging problem.
no code implementations • 16 Feb 2018 • Zachary A. Daniels, Dimitris N. Metaxas
The ability for computational agents to reason about the high-level content of real world scene images is important for many applications.
no code implementations • 17 Jan 2018 • Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas
We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model.
no code implementations • 25 Mar 2017 • Long Zhao, Fangda Han, Xi Peng, Xun Zhang, Mubbasir Kapadia, Vladimir Pavlovic, Dimitris N. Metaxas
We first recover the facial identity and expressions from the video by fitting a face morphable model for each frame.
no code implementations • ICML 2017 • Bo Liu, Xiao-Tong Yuan, Lezi Wang, Qingshan Liu, Dimitris N. Metaxas
It remains open to explore duality theory and algorithms in such a non-convex and NP-hard problem setting.
1 code implementation • 8 Nov 2016 • Jingjing Liu, Shaoting Zhang, Shu Wang, Dimitris N. Metaxas
Multispectral pedestrian detection is essential for around-the-clock applications, e. g., surveillance and autonomous driving.
no code implementations • 9 Sep 2016 • Xi Peng, Qiong Hu, Junzhou Huang, Dimitris N. Metaxas
Our approach takes advantage of part-based representation and cascade regression for robust and efficient alignment on each frame.
no code implementations • 19 Aug 2016 • Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas
We propose a novel recurrent encoder-decoder network model for real-time video-based face alignment.
no code implementations • 4 Feb 2016 • Shu Wang, Shaoting Zhang, Wei Liu, Dimitris N. Metaxas
In this paper, we propose a novel visual tracking framework that intelligently discovers reliable patterns from a wide range of video to resist drift error for long-term tracking tasks.
no code implementations • ICCV 2015 • Xi Peng, Shaoting Zhang, Yu Yang, Dimitris N. Metaxas
Face alignment, especially on real-time or large-scale sequential images, is a challenging task with broad applications.