no code implementations • 10 Sep 2024 • Xin Wang, Tao Tan, Yuan Gao, Eric Marcus, Luyi Han, Antonio Portaluri, Tianyu Zhang, Chunyao Lu, Xinglong Liang, Regina Beets-Tan, Jonas Teuwen, Ritse Mann
Precision breast cancer (BC) risk assessment is crucial for developing individualized screening and prevention.
1 code implementation • 2 Sep 2024 • Ryan Wen Liu, Yuxu Lu, Yuan Gao, Yu Guo, Wenqi Ren, Fenghua Zhu, Fei-Yue Wang
To promote the navigational safety of vessels, many computational methods have been presented to perform visual quality enhancement under poor weather conditions.
no code implementations • 1 Sep 2024 • Hao Shi, Yuan Gao, Zhaoheng Ni, Tatsuya Kawahara
Furthermore, we propose the serialized speech information guidance SOT (GEncSep) to further utilize the separated encodings.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 29 Jul 2024 • Tom Gunter, ZiRui Wang, Chong Wang, Ruoming Pang, Aonan Zhang, BoWen Zhang, Chen Chen, Chung-Cheng Chiu, David Qiu, Deepak Gopinath, Dian Ang Yap, Dong Yin, Feng Nan, Floris Weers, Guoli Yin, Haoshuo Huang, Jianyu Wang, Jiarui Lu, John Peebles, Ke Ye, Mark Lee, Nan Du, Qibin Chen, Quentin Keunebroek, Sam Wiseman, Syd Evans, Tao Lei, Vivek Rathod, Xiang Kong, Xianzhi Du, Yanghao Li, Yongqiang Wang, Yuan Gao, Zaid Ahmed, Zhaoyang Xu, Zhiyun Lu, Al Rashid, Albin Madappally Jose, Alec Doane, Alfredo Bencomo, Allison Vanderby, Andrew Hansen, Ankur Jain, Anupama Mann Anupama, Areeba Kamal, Bugu Wu, Carolina Brum, Charlie Maalouf, Chinguun Erdenebileg, Chris Dulhanty, Dominik Moritz, Doug Kang, Eduardo Jimenez, Evan Ladd, Fangping Shi, Felix Bai, Frank Chu, Fred Hohman, Hadas Kotek, Hannah Gillis Coleman, Jane Li, Jeffrey Bigham, Jeffery Cao, Jeff Lai, Jessica Cheung, Jiulong Shan, Joe Zhou, John Li, Jun Qin, Karanjeet Singh, Karla Vega, Kelvin Zou, Laura Heckman, Lauren Gardiner, Margit Bowler, Maria Cordell, Meng Cao, Nicole Hay, Nilesh Shahdadpuri, Otto Godwin, Pranay Dighe, Pushyami Rachapudi, Ramsey Tantawi, Roman Frigg, Sam Davarnia, Sanskruti Shah, Saptarshi Guha, Sasha Sirovica, Shen Ma, Shuang Ma, Simon Wang, Sulgi Kim, Suma Jayaram, Vaishaal Shankar, Varsha Paidi, Vivek Kumar, Xin Wang, Xin Zheng, Walker Cheng, Yael Shrager, Yang Ye, Yasu Tanaka, Yihao Guo, Yunsong Meng, Zhao Tang Luo, Zhi Ouyang, Alp Aygar, Alvin Wan, Andrew Walkingshaw, Andy Narayanan, Antonie Lin, Arsalan Farooq, Brent Ramerth, Colorado Reed, Chris Bartels, Chris Chaney, David Riazati, Eric Liang Yang, Erin Feldman, Gabriel Hochstrasser, Guillaume Seguin, Irina Belousova, Joris Pelemans, Karen Yang, Keivan Alizadeh Vahid, Liangliang Cao, Mahyar Najibi, Marco Zuliani, Max Horton, Minsik Cho, Nikhil Bhendawade, Patrick Dong, Piotr Maj, Pulkit Agrawal, Qi Shan, Qichen Fu, Regan Poston, Sam Xu, Shuangning Liu, Sushma Rao, Tashweena Heeramun, Thomas Merth, Uday Rayala, Victor Cui, Vivek Rangarajan Sridhar, Wencong Zhang, Wenqi Zhang, Wentao Wu, Xingyu Zhou, Xinwen Liu, Yang Zhao, Yin Xia, Zhile Ren, Zhongzheng Ren
We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute.
no code implementations • 18 Jul 2024 • Chunli Li, XiaoMing Zhang, Yuan Gao, Xiaoli Yin, Le Lu, Ling Zhang, Ke Yan, Yu Shi
Esophageal varices (EV), a serious health concern resulting from portal hypertension, are traditionally diagnosed through invasive endoscopic procedures.
no code implementations • 18 Jul 2024 • Wei Huang, Wei Liu, XiaoMing Zhang, Xiaoli Yin, Xu Han, Chunli Li, Yuan Gao, Yu Shi, Le Lu, Ling Zhang, Lei Zhang, Ke Yan
The early detection and precise diagnosis of liver tumors are tasks of critical clinical value, yet they pose significant challenges due to the high heterogeneity and variability of liver tumors.
no code implementations • 12 Jul 2024 • Dangxing Chen, Yuan Gao
Consequently, when applying machine learning models, we must ensure that the attribution methods reflect the underlying risks accurately.
1 code implementation • 6 Jul 2024 • Yuan Gao, Shuguo Jiang, Moran Li, Jin-Gang Yu, Gui-Song Xia
Given N tasks, we propose to simultaneously identify the best task groups from 2^N candidates and train the model weights simultaneously in one-shot, with the high-order task-affinity fully exploited.
1 code implementation • 5 Jul 2024 • Yu Guo, Yuan Gao, Yuxu Lu, Huilin Zhu, Ryan Wen Liu, Shengfeng He
In real-world scenarios, image impairments often manifest as composite degradations, presenting a complex interplay of elements such as low light, haze, rain, and snow.
1 code implementation • 3 Jul 2024 • Luyi Han, Tao Tan, Tianyu Zhang, Xin Wang, Yuan Gao, Chunyao Lu, Xinglong Liang, Haoran Dou, Yunzhi Huang, Ritse Mann
We propose a generative model that compresses discrete representations of each sequence to estimate the Gaussian distribution of vector-quantized common (VQC) latent space between multiple sequences.
1 code implementation • 26 Jun 2024 • Yuan Gao, Yajing Luo, Junhong Wang, Kui Jia, Gui-Song Xia
Motivated by this, we propose a novel 3D generalizable relative pose estimation method by elaborating (i) with a 2. 5D shape from an RGB-D reference, (ii) with an off-the-shelf differentiable renderer, and (iii) with semantic cues from a pretrained model like DINOv2.
no code implementations • 15 Jun 2024 • Yuan Gao, Zujing Liu, Weizhong Zhang, Bo Du, Gui-Song Xia
Compared to the moderate size of neural network models, structural weight pruning on the Large-Language Models (LLMs) imposes a novel challenge on the efficiency of the pruning algorithms, due to the heavy computation/memory demands of the LLMs.
1 code implementation • 6 Jun 2024 • Ye Tian, Ling Yang, Haotian Yang, Yuan Gao, Yufan Deng, Jingmin Chen, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui
Diffusion models have demonstrated great success in text-to-video (T2V) generation.
1 code implementation • 30 May 2024 • Rustem Islamov, Yuan Gao, Sebastian U. Stich
Communication efficiency has garnered significant attention as it is considered the main bottleneck for large-scale decentralized Machine Learning applications in distributed and federated settings.
1 code implementation • 24 May 2024 • Chunjiang Ge, Sijie Cheng, ZiMing Wang, Jiale Yuan, Yuan Gao, Jun Song, Shiji Song, Gao Huang, Bo Zheng
To enhance the capabilities of ConvLLaVA, we propose two critical optimizations.
Ranked #35 on Visual Question Answering on MM-Vet
1 code implementation • 9 May 2024 • Yuan Gao, Weizhong Zhang, Wenhan Luo, Lin Ma, Jin-Gang Yu, Gui-Song Xia, Jiayi Ma
We aim at exploiting additional auxiliary labels from an independent (auxiliary) task to boost the primary task performance which we focus on, while preserving a single task inference cost of the primary task.
no code implementations • 6 May 2024 • Jinwei Han, Yingguo Gao, Zhiwen Lin, Ke Yan, Shouhong Ding, Yuan Gao, Gui-Song Xia
Specifically, we introduce a Dual Attention Block (DAB) for visual-semantic relationship mining, which enriches visual information by multi-level feature fusion and conducts spatial attention for visual to semantic embedding.
no code implementations • CVPR 2024 • Jinwei Han, Zhiwen Lin, Zhongyisun Sun, Yingguo Gao, Ke Yan, Shouhong Ding, Yuan Gao, Gui-Song Xia
Specifically, two types of anchors are elaborated in our method, including i) text-compensated anchor which uses the images from the finetune set but enriches the text supervision from a pretrained captioner, ii) image-text-pair anchor which is retrieved from the dataset similar to pretraining data of CLIP according to the downstream task, associating with the original CLIP text with rich semantics.
no code implementations • 31 Mar 2024 • Yuan Gao, Jian Huang, Yuling Jiao, Shurong Zheng
We establish non-asymptotic error bounds for the distribution estimator based on CNFs, in terms of the Wasserstein-2 distance.
1 code implementation • CVPR 2024 • Xiao Lin, Wenfei Yang, Yuan Gao, Tianzhu Zhang
(2) The second design is a Geometric-Aware Feature Aggregation module, which can efficiently integrate the local and global geometric information into keypoint features.
1 code implementation • 19 Mar 2024 • Yuan Gao, Yiheng Zhu, Yuanbin Cao, Yinzhi Zhou, Zhen Wu, Yujie Chen, Shenglan Wu, Haoyuan Hu, Xinyu Dai
To alleviate this issue, we propose the Discriminate->Re-Compose->Re- Solve->Re-Decompose (Dr3) mechanism.
no code implementations • 19 Mar 2024 • Yuan Gao, SangWook Kim, David E Austin, Chris McIntosh
Medical vision-language pretraining models (VLPM) have achieved remarkable progress in fusing chest X-rays (CXR) with clinical texts, introducing image-text data binding approaches that enable zero-shot learning and downstream clinical tasks.
no code implementations • 17 Mar 2024 • Xuetong Li, Yuan Gao, Hong Chang, Danyang Huang, Yingying Ma, Rui Pan, Haobo Qi, Feifei Wang, Shuyuan Wu, Ke Xu, Jing Zhou, Xuening Zhu, Yingqiu Zhu, Hansheng Wang
A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades.
no code implementations • CVPR 2024 • Yuan Gao, Kunyu Shi, Pengkai Zhu, Edouard Belval, Oren Nuriel, Srikar Appalaraju, Shabnam Ghadar, Vijay Mahadevan, Zhuowen Tu, Stefano Soatto
We propose Strongly Supervised pre-training with ScreenShots (S4) - a novel pre-training paradigm for Vision-Language Models using data from large-scale web screenshot rendering.
no code implementations • 5 Mar 2024 • Yuan Gao, Anton Rodomanov, Sebastian U. Stich
In this paper, we focus on the stochastic proximal gradient method with Polyak momentum.
no code implementations • 29 Feb 2024 • Guangyi Liu, Yu Wang, Zeyu Feng, Qiyu Wu, Liping Tang, Yuan Gao, Zhen Li, Shuguang Cui, Julian McAuley, Zichao Yang, Eric P. Xing, Zhiting Hu
The vast applications of deep generative models are anchored in three core capabilities -- generating new instances, reconstructing inputs, and learning compact representations -- across various data types, such as discrete text/protein sequences and continuous images.
no code implementations • 26 Feb 2024 • Yu Ming, Zihao Wu, Jie Yang, Danyi Li, Yuan Gao, Changxin Gao, Gui-Song Xia, Yuanqing Li, Li Liang, Jin-Gang Yu
In this paper, we propose to formulate annotation-efficient nucleus instance segmentation from the perspective of few-shot learning (FSL).
1 code implementation • 6 Feb 2024 • Yuxu Lu, Dong Yang, Yuan Gao, Ryan Wen Liu, Jun Liu, Yu Guo
Additionally, we suggest a multi-receptive field extraction module (MEM) to attenuate the loss of image texture details caused by GC nonlinear and OLS linear transformations.
no code implementations • 5 Feb 2024 • Yuan Gao, Haokun Chen, Xiang Wang, Zhicai Wang, Xue Wang, Jinyang Gao, Bolin Ding
Our research demonstrates the efficacy of leveraging AIGS and the DiffsFormer architecture to mitigate data scarcity in stock forecasting tasks.
no code implementations • 5 Feb 2024 • Junfeng Fang, Xinglin Li, Yongduo Sui, Yuan Gao, Guibin Zhang, Kun Wang, Xiang Wang, Xiangnan He
Graph representation learning on vast datasets, like web data, has made significant strides.
no code implementations • 30 Jan 2024 • Tiannan Wang, Jiamin Chen, Qingrui Jia, Shuai Wang, Ruoyu Fang, Huilin Wang, Zhaowei Gao, Chunzhao Xie, Chuou Xu, Jihong Dai, Yibin Liu, Jialong Wu, Shengwei Ding, Long Li, Zhiwei Huang, Xinle Deng, Teng Yu, Gangan Ma, Han Xiao, Zixin Chen, Danjun Xiang, Yunxia Wang, Yuanyuan Zhu, Yi Xiao, Jing Wang, Yiru Wang, Siran Ding, Jiayang Huang, Jiayi Xu, Yilihamu Tayier, Zhenyu Hu, Yuan Gao, Chengfeng Zheng, Yueshu Ye, Yihang Li, Lei Wan, Xinyue Jiang, Yujie Wang, Siyu Cheng, Zhule Song, Xiangru Tang, Xiaohua Xu, Ningyu Zhang, Huajun Chen, Yuchen Eleanor Jiang, Wangchunshu Zhou
Weaver is pre-trained on a carefully selected corpus that focuses on improving the writing capabilities of large language models.
1 code implementation • 25 Jan 2024 • Yuan Gao, Xiang Wang, Xiangnan He, Zhenguang Liu, Huamin Feng, Yongdong Zhang
Graph anomaly detection (GAD) is a challenging binary classification problem due to its different structural distribution between anomalies and normal nodes -- abnormal nodes are a minority, therefore holding high heterophily and low homophily compared to normal nodes.
1 code implementation • 17 Jan 2024 • Luyi Han, Tao Tan, Tianyu Zhang, Yuan Gao, Xin Wang, Valentina Longo, Sofía Ventura-Díaz, Anna D'Angelo, Jonas Teuwen, Ritse Mann
We use a clinical dataset with 1630 MRI scans from 314 patients treated with NAC.
1 code implementation • 8 Jan 2024 • Dong Yang, Wenyu Xu, Yuan Gao, Yuxu Lu, Jingming Zhang, Yu Guo
High-quality imaging is crucial for ensuring safety supervision and intelligent deployment in fields like transportation and industry.
1 code implementation • CVPR 2024 • Shihua Zhang, Zizhuo Li, Yuan Gao, Jiayi Ma
Specifically we first decompose the rough motion field that is contaminated by false matches into several different sub-fields which are highly smooth and contain the main energy of the original field.
no code implementations • CVPR 2024 • Yuan Gao, Yuqing Zhu, Xinjun Li, Yimin Du, Tianzhu Zhang
To address these challenges a novel event-based keypoint detection method is proposed by learning dynamic detectors and contextual descriptors in a self-supervised manner (SD2Event) including a contextual feature descriptor learning (CFDL) module and a dynamic keypoint detector learning (DKDL) module.
no code implementations • 27 Dec 2023 • Xun Guo, Mingwu Zheng, Liang Hou, Yuan Gao, Yufan Deng, Pengfei Wan, Di Zhang, Yufan Liu, Weiming Hu, ZhengJun Zha, Haibin Huang, Chongyang Ma
I2V-Adapter adeptly propagates the unnoised input image to subsequent noised frames through a cross-frame attention mechanism, maintaining the identity of the input image without any changes to the pretrained T2V model.
no code implementations • NeurIPS 2023 • Hong-Xing Yu, Yang Zheng, Yuan Gao, Yitong Deng, Bo Zhu, Jiajun Wu
Specifically, to deal with visual ambiguities of fluid velocity, we introduce a set of physics-based losses that enforce inferring a physically plausible velocity field, which is divergence-free and drives the transport of density.
no code implementations • 29 Nov 2023 • Xinshun Wang, Wanying Zhang, Can Wang, Yuan Gao, Mengyuan Liu
Graph Convolutional Networks (GCN) which typically follows a neural message passing framework to model dependencies among skeletal joints has achieved high success in skeleton-based human motion prediction task.
no code implementations • 21 Nov 2023 • Wenqing Wei, Zhengdong Yang, Yuan Gao, Jiyi Li, Chenhui Chu, Shogo Okada, Sheng Li
The early-stage Alzheimer's disease (AD) detection has been considered an important field of medical studies.
no code implementations • 20 Nov 2023 • Yuan Gao, Jian Huang, Yuling Jiao
Gaussian denoising has emerged as a powerful method for constructing simulation-free continuous normalizing flows for generative modeling.
no code implementations • 19 Nov 2023 • Yuan Gao, Junjie Jiao, Zhongkui Li, Sandra Hirche
The aim is to design a distributed protocol by dynamic output feedback that achieves state/output containment control while the associated H2 cost is smaller than an a priori given upper bound.
no code implementations • 6 Nov 2023 • Yuan Gao, Rustem Islamov, Sebastian Stich
Error Compensation (EC) is an extremely popular mechanism to mitigate the aforementioned issues during the training of models enhanced by contractive compression operators.
no code implementations • 2 Nov 2023 • Yuan Gao, Nobuyuki Morioka, Yu Zhang, Nanxin Chen
Instead, E3 TTS models the temporal structure of the waveform through the diffusion process.
2 code implementations • 15 Sep 2023 • Jingxiang Qu, Ryan Wen Liu, Yuan Gao, Yu Guo, Fenghua Zhu, Fei-Yue Wang
Real-time transportation surveillance is an essential part of the intelligent transportation system (ITS).
no code implementations • 9 Sep 2023 • Haiquan Zhao, Yuan Gao, Yingying Zhu
In this paper, a generalized minimum error with fiducial points criterion (GMEEF) is presented by adopting the Generalized Gaussian Density (GGD) function as kernel.
no code implementations • 17 Jul 2023 • Yan-Jie Zhou, Wei Liu, Yuan Gao, Jing Xu, Le Lu, Yuping Duan, Hao Cheng, Na Jin, Xiaoyong Man, Shuang Zhao, Yu Wang
Skin diseases are among the most prevalent health issues, and accurate computer-aided diagnosis methods are of importance for both dermatologists and patients.
1 code implementation • 17 Jul 2023 • Ke Yan, Xiaoli Yin, Yingda Xia, Fakai Wang, Shu Wang, Yuan Gao, Jiawen Yao, Chunli Li, Xiaoyu Bai, Jingren Zhou, Ling Zhang, Le Lu, Yu Shi
Liver tumor segmentation and classification are important tasks in computer aided diagnosis.
no code implementations • 17 Jul 2023 • Andrew Caines, Luca Benedetto, Shiva Taslimipoor, Christopher Davis, Yuan Gao, Oeistein Andersen, Zheng Yuan, Mark Elliott, Russell Moore, Christopher Bryant, Marek Rei, Helen Yannakoudakis, Andrew Mullooly, Diane Nicholls, Paula Buttery
The recent release of very large language models such as PaLM and GPT-4 has made an unprecedented impact in the popular media and public consciousness, giving rise to a mixture of excitement and fear as to their capabilities and potential uses, and shining a light on natural language processing research which had not previously received so much attention.
no code implementations • 6 Jul 2023 • Xin Wang, Tao Tan, Yuan Gao, Luyi Han, Tianyu Zhang, Chunyao Lu, Regina Beets-Tan, Ruisheng Su, Ritse Mann
The question of 'what the symmetrical Bi-MG would look like when the asymmetrical abnormalities have been removed ?'
1 code implementation • 3 Jul 2023 • Tianyu Zhang, Luyi Han, Anna D'Angelo, Xin Wang, Yuan Gao, Chunyao Lu, Jonas Teuwen, Regina Beets-Tan, Tao Tan, Ritse Mann
DWIs with different b-values are fused to efficiently utilize the difference features of DWIs.
1 code implementation • 3 Jul 2023 • Luyi Han, Tianyu Zhang, Yunzhi Huang, Haoran Dou, Xin Wang, Yuan Gao, Chunyao Lu, Tan Tao, Ritse Mann
Multi-sequence MRI is valuable in clinical settings for reliable diagnosis and treatment prognosis, but some sequences may be unusable or missing for various reasons.
1 code implementation • IEEE Transactions on Multimedia 2023 • Jinfu Liu, Xinshun Wang, Can Wang, Yuan Gao, Mengyuan Liu
Then, channel-dependent and temporal-dependent adjacency matrices corresponding to different channels and frames are calculated to capture the spatiotemporal dependencies between skeleton joints.
1 code implementation • 17 Apr 2023 • Yu Guo, Yuan Gao, Ryan Wen Liu, Yuxu Lu, Jingxiang Qu, Shengfeng He, Wenqi Ren
The presence of non-homogeneous haze can cause scene blurring, color distortion, low contrast, and other degradations that obscure texture details.
no code implementations • 5 Apr 2023 • Yuan Gao, Ruili Wang, Feng Hou
Machine translation relies heavily on the abilities of language understanding and generation.
no code implementations • 27 Feb 2023 • Yuan Gao, Stephanie Epstein, Murat Inalpolat, Yi-Ning Wu, Yan Gu
Explosive Ordnance Disposal (EOD) suits are widely used to protect human operators to execute emergency tasks such as bomb disposal and neutralization.
no code implementations • 23 Feb 2023 • Yuan Gao, Biao Jiang, Jietong Zhou
As a result, there is a need to develop a productive prediction model for better order execution and adaptability to different datasets.
1 code implementation • 3 Feb 2023 • Tianyu Zhang, Tao Tan, Luyi Han, Xin Wang, Yuan Gao, Jonas Teuwen, Regina Beets-Tan, Ritse Mann
Then the multi-parameter fusion with attention module enables the interaction of the encoded information from different parameters through a set of algorithmic strategies, and applies different weights to the information through the attention mechanism after information fusion to obtain refined representation information.
1 code implementation • 1 Feb 2023 • Luyi Han, Tao Tan, Tianyu Zhang, Yunzhi Huang, Xin Wang, Yuan Gao, Jonas Teuwen, Ritse Mann
Multi-sequence MRIs can be necessary for reliable diagnosis in clinical practice due to the complimentary information within sequences.
no code implementations • CVPR 2023 • Jianfeng He, Yuan Gao, Tianzhu Zhang, Zhe Zhang, Feng Wu
Second, the HKDL module can generate keypoint detectors in a hierarchical way, which is helpful for detecting keypoints with diverse levels of structures.
no code implementations • 28 Dec 2022 • Chih-Jung Tracy Chang, Yuan Gao, Beicheng Lou
In this paper, we introduce a novel variation of model-agnostic meta-learning, where an extra multiplicative parameter is introduced in the inner-loop adaptation.
1 code implementation • 27 Dec 2022 • Zhiwei Hu, Bo Chen, Yuan Gao, Zhilong Ji, Jinfeng Bai
The task of referring video object segmentation aims to segment the object in the frames of a given video to which the referring expressions refer.
1 code implementation • 12 Oct 2022 • Dehua Zheng, Xiaochen Zheng, Laurence T. Yang, Yuan Gao, Chenlu Zhu, Yiheng Ruan
In addition, our MFFN exploits the dependence and interaction between views and channels.
no code implementations • 29 Sep 2022 • Luofeng Liao, Yuan Gao, Christian Kroer
In resource allocation, it is crucial to quantify the variability of the resource received by the agents (such as blood banks and food banks) in addition to fairness and efficiency properties of the systems.
1 code implementation • 29 Aug 2022 • Junjie Hu, Chenyou Fan, Mete Ozay, Hua Feng, Yuan Gao, Tin Lun Lam
In this paper, we introduce the ground-to-aerial perception knowledge transfer and propose a progressive semi-supervised learning framework that enables drone perception using only labeled data of ground viewpoint and unlabeled data of flying viewpoints.
1 code implementation • 12 Aug 2022 • Junjie Li, Zilei Wang, Yuan Gao, Xiaoming Hu
Such a strategy can generate the object boundaries in target domain (edge of target-domain object areas) with the correct labels.
1 code implementation • 10 Aug 2022 • Zhongliang Jiang, Yuan Gao, Le Xie, Nassir Navab
Robotic ultrasound (US) imaging aims at overcoming some of the limitations of free-hand US examinations, e. g. difficulty in guaranteeing intra- and inter-operator repeatability.
no code implementations • 5 Aug 2022 • Jingtao Tang, Yuan Gao, Tin Lun Lam
In this paper, we focus on the multi-robot coverage path planning (mCPP) problem in large-scale planar areas with random dynamic interferers in the environment, where the robots have limited resources.
1 code implementation • 1 Aug 2022 • Guangyi Liu, Zeyu Feng, Yuan Gao, Zichao Yang, Xiaodan Liang, Junwei Bao, Xiaodong He, Shuguang Cui, Zhen Li, Zhiting Hu
This paper proposes a new efficient approach for composable text operations in the compact latent space of text.
Ranked #2 on Unsupervised Text Style Transfer on Yelp
no code implementations • CVPR 2022 • Jinke Li, Xiao He, Yang Wen, Yuan Gao, Xiaoqiang Cheng, Dan Zhang
As a rising task, panoptic segmentation is faced with challenges in both semantic segmentation and instance segmentation.
1 code implementation • 10 May 2022 • Yuan Bi, Zhongliang Jiang, Yuan Gao, Thomas Wendler, Angelos Karlas, Nassir Navab
The results demonstrate that proposed approach can effectively and accurately navigate the probe towards the longitudinal view of vessels.
1 code implementation • 9 May 2022 • Min Peng, Chongyang Wang, Yuan Gao, Yu Shi, Xiang-Dong Zhou
With a multiscale sampling, RMI iterates the interaction of appearance-motion information at each scale and the question embeddings to build the multilevel question-guided visual representations.
no code implementations • 19 Apr 2022 • Yuan Gao, Xiang Wang, Xiangnan He, Huamin Feng, Yongdong Zhang
At the core is to model the rumor characteristics inherent in rich information, such as propagation patterns in social network and semantic patterns in post content, and differentiate them from the truth.
1 code implementation • 16 Mar 2022 • Xi Chen, Ali Ghadirzadeh, Tianhe Yu, Yuan Gao, Jianhao Wang, Wenzhe Li, Bin Liang, Chelsea Finn, Chongjie Zhang
Offline reinforcement learning methods hold the promise of learning policies from pre-collected datasets without the need to query the environment for new transitions.
no code implementations • 25 Feb 2022 • Yuan Gao, Kaiyu Yang, Yuanlong Chen, Min Liu, Noureddine El Karoui
We establish a general optimization framework for the design of automated bidding agent in dynamic online marketplaces.
no code implementations • 14 Feb 2022 • Xupeng Shi, Pengfei Zheng, A. Adam Ding, Yuan Gao, Weizhong Zhang
Modern deep neural networks (DNNs) are vulnerable to adversarial attacks and adversarial training has been shown to be a promising method for improving the adversarial robustness of DNNs.
1 code implementation • CVPR 2022 • Jiafan Zhuang, Zilei Wang, Yuan Gao
For this task, we observe that the overfitting is surprisingly severe between labeled and unlabeled frames within a training video although they are very similar in style and contents.
no code implementations • 9 Dec 2021 • Qinghao Ye, Yuan Gao, Weiping Ding, Zhangming Niu, Chengjia Wang, Yinghui Jiang, Minhao Wang, Evandro Fei Fang, Wade Menpes-Smith, Jun Xia, Guang Yang
The multi-domain shift problem for the multi-center and multi-scanner studies is therefore nontrivial that is also crucial for a dependable recognition and critical for reproducible and objective diagnosis and prognosis.
no code implementations • 4 Nov 2021 • Sainan Liu, Vincent Nguyen, Yuan Gao, Subarna Tripathi, Zhuowen Tu
Our proposed panoptic 3D parsing framework points to a promising direction in computer vision.
no code implementations • 29 Oct 2021 • Maoguo Gong, Yuan Gao, Yue Wu, A. K. Qin
Inspired by the idea of dropout in neural networks, we introduce a network sampling strategy in the multi-party setting, which distributes different subnets of the central model to clients for updating, and the differentiable sampling rates allow each client to extract optimal local architecture from the supernet according to its private data distribution.
1 code implementation • 18 Oct 2021 • Fuqin Deng, Hua Feng, Mingjian Liang, Qi Feng, Ningbo Yi, Yong Yang, Yuan Gao, Junfeng Chen, Tin Lun Lam
The occupancy grid map is a critical component of autonomous positioning and navigation in the mobile robotic system, as many other systems' performance depends heavily on it.
1 code implementation • 18 Oct 2021 • Fuqin Deng, Hua Feng, Mingjian Liang, Hongmin Wang, Yong Yang, Yuan Gao, Junfeng Chen, Junjie Hu, Xiyue Guo, Tin Lun Lam
To better extract detail spatial information, we propose a two-stage Feature-Enhanced Attention Network (FEANet) for the RGB-T semantic segmentation task.
Ranked #13 on Semantic Segmentation on FMB Dataset
no code implementations • 6 Oct 2021 • Xingdong Feng, Yuan Gao, Jian Huang, Yuling Jiao, Xu Liu
We propose a relative entropy gradient sampler (REGS) for sampling from unnormalized distributions.
1 code implementation • 10 Sep 2021 • Min Peng, Chongyang Wang, Yuan Gao, Yu Shi, Xiang-Dong Zhou
Targeting these issues, this paper proposes a novel Temporal Pyramid Transformer (TPT) model with multimodal interaction for VideoQA.
no code implementations • 8 Sep 2021 • Chongyang Wang, Yuan Gao, Chenyou Fan, Junjie Hu, Tin Lun Lam, Nicholas D. Lane, Nadia Bianchi-Berthouze
For such issues, we propose a novel Learning to Agreement (Learn2Agree) framework to tackle the challenge of learning from multiple annotators without objective ground truth.
no code implementations • 28 Aug 2021 • Yuan Gao, Lok Hin Lee, Richard Droste, Rachel Craik, Sridevi Beriwal, Aris Papageorghiou, Alison Noble
This paper presents a novel approach to automatic fetal brain biometry motivated by needs in low- and medium- income countries.
1 code implementation • ICCV 2021 • Yuxiang Wei, Yupeng Shi, Xiao Liu, Zhilong Ji, Yuan Gao, Zhongqin Wu, WangMeng Zuo
It simply encourages the variation of output caused by perturbations on different latent dimensions to be orthogonal, and the Jacobian with respect to the input is calculated to represent this variation.
no code implementations • 2 Jul 2021 • Pengcheng Wang, Lingqiao Ji, Zhilong Ji, Yuan Gao, Xiao Liu
In this technical report, we briefly introduce the solution of our team "TAL-ai" for (Semi-) supervised Face detection in the low light condition in UG2+ Challenge in CVPR 2021.
2 code implementations • 13 May 2021 • Junjie Hu, Chenyou Fan, Hualie Jiang, Xiyue Guo, Yuan Gao, Xiangyong Lu, Tin Lun Lam
However, this KD process can be challenging and insufficient due to the large model capacity gap between the teacher and the student.
no code implementations • 14 Apr 2021 • Maoguo Gong, Yuan Gao, Yu Xie, A. K. Qin, Ke Pan, Yew-Soon Ong
The performance of machine learning algorithms heavily relies on the availability of a large amount of training data.
no code implementations • 14 Apr 2021 • Yuan Gao, Jiawei Li, Maoguo Gong, Yu Xie, A. K. Qin
Since the existing naive model parameter averaging method is contradictory to the learning paradigm of neural networks, we simulate the process of human cognition and communication, and analogy multi-party learning as a many-to-one knowledge sharing problem.
no code implementations • 14 Mar 2021 • Lok Hin Lee, Yuan Gao, J. Alison Noble
In this paper, we present an augmentation policy search method with the goal of improving model classification performance.
no code implementations • 17 Feb 2021 • Tao Liu, Xin-Yang Liu, Yuan Gao, Hai Jin, Jun He, Xian-Lei Sheng, Wentao Jin, Ziyu Chen, Wei Li
Strong fluctuations in the low-$T$ quantum critical regime can give rise to a large thermal entropy change and thus significant cooling effect when approaching the QCP.
Strongly Correlated Electrons
no code implementations • 4 Feb 2021 • Yuan Gao, Han Lin Shang, Yanrong Yang
We propose modeling raw functional data as a mixture of a smooth function and a highdimensional factor component.
Methodology Statistics Theory Statistics Theory
no code implementations • ICCV 2021 • Qinghao Ye, Xiyue Shen, Yuan Gao, ZiRui Wang, Qi Bi, Ping Li, Guang Yang
Video highlight detection plays an increasingly important role in social media content filtering, however, it remains highly challenging to develop automated video highlight detection methods because of the lack of temporal annotations (i. e., where the highlight moments are in long videos) for supervised learning.
1 code implementation • 17 Dec 2020 • Moran Li, Yuan Gao, Nong Sang
This is different from the previous methods where all the joints are considered holistically and share the same feature.
no code implementations • 11 Dec 2020 • Yuan Gao, Jian Huang, Yuling Jiao, Jin Liu, Xiliang Lu, Zhijian Yang
The key task in training is the estimation of the density ratios or differences that determine the residual maps.
1 code implementation • 3 Nov 2020 • Chongyang Wang, Yuan Gao, Akhil Mathur, Amanda C. De C. Williams, Nicholas D. Lane, Nadia Bianchi-Berthouze
Protective behavior exhibited by people with chronic pain (CP) during physical activities is the key to understanding their physical and emotional states.
7 code implementations • 11 Oct 2020 • Xiang An, Xuhan Zhu, Yang Xiao, Lan Wu, Ming Zhang, Yuan Gao, Bin Qin, Debing Zhang, Ying Fu
The experiment demonstrates no loss of accuracy when training with only 10\% randomly sampled classes for the softmax-based loss functions, compared with training with full classes using state-of-the-art models on mainstream benchmarks.
Ranked #2 on Face Identification on MegaFace
no code implementations • WMT (EMNLP) 2020 • Fandong Meng, Jianhao Yan, Yijin Liu, Yuan Gao, Xianfeng Zeng, Qinsong Zeng, Peng Li, Ming Chen, Jie zhou, Sifan Liu, Hao Zhou
We participate in the WMT 2020 shared news translation task on Chinese to English.
1 code implementation • 19 Sep 2020 • Min Peng, Chongyang Wang, Yuan Gao, Tao Bi, Tong Chen, Yu Shi, Xiang-Dong Zhou
As a spontaneous expression of emotion on face, micro-expression reveals the underlying emotion that cannot be controlled by human.
1 code implementation • 10 Sep 2020 • Jiehong Lin, Xian Shi, Yuan Gao, Ke Chen, Kui Jia
Point set is arguably the most direct approximation of an object or scene surface, yet its practical acquisition often suffers from the shortcoming of being noisy, sparse, and possibly incomplete, which restricts its use for a high-quality surface recovery.
5 code implementations • 23 Jul 2020 • Xiang Long, Kaipeng Deng, Guanzhong Wang, Yang Zhang, Qingqing Dang, Yuan Gao, Hui Shen, Jianguo Ren, Shumin Han, Errui Ding, Shilei Wen
We mainly try to combine various existing tricks that almost not increase the number of model parameters and FLOPs, to achieve the goal of improving the accuracy of detector as much as possible while ensuring that the speed is almost unchanged.
no code implementations • ECCV 2020 • Jian Wang, Xiang Long, Yuan Gao, Errui Ding, Shilei Wen
In the first stage, heatmap regression network is applied to obtain a rough localization result, and a set of proposal keypoints, called guided points, are sampled.
1 code implementation • NeurIPS 2020 • Yanli Liu, Yuan Gao, Wotao Yin
Furthermore, the role of dynamic parameters has not been addressed.
Optimization and Control
no code implementations • 22 May 2020 • Yuan Gao, Jian-Guo Liu, Nan Wu
To construct an efficient and stable approximation for the Langevin dynamics on $\mathcal{N}$, we leverage the corresponding Fokker-Planck equation on the manifold $\mathcal{N}$ in terms of the reaction coordinates $\mathsf{y}$.
no code implementations • 14 Apr 2020 • Shaoping Hu, Yuan Gao, Zhangming Niu, Yinghui Jiang, Lao Li, Xianglu Xiao, Minhao Wang, Evandro Fei Fang, Wade Menpes-Smith, Jun Xia, Hui Ye, Guang Yang
An outbreak of a novel coronavirus disease (i. e., COVID-19) has been recorded in Wuhan, China since late December 2019, which subsequently became pandemic around the world.
1 code implementation • CVPR 2020 • Yuan Gao, Haoping Bai, Zequn Jie, Jiayi Ma, Kui Jia, Wei Liu
We propose to incorporate neural architecture search (NAS) into general-purpose multi-task learning (GP-MTL).
no code implementations • ICML 2020 • Krzysztof Choromanski, David Cheikhi, Jared Davis, Valerii Likhosherstov, Achille Nazaret, Achraf Bahamou, Xingyou Song, Mrugank Akarte, Jack Parker-Holder, Jacob Bergquist, Yuan Gao, Aldo Pacchiano, Tamas Sarlos, Adrian Weller, Vikas Sindhwani
We present a new class of stochastic, geometrically-driven optimization algorithms on the orthogonal group $O(d)$ and naturally reductive homogeneous manifolds obtained from the action of the rotation group $SO(d)$.
no code implementations • 20 Mar 2020 • Yuan Gao, Robert Bregovic, Atanas Gotchev
Specifically, CycleST is composed of an encoder-decoder network and a residual learning strategy that restore the shearlet coefficients of densely-sampled EPIs using EPI reconstruction and cycle consistency losses.
Signal Processing Multimedia Image and Video Processing
no code implementations • 19 Mar 2020 • Yuan Gao, Robert Bregovic, Reinhard Koch, Atanas Gotchev
Specifically, for an input sparsely-sampled EPI, DRST employs a deep fully Convolutional Neural Network (CNN) to predict the residuals of the shearlet coefficients in shearlet domain in order to reconstruct a densely-sampled EPI in image domain.
no code implementations • 13 Mar 2020 • Ziming Gao, Yuan Gao, Yi Hu, Zhengyong Jiang, Jionglong Su
This paper will introduce a strategy based on the classic Deep Reinforcement Learning algorithm, Deep Q-Network, for portfolio management in stock market.
no code implementations • 7 Feb 2020 • Yuan Gao, Jian Huang, Yuling Jiao, Jin Liu
We then solve the McKean-Vlasov equation numerically using the forward Euler iteration, where the forward Euler map depends on the density ratio (density difference) between the distribution at current iteration and the underlying target distribution.
no code implementations • 31 Dec 2019 • Yuan Gao, Yiqiang Han
By observing differential behaviors from three pre-trained models during each testing iteration, the input image that triggered erroneous feedback was registered as a corner-case.
1 code implementation • Geoscientific Model Development 2019 • Xiaomeng Huang, Xing Huang, Dong Wang, Qi Wu, Yi Li, Shixun Zhang, YuWen Chen, Mingqing Wang, Yuan Gao, Qiang Tang, Yue Chen, Zheng Fang, Zhenya Song, Guangwen Yang
In this work, we design a simple computing library to bridge the gap and decouple the work of ocean modeling from parallel computing.
3 code implementations • 9 Nov 2019 • Iddo Drori, Darshan Thaker, Arjun Srivatsa, Daniel Jeong, Yueqi Wang, Linyong Nan, Fan Wu, Dimitri Leggas, Jinhao Lei, Weiyi Lu, Weilong Fu, Yuan Gao, Sashank Karri, Anand Kannan, Antonio Moretti, Mohammed AlQuraishi, Chen Keasar, Itsik Pe'er
Our dataset consists of amino acid sequences, Q8 secondary structures, position specific scoring matrices, multiple sequence alignment co-evolutionary features, backbone atom distance matrices, torsion angles, and 3D coordinates.
no code implementations • 25 Sep 2019 • Xi Chen, Yuan Gao, Ali Ghadirzadeh, Marten Bjorkman, Ginevra Castellano, Patric Jensfelt
In this work, we introduce an exploration approach based on maximizing the entropy of the visited states while learning a goal-conditioned policy.
no code implementations • 12 Aug 2019 • Yuan Gao, Elena Sibirtseva, Ginevra Castellano, Danica Kragic
In socially assistive robotics, an important research area is the development of adaptation techniques and their effect on human-robot interaction.
no code implementations • 9 Apr 2019 • Yuan Gao, Zixiang Cai, Lei Yu
In this work, we propose Intra-Ensemble, an end-to-end ensemble strategy with stochastic channel recombination operations to train several sub-networks simultaneously within one neural network.
no code implementations • 26 Mar 2019 • Yuan Gao, Christian Kroer, Donald Goldfarb
In particular, the increasing averages consistently outperform the uniform averages in all test problems by orders of magnitude.
no code implementations • 25 Feb 2019 • Shunkang Zhang, Yuan Gao, Yuling Jiao, Jin Liu, Yang Wang, Can Yang
To address the challenges in learning deep generative models (e. g., the blurriness of variational auto-encoder and the instability of training generative adversarial networks, we propose a novel deep generative model, named Wasserstein-Wasserstein auto-encoders (WWAE).
1 code implementation • 24 Jan 2019 • Yuan Gao, Yuling Jiao, Yang Wang, Yao Wang, Can Yang, Shunkang Zhang
We propose a general framework to learn deep generative models via \textbf{V}ariational \textbf{Gr}adient Fl\textbf{ow} (VGrow) on probability spaces.
1 code implementation • 16 Oct 2018 • Yuan Gao, Fangkai Yang, Martin Frisk, Daniel Hernandez, Christopher Peters, Ginevra Castellano
Deep reinforcement learning has recently been widely applied in robotics to study tasks such as locomotion and grasping, but its application to social human-robot interaction (HRI) remains a challenge.
no code implementations • 15 Oct 2018 • Yuan Gao, Xingyuan Bu, Yang Hu, Hui Shen, Ti Bai, Xubin Li, Shilei Wen
This report demonstrates our solution for the Open Images 2018 Challenge.
55 code implementations • 7 Sep 2018 • Arthur Juliani, Vincent-Pierre Berges, Ervin Teng, Andrew Cohen, Jonathan Harper, Chris Elion, Chris Goy, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange
Recent advances in artificial intelligence have been driven by the presence of increasingly realistic and complex simulated environments.
1 code implementation • CVPR 2019 • Yuan Gao, Jiayi Ma, Mingbo Zhao, Wei Liu, Alan L. Yuille
In this paper, we propose a novel Convolutional Neural Network (CNN) structure for general-purpose multi-task learning (MTL), which enables automatic feature fusing at every layer from different tasks.
Ranked #93 on Semantic Segmentation on NYU Depth v2
no code implementations • 3 Nov 2017 • Hengduo Li, Jun Liu, Guyue Zhang, Yuan Gao, Yirui Wu
In this paper, we propose a new Multi-Glimpse LSTM (MG-LSTM) network, in which multi-scale contextual information is sequentially integrated to promote the human detection performance.
1 code implementation • 6 Sep 2017 • Yuan Gao, Brij Mohan Lal Srivastava, James Salsman
We use automatic speech recognition to assess spoken English learner pronunciation based on the authentic intelligibility of the learners' spoken responses determined from support vector machine (SVM) classifier or deep learning neural network model predictions of transcription correctness.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 4 Mar 2017 • James V. Burke, Yuan Gao, Tim Hoheisel
Generalized matrix-fractional (GMF) functions are a class of matrix support functions introduced by Burke and Hoheisel as a tool for unifying a range of seemingly divergent matrix optimization problems associated with inverse problems, regularization and learning.
no code implementations • 22 Sep 2016 • Yuan Gao, Alan Yuille
This paper addresses the estimation of 3D structures of symmetric objects from multiple images of the same object category, e. g. different cars, seen from various viewpoints.
no code implementations • 12 Sep 2016 • Yuan Gao, Jiayi Ma, Alan L. Yuille
This is based on recent work on sparsity where faces are represented in terms of two dictionaries: a gallery dictionary consisting of one or more examples of each person, and a variation dictionary representing linear nuisance variables (e. g., different lighting conditions, different glasses).
no code implementations • CVPR 2017 • Yuan Gao, Alan L. Yuille
By assuming an orthographic projection model, this paper addresses the estimation of 3D structures and camera projection using symmetry and/or Manhattan structure cues, which occur when the input is single- or multiple-image from the same category, e. g., multiple different cars.
no code implementations • 11 Apr 2016 • Yuan Gao, Dorota Glowacka
This paper introduces two recurrent neural network structures called Simple Gated Unit (SGU) and Deep Simple Gated Unit (DSGU), which are general structures for learning long term dependencies.