no code implementations • 31 Dec 2024 • Jiasheng Zhang, Deqiang Ouyang, Shuang Liang, Jie Shao
Meanwhile, we propose a two-stage training approach to identify samples that deviate from the model's preferred patterns.
no code implementations • 5 Dec 2024 • Yongming Zhu, Longhao Zhang, Zhengkun Rong, Tianshu Hu, Shuang Liang, Zhipeng Ge
The second stage learns the mapping from the input dyadic audio to motion latent codes through denoising, leading to the audio-driven head generation in interactive scenarios.
no code implementations • 21 Nov 2024 • Yuke Zhu, Chi Xie, Shuang Liang, Bo Zheng, Sheng Guo
Recent advances on Multi-modal Large Language Models have demonstrated that high-resolution image input is crucial for model capabilities, especially for fine-grained tasks.
Ranked #101 on
Visual Question Answering
on MM-Vet
no code implementations • 5 Oct 2024 • Shuang Liang, Guido Montúfar
For a broad class of potential functions, we show that mirror flow exhibits lazy training and has the same implicit bias as ordinary gradient flow when the network width tends to infinity.
no code implementations • 9 Sep 2024 • Longhao Zhang, Shuang Liang, Zhipeng Ge, Tianshu Hu
In this paper, we present PersonaTalk, an attention-based two-stage framework, including geometry construction and face rendering, for high-fidelity and personalized visual dubbing.
no code implementations • 7 Sep 2024 • Ruiting Dai, Yuqiao Tan, Lisi Mo, Tao He, Ke Qin, Shuang Liang
To this end, we propose a novel Multi-step Adaptive Prompt Learning (MuAP) framework, aiming to generate multimodal prompts and perform multi-step prompt tuning, which adaptively learns knowledge by iteratively aligning modalities.
no code implementations • 22 Jul 2024 • Zhongxing Ma, Shuang Liang, Yongkun Wen, Weixin Lu, Guowei Wan
The core concept behind RoadPainter is to extract a set of points from each centerline mask to improve the accuracy of centerline prediction.
1 code implementation • 17 Jun 2024 • Jiasheng Zhang, Jialin Chen, Menglin Yang, Aosong Feng, Shuang Liang, Jie Shao, Rex Ying
Moreover, we conduct extensive benchmark experiments on DTGB, evaluating 7 popular dynamic graph learning algorithms and their variants of adapting to text attributes with LLM embeddings, along with 6 powerful large language models (LLMs).
no code implementations • 9 May 2024 • Ruiting Dai, Yuqiao Tan, Lisi Mo, Shuang Liang, Guohao Huo, Jiayi Luo, Yao Cheng
Afterward, a structure-aware frozen PLM is employed to fully incorporate the structured and textual information from the evidence graph, where the generation of prompts is driven by graph entities and relations.
no code implementations • 11 Apr 2024 • Saichao Liu, Geng Sun, Jiahui Li, Shuang Liang, Qingqing Wu, Pengfei Wang, Dusit Niyato
To improve the work efficiency of the UVAA, we formulate a UAV-enabled collaborative beamforming multi-objective optimization problem (UCBMOP) to simultaneously maximize the transmission rate of the UVAA and minimize the energy consumption of all UAVs by optimizing the positions and excitation current weights of all UAVs.
no code implementations • 6 Apr 2024 • Zemin Sun, Geng Sun, Long He, Fang Mei, Shuang Liang, Yanheng Liu
In the short time scale, we propose a price-incentive method for on-demand computing resource allocation and a matching mechanism-based method for computation offloading.
no code implementations • 23 Mar 2024 • Zemin Sun, Geng Sun, Qingqing Wu, Long He, Shuang Liang, Hongyang Pan, Dusit Niyato, Chau Yuen, Victor C. M. Leung
Since the problem is a non-convex and NP-hard mixed integer nonlinear programming (MINLP), we propose a two-timescale joint computing resource allocation, computation offloading, and trajectory control (TJCCT) approach for solving the problem.
1 code implementation • 11 Dec 2023 • Hongcai He, Anjie Zhu, Shuang Liang, Feiyu Chen, Jie Shao
We propose a framework called decoupled meta-reinforcement learning (DCMRL), which (1) contrastively restricts the learning of task contexts through pulling in similar task contexts within the same task and pushing away different task contexts of different tasks, and (2) utilizes a Gaussian quantization variational autoencoder (GQ-VAE) for clustering the Gaussian distributions of the task contexts and skills respectively, and decoupling the exploration and learning processes of their spaces.
1 code implementation • 6 Dec 2023 • Xu Yao, Shuang Liang, Songqiao Han, Hailiang Huang
To address data scarcity and imbalance in MPP, some studies have adopted Graph Neural Networks (GNN) as an encoder to extract commonalities from molecular graphs.
no code implementations • 3 Oct 2023 • Chuang Zhang, Geng Sun, Qingqing Wu, Jiahui Li, Shuang Liang, Dusit Niyato, Victor C. M. Leung
Unmanned aerial vehicles (UAVs) as aerial relays are practically appealing for assisting Internet of Things (IoT) network.
no code implementations • 30 Sep 2023 • Yanheng Liu, Hongyang Pan, Geng Sun, Aimin Wang, Jiahui Li, Shuang Liang
Wireless rechargeable sensor networks with a charging unmanned aerial vehicle (CUAV) have the broad application prospects in the power supply of the rechargeable sensor nodes (SNs).
no code implementations • 30 Sep 2023 • Hongyang Pan, Yanheng Liu, Geng Sun, Junsong Fan, Shuang Liang, Chau Yuen
For UTTOP, we first introduce a pretreatment method, and then use an improved particle swarm optimization with Normal distribution initialization, Genetic mechanism, Differential mechanism and Pursuit operator (PSO-NGDP) to deal with this sub optimization problem.
no code implementations • 17 Aug 2023 • Geng Sun, Long He, Zemin Sun, Qingqing Wu, Shuang Liang, Jiahui Li, Dusit Niyato, Victor C. M. Leung
Unmanned aerial vehicles (UAVs) play an increasingly important role in assisting fast-response post-disaster rescue due to their fast deployment, flexible mobility, and low cost.
no code implementations • 11 Aug 2023 • Zikun Zhuang, Ruihao Qian, Chi Xie, Shuang Liang
Human-object interaction (HOI) detection is an important part of understanding human activities and visual scenes.
no code implementations • 11 Aug 2023 • Yiyang Cai, Jiaming Lu, Jiewen Wang, Shuang Liang
UACTN decouples the representation learning of sketches and 3D shapes into two separate tasks: classification-based sketch uncertainty learning and 3D shape feature transfer.
1 code implementation • NeurIPS 2023 • Chi Xie, Zhao Zhang, Yixuan Wu, Feng Zhu, Rui Zhao, Shuang Liang
In this paper, we advance them to a more practical setting called Described Object Detection (DOD) by expanding category names to flexible language expressions for OVD and overcoming the limitation of REC only grounding the pre-existing object.
no code implementations • 18 Jul 2023 • Yuzhe He, Shuang Liang, Xiaofei Rui, Chengying Cai, Guowei Wan
The experimental results show that our method achieves centimeter-level localization accuracy, and outperforms existing methods using vectorized maps by a large margin.
1 code implementation • CVPR 2023 • Chi Xie, Fangao Zeng, Yue Hu, Shuang Liang, Yichen Wei
Unlike most previous HOI methods that focus on learning better human-object features, we propose a novel and complementary approach called category query learning.
Ranked #9 on
Human-Object Interaction Detection
on HICO-DET
no code implementations • IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022 • Wenyu Yu, Hui Kang, Geng Sun, Member, Shuang Liang, and Jiahui Li, Student Member, IEEE
Finally, the proposed ISSA is utilized to solve the objective function.
1 code implementation • Knowledge-Based Systems 2022 • Jiasheng Zhang, Shuang Liang, Yongpan Sheng, Jie Shao
Temporal knowledge graph (TKG) representation learning aims to project entities and relations in TKG to low-dimensional vector space while preserving the evolutionary nature of TKG.
3 code implementations • Knowledge-Based Systems 2022 • Anjie Zhu, Deqiang Ouyang, Shuang Liang, Jie Shao
Due to this one-to-many dilemma, enlarged action space and ignoring logical relationship between entity and relation increase the difficulty of learning.
1 code implementation • 4 Apr 2022 • Shuang Liang, Yinan Zou, Yong Zhou
Joint activity detection and channel estimation (JADCE) for grant-free random access is a critical issue that needs to be addressed to support massive connectivity in IoT networks.
1 code implementation • 1 Mar 2022 • Jinlai Zhang, Weiming Li, Shuang Liang, Hao Wang, Jihong Zhu
We also introduce a new U6DA-Linemod dataset for robustness study of the 6D pose estimation task.
no code implementations • 12 Dec 2021 • Weilin Liu, Ye Mu, Chao Yu, Xuefei Ning, Zhong Cao, Yi Wu, Shuang Liang, Huazhong Yang, Yu Wang
These scenarios indeed correspond to the vulnerabilities of the under-test driving policies, thus are meaningful for their further improvements.
no code implementations • 20 Aug 2021 • Shuang Liang, Yuanming Shi, Yong Zhou
Although an enhanced estimation performance in terms of the mean squared error (MSE) can be achieved, the weighted $l_1$-norm minimization algorithm is still a convex relaxation of the original group-sparse matrix estimation problem, yielding a suboptimal solution.
no code implementations • 1 Aug 2021 • Kai Wu, Vinit Kumar Chugh, Venkatramana D. Krishna, Arturo di Girolamo, Yongqiang Andrew Wang, Renata Saha, Shuang Liang, Maxim C-J Cheeran, Jian-Ping Wang
With the ongoing global pandemic of coronavirus disease 2019 (COVID-19), there is an increasing quest for more accessible, easy-to-use, rapid, inexpensive, and high accuracy diagnostic tools.
no code implementations • 8 Jul 2021 • Shuang Liang
In this paper, we present a hybrid deep learning framework named CTNet which combines convolutional neural network and transformer together for the detection of COVID-19 via 3D chest CT images.
no code implementations • 4 Mar 2021 • Jia-Xing Zhang, Lu Yang, Shuang Liang, Wei Chen, Qiang-Hua Wang
We study the possibility to realize Majorana zero mode that's robust and may be easily manipulated for braiding in quantum computing in the ground state of the Kitaev model in this work.
Strongly Correlated Electrons
1 code implementation • 25 Nov 2020 • Xuefei Ning, Changcheng Tang, Wenshuo Li, Songyi Yang, Tianchen Zhao, Niansong Zhang, Tianyi Lu, Shuang Liang, Huazhong Yang, Yu Wang
Neural Architecture Search (NAS) has received extensive attention due to its capability to discover neural network architectures in an automated manner.
no code implementations • 21 Nov 2020 • Tianchen Zhao, Xuefei Ning, Xiangsheng Shi, Songyi Yang, Shuang Liang, Peng Lei, Jianfei Chen, Huazhong Yang, Yu Wang
We also design the micro-level search space to strengthen the information flow for BNN.
no code implementations • 28 Sep 2020 • Xuefei Ning, Wenshuo Li, Zixuan Zhou, Tianchen Zhao, Shuang Liang, Yin Zheng, Huazhong Yang, Yu Wang
A major challenge in NAS is to conduct a fast and accurate evaluation of neural architectures.
no code implementations • 18 Sep 2020 • Siyuan Lu, Meiqi Wang, Shuang Liang, Jun Lin, Zhongfeng Wang
Designing hardware accelerators for deep neural networks (DNNs) has been much desired.
1 code implementation • NeurIPS 2021 • Xuefei Ning, Changcheng Tang, Wenshuo Li, Zixuan Zhou, Shuang Liang, Huazhong Yang, Yu Wang
Conducting efficient performance estimations of neural architectures is a major challenge in neural architecture search (NAS).
no code implementations • 11 Apr 2019 • Huawei Wei, Shuang Liang, Yichen Wei
Recently, 3D face reconstruction and face alignment tasks are gradually combined into one task: 3D dense face alignment.
1 code implementation • 20 Feb 2019 • Yu Xing, Shuang Liang, Lingzhi Sui, Xijie Jia, Jiantao Qiu, Xin Liu, Yushun Wang, Yu Wang, Yi Shan
On the Xilinx ZU2 @330 MHz and ZU9 @330 MHz, we achieve equivalently state-of-the-art performance on our benchmarks by na\"ive implementations without optimizations, and the throughput is further improved up to 1. 26x by leveraging heterogeneous optimizations in DNNVM.
no code implementations • CVPR 2018 • Xiangyun Zhao, Shuang Liang, Yichen Wei
In this work, we present a novel and effective framework to facilitate object detection with the instance-level segmentation information that is only supervised by bounding box annotation.
2 code implementations • ECCV 2018 • Xiao Sun, Bin Xiao, Fangyin Wei, Shuang Liang, Yichen Wei
State-of-the-art human pose estimation methods are based on heat map representation.
Ranked #23 on
Pose Estimation
on MPII Human Pose
1 code implementation • ICCV 2017 • Xiao Sun, Jiaxiang Shang, Shuang Liang, Yichen Wei
A central problem is that the structural information in the pose is not well exploited in the previous regression methods.
Ranked #36 on
Pose Estimation
on MPII Human Pose
no code implementations • 17 Sep 2016 • Xingyi Zhou, Xiao Sun, Wei zhang, Shuang Liang, Yichen Wei
In this work, we propose to directly embed a kinematic object model into the deep neutral network learning for general articulated object pose estimation.
Ranked #327 on
3D Human Pose Estimation
on Human3.6M
no code implementations • 5 Jul 2016 • Luwei Yang, Ligen Zhu, Yichen Wei, Shuang Liang, Ping Tan
Previous part-based attribute recognition approaches perform part detection and attribute recognition in separate steps.
no code implementations • CVPR 2015 • Chaoyang Wang, Long Zhao, Shuang Liang, Liqing Zhang, Jinyuan Jia, Yichen Wei
Hierarchical segmentation based object proposal methods have become an important step in modern object detection paradigm.
no code implementations • CVPR 2015 • Xiao Sun, Yichen Wei, Shuang Liang, Xiaoou Tang, Jian Sun
We extends the previous 2D cascaded object pose regression work [9] in two aspects so that it works better for 3D articulated objects.
no code implementations • CVPR 2014 • Wangjiang Zhu, Shuang Liang, Yichen Wei, Jian Sun
However, their usage of boundary prior is very simple, fragile, and the integration with other cues is mostly heuristic.