1 code implementation • 3 Mar 2025 • Xinsheng Wang, Mingqi Jiang, Ziyang Ma, Ziyu Zhang, Songxiang Liu, Linqin Li, Zheng Liang, Qixi Zheng, Rui Wang, Xiaoqin Feng, Weizhen Bian, Zhen Ye, Sitong Cheng, Ruibin Yuan, Zhixian Zhao, Xinfa Zhu, Jiahao Pan, Liumeng Xue, Pengcheng Zhu, Yunlin Chen, Zhifei Li, Xie Chen, Lei Xie, Yike Guo, Wei Xue
Recent advancements in large language models (LLMs) have driven significant progress in zero-shot text-to-speech (TTS) synthesis.
no code implementations • 20 Feb 2025 • Ran Ding, Ziyu Zhang, Ying Zhu, Ziqian Kong, Peilan Xu
To enhance tourists' experiences and immersion, this paper proposes a narrative-driven travel planning framework called NarrativeGuide, which generates a geoculturally-grounded narrative script for travelers, offering a novel, role-playing experience for their journey.
no code implementations • 25 Nov 2024 • Ziyu Zhang, Binbin Huang, Hanqing Jiang, Liyang Zhou, Xiaojun Xiang, Shunhan Shen
Recently, 3D Gaussian Splatting (3DGS) has attracted attention for its superior rendering quality and speed over Neural Radiance Fields (NeRF).
no code implementations • 31 Oct 2024 • Dake Guo, Jixun Yao, Xinfa Zhu, Kangxiang Xia, Zhao Guo, Ziyu Zhang, Yao Wang, Jie Liu, Lei Xie
Our system consists of two modules: a speech generator for Track 1 and a background audio generator for Track 2.
no code implementations • 8 Mar 2024 • Ziyu Zhang, Johann Laconte, Daniil Lisus, Timothy D. Barfoot
This paper presents a novel method to assess the resilience of the Iterative Closest Point (ICP) algorithm via deep-learning-based attacks on lidar point clouds.
2 code implementations • 15 Sep 2023 • Daniil Lisus, Johann Laconte, Keenan Burnett, Ziyu Zhang, Timothy D. Barfoot
This paper presents a novel deep-learning-based approach to improve localizing radar measurements against lidar maps.
1 code implementation • 6 May 2023 • Zeyu Cai, Jian Yu, Ziyu Zhang, Chengqian Jin, Feipeng Da
The reconstruction subnet in the network then learns the mapping of the residuals to the true values to improve reconstruction accuracy.
no code implementations • 7 Nov 2022 • Andrey Ignatov, Grigory Malivenko, Radu Timofte, Lukasz Treszczotko, Xin Chang, Piotr Ksiazek, Michal Lopuszynski, Maciej Pioro, Rafal Rudnicki, Maciej Smyl, Yujie Ma, Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang, XueChao Shi, Difan Xu, Yanan Li, Xiaotao Wang, Lei Lei, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo, Gang Yu, Bin Fu, Jiaqi Li, Yiran Wang, Zihao Huang, Zhiguo Cao, Marcos V. Conde, Denis Sapozhnikov, Byeong Hyun Lee, Dongwon Park, Seongmin Hong, Joonhee Lee, Seunggyu Lee, Se Young Chun
Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks.
no code implementations • 15 Jul 2022 • Arvind V. Mahankali, David P. Woodruff, Ziyu Zhang
Our key technique is a method for obtaining subspace embeddings with a number of rows polynomial in $q$ for a matrix which is the flattening of a tensor train of $q$ tensors.
no code implementations • 12 Nov 2021 • Yanyi Ding, Zhiyi Kuang, Yuxin Pei, Jeff Tan, Ziyu Zhang, Joseph Konan
SARS-CoV-2 is an upper respiratory system RNA virus that has caused over 3 million deaths and infecting over 150 million worldwide as of May 2021.
no code implementations • 17 May 2021 • Andrey Ignatov, Grigory Malivenko, David Plowman, Samarth Shukla, Radu Timofte, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo, Gang Yu, Bin Fu, Yiran Wang, Xingyi Li, Min Shi, Ke Xian, Zhiguo Cao, Jin-Hua Du, Pei-Lin Wu, Chao Ge, Jiaoyang Yao, Fangwen Tu, Bo Li, Jung Eun Yoo, Kwanggyoon Seo, Jialei Xu, Zhenyu Li, Xianming Liu, Junjun Jiang, Wei-Chi Chen, Shayan Joya, Huanhuan Fan, Zhaobing Kang, Ang Li, Tianpeng Feng, Yang Liu, Chuannan Sheng, Jian Yin, Fausto T. Benavide
While many solutions have been proposed for this task, they are usually very computationally expensive and thus are not applicable for on-device inference.
no code implementations • 31 Mar 2021 • Yi Yu, Feipeng Da, Ziyu Zhang
Without fine-tuning on the test set, the Rank-1 Recognition Rate (RR1) is achieved as follows: 98. 85% on FRGC v2. 0 dataset and 99. 33% on Bosphorus dataset, which proves the effectiveness and the potentiality of our method.
1 code implementation • CVPR 2018 • Ishan Deshpande, Ziyu Zhang, Alexander Schwing
While this is particularly true for early GAN formulations, there has been significant empirically motivated and theoretically founded progress to improve stability, for instance, by using the Wasserstein distance rather than the Jenson-Shannon divergence.
no code implementations • CVPR 2017 • Unnat Jain, Ziyu Zhang, Alexander Schwing
Generating diverse questions for given images is an important task for computational education, entertainment and AI assistants.
no code implementations • CVPR 2016 • Xiaozhi Chen, Kaustav Kundu, Ziyu Zhang, Huimin Ma, Sanja Fidler, Raquel Urtasun
The focus of this paper is on proposal generation.
Ranked #8 on
Vehicle Pose Estimation
on KITTI Cars Hard
no code implementations • CVPR 2016 • Ziyu Zhang, Sanja Fidler, Raquel Urtasun
Our aim is to provide a pixel-wise instance-level labeling of a monocular image in the context of autonomous driving.
no code implementations • ICCV 2015 • Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun
In this paper we tackle the problem of instance-level segmentation and depth ordering from a single monocular image.