1 code implementation • 3 Feb 2025 • Boyu Mi, Hanqing Wang, Tai Wang, Yilun Chen, Jiangmiao Pang
3D visual grounding (3DVG) is challenging because of the requirement of understanding on visual information, language and spatial relationships.
no code implementations • 2 Dec 2024 • Chunlin Yu, Hanqing Wang, Ye Shi, Haoyang Luo, Sibei Yang, Jingyi Yu, Jingya Wang
In this paper, we introduce the Sequential 3D Affordance Reasoning task, which extends the traditional paradigm by reasoning from cumbersome user intentions and then decomposing them into a series of segmentation maps.
no code implementations • 30 Oct 2024 • Xujia Wang, Haiyan Zhao, Shuo Wang, Hanqing Wang, Zhiyuan Liu
Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA have significantly improved the adaptation of LLMs to downstream tasks in a resource-efficient manner.
1 code implementation • 15 Jul 2024 • Hanqing Wang, Jiahe Chen, Wensi Huang, Qingwei Ben, Tai Wang, Boyu Mi, Tao Huang, Siheng Zhao, Yilun Chen, Sizhe Yang, Peizhou Cao, Wenye Yu, Zichao Ye, Jialun Li, Junfeng Long, ZiRui Wang, Huiling Wang, Ying Zhao, Zhongying Tu, Yu Qiao, Dahua Lin, Jiangmiao Pang
Recent works have been exploring the scaling laws in the field of Embodied AI.
1 code implementation • 13 Jun 2024 • Bowen Ping, Shuo Wang, Hanqing Wang, Xu Han, Yuzhuang Xu, Yukun Yan, Yun Chen, Baobao Chang, Zhiyuan Liu, Maosong Sun
Motivated by the long-tail distribution of singular values in the delta weights, we propose a delta quantization approach using mixed-precision.
1 code implementation • 13 Jun 2024 • Hanqing Wang, Yixia Li, Shuo Wang, Guanhua Chen, Yun Chen
It is observed that the minor matrix corresponds to the noisy or long-tail information, while the principal matrix contains important knowledge.
no code implementations • 18 Feb 2024 • Hanqing Wang, Bowen Ping, Shuo Wang, Xu Han, Yun Chen, Zhiyuan Liu, Maosong Sun
Most prior works on LoRA combination primarily rely on task-level weights for each involved LoRA, making different examples and tokens share the same LoRA weights.
1 code implementation • 26 Oct 2023 • Hanqing Wang, Yajing Luo, Boya Xiong, Guanhua Chen, Yun Chen
Stylistic headline generation is the task to generate a headline that not only summarizes the content of an article, but also reflects a desired style that attracts users.
no code implementations • ICCV 2023 • Hanqing Wang, Wei Liang, Luc van Gool, Wenguan Wang
VLN-CE is a recently released embodied task, where AI agents need to navigate a freely traversable environment to reach a distant target location, given language instructions.
1 code implementation • 6 Apr 2023 • Dong An, Hanqing Wang, Wenguan Wang, Zun Wang, Yan Huang, Keji He, Liang Wang
To develop a robust VLN-CE agent, we propose a new navigation framework, ETPNav, which focuses on two critical skills: 1) the capability to abstract environments and generate long-range navigation plans, and 2) the ability of obstacle-avoiding control in continuous environments.
1 code implementation • 28 Jan 2023 • Weikang Wang, Guanhua Chen, Hanqing Wang, Yue Han, Yun Chen
In this paper, we investigate whether multilingual sentence Transformer LaBSE is a strong multilingual word aligner.
no code implementations • CVPR 2023 • Chuanqi Zang, Hanqing Wang, Mingtao Pei, Wei Liang
For textual data, the model prefers the local phrase semantics which may deviate from the global semantics in long sentences.
1 code implementation • 30 Oct 2022 • Hanqing Wang, Wei Liang, Luc van Gool, Wenguan Wang
With the emergence of varied visual navigation tasks (e. g, image-/object-/audio-goal and vision-language navigation) that specify the target in different ways, the community has made appealing advances in training specialized agents capable of handling individual navigation tasks well.
1 code implementation • CVPR 2022 • Hanqing Wang, Wei Liang, Jianbing Shen, Luc van Gool, Wenguan Wang
Since the rise of vision-language navigation (VLN), great progress has been made in instruction following -- building a follower to navigate environments under the guidance of instructions.
no code implementations • 10 May 2021 • Hanqing Wang, Zan Wang, Wei Liang, Lap-Fai Yu
Scene Rearrangement Planning (SRP) is an interior task proposed recently.
1 code implementation • CVPR 2021 • Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen
Recently, numerous algorithms have been developed to tackle the problem of vision-language navigation (VLN), i. e., entailing an agent to navigate 3D environments through following linguistic instructions.
1 code implementation • ECCV 2020 • Hanqing Wang, Wenguan Wang, Tianmin Shu, Wei Liang, Jianbing Shen
Vision-language navigation (VLN) is the task of entailing an agent to carry out navigational instructions inside photo-realistic environments.
1 code implementation • 10 Sep 2018 • Hanqing Wang, Jiaolong Yang, Wei Liang, Xin Tong
The key idea of our method is to leverage object mask and pose estimation from CNNs to assist the 3D shape learning by constructing a probabilistic single-view visual hull inside of the network.
no code implementations • ICCV 2017 • Hanqing Wang, Wei Liang, Lap-Fai Yu
In the inference phase, given a scanned 3D scene with different object candidates and a dictionary of human poses, our approach infers the best object as a container together with human pose for transferring a given object.