no code implementations • 23 Dec 2024 • Hao Gui, Lin Hu, Rui Chen, Mingxiao Huang, Yuxin Yin, Jin Yang, Yong Wu, Chen Liu, Zhongxu Sun, Xueyang Zhang, Kun Zhan
3D Gaussian Splatting (3DGS) is increasingly attracting attention in both academia and industry owing to its superior visual quality and rendering speed.
no code implementations • 17 Dec 2024 • Yunzhi Yan, Zhen Xu, Haotong Lin, Haian Jin, Haoyu Guo, Yida Wang, Kun Zhan, Xianpeng Lang, Hujun Bao, Xiaowei Zhou, Sida Peng
This paper aims to tackle the problem of photorealistic view synthesis from vehicle sensor data.
no code implementations • 29 Nov 2024 • Chaojun Ni, Guosheng Zhao, XiaoFeng Wang, Zheng Zhu, Wenkang Qin, Guan Huang, Chen Liu, Yuyin Chen, Yida Wang, Xueyang Zhang, Yifei Zhan, Kun Zhan, Peng Jia, Xianpeng Lang, Xingang Wang, Wenjun Mei
This is complemented by a progressive data update strategy designed to ensure high-quality rendering for more complex maneuvers.
no code implementations • 18 Nov 2024 • Tianyi Yan, Dongming Wu, Wencheng Han, Junpeng Jiang, Xia Zhou, Kun Zhan, Cheng-Zhong Xu, Jianbing Shen
By providing a dynamic and realistic simulation environment, DrivingSphere enables comprehensive testing and validation of autonomous driving algorithms, ultimately advancing the development of more reliable autonomous cars.
1 code implementation • 21 Oct 2024 • Qiao Sun, Huimin Wang, Jiahao Zhan, Fan Nie, Xin Wen, Leimeng Xu, Kun Zhan, Peng Jia, Xianpeng Lang, Hang Zhao
These planners promise better generalizations on complicated and few-shot cases than previous methods.
no code implementations • 3 Sep 2024 • Junpeng Jiang, Gangyi Hong, Lijun Zhou, Enhui Ma, Hengtong Hu, Xia Zhou, Jie Xiang, Fan Liu, Kaicheng Yu, Haiyang Sun, Kun Zhan, Peng Jia, Miao Zhang
Generating high-fidelity, temporally consistent videos in autonomous driving scenarios faces a significant challenge, e. g. problematic maneuvers in corner cases.
1 code implementation • 14 Aug 2024 • Xue Xia, Kun Zhan, Yuming Fang, Wenhui Jiang, Fei Shen
To this end, we propose a CNN-based DR diagnosis network with attention mechanism involved, termed lesion-aware network, to better capture lesion information from imbalanced data.
no code implementations • 1 Aug 2024 • Honglei Miao, Fan Ma, Ruijie Quan, Kun Zhan, Yi Yang
Despite growing interest in T2M, few methods focus on safeguarding these models against adversarial attacks, with existing work on text-to-image models proving insufficient for the unique motion domain.
1 code implementation • 10 Jul 2024 • Jingsheng Li, Tianxiang Xue, Jiayi Zhao, Jingmin Ge, Yufang Min, Wei Su, Kun Zhan
The complexity of clouds, particularly in terms of texture detail at high resolutions, has not been well explored by most existing cloud detection networks.
no code implementations • 2 Jul 2024 • Shuaike Xu, Xiaolin Zhang, Peng Zhang, Kun Zhan
Secondly, SACN uniquely integrates the graph's structural information to achieve strong-to-strong consensus learning, improving the utilization of unlabeled data while maintaining multiview learning.
no code implementations • 7 Jun 2024 • Xiaobiao Du, Haiyang Sun, Shuyun Wang, Zhuojie Wu, Hongwei Sheng, Jiaying Ying, Ming Lu, Tianqing Zhu, Kun Zhan, Xin Yu
(1) \textbf{High-Volume}: 2, 500 cars are meticulously scanned by 3D scanners, obtaining car images and point clouds with real-world dimensions; (2) \textbf{High-Quality}: Each car is captured in an average of 200 dense, high-resolution 360-degree RGB-D views, enabling high-fidelity 3D reconstruction; (3) \textbf{High-Diversity}: The dataset contains various cars from over 100 brands, collected under three distinct lighting conditions, including reflective, standard, and dark.
no code implementations • 4 Jun 2024 • Lijun Zhou, Tao Tang, Pengkun Hao, Zihang He, Kalok Ho, Shuo Gu, Wenbo Hou, Zhihui Hao, Haiyang Sun, Kun Zhan, Peng Jia, Xianpeng Lang, Xiaodan Liang
Secondly, we propose an Uncertainty-guided Query Denoising strategy to further enhance the training process.
no code implementations • 3 Jun 2024 • Enhui Ma, Lijun Zhou, Tao Tang, Zhan Zhang, Dong Han, Junpeng Jiang, Kun Zhan, Peng Jia, Xianpeng Lang, Haiyang Sun, Di Lin, Kaicheng Yu
Instead of randomly generating new data, we further design a sampling policy to let Delphi generate new data that are similar to those failure cases to improve the sample efficiency.
1 code implementation • 17 Apr 2024 • Qi Han, Zhibo Tian, Chengwei Xia, Kun Zhan
To address this, we employ information entropy neural estimation to utilize the potential of unlabeled samples.
1 code implementation • 28 Mar 2024 • Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao
However, the exploration of 3D dense captioning in outdoor scenes is hindered by two major challenges: 1) the domain gap between indoor and outdoor scenes, such as dynamics and sparse visual inputs, makes it difficult to directly adapt existing indoor methods; 2) the lack of data with comprehensive box-caption pair annotations specifically tailored for outdoor scenes.
no code implementations • 19 Feb 2024 • Xiaoyu Tian, Junru Gu, Bailin Li, Yicheng Liu, Yang Wang, Zhiyong Zhao, Kun Zhan, Peng Jia, Xianpeng Lang, Hang Zhao
A primary hurdle of autonomous driving in urban environments is understanding complex and long-tail scenarios, such as challenging road conditions and delicate human behaviors.
no code implementations • 2 Jan 2024 • Tao Tang, Dafeng Wei, Zhengyu Jia, Tian Gao, Changwei Cai, Chengkai Hou, Peng Jia, Kun Zhan, Haiyang Sun, Jingchen Fan, Yixing Zhao, Fu Liu, Xiaodan Liang, Xianpeng Lang, Yang Wang
Furthermore, there lack of well-formed retrieval datasets for effective evaluation.
2 code implementations • 2 Jan 2024 • Yunzhi Yan, Haotong Lin, Chenxu Zhou, Weijie Wang, Haiyang Sun, Kun Zhan, Xianpeng Lang, Xiaowei Zhou, Sida Peng
Recent methods extend NeRF by incorporating tracked vehicle poses to animate vehicles, enabling photo-realistic view synthesis of dynamic urban street scenes.
1 code implementation • 27 Jul 2023 • Yixian Ma, Kun Zhan
AttM aggregates higher-order structure and feature information to get an excellent embedding, while DiFM balances the state of each node in the graph through Laplacian diffusion learning and allows the cooperative evolution of adjacency and feature information in the graph.
1 code implementation • 26 Jul 2023 • Zhibo Tain, Xiaolin Zhang, Peng Zhang, Kun Zhan
Semi-supervised semantic segmentation (SSS) is an important task that utilizes both labeled and unlabeled data to reduce expenses on labeling training examples.
1 code implementation • 26 Jul 2023 • Yixuan Ma, Xiaolin Zhang, Peng Zhang, Kun Zhan
In this paper, we theoretically illustrate that the entropy of a dataset can be approximated by maximizing the lower bound of the mutual information across different views of a graph, \ie, entropy is estimated by a neural network.
1 code implementation • 22 Jun 2023 • Yumou Tang, Kun Zhan, Zhibo Tian, Mingxuan Zhang, Saisai Wang, Xueming Wen
Pancreas segmentation is challenging due to the small proportion and highly changeable anatomical structure.
1 code implementation • 2 Dec 2021 • Chenghua Liu, Zhuolin Liao, Yixuan Ma, Kun Zhan
Meanwhile, instead of using auto-encoder in most unsupervised learning graph neural networks, SDSNE uses a co-supervised strategy with structure information to supervise the model learning.
Ranked #1 on Multiview Clustering on Multilingual Reuters
no code implementations • 1 Jan 2021 • Zhuolin Liao, Kun Zhan
The supervised loss uses the known labeled set, while a view-consistent loss is applied to the two views to obtain the consistent representation and a pseudo-label loss is designed by using the common high-confidence predictions.
no code implementations • 1 Jan 2021 • Chenhua Liu, Kun Zhan
We use two different branches, and inputs of the two branches are the same, which are composed of structure and feature information.
1 code implementation • 2 Sep 2020 • Kun Zhan, Chaoxi Niu
We propose a new training method named as mutual teaching, i. e., we train dual models and let them teach each other during each batch.
Ranked #1 on Node Classification on CiteSeer (0.5%)
1 code implementation • 19 Jun 2019 • Changlu Chen, Chaoxi Niu, Xia Zhan, Kun Zhan
Based on the pretrained model and the constructed graph, we add a self-expressive layer to complete the generative model and then fine-tune it with a new loss function, including the reconstruction loss and a deliberately defined locality-preserving loss.
no code implementations • 22 Sep 2017 • Xuanyi Dong, Guoliang Kang, Kun Zhan, Yi Yang
For most state-of-the-art architectures, Rectified Linear Unit (ReLU) becomes a standard component accompanied with each layer.
Ranked #11 on Image Classification on SVHN