1 code implementation • 10 Feb 2025 • Yuqi Lin, Hengjia Li, Wenqi Shao, Zheng Yang, Jun Zhao, Xiaofei He, Ping Luo, Kaipeng Zhang
In contrast to prior refinement techniques that are tailored to specific models or tasks in a close-world manner, we propose SAMRefiner, a universal and efficient approach by adapting SAM to the mask refinement task.
no code implementations • 20 Dec 2024 • Hengjia Li, Yang Liu, Yibo Zhao, Haoran Cheng, Yang Yang, Linxuan Xia, Zekai Luo, Qibo Qiu, Boxi Wu, Tu Zheng, Zheng Yang, Deng Cai
To enhance the pose and identity consistency, we further propose a hierarchical spatial consistency loss to align the spatial structure between the generated images in the source and target domain.
no code implementations • 26 Aug 2024 • Yiyang Jia, Guohong Peng, Zheng Yang, Tianhao Chen
In this survey, we provide an overview of category theory-derived machine learning from four mainstream perspectives: gradient-based learning, probability-based learning, invariance and equivalence-based learning, and topos-based learning.
no code implementations • 12 Jul 2024 • Linhan Xia, Yicheng Yang, Ziou Chen, Zheng Yang, Shengxin Zhu
This study proposes a multi-modal movie recommendation system by extract features of the well designed posters for each movie and the narrative text description of the movie.
1 code implementation • 14 Apr 2024 • Guoxuan Chi, Zheng Yang, Chenshu Wu, Jingao Xu, Yuchong Gao, Yunhao Liu, Tony Xiao Han
In this work, inspired by the stellar achievements of the diffusion model in CV and NLP, we adapt it to the RF domain and propose RF-Diffusion.
no code implementations • 23 Jan 2024 • Hengjia Li, Yang Liu, Yuqi Lin, Zhanwei Zhang, Yibo Zhao, weihang Pan, Tu Zheng, Zheng Yang, Yuchun Jiang, Boxi Wu, Deng Cai
In this paper, we propose UniHDA, a \textbf{unified} and \textbf{versatile} framework for generative hybrid domain adaptation with multi-modal references from multiple domains.
1 code implementation • 20 Dec 2023 • Yuqi Lin, Minghao Chen, Kaipeng Zhang, Hengjia Li, Mingming Li, Zheng Yang, Dongqin Lv, Binbin Lin, Haifeng Liu, Deng Cai
As a result, we dissect the preservation of patch-wise spatial information in CLIP and proposed a local-to-global framework to obtain image tags.
no code implementations • 14 Dec 2023 • Yibo Zhao, Liang Peng, Yang Yang, Zekai Luo, Hengjia Li, Yao Chen, Zheng Yang, Xiaofei He, Wei Zhao, Qinglin Lu, Boxi Wu, Wei Liu
It focuses on controlling specific local region according to user-defined image conditions, while the remaining regions are only conditioned by the original text prompt.
1 code implementation • 29 Nov 2023 • Liang Peng, Haoran Cheng, Zheng Yang, Ruisi Zhao, Linxuan Xia, Chaotian Song, Qinglin Lu, Boxi Wu, Wei Liu
By applying the loss to existing one-shot video tuning methods, we significantly improve the overall consistency and smoothness of the generated videos.
1 code implementation • 19 Nov 2023 • Ping Li, Chenhan Zhang, Zheng Yang, Xianghua Xu, Mingli Song
To this end, we present a Pair-wise Layer Attention with Spatial Masking (PLA-SM) framework for video prediction to capture the spatiotemporal dynamics, which reflect the motion trend.
1 code implementation • 30 Oct 2023 • Hengjia Li, Yang Liu, Linxuan Xia, Yuqi Lin, Tu Zheng, Zheng Yang, Wenxiao Wang, Xiaohui Zhong, Xiaobo Ren, Xiaofei He
Concretely, the distance loss blends the attributes of all target domains by reducing the distances from generated images to all target subspaces.
1 code implementation • 1 Aug 2023 • Zhihao Chi, Tu Zheng, Hengjia Li, Zheng Yang, Boxi Wu, Binbin Lin, Deng Cai
In this paper, we restudy the hyper-parameter temperature and figure out its incapability to distill the knowledge from each sample sufficiently when it is a single value.
1 code implementation • CVPR 2024 • Liang Peng, Junkai Xu, Haoran Cheng, Zheng Yang, Xiaopei Wu, Wei Qian, Wenxiao Wang, Boxi Wu, Deng Cai
Monocular 3D detection is a challenging task due to the lack of accurate 3D information.
no code implementations • 31 Mar 2023 • Hengjia Li, Tu Zheng, Zhihao Chi, Zheng Yang, Wenxiao Wang, Boxi Wu, Binbin Lin, Deng Cai
To tackle these problems, we propose Asymmetric Parallel Point Transformer (APPT).
no code implementations • 31 Oct 2022 • Litian Li, Zheng Yang, Ronggang Wang
Benefit from flexible network designs and end-to-end joint optimization approach, learned image compression (LIC) has demonstrated excellent coding performance and practical feasibility in recent years.
1 code implementation • 18 Jul 2022 • Liang Peng, Xiaopei Wu, Zheng Yang, Haifeng Liu, Deng Cai
Therefore, we propose to reformulate the instance depth to the combination of the instance visual surface depth (visual depth) and the instance attribute depth (attribute depth).
no code implementations • 24 Jun 2022 • Hao Wu, Yongqiang Cheng, Xixi Chen, Zheng Yang, Xiang Li, Hongqiang Wang
These advantages benefit from the geometry of the Toeplitz Hermitian positive definite (HPD) manifold $\mathcal{M}_{\mathcal{T}H_{++}}$, but the sophisticated geometry also results in some challenges for geometric detectors, such as the implementation of the enhanced detector to improve the SCR (signal-to-clutter ratio) and the analysis of the detection performance.
no code implementations • 20 Jun 2022 • Zheng Yang, Yi Zhang, Guoxuan Chi, Guidong Zhang
With the rapid development of wireless communication technology, wireless access points (AP) and internet of things (IoT) devices have been widely deployed in our surroundings.
4 code implementations • CVPR 2022 • Tu Zheng, Yifei HUANG, Yang Liu, Wenjian Tang, Zheng Yang, Deng Cai, Xiaofei He
In this way, we can exploit more contextual information to detect lanes while leveraging local detailed lane features to improve localization accuracy.
Ranked #1 on
Lane Detection
on LLAMAS
1 code implementation • ICLR 2022 • Liang Peng, Senbo Yan, Boxi Wu, Zheng Yang, Xiaofei He, Deng Cai
This network is learned by minimizing our newly-proposed 3D alignment loss between the 3D box estimates and the corresponding RoI LiDAR points.
Ranked #3 on
Weakly Supervised 3D Detection
on KITTI-360
1 code implementation • 19 Apr 2021 • Liang Peng, Fei Liu, Zhengxu Yu, Senbo Yan, Dan Deng, Zheng Yang, Haifeng Liu, Deng Cai
We delve into this underlying mechanism and then empirically find that: concerning the label accuracy, the 3D location part in the label is preferred compared to other parts of labels.
1 code implementation • 18 Mar 2021 • Zili Liu, Guodong Xu, Honghui Yang, Minghao Chen, Kuoliang Wu, Zheng Yang, Haifeng Liu, Deng Cai
In this work, we propose a suppress-and-refine framework to remove these handcrafted components.
no code implementations • 24 Nov 2020 • Hongkang Shi, Yuqiong Cheng, Zheng Yang, Yuntian Chen, Shubo Wang
Optical isolation enables nonreciprocal manipulations of light with broad applications in optical communications.
Optics
no code implementations • 2 Oct 2020 • Yuan Hui, Zheng Yang, Hao Yu
The magnetization evolution of the free layer in an orthogonal spin-torque device is studied based on a macrospin model.
Mesoscale and Nanoscale Physics
4 code implementations • 31 Aug 2020 • Tu Zheng, Hao Fang, Yi Zhang, Wenjian Tang, Zheng Yang, Haifeng Liu, Deng Cai
Lane detection is one of the most important tasks in self-driving.
Ranked #5 on
Lane Detection
on TuSimple
no code implementations • 13 Jul 2020 • Wenying Wu, Pavlos Protopapas, Zheng Yang, Panagiotis Michalatos
We worked to increase classification accuracy and mitigate algorithmic biases on our baseline model trained on the augmented benchmark database.
no code implementations • 1 Apr 2020 • Guodong Xu, Wenxiao Wang, Zili Liu, Liang Xie, Zheng Yang, Haifeng Liu, Deng Cai
3D object detection based on point clouds has become more and more popular.
no code implementations • 14 Nov 2019 • Liang Xie, Chao Xiang, Zhengxu Yu, Guodong Xu, Zheng Yang, Deng Cai, Xiaofei He
Moreover, based on the PACF module, we propose a 3D multi-sensor multi-task network called Pointcloud-Image RCNN(PI-RCNN as brief), which handles the image segmentation and 3D object detection tasks.
2 code implementations • NeurIPS 2019 • Shuai Zhao, Yang Wang, Zheng Yang, Deng Cai
In this paper, we develop a region mutual information (RMI) loss to model the dependencies among pixels more simply and efficiently.
6 code implementations • 2 Sep 2019 • Zili Liu, Tu Zheng, Guodong Xu, Zheng Yang, Haifeng Liu, Deng Cai
Experiments on MS COCO show that our TTFNet has great advantages in balancing training time, inference speed, and accuracy.
no code implementations • 16 Jul 2019 • Boxi Wu, Shuai Zhao, Wenqing Chu, Zheng Yang, Deng Cai
To be specific, our method explicitly requires the network to predict semantic segmentation as well as dilated affinity, which is a sparse version of pair-wise pixel affinity.
no code implementations • 17 May 2018 • Kaixuan Chen, Lina Yao, Xianzhi Wang, Dalin Zhang, Tao Gu, Zhiwen Yu, Zheng Yang
Multimodal features play a key role in wearable sensor-based human activity recognition (HAR).
no code implementations • 27 Mar 2018 • Zheng Yang, Hang Lei
This article presents the formal syntax and semantics for a large subset of the Solidity programming language developed for the Etheruem blockchain platform based on our resent work about developing a general, extensible, and reusable formal memory (GERM) framework and an extension of Curry-Howard isomorphism, denoted as execution-verification isomorphism (EVI).
Programming Languages
no code implementations • 27 Jan 2018 • Wei Li, Zheng Yang, Xu sun
Traditional Chinese Medicine (TCM) is an influential form of medical treatment in China and surrounding areas.
no code implementations • 6 Nov 2017 • Wei Li, Zheng Yang
Traditional Chinese Medicine (TCM) has accumulated a big amount of precious resource in the long history of development.
no code implementations • 6 Jun 2017 • Xiang Zhang, Lina Yao, Chaoran Huang, Tao Gu, Zheng Yang, Yunhao Liu
Biometric authentication involves various technologies to identify individuals by exploiting their unique, measurable physiological and behavioral characteristics.