Search Results for author: Zhengxin Li

Found 11 papers, 5 papers with code

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

2 code implementations19 Jan 2024 Chenyu Wang, Weixin Luo, Qianyu Chen, Haonan Mai, Jindi Guo, Sixun Dong, Xiaohua, Xuan, Zhengxin Li, Lin Ma, Shenghua Gao

Recently, the astonishing performance of large language models (LLMs) in natural language comprehension and generation tasks triggered lots of exploration of using them as central controllers to build agent systems.

Language Modelling Large Language Model

TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding

1 code implementation6 Nov 2023 Shuo Wang, Jing Li, Zibo Zhao, Dongze Lian, Binbin Huang, Xiaomei Wang, Zhengxin Li, Shenghua Gao

Holistic scene understanding includes semantic segmentation, surface normal estimation, object boundary detection, depth estimation, etc.

Boundary Detection Depth Estimation +5

P$^2$SDF for Neural Indoor Scene Reconstruction

no code implementations1 Mar 2023 Jing Li, Jinpeng Yu, Ruoyu Wang, Zhengxin Li, Zhengyu Zhang, Lina Cao, Shenghua Gao

As the unsupervised plane segments are usually noisy and inaccurate, we propose to assign different weights to the sampled points on the plane in plane estimation as well as the regularization loss.

Indoor Scene Reconstruction Surface Reconstruction

TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting

1 code implementation CVPR 2022 Huazhang Hu, Sixun Dong, Yiqun Zhao, Dongze Lian, Zhengxin Li, Shenghua Gao

Existing methods focus on performing repetitive action counting in short videos, which is tough for dealing with longer videos in more realistic scenarios.

Repetitive Action Counting

Proxy-bridged Image Reconstruction Network for Anomaly Detection in Medical Images

no code implementations5 Oct 2021 Kang Zhou, Jing Li, Weixin Luo, Zhengxin Li, Jianlong Yang, Huazhu Fu, Jun Cheng, Jiang Liu, Shenghua Gao

To mitigate this problem, in this paper, we propose a novel Proxy-bridged Image Reconstruction Network (ProxyAno) for anomaly detection in medical images.

Anomaly Detection Image Reconstruction

Crowd Counting With Partial Annotations in an Image

1 code implementation ICCV 2021 Yanyu Xu, Ziming Zhong, Dongze Lian, Jing Li, Zhengxin Li, Xinxing Xu, Shenghua Gao

To fully leverage the data captured from different scenes with different view angles while reducing the annotation cost, this paper studies a novel crowd counting setting, i. e. only using partial annotations in each image as training data.

Active Learning Crowd Counting

Sparse PCA via $l_{2,p}$-Norm Regularization for Unsupervised Feature Selection

no code implementations29 Dec 2020 Zhengxin Li, Feiping Nie, Jintang Bian, Xuelong Li

However, real-world data contain a large number of noise samples and features, making the similarity matrix constructed by original data cannot be completely reliable.

feature selection

Exact Indexing of Time Series under Dynamic Time Warping

no code implementations11 Feb 2020 Zhengxin Li

Unfortunately, there is still a lack of an effective lower bounding distance that can measure unequal-length time series and has desirable tightness.

Dynamic Time Warping Time Series +1

Cannot find the paper you are looking for? You can Submit a new open access paper.