Search Results for author: Yiming Ren

Found 20 papers, 6 papers with code

NPC: Neural Predictive Control for Fuel-Efficient Autonomous Trucks

no code implementations18 Dec 2024 Jiaping Ren, Jiahao Xiang, Hongfei Gao, Jinchuan Zhang, Yiming Ren, Yuexin Ma, Yi Wu, Ruigang Yang, Wei Li

Fuel efficiency is a crucial aspect of long-distance cargo transportation by oil-powered trucks that economize on costs and decrease carbon emissions.

FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments

no code implementations7 Nov 2024 Aoru Xue, Yiming Ren, Zining Song, Mao Ye, Xinge Zhu, Yuexin Ma

We propose a novel hybrid calibration-free method FreeCap to accurately capture global multi-person motions in open environments.

Motion Estimation

Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance

1 code implementation21 Oct 2024 Zhangwei Gao, Zhe Chen, Erfei Cui, Yiming Ren, Weiyun Wang, Jinguo Zhu, Hao Tian, Shenglong Ye, Junjun He, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Jifeng Dai, Wenhai Wang

Multimodal large language models (MLLMs) have demonstrated impressive performance in vision-language tasks across a broad spectrum of domains.

Autonomous Driving

Towards Practical Human Motion Prediction with LiDAR Point Clouds

no code implementations15 Aug 2024 Xiao Han, Yiming Ren, Yichen Yao, Yujing Sun, Yuexin Ma

In this paper, we propose \textit{LiDAR-HMP}, the first single-LiDAR-based 3D human motion prediction approach, which receives the raw LiDAR point cloud as input and forecasts future 3D human poses directly.

Human motion prediction motion prediction +1

LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment

no code implementations13 Jul 2024 Yiming Ren, Xiao Han, Yichen Yao, Xiaoxiao Long, Yujing Sun, Yuexin Ma

LiDAR-based human motion capture has garnered significant interest in recent years for its practicability in large-scale and unconstrained environments.

Needle In A Multimodal Haystack

1 code implementation11 Jun 2024 Weiyun Wang, Shuibo Zhang, Yiming Ren, Yuchen Duan, Tiantong Li, Shuo Liu, Mengkang Hu, Zhe Chen, Kaipeng Zhang, Lewei Lu, Xizhou Zhu, Ping Luo, Yu Qiao, Jifeng Dai, Wenqi Shao, Wenhai Wang

In this work, we present Needle In A Multimodal Haystack (MM-NIAH), the first benchmark specifically designed to systematically evaluate the capability of existing MLLMs to comprehend long multimodal documents.

Retrieval

A Unified Framework for Human-centric Point Cloud Video Understanding

no code implementations CVPR 2024 Yiteng Xu, Kecheng Ye, Xiao Han, Yiming Ren, Xinge Zhu, Yuexin Ma

Human-centric Point Cloud Video Understanding (PVU) is an emerging field focused on extracting and interpreting human-related features from sequences of human point clouds, further advancing downstream human-centric tasks and applications.

3D Pose Estimation Action Recognition +4

The All-Seeing Project V2: Towards General Relation Comprehension of the Open World

1 code implementation29 Feb 2024 Weiyun Wang, Yiming Ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, Yu Qiao, Jifeng Dai

In addition, we design a new benchmark, termed Circular-based Relation Probing Evaluation (CRPE) for comprehensively evaluating the relation comprehension capabilities of MLLMs.

Hallucination Object Localization +3

LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment

no code implementations CVPR 2024 Yiming Ren, Xiao Han, Chengfeng Zhao, Jingya Wang, Lan Xu, Jingyi Yu, Yuexin Ma

For human-centric large-scale scenes, fine-grained modeling for 3D human global pose and shape is significant for scene understanding and can benefit many real-world applications.

Scene Understanding

3D-Mol: A Novel Contrastive Learning Framework for Molecular Property Prediction with 3D Information

no code implementations28 Sep 2023 Taojie Kuang, Yiming Ren, Zhixiang Ren

Molecular property prediction, crucial for early drug candidate screening and optimization, has seen advancements with deep learning-based methods.

Contrastive Learning Molecular Property Prediction +2

GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

1 code implementation14 Aug 2023 PengFei Liu, Yiming Ren, Jun Tao, Zhixiang Ren

Large language models have made significant strides in natural language processing, enabling innovative applications in molecular science by processing textual representations of molecules.

Image Captioning Language Modeling +6

Weakly Supervised 3D Multi-person Pose Estimation for Large-scale Scenes based on Monocular Camera and Single LiDAR

no code implementations30 Nov 2022 Peishan Cong, Yiteng Xu, Yiming Ren, Juze Zhang, Lan Xu, Jingya Wang, Jingyi Yu, Yuexin Ma

Motivated by this, we propose a monocular camera and single LiDAR-based method for 3D multi-person pose estimation in large-scale scenes, which is easy to deploy and insensitive to light.

3D Multi-Person Pose Estimation 3D Pose Estimation +2

Predicting Football Match Outcomes with eXplainable Machine Learning and the Kelly Index

no code implementations28 Nov 2022 Yiming Ren, Teo Susnjak

An approach was developed that minimises risk by combining the Kelly Index with the predefined confidence thresholds of the predictive models.

Benchmarking

Gait Recognition in Large-scale Free Environment via Single LiDAR

no code implementations22 Nov 2022 Xiao Han, Yiming Ren, Peishan Cong, Yujing Sun, Jingya Wang, Lan Xu, Yuexin Ma

Human gait recognition is crucial in multimedia, enabling identification through walking patterns without direct interaction, enhancing the integration across various media forms in real-world applications like smart homes, healthcare and non-intrusive security.

Gait Recognition in the Wild

LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors

no code implementations30 May 2022 Yiming Ren, Chengfeng Zhao, Yannan He, Peishan Cong, Han Liang, Jingyi Yu, Lan Xu, Yuexin Ma

We propose a multi-sensor fusion method for capturing challenging 3D human motions with accurate consecutive local poses and global trajectories in large-scale scenarios, only using single LiDAR and 4 IMUs, which are set up conveniently and worn lightly.

Sensor Fusion Translation

STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes

1 code implementation CVPR 2022 Peishan Cong, Xinge Zhu, Feng Qiao, Yiming Ren, Xidong Peng, Yuenan Hou, Lan Xu, Ruigang Yang, Dinesh Manocha, Yuexin Ma

In addition, considering the property of sparse global distribution and density-varying local distribution of pedestrians, we further propose a novel method, Density-aware Hierarchical heatmap Aggregation (DHA), to enhance pedestrian perception in crowded scenes.

3D Object Detection Pedestrian Detection +1

Self-supervised Point Cloud Completion on Real Traffic Scenes via Scene-concerned Bottom-up Mechanism

no code implementations20 Mar 2022 Yiming Ren, Peishan Cong, Xinge Zhu, Yuexin Ma

In this paper, we propose a self-supervised point cloud completion method (TraPCC) for vehicles in real traffic scenes without any complete data.

Point Cloud Completion

Train and Deploy an Image Classifier for Disaster Response

no code implementations12 May 2020 Jianyu Mao, Kiana Harris, Nae-Rong Chang, Caleb Pennell, Yiming Ren

With Deep Learning Image Classification becoming more powerful each year, it is apparent that its introduction to disaster response will increase the efficiency that responders can work with.

Disaster Response Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.