Search Results for author: Fan Lu

Found 31 papers, 14 papers with code

Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition

1 code implementation17 Feb 2025 Jianyi Peng, Fan Lu, Bin Li, Yuan Huang, Sanqing Qu, Guang Chen

Compared to single-modal VPR, this approach benefits from the widespread availability of RGB cameras and the robustness of point clouds in providing accurate spatial geometry and distance information.

Re-Ranking Triplet +1

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

1 code implementation11 Dec 2024 Fan Lu, Wei Wu, Kecheng Zheng, Shuailei Ma, Biao Gong, Jiawei Liu, Wei Zhai, Yang Cao, Yujun Shen, Zheng-Jun Zha

Generating detailed captions comprehending text-rich visual content in images has received growing attention for Large Vision-Language Models (LVLMs).

Attribute Benchmarking +2

Learning Visual Generative Priors without Text

no code implementations10 Dec 2024 Shuailei Ma, Kecheng Zheng, Ying WEI, Wei Wu, Fan Lu, Yifei Zhang, Chen-Wei Xie, Biao Gong, Jiapeng Zhu, Yujun Shen

Although text-to-image (T2I) models have recently thrived as visual generative priors, their reliance on high-quality text-image pairs makes scaling up expensive.

Image to 3D Philosophy

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

no code implementations7 Oct 2024 Wei Wu, Kecheng Zheng, Shuailei Ma, Fan Lu, Yuxin Guo, Yifei Zhang, Wei Chen, Qingpei Guo, Yujun Shen, Zheng-Jun Zha

Then, after incorporating corner tokens to aggregate diverse textual information, we manage to help the model catch up to its original level of short text understanding yet greatly enhance its capability of long text understanding.

Image Classification Image Retrieval

GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields

no code implementations8 Jul 2024 Weiyi Xue, Zehan Zheng, Fan Lu, Haiyun Wei, Guang Chen, Changjun Jiang

Based on this, we propose Geometry guided Neural LiDAR Fields(GeoNLF), a hybrid framework performing alternately global neural reconstruction and pure geometric pose optimization.

NeRF Novel View Synthesis +2

RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural Modeling

no code implementations27 May 2024 Tianhang Wang, Fan Lu, Zehan Zheng, Guang Chen, Changjun Jiang

To address above problems, we propose RCDN, a Robust Camera-insensitivity collaborative perception with a novel Dynamic feature-based 3D Neural modeling mechanism.

Neural Rendering

Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior

no code implementations10 Apr 2024 Fan Lu, Kwan-Yee Lin, Yan Xu, Hongsheng Li, Guang Chen, Changjun Jiang

(2) To handle the unbounded nature of urban scenes, we represent 3D scene with a Scalable Hash Grid structure, incrementally adapting to the growing scale of urban scenes.

3D Generation Model Optimization +2

DreamLIP: Language-Image Pre-training with Long Captions

1 code implementation25 Mar 2024 Kecheng Zheng, Yifei Zhang, Wei Wu, Fan Lu, Shuailei Ma, Xin Jin, Wei Chen, Yujun Shen

Motivated by this, we propose to dynamically sample sub-captions from the text label to construct multiple positive pairs, and introduce a grouping loss to match the embeddings of each sub-caption with its corresponding local image patches in a self-supervised manner.

Contrastive Learning Image-text Retrieval +5

PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds

no code implementations29 Feb 2024 Haotian Liu, Sanqing Qu, Fan Lu, Zongtao Bu, Florian Roehrbein, Alois Knoll, Guang Chen

Therefore, existing complementary learning approaches for MDE fuse intensity information from images and scene details from event data for better scene understanding.

Depth Prediction Monocular Depth Estimation +2

Edge-Enabled Anomaly Detection and Information Completion for Social Network Knowledge Graphs

no code implementations13 Jan 2024 Fan Lu, Quan Qi, Huaibin Qin

Firstly, we introduce a lightweight distributed knowledge graph completion architecture that utilizes knowledge graph embedding for data analysis.

Anomaly Detection Edge-computing +1

Joint Extraction of Uyghur Medicine Knowledge with Edge Computing

no code implementations13 Jan 2024 Fan Lu, Quan Qi, Huaibin Qin

To address these challenges, a joint extraction model with parameter sharing in edge computing is proposed, named CoEx-Bert.

Edge-computing Relation Extraction +1

Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection

1 code implementation4 Dec 2023 Fan Lu, Kai Zhu, Kecheng Zheng, Wei Zhai, Yang Cao

Full-spectrum out-of-distribution (F-OOD) detection aims to accurately recognize in-distribution (ID) samples while encountering semantic and covariate shifts simultaneously.

Out-of-Distribution Detection

HDMNet: A Hierarchical Matching Network with Double Attention for Large-scale Outdoor LiDAR Point Cloud Registration

no code implementations29 Oct 2023 Weiyi Xue, Fan Lu, Guang Chen

Specifically, A novel feature consistency enhanced double-soft matching network is introduced to achieve two-stage matching with high flexibility while enlarging the receptive field with high efficiency in a patch-to patch manner, which significantly improves the registration performance.

Point Cloud Registration Pose Estimation

Convex Q Learning in a Stochastic Environment: Extended Version

no code implementations10 Sep 2023 Fan Lu, Sean Meyn

The main contributions firstly concern properties of the relaxation, described as a deterministic convex program: we identify conditions for a bounded solution, and a significant relationship between the solution to the new convex program, and the solution to standard Q-learning.

Q-Learning

Urban Radiance Field Representation with Deformable Neural Mesh Primitives

1 code implementation ICCV 2023 Fan Lu, Yan Xu, Guang Chen, Hongsheng Li, Kwan-Yee Lin, Changjun Jiang

To construct urban-level radiance fields efficiently, we design Deformable Neural Mesh Primitive~(DNMP), and propose to parameterize the entire scene with such primitives.

Image Generation Novel View Synthesis

NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation

1 code implementation CVPR 2023 Zehan Zheng, Danni Wu, Ruisi Lu, Fan Lu, Guang Chen, Changjun Jiang

In light of these issues, we present NeuralPCI: an end-to-end 4D spatio-temporal Neural field for 3D Point Cloud Interpolation, which implicitly integrates multi-frame information to handle nonlinear large motions for both indoor and outdoor scenarios.

3D Point Cloud Interpolation Autonomous Driving

Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection

1 code implementation CVPR 2023 Fan Lu, Kai Zhu, Wei Zhai, Kecheng Zheng, Yang Cao

Semantically coherent out-of-distribution (SCOOD) detection aims to discern outliers from the intended data distribution with access to unlabeled extra set.

Out-of-Distribution Detection

Sufficient Exploration for Convex Q-learning

no code implementations17 Oct 2022 Fan Lu, Prashant Mehta, Sean Meyn, Gergely Neu

The main contributions follow: (i) The dual of convex Q-learning is not precisely Manne's LP or a version of logistic Q-learning, but has similar structure that reveals the need for regularization to avoid over-fitting.

OpenAI Gym Q-Learning

Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time

no code implementations14 Oct 2022 Fan Lu, Joel Mathias, Sean Meyn, Karanjit Kalsi

Convex Q-learning is a recent approach to reinforcement learning, motivated by the possibility of a firmer theory for convergence, and the possibility of making use of greater a priori knowledge regarding policy or value function structure.

Q-Learning

Modeling User Behavior with Graph Convolution for Personalized Product Search

1 code implementation12 Feb 2022 Fan Lu, Qimai Li, Bo Liu, Xiao-Ming Wu, Xiaotong Zhang, Fuyu Lv, Guli Lin, Sen Li, Taiwei Jin, Keping Yang

Our approach can be seamlessly integrated with existing latent space based methods and be potentially applied in any product retrieval method that uses purchase history to model user preferences.

Learning Semantic Representations Retrieval

HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

1 code implementation ICCV 2021 Fan Lu, Guang Chen, Yinlong Liu, Lijun Zhang, Sanqing Qu, Shu Liu, Rongqi Gu

Extensive experiments are conducted on two large-scale outdoor LiDAR point cloud datasets to demonstrate the high accuracy and efficiency of the proposed HRegNet.

Point Cloud Registration

PointINet: Point Cloud Frame Interpolation Network

1 code implementation18 Dec 2020 Fan Lu, Guang Chen, Sanqing Qu, Zhijun Li, Yinlong Liu, Alois Knoll

Generally, the frame rates of mechanical LiDAR sensors are 10 to 20 Hz, which is much lower than other commonly used sensors like cameras.

3D Point Cloud Interpolation

MoNet: Motion-based Point Cloud Prediction Network

no code implementations21 Nov 2020 Fan Lu, Guang Chen, Yinlong Liu, Zhijun Li, Sanqing Qu, Tianpei Zou

3D point clouds accurately model 3D information of surrounding environment and are crucial for intelligent vehicles to perceive the scene.

Autonomous Driving Prediction

LAP-Net: Adaptive Features Sampling via Learning Action Progression for Online Action Detection

no code implementations16 Nov 2020 Sanqing Qu, Guang Chen, Dan Xu, Jinhu Dong, Fan Lu, Alois Knoll

At each time step, this sampling strategy first estimates current action progression and then decide what temporal ranges should be used to aggregate the optimal supplementary features.

Online Action Detection

RSKDD-Net: Random Sample-based Keypoint Detector and Descriptor

1 code implementation NeurIPS 2020 Fan Lu, Guang Chen, Yinlong Liu, Zhongnan Qu, Alois Knoll

To tackle the information loss of random sampling, we exploit a novel random dilation cluster strategy to enlarge the receptive field of each sampled point and an attention mechanism to aggregate the positions and features of neighbor points.

Point Cloud Registration Saliency Prediction

Zap Q-Learning With Nonlinear Function Approximation

no code implementations NeurIPS 2020 Shuhang Chen, Adithya M. Devraj, Fan Lu, Ana Bušić, Sean P. Meyn

Based on multiple experiments with a range of neural network sizes, it is found that the new algorithms converge quickly and are robust to choice of function approximation architecture.

OpenAI Gym Q-Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.