Search Results for author: Yue Wang

Found 343 papers, 133 papers with code

Sentence-Level Resampling for Named Entity Recognition

1 code implementation NAACL 2022 Xiaochen Wang, Yue Wang

As a fundamental task in natural language processing, named entity recognition (NER) aims to locate and classify named entities in unstructured text.

Data Augmentation named-entity-recognition +3

基于词信息嵌入的汉语构词结构识别研究(Chinese Word-Formation Prediction based on Representations of Word-Related Features)

no code implementations CCL 2021 Hua Zheng, Yaqi Yan, Yue Wang, Damai Dai, Yang Liu

“作为一种意合型语言, 汉语中的构词结构刻画了构词成分之间的组合关系, 是认知、理解词义的关键。在中文信息处理领域, 此前的构词结构识别工作大多沿用句法层面的粗粒度标签, 且主要基于上下文等词间信息建模, 忽略了语素义、词义等词内信息对构词结构识别的作用。本文采用语言学视域下的构词结构标签体系, 构建汉语构词结构及相关信息数据集, 提出了一种基于Bi-LSTM和Self-attention的模型, 以此来探究词内、词间等多方面信息对构词结构识别的潜在影响和能达到的性能。实验取得了良好的预测效果, 准确率77. 87%, F1值78. 36%;同时, 对比测试揭示, 词内的语素义信息对构词结构识别具有显著的贡献, 而词间的上下文信息贡献较弱且带有较强的不稳定性。该预测方法与数据集, 将为中文信息处理的多种任务, 如语素和词结构分析、词义识别与生成、语言文字研究与词典编纂等提供新的观点和方案。”

How to Re-enable PDE Loss for Physical Systems Modeling Under Partial Observation

no code implementations12 Dec 2024 Haodong Feng, Yue Wang, Dixia Fan

Effectively integrating PDE loss as a constraint of system transition can improve the model's prediction by overcoming generalization issues due to data scarcity, especially when data acquisition is costly.

LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models

no code implementations10 Dec 2024 Ziqi Lu, Heng Yang, Danfei Xu, Boyi Li, Boris Ivanovic, Marco Pavone, Yue Wang

Emerging 3D geometric foundation models, such as DUSt3R, offer a promising approach for in-the-wild 3D vision tasks.

3D Reconstruction Pose Estimation

Wavelet Diffusion Neural Operator

1 code implementation6 Dec 2024 Peiyan Hu, Rui Wang, Xiang Zheng, Tao Zhang, Haodong Feng, Ruiqi Feng, Long Wei, Yue Wang, Zhi-Ming Ma, Tailin Wu

Recently, diffusion generative models have emerged as a competitive class of methods for these tasks due to their ability to capture long-term dependencies and model high-dimensional states.

Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning

no code implementations29 Nov 2024 Yang You, Yixin Li, Congyue Deng, Yue Wang, Leonidas Guibas

Vision foundation models, particularly the ViT family, have revolutionized image understanding by providing rich semantic features.

Pose Estimation

Break the ID-Language Barrier: An Adaption Framework for Sequential Recommendation

no code implementations27 Nov 2024 Xiaohan Yu, Li Zhang, Xin Zhao, Yue Wang

The recent breakthrough of large language models (LLMs) in natural language processing has sparked exploration in recommendation systems, however, their limited domain-specific knowledge remains a critical bottleneck.

Sequential Recommendation

Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting

no code implementations21 Nov 2024 Jingyi Xu, Xieyuanli Chen, Junyi Ma, Jiawei Huang, Jintao Xu, Yue Wang, Ling Pei

Existing 3D OCF approaches struggle to predict plausible spatial details for movable objects and suffer from slow inference speeds due to neglecting the bias and uneven distribution of changing occupancy states in both space and time.

Guided MRI Reconstruction via Schrödinger Bridge

no code implementations21 Nov 2024 Yue Wang, Tian Zhou, Zhuo-Xu Cui, Bingsheng Huang, Hairong Zheng, Dong Liang, Yanjie Zhu

This study proposes an SB-based, multi-contrast image-guided reconstruction framework that establishes a diffusion bridge between the guiding and target image distributions.

MRI Reconstruction

AnyECG: Foundational Models for Electrocardiogram Analysis

no code implementations17 Nov 2024 Yue Wang, Xu Cao, Yaojun Hu, Haochao Ying, James Matthew Rehg, Jimeng Sun, Jian Wu, Jintai Chen

Electrocardiogram (ECG), a non-invasive and affordable tool for cardiac monitoring, is highly sensitive in detecting acute heart attacks.

Anomaly Detection Arrhythmia Detection

Large Spatial Model: End-to-end Unposed Images to Semantic 3D

no code implementations24 Oct 2024 Zhiwen Fan, Jian Zhang, Wenyan Cong, Peihao Wang, Renjie Li, Kairun Wen, Shijie Zhou, Achuta Kadambi, Zhangyang Wang, Danfei Xu, Boris Ivanovic, Marco Pavone, Yue Wang

To tackle the scarcity of labeled 3D semantic data and enable natural language-driven scene manipulation, we incorporate a pre-trained 2D language-based segmentation model into a 3D-consistent semantic feature field.

3D Reconstruction Attribute

AttentionPainter: An Efficient and Adaptive Stroke Predictor for Scene Painting

no code implementations21 Oct 2024 Yizhe Tang, Yue Wang, Teng Hu, Ran Yi, Xin Tan, Lizhuang Ma, Yu-Kun Lai, Paul L. Rosin

Stroke-based Rendering (SBR) aims to decompose an input image into a sequence of parameterized strokes, which can be rendered into a painting that resembles the input image.

reinforcement-learning Reinforcement Learning

Multimodal Policies with Physics-informed Representations

no code implementations20 Oct 2024 Haodong Feng, Peiyan Hu, Yue Wang, Dixia Fan

It leverages PDE loss to fit the neural network and data loss calculated on the observations with random quantities and modalities to propagate the information of initial conditions and boundary conditions into the inputs.

HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks

1 code implementation16 Oct 2024 Fengji Zhang, Linquan Wu, Huiyu Bai, Guancheng Lin, Xiao Li, Xiao Yu, Yue Wang, Bei Chen, Jacky Keung

Despite the progress in Large Multimodal Models (LMMs), which extend LLMs with visual perception and understanding capabilities, there remains a notable lack of coding benchmarks that rigorously assess these models, particularly in tasks that emphasize visual reasoning.

Code Generation HumanEval +1

LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images

no code implementations15 Oct 2024 Yuzhou Cheng, Jianhao Jiao, Yue Wang, Dimitrios Kanoulas

Visual localization involves estimating a query image's 6-DoF (degrees of freedom) camera pose, which is a fundamental component in various computer vision and robotic tasks.

Image Retrieval Novel View Synthesis +3

TRENDY: Gene Regulatory Network Inference Enhanced by Transformer

no code implementations14 Oct 2024 Xueying Tian, Yash Patel, Yue Wang

Gene regulatory networks (GRNs) play a crucial role in the control of cellular functions.

Neural Eulerian Scene Flow Fields

no code implementations2 Oct 2024 Kyle Vedder, Neehar Peri, Ishan Khatri, Siyi Li, Eric Eaton, Mehmet Kocamaz, Yue Wang, Zhiding Yu, Deva Ramanan, Joachim Pehserl

We reframe scene flow as the task of estimating a continuous space-time ODE that describes motion for an entire observation sequence, represented with a neural prior.

Autonomous Driving Point Tracking +1

HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting

1 code implementation27 Sep 2024 Nian Ran, Peng Xiao, Yue Wang, Wesley Shi, Jianxin Lin, Qi Meng, Richard Allmendinger

The application of large deep learning models in weather forecasting has led to significant advancements in the field, including higher-resolution forecasting and extended prediction periods exemplified by models such as Pangu and Fuxi.

Deep Learning Weather Forecasting

Safety Verification and Navigation for Autonomous Vehicles based on Signal Temporal Logic Constraints

no code implementations16 Sep 2024 Aditya Parameshwaran, Yue Wang

It inputs a simplified AV state space model and a set of STL specifications, for which it constructs a closed-loop controller.

Autonomous Vehicles Model Predictive Control +1

Multi-scale decomposition of sea surface height snapshots using machine learning

1 code implementation11 Sep 2024 Jingwen Lyu, Yue Wang, Christian Pedersen, Spencer Jones, Dhruv Balwada

Knowledge of ocean circulation is important for understanding and predicting weather and climate, and managing the blue economy.

Data Augmentation Image-to-Image Translation

RING#: PR-by-PE Global Localization with Roto-translation Equivariant Gram Learning

no code implementations30 Aug 2024 Sha Lu, Xuecheng Xu, Yuxuan Wu, Haojian Lu, Xieyuanli Chen, Rong Xiong, Yue Wang

To address this, we introduce a new paradigm, PR-by-PE localization, which bypasses the need for separate place recognition by directly deriving it from pose estimation.

Autonomous Driving Pose Estimation +1

OmniRe: Omni Urban Scene Reconstruction

no code implementations29 Aug 2024 Ziyu Chen, Jiawei Yang, Jiahui Huang, Riccardo de Lutio, Janick Martinez Esturo, Boris Ivanovic, Or Litany, Zan Gojcic, Sanja Fidler, Marco Pavone, Li Song, Yue Wang

To that end, we propose a comprehensive 3DGS framework for driving scenes, named OmniRe, that allows for accurate, full-length reconstruction of diverse dynamic objects in a driving log.

A high-order focus interaction model and oral ulcer dataset for oral ulcer segmentation

1 code implementation Scientific Reports 2024 Chenghao Jiang, Renkai Wu, Yinghao Liu, Yue Wang, Qing Chang, Pengchen Liang, Yuan Fan

Therefore to address this challenge, in this paper a multi-tasking oral ulcer dataset (Autooral) containing two major tasks of lesion segmentation and classification is proposed and made publicly available.

Image Segmentation Lesion Segmentation +2

MambaCapsule: Towards Transparent Cardiac Disease Diagnosis with Electrocardiography Using Mamba Capsule Network

no code implementations30 Jul 2024 Yinlong Xu, Xiaoqiang Liu, Zitai Kong, Yixuan Wu, Yue Wang, Yingzhou Lu, Honghao Gao, Jian Wu, Hongxia Xu

Cardiac arrhythmia, a condition characterized by irregular heartbeats, often serves as an early indication of various heart ailments.

Mamba

Gene Regulatory Network Inference from Pre-trained Single-Cell Transcriptomics Transformer with Joint Graph Learning

no code implementations25 Jul 2024 Sindhura Kommu, Yizhi Wang, Yue Wang, Xuan Wang

Inferring gene regulatory networks (GRNs) from single-cell RNA sequencing (scRNA-seq) data is a complex challenge that requires capturing the intricate relationships between genes and their regulatory interactions.

Graph Learning

Scale Disparity of Instances in Interactive Point Cloud Segmentation

no code implementations19 Jul 2024 Chenrui Han, Xuan Yu, Yuxuan Xie, Yili Liu, Sitong Mao, Shunbo Zhou, Rong Xiong, Yue Wang

However, in the realm of interactive segmentation, the meaning of instance diverges from that in instance segmentation, because users might desire to segment instances of both thing and stuff categories that vary greatly in scale.

Instance Segmentation Interactive Segmentation +3

Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction

no code implementations10 Jul 2024 Yili Liu, Linzhan Mou, Xuan Yu, Chenrui Han, Sitong Mao, Rong Xiong, Yue Wang

Accurate perception of the dynamic environment is a fundamental task for autonomous driving and robot systems.

Autonomous Driving Optical Flow Estimation

DiffPhyCon: A Generative Approach to Control Complex Physical Systems

1 code implementation9 Jul 2024 Long Wei, Peiyan Hu, Ruiqi Feng, Haodong Feng, Yixuan Du, Tao Zhang, Rui Wang, Yue Wang, Zhi-Ming Ma, Tailin Wu

In this work, we introduce Diffusion Physical systems Control (DiffPhyCon), a new class of method to address the physical systems control problem.

reinforcement-learning Reinforcement Learning

RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation

1 code implementation5 Jul 2024 Yuxuan Kuang, Junjie Ye, Haoran Geng, Jiageng Mao, Congyue Deng, Leonidas Guibas, He Wang, Yue Wang

First, RAM extracts unified affordance at scale from diverse sources of demonstrations including robotic data, human-object interaction (HOI) data, and custom data to construct a comprehensive affordance memory.

Human-Object Interaction Detection Retrieval

PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction

no code implementations1 Jul 2024 Xuan Yu, Yili Liu, Chenrui Han, Sitong Mao, Shunbo Zhou, Rong Xiong, Yiyi Liao, Yue Wang

We tackle both challenges by propagating partial labels with the aid of dense generalized features and building a 3D instance graph for associating 2D instance IDs.

3D Panoptic Segmentation Instance Segmentation +4

TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets

1 code implementation30 Jun 2024 Jintai Chen, Yaojun Hu, Yue Wang, Yingzhou Lu, Xu Cao, Miao Lin, Hongxia Xu, Jian Wu, Cao Xiao, Jimeng Sun, Lucas Glass, Kexin Huang, Marinka Zitnik, Tianfan Fu

Clinical trials are pivotal for developing new medical treatments, yet they typically pose some risks such as patient mortality, adverse events, and enrollment failure that waste immense efforts spanning over a decade.

Model-Free Robust Reinforcement Learning with Sample Complexity Analysis

no code implementations24 Jun 2024 Yudan Wang, Shaofeng Zou, Yue Wang

We develop algorithms for uncertainty sets defined by total variation, Chi-square divergence, and KL divergence, and provide finite sample analyses under all three cases.

reinforcement-learning Reinforcement Learning

Fantastic Copyrighted Beasts and How (Not) to Generate Them

1 code implementation20 Jun 2024 Luxi He, Yangsibo Huang, Weijia Shi, Tinghao Xie, Haotian Liu, Yue Wang, Luke Zettlemoyer, Chiyuan Zhang, Danqi Chen, Peter Henderson

Our evaluation systematically shows that both image and video generation models can still generate characters even if characters' names are not explicitly mentioned in the prompt, sometimes with only two generic keywords (e. g., prompting with "videogame, plumber" consistently generates Nintendo's Mario character).

Image Generation Video Generation

WENDY: Covariance Dynamics Based Gene Regulatory Network Inference

1 code implementation17 Jun 2024 Yue Wang, Peng Zheng, Yu-Chen Cheng, Zikun Wang, Aleksandr Aravkin

Determining gene regulatory network (GRN) structure is a central problem in biology, with a variety of inference methods available for different types of data.

DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features

no code implementations17 Jun 2024 Letian Wang, Seung Wook Kim, Jiawei Yang, Cunjun Yu, Boris Ivanovic, Steven L. Waslander, Yue Wang, Sanja Fidler, Marco Pavone, Peter Karkus

Our method is a generalizable feedforward model that predicts a rich neural scene representation from sparse, single-frame multi-view camera inputs with limited view overlap, and is trained self-supervised with differentiable rendering to reconstruct RGB, depth, or feature images.

3D geometry 3D Semantic Occupancy Prediction +5

An Effective Software Risk Prediction Management Analysis of Data Using Machine Learning and Data Mining Method

no code implementations13 Jun 2024 Jinxin Xu, Yue Wang, Ruisi Li, Ziyue Wang, Qian Zhao

The results of our experiments show that, when compared to other current methods, our integrative fuzzy techniques may perform more accurately and effectively in the evaluation of software project risks.

Management

Mmm whatcha say? Uncovering distal and proximal context effects in first and second-language word perception using psychophysical reverse correlation

1 code implementation8 Jun 2024 Paige Tuttösí, H. Henny Yeung, Yue Wang, Fenqi Wang, Guillaume Denis, Jean-Julien Aucouturier, Angelica Lim

Acoustic context effects, where surrounding changes in pitch, rate or timbre influence the perception of a sound, are well documented in speech perception, but how they interact with language background remains unclear.

A novel robust meta-analysis model using the $t$ distribution for outlier accommodation and detection

no code implementations6 Jun 2024 Yue Wang, Jianhua Zhao, Fen Jiang, Lei Shi, Jianxin Pan

Although robust modeling using the $t$ distribution is an appealing idea, the existing work, that explores the use of the $t$ distribution only for random effects, involves complicated numerical integration and numerical optimization.

Numerical Integration

Why is "Problems" Predictive of Positive Sentiment? A Case Study of Explaining Unintuitive Features in Sentiment Classification

no code implementations5 Jun 2024 Jiaming Qu, Jaime Arguello, Yue Wang

We propose approaches for (1) automatically detecting associations that can appear unintuitive to users and (2) generating explanations to help users understand why an unintuitive feature is predictive.

Sentiment Analysis Sentiment Classification

Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation

no code implementations3 Jun 2024 Yudan Wang, Yue Wang, Yi Zhou, Shaofeng Zou

Specifically, existing studies show that AC converges to an $\epsilon+\varepsilon_{\text{critic}}$ neighborhood of stationary points with the best known sample complexity of $\mathcal{O}(\epsilon^{-2})$ (up to a log factor), and NAC converges to an $\epsilon+\varepsilon_{\text{critic}}+\sqrt{\varepsilon_{\text{actor}}}$ neighborhood of the global optimum with the best known sample complexity of $\mathcal{O}(\epsilon^{-3})$, where $\varepsilon_{\text{critic}}$ is the approximation error of the critic and $\varepsilon_{\text{actor}}$ is the approximation error induced by the insufficient expressive power of the parameterized policy class.

Yuan 2.0-M32: Mixture of Experts with Attention Router

1 code implementation28 May 2024 Shaohua Wu, Jiangang Luo, Xi Chen, Lingjun Li, Xudong Zhao, Tong Yu, Chao Wang, Yue Wang, Fei Wang, Weixu Qiao, Houbo He, Zeru Zhang, Zeyu Sun, Junxiong Mao, Chong Shen

Yuan 2. 0-M32, with a similar base architecture as Yuan-2. 0 2B, uses a mixture-of-experts architecture with 32 experts of which 2 experts are active.

ARC Math

Reliable Object Tracking by Multimodal Hybrid Feature Extraction and Transformer-Based Fusion

1 code implementation28 May 2024 Hongze Sun, Rui Liu, Wuque Cai, Jun Wang, Yue Wang, Huajin Tang, Yan Cui, Dezhong Yao, Daqing Guo

In this study, we propose a novel multimodal hybrid tracker (MMHT) that utilizes frame-event-based data for reliable single object tracking.

Object Visual Object Tracking

Memorize What Matters: Emergent Scene Decomposition from Multitraverse

1 code implementation27 May 2024 Yiming Li, Zehong Wang, Yue Wang, Zhiding Yu, Zan Gojcic, Marco Pavone, Chen Feng, Jose M. Alvarez

Humans naturally retain memories of permanent elements, while ephemeral moments often slip through the cracks of memory.

3D Reconstruction Neural Rendering +2

HC-GAE: The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation Learning

no code implementations23 May 2024 Zhuo Xu, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Edwin R. Hancock

To this end, during the encoding process, we commence by utilizing the hard node assignment to decompose a sample graph into a family of separated subgraphs.

Decoder Graph Classification +2

A K-means Algorithm for Financial Market Risk Forecasting

no code implementations21 May 2024 Jinxin Xu, Kaixian Xu, Yue Wang, Qinyan Shen, Ruisi Li

In today's society, there are problems of high error rate and low precision in financial market risk prediction, which greatly affect the accuracy of financial market risk prediction.

ENADPool: The Edge-Node Attention-based Differentiable Pooling for Graph Neural Networks

no code implementations16 May 2024 Zhehan Zhao, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Lixiang Xu, Edwin R. Hancock

In this paper, we propose a new hierarchical pooling operation, namely the Edge-Node Attention-based Differentiable Pooling (ENADPool), for GNNs to learn effective graph representations.

Graph Classification

OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning

1 code implementation9 May 2024 Dan Qiao, Yi Su, Pinzheng Wang, Jing Ye, Wenjing Xie, Yuechi Zhou, Yuyang Ding, Zecheng Tang, Jikai Wang, Yixin Ji, Yue Wang, Pei Guo, Zechen Sun, Zikang Zhang, Juntao Li, Pingfu Chao, Wenliang Chen, Guohong Fu, Guodong Zhou, Qiaoming Zhu, Min Zhang

Large Language Models (LLMs) have played an important role in many fields due to their powerful capabilities. However, their massive number of parameters leads to high deployment requirements and incurs significant inference costs, which impedes their practical applications.

Common Sense Reasoning named-entity-recognition +2

Language-Image Models with 3D Understanding

no code implementations6 May 2024 Jang Hyun Cho, Boris Ivanovic, Yulong Cao, Edward Schmerling, Yue Wang, Xinshuo Weng, Boyi Li, Yurong You, Philipp Krähenbühl, Yan Wang, Marco Pavone

Our experiments on outdoor benchmarks demonstrate that Cube-LLM significantly outperforms existing baselines by 21. 3 points of AP-BEV on the Talk2Car dataset for 3D grounded reasoning and 17. 7 points on the DriveLM dataset for complex reasoning about driving scenarios, respectively.

Question Answering Visual Question Answering

$ν$-DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction

no code implementations29 Apr 2024 Yunxuan Mao, Bingqi Shen, Yifei Yang, Kai Wang, Rong Xiong, Yiyi Liao, Yue Wang

The joint optimization of the sensor trajectory and 3D map is a crucial characteristic of bundle adjustment (BA), essential for autonomous driving.

Autonomous Driving Novel View Synthesis +2

Forecasting the Future with Future Technologies: Advancements in Large Meteorological Models

no code implementations10 Apr 2024 Hailong Shu, Yue Wang, Weiwei Song, Huichuang Guo, Zhen Song

The field of meteorological forecasting has undergone a significant transformation with the integration of large models, especially those employing deep learning techniques.

Model Optimization

Learning 3D-Aware GANs from Unposed Images with Template Feature Field

no code implementations8 Apr 2024 Xinya Chen, Hanlei Guo, Yanrui Bin, Shangzhan Zhang, Yuanbo Yang, Yue Wang, Yujun Shen, Yiyi Liao

Collecting accurate camera poses of training images has been shown to well serve the learning of 3D-aware generative adversarial networks (GANs) yet can be quite expensive in practice.

Pose Estimation

Towards Realistic Scene Generation with LiDAR Diffusion Models

1 code implementation CVPR 2024 Haoxi Ran, Vitor Guizilini, Yue Wang

In this paper, we propose LiDAR Diffusion Models (LiDMs) to generate LiDAR-realistic scenes from a latent space tailored to capture the realism of LiDAR scenes by incorporating geometric priors into the learning pipeline.

3D geometry Image Generation +1

Distributed Swarm Learning for Edge Internet of Things

no code implementations29 Mar 2024 Yue Wang, Zhi Tian, FXin Fan, Zhipeng Cai, Cameron Nowzari, Kai Zeng

The rapid growth of Internet of Things (IoT) has led to the widespread deployment of smart IoT devices at wireless edge for collaborative machine learning tasks, ushering in a new era of edge learning.

Security Risks Concerns of Generative AI in the IoT

no code implementations29 Mar 2024 Honghui Xu, Yingshu Li, Olusesi Balogun, Shaoen Wu, Yue Wang, Zhipeng Cai

In an era where the Internet of Things (IoT) intersects increasingly with generative Artificial Intelligence (AI), this article scrutinizes the emergent security risks inherent in this integration.

SSHPool: The Separated Subgraph-based Hierarchical Pooling

no code implementations24 Mar 2024 Zhuo Xu, Lixin Cui, Ming Li, Yue Wang, Ziyu Lyu, Hangyuan Du, Lu Bai, Philip S. Yu, Edwin R. Hancock

We commence by assigning the nodes of a sample graph into different clusters, resulting in a family of separated subgraphs.

Graph Classification

PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors

1 code implementation14 Mar 2024 Tianyuan Yuan, Yucheng Mao, Jiawei Yang, Yicheng Liu, Yue Wang, Hang Zhao

Autonomous vehicles rely extensively on perception systems to navigate and interpret their surroundings.

Autonomous Driving Navigate

BEV$^2$PR: BEV-Enhanced Visual Place Recognition with Structural Cues

1 code implementation11 Mar 2024 Fudong Ge, Yiwei Zhang, Shuhan Shen, Yue Wang, Weiming Hu, Jin Gao

2) The lower layers of the pre-trained backbone from BEV generation are shared for visual and structural streams in VPR, facilitating the learning of fine-grained local features in the visual stream.

Visual Place Recognition

Yi: Open Foundation Models by 01.AI

1 code implementation7 Mar 2024 01. AI, :, Alex Young, Bei Chen, Chao Li, Chengen Huang, Ge Zhang, Guanwei Zhang, Heng Li, Jiangcheng Zhu, Jianqun Chen, Jing Chang, Kaidong Yu, Peng Liu, Qiang Liu, Shawn Yue, Senbin Yang, Shiming Yang, Tao Yu, Wen Xie, Wenhao Huang, Xiaohui Hu, Xiaoyi Ren, Xinyao Niu, Pengcheng Nie, Yuchi Xu, Yudong Liu, Yue Wang, Yuxuan Cai, Zhenyu Gu, Zhiyuan Liu, Zonghong Dai

The Yi model family is based on 6B and 34B pretrained language models, then we extend them to chat models, 200K long context models, depth-upscaled models, and vision-language models.

Attribute Chatbot +3

Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey

2 code implementations3 Mar 2024 Qizhi Pei, Lijun Wu, Kaiyuan Gao, Jinhua Zhu, Yue Wang, Zun Wang, Tao Qin, Rui Yan

The integration of biomolecular modeling with natural language (BL) has emerged as a promising interdisciplinary area at the intersection of artificial intelligence, chemistry and biology.

Property Prediction

Identify Critical Nodes in Complex Network with Large Language Models

no code implementations1 Mar 2024 Jinzhu Mao, Dongyun Zou, Li Sheng, Siyi Liu, Chen Gao, Yue Wang, Yong Li

Identifying critical nodes in networks is a classical decision-making task, and many methods struggle to strike a balance between adaptability and utility.

Decision Making Diversity

Explicit Interaction for Fusion-Based Place Recognition

2 code implementations27 Feb 2024 Jingyi Xu, Junyi Ma, Qi Wu, Zijie Zhou, Yue Wang, Xieyuanli Chen, Ling Pei

Fusion-based place recognition is an emerging technique jointly utilizing multi-modal perception data, to recognize previously visited places in GPS-denied scenarios for robots and autonomous vehicles.

Autonomous Vehicles

Parallelized Spatiotemporal Binding

no code implementations26 Feb 2024 Gautam Singh, Yue Wang, Jiawei Yang, Boris Ivanovic, Sungjin Ahn, Marco Pavone, Tong Che

While modern best practices advocate for scalable architectures that support long-range interactions, object-centric models are yet to fully embrace these architectures.

Decoder Object

Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior

1 code implementation23 Feb 2024 Kechun Xu, Zhongxiang Zhou, Jun Wu, Haojian Lu, Rong Xiong, Yue Wang

For the outer loop, we learn a grasp policy aware of object matching and grasp capability guided by task-level rewards.

Object

Social Physics Informed Diffusion Model for Crowd Simulation

1 code implementation8 Feb 2024 Hongyi Chen, Jingtao Ding, Yong Li, Yue Wang, Xiao-Ping Zhang

In this paper, we propose a social physics-informed diffusion model named SPDiff to mitigate the above gap.

Denoising Physics-informed machine learning

Driving Everywhere with Large Language Model Policy Adaptation

no code implementations CVPR 2024 Boyi Li, Yue Wang, Jiageng Mao, Boris Ivanovic, Sushant Veer, Karen Leung, Marco Pavone

Adapting driving behavior to new environments, customs, and laws is a long-standing problem in autonomous driving, precluding the widespread deployment of autonomous vehicles (AVs).

Autonomous Driving Language Modelling +2

RA-Rec: An Efficient ID Representation Alignment Framework for LLM-based Recommendation

no code implementations7 Feb 2024 Xiaohan Yu, Li Zhang, Xin Zhao, Yue Wang, Zhongrui Ma

To address this limitation, we propose a new paradigm, ID representation, which incorporates pre-trained ID embeddings into LLMs in a complementary manner.

Recommendation Systems

Denoising Vision Transformers

1 code implementation5 Jan 2024 Jiawei Yang, Katie Z Luo, Jiefeng Li, Congyue Deng, Leonidas Guibas, Dilip Krishnan, Kilian Q Weinberger, Yonglong Tian, Yue Wang

In the second stage, we train a lightweight transformer block to predict clean features from raw ViT outputs, leveraging the derived estimates of the clean features as supervision.

Denoising Depth Estimation +3

PARA-Drive: Parallelized Architecture for Real-time Autonomous Driving

no code implementations CVPR 2024 Xinshuo Weng, Boris Ivanovic, Yan Wang, Yue Wang, Marco Pavone

Recent works have proposed end-to-end autonomous vehicle (AV) architectures comprised of differentiable modules achieving state-of-the-art driving performance.

Autonomous Driving

GenoCraft: A Comprehensive, User-Friendly Web-Based Platform for High-Throughput Omics Data Analysis and Visualization

1 code implementation21 Dec 2023 Yingzhou Lu, Minjie Shen, Ling Yue, Chenhao Li, Lulu Chen, Fan Meng, Xiao Wang, David Herrington, Yue Wang, Yue Zhao, Tianfan Fu, Capucine van Rechem

With GenoCraft, researchers and data scientists have access to an array of cutting-edge bioinformatics tools under a user-friendly interface, making it a valuable resource for managing and analyzing large-scale omics data.

EDA: Evolving and Distinct Anchors for Multimodal Motion Prediction

1 code implementation15 Dec 2023 Longzhong Lin, Xuewu Lin, Tianwei Lin, Lichao Huang, Rong Xiong, Yue Wang

Motion prediction is a crucial task in autonomous driving, and one of its major challenges lands in the multimodality of future behaviors.

Autonomous Driving motion prediction +1

Better Neural PDE Solvers Through Data-Free Mesh Movers

2 code implementations9 Dec 2023 Peiyan Hu, Yue Wang, Zhi-Ming Ma

Based on DMM, to efficiently and accurately model dynamic systems, we develop a moving mesh based neural PDE solver (MM-PDE) that embeds the moving mesh with a two-branch architecture and a learnable interpolation framework to preserve information within the data.

Rethinking Directional Integration in Neural Radiance Fields

no code implementations28 Nov 2023 Congyue Deng, Jiawei Yang, Leonidas Guibas, Yue Wang

To that end, we introduce a modification to the NeRF rendering equation which is as simple as a few lines of code change for any NeRF variations, while greatly improving the rendering quality of view-dependent effects.

3D Reconstruction Decoder +3

Enhancing Few-shot CLIP with Semantic-Aware Fine-Tuning

no code implementations8 Nov 2023 Yao Zhu, Yuefeng Chen, Wei Wang, Xiaofeng Mao, Xiu Yan, Yue Wang, Zhigang Li, Wang Lu, Jindong Wang, Xiangyang Ji

Hence, we propose fine-tuning the parameters of the attention pooling layer during the training process to encourage the model to focus on task-specific semantics.

Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps

1 code implementation7 Nov 2023 Katie Z Luo, Xinshuo Weng, Yan Wang, Shuang Wu, Jie Li, Kilian Q Weinberger, Yue Wang, Marco Pavone

We propose a novel framework to integrate SD maps into online map prediction and propose a Transformer-based encoder, SD Map Encoder Representations from transFormers, to leverage priors in SD maps for the lane-topology prediction task.

Autonomous Driving Lane Detection

DORec: Decomposed Object Reconstruction and Segmentation Utilizing 2D Self-Supervised Features

no code implementations17 Oct 2023 Jun Wu, Sicheng Li, Sihui Ji, Yifei Yang, Yue Wang, Rong Xiong, Yiyi Liao

Recovering 3D geometry and textures of individual objects is crucial for many robotics applications, such as manipulation, pose estimation, and autonomous driving.

3D geometry Autonomous Driving +4

G-SPEED: General SParse Efficient Editing MoDel

1 code implementation16 Oct 2023 Haoke Zhang, Yue Wang, Juntao Li, Xiabing Zhou, Min Zhang

Large Language Models~(LLMs) have demonstrated incredible capabilities in understanding, generating, and manipulating languages.

GPT-Driver: Learning to Drive with GPT

1 code implementation2 Oct 2023 Jiageng Mao, Yuxi Qian, Junjie Ye, Hang Zhao, Yue Wang

In this paper, we propose a novel approach to motion planning that capitalizes on the strong reasoning capabilities and generalization potential inherent to Large Language Models (LLMs).

Autonomous Driving Decision Making +2

Sparsity-Based Channel Estimation Exploiting Deep Unrolling for Downlink Massive MIMO

no code implementations24 Sep 2023 An Chen, Wenbo Xu, Liyang Lu, Yue Wang

Massive multiple-input multiple-output (MIMO) enjoys great advantage in 5G wireless communication systems owing to its spectrum and energy efficiency.

Compressive Sensing

RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair

no code implementations12 Sep 2023 Weishi Wang, Yue Wang, Shafiq Joty, Steven C. H. Hoi

Automatic program repair (APR) is crucial to reduce manual debugging efforts for developers and improve software reliability.

Language Modelling Program Repair +1

RGAT: A Deeper Look into Syntactic Dependency Information for Coreference Resolution

no code implementations10 Sep 2023 Yuan Meng, Xuhao Pan, Jun Chang, Yue Wang

Our experiments on a public Gendered Ambiguous Pronouns (GAP) dataset show that with the supervision learning of the syntactic dependency graph and without fine-tuning the entire BERT, we increased the F1-score of the previous best model (RGCN-with-BERT) from 80. 3% to 82. 5%, compared to the F1-score by single BERT embeddings from 78. 5% to 82. 5%.

coreference-resolution Graph Attention

Toward High Quality Facial Representation Learning

1 code implementation7 Sep 2023 Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Liang Liu, Yabiao Wang, Chengjie Wang

To improve the facial representation quality, we use feature map of a pre-trained visual backbone as a supervision item and use a partially pre-trained decoder for mask image modeling.

Contrastive Learning Decoder +3

Harnessing the Power of David against Goliath: Exploring Instruction Data Generation without Using Closed-Source Models

no code implementations24 Aug 2023 Yue Wang, Xinrui Wang, Juntao Li, Jinxiong Chang, Qishen Zhang, Zhongyi Liu, Guannan Zhang, Min Zhang

Instruction tuning is instrumental in enabling Large Language Models~(LLMs) to follow user instructions to complete various open-domain tasks.

StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map Construction

1 code implementation24 Aug 2023 Tianyuan Yuan, Yicheng Liu, Yue Wang, Yilun Wang, Hang Zhao

This approach limits their stability and performance in complex scenarios such as occlusions, largely due to the absence of temporal information.

Autonomous Driving

Order-of-mutation effects on cancer progression: models for myeloproliferative neoplasm

no code implementations19 Aug 2023 Yue Wang, Blerta Shtylla, Tom Chou

In some patients with myeloproliferative neoplasms, two genetic mutations can be found, JAK2 V617F and TET2.

Exploiting Point-Wise Attention in 6D Object Pose Estimation Based on Bidirectional Prediction

no code implementations16 Aug 2023 Yuhao Yang, Jun Wu, Yue Wang, Guangjian Zhang, Rong Xiong

Traditional geometric registration based estimation methods only exploit the CAD model implicitly, which leads to their dependence on observation quality and deficiency to occlusion.

6D Pose Estimation using RGB

Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples

1 code implementation27 Jul 2023 Peng Li, Yeye He, Cong Yan, Yue Wang, Surajit Chaudhuri

Relational tables, where each row corresponds to an entity and each column corresponds to an attribute, have been the standard for tables in relational databases.

Attribute

Joint Radio Frequency Fingerprints Identification via Multi-antenna Receiver

no code implementations11 Jul 2023 Xiaofang Chen, Wenbo Xu, Yue Wang

When the number is small, the Mutual Information Weighting Scheme (MIWS) is developed by calculating the weighted voting of RFFI result at each antenna; when the number is moderate, the Distortions Filtering Scheme (DFS) is developed by filtering out the channel noise and receiver distortions; when the number is large enough, the Group-Distortions Filtering and Weighting Scheme (GDFWS) is developed, which integrates the advantages of MIWS and DFS.

Privately generating tabular data using language models

1 code implementation7 Jun 2023 Alexandre Sablayrolles, Yue Wang, Brian Karrer

Privately generating synthetic data from a table is an important brick of a privacy-first world.

Language Modelling Sentence

MathChat: Converse to Tackle Challenging Math Problems with LLM Agents

3 code implementations2 Jun 2023 Yiran Wu, Feiran Jia, Shaokun Zhang, Hangyu Li, Erkang Zhu, Yue Wang, Yin Tat Lee, Richard Peng, Qingyun Wu, Chi Wang

Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields.

Elementary Mathematics Math +1

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

1 code implementation31 May 2023 Nghi D. Q. Bui, Hung Le, Yue Wang, Junnan Li, Akhilesh Deepak Gotmare, Steven C. H. Hoi

In this paper, we present CodeTF, an open-source Transformer-based library for state-of-the-art Code LLMs and code intelligence.

Leveraging BEV Representation for 360-degree Visual Place Recognition

1 code implementation23 May 2023 Xuecheng Xu, Yanmei Jiao, Sha Lu, Xiaqing Ding, Rong Xiong, Yue Wang

In addition, the image and point cloud cues can be easily stated in the same coordinates, which benefits sensor fusion for place recognition.

Sensor Fusion Visual Place Recognition

'Tax-free' 3DMM Conditional Face Generation

no code implementations22 May 2023 Yiwen Huang, Zhiqiu Yu, Xinjie Yi, Yue Wang, James Tompkin

This results in a new model that effectively removes the quality tax between 3DMM conditioned face GANs and the unconditional StyleGAN.

Face Generation

Achieving the Asymptotically Optimal Sample Complexity of Offline Reinforcement Learning: A DRO-Based Approach

no code implementations22 May 2023 Yue Wang, JinJun Xiong, Shaofeng Zou

We show that an improved sample complexity of $\mathcal{O}(SC^{\pi^*}\epsilon^{-2}(1-\gamma)^{-3})$ can be obtained, which asymptotically matches with the minimax lower bound for offline reinforcement learning, and thus is asymptotically minimax optimal.

reinforcement-learning

Discounted Thompson Sampling for Non-Stationary Bandit Problems

no code implementations18 May 2023 Han Qi, Yue Wang, Li Zhu

Under mild assumptions, we show that DS-TS with Gaussian priors can achieve nearly optimal regret bound on the order of $\tilde{O}(\sqrt{TB_T})$ for abruptly changing and $\tilde{O}(T^{\beta})$ for smoothly changing, where $T$ is the number of time steps, $B_T$ is the number of breakpoints, $\beta$ is associated with the smoothly changing environment and $\tilde{O}$ hides the parameters independent of $T$ as well as logarithmic terms.

Thompson Sampling

Model-Free Robust Average-Reward Reinforcement Learning

no code implementations17 May 2023 Yue Wang, Alvaro Velasquez, George Atia, Ashley Prater-Bennette, Shaofeng Zou

Robust Markov decision processes (MDPs) address the challenge of model uncertainty by optimizing the worst-case performance over an uncertainty set of MDPs.

Q-Learning reinforcement-learning +1

GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training

1 code implementation CVPR 2023 Xiaoyu Tian, Haoxi Ran, Yue Wang, Hang Zhao

This paper tries to address a fundamental question in point cloud self-supervised learning: what is a good signal we should leverage to learn features from point clouds without annotations?

Decoder Multi-Object Tracking +4

CodeT5+: Open Code Large Language Models for Code Understanding and Generation

2 code implementations13 May 2023 Yue Wang, Hung Le, Akhilesh Deepak Gotmare, Nghi D. Q. Bui, Junnan Li, Steven C. H. Hoi

To address these limitations, we propose ``CodeT5+'', a family of encoder-decoder LLMs for code in which component modules can be flexibly combined to suit a wide range of downstream code tasks.

Arithmetic Reasoning Code Completion +6

On Uni-Modal Feature Learning in Supervised Multi-Modal Learning

1 code implementation2 May 2023 Chenzhuang Du, Jiaye Teng, Tingle Li, Yichen Liu, Tianyuan Yuan, Yue Wang, Yang Yuan, Hang Zhao

We abstract the features (i. e. learned representations) of multi-modal data into 1) uni-modal features, which can be learned from uni-modal training, and 2) paired features, which can only be learned from cross-modal interactions.

Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving

1 code implementation NeurIPS 2023 Xiaoyu Tian, Tao Jiang, Longfei Yun, Yucheng Mao, Huitong Yang, Yue Wang, Yilun Wang, Hang Zhao

3D occupancy prediction, which estimates the detailed occupancy states and semantics of a scene, is an emerging task to overcome these limitations.

3D geometry Autonomous Driving

How to Control Hydrodynamic Force on Fluidic Pinball via Deep Reinforcement Learning

1 code implementation23 Apr 2023 Haodong Feng, Yue Wang, Hui Xiang, Zhiyang Jin, Dixia Fan

The finding from this work can control hydrodynamic force on the operation of fluidic pinball system and potentially pave the way for exploring efficient active flow control strategies in other complex fluid dynamic problems.

Decision Making Deep Reinforcement Learning +3

Neural Map Prior for Autonomous Driving

no code implementations CVPR 2023 Xuan Xiong, Yicheng Liu, Tianyuan Yuan, Yue Wang, Yilun Wang, Hang Zhao

To the best of our knowledge, this is the first learning-based system for creating a global map prior.

Autonomous Driving Navigate

SPIRiT-Diffusion: Self-Consistency Driven Diffusion Model for Accelerated MRI

no code implementations11 Apr 2023 Zhuo-Xu Cui, Chentao Cao, Yue Wang, Sen Jia, Jing Cheng, Xin Liu, Hairong Zheng, Dong Liang, Yanjie Zhu

To overcome this challenge, we introduce a novel approach called SPIRiT-Diffusion, which is a diffusion model for k-space interpolation inspired by the iterative self-consistent SPIRiT method.

Image Generation MRI Reconstruction

How Does Imperfect Automatic Indexing Affect Semantic Search Performance?

no code implementations8 Apr 2023 Mengtian Guo, David Gotz, Yue Wang

In this work, we aim to understand the performance impact of using imperfectly assigned terms in Boolean semantic searches.

Simplifying Low-Light Image Enhancement Networks with Relative Loss Functions

1 code implementation6 Apr 2023 Yu Zhang, Xiaoguang Di, Junde Wu, Rao Fu, Yong Li, Yue Wang, Yanwu Xu, Guohui YANG, Chunhui Wang

In this paper, to make the learning easier in low-light image enhancement, we introduce FLW-Net (Fast and LightWeight Network) and two relative loss functions.

Low-Light Image Enhancement

Object-centric Inference for Language Conditioned Placement: A Foundation Model based Approach

no code implementations6 Apr 2023 Zhixuan Xu, Kechun Xu, Yue Wang, Rong Xiong

We focus on the task of language-conditioned object placement, in which a robot should generate placements that satisfy all the spatial relational constraints in language instructions.

Object

Optimal Smoothing Distribution Exploration for Backdoor Neutralization in Deep Learning-based Traffic Systems

no code implementations24 Mar 2023 Yue Wang, Wending Li, Michail Maniatakos, Saif Eddin Jabari

The effectiveness of the proposed method is verified on a simulated traffic system based on a microscopic traffic simulator, where experimental results showcase that the smoothed traffic controller can neutralize all trigger samples and maintain the performance of relieving traffic congestion

Autonomous Vehicles Deep Reinforcement Learning +1

Physical Backdoor Trigger Activation of Autonomous Vehicle using Reachability Analysis

no code implementations24 Mar 2023 Wenqing Li, Yue Wang, Muhammad Shafique, Saif Eddin Jabari

Recent studies reveal that Autonomous Vehicles (AVs) can be manipulated by hidden backdoors, causing them to perform harmful actions when activated by physical triggers.

Autonomous Vehicles

UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields

no code implementations ICCV 2023 Yuanbo Yang, Yifei Yang, Hanlei Guo, Rong Xiong, Yue Wang, Yiyi Liao

Generating photorealistic images with controllable camera pose and scene contents is essential for many applications including AR/VR and simulation.

3D-Aware Image Synthesis Object

GOOD: General Optimization-based Fusion for 3D Object Detection via LiDAR-Camera Object Candidates

no code implementations17 Mar 2023 Bingqi Shen, Shuwei Dai, Yuyin Chen, Rong Xiong, Yue Wang, Yanmei Jiao

In this paper, we propose GOOD, a general optimization-based fusion framework that can achieve satisfying detection without training additional models and is available for any combinations of 2D and 3D detectors to improve the accuracy and robustness of 3D detection.

3D Object Detection Autonomous Driving +2

FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization

2 code implementations CVPR 2023 Jiawei Yang, Marco Pavone, Yue Wang

One is to regularize the frequency range of NeRF's inputs, while the other is to penalize the near-camera density fields.

Neural Rendering Novel View Synthesis