Search Results for author: Ming Xu

Found 48 papers, 22 papers with code

MatPilot: an LLM-enabled AI Materials Scientist under the Framework of Human-Machine Collaboration

no code implementations10 Nov 2024 Ziqi Ni, Yahao Li, Kaijia Hu, Kunyuan Han, Ming Xu, Xingyu Chen, Fengqi Liu, Yicong Ye, Shuxin Bai

The rapid evolution of artificial intelligence, particularly large language models, presents unprecedented opportunities for materials science research.

Can We Predict Performance of Large Models across Vision-Language Tasks?

1 code implementation14 Oct 2024 Qinyu Zhao, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gould

In this study, we propose a new framework for predicting unknown performance scores based on observed ones from other LVLMs or tasks.

Matrix Completion

VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition

1 code implementation28 Sep 2024 Ahmad Khaliq, Ming Xu, Stephen Hausler, Michael Milford, Sourav Garg

This paper addresses these limitations by introducing VLAD-BuFF with two novel contributions: i) a self-similarity based feature discounting mechanism to learn Burst-aware features within end-to-end VPR training, and ii) Fast Feature aggregation by reducing local feature dimensions specifically through PCA-initialized learnable pre-projection.

Image Retrieval Visual Localization +1

GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution

no code implementations12 Sep 2024 Liang Feng, Ming Xu, Lihua Wen, Zhixuan Shen

Pose estimation is a crucial task in computer vision, with wide applications in autonomous driving, human motion capture, and virtual reality.

Autonomous Driving Pose Estimation

GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions

no code implementations12 Sep 2024 Liang Feng, Zhixuan Shen, Lihua Wen, Shiyao Li, Ming Xu

This paper introduces GateAttentionPose, an innovative approach that enhances the UniRepLKNet architecture for pose estimation tasks.

Autonomous Driving Computational Efficiency +1

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

2 code implementations22 Jun 2024 Terry Yue Zhuo, Minh Chien Vu, Jenny Chim, Han Hu, Wenhao Yu, Ratnadira Widyasari, Imam Nur Bani Yusuf, Haolan Zhan, Junda He, Indraneil Paul, Simon Brunner, Chen Gong, Thong Hoang, Armel Randy Zebaze, Xiaoheng Hong, Wen-Ding Li, Jean Kaddour, Ming Xu, Zhihan Zhang, Prateek Yadav, Naman jain, Alex Gu, Zhoujun Cheng, Jiawei Liu, Qian Liu, Zijian Wang, David Lo, Binyuan Hui, Niklas Muennighoff, Daniel Fried, Xiaoning Du, Harm de Vries, Leandro von Werra

Fulfilling both of these characteristics can pose a great challenge for LLMs. To assess how well LLMs can solve challenging and practical tasks via programs, we introduce BigCodeBench, a benchmark that challenges LLMs to invoke multiple function calls as tools from 139 libraries and 7 domains for 1, 140 fine-grained tasks.

Benchmarking Code Generation

Mix-Domain Contrastive Learning for Unpaired H&E-to-IHC Stain Translation

1 code implementation17 Jun 2024 Song Wang, Zhong Zhang, Huan Yan, Ming Xu, Guanghui Wang

H&E-to-IHC stain translation techniques offer a promising solution for precise cancer diagnosis, especially in low-resource regions where there is a shortage of health professionals and limited access to expensive equipment.

Contrastive Learning Translation

Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension

1 code implementation21 May 2024 Runwei Guan, RuiXiao Zhang, Ningwei Ouyang, Jianan Liu, Ka Lok Man, Xiaohao Cai, Ming Xu, Jeremy Smith, Eng Gee Lim, Yutao Yue, Hui Xiong

Moreover, we propose a novel model, T-RadarNet, for 3D REC on point clouds, achieving State-Of-The-Art (SOTA) performance on the Talk2Radar dataset compared to counterparts.

3D visual grounding Referring Expression +1

Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation

1 code implementation CVPR 2024 Ming Xu, Stephen Gould

We evaluate our segmentation approach and unsupervised learning pipeline on the Breakfast, 50-Salads, YouTube Instructions and Desktop Assembly datasets, yielding state-of-the-art results for the unsupervised video action segmentation task.

Segmentation Unsupervised Action Segmentation

Study of the mechanism of electroacupuncture regulating ferroptosis, inhibiting bladder neck fibrosis, and improving bladder urination function after suprasacral spinal cord injury using proteomics

no code implementations11 Mar 2024 Jin-Can Liu, Li-Ya Tang, Xiao-Ying Sun, Qi-Rui Qu, Qiong Liu, Lu Zhou, Hong Zhang, Bruce Song, Ming Xu, Kun Ai

Purpose The aim of this study was to explore whether electroacupuncture regulates phenotypic transformation of smooth muscle cells by inhibiting ferroptosis and inhibiting fibrosis, thereby improving bladder urination function after suprasacral spinal cord injury (SSCI).

TAG

Molecule Design by Latent Prompt Transformer

no code implementations27 Feb 2024 Deqian Kong, Yuhao Huang, Jianwen Xie, Edouardo Honig, Ming Xu, Shuanghong Xue, Pei Lin, Sanping Zhou, Sheng Zhong, Nanning Zheng, Ying Nian Wu

We propose the Latent Prompt Transformer (LPT), a novel generative model comprising three components: (1) a latent vector with a learnable prior distribution modeled by a neural transformation of Gaussian white noise; (2) a molecule generation model based on a causal Transformer, which uses the latent vector as a prompt; and (3) a property prediction model that predicts a molecule's target properties and/or constraint values using the latent prompt.

Property Prediction

Advanced Unstructured Data Processing for ESG Reports: A Methodology for Structured Transformation and Enhanced Analysis

1 code implementation4 Jan 2024 Jiahui Peng, Jing Gao, Xin Tong, Jing Guo, Hang Yang, Jianchuan Qi, Ruiqiao Li, Nan Li, Ming Xu

In the evolving field of corporate sustainability, analyzing unstructured Environmental, Social, and Governance (ESG) reports is a complex challenge due to their varied formats and intricate content.

A clean-label graph backdoor attack method in node classification task

no code implementations30 Dec 2023 Xiaogang Xing, Ming Xu, Yujing Bai, Dongdong Yang

To explore the backdoor vulnerability of GNNs and create a more stealthy backdoor attack method, a clean-label graph backdoor attack method(CGBA) in the node classification task is proposed in this paper.

Backdoor Attack Node Classification

Empowering Working Memory for Large Language Model Agents

no code implementations22 Dec 2023 Jing Guo, Nan Li, Jianchuan Qi, Hang Yang, Ruiqiao Li, Yuzhen Feng, Si Zhang, Ming Xu

The limitations of traditional LLM memory designs are analyzed, including their isolation of distinct dialog episodes and lack of persistent memory links.

Language Modeling Language Modelling +2

Global Feature Pyramid Network

no code implementations18 Dec 2023 Weilin Xiao, Ming Xu, Yonggui Lin

The visual feature pyramid has proven its effectiveness and efficiency in target detection tasks.

object-detection Object Detection

GAME: Generalized deep learning model towards multimodal data integration for early screening of adolescent mental disorders

no code implementations18 Sep 2023 Zhicheng Du, Chenyao Jiang, Xi Yuan, Shiyao Zhai, Zhengyang Lei, Shuyue Ma, Yang Liu, Qihui Ye, Chufan Xiao, Qiming Huang, Ming Xu, Dongmei Yu, Peiwu Qin

The timely identification of mental disorders in adolescents is a global public health challenge. Single factor is difficult to detect the abnormality due to its complex and subtle nature.

Data Integration

Towards Understanding Gradient Approximation in Equality Constrained Deep Declarative Networks

no code implementations24 Jun 2023 Stephen Gould, Ming Xu, Zhiwei Xu, Yanbin Liu

We explore conditions for when the gradient of a deep declarative node can be approximated by ignoring constraint terms and still result in a descent direction for the global loss function.

Adaptive Frequency Green Light Optimal Speed Advisory based on Hybrid Actor-Critic Reinforcement Learning

no code implementations7 Jun 2023 Ming Xu, Dongyu Zuo

Green Light Optimal Speed Advisory (GLOSA) system suggests speeds to vehicles to assist them in passing through intersections during green intervals, thus reducing traffic congestion and fuel consumption by minimizing the number of stops and idle times at intersections.

MGL2Rank: Learning to Rank the Importance of Nodes in Road Networks Based on Multi-Graph Fusion

no code implementations20 May 2023 Ming Xu, Jing Zhang

The identification of important nodes with strong propagation capabilities in road networks is a vital topic in urban planning.

Diversity Graph Learning +1

TraffNet: Learning Causality of Traffic Generation for What-if Prediction

2 code implementations28 Mar 2023 Ming Xu, Qiang Ai, Ruimin Li, Yunyi Ma, Geqi Qi, Xiangfu Meng, Haibo Jin

Real-time what-if traffic prediction is crucial for decision making in intelligent traffic management and control.

Decision Making Management +1

Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths

1 code implementation19 Mar 2023 Ming Xu, Sourav Garg, Michael Milford, Stephen Gould

An interesting byproduct of this formulation is that DecDTW outputs the optimal warping path between two time series as opposed to a soft approximation, recoverable from Soft-DTW.

Dynamic Time Warping Information Retrieval +4

Cognitive Semantic Communication Systems Driven by Knowledge Graph: Principle, Implementation, and Performance Evaluation

no code implementations15 Mar 2023 Fuhui Zhou, Yihao Li, Ming Xu, Lu Yuan, Qihui Wu, Rose Qingyang Hu, Naofal Al-Dhahir

Extensive simulation results conducted on a public dataset demonstrate that our proposed single-user and multi-user cognitive semantic communication systems are superior to benchmark communication systems in terms of the data compression rate and communication reliability.

Data Compression Semantic Communication

Residual Skill Policies: Learning an Adaptable Skill-based Action Space for Reinforcement Learning for Robotics

1 code implementation4 Nov 2022 Krishan Rana, Ming Xu, Brendan Tidd, Michael Milford, Niko Sünderhauf

Furthermore, the downstream RL agent is limited to learning structurally similar tasks to those used to construct the skill space.

Reinforcement Learning (RL)

3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

1 code implementation22 Jul 2022 Rui Qiu, Ming Xu, Yuyao Yan, Jeremy S. Smith, Xi Yang

Although deep-learning based methods for monocular pedestrian detection have made great progress, they are still vulnerable to heavy occlusions.

Data Augmentation Multiview Detection +1

Improving Road Segmentation in Challenging Domains Using Similar Place Priors

no code implementations27 May 2022 Connor Malone, Sourav Garg, Ming Xu, Thierry Peynot, Michael Milford

These approaches share one or more of three significant limitations: a reliance on large amounts of annotated training data that can be costly to obtain, both anticipation of and training data from the type of environmental conditions expected at inference time, and/or imagery captured from a previous visit to the location.

Domain Adaptation Road Segmentation +3

Mixed-UNet: Refined Class Activation Mapping for Weakly-Supervised Semantic Segmentation with Multi-scale Inference

no code implementations6 May 2022 Yang Liu, Ersi Zhang, Lulu Xu, Chufan Xiao, Xiaoyun Zhong, Lijin Lian, Fang Li, Bin Jiang, Yuhan Dong, Lan Ma, Qiming Huang, Ming Xu, Yongbing Zhang, Dongmei Yu, Chenggang Yan, Peiwu Qin

Deep learning techniques have shown great potential in medical image processing, particularly through accurate and reliable image segmentation on magnetic resonance imaging (MRI) scans or computed tomography (CT) scans, which allow the localization and diagnosis of lesions.

Computed Tomography (CT) Image Segmentation +3

FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction

1 code implementation CVPR 2022 Liang Gao, Huazhu Fu, Li Li, YingWen Chen, Ming Xu, Cheng-Zhong Xu

Federated learning (FL) allows multiple clients to collectively train a high-performance global model without sharing their private data.

Federated Learning Image Classification

Sharing Behavior in Ride-hailing Trips: A Machine Learning Inference Approach

no code implementations30 Jan 2022 Morteza Taiebat, Elham Amini, Ming Xu

Using ensemble machine learning models, we find that the travel impedance variables (trip cost, distance, and duration) collectively contribute to 95% and 91% of the predictive power in determining whether a trip is requested to share and whether it is successfully shared, respectively.

BIG-bench Machine Learning

Interactive Model with Structural Loss for Language-based Abductive Reasoning

no code implementations1 Dec 2021 Linhao Li, Ming Xu, Yongfeng Dong, Xin Li, Ao Wang

Therefore, we propose to group instead of ranking the hypotheses and design a structural loss called ``joint softmax focal loss'' in this paper.

Language Modeling Language Modelling +1

Probabilistic Appearance-Invariant Topometric Localization with New Place Awareness

1 code implementation16 Jul 2021 Ming Xu, Tobias Fischer, Niko Sünderhauf, Michael Milford

Probabilistic state-estimation approaches offer a principled foundation for designing localization systems, because they naturally integrate sequences of imperfect motion and exteroceptive sensor data.

Loop Closure Detection Visual Place Recognition

Probabilistic Visual Place Recognition for Hierarchical Localization

1 code implementation7 May 2021 Ming Xu, Niko Sünderhauf, Michael Milford

In this letter, we propose two methods which adapt image retrieval techniques used for visual place recognition to the Bayesian state estimation formulation for localization.

Image Retrieval Retrieval +2

Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition

4 code implementations CVPR 2021 Stephen Hausler, Sourav Garg, Ming Xu, Michael Milford, Tobias Fischer

Visual Place Recognition is a challenging task for robotics and autonomous systems, which must deal with the twin problems of appearance and viewpoint change in an always changing world.

Computational Efficiency Visual Localization +1

EPINE: Enhanced Proximity Information Network Embedding

no code implementations4 Mar 2020 Luoyi Zhang, Ming Xu

Unsupervised homogeneous network embedding (NE) represents every vertex of networks into a low-dimensional vector and meanwhile preserves the network information.

Link Prediction Network Embedding +1

Deep Attentive Features for Prostate Segmentation in 3D Transrectal Ultrasound

1 code implementation3 Jul 2019 Yi Wang, Haoran Dou, Xiao-Wei Hu, Lei Zhu, Xin Yang, Ming Xu, Jing Qin, Pheng-Ann Heng, Tianfu Wang, Dong Ni

Our attention module utilizes the attention mechanism to selectively leverage the multilevel features integrated from different layers to refine the features at each individual layer, suppressing the non-prostate noise at shallow layers of the CNN and increasing more prostate details into features at deep layers.

Image Segmentation Medical Image Segmentation +2

Variance reduction properties of the reparameterization trick

no code implementations27 Sep 2018 Ming Xu, Matias Quiroz, Robert Kohn, Scott A. Sisson

From this, we show that the marginal variances of the reparameterization gradient estimator are smaller than those of the score function gradient estimator.

Variational Inference

Discovery of Important Crossroads in Road Network using Massive Taxi Trajectories

no code implementations9 Jul 2014 Ming Xu, Jianping Wu, Yiman Du, Haohan Wang, Geqi Qi, Kezhen Hu, Yun-Peng Xiao

However, none of existing approaches addresses the problem of identifying network-wide important crossroads in real road network.

Adaptive Mesh Representation and Restoration of Biomedical Images

no code implementations27 Jun 2014 Ke Liu, Ming Xu, Zeyun Yu

The triangulation of images has become an active research area in recent years for its compressive representation and ease of image processing and visualization.

Image Restoration

Cannot find the paper you are looking for? You can Submit a new open access paper.