Search Results for author: Xin Huang

Found 80 papers, 25 papers with code

Unseen Entity Handling in Complex Question Answering over Knowledge Base via Language Generation

no code implementations Findings (EMNLP) 2021 Xin Huang, Jung-jae Kim, Bowei Zou

Complex question answering over knowledge base remains as a challenging task because it involves reasoning over multiple pieces of information, including intermediate entities/relations and other constraints.

Computational Efficiency Question Answering +1

Photon-Efficient 3D Imaging with A Non-Local Neural Network

1 code implementation ECCV 2020 Jiayong Peng, Zhiwei Xiong, Xin Huang, Zheng-Ping Li, Dong Liu, Feihu Xu

Photon-efficient imaging has enabled a number of applications relying on single-photon sensors that can capture a 3D image with as few as one photon per pixel.

Entity-level Cross-modal Learning Improves Multi-modal Machine Translation

no code implementations Findings (EMNLP) 2021 Xin Huang, Jiajun Zhang, Chengqing Zong

Inspired by the findings of (CITATION) that entities are most informative in the image, we propose an explicit entity-level cross-modal learning approach that aims to augment the entity representation.

Machine Translation Representation Learning +1

Towards Personalized Evaluation of Large Language Models with An Anonymous Crowd-Sourcing Platform

no code implementations13 Mar 2024 Mingyue Cheng, Hao Zhang, Jiqian Yang, Qi Liu, Li Li, Xin Huang, Liwei Song, Zhi Li, Zhenya Huang, Enhong Chen

Through this gateway, users have the opportunity to submit their questions, testing the models on a personalized and potentially broader range of capabilities.

Language Modelling Large Language Model

Point cloud-based registration and image fusion between cardiac SPECT MPI and CTA

no code implementations10 Feb 2024 Shaojie Tang, Penpen Miao, Xingyu Gao, Yu Zhong, Dantong Zhu, Haixing Wen, Zhihui Xu, Qiuyue Wei, Hongping Yao, Xin Huang, Rui Gao, Chen Zhao, Weihua Zhou

Fourthly, we employed ICP, SICP or CPD algorithm to achieve a fine registration for the point clouds (together with the special points of APIGs) of the LV epicardial surfaces (LVERs) in SPECT and CTA images.

Anatomy

MT-HCCAR: Multi-Task Deep Learning with Hierarchical Classification and Attention-based Regression for Cloud Property Retrieval

1 code implementation29 Jan 2024 Xingyan Li, Andrew M. Sayer, Ian T. Carroll, Xin Huang, Jianwu Wang

In response, this paper introduces MT-HCCAR, an end-to-end deep learning model employing multi-task learning to simultaneously tackle cloud masking, cloud phase retrieval (classification tasks), and COT prediction (a regression task).

Classification Model Selection +3

Medical Image Debiasing by Learning Adaptive Agreement from a Biased Council

no code implementations22 Jan 2024 Luyang Luo, Xin Huang, Minghao Wang, Zhuoyue Wan, Hao Chen

Specifically, the debiasing model is required to learn adaptive agreement with the biased council by agreeing on the correctly predicted samples and disagreeing on the wrongly predicted samples by the biased council.

Attribute Image Classification +1

GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extraction

no code implementations2 Jan 2024 Yuping Hu, Xin Huang, Jiayi Li, Zhen Zhang

Semantic segmentation techniques for extracting building footprints from high-resolution remote sensing images have been widely used in many fields such as urban planning.

Segmentation Semantic Segmentation +1

Early ChatGPT User Portrait through the Lens of Data

no code implementations10 Dec 2023 Yuyang Deng, Ni Zhao, Xin Huang

Since its launch, ChatGPT has achieved remarkable success as a versatile conversational AI platform, drawing millions of users worldwide and garnering widespread recognition across academic, industrial, and general communities.

HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance

1 code implementation30 Nov 2023 Zhuohao Yin, Xin Huang

Visual Word Sense Disambiguation (VWSD) is a multi-modal task that aims to select, among a batch of candidate images, the one that best entails the target word's meaning within a limited context.

Image Retrieval Retrieval +1

HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation

no code implementations2 Oct 2023 Xin Huang, Ruizhi Shao, Qi Zhang, Hongwen Zhang, Ying Feng, Yebin Liu, Qing Wang

The main idea is to enhance the model's 2D perception of 3D geometry by learning a normal-adapted diffusion model and a normal-aligned diffusion model.

Text to 3D Texture Synthesis

Inverting the Imaging Process by Learning an Implicit Camera Model

no code implementations CVPR 2023 Xin Huang, Qi Zhang, Ying Feng, Hongdong Li, Qing Wang

In principle, our new implicit neural camera model has the potential to benefit a wide array of other inverse imaging tasks.

A Retrospect to Multi-prompt Learning across Vision and Language

no code implementations ICCV 2023 Ziliang Chen, Xin Huang, Quanlong Guan, Liang Lin, Weiqi Luo

The vision community is undergoing the unprecedented progress with the emergence of Vision-Language Pretraining Models (VLMs).

Privileged Prior Information Distillation for Image Matting

no code implementations25 Nov 2022 Cheng Lyu, Jiake Xie, Bo Xu, Cheng Lu, Han Huang, Xin Huang, Ming Wu, Chuang Zhang, Yong Tang

Performance of trimap-free image matting methods is limited when trying to decouple the deterministic and undetermined regions, especially in the scenes where foregrounds are semantically ambiguous, chromaless, or high transmittance.

Image Matting

Characterizing the Efficiency of Graph Neural Network Frameworks with a Magnifying Glass

1 code implementation6 Nov 2022 Xin Huang, Jongryool Kim, Bradley Rees, Chul-Ho Lee

In particular, unlike the traditional GNNs that are trained based on the entire graph in a full-batch manner, recent GNNs have been developed with different graph sampling techniques for mini-batch training of GNNs on large graphs.

Graph Sampling

P4P: Conflict-Aware Motion Prediction for Planning in Autonomous Driving

no code implementations3 Nov 2022 Qiao Sun, Xin Huang, Brian C. Williams, Hang Zhao

Motion prediction is crucial in enabling safe motion planning for autonomous vehicles in interactive scenarios.

Autonomous Driving Motion Planning +2

VectorFlow: Combining Images and Vectors for Traffic Occupancy and Flow Prediction

no code implementations9 Aug 2022 Xin Huang, Xiaoyu Tian, Junru Gu, Qiao Sun, Hang Zhao

Recently, the occupancy flow fields representation was proposed to represent joint future states of road agents through a combination of occupancy grid and flow, which supports efficient and consistent joint predictions.

Autonomous Driving

UniNet: Unified Architecture Search with Convolution, Transformer, and MLP

2 code implementations12 Jul 2022 Jihao Liu, Xin Huang, Guanglu Song, Hongsheng Li, Yu Liu

Finally, we integrate configurable operators and DSMs into a unified search space and search with a Reinforcement Learning-based search algorithm to fully explore the optimal combination of the operators.

Image Classification Neural Architecture Search

MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers

1 code implementation CVPR 2023 Jihao Liu, Xin Huang, Jinliang Zheng, Yu Liu, Hongsheng Li

In this paper, we propose Mixed and Masked AutoEncoder (MixMAE), a simple but efficient pretraining method that is applicable to various hierarchical Vision Transformers.

Image Classification Object Detection +2

Pyramid-BERT: Reducing Complexity via Successive Core-set based Token Selection

no code implementations ACL 2022 Xin Huang, Ashish Khetan, Rene Bidart, Zohar Karnin

Transformer-based language models such as BERT have achieved the state-of-the-art performance on various NLP tasks, but are computationally prohibitive.

HDR-NeRF: High Dynamic Range Neural Radiance Fields

no code implementations CVPR 2022 Xin Huang, Qi Zhang, Ying Feng, Hongdong Li, Xuan Wang, Qing Wang

The key to our method is to model the physical imaging process, which dictates that the radiance of a scene point transforms to a pixel value in the LDR image with two implicit functions: a radiance field and a tone mapper.

Vocal Bursts Intensity Prediction

Trajectory Prediction with Linguistic Representations

no code implementations19 Oct 2021 Yen-Ling Kuo, Xin Huang, Andrei Barbu, Stephen G. McGill, Boris Katz, John J. Leonard, Guy Rosman

Language allows humans to build mental models that interpret what is happening around them resulting in more accurate long-term predictions.

Trajectory Prediction

TIP: Task-Informed Motion Prediction for Intelligent Vehicles

no code implementations17 Oct 2021 Xin Huang, Guy Rosman, Ashkan Jasour, Stephen G. McGill, John J. Leonard, Brian C. Williams

When predicting trajectories of road agents, motion predictors usually approximate the future distribution by a limited number of samples.

Autonomous Driving Decision Making +1

A Generic Knowledge Based Medical Diagnosis Expert System

no code implementations9 Oct 2021 Xin Huang, Xuejiao Tang, Wenbin Zhang, Shichao Pei, Ji Zhang, Mingli Zhang, Zhen Liu, Ruijun Chen, Yiyi Huang

The proposed disease diagnosis system also uses a graphical user interface (GUI) to facilitate users to interact with the expert system.

Medical Diagnosis

UniNet: Unified Architecture Search with Convolution, Transformer, and MLP

no code implementations8 Oct 2021 Jihao Liu, Hongsheng Li, Guanglu Song, Xin Huang, Yu Liu

Recently, transformer and multi-layer perceptron (MLP) architectures have achieved impressive results on various vision tasks.

Image Classification object-detection +2

Fast nonlinear risk assessment for autonomous vehicles using learned conditional probabilistic models of agent futures

1 code implementation21 Sep 2021 Ashkan Jasour, Xin Huang, Allen Wang, Brian C. Williams

The presented methods address a wide range of representations for uncertain predictions including both Gaussian and non-Gaussian mixture models to predict both agent positions and control inputs conditioned on the scene contexts.

Autonomous Vehicles Position

Risk Conditioned Neural Motion Planning

1 code implementation4 Aug 2021 Xin Huang, Meng Feng, Ashkan Jasour, Guy Rosman, Brian Williams

In this paper, we propose an extension of soft actor critic model to estimate the execution risk of a plan through a risk critic and produce risk-bounded policies efficiently by adding an extra risk term in the loss function of the policy network.

Motion Planning

Sea Ice Forecasting using Attention-based Ensemble LSTM

1 code implementation27 Jul 2021 Sahara Ali, Yiyi Huang, Xin Huang, Jianwu Wang

Accurately forecasting Arctic sea ice from subseasonal to seasonal scales has been a major scientific effort with fundamental challenges at play.

PP-YOLOv2: A Practical Object Detector

1 code implementation21 Apr 2021 Xin Huang, Xinxin Wang, Wenyu Lv, Xiaying Bai, Xiang Long, Kaipeng Deng, Qingqing Dang, Shumin Han, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma, Osamu Yoshie

To meet these two concerns, we comprehensively evaluate a collection of existing refinements to improve the performance of PP-YOLO while almost keep the infer time unchanged.

Object Real-Time Object Detection

Query Driven-Graph Neural Networks for Community Search: From Non-Attributed, Attributed, to Interactive Attributed

no code implementations8 Apr 2021 Yuli Jiang, Yu Rong, Hong Cheng, Xin Huang, Kangfei Zhao, Junzhou Huang

In this paper, we propose Graph Neural Network models for both CS and ACS problems, i. e., Query Driven-GNN and Attributed Query Driven-GNN.

Attribute Community Search +2

LSTM Based Sentiment Analysis for Cryptocurrency Prediction

no code implementations27 Mar 2021 Xin Huang, Wenbin Zhang, Xuejiao Tang, Mingli Zhang, Jayachander Surbiryala, Vasileios Iosifidis, Zhen Liu, Ji Zhang

Recent studies in big data analytics and natural language processing develop automatic techniques in analyzing sentiment in the social media information.

Sentiment Analysis

Single-photon imaging over 200 km

no code implementations10 Mar 2021 Zheng-Ping Li, Jun-Tian Ye, Xin Huang, Peng-Yu Jiang, Yuan Cao, Yu Hong, Chao Yu, Jun Zhang, Qiang Zhang, Cheng-Zhi Peng, Feihu Xu, Jian-Wei Pan

Long-range active imaging has widespread applications in remote sensing and target recognition.

LLA: Loss-aware Label Assignment for Dense Pedestrian Detection

1 code implementation12 Jan 2021 Zheng Ge, JianFeng Wang, Xin Huang, Songtao Liu, Osamu Yoshie

A joint loss is then defined as the weighted summation of cls and reg losses as the assigning indicator.

object-detection Object Detection +1

A hybrid deep-learning approach for complex biochemical named entity recognition

no code implementations20 Dec 2020 Jian Liu, Lei Gao, Sujie Guo, Rui Ding, Xin Huang, Long Ye, Qinghua Meng, Asef Nazari, Dhananjay Thiruvady

In this approach, the MHATT mechanism aims to improve the recognition accuracy of abbreviations to efficiently deal with the problem of inconsistency in full-text labels.

Attribute Attribute Extraction +4

Graph Neural Networks: Taxonomy, Advances and Trends

no code implementations16 Dec 2020 Yu Zhou, Haixia Zheng, Xin Huang, Shufeng Hao, Dengao Li, Jumin Zhao

Graph neural networks provide a powerful toolkit for embedding real-world graphs into low-dimensional spaces according to specific tasks.

TabTransformer: Tabular Data Modeling Using Contextual Embeddings

11 code implementations11 Dec 2020 Xin Huang, Ashish Khetan, Milan Cvitkovic, Zohar Karnin

We propose TabTransformer, a novel deep tabular data modeling architecture for supervised and semi-supervised learning.

tabular-classification Unsupervised Pre-training

Budget Constrained Interactive Search for Multiple Targets

no code implementations3 Dec 2020 Xuliang Zhu, Xin Huang, Byron Choi, Jiaxin Jiang, Zhaonian Zou, Jianliang Xu

To address these two limitations, in this paper, we study a new problem of budget constrained interactive graph search for multiple targets called kBM-IGS-problem.

Image Classification Product Categorization Databases

Scenario-decomposition Solution Framework for Nonseparable Stochastic Control Problems

no code implementations18 Oct 2020 Xin Huang, Duan Li, Daniel Zhuoyu Long

When stochastic control problems do not possess separability and/or monotonicity, the dynamic programming pioneered by Bellman in 1950s fails to work as a time-decomposition solution method.

Optimization and Control Systems and Control Systems and Control Portfolio Management

Hyperbolic Capsule Networks for Multi-Label Classification

no code implementations ACL 2020 Boli Chen, Xin Huang, Lin Xiao, Liping Jing

Second, Hyperbolic Dynamic Routing (HDR) is introduced to aggregate hyperbolic capsules in a label-aware manner, so that the label-level discriminative information can be preserved along the depth of neural networks.

Classification General Classification +1

Fast Risk Assessment for Autonomous Vehicles Using Learned Models of Agent Futures

1 code implementation27 May 2020 Allen Wang, Xin Huang, Ashkan Jasour, Brian Williams

The presented methods address a wide range of representations for uncertain predictions including both Gaussian and non-Gaussian mixture models for predictions of both agent positions and controls.

Autonomous Vehicles Position

Delving into the Imbalance of Positive Proposals in Two-stage Object Detection

no code implementations23 May 2020 Zheng Ge, Zequn Jie, Xin Huang, Chengzheng Li, Osamu Yoshie

The first imbalance lies in the large number of low-quality RPN proposals, which makes the R-CNN module (i. e., post-classification layers) become highly biased towards the negative proposals in the early training stage.

object-detection Object Detection

NMS by Representative Region: Towards Crowded Pedestrian Detection by Proposal Pairing

no code implementations CVPR 2020 Xin Huang, Zheng Ge, Zequn Jie, Osamu Yoshie

To acquire the visible parts, a novel Paired-Box Model (PBM) is proposed to simultaneously predict the full and visible boxes of a pedestrian.

Pedestrian Detection

PS-RCNN: Detecting Secondary Human Instances in a Crowd via Primary Object Suppression

no code implementations16 Mar 2020 Zheng Ge, Zequn Jie, Xin Huang, Rong Xu, Osamu Yoshie

PS-RCNN first detects slightly/none occluded objects by an R-CNN module (referred as P-RCNN), and then suppress the detected instances by human-shaped masks so that the features of heavily occluded instances can stand out.

Human Detection Object Detection

DSSLP: A Distributed Framework for Semi-supervised Link Prediction

no code implementations27 Feb 2020 Dalong Zhang, Xianzheng Song, Ziqi Liu, Zhiqiang Zhang, Xin Huang, Lin Wang, Jun Zhou

Instead of training model on the whole graph, DSSLP is proposed to train on the \emph{$k$-hops neighborhood} of nodes in a mini-batch setting, which helps reduce the scale of the input graph and distribute the training procedure.

Link Prediction

Building Footprint Generation by IntegratingConvolution Neural Network with Feature PairwiseConditional Random Field (FPCRF)

no code implementations11 Feb 2020 Qingyu Li, Yilei Shi, Xin Huang, Xiao Xiang Zhu

Due to the complexity of buildings, the accurate and reliable generation of the building footprint from remote sensing imagery is still a challenging task.

Management

BERT-based Financial Sentiment Index and LSTM-based Stock Return Predictability

no code implementations21 Jun 2019 Joshua Zoen Git Hiew, Xin Huang, Hao Mou, Duan Li, Qi Wu, Yabo Xu

On the other hand, by combining with the other two commonly-used methods when it comes to building the sentiment index in the financial literature, i. e., the option-implied and the market-implied approaches, we propose a more general and comprehensive framework for the financial sentiment analysis, and further provide convincing outcomes for the predictability of individual stock return by combining LSTM (with a feature of a nonlinear mapping).

Sentiment Analysis

Revised Progressive-Hedging-Algorithm Based Two-layer Solution Scheme for Bayesian Reinforcement Learning

no code implementations21 Jun 2019 Xin Huang, Duan Li, Daniel Zhuoyu Long

Stochastic control with both inherent random system noise and lack of knowledge on system parameters constitutes the core and fundamental topic in reinforcement learning (RL), especially under non-episodic situations where online learning is much more demanding.

Reinforcement Learning (RL) Thompson Sampling

Fast Algorithm for K-Truss Discovery on Public-Private Graphs

no code implementations1 Jun 2019 Soroush Ebadian, Xin Huang

In public-private graphs, users share one public graph and have their own private graphs.

Hyperbolic Interaction Model For Hierarchical Multi-Label Classification

1 code implementation26 May 2019 Boli Chen, Xin Huang, Lin Xiao, Zixin Cai, Liping Jing

The main reason is that the tree-likeness of the hyperbolic space matches the complexity of symbolic data with hierarchical structures.

Classification General Classification +1

CRAD: Clustering with Robust Autocuts and Depth

1 code implementation8 Apr 2019 Xin Huang, Yulia R. Gel

We develop a new density-based clustering algorithm named CRAD which is based on a new neighbor searching function with a robust data depth as the dissimilarity measure.

Clustering Time Series +1

Simultaneous Spectral-Spatial Feature Selection and Extraction for Hyperspectral Images

no code implementations8 Apr 2019 Lefei Zhang, Qian Zhang, Bo Du, Xin Huang, Yuan Yan Tang, DaCheng Tao

In a feature representation point of view, a nature approach to handle this situation is to concatenate the spectral and spatial features into a single but high dimensional vector and then apply a certain dimension reduction technique directly on that concatenated vector before feed it into the subsequent classifier.

Dimensionality Reduction feature selection +2

Online Risk-Bounded Motion Planning for Autonomous Vehicles in Dynamic Environments

no code implementations4 Apr 2019 Xin Huang, Sungkweon Hong, Andreas Hofmann, Brian C. Williams

In this work, we model the motion planning problem as a partially observable Markov decision process (POMDP) and propose an online system that combines an intent recognition algorithm and a POMDP solver to generate risk-bounded plans for the ego vehicle navigating with a number of dynamic agent vehicles.

Autonomous Vehicles Intent Recognition +1

Uncertainty-Aware Driver Trajectory Prediction at Urban Intersections

no code implementations16 Jan 2019 Xin Huang, Stephen McGill, Brian C. Williams, Luke Fletcher, Guy Rosman

In this paper, we propose a variational neural network approach that predicts future driver trajectory distributions for the vehicle based on multiple sensors.

Trajectory Prediction

Boost Blockchain Broadcast Propagation with Tree Routing

2 code implementations30 Oct 2018 Jia Kan, Lingyi Zou, Bella Liu, Xin Huang

The research shows that the tree based routing can accelerate broadcast convergence time and reduce redundant traffic.

Distributed, Parallel, and Cluster Computing

Improve Blockchain Performance using Graph Data Structure and Parallel Mining

3 code implementations31 Aug 2018 Jia Kan, Shangzhe Chen, Xin Huang

Blockchain technology is ushering in another break-out year, the challenge of blockchain still remains to be solved.

Cryptography and Security Distributed, Parallel, and Cluster Computing

Deep Cross-media Knowledge Transfer

no code implementations CVPR 2018 Xin Huang, Yuxin Peng

For achieving the goal, this paper proposes deep cross-media knowledge transfer (DCKT) approach, which transfers knowledge from a large-scale cross-media dataset to promote the model training on another small-scale cross-media dataset.

Multimedia

MHTN: Modal-adversarial Hybrid Transfer Network for Cross-modal Retrieval

no code implementations8 Aug 2017 Xin Huang, Yuxin Peng, Mingkuan Yuan

Transfer learning is for relieving the problem of insufficient training data, but it mainly focuses on knowledge transfer only from large-scale datasets as single-modal source domain to single-modal target domain.

Cross-Modal Retrieval Representation Learning +2

Cross-modal Common Representation Learning by Hybrid Transfer Network

no code implementations1 Jun 2017 Xin Huang, Yuxin Peng, Mingkuan Yuan

Knowledge in source domain cannot be directly transferred to both two different modalities in target domain, and the inherent cross-modal correlation contained in target domain provides key hints for cross-modal retrieval which should be preserved during transfer process.

Cross-Modal Retrieval Representation Learning +1

Cross-media Similarity Metric Learning with Unified Deep Networks

no code implementations14 Apr 2017 Jinwei Qi, Xin Huang, Yuxin Peng

Motivated by the strong ability of deep neural network in feature representation and comparison functions learning, we propose the Unified Network for Cross-media Similarity Metric (UNCSM) to associate cross-media shared representation learning with distance metric in a unified framework.

Metric Learning Representation Learning +3

Cross-modal Deep Metric Learning with Multi-task Regularization

no code implementations21 Mar 2017 Xin Huang, Yuxin Peng

The quadruplet ranking loss can model the semantically similar and dissimilar constraints to preserve cross-modal relative similarity ranking information.

Cross-Modal Retrieval Metric Learning +4

Information-theoretic interpretation of tuning curves for multiple motion directions

no code implementations1 Feb 2017 Wentao Huang, Xin Huang, Kechen Zhang

We have developed an efficient information-maximization method for computing the optimal shapes of tuning curves of sensory neurons by optimizing the parameters of the underlying feedforward network model.

Cannot find the paper you are looking for? You can Submit a new open access paper.