1 code implementation • ECCV 2020 • Jiayong Peng, Zhiwei Xiong, Xin Huang, Zheng-Ping Li, Dong Liu, Feihu Xu
Photon-efficient imaging has enabled a number of applications relying on single-photon sensors that can capture a 3D image with as few as one photon per pixel.
no code implementations • Findings (EMNLP) 2021 • Xin Huang, Jiajun Zhang, Chengqing Zong
Inspired by the findings of (CITATION) that entities are most informative in the image, we propose an explicit entity-level cross-modal learning approach that aims to augment the entity representation.
no code implementations • Findings (EMNLP) 2021 • Xin Huang, Jung-jae Kim, Bowei Zou
Complex question answering over knowledge base remains as a challenging task because it involves reasoning over multiple pieces of information, including intermediate entities/relations and other constraints.
no code implementations • 25 Nov 2022 • Cheng Lyu, Jiake Xie, Bo Xu, Cheng Lu, Han Huang, Xin Huang, Ming Wu, Chuang Zhang, Yong Tang
Performance of trimap-free image matting methods is limited when trying to decouple the deterministic and undetermined regions, especially in the scenes where foregrounds are semantically ambiguous, chromaless, or high transmittance.
1 code implementation • 6 Nov 2022 • Xin Huang, Jongryool Kim, Bradley Rees, Chul-Ho Lee
In particular, unlike the traditional GNNs that are trained based on the entire graph in a full-batch manner, recent GNNs have been developed with different graph sampling techniques for mini-batch training of GNNs on large graphs.
no code implementations • 3 Nov 2022 • Qiao Sun, Xin Huang, Brian C. Williams, Hang Zhao
Motion prediction is crucial in enabling safe motion planning for autonomous vehicles in interactive scenarios.
no code implementations • 3 Oct 2022 • Majid Khonji, Rashid Alyassi, Wolfgang Merkt, Areg Karapetyan, Xin Huang, Sungkweon Hong, Jorge Dias, Brian Williams
In this paper, we propose a risk-aware intelligent intersection system for autonomous vehicles (AVs) as well as human-driven vehicles (HVs).
no code implementations • 9 Aug 2022 • Xin Huang, Xiaoyu Tian, Junru Gu, Qiao Sun, Hang Zhao
Recently, the occupancy flow fields representation was proposed to represent joint future states of road agents through a combination of occupancy grid and flow, which supports efficient and consistent joint predictions.
2 code implementations • 12 Jul 2022 • Jihao Liu, Xin Huang, Guanglu Song, Hongsheng Li, Yu Liu
Finally, we integrate configurable operators and DSMs into a unified search space and search with a Reinforcement Learning-based search algorithm to fully explore the optimal combination of the operators.
Ranked #12 on
Neural Architecture Search
on ImageNet
1 code implementation • 26 May 2022 • Jihao Liu, Xin Huang, Osamu Yoshie, Yu Liu, Hongsheng Li
In this study, we propose Mixed and Masked Image Modeling (MixMIM), a simple but efficient MIM method that is applicable to various hierarchical Vision Transformers.
Ranked #1 on
Object Detection
on COCO 2017
(mAP metric)
no code implementations • 1 Apr 2022 • Qi Zhang, Xin Huang, Ying Feng, Xue Wang, Hongdong Li, Qing Wang
A two-stage network is developed for novel view synthesis.
no code implementations • ACL 2022 • Xin Huang, Ashish Khetan, Rene Bidart, Zohar Karnin
Transformer-based language models such as BERT have achieved the state-of-the-art performance on various NLP tasks, but are computationally prohibitive.
no code implementations • CVPR 2022 • Qiao Sun, Xin Huang, Junru Gu, Brian C. Williams, Hang Zhao
Predicting future motions of road participants is an important task for driving autonomously in urban scenes.
no code implementations • CVPR 2022 • Xin Huang, Qi Zhang, Ying Feng, Hongdong Li, Xuan Wang, Qing Wang
The key to our method is to model the physical imaging process, which dictates that the radiance of a scene point transforms to a pixel value in the LDR image with two implicit functions: a radiance field and a tone mapper.
no code implementations • 19 Oct 2021 • Yen-Ling Kuo, Xin Huang, Andrei Barbu, Stephen G. McGill, Boris Katz, John J. Leonard, Guy Rosman
Language allows humans to build mental models that interpret what is happening around them resulting in more accurate long-term predictions.
no code implementations • 17 Oct 2021 • Xin Huang, Guy Rosman, Ashkan Jasour, Stephen G. McGill, John J. Leonard, Brian C. Williams
When predicting trajectories of road agents, motion predictors usually approximate the future distribution by a limited number of samples.
no code implementations • 9 Oct 2021 • Xin Huang, Xuejiao Tang, Wenbin Zhang, Shichao Pei, Ji Zhang, Mingli Zhang, Zhen Liu, Ruijun Chen, Yiyi Huang
The proposed disease diagnosis system also uses a graphical user interface (GUI) to facilitate users to interact with the expert system.
no code implementations • 8 Oct 2021 • Jihao Liu, Hongsheng Li, Guanglu Song, Xin Huang, Yu Liu
Recently, transformer and multi-layer perceptron (MLP) architectures have achieved impressive results on various vision tasks.
Ranked #209 on
Image Classification
on ImageNet
no code implementations • 5 Oct 2021 • Xin Huang, Guy Rosman, Igor Gilitschenski, Ashkan Jasour, Stephen G. McGill, John J. Leonard, Brian C. Williams
Modeling multi-modal high-level intent is important for ensuring diversity in trajectory prediction.
1 code implementation • 21 Sep 2021 • Ashkan Jasour, Xin Huang, Allen Wang, Brian C. Williams
The presented methods address a wide range of representations for uncertain predictions including both Gaussian and non-Gaussian mixture models to predict both agent positions and control inputs conditioned on the scene contexts.
1 code implementation • 4 Aug 2021 • Xin Huang, Meng Feng, Ashkan Jasour, Guy Rosman, Brian Williams
In this paper, we propose an extension of soft actor critic model to estimate the execution risk of a plan through a risk critic and produce risk-bounded policies efficiently by adding an extra risk term in the loss function of the policy network.
1 code implementation • 27 Jul 2021 • Sahara Ali, Yiyi Huang, Xin Huang, Jianwu Wang
Accurately forecasting Arctic sea ice from subseasonal to seasonal scales has been a major scientific effort with fundamental challenges at play.
1 code implementation • 4 Jul 2021 • Xuejiao Tang, Xin Huang, Wenbin Zhang, Travers B. Child, Qiong Hu, Zhen Liu, Ji Zhang
Moreover, the proposed model provides intuitive interpretation into visual commonsense reasoning.
1 code implementation • 21 Apr 2021 • Xin Huang, Xinxin Wang, Wenyu Lv, Xiaying Bai, Xiang Long, Kaipeng Deng, Qingqing Dang, Shumin Han, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma, Osamu Yoshie
To meet these two concerns, we comprehensively evaluate a collection of existing refinements to improve the performance of PP-YOLO while almost keep the infer time unchanged.
no code implementations • 8 Apr 2021 • Yuli Jiang, Yu Rong, Hong Cheng, Xin Huang, Kangfei Zhao, Junzhou Huang
In this paper, we propose Graph Neural Network models for both CS and ACS problems, i. e., Query Driven-GNN and Attributed Query Driven-GNN.
no code implementations • 27 Mar 2021 • Xin Huang, Wenbin Zhang, Xuejiao Tang, Mingli Zhang, Jayachander Surbiryala, Vasileios Iosifidis, Zhen Liu, Ji Zhang
Recent studies in big data analytics and natural language processing develop automatic techniques in analyzing sentiment in the social media information.
no code implementations • 10 Mar 2021 • Zheng-Ping Li, Jun-Tian Ye, Xin Huang, Peng-Yu Jiang, Yuan Cao, Yu Hong, Chao Yu, Jun Zhang, Qiang Zhang, Cheng-Zhi Peng, Feihu Xu, Jian-Wei Pan
Long-range active imaging has widespread applications in remote sensing and target recognition.
1 code implementation • 12 Jan 2021 • Zheng Ge, JianFeng Wang, Xin Huang, Songtao Liu, Osamu Yoshie
A joint loss is then defined as the weighted summation of cls and reg losses as the assigning indicator.
no code implementations • 20 Dec 2020 • Jian Liu, Lei Gao, Sujie Guo, Rui Ding, Xin Huang, Long Ye, Qinghua Meng, Asef Nazari, Dhananjay Thiruvady
In this approach, the MHATT mechanism aims to improve the recognition accuracy of abbreviations to efficiently deal with the problem of inconsistency in full-text labels.
no code implementations • 16 Dec 2020 • Yu Zhou, Haixia Zheng, Xin Huang, Shufeng Hao, Dengao Li, Jumin Zhao
Graph neural networks provide a powerful toolkit for embedding real-world graphs into low-dimensional spaces according to specific tasks.
10 code implementations • 11 Dec 2020 • Xin Huang, Ashish Khetan, Milan Cvitkovic, Zohar Karnin
We propose TabTransformer, a novel deep tabular data modeling architecture for supervised and semi-supervised learning.
no code implementations • 6 Dec 2020 • Xuejiao Tang, Liuhua Zhang, Wenbin Zhang, Xin Huang, Vasileios Iosifidis, Zhen Liu, Mingli Zhang, Enza Messina, Ji Zhang
Early detection of breast cancer in X-ray mammography is believed to have effectively reduced the mortality rate.
no code implementations • 3 Dec 2020 • Xuliang Zhu, Xin Huang, Byron Choi, Jiaxin Jiang, Zhaonian Zou, Jianliang Xu
To address these two limitations, in this paper, we study a new problem of budget constrained interactive graph search for multiple targets called kBM-IGS-problem.
Image Classification
Product Categorization
Databases
no code implementations • 18 Oct 2020 • Xin Huang, Duan Li, Daniel Zhuoyu Long
When stochastic control problems do not possess separability and/or monotonicity, the dynamic programming pioneered by Bellman in 1950s fails to work as a time-decomposition solution method.
Optimization and Control Systems and Control Systems and Control Portfolio Management
1 code implementation • Medical Image Computing and Computer Assisted Intervention 2020 • Donglai Wei, Zudi Lin, Daniel Franco-Barranco, Nils Wendt, Xingyu Liu, Wenjie Yin, Xin Huang, Aarush Gupta, Won-Dong Jang, Xueying Wang, Ignacio Arganda-Carreras, Jeff Lichtman, Hanspeter Pfister
On MitoEM, we find existing instance segmentation methods often fail to correctly segment mitochondria with complex shapes or close contacts with other instances.
Ranked #2 on
3D Instance Segmentation
on MitoEM
(AP75-R-Test metric)
2 code implementations • 27 Sep 2020 • Majed El Helou, Ruofan Zhou, Sabine Süsstrunk, Radu Timofte, Mahmoud Afifi, Michael S. Brown, Kele Xu, Hengxing Cai, Yuzhong Liu, Li-Wen Wang, Zhi-Song Liu, Chu-Tak Li, Sourya Dipta Das, Nisarg A. Shah, Akashdeep Jassal, Tongtong Zhao, Shanshan Zhao, Sabari Nathan, M. Parisa Beham, R. Suganya, Qing Wang, Zhongyun Hu, Xin Huang, Yaning Li, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Densen Puthussery, Hrishikesh P. S, Melvin Kuriakose, Jiji C. V, Yu Zhu, Liping Dong, Zhuolong Jiang, Chenghua Li, Cong Leng, Jian Cheng
The first track considered one-to-one relighting; the objective was to relight an input photo of a scene with a different color temperature and illuminant orientation (i. e., light source position).
no code implementations • ACL 2020 • Boli Chen, Xin Huang, Lin Xiao, Liping Jing
Second, Hyperbolic Dynamic Routing (HDR) is introduced to aggregate hyperbolic capsules in a label-aware manner, so that the label-level discriminative information can be preserved along the depth of neural networks.
1 code implementation • 27 May 2020 • Allen Wang, Xin Huang, Ashkan Jasour, Brian Williams
The presented methods address a wide range of representations for uncertain predictions including both Gaussian and non-Gaussian mixture models for predictions of both agent positions and controls.
no code implementations • 23 May 2020 • Zheng Ge, Zequn Jie, Xin Huang, Chengzheng Li, Osamu Yoshie
The first imbalance lies in the large number of low-quality RPN proposals, which makes the R-CNN module (i. e., post-classification layers) become highly biased towards the negative proposals in the early training stage.
no code implementations • CVPR 2020 • Xin Huang, Zheng Ge, Zequn Jie, Osamu Yoshie
To acquire the visible parts, a novel Paired-Box Model (PBM) is proposed to simultaneously predict the full and visible boxes of a pedestrian.
no code implementations • 18 Mar 2020 • Xin Huang, Stephen G. McGill, Jonathan A. DeCastro, Luke Fletcher, John J. Leonard, Brian C. Williams, Guy Rosman
Predicting driver intentions is a difficult and crucial task for advanced driver assistance systems.
no code implementations • 16 Mar 2020 • Zheng Ge, Zequn Jie, Xin Huang, Rong Xu, Osamu Yoshie
PS-RCNN first detects slightly/none occluded objects by an R-CNN module (referred as P-RCNN), and then suppress the detected instances by human-shaped masks so that the features of heavily occluded instances can stand out.
Ranked #2 on
Object Detection
on WiderPerson
no code implementations • 27 Feb 2020 • Dalong Zhang, Xianzheng Song, Ziqi Liu, Zhiqiang Zhang, Xin Huang, Lin Wang, Jun Zhou
Instead of training model on the whole graph, DSSLP is proposed to train on the \emph{$k$-hops neighborhood} of nodes in a mini-batch setting, which helps reduce the scale of the input graph and distribute the training procedure.
no code implementations • 11 Feb 2020 • Qingyu Li, Yilei Shi, Xin Huang, Xiao Xiang Zhu
Due to the complexity of buildings, the accurate and reliable generation of the building footprint from remote sensing imagery is still a challenging task.
no code implementations • 28 Nov 2019 • Xin Huang, Stephen G. McGill, Jonathan A. DeCastro, Luke Fletcher, John J. Leonard, Brian C. Williams, Guy Rosman
Vehicle trajectory prediction is crucial for autonomous driving and advanced driver assistant systems.
1 code implementation • IJCNLP 2019 • Lin Xiao, Xin Huang, Boli Chen, Liping Jing
Multi-label text classification (MLTC) aims to tag most relevant labels for the given document.
Ranked #1 on
Multi-Label Text Classification
on AAPD
no code implementations • 21 Jun 2019 • Xin Huang, Duan Li, Daniel Zhuoyu Long
Stochastic control with both inherent random system noise and lack of knowledge on system parameters constitutes the core and fundamental topic in reinforcement learning (RL), especially under non-episodic situations where online learning is much more demanding.
no code implementations • 21 Jun 2019 • Joshua Zoen Git Hiew, Xin Huang, Hao Mou, Duan Li, Qi Wu, Yabo Xu
On the other hand, by combining with the other two commonly-used methods when it comes to building the sentiment index in the financial literature, i. e., the option-implied and the market-implied approaches, we propose a more general and comprehensive framework for the financial sentiment analysis, and further provide convincing outcomes for the predictability of individual stock return by combining LSTM (with a feature of a nonlinear mapping).
no code implementations • 1 Jun 2019 • Soroush Ebadian, Xin Huang
In public-private graphs, users share one public graph and have their own private graphs.
1 code implementation • 26 May 2019 • Boli Chen, Xin Huang, Lin Xiao, Zixin Cai, Liping Jing
The main reason is that the tree-likeness of the hyperbolic space matches the complexity of symbolic data with hierarchical structures.
1 code implementation • 24 May 2019 • Xin Huang, Boli Chen, Lin Xiao, Liping Jing
Extreme multi-label text classification (XMTC) aims at tagging a document with most relevant labels from an extremely large-scale label set.
Ranked #1 on
Multi-Label Text Classification
on Amazon-12K
no code implementations • 30 Apr 2019 • Xianbin Hong, Gautam Pal, Sheng-Uei Guan, Prudence Wong, Dawei Liu, Ka Lok Man, Xin Huang
Lifelong machine learning is a novel machine learning paradigm which can continually accumulate knowledge during learning.
no code implementations • 8 Apr 2019 • Lefei Zhang, Qian Zhang, Bo Du, Xin Huang, Yuan Yan Tang, DaCheng Tao
In a feature representation point of view, a nature approach to handle this situation is to concatenate the spectral and spatial features into a single but high dimensional vector and then apply a certain dimension reduction technique directly on that concatenated vector before feed it into the subsequent classifier.
1 code implementation • 8 Apr 2019 • Xin Huang, Yulia R. Gel
We develop a new density-based clustering algorithm named CRAD which is based on a new neighbor searching function with a robust data depth as the dissimilarity measure.
no code implementations • 4 Apr 2019 • Xin Huang, Sungkweon Hong, Andreas Hofmann, Brian C. Williams
In this work, we model the motion planning problem as a partially observable Markov decision process (POMDP) and propose an online system that combines an intent recognition algorithm and a POMDP solver to generate risk-bounded plans for the ego vehicle navigating with a number of dynamic agent vehicles.
no code implementations • 16 Jan 2019 • Xin Huang, Stephen McGill, Brian C. Williams, Luke Fletcher, Guy Rosman
In this paper, we propose a variational neural network approach that predicts future driver trajectory distributions for the vehicle based on multiple sensors.
2 code implementations • 30 Oct 2018 • Jia Kan, Lingyi Zou, Bella Liu, Xin Huang
The research shows that the tree based routing can accelerate broadcast convergence time and reduce redundant traffic.
Distributed, Parallel, and Cluster Computing
3 code implementations • 31 Aug 2018 • Jia Kan, Shangzhe Chen, Xin Huang
Blockchain technology is ushering in another break-out year, the challenge of blockchain still remains to be solved.
Cryptography and Security Distributed, Parallel, and Cluster Computing
1 code implementation • ICML 2018 • Hanjun Dai, Hui Li, Tian Tian, Xin Huang, Lin Wang, Jun Zhu, Le Song
Deep learning on graph structures has shown exciting results in various applications.
no code implementations • CVPR 2018 • Xin Huang, Yuxin Peng
For achieving the goal, this paper proposes deep cross-media knowledge transfer (DCKT) approach, which transfers knowledge from a large-scale cross-media dataset to promote the model training on another small-scale cross-media dataset.
Multimedia
no code implementations • 8 Aug 2017 • Xin Huang, Yuxin Peng, Mingkuan Yuan
Transfer learning is for relieving the problem of insufficient training data, but it mainly focuses on knowledge transfer only from large-scale datasets as single-modal source domain to single-modal target domain.
no code implementations • 1 Jun 2017 • Xin Huang, Yuxin Peng, Mingkuan Yuan
Knowledge in source domain cannot be directly transferred to both two different modalities in target domain, and the inherent cross-modal correlation contained in target domain provides key hints for cross-modal retrieval which should be preserved during transfer process.
no code implementations • 14 Apr 2017 • Jinwei Qi, Xin Huang, Yuxin Peng
Motivated by the strong ability of deep neural network in feature representation and comparison functions learning, we propose the Unified Network for Cross-media Similarity Metric (UNCSM) to associate cross-media shared representation learning with distance metric in a unified framework.
no code implementations • 21 Mar 2017 • Xin Huang, Yuxin Peng
The quadruplet ranking loss can model the semantically similar and dissimilar constraints to preserve cross-modal relative similarity ranking information.
no code implementations • 1 Feb 2017 • Wentao Huang, Xin Huang, Kechen Zhang
We have developed an efficient information-maximization method for computing the optimal shapes of tuning curves of sensory neurons by optimizing the parameters of the underlying feedforward network model.