Search Results for author: Yan Xu

Found 117 papers, 43 papers with code

CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue System

1 code implementation • ACL (dialdoc) 2021 • Yan Xu, Etsuko Ishii, Genta Indra Winata, Zhaojiang Lin, Andrea Madotto, Zihan Liu, Peng Xu, Pascale Fung

Information-seeking dialogue systems, including knowledge identification and response generation, aim to respond to users with fluent, coherent, and informative responses based on users’ needs, which.

Data Augmentation Response Generation

Paper
Code

Integrating Question Rewrites in Conversational Question Answering: A Reinforcement Learning Approach

no code implementations • ACL 2022 • Etsuko Ishii, Bryan Wilie, Yan Xu, Samuel Cahyawijaya, Pascale Fung

Resolving dependencies among dialogue history is one of the main obstacles in the research on conversational question answering (QA).

Conversational Question Answering reinforcement-learning +1

Paper
Add Code

Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior

no code implementations • 10 Apr 2024 • Fan Lu, Kwan-Yee Lin, Yan Xu, Hongsheng Li, Guang Chen, Changjun Jiang

(2) To handle the unbounded nature of urban scenes, we represent 3D scene with a Scalable Hash Grid structure, incrementally adapting to the growing scale of urban scenes.

3D Generation Model Optimization +2

Paper
Add Code

Super-Resolution of SOHO/MDI Magnetograms of Solar Active Regions Using SDO/HMI Data and an Attention-Aided Convolutional Neural Network

no code implementations • 27 Mar 2024 • Chunhui Xu, Jason T. L. Wang, Haimin Wang, Haodi Jiang, Qin Li, Yasser Abduallah, Yan Xu

Image super-resolution has been an important subject in image processing and recognition.

Image Super-Resolution SSIM

Paper
Add Code

LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing

no code implementations • 15 Feb 2024 • Bryan Wang, Yuliang Li, Zhaoyang Lv, Haijun Xia, Yan Xu, Raj Sodhi

Based on these findings, we propose design implications to inform the future development of agent-assisted content editing.

Video Editing

Paper
Add Code

Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras

no code implementations • 28 Jan 2024 • Yu-Jhe Li, Yan Xu, Rawal Khirodkar, Jinhyung Park, Kris Kitani

In order to evaluate our proposed pipeline, we collect three video sets of RGBD videos recorded from multiple sparse-view depth cameras and ground truth 3D poses are manually annotated.

3D Human Pose Estimation 3D Pose Estimation +2

Paper
Add Code

Joint Planning of Active Distribution Network and EV Charging Stations Considering Vehicle-to-Grid Functionality and Reactive Power Support

no code implementations • 26 Dec 2023 • Yongheng Wang, Xinwei Shen, Yan Xu

This paper proposes a collaborative planning model for the active distribution network (ADN) and electric vehicle (EV) charging stations that fully considers the vehicle-to-grid (V2G) function and reactive power support of EVs in different regions.

Paper
Add Code

Multi-View Person Matching and 3D Pose Estimation with Arbitrary Uncalibrated Camera Networks

no code implementations • 4 Dec 2023 • Yan Xu, Kris Kitani

The 2D human poses used in clustering are obtained through a pre-trained 2D pose detector, so our method does not require expensive 3D training data for each new scene.

3D Human Pose Estimation 3D Pose Estimation +1

Paper
Add Code

Contrastive Learning for Inference in Dialogue

1 code implementation • 19 Oct 2023 • Etsuko Ishii, Yan Xu, Bryan Wilie, Ziwei Ji, Holy Lovenia, Willy Chung, Pascale Fung

Inference, especially those derived from inductive processes, is a crucial component in our conversation to complement the information implicitly or explicitly conveyed by a speaker.

Contrastive Learning

Paper
Code

Deep learning based on Transformer architecture for power system short-term voltage stability assessment with class imbalance

no code implementations • 18 Oct 2023 • Yang Li, Jiting Cao, Yan Xu, Lipeng Zhu, Zhao Yang Dong

This work proposes a Transformer-based STVSA method to address this challenge.

Clustering Generative Adversarial Network +1

Paper
Add Code

Towards Mitigating Hallucination in Large Language Models via Self-Reflection

no code implementations • 10 Oct 2023 • Ziwei Ji, Tiezheng Yu, Yan Xu, Nayeon Lee, Etsuko Ishii, Pascale Fung

Large language models (LLMs) have shown promise for generative and knowledge-intensive tasks including question-answering (QA) tasks.

Answer Generation Hallucination +1

Paper
Add Code

PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems

1 code implementation • 19 Sep 2023 • Bryan Wilie, Yan Xu, Willy Chung, Samuel Cahyawijaya, Holy Lovenia, Pascale Fung

Grounding dialogue response generation on external knowledge is proposed to produce informative and engaging responses.

Hallucination Language Modelling +1

Paper
Code

Preserving Tumor Volumes for Unsupervised Medical Image Registration

1 code implementation • ICCV 2023 • Qihua Dong, Hao Du, Ying Song, Yan Xu, Jing Liao

Our approach balances image similarity and volume preservation in different regions, i. e., normal and tumor regions, by using soft tumor masks to adjust the imposition of volume-preserving loss on each one.

Anatomy Image Registration +1

Paper
Code

Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images

1 code implementation • 14 Sep 2023 • Zhiyun Song, Penghui Du, Junpeng Yan, Kailu Li, Jianzhong Shou, Maode Lai, Yubo Fan, Yan Xu

Self-supervised pretraining attempts to enhance model performance by obtaining effective features from unlabeled data, and has demonstrated its effectiveness in the field of histopathology images.

Image-to-Image Translation Instance Segmentation +3

Paper
Code

NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

no code implementations • 5 Sep 2023 • TaeHoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh, Jonghwan Mun, Solgil Oh, Kenan Emir Ak, Gwang-Gook Lee, Yan Xu, Mingwei Shen, Kyomin Hwang, Wonsik Shin, Kamin Lee, Wonhark Park, Dongkwan Lee, Nojun Kwak, Yujin Wang, Yimu Wang, Tiancheng Gu, Xingchang Lv, Mingmao Sun

In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge.

Fairness Image Captioning

Paper
Add Code

Physics-Informed Deep Learning to Reduce the Bias in Joint Prediction of Nitrogen Oxides

no code implementations • 14 Aug 2023 • Lianfa Li, Roxana Khalili, Frederick Lurmann, Nathan Pavlovic, Jun Wu, Yan Xu, Yisi Liu, Karl O'Sharkey, Beate Ritz, Luke Oman, Meredith Franklin, Theresa Bastain, Shohreh F. Farzan, Carrie Breton, Rima Habre

Atmospheric nitrogen oxides (NOx) primarily from fuel combustion have recognized acute and chronic health and environmental effects.

Paper
Add Code

Partitioned Saliency Ranking with Dense Pyramid Transformers

1 code implementation • 1 Aug 2023 • Chengxiao Sun, Yan Xu, Jialun Pei, Haopeng Fang, He Tang

The ranking by partition paradigm alleviates ranking ambiguities in a general sense, as it consistently improves the performance of other saliency ranking models.

Saliency Ranking

Paper
Code

Urban Radiance Field Representation with Deformable Neural Mesh Primitives

1 code implementation • ICCV 2023 • Fan Lu, Yan Xu, Guang Chen, Hongsheng Li, Kwan-Yee Lin, Changjun Jiang

To construct urban-level radiance fields efficiently, we design Deformable Neural Mesh Primitive~(DNMP), and propose to parameterize the entire scene with such primitives.

Image Generation Novel View Synthesis

Paper
Code

DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image Synthesis

1 code implementation • 11 Jul 2023 • Zhiwen Yang, Yang Zhou, HUI ZHANG, Bingzheng Wei, Yubo Fan, Yan Xu

To address this, we develop a generalist model that shares architecture and parameters across centers to utilize the shared knowledge.

Image Generation

Paper
Code

Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

1 code implementation • 30 Jun 2023 • Yongjian Wu, Yang Zhou, Jiya Saiyin, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

Foremost, our work demonstrates that the VLPM pre-trained on natural image-text pairs exhibits astonishing potential for downstream tasks in the medical field as well.

object-detection Object Detection

Paper
Code

Elastically-Constrained Meta-Learner for Federated Learning

no code implementations • 29 Jun 2023 • Peng Lan, Donglai Chen, Chong Xie, Keshu Chen, Jinyuan He, Juntao Zhang, Yonghong Chen, Yan Xu

One of the challenges in federated learning is non-IID data between clients, as a single model can not fit the data distribution for all clients.

Federated Learning Meta-Learning

Paper
Add Code

A Novel Dual-pooling Attention Module for UAV Vehicle Re-identification

no code implementations • 25 Jun 2023 • Xiaoyan Guo, Jie Yang, Xinyu Jia, Chuanyan Zang, Yan Xu, Zhaoyang Chen

Therefore, this paper proposes a novel dual-pooling attention (DpA) module, which achieves the extraction and enhancement of locally important information about vehicles from both channel and spatial dimensions by constructing two branches of channel-pooling attention (CpA) and spatial-pooling attention (SpA), and employing multiple pooling operations to enhance the attention to fine-grained information of vehicles.

Single Particle Analysis Vehicle Re-Identification

Paper
Add Code

Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation

1 code implementation • 5 Jun 2023 • Yang Zhou, Yongjian Wu, Zihua Wang, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

Experiments on three datasets demonstrate the good generality of our method, which outperforms other image-level weakly supervised methods for nuclei instance segmentation, and achieves comparable performance to fully-supervised methods.

Instance Segmentation Multi-Task Learning +4

Paper
Code

Diverse and Faithful Knowledge-Grounded Dialogue Generation via Sequential Posterior Inference

1 code implementation • 1 Jun 2023 • Yan Xu, Deqian Kong, Dehong Xu, Ziwei Ji, Bo Pang, Pascale Fung, Ying Nian Wu

The capability to generate responses with diversity and faithfulness using factual knowledge is paramount for creating a human-like, trustworthy dialogue system.

Dialogue Generation Response Generation

Paper
Code

Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following

no code implementations • 7 Apr 2023 • Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan

ECL consists of: (i) an instruction parser that translates the natural languages into executable programs; (ii) an embodied concept learner that grounds visual concepts based on language descriptions; (iii) a map constructor that estimates depth and constructs semantic maps by leveraging the learned concepts; and (iv) a program executor with deterministic policies to execute each program.

Instruction Following Self-Supervised Learning

Paper
Add Code

KILM: Knowledge Injection into Encoder-Decoder Language Models

1 code implementation • 17 Feb 2023 • Yan Xu, Mahdi Namazifar, Devamanyu Hazarika, Aishwarya Padmakumar, Yang Liu, Dilek Hakkani-Tür

Large pre-trained language models (PLMs) have been shown to retain implicit knowledge within their parameters.

Entity Disambiguation

Paper
Code

A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity

1 code implementation • 8 Feb 2023 • Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung

It is, for example, better at deductive than inductive reasoning.

Code Generation Hallucination +4

Paper
Code

Optimization of Topology-Aware Job Allocation on a High-Performance Computing Cluster by Neural Simulated Annealing

1 code implementation • 6 Feb 2023 • Zekang Lan, Yan Xu, Yingkun Huang, Dian Huang, Shengzhong Feng

For the DCAS, an approach called neural simulated algorithm (NSA), which is an extension to simulated algorithm (SA) that learns a repair operator and employs them in a guided heuristic search, is proposed.

Scheduling

Paper
Code

Weakly-Supervised 3D Medical Image Segmentation using Geometric Prior and Contrastive Similarity

no code implementations • 4 Feb 2023 • Hao Du, Qihua Dong, Yan Xu, Jing Liao

Furthermore, we propose contrastive similarity to encourage organ pixels to gather around in the contrastive embedding space, which helps better distinguish low-contrast tissues.

Image Segmentation Medical Image Segmentation +3

Paper
Add Code

Exploring Semantic Perturbations on Grover

1 code implementation • 1 Feb 2023 • Pranav Kulkarni, Ziqing Ji, Yan Xu, Marko Neskovic, Kevin Nolan

With news and information being as easy to access as they currently are, it is more important than ever to ensure that people are not mislead by what they read.

Fake News Detection

Paper
Code

Variational Degeneration to Structural Refinement: A Unified Framework for Superimposed Image Decomposition

no code implementations • ICCV 2023 • Wenyu Li, Yan Xu, Yang Yang, Haoran Ji, Yue Lang

Several unified frameworks have been proposed that can handle different types of degradation in superimposed image decomposition.

Image Restoration Image Shadow Removal +3

Paper
Add Code

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

1 code implementation • 19 Dec 2022 • Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, JENNIFER SANTOSO, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Ignatius, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Noor Fatyanosa, Ziwei Ji, Pascale Fung, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti

We present NusaCrowd, a collaborative initiative to collect and unify existing resources for Indonesian languages, including opening access to previously non-public resources.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

253

Paper
Code

Lightweight Facial Attractiveness Prediction Using Dual Label Distribution

no code implementations • 4 Dec 2022 • Shu Liu, Enquan Huang, Yan Xu, Kexuan Wang, Xiaoyan Kui, Tao Lei, Hongying Meng

To make the best use of the dataset, the manual ratings, attractiveness score, and standard deviation are aggregated explicitly to construct a dual label distribution, including the attractiveness distribution and the rating distribution.

Paper
Add Code

An interpretable imbalanced semi-supervised deep learning framework for improving differential diagnosis of skin diseases

no code implementations • 20 Nov 2022 • Futian Weng, Yuanting Ma, Jinghan Sun, Shijun Shan, Qiyuan Li, Jianping Zhu, Yang Wang, Yan Xu

This paper presents the first study of the interpretability and imbalanced semi-supervised learning of the multiclass intelligent skin diagnosis framework (ISDL) using 58, 457 skin images with 10, 857 unlabeled samples.

Specificity

Paper
Add Code

A Deep Learning Approach to Generating Photospheric Vector Magnetograms of Solar Active Regions for SOHO/MDI Using SDO/HMI and BBSO Data

no code implementations • 4 Nov 2022 • Haodi Jiang, Qin Li, Zhihang Hu, Nian Liu, Yasser Abduallah, Ju Jing, Genwei Zhang, Yan Xu, Wynne Hsu, Jason T. L. Wang, Haimin Wang

We propose a new deep learning method, named MagNet, to learn from combined LOS magnetograms, Bx and By taken by SDO/HMI along with H-alpha observations collected by the Big Bear Solar Observatory (BBSO), and to generate vector components Bx' and By', which would form vector magnetograms with observed LOS data.

Paper
Add Code

Inferring Line-of-Sight Velocities and Doppler Widths from Stokes Profiles of GST/NIRIS Using Stacked Deep Neural Networks

no code implementations • 8 Oct 2022 • Haodi Jiang, Qin Li, Yan Xu, Wynne Hsu, Kwangsu Ahn, Wenda Cao, Jason T. L. Wang, Haimin Wang

Obtaining high-quality magnetic and velocity fields through Stokes inversion is crucial in solar physics.

Paper
Add Code

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields

no code implementations • 24 Sep 2022 • Jiankai Sun, Yan Xu, Mingyu Ding, Hongwei Yi, Chen Wang, Jingdong Wang, Liangjun Zhang, Mac Schwager

Using current NeRF training tools, a robot can train a NeRF environment model in real-time and, using our algorithm, identify 3D bounding boxes of objects of interest within the NeRF for downstream navigation or manipulation tasks.

Object Localization Robot Navigation

Paper
Add Code

NeuralMarker: A Framework for Learning General Marker Correspondence

no code implementations • 19 Sep 2022 • Zhaoyang Huang, Xiaokun Pan, Weihong Pan, Weikang Bian, Yan Xu, Ka Chun Cheung, Guofeng Zhang, Hongsheng Li

We tackle the problem of estimating correspondences from a general marker, such as a movie poster, to an image that captures such a marker.

Video Editing

Paper
Add Code

3D Segmentation Guided Style-based Generative Adversarial Networks for PET Synthesis

no code implementations • 18 May 2022 • Yang Zhou, Zhiwen Yang, HUI ZHANG, Eric I-Chao Chang, Yubo Fan, Yan Xu

(2) We adopt a task-driven strategy that couples a segmentation task with a generative adversarial network (GAN) framework to improve the translation performance.

Generative Adversarial Network Translation

Paper
Add Code

Transformer based multiple instance learning for weakly supervised histopathology image segmentation

1 code implementation • 18 May 2022 • Ziniu Qian, Kailu Li, Maode Lai, Eric I-Chao Chang, Bingzheng Wei, Yubo Fan, Yan Xu

Hispathological image segmentation algorithms play a critical role in computer aided diagnosis technology.

Image Segmentation Multiple Instance Learning +4

Paper
Code

Towards Answering Open-ended Ethical Quandary Questions

no code implementations • 12 May 2022 • Yejin Bang, Nayeon Lee, Tiezheng Yu, Leila Khalatbari, Yan Xu, Samuel Cahyawijaya, Dan Su, Bryan Wilie, Romain Barraud, Elham J. Barezi, Andrea Madotto, Hayden Kee, Pascale Fung

We explore the current capability of LLMs in providing an answer with a deliberative exchange of different perspectives to an ethical quandary, in the approach of Socratic philosophy, instead of providing a closed answer like an oracle.

Few-Shot Learning Generative Question Answering +2

Paper
Add Code

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

1 code implementation • CVPR 2022 • Li Yang, Yan Xu, Chunfeng Yuan, Wei Liu, Bing Li, Weiming Hu

They base the visual grounding on the features from pre-generated proposals or anchors, and fuse these features with the text embeddings to locate the target mentioned by the text.

Attribute object-detection +2

Paper
Code

Can Question Rewriting Help Conversational Question Answering?

1 code implementation • insights (ACL) 2022 • Etsuko Ishii, Yan Xu, Samuel Cahyawijaya, Bryan Wilie

Question rewriting (QR) is a subtask of conversational question answering (CQA) aiming to ease the challenges of understanding dependencies among dialogue history by reformulating questions in a self-contained form.

Question Rewriting reinforcement-learning +1

Paper
Code

WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma

no code implementations • 13 Apr 2022 • Chu Han, Xipeng Pan, Lixu Yan, Huan Lin, Bingbing Li, Su Yao, Shanshan Lv, Zhenwei Shi, Jinhai Mai, Jiatai Lin, Bingchao Zhao, Zeyan Xu, Zhizhen Wang, Yumeng Wang, Yuan Zhang, Huihui Wang, Chao Zhu, Chunhui Lin, Lijian Mao, Min Wu, Luwen Duan, Jingsong Zhu, Dong Hu, Zijie Fang, Yang Chen, Yongbing Zhang, Yi Li, Yiwen Zou, Yiduo Yu, Xiaomeng Li, Haiming Li, Yanfen Cui, Guoqiang Han, Yan Xu, Jun Xu, Huihua Yang, Chunming Li, Zhenbing Liu, Cheng Lu, Xin Chen, Changhong Liang, Qingling Zhang, Zaiyi Liu

According to the technical reports of the top-tier teams, CAM is still the most popular approach in WSSS.

Data Augmentation Weakly supervised Semantic Segmentation +1

Paper
Add Code

RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization

1 code implementation • CVPR 2022 • Yan Xu, Kwan-Yee Lin, Guofeng Zhang, Xiaogang Wang, Hongsheng Li

The correspondence field estimation and pose refinement are conducted alternatively in each iteration to recover the object poses.

Ranked #1 on 6D Pose Estimation using RGB on LineMOD

6D Pose Estimation using RGB Object

135

Paper
Code

VScript: Controllable Script Generation with Visual Presentation

no code implementations • 1 Mar 2022 • Ziwei Ji, Yan Xu, I-Tsun Cheng, Samuel Cahyawijaya, Rita Frieske, Etsuko Ishii, Min Zeng, Andrea Madotto, Pascale Fung

In order to offer a customized script tool and inspire professional scriptwriters, we present VScript.

Dialogue Generation Retrieval +1

Paper
Add Code

Robust Self-Supervised LiDAR Odometry via Representative Structure Discovery and 3D Inherent Error Modeling

1 code implementation • 27 Feb 2022 • Yan Xu, Junyi Lin, Jianping Shi, Guofeng Zhang, Xiaogang Wang, Hongsheng Li

The correct ego-motion estimation basically relies on the understanding of correspondences between adjacent LiDAR scans.

Motion Estimation

Paper
Code

Survey of Hallucination in Natural Language Generation

no code implementations • 8 Feb 2022 • Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Yejin Bang, Delong Chen, Ho Shu Chan, Wenliang Dai, Andrea Madotto, Pascale Fung

This advancement has led to more fluent and coherent NLG, leading to improved development in downstream tasks such as abstractive summarization, dialogue generation and data-to-text generation.

Abstractive Text Summarization Data-to-Text Generation +4

Paper
Add Code

Pedestrian Trajectory Prediction via Spatial Interaction Transformer Network

no code implementations • 13 Dec 2021 • Tong Su, Yu Meng, Yan Xu

As a core technology of the autonomous driving system, pedestrian trajectory prediction can significantly enhance the function of active vehicle safety and reduce road traffic injuries.

Autonomous Driving Pedestrian Trajectory Prediction +1

Paper
Add Code

Whole Brain Segmentation with Full Volume Neural Network

1 code implementation • 29 Oct 2021 • Yeshu Li, Jonathan Cui, Yilun Sheng, Xiao Liang, Jingdong Wang, Eric I-Chao Chang, Yan Xu

To address these issues, we propose to adopt a full volume framework, which feeds the full volume brain image into the segmentation network and directly outputs the segmentation result for the whole brain volume.

Brain Segmentation Representation Learning +1

Paper
Code

Fair AutoML Through Multi-objective Optimization

no code implementations • 29 Sep 2021 • Steven Gardner, Oleg Golovidov, Joshua Griffin, Patrick Koch, Rui Shi, Brett Wujek, Yan Xu

There has been a recent surge of interest in fairness measurement and bias mitigation in machine learning, given the identification of significant disparities in predictions from models in many domains.

AutoML Fairness

Paper
Add Code

Tracing Halpha Fibrils through Bayesian Deep Learning

no code implementations • 16 Jul 2021 • Haodi Jiang, Ju Jing, Jiasheng Wang, Chang Liu, Qin Li, Yan Xu, Jason T. L. Wang, Haimin Wang

Our method consists of a data pre-processing component that prepares training data from a threshold-based tool, a deep learning model implemented as a Bayesian convolutional neural network for probabilistic image segmentation with uncertainty quantification to predict fibrils, and a post-processing component containing a fibril-fitting algorithm to determine fibril orientations.

Image Segmentation Segmentation +2

Paper
Add Code

CAiRE in DialDoc21: Data Augmentation for Information-Seeking Dialogue System

1 code implementation • 7 Jun 2021 • Etsuko Ishii, Yan Xu, Genta Indra Winata, Zhaojiang Lin, Andrea Madotto, Zihan Liu, Peng Xu, Pascale Fung

Data Augmentation Response Generation

Paper
Code

VS-Net: Voting with Segmentation for Visual Localization

1 code implementation • CVPR 2021 • Zhaoyang Huang, Han Zhou, Yijin Li, Bangbang Yang, Yan Xu, Xiaowei Zhou, Hujun Bao, Guofeng Zhang, Hongsheng Li

To address this problem, we propose a novel visual localization framework that establishes 2D-to-3D correspondences between the query image and the 3D map with a series of learnable scene-specific landmarks.

Segmentation Semantic Segmentation +1

Paper
Code

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters

1 code implementation • dialdoc (ACL) 2022 • Yan Xu, Etsuko Ishii, Samuel Cahyawijaya, Zihan Liu, Genta Indra Winata, Andrea Madotto, Dan Su, Pascale Fung

This paper proposes KnowExpert, a framework to bypass the explicit retrieval process and inject knowledge into the pre-trained language models with lightweight adapters and adapt to the knowledge-grounded dialogue task.

Response Generation Retrieval

Paper
Code

PDNet: Toward Better One-Stage Object Detection With Prediction Decoupling

1 code implementation • 28 Apr 2021 • Li Yang, Yan Xu, Shaoru Wang, Chunfeng Yuan, Ziqi Zhang, Bing Li, Weiming Hu

However, the most suitable positions for inferring different targets, i. e., the object category and boundaries, are generally different.

Object object-detection +1

Paper
Code

Wide-Baseline Multi-Camera Calibration using Person Re-Identification

no code implementations • CVPR 2021 • Yan Xu, Yu-Jhe Li, Xinshuo Weng, Kris Kitani

We address the problem of estimating the 3D pose of a network of cameras for large-environment wide-baseline scenarios, e. g., cameras for construction sites, sports stadiums, and public spaces.

Camera Calibration Person Re-Identification

Paper
Add Code

LIFE: Lighting Invariant Flow Estimation

no code implementations • 7 Apr 2021 • Zhaoyang Huang, Xiaokun Pan, Runsen Xu, Yan Xu, Ka Chun Cheung, Guofeng Zhang, Hongsheng Li

However, local image contents are inevitably ambiguous and error-prone during the cross-image feature matching process, which hinders downstream tasks.

Paper
Add Code

Semi-supervised Variational Temporal Convolutional Network for IoT Communication Multi-anomaly Detection

no code implementations • 5 Apr 2021 • Yan Xu, Yongliang Cheng

But these devices are insecure in reality, it means that the communications network are exposed by the attacker.

Anomaly Detection

Paper
Add Code

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

1 code implementation • ICLR 2021 • Shengyu Zhao, Jonathan Cui, Yilun Sheng, Yue Dong, Xiao Liang, Eric I Chang, Yan Xu

To overcome this challenge, we propose a generic new approach that bridges the gap between image-conditional and recent modulated unconditional generative architectures via co-modulation of both conditional and stochastic style representations.

Ranked #3 on Image Inpainting on FFHQ 512 x 512

Image Inpainting Image-to-Image Translation +1

430

Paper
Code

CelebA-Spoof Challenge 2020 on Face Anti-Spoofing: Methods and Results

1 code implementation • 25 Feb 2021 • Yuanhan Zhang, Zhenfei Yin, Jing Shao, Ziwei Liu, Shuo Yang, Yuanjun Xiong, Wei Xia, Yan Xu, Man Luo, Jian Liu, Jianshu Li, Zhijun Chen, Mingyu Guo, Hui Li, Junfu Liu, Pengfei Gao, Tianqi Hong, Hao Han, Shijie Liu, Xinhua Chen, Di Qiu, Cheng Zhen, Dashuang Liang, Yufeng Jin, Zhanlong Hao

It is the largest face anti-spoofing dataset in terms of the numbers of the data and the subjects.

Face Anti-Spoofing valid

512

Paper
Code

Exploring the Galactic Anticenter substructure with LAMOST & Gaia DR2

no code implementations • 7 Jan 2021 • Jing Li, Xiang-Xiang Xue, Chao Liu, Bo Zhang, Hans-Walter Rix, Jeffrey L. Carlin, Chengqun Yang, Rene A. Mendez, Jing Zhong, Hao Tian, Lan Zhang, Yan Xu, Yaqian Wu, Gang Zhao, Ruixiang Chang

Their location in [$\alpha$/M] vs. [M/H] space is more metal poor than typical thin disk stars, with [$\alpha$/M] \textbf{lower} than the thick disk.

Astrophysics of Galaxies

Paper
Add Code

Visio-Temporal Attention for Multi-Camera Multi-Target Association

no code implementations • ICCV 2021 • Yu-Jhe Li, Xinshuo Weng, Yan Xu, Kris M. Kitani

We propose a inter-tracklet (person to person) attention mechanism that learns a representation for a target tracklet while taking into account other tracklets across multiple views.

Paper
Add Code

R-SAC: Reinforcement Sample Consensus

no code implementations • CUHK Course IERG5350 2020 • Zhaoyang Huang, Yan Xu

In contrast, a model estimated from more observations may be better than from a minimum set.

Paper
Add Code

CrossNER: Evaluating Cross-Domain Named Entity Recognition

5 code implementations • 8 Dec 2020 • Zihan Liu, Yan Xu, Tiezheng Yu, Wenliang Dai, Ziwei Ji, Samuel Cahyawijaya, Andrea Madotto, Pascale Fung

Cross-domain named entity recognition (NER) models are able to cope with the scarcity issue of NER samples in target domains.

Cross-Domain Named Entity Recognition Domain Adaptation +3

114

Paper
Code

Multi-hop Question Generation with Graph Convolutional Network

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Dan Su, Yan Xu, Wenliang Dai, Ziwei Ji, Tiezheng Yu, Pascale Fung

Multi-hop Question Generation (QG) aims to generate answer-related questions by aggregating and reasoning over multiple scattered evidence from different paragraphs.

Question Generation Question-Generation +1

Paper
Code

SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks

no code implementations • 19 Oct 2020 • Yan Xu, Zhaoyang Huang, Kwan-Yee Lin, Xinge Zhu, Jianping Shi, Hujun Bao, Guofeng Zhang, Hongsheng Li

To suit our network to self-supervised learning, we design several novel loss functions that utilize the inherent properties of LiDAR point clouds.

Self-Supervised Learning

Paper
Add Code

Microscopic fine-grained instance classification through deep attention

no code implementations • 6 Oct 2020 • Mengran Fan, Tapabrata Chakrabort, Eric I-Chao Chang, Yan Xu, Jens Rittscher

Fine-grained classification of microscopic image data with limited samples is an open problem in computer vision and biomedical imaging.

Classification Deep Attention +2

Paper
Add Code

Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Andrea Madotto, Samuel Cahyawijaya, Genta Indra Winata, Yan Xu, Zihan Liu, Zhaojiang Lin, Pascale Fung

In this paper, we propose a method to embed the KB, of any size, directly into the model parameters.

Dialogue State Tracking Management +1

Paper
Code

Few-Shot Learning with Intra-Class Knowledge Transfer

no code implementations • 22 Aug 2020 • Vivek Roy, Yan Xu, Yu-Xiong Wang, Kris Kitani, Ruslan Salakhutdinov, Martial Hebert

Recent works have proposed to solve this task by augmenting the training data of the few-shot classes using generative models with the few-shot training samples as the seeds.

Few-Shot Learning Transfer Learning

Paper
Add Code

Machine Learning in Heliophysics and Space Weather Forecasting: A White Paper of Findings and Recommendations

no code implementations • 22 Jun 2020 • Gelu Nita, Manolis Georgoulis, Irina Kitiashvili, Viacheslav Sadykov, Enrico Camporeale, Alexander Kosovichev, Haimin Wang, Vincent Oria, Jason Wang, Rafal Angryk, Berkay Aydin, Azim Ahmadzadeh, Xiaoli Bai, Timothy Bastian, Soukaina Filali Boubrahimi, Bin Chen, Alisdair Davey, Sheldon Fereira, Gregory Fleishman, Dale Gary, Andrew Gerrard, Gregory Hellbourg, Katherine Herbert, Jack Ireland, Egor Illarionov, Natsuha Kuroda, Qin Li, Chang Liu, Yuexin Liu, Hyomin Kim, Dustin Kempton, Ruizhe Ma, Petrus Martens, Ryan McGranaghan, Edward Semones, John Stefan, Andrey Stejko, Yaireska Collado-Vega, Meiqi Wang, Yan Xu, Sijie Yu

The authors of this white paper met on 16-17 January 2020 at the New Jersey Institute of Technology, Newark, NJ, for a 2-day workshop that brought together a group of heliophysicists, data providers, expert modelers, and computer/data scientists.

BIG-bench Machine Learning Weather Forecasting

Paper
Add Code

EDGE COVID-19: A Web Platform to generate submission-ready genomes for SARS-CoV-2 sequencing efforts

1 code implementation • 15 Jun 2020 • Chien-Chi Lo, Migun Shakya, Karen Davenport, Mark Flynn, Adán Myers y Gutiérrez, Bin Hu, Po-E Li, Elais Player Jackson, Yan Xu, Patrick S. G. Chain

Using an intuitive web-based interface, this workflow automates SARS-CoV-2 reference-based genome assembly, variant calling, lineage determination, and provides the ability to submit the consensus sequence and necessary metadata to GenBank or GISAID.

Decision Making

Paper
Code

A Public Website for the Automated Assessment and Validation of SARS-CoV-2 Diagnostic PCR Assays

no code implementations • 8 Jun 2020 • Po-E Li, Adán Myers y Gutiérrez, Karen Davenport, Mark Flynn, Bin Hu, Chien-Chi Lo, Elais Player Jackson, Migun Shakya, Yan Xu, Jason Gans, Patrick S. G. Chain

Summary: Polymerase chain reaction-based assays are the current gold standard for detecting and diagnosing SARS-CoV-2.

Paper
Add Code

Inferring Vector Magnetic Fields from Stokes Profiles of GST/NIRIS Using a Convolutional Neural Network

no code implementations • 8 May 2020 • Hao Liu, Yan Xu, Jiasheng Wang, Ju Jing, Chang Liu, Jason T. L. Wang, Haimin Wang

By learning the latent patterns in the training data prepared by the physics-based ME tool, the proposed CNN method is able to infer vector magnetic fields from the Stokes profiles of GST/NIRIS.

Solar and Stellar Astrophysics

Paper
Add Code

CAiRE-COVID: A Question Answering and Query-focused Multi-Document Summarization System for COVID-19 Scholarly Information Management

1 code implementation • EMNLP (NLP-COVID19) 2020 • Dan Su, Yan Xu, Tiezheng Yu, Farhad Bin Siddique, Elham J. Barezi, Pascale Fung

We present CAiRE-COVID, a real-time question answering (QA) and multi-document summarization system, which won one of the 10 tasks in the Kaggle COVID-19 Open Research Dataset Challenge, judged by medical experts.

Document Summarization Information Retrieval +3

Paper
Code

A Natural Language Processing Pipeline of Chinese Free-text Radiology Reports for Liver Cancer Diagnosis

no code implementations • 10 Apr 2020 • Honglei Liu, Yan Xu, Zhiqiang Zhang, Ni Wang, Yanqun Huang, Yanjun Hu, Zhenghan Yang, Rui Jiang, Hui Chen

Despite the rapid development of natural language processing (NLP) implementation in electronic medical records (EMRs), Chinese EMRs processing remains challenging due to the limited corpus and specific grammatical characteristics, especially for radiology reports.

Computed Tomography (CT) Coreference Resolution +3

Paper
Add Code

SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds

1 code implementation • 6 Apr 2020 • Xinge Zhu, Yuexin Ma, Tai Wang, Yan Xu, Jianping Shi, Dahua Lin

Multi-class 3D object detection aims to localize and classify objects of multiple categories from point clouds.

3D Object Detection object-detection

Paper
Code

MaskFlownet: Asymmetric Feature Matching with Learnable Occlusion Mask

3 code implementations • CVPR 2020 • Shengyu Zhao, Yilun Sheng, Yue Dong, Eric I-Chao Chang, Yan Xu

In this paper, we propose an asymmetric occlusion-aware feature matching module, which can learn a rough occlusion mask that filters useless (occluded) areas immediately after feature warping without any explicit supervision.

Ranked #2 on Optical Flow Estimation on KITTI 2012

Optical Flow Estimation

889

Paper
Code

SiamSNN: Siamese Spiking Neural Networks for Energy-Efficient Object Tracking

no code implementations • 17 Mar 2020 • Yihao Luo, Min Xu, Caihong Yuan, Xiang Cao, Liangqi Zhang, Yan Xu, Tianjiang Wang, Qi Feng

Recently spiking neural networks (SNNs), the third-generation of neural networks has shown remarkable capabilities of energy-efficient computing, which is a promising alternative for deep neural networks (DNNs) with high energy consumption.

Image Classification Visual Object Tracking

Paper
Add Code

Estimating 3D Camera Pose from 2D Pedestrian Trajectories

no code implementations • 12 Dec 2019 • Yan Xu, Vivek Roy, Kris Kitani

We propose an alternative strategy for extracting 3D information to solve for camera pose by using pedestrian trajectories.

Pose Estimation

Paper
Add Code

Zero-shot Cross-lingual Dialogue Systems with Transferable Latent Variables

no code implementations • IJCNLP 2019 • Zihan Liu, Jamin Shin, Yan Xu, Genta Indra Winata, Peng Xu, Andrea Madotto, Pascale Fung

Despite the surging demands for multilingual task-oriented dialog systems (e. g., Alexa, Google Home), there has been less research done in multilingual or cross-lingual scenarios.

Intent Detection Natural Language Understanding +2

Paper
Add Code

Generalizing Question Answering System with Pre-trained Language Model Fine-tuning

no code implementations • WS 2019 • Dan Su, Yan Xu, Genta Indra Winata, Peng Xu, Hyeondey Kim, Zihan Liu, Pascale Fung

With a large number of datasets being released and new techniques being proposed, Question answering (QA) systems have witnessed great breakthroughs in reading comprehension (RC)tasks.

Language Modelling Multi-Task Learning +2

Paper
Add Code

Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints

no code implementations • ICCV 2019 • Yan Xu, Xinge Zhu, Jianping Shi, Guofeng Zhang, Hujun Bao, Hongsheng Li

Most of existing methods directly train a network to learn a mapping from sparse depth inputs to dense depth maps, which has difficulties in utilizing the 3D geometric constraints and handling the practical sensor noises.

Autonomous Driving Depth Completion

Paper
Add Code

Incorporating Word and Subword Units in Unsupervised Machine Translation Using Language Model Rescoring

no code implementations • WS 2019 • Zihan Liu, Yan Xu, Genta Indra Winata, Pascale Fung

This paper describes CAiRE's submission to the unsupervised machine translation track of the WMT'19 news shared task from German to Czech.

Language Modelling NMT +2

Paper
Add Code

Constrained Multi-Objective Optimization for Automated Machine Learning

no code implementations • 14 Aug 2019 • Steven Gardner, Oleg Golovidov, Joshua Griffin, Patrick Koch, Wayne Thompson, Brett Wujek, Yan Xu

In this work, we present a framework called Autotune that effectively handles multiple objectives and constraints that arise in machine learning problems.

BIG-bench Machine Learning Distributed Computing

Paper
Add Code

Learning to Learn Sales Prediction with Social Media Sentiment

no code implementations • WS 2019 • Zhaojiang Lin, Andrea Madotto, Genta Indra Winata, Zihan Liu, Yan Xu, Cong Gao, Pascale Fung

Paper
Add Code

Recursive Cascaded Networks for Unsupervised Medical Image Registration

5 code implementations • ICCV 2019 • Shengyu Zhao, Yue Dong, Eric I-Chao Chang, Yan Xu

We present recursive cascaded networks, a general architecture that enables learning deep cascades, for deformable image registration.

Image Registration Medical Image Registration

354

Paper
Code

CAiRE_HKUST at SemEval-2019 Task 3: Hierarchical Attention for Dialogue Emotion Classification

no code implementations • 10 Jun 2019 • Genta Indra Winata, Andrea Madotto, Zhaojiang Lin, Jamin Shin, Yan Xu, Peng Xu, Pascale Fung

Detecting emotion from dialogue is a challenge that has not yet been extensively surveyed.

Emotion Classification Gaussian Processes +1

Paper
Add Code

CAiRE\_HKUST at SemEval-2019 Task 3: Hierarchical Attention for Dialogue Emotion Classification

no code implementations • SEMEVAL 2019 • Genta Indra Winata, Andrea Madotto, Zhaojiang Lin, Jamin Shin, Yan Xu, Peng Xu, Pascale Fung

Detecting emotion from dialogue is a challenge that has not yet been extensively surveyed.

Emotion Classification Gaussian Processes +1

Paper
Add Code

Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables

1 code implementation • CVPR 2019 • Yan Xu, Baoyuan Wu, Fumin Shen, Yanbo Fan, Yong Zhang, Heng Tao Shen, Wei Liu

Due to the sequential dependencies among words in a caption, we formulate the generation of adversarial noises for targeted partial captions as a structured output learning problem with latent variables.

Adversarial Attack Image Captioning

Paper
Code

Unsupervised 3D End-to-End Medical Image Registration with Volume Tweening Network

6 code implementations • 13 Feb 2019 • Shengyu Zhao, Tingfung Lau, Ji Luo, Eric I-Chao Chang, Yan Xu

3D medical image registration is of great clinical importance.

Image Registration Medical Image Registration

354

Paper
Code

Look Across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition

1 code implementation • 2 Sep 2018 • Jian Zhao, Yu Cheng, Yi Cheng, Yang Yang, Haochong Lan, Fang Zhao, Lin Xiong, Yan Xu, Jianshu Li, Sugiri Pranata, ShengMei Shen, Junliang Xing, Hengzhu Liu, Shuicheng Yan, Jiashi Feng

Benchmarking our model on one of the most popular unconstrained face recognition datasets IJB-C additionally verifies the promising generalizability of AIM in recognizing faces in the wild.

Ranked #1 on Age-Invariant Face Recognition on MORPH Album2

Age-Invariant Face Recognition Benchmarking +4

361

Paper
Code

Predicting breast tumor proliferation from whole-slide images: the TUPAC16 challenge

no code implementations • 22 Jul 2018 • Mitko Veta, Yujing J. Heng, Nikolas Stathonikos, Babak Ehteshami Bejnordi, Francisco Beca, Thomas Wollmann, Karl Rohr, Manan A. Shah, Dayong Wang, Mikael Rousson, Martin Hedlund, David Tellez, Francesco Ciompi, Erwan Zerhouni, David Lanyi, Matheus Viana, Vassili Kovalev, Vitali Liauchuk, Hady Ahmady Phoulady, Talha Qaiser, Simon Graham, Nasir Rajpoot, Erik Sjöblom, Jesper Molin, Kyunghyun Paeng, Sangheum Hwang, Sunggyun Park, Zhipeng Jia, Eric I-Chao Chang, Yan Xu, Andrew H. Beck, Paul J. van Diest, Josien P. W. Pluim

The best performing automatic method for the first task achieved a quadratic-weighted Cohen's kappa score of $\kappa$ = 0. 567, 95% CI [0. 464, 0. 671] between the predicted scores and the ground truth.

Mitosis Detection whole slide images

Paper
Add Code

Human-Interactive Subgoal Supervision for Efficient Inverse Reinforcement Learning

no code implementations • 22 Jun 2018 • Xinlei Pan, Eshed Ohn-Bar, Nicholas Rhinehart, Yan Xu, Yilin Shen, Kris M. Kitani

The learning process is interactive, with a human expert first providing input in the form of full demonstrations along with some subgoal states.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Model-based clustering for identifying disease-associated SNPs in case-control genome-wide association studies

no code implementations • 21 Jun 2018 • Yan Xu, Li Xing, Jessica Su, Xuekui Zhang, Weiliang Qiu

Genome-wide association studies (GWASs) aim to detect genetic risk factors for complex human diseases by identifying disease-associated single-nucleotide polymorphisms (SNPs).

Clustering

Paper
Add Code

Towards Pose Invariant Face Recognition in the Wild

no code implementations • CVPR 2018 • Jian Zhao, Yu Cheng, Yan Xu, Lin Xiong, Jianshu Li, Fang Zhao, Karlekar Jayashree, Sugiri Pranata, ShengMei Shen, Junliang Xing, Shuicheng Yan, Jiashi Feng

To this end, we propose a Pose Invariant Model (PIM) for face recognition in the wild, with three distinct novelties.

Face Recognition Generative Adversarial Network +1

Paper
Add Code

Autotune: A Derivative-free Optimization Framework for Hyperparameter Tuning

no code implementations • 20 Apr 2018 • Patrick Koch, Oleg Golovidov, Steven Gardner, Brett Wujek, Joshua Griffin, Yan Xu

For hyperparameter tuning, machine learning algorithms are complex black-boxes.

BIG-bench Machine Learning

Paper
Add Code

MRI Cross-Modality NeuroImage-to-NeuroImage Translation

no code implementations • 22 Jan 2018 • Qianye Yang, Nannan Li, Zixu Zhao, Xingyu Fan, Eric I-Chao Chang, Yan Xu

Based on our proposed framework, we first propose a method for cross-modality registration by fusing the deformation fields to adopt the cross-modality information from translated modalities.

MRI segmentation Segmentation +1

Paper
Add Code

Unsupervised Learning for Cell-level Visual Representation in Histopathology Images with Generative Adversarial Networks

4 code implementations • 30 Nov 2017 • Bo Hu, Ye Tang, Eric I-Chao Chang, Yubo Fan, Maode Lai, Yan Xu

The visual attributes of cells, such as the nuclear morphology and chromatin openness, are critical for histopathology image analysis.

Classification General Classification +2

Paper
Code

Unsupervised End-to-end Learning for Deformable Medical Image Registration

no code implementations • 23 Nov 2017 • Siyuan Shan, Wen Yan, Xiaoqing Guo, Eric I-Chao Chang, Yubo Fan, Yan Xu

The contributions of our algorithm are threefold: (1) We transplant traditional image registration algorithms to an end-to-end convolutional neural network framework, while maintaining the unsupervised nature of image registration problems.

Deformable Medical Image Registration Image Registration +1

Paper
Add Code

Sleep Stage Classification Based on Multi-level Feature Learning and Recurrent Neural Networks via Wearable Device

no code implementations • 2 Nov 2017 • Xin Zhang, Weixuan Kou, Eric I-Chao Chang, He Gao, Yubo Fan, Yan Xu

The feature learning framework is designed to extract low- and mid-level features.

Automatic Sleep Stage Classification General Classification +1

Paper
Add Code

A Good Practice Towards Top Performance of Face Recognition: Transferred Deep Feature Fusion

1 code implementation • 3 Apr 2017 • Lin Xiong, Jayashree Karlekar, Jian Zhao, Yi Cheng, Yan Xu, Jiashi Feng, Sugiri Pranata, ShengMei Shen

In this paper, we propose a unified learning framework named Transferred Deep Feature Fusion (TDFF) targeting at the new IARPA Janus Benchmark A (IJB-A) face recognition dataset released by NIST face challenge.

Face Recognition Transfer Learning

Paper
Code

Constrained Deep Weak Supervision for Histopathology Image Segmentation

no code implementations • 3 Jan 2017 • Zhipeng Jia, Xingyi Huang, Eric I-Chao Chang, Yan Xu

(2) We develop a deep week supervision formulation to exploit multi-scale learning under weak supervision within fully convolutional networks.

Image Segmentation Multiple Instance Learning +2

Paper
Add Code

Optimizing Quantiles in Preference-based Markov Decision Processes

no code implementations • 1 Dec 2016 • Hugo Gilbert, Paul Weng, Yan Xu

In the Markov decision process model, policies are usually evaluated by expected cumulative rewards.

Paper
Add Code

Learning Multi-level Features For Sensor-based Human Action Recognition

no code implementations • 22 Nov 2016 • Yan Xu, Zhengyang Shen, Xin Zhang, Yifan Gao, Shujian Deng, Yipei Wang, Yubo Fan, Eric I-Chao Chang

This paper proposes a multi-level feature learning framework for human action recognition using a single body-worn inertial sensor.

Action Recognition Temporal Action Localization

Paper
Add Code

Gland Instance Segmentation Using Deep Multichannel Neural Networks

no code implementations • 21 Nov 2016 • Yan Xu, Yang Li, Yipei Wang, Mingyuan Liu, Yubo Fan, Maode Lai, Eric I-Chao Chang

Methods: We leverage the idea of image-to-image prediction in recent deep learning by designing an algorithm that automatically exploits and fuses complex multichannel information - regional, location, and boundary cues - in gland histology images.

Instance Segmentation Segmentation +1

Paper
Add Code

End-to-End Subtitle Detection and Recognition for Videos in East Asian Languages via CNN Ensemble with Near-Human-Level Performance

no code implementations • 18 Nov 2016 • Yan Xu, Siyuan Shan, Ziming Qiu, Zhipeng Jia, Zhengyang Shen, Yipei Wang, Mengfei Shi, Eric I-Chao Chang

In this paper, we propose an innovative end-to-end subtitle detection and recognition system for videos in East Asian languages.

Paper
Add Code

Compressing Neural Language Models by Sparse Word Representations

1 code implementation • ACL 2016 • Yunchuan Chen, Lili Mou, Yan Xu, Ge Li, Zhi Jin

Such approaches are time- and memory-intensive because of the large numbers of parameters for word embeddings and the output layer.

Language Modelling Word Embeddings

Paper
Code

Gland Instance Segmentation by Deep Multichannel Neural Networks

no code implementations • 17 Jul 2016 • Yan Xu, Yang Li, Mingyuan Liu, Yipei Wang, Yubo Fan, Maode Lai, Eric I-Chao Chang

Here we leverage the idea of image-to-image prediction in recent deep learning by building a framework that automatically exploits and fuses complex multichannel information, regional, location and boundary patterns in gland histology images.

Instance Segmentation Segmentation +1

Paper
Add Code

Gland Instance Segmentation by Deep Multichannel Side Supervision

no code implementations • 12 Jul 2016 • Yan Xu, Yang Li, Mingyuan Liu, Yipei Wang, Maode Lai, Eric I-Chao Chang

In this paper, we propose a new image instance segmentation method that segments individual glands (instances) in colon histology images.

Instance Segmentation Segmentation +1

Paper
Add Code

How Transferable are Neural Networks in NLP Applications?

no code implementations • EMNLP 2016 • Lili Mou, Zhao Meng, Rui Yan, Ge Li, Yan Xu, Lu Zhang, Zhi Jin

Transfer learning is aimed to make use of valuable knowledge in a source domain to help model performance in a target domain.

Transfer Learning

Paper
Add Code

Improved Relation Classification by Deep Recurrent Neural Networks with Data Augmentation

no code implementations • COLING 2016 • Yan Xu, Ran Jia, Lili Mou, Ge Li, Yunchuan Chen, Yangyang Lu, Zhi Jin

However, existing neural networks for relation classification are usually of shallow architectures (e. g., one-layer convolutional neural networks or recurrent networks).

Ranked #2 on Relation Classification on SemEval 2010 Task 8

Classification Data Augmentation +3

Paper
Add Code

Natural Language Inference by Tree-Based Convolution and Heuristic Matching

no code implementations • ACL 2016 • Lili Mou, Rui Men, Ge Li, Yan Xu, Lu Zhang, Rui Yan, Zhi Jin

In this paper, we propose the TBCNN-pair model to recognize entailment and contradiction between two sentences.

Ranked #87 on Natural Language Inference on SNLI

Natural Language Inference Sentence

Paper
Add Code

Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths

no code implementations • EMNLP 2015 • Yan Xu, Lili Mou, Ge Li, Yunchuan Chen, Hao Peng, Zhi Jin

Question Answering Relation Classification

Paper
Add Code

Distilling Word Embeddings: An Encoding Approach

no code implementations • 15 Jun 2015 • Lili Mou, Ran Jia, Yan Xu, Ge Li, Lu Zhang, Zhi Jin

Distilling knowledge from a well-trained cumbersome network to a small one has recently become a new research topic, as lightweight neural networks with high performance are particularly in need in various resource-restricted systems.

Word Embeddings

Paper
Add Code

Discriminative Neural Sentence Modeling by Tree-Based Convolution

no code implementations • EMNLP 2015 • Lili Mou, Hao Peng, Ge Li, Yan Xu, Lu Zhang, Zhi Jin

This paper proposes a tree-based convolutional neural network (TBCNN) for discriminative sentence modeling.

Ranked #7 on Text Classification on TREC-6

General Classification Sentence +2

Paper
Add Code

Building Program Vector Representations for Deep Learning

1 code implementation • 11 Sep 2014 • Lili Mou, Ge Li, Yuxuan Liu, Hao Peng, Zhi Jin, Yan Xu, Lu Zhang

In this pioneering paper, we propose the "coding criterion" to build program vector representations, which are the premise of deep learning for program analysis.

Representation Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.