Search Results for author: Long Chen

Found 91 papers, 33 papers with code

基于中文信息与越南语句法指导的越南语事件检测(Vietnamese event detection based on Chinese information and Vietnamese syntax guidance)

no code implementations CCL 2021 Long Chen, Junjun Guo, Yafei Zhang, Chengxiang Gao, Zhengtao Yu

“当前基于深度学习的事件检测模型都依赖足够数量的标注数据, 而标注数据的稀缺及事件类型歧义为越南语事件检测带来了极大的挑战。根据“表达相同观点但语言不同的句子通常有相同或相似的语义成分”这一多语言一致性特征, 本文提出了一种基于中文信息与越南语句法指导的越南语事件检测框架。首先通过共享编码器策略和交叉注意力网络将中文信息融入到越南语中, 然后使用图卷积网络融入越南语依存句法信息, 最后在中文事件类型指导下实现越南语事件检测。实验结果表明, 在中文信息和越南语句法的指导下越南语事件检测取得了较好的效果。”

Event Detection

Rethinking the Two-Stage Framework for Grounded Situation Recognition

1 code implementation10 Dec 2021 Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Tat-Seng Chua

Since each verb is associated with a specific set of semantic roles, all existing GSR methods resort to a two-stage framework: predicting the verb in the first stage and detecting the semantic roles in the second stage.

Object Recognition

Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs

no code implementations8 Dec 2021 Kaifeng Gao, Long Chen, Yulei Niu, Jian Shao, Jun Xiao

To this end, we propose a new classification-then-grounding framework for VidSGG, which can avoid all the three overlooked drawbacks.

Predicate Classification

High-throughput Phenotyping of Nematode Cysts

no code implementations13 Oct 2021 Long Chen, Matthias Daub, Hans-Georg Luigs, Marcus Jansen, Martin Strauch, Dorit Merhof

The beet cyst nematode (BCN) Heterodera schachtii is a plant pest responsible for crop loss on a global scale.

Instance Segmentation Semantic Segmentation

Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering

1 code implementation3 Oct 2021 Long Chen, Yuhang Zheng, Yulei Niu, Hanwang Zhang, Jun Xiao

Specifically, CSST is composed of two parts: Counterfactual Samples Synthesizing (CSS) and Counterfactual Samples Training (CST).

Question Answering Visual Question Answering

Natural Language Video Localization with Learnable Moment Proposals

no code implementations EMNLP 2021 Shaoning Xiao, Long Chen, Jian Shao, Yueting Zhuang, Jun Xiao

Given an untrimmed video and a natural language query, Natural Language Video Localization (NLVL) aims to identify the video moment described by the query.

On Pursuit of Designing Multi-modal Transformer for Video Grounding

no code implementations EMNLP 2021 Meng Cao, Long Chen, Mike Zheng Shou, Can Zhang, Yuexian Zou

Almost all existing video grounding methods fall into two frameworks: 1) Top-down model: It predefines a set of segment candidates and then conducts segment classification and regression.

Instance-wise or Class-wise? A Tale of Neighbor Shapley for Concept-based Explanation

no code implementations3 Sep 2021 Jiahui Li, Kun Kuang, Lin Li, Long Chen, Songyang Zhang, Jian Shao, Jun Xiao

Deep neural networks have demonstrated remarkable performance in many data-driven and prediction-oriented applications, and sometimes even perform better than humans.

Medical Diagnosis

Video Relation Detection via Tracklet based Visual Transformer

1 code implementation19 Aug 2021 Kaifeng Gao, Long Chen, Yifeng Huang, Jun Xiao

Video Visual Relation Detection (VidVRD), has received significant attention of our community over recent years.

Video Visual Relation Detection

FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention

no code implementations NeurIPS 2021 Tan M. Nguyen, Vai Suliafu, Stanley J. Osher, Long Chen, Bao Wang

For instance, FMMformers achieve an average classification accuracy of $60. 74\%$ over the five Long Range Arena tasks, which is significantly better than the standard transformer's average accuracy of $58. 70\%$.

Language Modelling

CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention

2 code implementations31 Jul 2021 Wenxiao Wang, Lu Yao, Long Chen, Binbin Lin, Deng Cai, Xiaofei He, Wei Liu

On the one hand, CEL blends each embedding with multiple patches of different scales, providing the self-attention module itself with cross-scale features.

Image Classification Instance Segmentation +2

Graph-based Label Propagation for Semi-Supervised Speaker Identification

no code implementations15 Jun 2021 Long Chen, Venkatesh Ravichandran, Andreas Stolcke

We show in experiments on the VoxCeleb dataset that this approach makes effective use of unlabeled data and improves speaker identification accuracy compared to two state-of-the-art scoring methods as well as their semi-supervised variants based on pseudo-labels.

Speaker Identification Speaker Recognition

Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning

no code implementations1 Jun 2021 Jiahui Li, Kun Kuang, Baoxiang Wang, Furui Liu, Long Chen, Fei Wu, Jun Xiao

Specifically, Shapley Value and its desired properties are leveraged in deep MARL to credit any combinations of agents, which grants us the capability to estimate the individual credit for each agent.

Multi-agent Reinforcement Learning Starcraft +1

SDNet: mutil-branch for single image deraining using swin

no code implementations31 May 2021 Fuxiang Tan, YuTing Kong, Yingying Fan, Feng Liu, Daxin Zhou, Hao Zhang, Long Chen, Liang Gao, Yurong Qian

The former implements the basic rain pattern feature extraction, while the latter fuses different features to further extract and process the image features.

Autonomous Driving Single Image Deraining

What data do we need for training an AV motion planner?

no code implementations26 May 2021 Long Chen, Lukas Platinsky, Stefanie Speichert, Blazej Osinski, Oliver Scheel, Yawei Ye, Hugo Grimmett, Luca Del Pero, Peter Ondruska

If cheaper sensors could be used for collection instead, data availability would go up, which is crucial in a field where data volume requirements are large and availability is small.

Imitation Learning Motion Planning

SimNet: Learning Reactive Self-driving Simulations from Real-world Observations

no code implementations26 May 2021 Luca Bergamini, Yawei Ye, Oliver Scheel, Long Chen, Chih Hu, Luca Del Pero, Blazej Osinski, Hugo Grimmett, Peter Ondruska

We train our system directly from 1, 000 hours of driving logs and measure both realism, reactivity of the simulation as the two key properties of the simulation.

VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching

no code implementations12 May 2021 Wenbo Ma, Long Chen, Hanwang Zhang, Jian Shao, Yueting Zhuang, Jun Xiao

In this paper, we argue that these methods overlook an obvious \emph{mismatch} between the roles of proposals in the two stages: they generate proposals solely based on the detection confidence (i. e., query-agnostic), hoping that the proposals contain all instances mentioned in the text query (i. e., query-aware).

Text Matching

Textual Analysis of Communications in COVID-19 Infected Community on Social Media

no code implementations3 May 2021 YuHan Liu, Yuhan Gao, Zhifan Nan, Long Chen

During the COVID-19 pandemic, people started to discuss about pandemic-related topics on social media.

Conditional Training with Bounding Map for Universal Lesion Detection

no code implementations23 Mar 2021 Han Li, Long Chen, Hu Han, S. Kevin Zhou

Universal Lesion Detection (ULD) in computed tomography plays an essential role in computer-aided diagnosis.

Human-like Controllable Image Captioning with Verb-specific Semantic Roles

1 code implementation CVPR 2021 Long Chen, Zhihong Jiang, Jun Xiao, Wei Liu

However, we argue that almost all existing objective control signals have overlooked two indispensable characteristics of an ideal control signal: 1) Event-compatible: all visual contents referred to in a single sentence should be compatible with the described activity.

Image Captioning Semantic Role Labeling

Boundary Proposal Network for Two-Stage Natural Language Video Localization

no code implementations15 Mar 2021 Shaoning Xiao, Long Chen, Songyang Zhang, Wei Ji, Jian Shao, Lu Ye, Jun Xiao

State-of-the-art NLVL methods are almost in one-stage fashion, which can be typically grouped into two categories: 1) anchor-based approach: it first pre-defines a series of video segment candidates (e. g., by sliding window), and then does classification for each candidate; 2) anchor-free approach: it directly predicts the probabilities for each video frame as a boundary or intermediate frame inside the positive segment.

A Closer Look at Temporal Sentence Grounding in Videos: Dataset and Metric

no code implementations22 Jan 2021 Yitian Yuan, Xiaohan Lan, Xin Wang, Long Chen, Zhi Wang, Wenwu Zhu

All the results demonstrate that the re-organized dataset splits and new metric can better monitor the progress in TSGV.

The electric dipole moment of the tau lepton revisited

no code implementations20 Jan 2021 Werner Bernreuther, Long Chen, Otto Nachtmann

We reconsider the issue of the search for a nonzero electric dipole form factor (EDM) $d_\tau(s)$ using optimal observables in $\tau^+\tau^-$ production by $e^+ e^-$ collisions in the center-of-mass energy range from the $\tau$-pair threshold to about $\sqrt{s} \sim 15$ GeV.

High Energy Physics - Phenomenology High Energy Physics - Experiment

Class balanced underwater object detection dataset generated by class-wise style augmentation

no code implementations20 Jan 2021 Long Chen, Junyu Dong, Huiyu Zhou

CWSA is a new kind of data augmentation technique which augments the training data for the minority classes by generating various colors, textures and contrasts for the minority classes.

Data Augmentation Object Detection

Structured Context Enhancement Network for Mouse Pose Estimation

1 code implementation1 Dec 2020 Feixiang Zhou, Zheheng Jiang, Zhihua Liu, Fang Chen, Long Chen, Lei Tong, Zhile Yang, Haikuan Wang, Minrui Fei, Ling Li, Huiyu Zhou

However, quantifying mouse behaviours from videos or images remains a challenging problem, where pose estimation plays an important role in describing mouse behaviours.

Animal Pose Estimation

$ZH$ production in gluon fusion: two-loop amplitudes with full top quark mass dependence

no code implementations24 Nov 2020 Long Chen, Gudrun Heinrich, Stephen P. Jones, Matthias Kerner, Jonas Klappert, Johannes Schlenk

We present results for the two-loop helicity amplitudes entering the NLO QCD corrections to the production of a Higgs boson in association with a $Z$-boson in gluon fusion.

High Energy Physics - Phenomenology

Lightweight Single-Image Super-Resolution Network with Attentive Auxiliary Feature Learning

1 code implementation13 Nov 2020 Xuehui Wang, Qing Wang, Yuzhi Zhao, Junchi Yan, Lei Fan, Long Chen

In this paper, we develop a computation efficient yet accurate network based on the proposed attentive auxiliary features (A$^2$F) for SISR.

Image Super-Resolution

Multi-View Adaptive Fusion Network for 3D Object Detection

1 code implementation2 Nov 2020 Guojun Wang, Bin Tian, Yachen Zhang, Long Chen, Dongpu Cao, Jian Wu

3D object detection based on LiDAR-camera fusion is becoming an emerging research theme for autonomous driving.

3D Object Detection Autonomous Driving +1

SWIPENET: Object detection in noisy underwater images

1 code implementation19 Oct 2020 Long Chen, Feixiang Zhou, Shengke Wang, Junyu Dong, Ning li, Haiping Ma, Xin Wang, Huiyu Zhou

Moreover, inspired by the human education process that drives the learning from easy to hard concepts, we here propose the CMA training paradigm that first trains a clean detector which is free from the influence of noisy data.

Small Object Detection

Accelerate CNNs from Three Dimensions: A Comprehensive Pruning Framework

no code implementations10 Oct 2020 Wenxiao Wang, Minghao Chen, Shuai Zhao, Long Chen, Jinming Hu, Haifeng Liu, Deng Cai, Xiaofei He, Wei Liu

Specifically, it first casts the relationships between a certain model's accuracy and depth/width/resolution into a polynomial regression and then maximizes the polynomial to acquire the optimal values for the three dimensions.

Network Pruning Neural Architecture Search

Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding

1 code implementation3 Sep 2020 Long Chen, Wenbo Ma, Jun Xiao, Hanwang Zhang, Shih-Fu Chang

The prevailing framework for solving referring expression grounding is based on a two-stage process: 1) detecting proposals with an object detector and 2) grounding the referent to one of the proposals.

Perceptual underwater image enhancement with deep learning and physical priors

1 code implementation21 Aug 2020 Long Chen, Zheheng Jiang, Lei Tong, Zhihua Liu, Aite Zhao, Qianni Zhang, Junyu Dong, Huiyu Zhou

Underwater image enhancement, as a pre-processing step to improve the accuracy of the following object detection task, has drawn considerable attention in the field of underwater navigation and ocean exploration.

Image Enhancement Image Generation +1

Defining Digital Quadruplets in the Cyber-Physical-Social Space for Parallel Driving

no code implementations26 Jul 2020 Teng Liu, Yang Xing, Long Chen, Dongpu Cao, Fei-Yue Wang

The objectives of the three virtual digital vehicles are interacting, guiding, simulating and improving with the real vehicles.

Digital Quadruplets for Cyber-Physical-Social Systems based Parallel Driving: From Concept to Applications

no code implementations21 Jul 2020 Teng Liu, Xing Yang, Hong Wang, Xiaolin Tang, Long Chen, Huilong Yu, Fei-Yue Wang

The three virtual vehicles (descriptive, predictive, and prescriptive) dynamically interact with the real one in order to enhance the safety and performance of the real vehicle.

Comparison of Different Methods for Time Sequence Prediction in Autonomous Vehicles

no code implementations16 Jul 2020 Teng Liu, Bin Tian, Yunfeng Ai, Long Chen, Fei Liu, Dongpu Cao

As a combination of various kinds of technologies, autonomous vehicles could complete a series of driving tasks by itself, such as perception, decision-making, planning, and control.

Autonomous Vehicles Decision Making +1

CANet: Context Aware Network for 3D Brain Glioma Segmentation

1 code implementation15 Jul 2020 Zhihua Liu, Lei Tong, Long Chen, Feixiang Zhou, Zheheng Jiang, Qianni Zhang, Yinhai Wang, Caifeng Shan, Ling Li, Huiyu Zhou

Automated segmentation of brain glioma plays an active role in diagnosis decision, progression monitoring and surgery planning.

Brain Tumor Segmentation Tumor Segmentation

CenterNet3D: An Anchor Free Object Detector for Point Cloud

2 code implementations13 Jul 2020 Guojun Wang, Jian Wu, Bin Tian, Siyu Teng, Long Chen, Dongpu Cao

However, because inherent sparsity of point clouds, 3D object center points are likely to be in empty space which makes it difficult to estimate accurate boundaries.

3D Object Detection Autonomous Driving

On Connections between Regularizations for Improving DNN Robustness

no code implementations4 Jul 2020 Yiwen Guo, Long Chen, Yurong Chen, Chang-Shui Zhang

This paper analyzes regularization terms proposed recently for improving the adversarial robustness of deep neural networks (DNNs), from a theoretical point of view.

Adversarial Robustness Image Classification

A Benchmark dataset for both underwater image enhancement and underwater object detection

no code implementations29 Jun 2020 Long Chen, Lei Tong, Feixiang Zhou, Zheheng Jiang, Zhenyang Li, Jialin Lv, Junyu Dong, Huiyu Zhou

To investigate how the underwater image enhancement methods influence the following underwater object detection tasks, in this paper, we provide a large-scale underwater object detection dataset with both bounding box annotations and high quality reference images, namely OUC dataset.

Image Enhancement Image Quality Assessment +1

One Thousand and One Hours: Self-driving Motion Prediction Dataset

2 code implementations25 Jun 2020 John Houston, Guido Zuidhof, Luca Bergamini, Yawei Ye, Long Chen, Ashesh Jain, Sammy Omari, Vladimir Iglovikov, Peter Ondruska

Motivated by the impact of large-scale datasets on ML systems we present the largest self-driving dataset for motion prediction to date, containing over 1, 000 hours of data.

Autonomous Vehicles Motion Forecasting +2

Hierarchical Fashion Graph Network for Personalized Outfit Recommendation

1 code implementation26 May 2020 Xingchen Li, Xiang Wang, Xiangnan He, Long Chen, Jun Xiao, Tat-Seng Chua

Fashion outfit recommendation has attracted increasing attentions from online shopping services and fashion communities. Distinct from other scenarios (e. g., social networking or content sharing) which recommend a single item (e. g., a friend or picture) to a user, outfit recommendation predicts user preference on a set of well-matched fashion items. Hence, performing high-quality personalized outfit recommendation should satisfy two requirements -- 1) the nice compatibility of fashion items and 2) the consistence with user preference.

Underwater object detection using Invert Multi-Class Adaboost with deep learning

1 code implementation23 May 2020 Long Chen, Zhihua Liu, Lei Tong, Zheheng Jiang, Shengke Wang, Junyu Dong, Huiyu Zhou

In addition, we propose a novel sample-weighted loss function which can model sample weights for SWIPENet, which uses a novel sample re-weighting algorithm, namely Invert Multi-Class Adaboost (IMA), to reduce the influence of noise on the proposed SWIPENet.

Small Object Detection

A CNN Framenwork Based on Line Annotations for Detecting Nematodes in Microscopic Images

no code implementations21 Apr 2020 Long Chen, Martin Strauch, Matthias Daub, Xiaochen Jiang, Marcus Jansen, Hans-Georg Luigs, Susanne Schultz-Kuhlmann, Stefan Krüssel, Dorif Merhof

The endpoints serve to untangle the skeletons from which segmentation masks are reconstructed by estimating the body width at each location along the skeleton.

MixNet: Multi-modality Mix Network for Brain Segmentation

1 code implementation21 Apr 2020 Long Chen, Dorit Merhof

Automated brain structure segmentation is important to many clinical quantitative analysis and diagnoses.

Brain Segmentation

In the Eyes of the Beholder: Analyzing Social Media Use of Neutral and Controversial Terms for COVID-19

no code implementations21 Apr 2020 Long Chen, Hanjia Lyu, Tongyu Yang, Yu Wang, Jiebo Luo

To model the substantive difference of tweets with controversial terms and those with non-controversial terms, we apply topic modeling and LIWC-based sentiment analysis.

Sentiment Analysis

Instance Segmentation of Biomedical Images with an Object-aware Embedding Learned with Local Constraints

1 code implementation21 Apr 2020 Long Chen, Martin Strauch, Dorit Merhof

The network is trained to output embedding vectors of similar directions for pixels from the same object, while adjacent objects are orthogonal in the embedding space, which effectively avoids the fusion of objects in a crowd.

Cell Segmentation Instance Segmentation +1

Location-Enabled IoT (LE-IoT): A Survey of Positioning Techniques, Error Sources, and Mitigation

no code implementations7 Apr 2020 You Li, Yuan Zhuang, Xin Hu, Zhouzheng Gao, Jia Hu, Long Chen, Zhe He, Ling Pei, Kejie Chen, Maosong Wang, Xiaoji Niu, Ruizhi Chen, John Thompson, Fadhel Ghannouchi, Naser El-Sheimy

Compared to the related surveys, this paper has a more comprehensive and state-of-the-art review on IoT localization methods, an original review on IoT localization error sources and mitigation, an original review on IoT localization performance evaluation, and a more comprehensive review of IoT localization applications, opportunities, and challenges.

Networking and Internet Architecture Signal Processing

Multi-Task Learning via Co-Attentive Sharing for Pedestrian Attribute Recognition

no code implementations7 Apr 2020 Haitian Zeng, Haizhou Ai, Zijie Zhuang, Long Chen

In this paper, we propose a novel Co-Attentive Sharing (CAS) module which extracts discriminative channels and spatial regions for more effective feature sharing in multi-task learning.

Multi-Task Learning Pedestrian Attribute Recognition

Distinguish Confusing Law Articles for Legal Judgment Prediction

1 code implementation ACL 2020 Nuo Xu, Pinghui Wang, Long Chen, Li Pan, Xiaoyan Wang, Junzhou Zhao

Legal Judgment Prediction (LJP) is the task of automatically predicting a law case's judgment results given a text describing its facts, which has excellent prospects in judicial assistance systems and convenient services for the public.

Counterfactual Samples Synthesizing for Robust Visual Question Answering

2 code implementations CVPR 2020 Long Chen, Xin Yan, Jun Xiao, Hanwang Zhang, ShiLiang Pu, Yueting Zhuang

To reduce the language biases, several recent works introduce an auxiliary question-only model to regularize the training of targeted VQA model, and achieve dominating performance on VQA-CP.

 Ranked #1 on Visual Question Answering on VQA-CP (using extra training data)

Question Answering Visual Question Answering

Transductive Zero-Shot Hashing for Multilabel Image Retrieval

1 code implementation17 Nov 2019 Qin Zou, Zheng Zhang, Ling Cao, Long Chen, Song Wang

Given semantic annotations such as class labels and pairwise similarities of the training data, hashing methods can learn and generate effective and compact binary codes.

Multi-Label Image Retrieval Quantization

DEBUG: A Dense Bottom-Up Grounding Approach for Natural Language Video Localization

no code implementations IJCNLP 2019 Chujie Lu, Long Chen, Chilie Tan, Xiaolin Li, Jun Xiao

In this paper, we focus on natural language video localization: localizing (ie, grounding) a natural language description in a long and untrimmed video sequence.

Learning Lightweight Pedestrian Detector with Hierarchical Knowledge Distillation

no code implementations20 Sep 2019 Rui Chen, Haizhou Ai, Chong Shang, Long Chen, Zijie Zhuang

It remains very challenging to build a pedestrian detection system for real world applications, which demand for both accuracy and speed.

Knowledge Distillation Pedestrian Detection

Extreme Low Resolution Activity Recognition with Confident Spatial-Temporal Attention Transfer

no code implementations9 Sep 2019 Yucai Bai, Qin Zou, Xieyuanli Chen, Lingxi Li, Zhengming Ding, Long Chen

Given the fact that one same activity may be represented by videos in both high resolution (HR) and extreme low resolution (eLR), it is worth studying to utilize the relevant HR data to improve the eLR activity recognition.

Activity Recognition Transfer Learning

MR-GNN: Multi-Resolution and Dual Graph Neural Network for Predicting Structured Entity Interactions

1 code implementation23 May 2019 Nuo Xu, Pinghui Wang, Long Chen, Jing Tao, Junzhou Zhao

To resolve these problems, we present MR-GNN, an end-to-end graph neural network with the following features: i) it uses a multi-resolution based architecture to extract node features from different neighborhoods of each node, and, ii) it uses dual graph-state long short-term memory networks (L-STMs) to summarize local features of each graph and extracts the interaction features between pairwise graphs.

A prescription for projectors to compute helicity amplitudes in D dimensions

no code implementations1 Apr 2019 Long Chen

The usage of these D-dimensional polarized amplitude projectors results in helicity amplitudes that can be expressed solely in terms of external momenta, but different from those defined in the existing dimensional regularization schemes.

High Energy Physics - Phenomenology High Energy Physics - Theory

Robust Lane Detection from Continuous Driving Scenes Using Deep Neural Networks

2 code implementations6 Mar 2019 Qin Zou, Hanwen Jiang, Qiyu Dai, Yuanhao Yue, Long Chen, Qian Wang

Specifically, information of each frame is abstracted by a CNN block, and the CNN features of multiple continuous frames, holding the property of time-series, are then fed into the RNN block for feature learning and lane prediction.

Lane Detection Time Series

Monocular Outdoor Semantic Mapping with a Multi-task Network

no code implementations17 Jan 2019 Yucai Bai, Lei Fan, Ziyu Pan, Long Chen

First, with the correlation of underlying information between depth and semantic prediction, a novel multi-task Convolutional Neural Network (CNN) is designed for joint prediction.

3D Reconstruction Autonomous Driving +1

Counterfactual Critic Multi-Agent Training for Scene Graph Generation

no code implementations ICCV 2019 Long Chen, Hanwang Zhang, Jun Xiao, Xiangnan He, ShiLiang Pu, Shih-Fu Chang

CMAT is a multi-agent policy gradient method that frames objects as cooperative agents, and then directly maximizes a graph-level metric as the reward.

Graph Generation Scene Graph Generation +1

Cross-Resolution Person Re-identification with Deep Antithetical Learning

no code implementations24 Oct 2018 Zijie Zhuang, Haizhou Ai, Long Chen, Chong Shang

One paradigm to deal with this problem is to use some complicated methods for mapping all images into an artificial image space, which however will disrupt the natural image distribution and requires heavy image preprocessing.

Person Re-Identification

End-to-end driving simulation via angle branched network

no code implementations19 May 2018 Qing Wang, Long Chen, Wei Tian

Imitation learning for end-to-end autonomous driving has drawn attention from academic communities.

Autonomous Driving Imitation Learning

Self-Supervised Monocular Image Depth Learning and Confidence Estimation

no code implementations14 Mar 2018 Long Chen, Wen Tang, Nigel John

Convolutional Neural Networks (CNNs) need large amounts of data with ground truth annotation, which is a challenging problem that has limited the development and fast deployment of CNNs for many computer vision tasks.

Depth Estimation

Context-Aware Mixed Reality: A Framework for Ubiquitous Interaction

no code implementations14 Mar 2018 Long Chen, Wen Tang, Nigel John, Tao Ruan Wan, Jian Jun Zhang

Mixed Reality (MR) is a powerful interactive technology that yields new types of user experience.

Mixed Reality

Improved Deep Hashing with Soft Pairwise Similarity for Multi-label Image Retrieval

1 code implementation8 Mar 2018 Zheng Zhang, Qin Zou, Yuewei Lin, Long Chen, Song Wang

In this paper, a new deep hashing method is proposed for multi-label image retrieval by re-defining the pairwise similarity into an instance similarity, where the instance similarity is quantified into a percentage based on the normalized semantic labels.

Multi-Label Image Retrieval

Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Networks

1 code implementation CVPR 2018 Long Chen, Hanwang Zhang, Jun Xiao, Wei Liu, Shih-Fu Chang

We propose a novel framework called Semantics-Preserving Adversarial Embedding Network (SP-AEN) for zero-shot visual recognition (ZSL), where test images and their classes are both unseen during training.

General Classification Zero-Shot Learning

Improving Negative Sampling for Word Representation using Self-embedded Features

no code implementations26 Oct 2017 Long Chen, Fajie Yuan, Joemon M. Jose, Wei-Nan Zhang

Although the word-popularity based negative sampler has shown superb performance in the skip-gram model, the theoretical motivation behind oversampling popular (non-observed) words as negative samples is still not well understood.

Maximum Principle Based Algorithms for Deep Learning

1 code implementation26 Oct 2017 Qianxiao Li, Long Chen, Cheng Tai, Weinan E

The continuous dynamical system approach to deep learning is explored in order to devise alternative frameworks for training algorithms.

Semantic Augmented Reality Environment with Material-Aware Physical Interactions

no code implementations3 Aug 2017 Long Chen, Karl Francis, Wen Tang

In Augmented Reality (AR) environment, realistic interactions between the virtual and real objects play a crucial role in user experience.

Scene Understanding

Recent Developments and Future Challenges in Medical Mixed Reality

no code implementations3 Aug 2017 Long Chen, Thomas Day, Wen Tang, Nigel W. John

Mixed Reality (MR) is of increasing interest within technology-driven modern medicine but is not yet used in everyday practice.

General Classification Mixed Reality

Real-time Geometry-Aware Augmented Reality in Minimally Invasive Surgery

no code implementations3 Aug 2017 Long Chen, Wen Tang, Nigel W. John

The potential of Augmented Reality (AR) technology to assist minimally invasive surgeries (MIS) lies in its computational performance and accuracy in dealing with challenging MIS scenes.

Stereo Matching Stereo Matching Hand +1

Video Question Answering via Attribute-Augmented Attention Network Learning

no code implementations20 Jul 2017 Yunan Ye, Zhou Zhao, Yimeng Li, Long Chen, Jun Xiao, Yueting Zhuang

Video Question Answering is a challenging problem in visual information retrieval, which provides the answer to the referenced video content according to the question.

Information Retrieval Question Answering +3

Planecell: Representing the 3D Space with Planes

no code implementations30 Mar 2017 Lei Fan, Ziyu Pan, Long Chen, Kai Huang

Reconstruction based on the stereo camera has received considerable attention recently, but two particular challenges still remain.

Semantic Segmentation

Augmented Reality for Depth Cues in Monocular Minimally Invasive Surgery

no code implementations1 Mar 2017 Long Chen, Wen Tang, Nigel W. John, Tao Ruan Wan, Jian Jun Zhang

In vivo laparoscopic videos used in the tests have demonstrated the robustness and accuracy of our proposed framework on both camera tracking and surface reconstruction, illustrating the potential of our algorithm for depth augmentation and depth-corrected augmented reality in MIS with monocular endoscopes.

Simultaneous Localization and Mapping Surface Reconstruction

Cascade one-vs-rest detection network for fine-grained recognition without part annotations

no code implementations28 Feb 2017 Long Chen, Junyu Dong, Shengke Wang, Kin-Man Lam, Muwei Jian, Hua Zhang, Xiaochun Cao

To bridge this gap, we introduce a cascaded structure to eliminate background and exploit a one-vs-rest loss to capture more minute variances among different subordinate categories.

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

1 code implementation CVPR 2017 Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua

Existing visual attention models are generally spatial, i. e., the attention is modeled as spatial probabilities that re-weight the last conv-layer feature map of a CNN encoding an input image.

Image Captioning

Who Leads the Clothing Fashion: Style, Color, or Texture? A Computational Study

no code implementations26 Aug 2016 Qin Zou, Zheng Zhang, Qian Wang, Qingquan Li, Long Chen, Song Wang

Specifically, a classification-based model is proposed to quantify the influence of different visual stimuli, in which each visual stimulus's influence is quantified by its corresponding accuracy in fashion classification.

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.