no code implementations • CCL 2021 • Long Chen, Junjun Guo, Yafei Zhang, Chengxiang Gao, Zhengtao Yu
“当前基于深度学习的事件检测模型都依赖足够数量的标注数据, 而标注数据的稀缺及事件类型歧义为越南语事件检测带来了极大的挑战。根据“表达相同观点但语言不同的句子通常有相同或相似的语义成分”这一多语言一致性特征, 本文提出了一种基于中文信息与越南语句法指导的越南语事件检测框架。首先通过共享编码器策略和交叉注意力网络将中文信息融入到越南语中, 然后使用图卷积网络融入越南语依存句法信息, 最后在中文事件类型指导下实现越南语事件检测。实验结果表明, 在中文信息和越南语句法的指导下越南语事件检测取得了较好的效果。”
no code implementations • 20 Mar 2023 • Xingchen Li, Long Chen, Guikun Chen, Yinfu Feng, Yi Yang, Jun Xiao
To this end, we propose a novel Decomposed Prototype Learning (DPL).
no code implementations • 17 Mar 2023 • Siyu Teng, Xuemin Hu, Peng Deng, Bai Li, Yuchen Li, Zhe XuanYuan, Dongsheng Yang, Yunfeng Ai, Lingxi Li, Long Chen, Fenghua Zhu
In terms of pipeline methods, a survey of selecting algorithms is provided along with a discussion of the expansion and optimization mechanisms, whereas in end-to-end methods, the training approaches and verification scenarios of driving tasks are points of concern.
1 code implementation • 13 Mar 2023 • Wenxiao Wang, Wei Chen, Qibo Qiu, Long Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wei Liu
On the one hand, CEL blends each token with multiple patches of different scales, providing the self-attention module itself with cross-scale features.
no code implementations • 11 Mar 2023 • Zhen Wang, Jun Xiao, Lei Chen, Fei Gao, Jian Shao, Long Chen
Due to its simplicity, our ComPro can easily be extended to more complex combined control signals by concatenating these prompts.
1 code implementation • 1 Feb 2023 • Kaifeng Gao, Long Chen, Hanwang Zhang, Jun Xiao, Qianru Sun
Without bells and whistles, our RePro achieves a new state-of-the-art performance on two VidVRD benchmarks of not only the base training object and predicate categories, but also the unseen ones.
no code implementations • 28 Dec 2022 • Yuncong Yang, Jiawei Ma, Shiyuan Huang, Long Chen, Xudong Lin, Guangxing Han, Shih-Fu Chang
For long videos, given a paragraph of description where the sentences describe different segments of the video, by matching all sentence-clip pairs, the paragraph and the full video are aligned implicitly.
no code implementations • 26 Dec 2022 • Wei Ji, Long Chen, Yinwei Wei, Yiming Wu, Tat-Seng Chua
In this work, we propose a novel multi-resolution temporal video sentence grounding network: MRTNet, which consists of a multi-modal feature encoder, a Multi-Resolution Temporal (MRT) module, and a predictor module.
no code implementations • 7 Dec 2022 • Long Chen, Piyushimita, Thakuriah, Konstantinos Ampountolas
UberNet empploys a multivariate framework that utilises a number of temporal and spatial features that have been found in the literature to explain demand for ride-hailing services.
1 code implementation • 12 Nov 2022 • Luxi Li, Qin Zou, Fan Zhang, Hongkai Yu, Long Chen, Chengfang Song, Xianfeng Huang, Xiaoguang Wang
Mural image inpainting refers to repairing the damage or missing areas in a mural image to restore the visual appearance.
no code implementations • 6 Nov 2022 • Long Chen, Jinhua Xu
Person Search aims to simultaneously localize and recognize a target person from realistic and uncropped gallery images.
no code implementations • 23 Oct 2022 • Yulei Niu, Long Chen, Chang Zhou, Hanwang Zhang
The network response serves as additional supervision to formulate the machine domain, which uses the data collected from the human domain as a transfer set.
1 code implementation • 22 Oct 2022 • Long Chen, Yulei Niu, Brian Chen, Xudong Lin, Guangxing Han, Christopher Thomas, Hammad Ayyubi, Heng Ji, Shih-Fu Chang
Specifically, given an article and a relevant video, WSAG aims to localize all ``groundable'' sentences to the video, and these sentences are possibly at different semantic scales.
1 code implementation • 7 Oct 2022 • Long Chen, Yuli Wu, Dorit Merhof
Instance segmentation aims to delineate each individual object of interest in an image.
1 code implementation • 29 Sep 2022 • Ruchi Guo, Shuhao Cao, Long Chen
A Transformer-based deep direct sampling method is proposed for electrical impedance tomography, a well-known severely ill-posed nonlinear boundary value inverse problem.
no code implementations • 7 Aug 2022 • Lin Li, Long Chen, Hanrong Shi, Wenxiao Wang, Jian Shao, Yi Yang, Jun Xiao
To this end, we propose a novel model-agnostic Label Semantic Knowledge Distillation (LS-KD) for unbiased SGG.
no code implementations • 7 Aug 2022 • Feixiang Zhou, Xinyu Yang, Fang Chen, Long Chen, Zheheng Jiang, Hui Zhu, Reiko Heckel, Haikuan Wang, Minrui Fei, Huiyu Zhou
Furthermore, we design a novel Interaction-Aware Transformer (IAT) to dynamically learn the graph-level representation of social behaviours and update the node-level representation, guided by our proposed interaction-aware self-attention mechanism.
1 code implementation • 3 Aug 2022 • Xingchen Li, Long Chen, Wenbo Ma, Yi Yang, Jun Xiao
However, we argue that most existing WSSGG works only focus on object-consistency, which means the grounded regions should have the same object category label as text entities.
no code implementations • 3 Aug 2022 • Xingchen Li, Long Chen, Jian Shao, Shaoning Xiao, Songyang Zhang, Jun Xiao
Current Scene Graph Generation (SGG) methods tend to predict frequent predicate categories and fail to recognize rare ones due to the severe imbalanced distribution of predicates.
no code implementations • 28 Jul 2022 • Liqun Huang, Long Chen, Baihai Zhang, Senchun Chai
Our architecture consists of a generator and a discriminator, which are trained in min-max game progress.
no code implementations • 27 Jul 2022 • Lin Li, Long Chen, Hanrong Shi, Hanwang Zhang, Yi Yang, Wei Liu, Jun Xiao
To this end, we propose a novel NoIsy label CorrEction and Sample Training strategy for SGG: NICEST.
1 code implementation • 22 Jul 2022 • Yangjun Mao, Long Chen, Zhihong Jiang, Dong Zhang, Zhimeng Zhang, Jian Shao, Jun Xiao
Unfortunately, reference images used by existing Ref-DIC works are easy to distinguish: these reference images only resemble the target image at scene-level and have few common objects, such that a Ref-DIC model can trivially generate distinctive captions even without considering the reference images.
1 code implementation • 21 Jul 2022 • Meng Cao, Ji Jiang, Long Chen, Yuexian Zou
Extensive experiments demonstrate that our DCNet achieves state-of-the-art performance on both video and image REC benchmarks.
1 code implementation • 20 Jul 2022 • Zhen Wang, Long Chen, Wenbo Ma, Guangxing Han, Yulei Niu, Jian Shao, Jun Xiao
Given an image and a reference caption, the image caption editing task aims to correct the misalignment errors and generate a refined caption.
1 code implementation • 19 Jul 2022 • Long Chen, Yingying Xu, Fangyi Xu, Qian Hu, Zhenzhou Tang
In addition, this work fully considers the heterogeneity of SNs (i. e. differentiated sensing range and deployment cost) and three-dimensional (3-D) deployment scenarios.
1 code implementation • 18 Jul 2022 • Long Chen, Yuhang Zheng, Jun Xiao
Unfortunately, to guarantee augmented samples have reasonable ground-truth answers, they manually design a set of heuristic rules for several question types, which extremely limits its generalization abilities.
no code implementations • 8 Jul 2022 • Long Chen, Yixiong Meng, Venkatesh Ravichandran, Andreas Stolcke
Speaker identification (SID) in the household scenario (e. g., for smart speakers) is an important but challenging problem due to limited number of labeled (enrollment) utterances, confusable voices, and demographic imbalances.
no code implementations • 14 Jun 2022 • Hammad A. Ayyubi, Christopher Thomas, Lovish Chum, Rahul Lokesh, Yulei Niu, Xudong Lin, Long Chen, Jaywon Koo, Sounak Ray, Shih-Fu Chang
Recognizing that the visual `arrest' event is a subevent of the broader `protest' event is a challenging, yet important problem that prior work has not explored.
1 code implementation • CVPR 2022 • Lin Li, Long Chen, Yifeng Huang, Zhimeng Zhang, Songyang Zhang, Jun Xiao
Then, in Pos-NSD, we use a clustering-based algorithm to divide all positive samples into multiple sets, and treat the samples in the noisiest set as noisy positive samples.
no code implementations • 29 Apr 2022 • Long Chen, Mao Ye, Alistair Milne, John Hillier, Frances Oglesby
This report, commissioned by the WTW research network, investigates the use of AI in property risk assessment.
no code implementations • 25 Apr 2022 • Shaoning Xiao, Long Chen, Kaifeng Gao, Zhao Wang, Yi Yang, Zhimeng Zhang, Jun Xiao
From the view of feature, we break down the video into trajectories and first leverage trajectory feature in VideoQA to enhance the alignment between two modalities.
no code implementations • 19 Apr 2022 • Justin Baker, Hedi Xia, Yiwei Wang, Elena Cherkaev, Akil Narayan, Long Chen, Jack Xin, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang
Learning neural ODEs often requires solving very stiff ODE systems, primarily using explicit adaptive step size ODE solvers.
1 code implementation • 17 Apr 2022 • Yuhang He, Lin Chen, Junkun Xie, Long Chen
This motivates us to conduct a "task transfer" paradigm so that 3D semantic segmentation benefits from aggregating 2D semantic cues, albeit pose noises are contained in 2D image observations.
no code implementations • 16 Apr 2022 • Guangxing Han, Long Chen, Jiawei Ma, Shiyuan Huang, Rama Chellappa, Shih-Fu Chang
Our approach is motivated by the high-level conceptual similarity of (metric-based) meta-learning and prompt-based learning to learn generalizable few-shot and zero-shot object detection models respectively without fine-tuning.
1 code implementation • CVPR 2022 • Guangxing Han, Jiawei Ma, Shiyuan Huang, Long Chen, Shih-Fu Chang
Inspired by the recent work on vision transformers and vision-language transformers, we propose a novel Fully Cross-Transformer based model (FCT) for FSOD by incorporating cross-transformer into both the feature backbone and detection head.
no code implementations • 13 Mar 2022 • Han Li, Long Chen, Hu Han, S. Kevin Zhou
Universal Lesion Detection (ULD) in computed tomography plays an essential role in computer-aided diagnosis.
no code implementations • 10 Mar 2022 • Xiaohan Lan, Yitian Yuan, Xin Wang, Long Chen, Zhi Wang, Lin Ma, Wenwu Zhu
New benchmarking results indicate that our proposed evaluation protocols can better monitor the research progress.
no code implementations • 24 Feb 2022 • Kishan K C, Zhenning Tan, Long Chen, Minho Jin, Eunjung Han, Andreas Stolcke, Chul Lee
Household speaker identification with few enrollment utterances is an important yet challenging problem, especially when household members share similar voice characteristics and room acoustics.
no code implementations • CVPR 2022 • Yuchen Li, Zixuan Li, Siyu Teng, Yu Zhang, YuHang Zhou, Yuchang Zhu, Dongpu Cao, Bin Tian, Yunfeng Ai, Zhe XuanYuan, Long Chen
The main contributions of the AutoMine dataset are as follows: 1. The first autonomous driving dataset for perception and localization in mine scenarios.
1 code implementation • 10 Dec 2021 • Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Tat-Seng Chua
Since each verb is associated with a specific set of semantic roles, all existing GSR methods resort to a two-stage framework: predicting the verb in the first stage and detecting the semantic roles in the second stage.
Ranked #2 on
Situation Recognition
on imSitu
1 code implementation • CVPR 2022 • Kaifeng Gao, Long Chen, Yulei Niu, Jian Shao, Jun Xiao
To this end, we propose a new classification-then-grounding framework for VidSGG, which can avoid all the three overlooked drawbacks.
no code implementations • 9 Nov 2021 • Fengda Zhang, Kun Kuang, Yuxuan Liu, Long Chen, Chao Wu, Fei Wu, Jiaxun Lu, Yunfeng Shao, Jun Xiao
We validate the advantages of the FMDA-M algorithm with various kinds of distribution shift settings in experiments, and the results show that FMDA-M algorithm outperforms the existing fair FL algorithms on unified group fairness.
no code implementations • 13 Oct 2021 • Long Chen, Matthias Daub, Hans-Georg Luigs, Marcus Jansen, Martin Strauch, Dorit Merhof
The beet cyst nematode (BCN) Heterodera schachtii is a plant pest responsible for crop loss on a global scale.
1 code implementation • 3 Oct 2021 • Long Chen, Yuhang Zheng, Yulei Niu, Hanwang Zhang, Jun Xiao
Specifically, CSST is composed of two parts: Counterfactual Samples Synthesizing (CSS) and Counterfactual Samples Training (CST).
1 code implementation • EMNLP 2021 • Shaoning Xiao, Long Chen, Jian Shao, Yueting Zhuang, Jun Xiao
Given an untrimmed video and a natural language query, Natural Language Video Localization (NLVL) aims to identify the video moment described by the query.
no code implementations • 15 Sep 2021 • Kaige Wang, Long Chen, Tianming Wang, Qixiang Meng, Huatao Jiang, Lin Chang
Perception plays an important role in reliable decision-making for autonomous vehicles.
no code implementations • EMNLP 2021 • Meng Cao, Long Chen, Mike Zheng Shou, Can Zhang, Yuexian Zou
Almost all existing video grounding methods fall into two frameworks: 1) Top-down model: It predefines a set of segment candidates and then conducts segment classification and regression.
no code implementations • 3 Sep 2021 • Jiahui Li, Kun Kuang, Lin Li, Long Chen, Songyang Zhang, Jian Shao, Jun Xiao
Deep neural networks have demonstrated remarkable performance in many data-driven and prediction-oriented applications, and sometimes even perform better than humans.
1 code implementation • 19 Aug 2021 • Kaifeng Gao, Long Chen, Yifeng Huang, Jun Xiao
Video Visual Relation Detection (VidVRD), has received significant attention of our community over recent years.
no code implementations • 12 Aug 2021 • Meng Cao, Can Zhang, Long Chen, Mike Zheng Shou, Yuexian Zou
In this paper, we analyze that the motion cues behind the optical flow features are complementary informative.
Optical Flow Estimation
Weakly-supervised Temporal Action Localization
+1
no code implementations • NeurIPS 2021 • Tan M. Nguyen, Vai Suliafu, Stanley J. Osher, Long Chen, Bao Wang
For instance, FMMformers achieve an average classification accuracy of $60. 74\%$ over the five Long Range Arena tasks, which is significantly better than the standard transformer's average accuracy of $58. 70\%$.
2 code implementations • ICLR 2022 • Wenxiao Wang, Lu Yao, Long Chen, Binbin Lin, Deng Cai, Xiaofei He, Wei Liu
On the one hand, CEL blends each embedding with multiple patches of different scales, providing the self-attention module itself with cross-scale features.
Ranked #40 on
Semantic Segmentation
on ADE20K val
no code implementations • 15 Jun 2021 • Long Chen, Venkatesh Ravichandran, Andreas Stolcke
We show in experiments on the VoxCeleb dataset that this approach makes effective use of unlabeled data and improves speaker identification accuracy compared to two state-of-the-art scoring methods as well as their semi-supervised variants based on pseudo-labels.
no code implementations • 1 Jun 2021 • Jiahui Li, Kun Kuang, Baoxiang Wang, Furui Liu, Long Chen, Fei Wu, Jun Xiao
Specifically, Shapley Value and its desired properties are leveraged in deep MARL to credit any combinations of agents, which grants us the capability to estimate the individual credit for each agent.
Multi-agent Reinforcement Learning
reinforcement-learning
+3
no code implementations • 31 May 2021 • Fuxiang Tan, YuTing Kong, Yingying Fan, Feng Liu, Daxin Zhou, Hao Zhang, Long Chen, Liang Gao, Yurong Qian
The former implements the basic rain pattern feature extraction, while the latter fuses different features to further extract and process the image features.
no code implementations • 26 May 2021 • Long Chen, Lukas Platinsky, Stefanie Speichert, Blazej Osinski, Oliver Scheel, Yawei Ye, Hugo Grimmett, Luca Del Pero, Peter Ondruska
If cheaper sensors could be used for collection instead, data availability would go up, which is crucial in a field where data volume requirements are large and availability is small.
no code implementations • 26 May 2021 • Luca Bergamini, Yawei Ye, Oliver Scheel, Long Chen, Chih Hu, Luca Del Pero, Blazej Osinski, Hugo Grimmett, Peter Ondruska
We train our system directly from 1, 000 hours of driving logs and measure both realism, reactivity of the simulation as the two key properties of the simulation.
no code implementations • 26 May 2021 • Feifei Shao, Long Chen, Jian Shao, Wei Ji, Shaoning Xiao, Lu Ye, Yueting Zhuang, Jun Xiao
With the success of deep neural networks in object detection, both WSOD and WSOL have received unprecedented attention.
no code implementations • 12 May 2021 • Chenchi Zhang, Wenbo Ma, Jun Xiao, Hanwang Zhang, Jian Shao, Yueting Zhuang, Long Chen
In this paper, we argue that these methods overlook an obvious \emph{mismatch} between the roles of proposals in the two stages: they generate proposals solely based on the detection confidence (i. e., query-agnostic), hoping that the proposals contain all instances mentioned in the text query (i. e., query-aware).
no code implementations • 3 May 2021 • YuHan Liu, Yuhan Gao, Zhifan Nan, Long Chen
During the COVID-19 pandemic, people started to discuss about pandemic-related topics on social media.
no code implementations • 23 Mar 2021 • Han Li, Long Chen, Hu Han, S. Kevin Zhou
Universal Lesion Detection (ULD) in computed tomography plays an essential role in computer-aided diagnosis.
Ranked #3 on
Medical Object Detection
on DeepLesion
1 code implementation • CVPR 2021 • Long Chen, Zhihong Jiang, Jun Xiao, Wei Liu
However, we argue that almost all existing objective control signals have overlooked two indispensable characteristics of an ideal control signal: 1) Event-compatible: all visual contents referred to in a single sentence should be compatible with the described activity.
no code implementations • 15 Mar 2021 • Shaoning Xiao, Long Chen, Songyang Zhang, Wei Ji, Jian Shao, Lu Ye, Jun Xiao
State-of-the-art NLVL methods are almost in one-stage fashion, which can be typically grouped into two categories: 1) anchor-based approach: it first pre-defines a series of video segment candidates (e. g., by sliding window), and then does classification for each candidate; 2) anchor-free approach: it directly predicts the probabilities for each video frame as a boundary or intermediate frame inside the positive segment.
no code implementations • 22 Jan 2021 • Yitian Yuan, Xiaohan Lan, Xin Wang, Long Chen, Zhi Wang, Wenwu Zhu
All the results demonstrate that the re-organized dataset splits and new metric can better monitor the progress in TSGV.
no code implementations • 20 Jan 2021 • Long Chen, Junyu Dong, Huiyu Zhou
CWSA is a new kind of data augmentation technique which augments the training data for the minority classes by generating various colors, textures and contrasts for the minority classes.
no code implementations • 20 Jan 2021 • Werner Bernreuther, Long Chen, Otto Nachtmann
We reconsider the issue of the search for a nonzero electric dipole form factor (EDM) $d_\tau(s)$ using optimal observables in $\tau^+\tau^-$ production by $e^+ e^-$ collisions in the center-of-mass energy range from the $\tau$-pair threshold to about $\sqrt{s} \sim 15$ GeV.
High Energy Physics - Phenomenology High Energy Physics - Experiment
1 code implementation • NeurIPS 2020 • Long Chen, Yuan YAO, Feng Xu, Miao Xu, Hanghang Tong
Collaborative filtering has been widely used in recommender systems.
1 code implementation • 1 Dec 2020 • Feixiang Zhou, Zheheng Jiang, Zhihua Liu, Fang Chen, Long Chen, Lei Tong, Zhile Yang, Haikuan Wang, Minrui Fei, Ling Li, Huiyu Zhou
However, quantifying mouse behaviours from videos or images remains a challenging problem, where pose estimation plays an important role in describing mouse behaviours.
no code implementations • 24 Nov 2020 • Long Chen, Gudrun Heinrich, Stephen P. Jones, Matthias Kerner, Jonas Klappert, Johannes Schlenk
We present results for the two-loop helicity amplitudes entering the NLO QCD corrections to the production of a Higgs boson in association with a $Z$-boson in gluon fusion.
High Energy Physics - Phenomenology
1 code implementation • 13 Nov 2020 • Xuehui Wang, Qing Wang, Yuzhi Zhao, Junchi Yan, Lei Fan, Long Chen
In this paper, we develop a computation efficient yet accurate network based on the proposed attentive auxiliary features (A$^2$F) for SISR.
1 code implementation • 2 Nov 2020 • Guojun Wang, Bin Tian, Yachen Zhang, Long Chen, Dongpu Cao, Jian Wu
3D object detection based on LiDAR-camera fusion is becoming an emerging research theme for autonomous driving.
1 code implementation • 19 Oct 2020 • Long Chen, Feixiang Zhou, Shengke Wang, Junyu Dong, Ning li, Haiping Ma, Xin Wang, Huiyu Zhou
Moreover, inspired by the human education process that drives the learning from easy to hard concepts, we here propose the CMA training paradigm that first trains a clean detector which is free from the influence of noisy data.
no code implementations • 10 Oct 2020 • Wenxiao Wang, Minghao Chen, Shuai Zhao, Long Chen, Jinming Hu, Haifeng Liu, Deng Cai, Xiaofei He, Wei Liu
Specifically, it first casts the relationships between a certain model's accuracy and depth/width/resolution into a polynomial regression and then maximizes the polynomial to acquire the optimal values for the three dimensions.
3 code implementations • 15 Sep 2020 • Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Xiaochuan Li, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Abdul Muqeet, Jiwon Hwang, Subin Yang, JungHeum Kang, Sung-Ho Bae, Yongwoo Kim, Geun-Woo Jeon, Jun-Ho Choi, Jun-Hyuk Kim, Jong-Seok Lee, Steven Marty, Eric Marty, Dongliang Xiong, Siang Chen, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Haicheng Wang, Vineeth Bhaskara, Alex Levinshtein, Stavros Tsogkas, Allan Jepson, Xiangzhen Kong, Tongtong Zhao, Shanshan Zhao, Hrishikesh P. S, Densen Puthussery, Jiji C. V, Nan Nan, Shuai Liu, Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho, Xuehui Wang, Qiong Yan, Yuzhi Zhao, Long Chen, Jiangtao Zhang, Xiaotong Luo, Liang Chen, Yanyun Qu, Long Sun, Wenhao Wang, Zhenbing Liu, Rushi Lan, Rao Muhammad Umer, Christian Micheloni
This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results.
1 code implementation • 3 Sep 2020 • Long Chen, Wenbo Ma, Jun Xiao, Hanwang Zhang, Shih-Fu Chang
The prevailing framework for solving referring expression grounding is based on a two-stage process: 1) detecting proposals with an object detector and 2) grounding the referent to one of the proposals.
no code implementations • 21 Aug 2020 • Long Chen, Zheheng Jiang, Lei Tong, Zhihua Liu, Aite Zhao, Qianni Zhang, Junyu Dong, Huiyu Zhou
Underwater image enhancement, as a pre-processing step to improve the accuracy of the following object detection task, has drawn considerable attention in the field of underwater navigation and ocean exploration.
no code implementations • 26 Jul 2020 • Teng Liu, Yang Xing, Long Chen, Dongpu Cao, Fei-Yue Wang
The objectives of the three virtual digital vehicles are interacting, guiding, simulating and improving with the real vehicles.
no code implementations • 21 Jul 2020 • Teng Liu, Xing Yang, Hong Wang, Xiaolin Tang, Long Chen, Huilong Yu, Fei-Yue Wang
The three virtual vehicles (descriptive, predictive, and prescriptive) dynamically interact with the real one in order to enhance the safety and performance of the real vehicle.
1 code implementation • 18 Jul 2020 • Zhihua Liu, Lei Tong, Zheheng Jiang, Long Chen, Feixiang Zhou, Qianni Zhang, Xiangrong Zhang, Yaochu Jin, Huiyu Zhou
Brain tumor segmentation is one of the most challenging problems in medical image analysis.
no code implementations • 16 Jul 2020 • Teng Liu, Bin Tian, Yunfeng Ai, Long Chen, Fei Liu, Dongpu Cao
As a combination of various kinds of technologies, autonomous vehicles could complete a series of driving tasks by itself, such as perception, decision-making, planning, and control.
1 code implementation • 15 Jul 2020 • Zhihua Liu, Lei Tong, Long Chen, Feixiang Zhou, Zheheng Jiang, Qianni Zhang, Yinhai Wang, Caifeng Shan, Ling Li, Huiyu Zhou
Automated segmentation of brain glioma plays an active role in diagnosis decision, progression monitoring and surgery planning.
no code implementations • 13 Jul 2020 • Yuli Wu, Long Chen, Dorit Merhof
A distance regression module is incorporated into our architecture to generate seeds for fast clustering.
2 code implementations • 13 Jul 2020 • Guojun Wang, Jian Wu, Bin Tian, Siyu Teng, Long Chen, Dongpu Cao
However, because inherent sparsity of point clouds, 3D object center points are likely to be in empty space which makes it difficult to estimate accurate boundaries.
no code implementations • 4 Jul 2020 • Yiwen Guo, Long Chen, Yurong Chen, Chang-Shui Zhang
This paper analyzes regularization terms proposed recently for improving the adversarial robustness of deep neural networks (DNNs), from a theoretical point of view.
no code implementations • 29 Jun 2020 • Long Chen, Lei Tong, Feixiang Zhou, Zheheng Jiang, Zhenyang Li, Jialin Lv, Junyu Dong, Huiyu Zhou
To investigate how the underwater image enhancement methods influence the following underwater object detection tasks, in this paper, we provide a large-scale underwater object detection dataset with both bounding box annotations and high quality reference images, namely OUC dataset.
3 code implementations • 25 Jun 2020 • John Houston, Guido Zuidhof, Luca Bergamini, Yawei Ye, Long Chen, Ashesh Jain, Sammy Omari, Vladimir Iglovikov, Peter Ondruska
Motivated by the impact of large-scale datasets on ML systems we present the largest self-driving dataset for motion prediction to date, containing over 1, 000 hours of data.
1 code implementation • 26 May 2020 • Xingchen Li, Xiang Wang, Xiangnan He, Long Chen, Jun Xiao, Tat-Seng Chua
Fashion outfit recommendation has attracted increasing attentions from online shopping services and fashion communities. Distinct from other scenarios (e. g., social networking or content sharing) which recommend a single item (e. g., a friend or picture) to a user, outfit recommendation predicts user preference on a set of well-matched fashion items. Hence, performing high-quality personalized outfit recommendation should satisfy two requirements -- 1) the nice compatibility of fashion items and 2) the consistence with user preference.
1 code implementation • 23 May 2020 • Long Chen, Zhihua Liu, Lei Tong, Zheheng Jiang, Shengke Wang, Junyu Dong, Huiyu Zhou
In addition, we propose a novel sample-weighted loss function which can model sample weights for SWIPENet, which uses a novel sample re-weighting algorithm, namely Invert Multi-Class Adaboost (IMA), to reduce the influence of noise on the proposed SWIPENet.
no code implementations • 21 Apr 2020 • Long Chen, Martin Strauch, Matthias Daub, Xiaochen Jiang, Marcus Jansen, Hans-Georg Luigs, Susanne Schultz-Kuhlmann, Stefan Krüssel, Dorif Merhof
The endpoints serve to untangle the skeletons from which segmentation masks are reconstructed by estimating the body width at each location along the skeleton.
no code implementations • 21 Apr 2020 • Long Chen, Hanjia Lyu, Tongyu Yang, Yu Wang, Jiebo Luo
To model the substantive difference of tweets with controversial terms and those with non-controversial terms, we apply topic modeling and LIWC-based sentiment analysis.
2 code implementations • 21 Apr 2020 • Long Chen, Martin Strauch, Dorit Merhof
The network is trained to output embedding vectors of similar directions for pixels from the same object, while adjacent objects are orthogonal in the embedding space, which effectively avoids the fusion of objects in a crowd.
1 code implementation • 21 Apr 2020 • Long Chen, Dorit Merhof
Automated brain structure segmentation is important to many clinical quantitative analysis and diagnoses.
no code implementations • 10 Apr 2020 • Yaodong Cui, Ren Chen, Wenbo Chu, Long Chen, Daxin Tian, Ying Li, Dongpu Cao
Autonomous vehicles were experiencing rapid development in the past few years.
no code implementations • 7 Apr 2020 • Haitian Zeng, Haizhou Ai, Zijie Zhuang, Long Chen
In this paper, we propose a novel Co-Attentive Sharing (CAS) module which extracts discriminative channels and spatial regions for more effective feature sharing in multi-task learning.
no code implementations • 7 Apr 2020 • You Li, Yuan Zhuang, Xin Hu, Zhouzheng Gao, Jia Hu, Long Chen, Zhe He, Ling Pei, Kejie Chen, Maosong Wang, Xiaoji Niu, Ruizhi Chen, John Thompson, Fadhel Ghannouchi, Naser El-Sheimy
Compared to the related surveys, this paper has a more comprehensive and state-of-the-art review on IoT localization methods, an original review on IoT localization error sources and mitigation, an original review on IoT localization performance evaluation, and a more comprehensive review of IoT localization applications, opportunities, and challenges.
Networking and Internet Architecture Signal Processing
1 code implementation • ACL 2020 • Nuo Xu, Pinghui Wang, Long Chen, Li Pan, Xiaoyan Wang, Junzhou Zhao
Legal Judgment Prediction (LJP) is the task of automatically predicting a law case's judgment results given a text describing its facts, which has excellent prospects in judicial assistance systems and convenient services for the public.
2 code implementations • CVPR 2020 • Long Chen, Xin Yan, Jun Xiao, Hanwang Zhang, ShiLiang Pu, Yueting Zhuang
To reduce the language biases, several recent works introduce an auxiliary question-only model to regularize the training of targeted VQA model, and achieve dominating performance on VQA-CP.
Ranked #1 on
Visual Question Answering (VQA)
on VQA-CP
(using extra training data)
1 code implementation • CVPR 2020 • Long Chen, Haizhou Ai, Rui Chen, Zijie Zhuang, Shuang Liu
To further verify the scalability of our method, we propose a new large-scale multi-human dataset with 12 to 28 camera views.
Ranked #8 on
3D Multi-Person Pose Estimation
on Campus
1 code implementation • 17 Nov 2019 • Qin Zou, Zheng Zhang, Ling Cao, Long Chen, Song Wang
Given semantic annotations such as class labels and pairwise similarities of the training data, hashing methods can learn and generate effective and compact binary codes.
no code implementations • IJCNLP 2019 • Chujie Lu, Long Chen, Chilie Tan, Xiaolin Li, Jun Xiao
In this paper, we focus on natural language video localization: localizing (ie, grounding) a natural language description in a long and untrimmed video sequence.
no code implementations • 20 Sep 2019 • Rui Chen, Haizhou Ai, Chong Shang, Long Chen, Zijie Zhuang
It remains very challenging to build a pedestrian detection system for real world applications, which demand for both accuracy and speed.
no code implementations • 9 Sep 2019 • Yucai Bai, Qin Zou, Xieyuanli Chen, Lingxi Li, Zhengming Ding, Long Chen
Given the fact that one same activity may be represented by videos in both high resolution (HR) and extreme low resolution (eLR), it is worth studying to utilize the relevant HR data to improve the eLR activity recognition.
no code implementations • ACL 2019 • Wei Ye, Bo Li, Rui Xie, Zhonghao Sheng, Long Chen, Shikun Zhang
In practical scenario, relation extraction needs to first identify entity pairs that have relation and then assign a correct relation class.
no code implementations • 6 Jun 2019 • Zheheng Jiang, Zhihua Liu, Long Chen, Lei Tong, Xiangrong Zhang, Xiangyuan Lan, Danny Crookes, Ming-Hsuan Yang, Huiyu Zhou
The study of mouse social behaviours has been increasingly undertaken in neuroscience research.
1 code implementation • 2 Jun 2019 • Lei Tong, Zhihua Liu, Zheheng Jiang, Feixiang Zhou, Long Chen, Jialin Lyu, Xiangrong Zhang, Qianni Zhang, Abdul Sadka Senior, Yinhai Wang, Ling Li, Huiyu Zhou
Depression is one of the most common mental health disorders, and a large number of depressed people commit suicide each year.
2 code implementations • 23 May 2019 • Nuo Xu, Pinghui Wang, Long Chen, Jing Tao, Junzhou Zhao
To resolve these problems, we present MR-GNN, an end-to-end graph neural network with the following features: i) it uses a multi-resolution based architecture to extract node features from different neighborhoods of each node, and, ii) it uses dual graph-state long short-term memory networks (L-STMs) to summarize local features of each graph and extracts the interaction features between pairwise graphs.
no code implementations • 1 Apr 2019 • Long Chen
The usage of these D-dimensional polarized amplitude projectors results in helicity amplitudes that can be expressed solely in terms of external momenta, but different from those defined in the existing dimensional regularization schemes.
High Energy Physics - Phenomenology High Energy Physics - Theory
2 code implementations • 6 Mar 2019 • Qin Zou, Hanwen Jiang, Qiyu Dai, Yuanhao Yue, Long Chen, Qian Wang
Specifically, information of each frame is abstracted by a CNN block, and the CNN features of multiple continuous frames, holding the property of time-series, are then fed into the RNN block for feature learning and lane prediction.
no code implementations • 17 Jan 2019 • Yucai Bai, Lei Fan, Ziyu Pan, Long Chen
First, with the correlation of underlying information between depth and semantic prediction, a novel multi-task Convolutional Neural Network (CNN) is designed for joint prediction.
no code implementations • ICCV 2019 • Long Chen, Hanwang Zhang, Jun Xiao, Xiangnan He, ShiLiang Pu, Shih-Fu Chang
CMAT is a multi-agent policy gradient method that frames objects as cooperative agents, and then directly maximizes a graph-level metric as the reward.
no code implementations • 24 Oct 2018 • Zijie Zhuang, Haizhou Ai, Long Chen, Chong Shang
One paradigm to deal with this problem is to use some complicated methods for mapping all images into an artificial image space, which however will disrupt the natural image distribution and requires heavy image preprocessing.
3 code implementations • 12 Sep 2018 • Long Chen, Haizhou Ai, Zijie Zhuang, Chong Shang
Online multi-object tracking is a fundamental problem in time-critical video analysis applications.
Ranked #4 on
Online Multi-Object Tracking
on MOT16
Large-Scale Person Re-Identification
Multi-Object Tracking
+2
no code implementations • 19 May 2018 • Qing Wang, Long Chen, Wei Tian
Imitation learning for end-to-end autonomous driving has drawn attention from academic communities.
no code implementations • 19 Apr 2018 • Haixin Wang, Xingzhang Ren, Jinan Sun, Wei Ye, Long Chen, Muzhi Yu, Shikun Zhang
Specically, we propose to measure the quality of each leaf node of every decision tree in the random forest to determine hard examples.
no code implementations • 14 Mar 2018 • Long Chen, Wen Tang, Nigel John
Convolutional Neural Networks (CNNs) need large amounts of data with ground truth annotation, which is a challenging problem that has limited the development and fast deployment of CNNs for many computer vision tasks.
no code implementations • 14 Mar 2018 • Long Chen, Wen Tang, Nigel John, Tao Ruan Wan, Jian Jun Zhang
Mixed Reality (MR) is a powerful interactive technology that yields new types of user experience.
1 code implementation • 8 Mar 2018 • Zheng Zhang, Qin Zou, Yuewei Lin, Long Chen, Song Wang
In this paper, a new deep hashing method is proposed for multi-label image retrieval by re-defining the pairwise similarity into an instance similarity, where the instance similarity is quantified into a percentage based on the normalized semantic labels.
1 code implementation • CVPR 2018 • Long Chen, Hanwang Zhang, Jun Xiao, Wei Liu, Shih-Fu Chang
We propose a novel framework called Semantics-Preserving Adversarial Embedding Network (SP-AEN) for zero-shot visual recognition (ZSL), where test images and their classes are both unseen during training.
2 code implementations • 26 Oct 2017 • Qianxiao Li, Long Chen, Cheng Tai, Weinan E
The continuous dynamical system approach to deep learning is explored in order to devise alternative frameworks for training algorithms.
no code implementations • 26 Oct 2017 • Long Chen, Fajie Yuan, Joemon M. Jose, Wei-Nan Zhang
Although the word-popularity based negative sampler has shown superb performance in the skip-gram model, the theoretical motivation behind oversampling popular (non-observed) words as negative samples is still not well understood.
no code implementations • 3 Aug 2017 • Long Chen, Karl Francis, Wen Tang
In Augmented Reality (AR) environment, realistic interactions between the virtual and real objects play a crucial role in user experience.
no code implementations • 3 Aug 2017 • Long Chen, Thomas Day, Wen Tang, Nigel W. John
Mixed Reality (MR) is of increasing interest within technology-driven modern medicine but is not yet used in everyday practice.
no code implementations • 3 Aug 2017 • Long Chen, Wen Tang, Nigel W. John
The potential of Augmented Reality (AR) technology to assist minimally invasive surgeries (MIS) lies in its computational performance and accuracy in dealing with challenging MIS scenes.
no code implementations • 20 Jul 2017 • Yunan Ye, Zhou Zhao, Yimeng Li, Long Chen, Jun Xiao, Yueting Zhuang
Video Question Answering is a challenging problem in visual information retrieval, which provides the answer to the referenced video content according to the question.
no code implementations • 30 Mar 2017 • Lei Fan, Ziyu Pan, Long Chen, Kai Huang
Reconstruction based on the stereo camera has received considerable attention recently, but two particular challenges still remain.
no code implementations • 1 Mar 2017 • Long Chen, Wen Tang, Nigel W. John, Tao Ruan Wan, Jian Jun Zhang
In vivo laparoscopic videos used in the tests have demonstrated the robustness and accuracy of our proposed framework on both camera tracking and surface reconstruction, illustrating the potential of our algorithm for depth augmentation and depth-corrected augmented reality in MIS with monocular endoscopes.
Simultaneous Localization and Mapping
Surface Reconstruction
no code implementations • 28 Feb 2017 • Long Chen, Junyu Dong, Shengke Wang, Kin-Man Lam, Muwei Jian, Hua Zhang, Xiaochun Cao
To bridge this gap, we introduce a cascaded structure to eliminate background and exploit a one-vs-rest loss to capture more minute variances among different subordinate categories.
2 code implementations • CVPR 2017 • Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua
Existing visual attention models are generally spatial, i. e., the attention is modeled as spatial probabilities that re-weight the last conv-layer feature map of a CNN encoding an input image.
no code implementations • 26 Aug 2016 • Qin Zou, Zheng Zhang, Qian Wang, Qingquan Li, Long Chen, Song Wang
Specifically, a classification-based model is proposed to quantify the influence of different visual stimuli, in which each visual stimulus's influence is quantified by its corresponding accuracy in fashion classification.