Search Results for author: Bing Li

Found 46 papers, 14 papers with code

When does Further Pre-training MLM Help? An Empirical Study on Task-Oriented Dialog Pre-training

1 code implementation EMNLP (insights) 2021 Qi Zhu, Yuxian Gu, Lingxiao Luo, Bing Li, Cheng Li, Wei Peng, Minlie Huang, Xiaoyan Zhu

Further pre-training language models on in-domain data (domain-adaptive pre-training, DAPT) or task-relevant data (task-adaptive pre-training, TAPT) before fine-tuning has been shown to improve downstream tasks’ performances.

Fine-tuning

Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos

1 code implementation5 Nov 2021 Minglang Qiao, Yufan Liu, Mai Xu, Xin Deng, Bing Li, Weiming Hu, Ali Borji

In this paper, we propose a multitask learning method for visual-audio saliency prediction and sound source localization on multi-face video by leveraging visual, audio and face information.

Eye Tracking Saliency Prediction

Multimodal Semi-Supervised Learning for 3D Objects

no code implementations22 Oct 2021 Zhimin Chen, Longlong Jing, Yang Liang, YingLi Tian, Bing Li

This paper explores how the coherence of different modelities of 3D data (e. g. point cloud, image, and mesh) can be used to improve data efficiency for both 3D classification and retrieval tasks.

3D Classification

MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction

no code implementations13 Oct 2021 Linhan Zhang, Qian Chen, Wen Wang, Chong Deng, Shiliang Zhang, Bing Li, Wei Wang, Xin Cao

Keyphrases are phrases in a document providing a concise summary of core content, helping readers to understand what the article is talking about in a minute.

Document Embedding Keyphrase Extraction +1

Dimension Reduction and Data Visualization for Fréchet Regression

no code implementations1 Oct 2021 Qi Zhang, Lingzhou Xue, Bing Li

In this paper, we introduce a flexible sufficient dimension reduction (SDR) method for Fr\'echet regression to achieve two purposes: to mitigate the curse of dimensionality caused by high-dimensional predictors, and to provide a tool for data visualization for Fr\'echet regression.

Data Visualization Dimensionality Reduction

Self-Supervised Modality-Invariant and Modality-Specific Feature Learning for 3D Objects

no code implementations29 Sep 2021 Longlong Jing, Zhimin Chen, Bing Li, YingLi Tian

Our proposed novel self-supervised model learns two types of distinct features: modality-invariant features and modality-specific features.

3D Object Recognition Cross-Modal Retrieval

Advancing Self-supervised Monocular Depth Learning with Sparse LiDAR

2 code implementations20 Sep 2021 Ziyue Feng, Longlong Jing, Peng Yin, YingLi Tian, Bing Li

Unlike the existing methods that use sparse LiDAR mainly in a manner of time-consuming iterative post-processing, our model fuses monocular image features and sparse LiDAR features to predict initial depth maps.

Depth Completion Monocular 3D Object Detection +1

SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction

no code implementations18 Sep 2021 Zekun Li, Yufan Liu, Bing Li, Weiming Hu, Kebin Wu, Pei Wang

CDI builds the global attention and interaction among different levels in decoupled space which also solves the problem of heavy computation.

Sent2Span: Span Detection for PICO Extraction in the Biomedical Text without Span Annotations

1 code implementation Findings (EMNLP) 2021 Shifeng Liu, Yifang Sun, Bing Li, Wei Wang, Florence T. Bourgeois, Adam G. Dunn

The rapid growth in published clinical trials makes it difficult to maintain up-to-date systematic reviews, which requires finding all relevant trials.

PICO

Learn to Match: Automatic Matching Network Design for Visual Tracking

1 code implementation ICCV 2021 Zhipeng Zhang, Yihao Liu, Xiao Wang, Bing Li, Weiming Hu

Siamese tracking has achieved groundbreaking performance in recent years, where the essence is the efficient matching operator cross-correlation and its variants.

Visual Tracking

PSE-Match: A Viewpoint-free Place Recognition Method with Parallel Semantic Embedding

no code implementations1 Aug 2021 Peng Yin, Lingyun Xu, Ziyue Feng, Anton Egorov, Bing Li

Accurate localization on autonomous driving cars is essential for autonomy and driving safety, especially for complex urban streets and search-and-rescue subterranean environments where high-accurate GPS is not available.

Autonomous Driving

Automatic Construction of Enterprise Knowledge Base

no code implementations EMNLP (ACL) 2021 Junyi Chai, Yujie He, Homa Hashemi, Bing Li, Daraksha Parveen, Ranganath Kondapally, Wenjin Xu

In this paper, we present an automatic knowledge base construction system from large scale enterprise documents with minimal efforts of human intervention.

Document-level

SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation

no code implementations10 May 2021 Bing Li, Cheng Zheng, Silvio Giancola, Bernard Ghanem

We propose a novel scene flow estimation approach to capture and infer 3D motions from point clouds.

Scene Flow Estimation

A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking

no code implementations6 May 2021 Zhenbang Li, Yaya Shi, Jin Gao, Shaoru Wang, Bing Li, Pengpeng Liang, Weiming Hu

In this paper, we show the existence of universal perturbations that can enable the targeted attack, e. g., forcing a tracker to follow the ground-truth trajectory with specified offsets, to be video-agnostic and free from inference in a network.

Visual Tracking

PDNet: Towards Better One-stage Object Detection with Prediction Decoupling

no code implementations28 Apr 2021 Li Yang, Yan Xu, Shaoru Wang, Chunfeng Yuan, Ziqi Zhang, Bing Li, Weiming Hu

However, the most suitable positions for inferring different targets, i. e., the object category and boundaries, are generally different.

Object Detection

One More Check: Making "Fake Background" Be Tracked Again

1 code implementation19 Apr 2021 Chao Liang, Zhipeng Zhang, Xue Zhou, Bing Li, Yi Lu, Weiming Hu

The one-shot multi-object tracking, which integrates object detection and ID embedding extraction into a unified network, has achieved groundbreaking results in recent years.

Multi-Object Tracking Object Detection

Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model

1 code implementation ECCV 2020 Yufan Liu, Minglang Qiao, Mai Xu, Bing Li, Weiming Hu, Ali Borji

Inspired by the findings of our investigation, we propose a novel multi-modal video saliency model consisting of three branches: visual, audio and face.

Eye Tracking Saliency Prediction

Open-book Video Captioning with Retrieve-Copy-Generate Network

no code implementations CVPR 2021 Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Ying Deng, Weiming Hu

Due to the rapid emergence of short videos and the requirement for content understanding and creation, the video captioning task has received increasing attention in recent years.

Video Captioning

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

3 code implementations24 Feb 2021 Bing Li, Yuanlue Zhu, Yitong Wang, Chia-Wen Lin, Bernard Ghanem, Linlin Shen

Specifically, a new generator architecture is proposed to simultaneously transfer color/texture styles and transform local facial shapes into anime-like counterparts based on the style of a reference anime-face, while preserving the global structure of the source photo-face.

Face Generation Translation

Generalized Forward Sufficient Dimension Reduction for Categorical and Ordinal Responses

no code implementations11 Feb 2021 Harris Quach, Bing Li

Like other forward regression-based sufficient dimension reduction methods, our approach avoids the relatively stringent distributional requirements necessary for inverse regression alternatives.

Dimensionality Reduction Methodology

Periodic repeating fast radio bursts: interaction between a magnetized neutron star and its planet in an eccentric orbit

no code implementations8 Feb 2021 Abudushataer Kuerban, Yong-Feng Huang, Jin-Jun Geng, Bing Li, Fan Xu, Xu Wang

The model can naturally explain the repeatability of FRBs with a period ranging from a few days to several hundred days, but it generally requires that the eccentricity of the planet orbit should be large enough.

High Energy Astrophysical Phenomena

Fine-Grained Named Entity Typing over Distantly Supervised Data via Refinement in Hyperbolic Space

no code implementations27 Jan 2021 Muhammad Asif Ali, Yifang Sun, Bing Li, Wei Wang

Fine-Grained Named Entity Typing (FG-NET) aims at classifying the entity mentions into a wide range of entity types (usually hundreds) depending upon the context.

Entity Typing

Cross-Lingual Named Entity Recognition Using Parallel Corpus: A New Approach Using XLM-RoBERTa Alignment

no code implementations26 Jan 2021 Bing Li, Yujie He, Wenjin Xu

We built an entity alignment model on top of XLM-RoBERTa to project the entities detected on the English part of the parallel data to the target language sentences, whose accuracy surpasses all previous unsupervised models.

Entity Alignment Named Entity Recognition +2

Named Entity Recognition in the Style of Object Detection

no code implementations26 Jan 2021 Bing Li

In this work, we propose a two-stage method for named entity recognition (NER), especially for nested NER.

NER Nested Named Entity Recognition +2

High Quality Disparity Remapping With Two-Stage Warping

no code implementations ICCV 2021 Bing Li, Chia-Wen Lin, Cheng Zheng, Shan Liu, Junsong Yuan, Bernard Ghanem, C.-C. Jay Kuo

In the second stage, we derive another warping model to refine warping results in less important regions by eliminating serious distortions in shape, disparity and 3D structure.

DSIC: Dynamic Sample-Individualized Connector for Multi-Scale Object Detection

no code implementations16 Nov 2020 Zekun Li, Yufan Liu, Bing Li, Weiming Hu

Furthermore, these two components are both plug-and-play and can be embedded in any backbone.

Object Detection

Towards Accurate Pixel-wise Object Tracking by Attention Retrieval

1 code implementation6 Aug 2020 Zhipeng Zhang, Bing Li, Weiming Hu, Houwen Peng

We first build a look-up-table (LUT) with the ground-truth mask in the starting frame, and then retrieves the LUT to obtain an attention map for spatial constraints.

Object Tracking

Object Relational Graph with Teacher-Recommended Learning for Video Captioning

no code implementations CVPR 2020 Ziqi Zhang, Yaya Shi, Chunfeng Yuan, Bing Li, Peijin Wang, Weiming Hu, Zheng-Jun Zha

In this paper, we propose a complete video captioning system including both a novel model and an effective training strategy.

 Ranked #1 on Video Captioning on MSR-VTT (using extra training data)

Language Modelling Video Captioning

HAMNER: Headword Amplified Multi-span Distantly Supervised Method for Domain Specific Named Entity Recognition

no code implementations3 Dec 2019 Shifeng Liu, Yifang Sun, Bing Li, Wei Wang, Xiang Zhao

To tackle Named Entity Recognition (NER) tasks, supervised methods need to obtain sufficient cleanly annotated data, which is labor and time consuming.

Boundary Detection Named Entity Recognition +1

An Overview of In-memory Processing with Emerging Non-volatile Memory for Data-intensive Applications

no code implementations15 Jun 2019 Bing Li, Bonan Yan, Hai, Li

The conventional von Neumann architecture has been revealed as a major performance and energy bottleneck for rising data-intensive applications.

Multimodal Semantic Attention Network for Video Captioning

no code implementations8 May 2019 Liang Sun, Bing Li, Chunfeng Yuan, Zheng-Jun Zha, Weiming Hu

Inspired by the fact that different modalities in videos carry complementary information, we propose a Multimodal Semantic Attention Network(MSAN), which is a new encoder-decoder framework incorporating multimodal semantic attributes for video captioning.

General Classification Multi-Label Classification +1

Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification

no code implementations ECCV 2018 Yang Du, Chunfeng Yuan, Bing Li, Lili Zhao, Yangxi Li, Weiming Hu

Furthermore, since different layers in a deep network capture feature maps of different scales, we use these feature maps to construct a spatial pyramid and then utilize multi-scale information to obtain more accurate attention scores, which are used to weight the local features in all spatial positions of feature maps to calculate attention maps.

Action Classification Classification +1

Depth-Aware Stereo Video Retargeting

no code implementations CVPR 2018 Bing Li, Chia-Wen Lin, Boxin Shi, Tiejun Huang, Wen Gao, C. -C. Jay Kuo

As compared with traditional video retargeting, stereo video retargeting poses new challenges because stereo video contains the depth information of salient objects and its time dynamics.

Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection From Videos

no code implementations CVPR 2017 Yang Du, Chunfeng Yuan, Bing Li, Weiming Hu, Stephen Maybank

In dynamic object detection, it is challenging to construct an effective model to sufficiently characterize the spatial-temporal properties of the background.

Object Detection

Linear Contour Learning: A Method for Supervised Dimension Reduction

no code implementations13 Aug 2014 Bing Li, Hongyuan Zha, Francesca Chiaromonte

We propose a novel approach to sufficient dimension reduction in regression, based on estimating contour directions of negligible variation for the response surface.

Dimensionality Reduction

Removing Mixture of Gaussian and Impulse Noise by Patch-Based Weighted Means

no code implementations11 Mar 2014 Haijuan Hu, Bing Li, Quansheng Liu

We first establish a law of large numbers and a convergence theorem in distribution to show the rate of convergence of the non-local means filter for removing Gaussian noise.

Illumination Estimation Based on Bilayer Sparse Coding

no code implementations CVPR 2013 Bing Li, Weihua Xiong, Weiming Hu, Houwen Peng

In this paper, we propose a novel bilayer sparse coding model for illumination estimation that considers image similarity in terms of both low level color distribution and high level image scene content simultaneously.

Color Constancy

Cannot find the paper you are looking for? You can Submit a new open access paper.