Search Results for author: Dong Liu

Found 113 papers, 47 papers with code

Photon-Efficient 3D Imaging with A Non-Local Neural Network

1 code implementation • ECCV 2020 • Jiayong Peng, Zhiwei Xiong, Xin Huang, Zheng-Ping Li, Dong Liu, Feihu Xu

Photon-efficient imaging has enabled a number of applications relying on single-photon sensors that can capture a 3D image with as few as one photon per pixel.

Paper
Code

Learning Trailer Moments in Full-Length Movies with Co-Contrastive Attention

no code implementations • ECCV 2020 • Lezi Wang, Dong Liu, Rohit Puri, Dimitris N. Metaxas

We introduce a novel ranking network that utilizes the Co-Attention between movies and trailers as guidance to generate the training pairs, where the moments highly corrected with trailers are expected to be scored higher than the uncorrelated moments.

Paper
Add Code

HiLo: Detailed and Robust 3D Clothed Human Reconstruction with High-and Low-Frequency Information of Parametric Models

2 code implementations • 7 Apr 2024 • Yifan Yang, Dong Liu, Shuhai Zhang, Zeshuai Deng, Zixiong Huang, Mingkui Tan

We empirically find that the high-frequency (HF) and low-frequency (LF) information from a parametric model has the potential to enhance geometry details and improve robustness to noise, respectively.

Virtual Try-on

104

Paper
Code

KunquDB: An Attempt for Speaker Verification in the Chinese Opera Scenario

no code implementations • 20 Mar 2024 • Huali Zhou, Yuke Lin, Dong Liu, Ming Li

This work aims to promote Chinese opera research in both musical and speech domains, with a primary focus on overcoming the data limitations.

Domain Adaptation Speaker Verification

Paper
Add Code

Object Segmentation-Assisted Inter Prediction for Versatile Video Coding

no code implementations • 18 Mar 2024 • Zhuoyuan Li, Zikun Yuan, Li Li, Dong Liu, Xiaohu Tang, Feng Wu

Moreover, segmentation mask is considered in the joint rate-distortion optimization for motion estimation and partition estimation to derive the motion vector of different regions and partition more accurately.

Motion Compensation Motion Estimation +3

Paper
Add Code

MEDPNet: Achieving High-Precision Adaptive Registration for Complex Die Castings

no code implementations • 15 Mar 2024 • Yu Du, Yu Song, Ce Guo, Xiaojing Tian, Dong Liu, Ming Cong

Due to their complex spatial structure and diverse geometric features, achieving high-precision and robust point cloud registration for complex Die Castings has been a significant challenge in the die-casting industry.

Computational Efficiency Point Cloud Registration

Paper
Add Code

Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding

no code implementations • 9 Mar 2024 • Cunhui Dong, Haichuan Ma, Haotian Zhang, Changsheng Gao, Li Li, Dong Liu

Neural network-based image coding has been developing rapidly since its birth.

Paper
Add Code

Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression

no code implementations • 29 Jan 2024 • Xihua Sheng, Li Li, Dong Liu, Houqiang Li

With the SDD-based motion model and long short-term temporal contexts fusion, our proposed learned video codec can obtain more accurate inter prediction.

Motion Estimation MS-SSIM +2

Paper
Add Code

Language-Conditioned Robotic Manipulation with Fast and Slow Thinking

no code implementations • 8 Jan 2024 • Minjie Zhu, Yichen Zhu, Jinming Li, Junjie Wen, Zhiyuan Xu, Zhengping Che, Chaomin Shen, Yaxin Peng, Dong Liu, Feifei Feng, Jian Tang

The language-conditioned robotic manipulation aims to transfer natural language instructions into executable actions, from simple pick-and-place to tasks requiring intent recognition and visual reasoning.

Decision Making Intent Recognition +2

Paper
Add Code

Object-Centric Instruction Augmentation for Robotic Manipulation

no code implementations • 5 Jan 2024 • Junjie Wen, Yichen Zhu, Minjie Zhu, Jinming Li, Zhiyuan Xu, Zhengping Che, Chaomin Shen, Yaxin Peng, Dong Liu, Feifei Feng, Jian Tang

Humans interpret scenes by recognizing both the identities and positions of objects in their observations.

Language Modelling Large Language Model +1

Paper
Add Code

Enhancing CT Image synthesis from multi-modal MRI data based on a multi-task neural network framework

no code implementations • 13 Dec 2023 • Zhuoyao Xin, Christopher Wu, Dong Liu, Chunming Gu, Jia Guo, Jun Hua

Image segmentation, real-value prediction, and cross-modal translation are critical challenges in medical imaging.

Image Generation Image Segmentation +3

Paper
Add Code

Towards Open-World Co-Salient Object Detection with Generative Uncertainty-aware Group Selective Exchange-Masking

1 code implementation • 16 Oct 2023 • Yang Wu, Shenglong Hu, Huihui Song, Kaihua Zhang, Bo Liu, Dong Liu

To simultaneously consider the uncertainty introduced by irrelevant images and the consensus features of the remaining relevant images in the group, we designed a latent variable generator branch and CoSOD transformer branch.

Co-Salient Object Detection object-detection +1

Paper
Code

On Uniform Scalar Quantization for Learned Image Compression

no code implementations • 29 Sep 2023 • Haotian Zhang, Li Li, Dong Liu

In principle, we find two factors crucial: one is the discrepancy between the surrogate and rounding, leading to train-test mismatch; the other is gradient estimation risk due to the surrogate, which consists of bias and variance of the gradient estimation.

Image Compression Quantization

Paper
Add Code

Learning Fine-Grained Features for Pixel-wise Video Correspondences

1 code implementation • ICCV 2023 • Rui Li, Shenglong Zhou, Dong Liu

We address the problem of learning features for establishing pixel-wise correspondences.

Computational Efficiency

Paper
Code

DTF-Net: Category-Level Pose Estimation and Shape Reconstruction via Deformable Template Field

no code implementations • 4 Aug 2023 • Haowen Wang, Zhipeng Fan, Zhen Zhao, Zhengping Che, Zhiyuan Xu, Dong Liu, Feifei Feng, Yakun Huang, XIUQUAN QIAO, Jian Tang

We introduce a pose regression module that shares the deformation features and template codes from the fields to estimate the accurate 6D pose of each object in the scene.

Object Pose Estimation

Paper
Add Code

On the Effectiveness of Spectral Discriminators for Perceptual Quality Improvement

1 code implementation • ICCV 2023 • Xin Luo, Yunan Zhu, Shunxin Xu, Dong Liu

We tackle this issue by examining the spectral discriminators in the context of perceptual image super-resolution (i. e., GAN-based SR), as SR image quality is susceptible to spectral changes.

Image Super-Resolution No-Reference Image Quality Assessment

Paper
Code

Offline and Online Optical Flow Enhancement for Deep Video Compression

no code implementations • 11 Jul 2023 • Chuanbo Tang, Xihua Sheng, Zhuoyuan Li, Haotian Zhang, Li Li, Dong Liu

In the offline stage, we fine-tune a trained optical flow estimation network with the motion information provided by a traditional (non-deep) video compression scheme, e. g. H. 266/VVC, as we believe the motion information of H. 266/VVC achieves a better rate-distortion trade-off.

Motion Estimation Optical Flow Estimation +1

Paper
Add Code

VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision

no code implementations • 19 Jun 2023 • Xihua Sheng, Li Li, Dong Liu, Houqiang Li

Such compact representations need to be decoded back to pixels before being displayed to humans and - as usual - before being enhanced/analyzed by machine vision algorithms.

Motion Compensation Motion Estimation +2

Paper
Add Code

A Dataset for Deep Learning-based Bone Structure Analyses in Total Hip Arthroplasty

1 code implementation • 7 Jun 2023 • Kaidong Zhang, Ziyang Gan, Dong Liu, Xifu Shang

For THA, it is of clinical significance to analyze the bone structure from the CT images, especially to observe the structure of the acetabulum and femoral head, before the surgical procedure.

Active Learning Anatomy +3

Paper
Code

Towards Interactive Image Inpainting via Sketch Refinement

1 code implementation • 1 Jun 2023 • Chang Liu, Shunxin Xu, Jialun Peng, Kaidong Zhang, Dong Liu

To address this problem, we propose a two-stage image inpainting method termed SketchRefiner.

Image Inpainting

Paper
Code

Imbalance-Agnostic Source-Free Domain Adaptation via Avatar Prototype Alignment

no code implementations • 22 May 2023 • Hongbin Lin, Mingkui Tan, Yifan Zhang, Zhen Qiu, Shuaicheng Niu, Dong Liu, Qing Du, Yanxia Liu

To address this issue, we study a more practical SF-UDA task, termed imbalance-agnostic SF-UDA, where the class distributions of both the unseen source domain and unlabeled target domain are unknown and could be arbitrarily skewed.

Pseudo Label Source-Free Domain Adaptation +1

Paper
Add Code

Late-Constraint Diffusion Guidance for Controllable Image Synthesis

1 code implementation • 19 May 2023 • Chang Liu, Dong Liu

Specifically, we train a lightweight condition adapter to establish the correlation between external conditions and internal representations of diffusion models.

Ranked #1 on Conditional Text-to-Image Synthesis on COCO 2017 val

Conditional Image Generation Conditional Text-to-Image Synthesis

Paper
Code

Customized Segment Anything Model for Medical Image Segmentation

1 code implementation • 26 Apr 2023 • Kaidong Zhang, Dong Liu

Different from the previous methods, SAMed is built upon the large-scale image segmentation model, Segment Anything Model (SAM), to explore the new research paradigm of customizing large-scale models for medical image segmentation.

Image Segmentation Medical Image Segmentation +3

393

Paper
Code

Mask-Based Modeling for Neural Radiance Fields

1 code implementation • 11 Apr 2023 • Ganlin Yang, Guoqiang Wei, Zhizheng Zhang, Yan Lu, Dong Liu

Most Neural Radiance Fields (NeRFs) exhibit limited generalization capabilities, which restrict their applicability in representing multiple scenes using a single model.

Representation Learning

Paper
Code

PyramidFlow: High-Resolution Defect Contrastive Localization using Pyramid Normalizing Flow

2 code implementations • CVPR 2023 • Jiarui Lei, Xiaobo Hu, Yue Wang, Dong Liu

During industrial processing, unforeseen defects may arise in products due to uncontrollable factors.

Ranked #5 on Anomaly Detection on BTAD (using extra training data)

Anomaly Detection Vocal Bursts Intensity Prediction

Paper
Code

Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting

2 code implementations • 24 Jan 2023 • Kaidong Zhang, Jialun Peng, Jingjing Fu, Dong Liu

Transformers have been widely used for video processing owing to the multi-head self attention (MHSA) mechanism.

Ranked #1 on Video Inpainting on DAVIS (SSIM (square) metric)

Optical Flow Estimation Video Inpainting

276

Paper
Code

Co-Salient Object Detection With Uncertainty-Aware Group Exchange-Masking

no code implementations • CVPR 2023 • Yang Wu, Huihui Song, Bo Liu, Kaihua Zhang, Dong Liu

To address this issue, this paper presents a group exchange-masking (GEM) strategy for robust CoSOD model learning.

Co-Salient Object Detection object-detection +2

Paper
Add Code

Unsupervised Video Object Segmentation with Online Adversarial Self-Tuning

no code implementations • ICCV 2023 • Tiankang Su, Huihui Song, Dong Liu, Bo Liu, Qingshan Liu

We integrate our offline training and online fine-tuning in a unified framework for unsupervised video object segmentation and dub our method Online Adversarial Self-Tuning (OAST).

Object Pseudo Label +4

Paper
Add Code

Spatial-then-Temporal Self-Supervised Learning for Video Correspondence

1 code implementation • CVPR 2023 • Rui Li, Dong Liu

Specifically, we firstly extract spatial features from unlabeled images via contrastive learning, and secondly enhance the features by exploiting the temporal cues in unlabeled videos via reconstructive learning.

Contrastive Learning Self-Supervised Learning

Paper
Code

Flow-Guided Transformer for Video Inpainting

1 code implementation • 14 Aug 2022 • Kaidong Zhang, Jingjing Fu, Dong Liu

Especially in spatial transformer, we design a dual perspective spatial MHSA, which integrates the global tokens to the window-based attention.

Retrieval Video Inpainting

276

Paper
Code

Towards Hybrid-Optimization Video Coding

no code implementations • 12 Jul 2022 • Shuai Huo, Dong Liu, Li Li, Siwei Ma, Feng Wu, Wen Gao

Our idea is to provide multiple discrete starting points in the global space and optimize the local optimum around each point by numerical algorithm efficiently.

Paper
Add Code

Recurrent Dynamic Embedding for Video Object Segmentation

1 code implementation • CVPR 2022 • Mingxing Li, Li Hu, Zhiwei Xiong, Bang Zhang, Pan Pan, Dong Liu

In this paper, we propose a Recurrent Dynamic Embedding (RDE) to build a memory bank of constant size.

Ranked #16 on Semi-Supervised Video Object Segmentation on MOSE

Object Semantic Segmentation +2

Paper
Code

Neural Compression-Based Feature Learning for Video Restoration

no code implementations • CVPR 2022 • Cong Huang, Jiahao Li, Bin Li, Dong Liu, Yan Lu

The temporal features usually contain various noisy and uncorrelated information, and they may interfere with the restoration of the current frame.

Denoising Quantization +3

Paper
Add Code

aiWave: Volumetric Image Compression with 3-D Trained Affine Wavelet-like Transform

no code implementations • 11 Mar 2022 • Dongmei Xue, Haichuan Ma, Li Li, Dong Liu, Zhiwei Xiong

Volumetric image compression has become an urgent task to effectively transmit and store images produced in biological research and clinical practice.

Image Compression

Paper
Add Code

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters

1 code implementation • 3 Feb 2022 • Mingxing Li, Shenglong Zhou, Chang Chen, Yueyi Zhang, Dong Liu, Zhiwei Xiong

Accurate retinal vessel segmentation is challenging because of the complex texture of retinal vessels and low imaging contrast.

Retinal Vessel Segmentation Segmentation

Paper
Code

Motion-Focused Contrastive Learning of Video Representations

1 code implementation • ICCV 2021 • Rui Li, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu, Tao Mei

To this end, we compose a duet of exploiting the motion for data augmentation and feature learning in the regime of contrastive learning.

Contrastive Learning Data Augmentation +2

Paper
Code

Inertia-Guided Flow Completion and Style Fusion for Video Inpainting

1 code implementation • CVPR 2022 • Kaidong Zhang, Jingjing Fu, Dong Liu

We propose a flow completion network to align and aggregate flow features from the consecutive flow sequences based on the inertia prior.

Optical Flow Estimation valid +1

Paper
Code

Attribute Artifacts Removal for Geometry-based Point Cloud Compression

no code implementations • 1 Dec 2021 • Xihua Sheng, Li Li, Dong Liu, Zhiwei Xiong

In this paper, we propose a Multi-Scale Graph Attention Network (MS-GAT) to remove the artifacts of point cloud attributes compressed by G-PCC.

Attribute Graph Attention +2

Paper
Add Code

Temporal Context Mining for Learned Video Compression

1 code implementation • 27 Nov 2021 • Xihua Sheng, Jiahao Li, Bin Li, Li Li, Dong Liu, Yan Lu

From the stored propagated features, we propose to learn multi-scale temporal contexts, and re-fill the learned temporal contexts into the modules of our compression scheme, including the contextual encoder-decoder, the frame generator, and the temporal context encoder.

MS-SSIM SSIM +1

310

Paper
Code

Deep Reinforcement Learning Aided Packet-Routing For Aeronautical Ad-Hoc Networks Formed by Passenger Planes

no code implementations • 28 Oct 2021 • Dong Liu, Jingjing Cui, Jiankang Zhang, Chenyang Yang, Lajos Hanzo

Data packet routing in aeronautical ad-hoc networks (AANETs) is challenging due to their high-dynamic topology.

Paper
Add Code

Deep Learning Aided Routing for Space-Air-Ground Integrated Networks Relying on Real Satellite, Flight, and Shipping Data

no code implementations • 28 Oct 2021 • Dong Liu, Jiankang Zhang, Jingjing Cui, Soon-Xin Ng, Robert G. Maunder, Lajos Hanzo

Current maritime communications mainly rely on satellites having meager transmission resources, hence suffering from poorer performance than modern terrestrial wireless networks.

Paper
Add Code

Deep Learning Aided Packet Routing in Aeronautical Ad-Hoc Networks Relying on Real Flight Data: From Single-Objective to Near-Pareto Multi-Objective Optimization

no code implementations • 28 Oct 2021 • Dong Liu, Jiankang Zhang, Jingjing Cui, Soon-Xin Ng, Robert G. Maunder, Lajos Hanzo

Furthermore, we extend the DL-aided routing algorithm to a multi-objective scenario, where we aim for simultaneously minimizing the delay, maximizing the path capacity, and maximizing the path lifetime.

Paper
Add Code

End-to-End Image Compression with Probabilistic Decoding

no code implementations • 30 Sep 2021 • Haichuan Ma, Dong Liu, Cunhui Dong, Li Li, Feng Wu

However, this nature was seldom considered in previous studies on image compression, which usually chose one possible image as reconstruction, e. g. the one with the maximal a posteriori probability.

Image Compression

Paper
Add Code

Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, And No Retraining

1 code implementation • ICLR 2022 • Lu Miao, Xiaolong Luo, Tianlong Chen, Wuyang Chen, Dong Liu, Zhangyang Wang

Conventional methods often require (iterative) pruning followed by re-training, which not only incurs large overhead beyond the original DNN training but also can be sensitive to retraining hyperparameters.

Paper
Code

iWave3D: End-to-end Brain Image Compression with Trainable 3-D Wavelet Transform

no code implementations • 18 Sep 2021 • Dongmei Xue, Haichuan Ma, Li Li, Dong Liu, Zhiwei Xiong

With the rapid development of whole brain imaging technology, a large number of brain images have been produced, which puts forward a great demand for efficient brain image compression methods.

Image Compression

Paper
Add Code

Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images

1 code implementation • ICCV 2021 • Size Wu, Sheng Jin, Wentao Liu, Lei Bai, Chen Qian, Dong Liu, Wanli Ouyang

Following the top-down paradigm, we decompose the task into two stages, i. e. person localization and pose estimation.

Ranked #2 on 3D Multi-Person Pose Estimation on Panoptic (using extra training data)

3D Multi-Person Pose Estimation 3D Pose Estimation +1

Paper
Code

CERL: A Unified Optimization Framework for Light Enhancement with Realistic Noise

1 code implementation • 1 Aug 2021 • Zeyuan Chen, Yifan Jiang, Dong Liu, Zhangyang Wang

We present \underline{C}oordinated \underline{E}nhancement for \underline{R}eal-world \underline{L}ow-light Noisy Images (CERL), that seamlessly integrates light enhancement and noise suppression parts into a unified and physics-grounded optimization framework.

Denoising

Paper
Code

Learned Image Compression with Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules

1 code implementation • 14 Jul 2021 • Haisheng Fu, Feng Liang, Jianping Lin, Bing Li, Mohammad Akbari, Jie Liang, Guohe Zhang, Dong Liu, Chengjie Tu, Jingning Han

However, due to the vast diversity of images, it is not optimal to use one model for all images, even different regions within one image.

Image Compression MS-SSIM +1

Paper
Code

THE DCASE 2021 CHALLENGE TASK 6 SYSTEM: AUTOMATED AUDIO CAPTIONING WITH WEAKLY SUPERVISED PRE-TRAING AND WORD SELECTION METHODS

no code implementations • DCASE workshop 2021 • Weiqiang Yuan ∗, Qichen Han∗, Dong Liu, Xiang Li, Zhen Yang

Our solution focuses on solving two problems in automated audio captioning: data insufficiency and word selection indeterminacy.

Ranked #1 on Audio captioning on Clotho (using extra training data)

Audio captioning Caption Generation

Paper
Add Code

Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability

1 code implementation • 1 Jul 2021 • Anubhab Ghosh, Antoine Honoré, Dong Liu, Gustav Eje Henter, Saikat Chatterjee

For a standard speech phone classification setup involving 39 phones (classes) and the TIMIT dataset, we show that the use of standard features called mel-frequency-cepstral-coeffcients (MFCCs), the proposed generative models, and the decision fusion together can achieve $86. 6\%$ accuracy by generative training only.

Classification

Paper
Code

PSD: Principled Synthetic-to-Real Dehazing Guided by Physical Priors

1 code implementation • CVPR 2021 • Zeyuan Chen, Yangchao Wang, Yang Yang, Dong Liu

Deep learning-based methods have achieved remarkable performance for image dehazing.

Image Dehazing

109

Paper
Code

Structured Multi-Level Interaction Network for Video Moment Localization via Language Query

no code implementations • CVPR 2021 • Hao Wang, Zheng-Jun Zha, Liang Li, Dong Liu, Jiebo Luo

In particular, for cross-modal interaction, we interact the sentence-level query with the whole moment while interact the word-level query with content and boundary, as in a coarse-to-fine manner.

Sentence

Paper
Add Code

Light Field Super-Resolution With Zero-Shot Learning

no code implementations • CVPR 2021 • Zhen Cheng, Zhiwei Xiong, Chang Chen, Dong Liu, Zheng-Jun Zha

To fill this gap, we propose a zero-shot learning framework for light field SR, which learns a mapping to super-resolve the reference view with examples extracted solely from the input low-resolution light field itself.

Super-Resolution Zero-Shot Learning

Paper
Add Code

Adaptive Domain-Specific Normalization for Generalizable Person Re-Identification

no code implementations • 7 May 2021 • Jiawei Liu, Zhipeng Huang, Kecheng Zheng, Dong Liu, Xiaoyan Sun, Zheng-Jun Zha

It describes unseen target domain as a combination of the known source ones, and explicitly learns domain-specific representation with target distribution to improve the model's generalization by a meta-learning pipeline.

Generalizable Person Re-identification Meta-Learning

Paper
Add Code

Simultaneous Navigation and Construction Benchmarking Environments

1 code implementation • 31 Mar 2021 • Wenyu Han, Chen Feng, Haoran Wu, Alexander Gao, Armand Jordana, Dong Liu, Lerrel Pinto, Ludovic Righetti

We need intelligent robots for mobile construction, the process of navigating in an environment and modifying its structure according to a geometric design.

Benchmarking Reinforcement Learning (RL) +2

Paper
Code

Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE

2 code implementations • CVPR 2021 • Jialun Peng, Dong Liu, Songcen Xu, Houqiang Li

We propose a two-stage model for diverse inpainting, where the first stage generates multiple coarse results each of which has a different structure, and the second stage refines each coarse result separately by augmenting texture.

Image Inpainting Quantization +1

172

Paper
Code

Synergy Between Semantic Segmentation and Image Denoising via Alternate Boosting

no code implementations • 24 Feb 2021 • Shunxin Xu, Ke Sun, Dong Liu, Zhiwei Xiong, Zheng-Jun Zha

We observe that not only denoising helps combat the drop of segmentation accuracy due to noise, but also pixel-wise semantic information boosts the capability of denoising.

Image Denoising Segmentation +1

Paper
Add Code

Robust Classification using Hidden Markov Models and Mixtures of Normalizing Flows

no code implementations • 15 Feb 2021 • Anubhab Ghosh, Antoine Honoré, Dong Liu, Gustav Eje Henter, Saikat Chatterjee

We test the robustness of a maximum-likelihood (ML) based classifier where sequential data as observation is corrupted by noise.

General Classification Robust classification +2

Paper
Add Code

Deep Transport Network for Unsupervised Video Object Segmentation

no code implementations • ICCV 2021 • Kaihua Zhang, Zicheng Zhao, Dong Liu, Qingshan Liu, Bo Liu

The popular unsupervised video object segmentation methods fuse the RGB frame and optical flow via a two-stream network.

Ranked #4 on Unsupervised Video Object Segmentation on FBMS test

Object Optical Flow Estimation +3

Paper
Add Code

An efficient Quasi-Newton method for nonlinear inverse problems via learned singular values

no code implementations • 14 Dec 2020 • Danny Smyl, Tyler N. Tallman, Dong Liu, Andreas Hauptmann

Here we present a highly efficient data-driven Quasi-Newton method applicable to nonlinear inverse problems.

Paper
Add Code

Learning Trailer Moments in Full-Length Movies

no code implementations • 19 Aug 2020 • Lezi Wang, Dong Liu, Rohit Puri, Dimitris N. Metaxas

A movie's key moments stand out of the screenplay to grab an audience's attention and make movie browsing efficient.

Paper
Add Code

Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation

5 code implementations • Interspeech 2020 • Jingjing Chen, Qirong Mao, Dong Liu

By introduces a improved transformer, elements in speech sequences can interact directly, which enables DPTNet can model for the speech sequences with direct context-awareness.

Ranked #15 on Speech Separation on WSJ0-2mix

Speech Separation Audio and Speech Processing Sound

2,094

Paper
Code

Graph Neural Networks for Massive MIMO Detection

1 code implementation • 11 Jul 2020 • Andrea Scotti, Nima N. Moghadam, Dong Liu, Karl Gafvert, Jinliang Huang

In this paper, we innovately use graph neural networks (GNNs) to learn a message-passing solution for the inference task of massive multiple multiple-input multiple-output (MIMO) detection in wireless communication.

Paper
Code

Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive Keypoint Estimates

1 code implementation • 28 Jun 2020 • Ke Sun, Zigang Geng, Depu Meng, Bin Xiao, Dong Liu, Zhao-Xiang Zhang, Jingdong Wang

The typical bottom-up human pose estimation framework includes two stages, keypoint detection and grouping.

Keypoint Detection Multi-Person Pose Estimation +1

141

Paper
Code

$α$ Belief Propagation for Approximate Inference

1 code implementation • 27 Jun 2020 • Dong Liu, Minh Thành Vu, Zuxing Li, Lars K. Rasmussen

To gain a better understanding of BP in general graphs, we derive an interpretable belief propagation algorithm that is motivated by minimization of a localized $\alpha$-divergence.

Paper
Code

Efficient Integer-Arithmetic-Only Convolutional Neural Networks

1 code implementation • 21 Jun 2020 • Hengrui Zhao, Dong Liu, Houqiang Li

Considering the tradeoff between activation quantization error and network learning ability, we set an empirical rule to tune the bound of each Bounded ReLU.

Image Super-Resolution Quantization

Paper
Code

Region-based Energy Neural Network for Approximate Inference

1 code implementation • 17 Jun 2020 • Dong Liu, Ragnar Thobaben, Lars K. Rasmussen

We term our model Region-based Energy Neural Network (RENN).

Paper
Code

Foreground-Background Imbalance Problem in Deep Object Detectors: A Review

no code implementations • 16 Jun 2020 • Joya Chen, Qi Wu, Dong Liu, Tong Xu

Recent years have witnessed the remarkable developments made by deep learning techniques for object detection, a fundamentally challenging problem of computer vision.

Object object-detection +1

Paper
Add Code

Transferring and Regularizing Prediction for Semantic Segmentation

no code implementations • CVPR 2020 • Yiheng Zhang, Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Dong Liu, Tao Mei

In the view of extremely expensive expert labeling, recent research has shown that the models trained on photo-realistic synthetic data (e. g., computer games) with computer-generated annotations can be adapted to real images.

Ranked #17 on Domain Adaptation on SYNTHIA-to-Cityscapes

Domain Adaptation Segmentation +1

Paper
Add Code

M-LVC: Multiple Frames Prediction for Learned Video Compression

1 code implementation • CVPR 2020 • Jianping Lin, Dong Liu, Houqiang Li, Feng Wu

To compensate for the compression error of the auto-encoders, we further design a MV refinement network and a residual refinement network, taking use of the multiple reference frames as well.

MS-SSIM SSIM +1

Paper
Code

Accelerating Deep Reinforcement Learning With the Aid of Partial Model: Energy-Efficient Predictive Video Streaming

no code implementations • 21 Mar 2020 • Dong Liu, Jianyu Zhao, Chenyang Yang, Lajos Hanzo

Predictive power allocation is conceived for energy-efficient video streaming over mobile networks using deep reinforcement learning.

Paper
Add Code

Dual Temporal Memory Network for Efficient Video Object Segmentation

no code implementations • 13 Mar 2020 • Kaihua Zhang, Long Wang, Dong Liu, Bo Liu, Qingshan Liu, Zhu Li

We present an end-to-end network which stores short- and long-term video sequence information preceding the current frame as the temporal memories to address the temporal modeling in VOS.

Object One-shot visual object segmentation +4

Paper
Add Code

Is There Tradeoff between Spatial and Temporal in Video Super-Resolution?

no code implementations • 13 Mar 2020 • Haochen Zhang, Dong Liu, Zhiwei Xiong

Recent advances of deep learning lead to great success of image and video super-resolution (SR) methods that are based on convolutional neural networks (CNN).

Video Super-Resolution

Paper
Add Code

On Dominant Interference in Random Networks and Communication Reliability

1 code implementation • 3 Mar 2020 • Dong Liu, Baptiste Cavarec, Lars K. Rasmussen, Jing Yue

In this paper, we study the characteristics of dominant interference power with directional reception in a random network modelled by a Poisson Point Process.

Information Theory Signal Processing Information Theory

Paper
Code

Optimizing Wireless Systems Using Unsupervised and Reinforced-Unsupervised Deep Learning

no code implementations • 3 Jan 2020 • Dong Liu, Chengjian Sun, Chenyang Yang, Lajos Hanzo

If the objective and constraint functions are unavailable, reinforcement learning can be applied to find the solution of a functional optimization problem, which is however not tailored to optimization problems in wireless networks.

Paper
Add Code

Hidden Markov Models for sepsis detection in preterm infants

no code implementations • 30 Oct 2019 • Antoine Honore, Dong Liu, David Forsberg, Karen Coste, Eric Herlenius, Saikat Chatterjee, Mikael Skoglund

We explore the use of traditional and contemporary hidden Markov models (HMMs) for sequential physiological data analysis and sepsis prediction in preterm infants.

regression

Paper
Add Code

Optimizing electrode positions in 2D Electrical Impedance Tomography using deep learning

no code implementations • 21 Oct 2019 • Danny Smyl, Dong Liu

Further, it is found that the use of optimized electrode positions computed using the approach derived herein can reduce errors in EIT reconstructions as well as improve the distinguishability of EIT measurements.

Paper
Add Code

Powering Hidden Markov Model by Neural Network based Generative Models

1 code implementation • 13 Oct 2019 • Dong Liu, Antoine Honoré, Saikat Chatterjee, Lars K. Rasmussen

In the proposed GenHMM, each HMM hidden state is associated with a neural network based generative model that has tractability of exact likelihood and provides efficient likelihood computation.

Paper
Code

Is Heuristic Sampling Necessary in Training Deep Object Detectors?

13 code implementations • 11 Sep 2019 • Joya Chen, Dong Liu, Tong Xu, Shiwei Wu, Yifei Cheng, Enhong Chen

In this paper, we challenge the necessity of such hard/soft sampling methods for training accurate deep object detectors.

General Classification Instance Segmentation +2

9,244

Paper
Code

A Comprehensive Benchmark for Single Image Compression Artifacts Reduction

no code implementations • 9 Sep 2019 • Jiaying Liu, Dong Liu, Wenhan Yang, Sifeng Xia, Xiaoshuai Zhang, Yuanying Dai

We present a comprehensive study and evaluation of existing single image compression artifacts removal algorithms, using a new 4K resolution benchmark including diversified foreground objects and background scenes with rich structures, called Large-scale Ideal Ultra high definition 4K (LIU4K) benchmark.

4k Image Compression +1

Paper
Add Code

Customizable Architecture Search for Semantic Segmentation

no code implementations • CVPR 2019 • Yiheng Zhang, Zhaofan Qiu, Jingen Liu, Ting Yao, Dong Liu, Tao Mei

As a result, our CAS is able to search an optimized architecture with customized constraints.

Image Segmentation Segmentation +1

Paper
Add Code

Residual Objectness for Imbalance Reduction

no code implementations • 24 Aug 2019 • Joya Chen, Dong Liu, Bin Luo, Xuezheng Peng, Tong Xu, Enhong Chen

For a long time, object detectors have suffered from extreme imbalance between foregrounds and backgrounds.

Paper
Add Code

$α$ Belief Propagation as Fully Factorized Approximation

no code implementations • 23 Aug 2019 • Dong Liu, Nima N. Moghadam, Lars K. Rasmussen, Jinliang Huang, Saikat Chatterjee

Belief propagation (BP) can do exact inference in loop-free graphs, but its performance could be poor in graphs with loops, and the understanding of its solution is limited.

Paper
Add Code

Deep High-Resolution Representation Learning for Visual Recognition

42 code implementations • 20 Aug 2019 • Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, Wenyu Liu, Bin Xiao

High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection.

Ranked #1 on Object Detection on COCO test-dev (Hardware Burden metric)

Dichotomous Image Segmentation Face Alignment +7

27,678

Paper
Code

Neural Network based Explicit Mixture Models and Expectation-maximization based Learning

1 code implementation • 31 Jul 2019 • Dong Liu, Minh Thành Vu, Saikat Chatterjee, Lars K. Rasmussen

A single latent variable is used as the common input to all the neural networks.

Paper
Code

Model-Free Unsupervised Learning for Optimization Problems with Constraints

no code implementations • 30 Jul 2019 • Chengjian Sun, Dong Liu, Chenyang Yang

In many optimization problems in wireless communications, the expressions of objective function or constraints are hard or even impossible to derive, which makes the solutions difficult to find.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Composition-Aware Image Aesthetics Assessment

no code implementations • 25 Jul 2019 • Dong Liu, Rohit Puri, Nagendra Kamath, Subhabrata Bhattachary

In this work, we propose to model the image composition information as the mutual dependency of its local regions, and design a novel architecture to leverage such information to boost the performance of aesthetics assessment.

Aesthetics Quality Assessment Image Retrieval +2

Paper
Add Code

SSFN -- Self Size-estimating Feed-forward Network with Low Complexity, Limited Need for Human Intervention, and Consistent Behaviour across Trials

no code implementations • 17 May 2019 • Saikat Chatterjee, Alireza M. Javid, Mostafa Sadeghi, Shumpei Kikuta, Dong Liu, Partha P. Mitra, Mikael Skoglund

We design a self size-estimating feed-forward network (SSFN) using a joint optimization approach for estimation of number of layers, number of nodes and learning of weight matrices.

Image Classification

Paper
Add Code

Deep Learning-Based Video Coding: A Review and A Case Study

1 code implementation • 29 Apr 2019 • Dong Liu, Yue Li, Jianping Lin, Houqiang Li, Feng Wu

For deep schemes, pixel probability modeling and auto-encoder are the two approaches, that can be viewed as predictive coding scheme and transform coding scheme, respectively.

Multimedia Image and Video Processing

Paper
Code

On The Classification-Distortion-Perception Tradeoff

no code implementations • NeurIPS 2019 • Dong Liu, Haochen Zhang, Zhiwei Xiong

In this paper, we extend the previous perception-distortion tradeoff to the case of classification-distortion-perception (CDP) tradeoff, where we introduced the classification error rate of the restored signal in addition to distortion and perceptual difference.

Classification General Classification

Paper
Add Code

High-Resolution Representations for Labeling Pixels and Regions

39 code implementations • 9 Apr 2019 • Ke Sun, Yang Zhao, Borui Jiang, Tianheng Cheng, Bin Xiao, Dong Liu, Yadong Mu, Xinggang Wang, Wenyu Liu, Jingdong Wang

The proposed approach achieves superior results to existing single-model networks on COCO object detection.

Ranked #7 on Semantic Segmentation on LIP val

Face Alignment Facial Landmark Detection +5

12,012

Paper
Code

Two-Stream Action Recognition-Oriented Video Super-Resolution

1 code implementation • ICCV 2019 • Haochen Zhang, Dong Liu, Zhiwei Xiong

Tailored for two-stream action recognition networks, we propose two video SR methods for the spatial and temporal streams respectively.

Action Recognition Optical Flow Estimation +3

Paper
Code

Deep High-Resolution Representation Learning for Human Pose Estimation

39 code implementations • CVPR 2019 • Ke Sun, Bin Xiao, Dong Liu, Jingdong Wang

We start from a high-resolution subnetwork as the first stage, gradually add high-to-low resolution subnetworks one by one to form more stages, and connect the mutli-resolution subnetworks in parallel.

Ranked #1 on Pose Estimation on BRACE

2D Human Pose Estimation Instance Segmentation +6

27,678

Paper
Code

Entropy-regularized Optimal Transport Generative Models

no code implementations • 16 Nov 2018 • Dong Liu, Minh Thành Vu, Saikat Chatterjee, Lars K. Rasmussen

We investigate the use of entropy-regularized optimal transport (EOT) cost in developing generative models to learn implicit distributions.

Image Generation

Paper
Add Code

Endowing Robots with Longer-term Autonomy by Recovering from External Disturbances in Manipulation through Grounded Anomaly Classification and Recovery Policies

no code implementations • 11 Sep 2018 • Hongmin Wu, Shuangqi Luo, Longxin Chen, Shuangda Duan, Sakmongkon Chumkamon, Dong Liu, Yisheng Guan, Juan Rojas

Robot manipulation is increasingly poised to interact with humans in co-shared workspaces.

Anomaly Classification General Classification +2

Paper
Add Code

DADA: Deep Adversarial Data Augmentation for Extremely Low Data Regime Classification

2 code implementations • 29 Aug 2018 • Xiaofeng Zhang, Zhangyang Wang, Dong Liu, Qing Ling

Given insufficient data, while many techniques have been developed to help combat overfitting, the challenge remains if one tries to train deep networks, especially in the ill-posed extremely low data regimes: only a small set of labeled data are available, and nothing -- including unlabeled data -- else.

Data Augmentation General Classification +2

Paper
Code

IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks

3 code implementations • 1 Jun 2018 • Ke Sun, Mingjie Li, Dong Liu, Jingdong Wang

In this paper, we are interested in building lightweight and efficient convolutional neural networks.

Image Classification object-detection +1

2,917

Paper
Code

Fully Convolutional Adaptation Networks for Semantic Segmentation

no code implementations • CVPR 2018 • Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu, Tao Mei

The recent advances in deep neural networks have convincingly demonstrated high capability in learning vision models on large datasets.

Domain Adaptation Semantic Segmentation

Paper
Add Code

Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse Coding

1 code implementation • 28 Feb 2018 • Dong Liu, Ke Sun, Zhangyang Wang, Runsheng Liu, Zheng-Jun Zha

We propose an interpretable deep structure namely Frank-Wolfe Network (F-W Net), whose architecture is inspired by unrolling and truncating the Frank-Wolfe algorithm for solving an $L_p$-norm constrained problem with $p\geq 1$.

Handwritten Digit Recognition Image Denoising +2

Paper
Code

A Learning-based Approach to Joint Content Caching and Recommendation at Base Stations

no code implementations • 22 Jan 2018 • Dong Liu, Chenyang Yang

We then formulate a joint caching and recommendation problem maximizing the successful offloading probability, which is a mixed integer programming problem.

Paper
Add Code

Human Pose Estimation using Global and Local Normalization

no code implementations • ICCV 2017 • Ke Sun, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Dong Liu, Jingdong Wang

We present a two-stage normalization scheme, human body normalization and limb normalization, to make the distribution of the relative joint locations compact, resulting in easier learning of convolutional spatial models and more accurate pose estimation.

Pose Estimation

Paper
Add Code

Neural network-based arithmetic coding of intra prediction modes in HEVC

no code implementations • 18 Sep 2017 • Rui Song, Dong Liu, Houqiang Li, Feng Wu

In this paper, we propose an arithmetic coding strategy by training neural networks, and make preliminary studies on coding of the intra prediction modes in HEVC.

Multimedia

Paper
Add Code

Snapshot Hyperspectral Light Field Imaging

no code implementations • CVPR 2017 • Zhiwei Xiong, Lizhi Wang, Huiqun Li, Dong Liu, Feng Wu

This paper presents the first snapshot hyperspectral light field imager in practice.

Paper
Add Code

A Convolutional Neural Network Approach for Half-Pel Interpolation in Video Coding

no code implementations • 10 Mar 2017 • Ning Yan, Dong Liu, Houqiang Li, Feng Wu

To further improve the coding efficiency, sub-pel motion compensation has been utilized, which requires interpolation of fractional samples.

Multimedia

Paper
Add Code

Convolutional Neural Network-Based Block Up-sampling for Intra Frame Coding

no code implementations • 22 Feb 2017 • Yue Li, Dong Liu, Houqiang Li, Li Li, Feng Wu, Hong Zhang, Haitao Yang

A block can be down-sampled before being compressed by normal intra coding, and then up-sampled to its original resolution.

Multimedia

Paper
Add Code

A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding

1 code implementation • 24 Aug 2016 • Yuanying Dai, Dong Liu, Feng Wu

Lossy image and video compression algorithms yield visually annoying artifacts including blocking, blurring, and ringing, especially at low bit-rates.

Multimedia

Paper
Code

Comparative Deep Learning of Hybrid Representations for Image Recommendations

no code implementations • CVPR 2016 • Chenyi Lei, Dong Liu, Weiping Li, Zheng-Jun Zha, Houqiang Li

In many image-related tasks, learning expressive and discriminative representations of images is essential, and deep learning has been studied for automating the learning of such representations.

Paper
Add Code

EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video

no code implementations • 8 Jun 2015 • Guangnan Ye, Yitong Li, Hongliang Xu, Dong Liu, Shih-Fu Chang

Extensive experiments over the zero-shot event retrieval task when no training samples are available show that the EventNet concept library consistently and significantly outperforms the state-of-the-art (such as the 20K ImageNet concepts trained with CNN) by a large margin up to 207%.

Event Detection Retrieval

Paper
Add Code

Building A Large Concept Bank for Representing Events in Video

no code implementations • 29 Mar 2014 • Yin Cui, Dong Liu, Jiawei Chen, Shih-Fu Chang

In this paper, we propose to build Concept Bank, the largest concept library consisting of 4, 876 concepts specifically designed to cover 631 real-world events.

Event Detection Retrieval

Paper
Add Code

$\propto$SVM for learning with label proportions

no code implementations • 4 Jun 2013 • Felix X. Yu, Dong Liu, Sanjiv Kumar, Tony Jebara, Shih-Fu Chang

We study the problem of learning with label proportions in which the training data is provided in groups and only the proportion of each class in each group is known.

Paper
Add Code

Sample-Specific Late Fusion for Visual Category Recognition

no code implementations • CVPR 2013 • Dong Liu, Kuan-Ting Lai, Guangnan Ye, Ming-Syan Chen, Shih-Fu Chang

However, the existing methods generally use a fixed fusion weight for all the scores of a classifier, and thus fail to optimally determine the fusion weight for the individual samples.

Paper
Add Code

Robust Object Co-detection

no code implementations • CVPR 2013 • Xin Guo, Dong Liu, Brendan Jou, Mojun Zhu, Anni Cai, Shih-Fu Chang

Object co-detection aims at simultaneous detection of objects of the same category from a pool of related images by exploiting consistent visual patterns present in candidate objects in the images.

Clustering Object +2

Paper
Add Code

A Bayesian Approach to Multimodal Visual Dictionary Learning

no code implementations • CVPR 2013 • Go Irie, Dong Liu, Zhenguo Li, Shih-Fu Chang

nary learning methods rely on image descriptors alone or together with class labels.

Bayesian Inference Clustering +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.