Search Results for author: Xin Lu

Found 50 papers, 25 papers with code

Grid R-CNN

2 code implementations • CVPR 2019 • Xin Lu, Buyu Li, Yuxin Yue, Quanquan Li, Junjie Yan

This paper proposes a novel object detection framework named Grid R-CNN, which adopts a grid guided localization mechanism for accurate object detection.

Ranked #9 on 2D Object Detection on SARDet-100K

Novel Object Detection Object +3

27,716

Paper
Code

Grid R-CNN Plus: Faster and Better

2 code implementations • 13 Jun 2019 • Xin Lu, Buyu Li, Yuxin Yue, Quanquan Li, Junjie Yan

Grid R-CNN is a well-performed objection detection framework.

Object Detection regression

27,716

Paper
Code

MMDetection: Open MMLab Detection Toolbox and Benchmark

144 code implementations • 17 Jun 2019 • Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin

In this paper, we introduce the various features of this toolbox.

Benchmarking Instance Segmentation +2

27,716

Paper
Code

Free-Form Image Inpainting with Gated Convolution

30 code implementations • ICCV 2019 • Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas Huang

We present a generative image inpainting system to complete images with free-form mask and guidance.

Ranked #3 on Image Inpainting on Places2 val

feature selection Image Inpainting +1

3,162

Paper
Code

Generative Image Inpainting with Contextual Attention

28 code implementations • CVPR 2018 • Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas S. Huang

Motivated by these observations, we propose a new deep generative model-based approach which can not only synthesize novel image structures but also explicitly utilize surrounding image features as references during network training to make better predictions.

Image Inpainting

3,162

Paper
Code

Improving Image Restoration by Revisiting Global Information Aggregation

2 code implementations • 8 Dec 2021 • Xiaojie Chu, Liangyu Chen, Chengpeng Chen, Xin Lu

Our TLC converts global operations to local ones only during inference so that they aggregate features within local spatial regions rather than the entire large images.

Ranked #1 on Color Image Denoising on Urban100 sigma30

Color Image Denoising Deblurring +7

1,991

Paper
Code

High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis

1 code implementation • CVPR 2017 • Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li

Recent advances in deep learning have shown exciting promise in filling large holes in natural images with semantically plausible and context aware details, impacting fundamental image manipulation tasks such as object removal.

Image Inpainting Image Manipulation +1

1,291

Paper
Code

Universal Style Transfer via Feature Transforms

15 code implementations • NeurIPS 2017 • Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

The whitening and coloring transforms reflect a direct matching of feature covariance of the content image to a given style image, which shares similar spirits with the optimization of Gram matrix based cost in neural style transfer.

Image Reconstruction Style Transfer

590

Paper
Code

The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition

1 code implementation • 11 Oct 2022 • Jingru Tan, Bo Li, Xin Lu, Yongqiang Yao, Fengwei Yu, Tong He, Wanli Ouyang

Long-tail distribution is widely spread in real-world applications.

Image Classification Long-tailed Object Detection +4

422

Paper
Code

HINet: Half Instance Normalization Network for Image Restoration

2 code implementations • 13 May 2021 • Liangyu Chen, Xin Lu, Jie Zhang, Xiaojie Chu, Chengpeng Chen

Specifically, we present a novel block: Half Instance Normalization Block (HIN Block), to boost the performance of image restoration networks.

Ranked #3 on Single Image Deraining on Test2800

Deblurring Image Deblurring +3

354

Paper
Code

MAttNet: Modular Attention Network for Referring Expression Comprehension

1 code implementation • CVPR 2018 • Licheng Yu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Mohit Bansal, Tamara L. Berg

In this paper, we address referring expression comprehension: localizing an image region described by a natural language expression.

Ranked #7 on Generalized Referring Expression Segmentation on gRefCOCO

Generalized Referring Expression Segmentation Referring Expression +1

291

Paper
Code

A Unified Model for Multi-class Anomaly Detection

1 code implementation • 8 Jun 2022 • Zhiyuan You, Lei Cui, Yujun Shen, Kai Yang, Xin Lu, Yu Zheng, Xinyi Le

For example, when learning a unified model for 15 categories in MVTec-AD, we surpass the second competitor on the tasks of both anomaly detection (from 88. 1% to 96. 5%) and anomaly localization (from 89. 5% to 96. 8%).

Unsupervised Anomaly Detection

211

Paper
Code

RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features

1 code implementation • CVPR 2021 • Gang Zhang, Xin Lu, Jingru Tan, Jianmin Li, Zhaoxiang Zhang, Quanquan Li, Xiaolin Hu

In this work, we propose a new method called RefineMask for high-quality instance segmentation of objects and scenes, which incorporates fine-grained features during the instance-wise segmenting process in a multi-stage manner.

Instance Segmentation Semantic Segmentation +1

210

Paper
Code

Deep Image Harmonization

2 code implementations • CVPR 2017 • Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang

Compositing is one of the most common operations in photo editing.

Image Harmonization

149

Paper
Code

Equalization Loss v2: A New Gradient Balance Approach for Long-tailed Object Detection

2 code implementations • CVPR 2021 • Jingru Tan, Xin Lu, Gang Zhang, Changqing Yin, Quanquan Li

To address the problem of imbalanced gradients, we introduce a new version of equalization loss, called equalization loss v2 (EQL v2), a novel gradient guided reweighing mechanism that re-balances the training process for each category independently and equally.

Ranked #12 on Instance Segmentation on LVIS v1.0 val

Instance Segmentation Long-tailed Object Detection +2

149

Paper
Code

On Efficient Transformer-Based Image Pre-training for Low-Level Vision

1 code implementation • 19 Dec 2021 • Wenbo Li, Xin Lu, Shengju Qian, Jiangbo Lu, Xiangyu Zhang, Jiaya Jia

Pre-training has marked numerous state of the arts in high-level computer vision, while few attempts have ever been made to investigate how pre-training acts in image processing systems.

Ranked #5 on Image Super-Resolution on Set5 - 2x upscaling (using extra training data)

Denoising Image Super-Resolution

119

Paper
Code

Few-shot Object Counting with Similarity-Aware Feature Enhancement

1 code implementation • 22 Jan 2022 • Zhiyuan You, Kai Yang, Wenhan Luo, Xin Lu, Lei Cui, Xinyi Le

This work studies the problem of few-shot object counting, which counts the number of exemplar objects (i. e., described by one or several support images) occurring in the query image.

Ranked #2 on Object Counting on CARPK

Crowd Counting Object Counting

113

Paper
Code

Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers

3 code implementations • ICLR 2018 • Jianbo Ye, Xin Lu, Zhe Lin, James Z. Wang

Model pruning has become a useful technique that improves the computational efficiency of deep learning, making it possible to deploy solutions in resource-limited scenarios.

Computational Efficiency

Paper
Code

Flow-Grounded Spatial-Temporal Video Prediction from Still Images

1 code implementation • ECCV 2018 • Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

Existing video prediction methods mainly rely on observing multiple historical frames or focus on predicting the next one-frame.

Video Prediction

Paper
Code

Recurrent Multimodal Interaction for Referring Image Segmentation

1 code implementation • ICCV 2017 • Chenxi Liu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Alan Yuille

In this paper we are interested in the problem of image segmentation given natural language descriptions, i. e. referring expressions.

Image Segmentation Segmentation +1

Paper
Code

Scene Parsing with Global Context Embedding

1 code implementation • ICCV 2017 • Wei-Chih Hung, Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang

We present a scene parsing method that utilizes global context information based on both the parametric and non- parametric models.

Scene Parsing

Paper
Code

Improving Long-tailed Object Detection with Image-Level Supervision by Multi-Task Collaborative Learning

1 code implementation • 11 Oct 2022 • Bo Li, Yongqiang Yao, Jingru Tan, Xin Lu, Fengwei Yu, Ye Luo, Jianwei Lu

Specifically, there are an object detection task (consisting of an instance-classification task and a localization task) and an image-classification task in our framework, responsible for utilizing the two types of supervision.

Classification Contrastive Learning +4

Paper
Code

Don't Lose Yourself! Empathetic Response Generation via Explicit Self-Other Awareness

1 code implementation • 8 Oct 2022 • Weixiang Zhao, Yanyan Zhao, Xin Lu, Bing Qin

As a critical step to achieve human-like chatbots, empathetic response generation has attained increasing interests.

Empathetic Response Generation Response Generation

Paper
Code

Multiscale methods for signal selection in single-cell data

1 code implementation • 15 Jun 2022 • Renee S. Hoekzema, Lewis Marsh, Otto Sumray, Thomas M. Carroll, Xin Lu, Helen M. Byrne, Heather A. Harrington

Analysis of single-cell transcriptomics often relies on clustering cells and then performing differential gene expression (DGE) to identify genes that vary between these clusters.

feature selection

Paper
Code

A Transition-based Parser for Unscoped Episodic Logical Forms

1 code implementation • IWCS (ACL) 2021 • Gene Louis Kim, Viet Duong, Xin Lu, Lenhart Schubert

"Episodic Logic:Unscoped Logical Form" (EL-ULF) is a semantic representation capturing predicate-argument structure as well as more challenging aspects of language within the Episodic Logic formalism.

Paper
Code

Diversified Texture Synthesis with Feed-forward Networks

no code implementations • CVPR 2017 • Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

Recent progresses on deep discriminative and generative modeling have shown promising results on texture synthesis.

Texture Synthesis

Paper
Add Code

Contemplating Visual Emotions: Understanding and Overcoming Dataset Bias

no code implementations • ECCV 2018 • Rameswar Panda, Jianming Zhang, Haoxiang Li, Joon-Young Lee, Xin Lu, Amit K. Roy-Chowdhury

While machine learning approaches to visual emotion recognition offer great promise, current methods consider training and testing models on small scale datasets covering limited visual emotion concepts.

Emotion Recognition

Paper
Add Code

Foreground-aware Image Inpainting

no code implementations • CVPR 2019 • Wei Xiong, Jiahui Yu, Zhe Lin, Jimei Yang, Xin Lu, Connelly Barnes, Jiebo Luo

We show that by such disentanglement, the contour completion model predicts reasonable contours of objects, and further substantially improves the performance of image inpainting.

Disentanglement Image Inpainting

Paper
Add Code

Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation

no code implementations • ICCV 2015 • Xin Lu, Zhe Lin, Xiaohui Shen, Radomir Mech, James Z. Wang

We propose a deep multi-patch aggregation network training approach, which allows us to train models using multiple patches generated from one image.

Ranked #8 on Aesthetics Quality Assessment on AVA

Aesthetics Quality Assessment Image Quality Estimation

Paper
Add Code

A deep learning framework for quality assessment and restoration in video endoscopy

no code implementations • 15 Apr 2019 • Sharib Ali, Felix Zhou, Adam Bailey, Barbara Braden, James East, Xin Lu, Jens Rittscher

Given the widespread use of endoscopy in different clinical applications, we contend that the robust and reliable identification of such artifacts and the automated restoration of corrupted video frames is a fundamental medical imaging problem.

Deblurring Image Restoration

Paper
Add Code

Information Mandala: Statistical Distance Matrix with Clustering

no code implementations • 7 Jun 2020 • Xin Lu

In machine learning, observation features are measured in a metric space to obtain their distance function for optimization.

Clustering Object Recognition

Paper
Add Code

MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object Detection

no code implementations • ECCV 2020 • Xin Lu, Quanquan Li, Buyu Li, Junjie Yan

In this paper, we propose MimicDet, a novel and efficient framework to train a one-stage detector by directly mimic the two-stage features, aiming to bridge the accuracy gap between one-stage and two-stage detectors.

object-detection Object Detection

Paper
Add Code

Dirac quantum well engineering on the surface of topological insulator

no code implementations • 28 Jul 2020 • Xin Lu, Mark-Oliver Goerbig

We investigate possible hybridization between these interface states as a function of the width of the topological material and of the characteristic interface size.

Mesoscale and Nanoscale Physics High Energy Physics - Theory Quantum Physics

Paper
Add Code

An Iterative Emotion Interaction Network for Emotion Recognition in Conversations

no code implementations • COLING 2020 • Xin Lu, Yanyan Zhao, Yang Wu, Yijian Tian, Huipeng Chen, Bing Qin

We noticed that the gold emotion labels of the context utterances can provide explicit and accurate emotion interaction, but it is impossible to input gold labels at inference time.

Ranked #41 on Emotion Recognition in Conversation on IEMOCAP

Emotion Recognition in Conversation

Paper
Add Code

Growth, Electronic Structure and Superconductivity of Ultrathin Epitaxial CoSi2 Films

no code implementations • 21 Jan 2021 • Yuan Fang, Ding Wang, Peng Li, Hang Su, Tian Le, Yi Wu, Guo-Wei Yang, Hua-Li Zhang, Zhi-Guang Xiao, Yan-Qiu Sun, Si-Yuan Hong, Yan-Wu Xie, Huan-Hua Wang, Chao Cao, Xin Lu, Hui-Qiu Yuan, Yang Liu

We report growth, electronic structure and superconductivity of ultrathin epitaxial CoSi2 films on Si(111).

Mesoscale and Nanoscale Physics

Paper
Add Code

Interplay between charge order and superconductivity in the kagome metal KV$_3$Sb$_5$

no code implementations • 22 Feb 2021 • Feng Du, Shuaishuai Luo, Brenden R. Ortiz, Ye Chen, Weiyin Duan, Dongting Zhang, Xin Lu, Stephen D. Wilson, Yu Song, Huiqiu Yuan

Beyond $p\approx10$ GPa, a second superconducting dome emerges with maximum $T_{\rm c}\approx1. 0$ K at $p_{\rm c2}\approx22$ GPa, which becomes fully suppressed at $p\approx28$ GPa.

Superconductivity

Paper
Add Code

Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report

no code implementations • 17 May 2021 • Andrey Ignatov, Kim Byeoung-su, Radu Timofte, Angeline Pouget, Fenglong Song, Cheng Li, Shuai Xiao, Zhongqian Fu, Matteo Maggioni, Yibin Huang, Shen Cheng, Xin Lu, Yifeng Zhou, Liangyu Chen, Donghao Liu, Xiangyu Zhang, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Bin Huang, Tianbao Zhou, Shuai Liu, Lei Lei, Chaoyu Feng, Liguang Huang, Zhikun Lei, Feifei Chen

A detailed description of all models developed in the challenge is provided in this paper.

Image Denoising

Paper
Add Code

Dynamic Binary Neural Network by learning channel-wise thresholds

no code implementations • 8 Oct 2021 • Jiehua Zhang, Zhuo Su, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu

The experimental results prove that our method is an effective and straightforward way to reduce information loss and enhance performance of BNNs.

Paper
Add Code

Retrieve, Discriminate and Rewrite: A Simple and Effective Framework for Obtaining Affective Response in Retrieval-Based Chatbots

no code implementations • Findings (EMNLP) 2021 • Xin Lu, Yijian Tian, Yanyan Zhao, Bing Qin

To address this problem, we propose a simple and effective Retrieve-Discriminate-Rewrite framework.

Retrieval

Paper
Add Code

Neighbor Regularized Bayesian Optimization for Hyperparameter Optimization

no code implementations • 7 Oct 2022 • Lei Cui, Yangguang Li, Xin Lu, Dong An, Fenggang Liu

Bayesian Optimization (BO) is a common solution to search optimal hyperparameters based on sample observations of a machine learning model.

Bayesian Optimization Hyperparameter Optimization

Paper
Add Code

Boosting Binary Neural Networks via Dynamic Thresholds Learning

no code implementations • 4 Nov 2022 • Jiehua Zhang, Xueyang Zhang, Zhuo Su, Zitong Yu, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu

For ViTs, DyBinaryCCT presents the superiority of the convolutional embedding layer in fully binarized ViTs and achieves 56. 1% on the ImageNet dataset, which is nearly 9% higher than the baseline.

Binarization

Paper
Add Code

Detecting Temporal shape changes with the Euler Characteristic Transform

no code implementations • 21 Dec 2022 • Lewis Marsh, Felix Y. Zhou, Xiao Qin, Xin Lu, Helen M. Byrne, Heather A. Harrington

Organoids are multi-cellular structures which are cultured in vitro from stem cells to resemble specific organs (e. g., brain, liver) in their three-dimensional composition.

Topological Data Analysis

Paper
Add Code

Is ChatGPT Equipped with Emotional Dialogue Capabilities?

no code implementations • 19 Apr 2023 • Weixiang Zhao, Yanyan Zhao, Xin Lu, Shilong Wang, Yanpeng Tong, Bing Qin

This report presents a study on the emotional dialogue capability of ChatGPT, an advanced language model developed by OpenAI.

Dialogue Understanding Language Modelling

Paper
Add Code

Text2Layer: Layered Image Generation using Latent Diffusion Model

no code implementations • 19 Jul 2023 • Xinyang Zhang, Wentian Zhao, Xin Lu, Jeff Chien

To achieve layered image generation, we train an autoencoder that is able to reconstruct layered images and train diffusion models on the latent representation.

Image Generation Image Segmentation +1

Paper
Add Code

Hybrid of representation learning and reinforcement learning for dynamic and complex robotic motion planning

no code implementations • 7 Sep 2023 • Chengmin Zhou, Xin Lu, Jiapeng Dai, Bingding Huang, Xiaoxu Liu, Pasi Fränti

Reinforcement learning algorithms generate optimal or near-optimal time-sequential predictions.

Decision Making Motion Planning +1

Paper
Add Code

Data-Centric Financial Large Language Models

no code implementations • 7 Oct 2023 • Zhixuan Chu, Huaiyu Guo, Xinyuan Zhou, Yijia Wang, Fei Yu, Hong Chen, Wanqing Xu, Xin Lu, Qing Cui, Longfei Li, Jun Zhou, Sheng Li

Large language models (LLMs) show promise for natural language tasks but struggle when applied directly to complex domains like finance.

Paper
Add Code

Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval

no code implementations • 10 Nov 2023 • Xin Lu, Shikun Chen, Yichao Cao, Xin Zhou, Xiaobo Lu

To handle this limitation, we substitute convolutional descriptors for attention-guided features and propose an Attributes Grouping and Mining Hashing (AGMH), which groups and embeds the category-specific visual attributes in multiple descriptors to generate a comprehensive feature representation for efficient fine-grained image retrieval.

Image Retrieval Retrieval

Paper
Add Code

How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider Transformer Models

no code implementations • 4 Mar 2024 • Xin Lu, Yanyan Zhao, Bing Qin

In this work, we attempt to explain and reverse the decline in base capabilities caused by the architecture of FFN-Wider Transformers, seeking to provide some insights.

Few-Shot Learning Language Modelling +1

Paper
Add Code

Vanilla Transformers are Transfer Capability Teachers

no code implementations • 4 Mar 2024 • Xin Lu, Yanyan Zhao, Bing Qin

However, studies have indicated that MoE Transformers underperform vanilla Transformers in many downstream tasks, significantly diminishing the practical value of MoE models.

Computational Efficiency

Paper
Add Code

A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT

no code implementations • 13 Mar 2024 • Hongyang Zhu, Xin Lu, Yanwei Qin, Xinran Yu, Tianjiao Sun, Yunsong Zhao

The proposed model corrects the vertical stripe artifacts on the sinogram by innovatively updating the response inconsistency compensation coefficients of detector units, which is achieved by employing the group sparse constraint and the projection-view direction sparse constraint on the stripe artifacts.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.