Search Results for author: Xin Lu

Found 50 papers, 25 papers with code

Grid R-CNN

2 code implementations CVPR 2019 Xin Lu, Buyu Li, Yuxin Yue, Quanquan Li, Junjie Yan

This paper proposes a novel object detection framework named Grid R-CNN, which adopts a grid guided localization mechanism for accurate object detection.

Novel Object Detection Object +3

Grid R-CNN Plus: Faster and Better

2 code implementations13 Jun 2019 Xin Lu, Buyu Li, Yuxin Yue, Quanquan Li, Junjie Yan

Grid R-CNN is a well-performed objection detection framework.

Object Detection regression

Generative Image Inpainting with Contextual Attention

28 code implementations CVPR 2018 Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas S. Huang

Motivated by these observations, we propose a new deep generative model-based approach which can not only synthesize novel image structures but also explicitly utilize surrounding image features as references during network training to make better predictions.

Image Inpainting

Improving Image Restoration by Revisiting Global Information Aggregation

2 code implementations8 Dec 2021 Xiaojie Chu, Liangyu Chen, Chengpeng Chen, Xin Lu

Our TLC converts global operations to local ones only during inference so that they aggregate features within local spatial regions rather than the entire large images.

Color Image Denoising Deblurring +7

High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis

1 code implementation CVPR 2017 Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li

Recent advances in deep learning have shown exciting promise in filling large holes in natural images with semantically plausible and context aware details, impacting fundamental image manipulation tasks such as object removal.

Image Inpainting Image Manipulation +1

Universal Style Transfer via Feature Transforms

15 code implementations NeurIPS 2017 Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

The whitening and coloring transforms reflect a direct matching of feature covariance of the content image to a given style image, which shares similar spirits with the optimization of Gram matrix based cost in neural style transfer.

Image Reconstruction Style Transfer

HINet: Half Instance Normalization Network for Image Restoration

2 code implementations13 May 2021 Liangyu Chen, Xin Lu, Jie Zhang, Xiaojie Chu, Chengpeng Chen

Specifically, we present a novel block: Half Instance Normalization Block (HIN Block), to boost the performance of image restoration networks.

Deblurring Image Deblurring +3

A Unified Model for Multi-class Anomaly Detection

1 code implementation8 Jun 2022 Zhiyuan You, Lei Cui, Yujun Shen, Kai Yang, Xin Lu, Yu Zheng, Xinyi Le

For example, when learning a unified model for 15 categories in MVTec-AD, we surpass the second competitor on the tasks of both anomaly detection (from 88. 1% to 96. 5%) and anomaly localization (from 89. 5% to 96. 8%).

Unsupervised Anomaly Detection

RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features

1 code implementation CVPR 2021 Gang Zhang, Xin Lu, Jingru Tan, Jianmin Li, Zhaoxiang Zhang, Quanquan Li, Xiaolin Hu

In this work, we propose a new method called RefineMask for high-quality instance segmentation of objects and scenes, which incorporates fine-grained features during the instance-wise segmenting process in a multi-stage manner.

Instance Segmentation Semantic Segmentation +1

Equalization Loss v2: A New Gradient Balance Approach for Long-tailed Object Detection

2 code implementations CVPR 2021 Jingru Tan, Xin Lu, Gang Zhang, Changqing Yin, Quanquan Li

To address the problem of imbalanced gradients, we introduce a new version of equalization loss, called equalization loss v2 (EQL v2), a novel gradient guided reweighing mechanism that re-balances the training process for each category independently and equally.

Instance Segmentation Long-tailed Object Detection +2

On Efficient Transformer-Based Image Pre-training for Low-Level Vision

1 code implementation19 Dec 2021 Wenbo Li, Xin Lu, Shengju Qian, Jiangbo Lu, Xiangyu Zhang, Jiaya Jia

Pre-training has marked numerous state of the arts in high-level computer vision, while few attempts have ever been made to investigate how pre-training acts in image processing systems.

Ranked #5 on Image Super-Resolution on Set5 - 2x upscaling (using extra training data)

Denoising Image Super-Resolution

Few-shot Object Counting with Similarity-Aware Feature Enhancement

1 code implementation22 Jan 2022 Zhiyuan You, Kai Yang, Wenhan Luo, Xin Lu, Lei Cui, Xinyi Le

This work studies the problem of few-shot object counting, which counts the number of exemplar objects (i. e., described by one or several support images) occurring in the query image.

Crowd Counting Object Counting

Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers

3 code implementations ICLR 2018 Jianbo Ye, Xin Lu, Zhe Lin, James Z. Wang

Model pruning has become a useful technique that improves the computational efficiency of deep learning, making it possible to deploy solutions in resource-limited scenarios.

Computational Efficiency

Flow-Grounded Spatial-Temporal Video Prediction from Still Images

1 code implementation ECCV 2018 Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

Existing video prediction methods mainly rely on observing multiple historical frames or focus on predicting the next one-frame.

Video Prediction

Recurrent Multimodal Interaction for Referring Image Segmentation

1 code implementation ICCV 2017 Chenxi Liu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Alan Yuille

In this paper we are interested in the problem of image segmentation given natural language descriptions, i. e. referring expressions.

Image Segmentation Segmentation +1

Scene Parsing with Global Context Embedding

1 code implementation ICCV 2017 Wei-Chih Hung, Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang

We present a scene parsing method that utilizes global context information based on both the parametric and non- parametric models.

Scene Parsing

Improving Long-tailed Object Detection with Image-Level Supervision by Multi-Task Collaborative Learning

1 code implementation11 Oct 2022 Bo Li, Yongqiang Yao, Jingru Tan, Xin Lu, Fengwei Yu, Ye Luo, Jianwei Lu

Specifically, there are an object detection task (consisting of an instance-classification task and a localization task) and an image-classification task in our framework, responsible for utilizing the two types of supervision.

Classification Contrastive Learning +4

Don't Lose Yourself! Empathetic Response Generation via Explicit Self-Other Awareness

1 code implementation8 Oct 2022 Weixiang Zhao, Yanyan Zhao, Xin Lu, Bing Qin

As a critical step to achieve human-like chatbots, empathetic response generation has attained increasing interests.

Empathetic Response Generation Response Generation

Multiscale methods for signal selection in single-cell data

1 code implementation15 Jun 2022 Renee S. Hoekzema, Lewis Marsh, Otto Sumray, Thomas M. Carroll, Xin Lu, Helen M. Byrne, Heather A. Harrington

Analysis of single-cell transcriptomics often relies on clustering cells and then performing differential gene expression (DGE) to identify genes that vary between these clusters.

feature selection

A Transition-based Parser for Unscoped Episodic Logical Forms

1 code implementation IWCS (ACL) 2021 Gene Louis Kim, Viet Duong, Xin Lu, Lenhart Schubert

"Episodic Logic:Unscoped Logical Form" (EL-ULF) is a semantic representation capturing predicate-argument structure as well as more challenging aspects of language within the Episodic Logic formalism.

Diversified Texture Synthesis with Feed-forward Networks

no code implementations CVPR 2017 Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

Recent progresses on deep discriminative and generative modeling have shown promising results on texture synthesis.

Texture Synthesis

Contemplating Visual Emotions: Understanding and Overcoming Dataset Bias

no code implementations ECCV 2018 Rameswar Panda, Jianming Zhang, Haoxiang Li, Joon-Young Lee, Xin Lu, Amit K. Roy-Chowdhury

While machine learning approaches to visual emotion recognition offer great promise, current methods consider training and testing models on small scale datasets covering limited visual emotion concepts.

Emotion Recognition

Foreground-aware Image Inpainting

no code implementations CVPR 2019 Wei Xiong, Jiahui Yu, Zhe Lin, Jimei Yang, Xin Lu, Connelly Barnes, Jiebo Luo

We show that by such disentanglement, the contour completion model predicts reasonable contours of objects, and further substantially improves the performance of image inpainting.

Disentanglement Image Inpainting

A deep learning framework for quality assessment and restoration in video endoscopy

no code implementations15 Apr 2019 Sharib Ali, Felix Zhou, Adam Bailey, Barbara Braden, James East, Xin Lu, Jens Rittscher

Given the widespread use of endoscopy in different clinical applications, we contend that the robust and reliable identification of such artifacts and the automated restoration of corrupted video frames is a fundamental medical imaging problem.

Deblurring Image Restoration

Information Mandala: Statistical Distance Matrix with Clustering

no code implementations7 Jun 2020 Xin Lu

In machine learning, observation features are measured in a metric space to obtain their distance function for optimization.

Clustering Object Recognition

MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object Detection

no code implementations ECCV 2020 Xin Lu, Quanquan Li, Buyu Li, Junjie Yan

In this paper, we propose MimicDet, a novel and efficient framework to train a one-stage detector by directly mimic the two-stage features, aiming to bridge the accuracy gap between one-stage and two-stage detectors.

object-detection Object Detection

Dirac quantum well engineering on the surface of topological insulator

no code implementations28 Jul 2020 Xin Lu, Mark-Oliver Goerbig

We investigate possible hybridization between these interface states as a function of the width of the topological material and of the characteristic interface size.

Mesoscale and Nanoscale Physics High Energy Physics - Theory Quantum Physics

An Iterative Emotion Interaction Network for Emotion Recognition in Conversations

no code implementations COLING 2020 Xin Lu, Yanyan Zhao, Yang Wu, Yijian Tian, Huipeng Chen, Bing Qin

We noticed that the gold emotion labels of the context utterances can provide explicit and accurate emotion interaction, but it is impossible to input gold labels at inference time.

Emotion Recognition in Conversation

Growth, Electronic Structure and Superconductivity of Ultrathin Epitaxial CoSi2 Films

no code implementations21 Jan 2021 Yuan Fang, Ding Wang, Peng Li, Hang Su, Tian Le, Yi Wu, Guo-Wei Yang, Hua-Li Zhang, Zhi-Guang Xiao, Yan-Qiu Sun, Si-Yuan Hong, Yan-Wu Xie, Huan-Hua Wang, Chao Cao, Xin Lu, Hui-Qiu Yuan, Yang Liu

We report growth, electronic structure and superconductivity of ultrathin epitaxial CoSi2 films on Si(111).

Mesoscale and Nanoscale Physics

Interplay between charge order and superconductivity in the kagome metal KV$_3$Sb$_5$

no code implementations22 Feb 2021 Feng Du, Shuaishuai Luo, Brenden R. Ortiz, Ye Chen, Weiyin Duan, Dongting Zhang, Xin Lu, Stephen D. Wilson, Yu Song, Huiqiu Yuan

Beyond $p\approx10$ GPa, a second superconducting dome emerges with maximum $T_{\rm c}\approx1. 0$ K at $p_{\rm c2}\approx22$ GPa, which becomes fully suppressed at $p\approx28$ GPa.

Superconductivity

Dynamic Binary Neural Network by learning channel-wise thresholds

no code implementations8 Oct 2021 Jiehua Zhang, Zhuo Su, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu

The experimental results prove that our method is an effective and straightforward way to reduce information loss and enhance performance of BNNs.

Neighbor Regularized Bayesian Optimization for Hyperparameter Optimization

no code implementations7 Oct 2022 Lei Cui, Yangguang Li, Xin Lu, Dong An, Fenggang Liu

Bayesian Optimization (BO) is a common solution to search optimal hyperparameters based on sample observations of a machine learning model.

Bayesian Optimization Hyperparameter Optimization

Boosting Binary Neural Networks via Dynamic Thresholds Learning

no code implementations4 Nov 2022 Jiehua Zhang, Xueyang Zhang, Zhuo Su, Zitong Yu, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu

For ViTs, DyBinaryCCT presents the superiority of the convolutional embedding layer in fully binarized ViTs and achieves 56. 1% on the ImageNet dataset, which is nearly 9% higher than the baseline.

Binarization

Detecting Temporal shape changes with the Euler Characteristic Transform

no code implementations21 Dec 2022 Lewis Marsh, Felix Y. Zhou, Xiao Qin, Xin Lu, Helen M. Byrne, Heather A. Harrington

Organoids are multi-cellular structures which are cultured in vitro from stem cells to resemble specific organs (e. g., brain, liver) in their three-dimensional composition.

Topological Data Analysis

Is ChatGPT Equipped with Emotional Dialogue Capabilities?

no code implementations19 Apr 2023 Weixiang Zhao, Yanyan Zhao, Xin Lu, Shilong Wang, Yanpeng Tong, Bing Qin

This report presents a study on the emotional dialogue capability of ChatGPT, an advanced language model developed by OpenAI.

Dialogue Understanding Language Modelling

Text2Layer: Layered Image Generation using Latent Diffusion Model

no code implementations19 Jul 2023 Xinyang Zhang, Wentian Zhao, Xin Lu, Jeff Chien

To achieve layered image generation, we train an autoencoder that is able to reconstruct layered images and train diffusion models on the latent representation.

Image Generation Image Segmentation +1

Data-Centric Financial Large Language Models

no code implementations7 Oct 2023 Zhixuan Chu, Huaiyu Guo, Xinyuan Zhou, Yijia Wang, Fei Yu, Hong Chen, Wanqing Xu, Xin Lu, Qing Cui, Longfei Li, Jun Zhou, Sheng Li

Large language models (LLMs) show promise for natural language tasks but struggle when applied directly to complex domains like finance.

Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval

no code implementations10 Nov 2023 Xin Lu, Shikun Chen, Yichao Cao, Xin Zhou, Xiaobo Lu

To handle this limitation, we substitute convolutional descriptors for attention-guided features and propose an Attributes Grouping and Mining Hashing (AGMH), which groups and embeds the category-specific visual attributes in multiple descriptors to generate a comprehensive feature representation for efficient fine-grained image retrieval.

Image Retrieval Retrieval

How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider Transformer Models

no code implementations4 Mar 2024 Xin Lu, Yanyan Zhao, Bing Qin

In this work, we attempt to explain and reverse the decline in base capabilities caused by the architecture of FFN-Wider Transformers, seeking to provide some insights.

Few-Shot Learning Language Modelling +1

Vanilla Transformers are Transfer Capability Teachers

no code implementations4 Mar 2024 Xin Lu, Yanyan Zhao, Bing Qin

However, studies have indicated that MoE Transformers underperform vanilla Transformers in many downstream tasks, significantly diminishing the practical value of MoE models.

Computational Efficiency

A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT

no code implementations13 Mar 2024 Hongyang Zhu, Xin Lu, Yanwei Qin, Xinran Yu, Tianjiao Sun, Yunsong Zhao

The proposed model corrects the vertical stripe artifacts on the sinogram by innovatively updating the response inconsistency compensation coefficients of detector units, which is achieved by employing the group sparse constraint and the projection-view direction sparse constraint on the stripe artifacts.

Cannot find the paper you are looking for? You can Submit a new open access paper.