Search Results for author: Xiaofei He

Found 60 papers, 23 papers with code

Model Compression and Efficient Inference for Large Language Models: A Survey

no code implementations15 Feb 2024 Wenxiao Wang, Wei Chen, Yicong Luo, Yongliu Long, Zhengkai Lin, Liye Zhang, Binbin Lin, Deng Cai, Xiaofei He

However, Large language models have two prominent characteristics compared to smaller models: (1) Most of compression algorithms require finetuning or even retraining the model after compression.

Knowledge Distillation Model Compression +1

Few-shot Hybrid Domain Adaptation of Image Generators

1 code implementation30 Oct 2023 Hengjia Li, Yang Liu, Linxuan Xia, Yuqi Lin, Tu Zheng, Zheng Yang, Wenxiao Wang, Xiaohui Zhong, Xiaobo Ren, Xiaofei He

Concretely, the distance loss blends the attributes of all target domains by reducing the distances from generated images to all target subspaces.

Domain Adaptation Semantic Similarity +1

Few-Shot Domain Adaptation for Charge Prediction on Unprofessional Descriptions

no code implementations29 Sep 2023 Jie Zhao, Ziyu Guan, Wei Zhao, Yue Jiang, Xiaofei He

Recent works considering professional legal-linguistic style (PLLS) texts have shown promising results on the charge prediction task.

Domain Adaptation

M$^3$CS: Multi-Target Masked Point Modeling with Learnable Codebook and Siamese Decoders

no code implementations23 Sep 2023 Qibo Qiu, Honghui Yang, Wenxiao Wang, Shun Zhang, Haiming Gao, Haochao Ying, Wei Hua, Xiaofei He

Specifically, with masked point cloud as input, M$^3$CS introduces two decoders to predict masked representations and the original points simultaneously.


SelFLoc: Selective Feature Fusion for Large-scale Point Cloud-based Place Recognition

no code implementations1 Jun 2023 Qibo Qiu, Haiming Gao, Wenxiao Wang, Zhiyi Su, Tian Xie, Wei Hua, Xiaofei He

To enhance message passing along particular axes, Stacked Asymmetric Convolution Block (SACB) is designed, which is one of the main contributions in this paper.

Autonomous Vehicles

PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer

1 code implementation CVPR 2023 Honghui Yang, Wenxiao Wang, Minghao Chen, Binbin Lin, Tong He, Hua Chen, Xiaofei He, Wanli Ouyang

The key to associating the two different representations is our introduced input-dependent Query Initialization module, which could efficiently generate reference points and content queries.

Autonomous Driving Quantization

Denoising Multi-modal Sequential Recommenders with Contrastive Learning

no code implementations3 May 2023 Dong Yao, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Wenqiao Zhang, Rui Zhang, Xiaofei He, Fei Wu

In contrast, modalities that do not cause users' behaviors are potential noises and might mislead the learning of a recommendation model.

Contrastive Learning Denoising +2

CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention

1 code implementation13 Mar 2023 Wenxiao Wang, Wei Chen, Qibo Qiu, Long Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wei Liu

On the one hand, CEL blends each token with multiple patches of different scales, providing the self-attention module itself with cross-scale features.

Image Classification Instance Segmentation +3

GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds

1 code implementation CVPR 2023 Honghui Yang, Tong He, Jiaheng Liu, Hua Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wanli Ouyang

In contrast to previous 3D MAE frameworks, which either design a complex decoder to infer masked information from maintained regions or adopt sophisticated masking strategies, we instead propose a much simpler paradigm.


CCL4Rec: Contrast over Contrastive Learning for Micro-video Recommendation

no code implementations17 Aug 2022 Shengyu Zhang, Bofang Li, Dong Yao, Fuli Feng, Jieming Zhu, Wenyan Fan, Zhou Zhao, Xiaofei He, Tat-Seng Chua, Fei Wu

Micro-video recommender systems suffer from the ubiquitous noises in users' behaviors, which might render the learned user representation indiscriminating, and lead to trivial recommendations (e. g., popular items) or even weird ones that are far beyond users' interests.

Contrastive Learning Recommendation Systems

Motion-aware Memory Network for Fast Video Salient Object Detection

1 code implementation1 Aug 2022 Xing Zhao, Haoran Liang, Peipei Li, Guodao Sun, Dongdong Zhao, Ronghua Liang, Xiaofei He

Moreover, inspired by the boundary supervision commonly used in image salient object detection (ISOD), we design a motion-aware loss for predicting object boundary motion and simultaneously perform multitask learning for VSOD and object motion prediction, which can further facilitate the model to extract spatiotemporal features accurately and maintain the object integrity.

motion prediction Object +4

Towards Efficient Adversarial Training on Vision Transformers

no code implementations21 Jul 2022 Boxi Wu, Jindong Gu, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu

Vision Transformer (ViT), as a powerful alternative to Convolutional Neural Network (CNN), has received much attention.

CLRNet: Cross Layer Refinement Network for Lane Detection

3 code implementations CVPR 2022 Tu Zheng, Yifei HUANG, Yang Liu, Wenjian Tang, Zheng Yang, Deng Cai, Xiaofei He

In this way, we can exploit more contextual information to detect lanes while leveraging local detailed lane features to improve localization accuracy.

Lane Detection

WeakM3D: Towards Weakly Supervised Monocular 3D Object Detection

1 code implementation ICLR 2022 Liang Peng, Senbo Yan, Boxi Wu, Zheng Yang, Xiaofei He, Deng Cai

This network is learned by minimizing our newly-proposed 3D alignment loss between the 3D box estimates and the corresponding RoI LiDAR points.

Monocular 3D Object Detection Object +2

VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation

1 code implementation21 Feb 2022 Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He

In this paper, the VLAD aggregation method is adopted to quantize local features with visual vocabulary locally partitioning the feature space, and hence preserve the local discriminability.

Face Presentation Attack Detection

MLSLT: Towards Multilingual Sign Language Translation

no code implementations CVPR 2022 Aoxiong Yin, Zhou Zhao, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He

In addition, we also explore zero-shot translation in sign language and find that our model can achieve comparable performance to the supervised BSLT model on some language pairs.

Sign Language Translation Translation

SimulSLT: End-to-End Simultaneous Sign Language Translation

no code implementations8 Dec 2021 Aoxiong Yin, Zhou Zhao, Jinglin Liu, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He

Sign language translation as a kind of technology with profound social significance has attracted growing researchers' interest in recent years.

Decoder Sign Language Translation +1

Digging Into Output Representation for Monocular 3D Object Detection

no code implementations29 Sep 2021 Liang Peng, Senbo Yan, Chenxi Huang, Xiaofei He, Deng Cai

This characteristic indicates that monocular 3D detection is inherently different from other typical detection tasks that have the same dimensional input and output.

Monocular 3D Object Detection Object +1

Why Do We Click: Visual Impression-aware News Recommendation

1 code implementation26 Sep 2021 Jiahao Xun, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Qi Zhang, Jingjie Li, Xiuqiang He, Xiaofei He, Tat-Seng Chua, Fei Wu

In this work, inspired by the fact that users make their click decisions mostly based on the visual impression they perceive when browsing news, we propose to capture such visual impression information with visual-semantic modeling for news recommendation.

Decision Making News Recommendation

SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory

no code implementations31 Aug 2021 Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He

Lip reading, aiming to recognize spoken sentences according to the given video of lip movements without relying on the audio stream, has attracted great interest due to its application in many scenarios.

Lip Reading

CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention

3 code implementations ICLR 2022 Wenxiao Wang, Lu Yao, Long Chen, Binbin Lin, Deng Cai, Xiaofei He, Wei Liu

On the one hand, CEL blends each embedding with multiple patches of different scales, providing the self-attention module itself with cross-scale features.

Image Classification Instance Segmentation +4

Learning to Affiliate: Mutual Centralized Learning for Few-shot Classification

1 code implementation CVPR 2022 Yang Liu, Weifeng Zhang, Chao Xiang, Tu Zheng, Deng Cai, Xiaofei He

Few-shot learning (FSL) aims to learn a classifier that can be easily adapted to accommodate new tasks not seen during training, given only a few examples.

Classification Few-Shot Learning

Salient Object Ranking with Position-Preserved Attention

1 code implementation ICCV 2021 Hao Fang, Daoxin Zhang, Yi Zhang, Minghao Chen, Jiawei Li, Yao Hu, Deng Cai, Xiaofei He

In this paper, we study the Salient Object Ranking (SOR) task, which manages to assign a ranking order of each detected object according to its visual saliency.

Image Cropping Instance Segmentation +7

Attacking Adversarial Attacks as A Defense

no code implementations9 Jun 2021 Boxi Wu, Heng Pan, Li Shen, Jindong Gu, Shuai Zhao, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu

In this work, we find that the adversarial attacks can also be vulnerable to small perturbations.

Discriminative-Generative Dual Memory Video Anomaly Detection

no code implementations29 Apr 2021 Xin Guo, Zhongming Jin, Chong Chen, Helei Nie, Jianqiang Huang, Deng Cai, Xiaofei He, Xiansheng Hua

In this paper, we propose a DiscRiminative-gEnerative duAl Memory (DREAM) anomaly detection model to take advantage of a few anomalies and solve data imbalance.

Anomaly Detection Video Anomaly Detection

OCM3D: Object-Centric Monocular 3D Object Detection

no code implementations13 Apr 2021 Liang Peng, Fei Liu, Senbo Yan, Xiaofei He, Deng Cai

Image-only and pseudo-LiDAR representations are commonly used for monocular 3D object detection.

Monocular 3D Object Detection Object +1

X-view: Non-egocentric Multi-View 3D Object Detector

no code implementations24 Mar 2021 Liang Xie, Guodong Xu, Deng Cai, Xiaofei He

3D object detection algorithms for autonomous driving reason about 3D obstacles either from 3D birds-eye view or perspective view or both.

3D Object Detection Autonomous Driving +3

DMN4: Few-shot Learning via Discriminative Mutual Nearest Neighbor Neural Network

no code implementations15 Mar 2021 Yang Liu, Tu Zheng, Jie Song, Deng Cai, Xiaofei He

In this paper, we argue that a Mutual Nearest Neighbor (MNN) relation should be established to explicitly select the query descriptors that are most relevant to each task and discard less relevant ones from aggregative clutters in FSL.

Few-Shot Learning

ES-Net: Erasing Salient Parts to Learn More in Re-Identification

no code implementations10 Mar 2021 Dong Shen, Shuai Zhao, Jinming Hu, Hao Feng, Deng Cai, Xiaofei He

In this paper, we propose a novel network, Erasing-Salient Net (ES-Net), to learn comprehensive features by erasing the salient areas in an image.

Reducing the Teacher-Student Gap via Spherical Knowledge Disitllation

1 code implementation15 Oct 2020 Jia Guo, Minghao Chen, Yao Hu, Chen Zhu, Xiaofei He, Deng Cai

We investigate this problem by study the gap of confidence between teacher and student.

Knowledge Distillation

Accelerate CNNs from Three Dimensions: A Comprehensive Pruning Framework

no code implementations10 Oct 2020 Wenxiao Wang, Minghao Chen, Shuai Zhao, Long Chen, Jinming Hu, Haifeng Liu, Deng Cai, Xiaofei He, Wei Liu

Specifically, it first casts the relationships between a certain model's accuracy and depth/width/resolution into a polynomial regression and then maximizes the polynomial to acquire the optimal values for the three dimensions.

Network Pruning Neural Architecture Search +1

Do Wider Neural Networks Really Help Adversarial Robustness?

1 code implementation NeurIPS 2021 Boxi Wu, Jinghui Chen, Deng Cai, Xiaofei He, Quanquan Gu

Previous empirical results suggest that adversarial training requires wider networks for better performances.

Adversarial Robustness

Apparel-invariant Feature Learning for Apparel-changed Person Re-identification

no code implementations14 Aug 2020 Zhengxu Yu, Yilun Zhao, Bin Hong, Zhongming Jin, Jianqiang Huang, Deng Cai, Xiaofei He, Xian-Sheng Hua

Therefore, it is critical to learn an apparel-invariant person representation under cases like cloth changing or several persons wearing similar clothes.

Person Re-Identification Representation Learning

Out-of-distribution Generalization via Partial Feature Decorrelation

no code implementations30 Jul 2020 Xin Guo, Zhengxu Yu, Chao Xiang, Zhongming Jin, Jianqiang Huang, Deng Cai, Xiaofei He, Xian-Sheng Hua

Most deep-learning-based image classification methods assume that all samples are generated under an independent and identically distributed (IID) setting.

Classification General Classification +3

PI-RCNN: An Efficient Multi-sensor 3D Object Detector with Point-based Attentive Cont-conv Fusion Module

no code implementations14 Nov 2019 Liang Xie, Chao Xiang, Zhengxu Yu, Guodong Xu, Zheng Yang, Deng Cai, Xiaofei He

Moreover, based on the PACF module, we propose a 3D multi-sensor multi-task network called Pointcloud-Image RCNN(PI-RCNN as brief), which handles the image segmentation and 3D object detection tasks.

3D Object Detection Image Segmentation +4

IntentGC: a Scalable Graph Convolution Framework Fusing Heterogeneous Information for Recommendation

1 code implementation24 Jul 2019 Jun Zhao, Zhou Zhou, Ziyu Guan, Wei Zhao, Wei Ning, Guang Qiu, Xiaofei He

In this work, we collect abundant relationships from common user behaviors and item information, and propose a novel framework named IntentGC to leverage both explicit preferences and heterogeneous relationships by graph convolutional networks.

Network Embedding

Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention Networks

no code implementations28 Jun 2019 Zhu Zhang, Zhou Zhao, Zhijie Lin, Jingkuan Song, Xiaofei He

Concretely, we first develop a hierarchical convolutional self-attention encoder to efficiently model long-form video contents, which builds the hierarchical structure for video sequences and captures question-aware long-range dependencies from video context.

Answer Generation Decoder +2

COP: Customized Deep Model Compression via Regularized Correlation-Based Filter-Level Pruning

1 code implementation25 Jun 2019 Wenxiao Wang, Cong Fu, Jishun Guo, Deng Cai, Xiaofei He

2) Cross-layer filter comparison is unachievable since the importance is defined locally within each layer.

Neural Network Compression

Dialogue Act Recognition via CRF-Attentive Structured Network

no code implementations SIGIR 2018 Zheqian Chen, Rongqin Yang, Zhou Zhao, Deng Cai, Xiaofei He

Dialogue Act Recognition (DAR) is a challenging problem in dialogue interpretation, which aims to attach semantic labels to utterances and characterize the speaker's intention.

Dialogue Act Classification Dialogue Interpretation +1

Keyword-based Query Comprehending via Multiple Optimized-Demand Augmentation

no code implementations1 Nov 2017 Boyuan Pan, Hao Li, Zhou Zhao, Deng Cai, Xiaofei He

In this paper, we propose a novel neural network system that consists a Demand Optimization Model based on a passage-attention neural machine translation and a Reader Model that can find the answer given the optimized question.

Machine Reading Comprehension Machine Translation +2

Smarnet: Teaching Machines to Read and Comprehend Like Human

no code implementations8 Oct 2017 Zheqian Chen, Rongqin Yang, Bin Cao, Zhou Zhao, Deng Cai, Xiaofei He

Machine Comprehension (MC) is a challenging task in Natural Language Processing field, which aims to guide the machine to comprehend a passage and answer the given question.

Question Answering Reading Comprehension +1

Learning Graph-Level Representation for Drug Discovery

2 code implementations12 Sep 2017 Junying Li, Deng Cai, Xiaofei He

Molecules can be represented as an undirected graph, and we can utilize graph convolution networks to predication molecular properties.

Drug Discovery General Classification +1

Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction

1 code implementation ICML 2017 Weizhong Zhang, Bin Hong, Wei Liu, Jieping Ye, Deng Cai, Xiaofei He, Jie Wang

By noting that sparse SVMs induce sparsities in both feature and sample spaces, we propose a novel approach, which is based on accurate estimations of the primal and dual optima of sparse SVMs, to simultaneously identify the inactive features and samples that are guaranteed to be irrelevant to the outputs.

Geodesic Distance Function Learning via Heat Flow on Vector Fields

no code implementations1 May 2014 Binbin Lin, Ji Yang, Xiaofei He, Jieping Ye

Based on our theoretical analysis, we propose to first learn the gradient field of the distance function and then learn the distance function itself.

O(logT) Projections for Stochastic Optimization of Smooth and Strongly Convex Functions

no code implementations2 Apr 2013 Lijun Zhang, Tianbao Yang, Rong Jin, Xiaofei He

Traditional algorithms for stochastic optimization require projecting the solution at each iteration into a given domain to ensure its feasibility.

Stochastic Optimization

Multi-task Vector Field Learning

no code implementations NeurIPS 2012 Binbin Lin, Sen yang, Chiyuan Zhang, Jieping Ye, Xiaofei He

MTVFL has the following key properties: (1) the vector fields we learned are close to the gradient fields of the prediction functions; (2) within each task, the vector field is required to be as parallel as possible which is expected to span a low dimensional subspace; (3) the vector fields from all tasks share a low dimensional subspace.

Multi-Task Learning

Semi-supervised Regression via Parallel Field Regularization

no code implementations NeurIPS 2011 Binbin Lin, Chiyuan Zhang, Xiaofei He

To achieve this goal, we show that the second order smoothness measures the linearity of the function, and the gradient field of a linear function has to be a parallel vector field.


Cannot find the paper you are looking for? You can Submit a new open access paper.