Search Results for author: Xiaoyu Wang

Found 73 papers, 23 papers with code

Generic Object Detection With Dense Neural Patterns and Regionlets

no code implementations16 Apr 2014 Will Y. Zou, Xiaoyu Wang, Miao Sun, Yuanqing Lin

This paper addresses the challenge of establishing a bridge between deep convolutional neural networks and conventional object detection frameworks for accurate and efficient generic object detection.

Object object-detection +1

Object-centric Sampling for Fine-grained Image Classification

no code implementations10 Dec 2014 Xiaoyu Wang, Tianbao Yang, Guobin Chen, Yuanqing Lin

In contrast, this paper proposes an \emph{object-centric sampling} (OCS) scheme that samples image windows based on the object location information.

Classification Fine-Grained Image Classification +4

Hyper-Class Augmented and Regularized Deep Learning for Fine-Grained Image Classification

no code implementations CVPR 2015 Saining Xie, Tianbao Yang, Xiaoyu Wang, Yuanqing Lin

We demonstrate the success of the proposed framework on two small-scale fine-grained datasets (Stanford Dogs and Stanford Cars) and on a large-scale car dataset that we collected.

Fine-Grained Image Classification General Classification +3

A Recurrent Encoder-Decoder Network for Sequential Face Alignment

no code implementations19 Aug 2016 Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas

We propose a novel recurrent encoder-decoder network model for real-time video-based face alignment.

Face Alignment

Deep Reinforcement Learning-based Image Captioning with Embedding Reward

no code implementations CVPR 2017 Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li

The policy network serves as a local guidance by providing the confidence of predicting the next word according to the current state.

Decision Making Image Captioning +2

SEP-Nets: Small and Effective Pattern Networks

no code implementations13 Jun 2017 Zhe Li, Xiaoyu Wang, Xutao Lv, Tianbao Yang

By doing this, we show that previous deep CNNs such as GoogLeNet and Inception-type Nets can be compressed dramatically with marginal drop in performance.

Binarization Quantization

RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

no code implementations17 Jan 2018 Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas

We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model.

Face Alignment

An Aggressive Genetic Programming Approach for Searching Neural Network Structure Under Computational Constraints

no code implementations3 Jun 2018 Zhe Li, Xuehan Xiong, Zhou Ren, Ning Zhang, Xiaoyu Wang, Tianbao Yang

In this paper, we study how to design a genetic programming approach for optimizing the structure of a CNN for a given task under limited computational resources yet without imposing strong restrictions on the search space.

Evolutionary Algorithms

Fast Stochastic AUC Maximization with $O(1/n)$-Convergence Rate

no code implementations ICML 2018 Mingrui Liu, Xiaoxuan Zhang, Zaiyi Chen, Xiaoyu Wang, Tianbao Yang

In this paper, we consider statistical learning with AUC (area under ROC curve) maximization in the classical stochastic setting where one random data drawn from an unknown distribution is revealed at each iteration for updating the model.

Efficient Metropolitan Traffic Prediction Based on Graph Recurrent Neural Network

no code implementations2 Nov 2018 Xiaoyu Wang, Cailian Chen, Yang Min, Jianping He, Bo Yang, Yang Zhang

Traffic prediction is a fundamental and vital task in Intelligence Transportation System (ITS), but it is very challenging to get high accuracy while containing low computational complexity due to the spatiotemporal characteristics of traffic flow, especially under the metropolitan circumstances.

Traffic Prediction

Adaptive Negative Curvature Descent with Applications in Non-convex Optimization

no code implementations NeurIPS 2018 Mingrui Liu, Zhe Li, Xiaoyu Wang, Jin-Feng Yi, Tianbao Yang

Negative curvature descent (NCD) method has been utilized to design deterministic or stochastic algorithms for non-convex optimization aiming at finding second-order stationary points or local minima.

Joint Modeling of Dense and Incomplete Trajectories for Citywide Traffic Volume Inference

no code implementations25 Feb 2019 Xianfeng Tang, Boqing Gong, Yanwei Yu, Huaxiu Yao, Yandong Li, Haiyong Xie, Xiaoyu Wang

In this paper, we propose a novel framework for the citywide traffic volume inference using both dense GPS trajectories and incomplete trajectories captured by camera surveillance systems.

Graph Embedding

Progressive Learning of Low-Precision Networks

no code implementations28 May 2019 Zhengguang Zhou, Wengang Zhou, Xutao Lv, Xuan Huang, Xiaoyu Wang, Houqiang Li

Recent years have witnessed the great advance of deep learning in a variety of vision tasks.

AIBench: An Industry Standard Internet Service AI Benchmark Suite

no code implementations13 Aug 2019 Wanling Gao, Fei Tang, Lei Wang, Jianfeng Zhan, Chunxin Lan, Chunjie Luo, Yunyou Huang, Chen Zheng, Jiahui Dai, Zheng Cao, Daoyi Zheng, Haoning Tang, Kunlin Zhan, Biao Wang, Defei Kong, Tong Wu, Minghe Yu, Chongkang Tan, Huan Li, Xinhui Tian, Yatao Li, Junchao Shao, Zhenyu Wang, Xiaoyu Wang, Hainan Ye

On the basis of the AIBench framework, abstracting the real-world data sets and workloads from one of the top e-commerce providers, we design and implement the first end-to-end Internet service AI benchmark, which contains the primary modules in the critical paths of an industry scale application and is scalable to deploy on different cluster scales.

Benchmarking Learning-To-Rank

Stochastic Optimization for Non-convex Inf-Projection Problems

no code implementations ICML 2020 Yan Yan, Yi Xu, Lijun Zhang, Xiaoyu Wang, Tianbao Yang

In this paper, we study a family of non-convex and possibly non-smooth inf-projection minimization problems, where the target objective function is equal to minimization of a joint function over another variable.

Stochastic Optimization

Distributed Generative Adversarial Net

1 code implementation19 Nov 2019 Xiaoyu Wang, Ye Deng, Jinjun Wang

Recently the Generative Adversarial Network has become a hot topic.

Generative Adversarial Network

A Simple and Effective Framework for Pairwise Deep Metric Learning

1 code implementation ECCV 2020 Qi Qi, Yan Yan, Xiaoyu Wang, Tianbao Yang

To tackle this issue, we propose a simple and effective framework to sample pairs in a batch of data for updating the model.

Binary Classification Metric Learning

Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework

1 code implementation5 Feb 2020 Chengchun Shi, Xiaoyu Wang, Shikai Luo, Hongtu Zhu, Jieping Ye, Rui Song

A/B testing, or online experiment is a standard business strategy to compare a new product with an old one in pharmaceutical, technological, and traditional industries.

reinforcement-learning Reinforcement Learning (RL)

Non-Convex Optimization via Non-Reversible Stochastic Gradient Langevin Dynamics

no code implementations6 Apr 2020 Yuanhan Hu, Xiaoyu Wang, Xuefeng Gao, Mert Gurbuzbalaban, Lingjiong Zhu

In this paper, we study the non reversible Stochastic Gradient Langevin Dynamics (NSGLD) which is based on discretization of the non-reversible Langevin diffusion.

Stochastic Optimization

Robust Partial Matching for Person Search in the Wild

no code implementations CVPR 2020 Yingji Zhong, Xiaoyu Wang, Shiliang Zhang

This paper also contributes a Large-Scale dataset for Person Search in the wild (LSPS), which is by far the largest and the most challenging dataset for person search.

Human Detection Person Search +1

Generalised Perceptron Learning

no code implementations7 Dec 2020 Xiaoyu Wang, Martin Benning

We present a generalisation of Rosenblatt's traditional perceptron learning algorithm to the class of proximal activation functions and demonstrate how this generalisation can be interpreted as an incremental gradient method applied to a novel energy function.

A REINFORCEMENT LEARNING FRAMEWORK FOR TIME DEPENDENT CAUSAL EFFECTS EVALUATION IN A/B TESTING

no code implementations1 Jan 2021 Chengchun Shi, Xiaoyu Wang, Shikai Luo, Rui Song, Hongtu Zhu, Jieping Ye

A/B testing, or online experiment is a standard business strategy to compare a new product with an old one in pharmaceutical, technological, and traditional industries.

Reinforcement Learning (RL)

Asymmetric Heavy Tails and Implicit Bias in Gaussian Noise Injections

1 code implementation13 Feb 2021 Alexander Camuto, Xiaoyu Wang, Lingjiong Zhu, Chris Holmes, Mert Gürbüzbalaban, Umut Şimşekli

In this paper we focus on the so-called `implicit effect' of GNIs, which is the effect of the injected noise on the dynamics of SGD.

On the Convergence of Step Decay Step-Size for Stochastic Optimization

no code implementations NeurIPS 2021 Xiaoyu Wang, Sindri Magnússon, Mikael Johansson

The convergence of stochastic gradient descent is highly dependent on the step-size, especially on non-convex problems such as neural network training.

Stochastic Optimization

Removing Adversarial Noise in Class Activation Feature Space

no code implementations ICCV 2021 Dawei Zhou, Nannan Wang, Chunlei Peng, Xinbo Gao, Xiaoyu Wang, Jun Yu, Tongliang Liu

Then, we train a denoising model to minimize the distances between the adversarial examples and the natural examples in the class activation feature space.

Adversarial Robustness Denoising

Bandwidth-based Step-Sizes for Non-Convex Stochastic Optimization

no code implementations5 Jun 2021 Xiaoyu Wang, Mikael Johansson

Many popular learning-rate schedules for deep neural networks combine a decaying trend with local perturbations that attempt to escape saddle points and bad local minima.

Stochastic Optimization

Improving White-box Robustness of Pre-processing Defenses via Joint Adversarial Training

no code implementations10 Jun 2021 Dawei Zhou, Nannan Wang, Xinbo Gao, Bo Han, Jun Yu, Xiaoyu Wang, Tongliang Liu

However, pre-processing methods may suffer from the robustness degradation effect, in which the defense reduces rather than improving the adversarial robustness of a target model in a white-box setting.

Adversarial Defense Adversarial Robustness

Connecting Compression Spaces with Transformer for Approximate Nearest Neighbor Search

no code implementations30 Jul 2021 Haokui Zhang, Buzhou Tang, Wenze Hu, Xiaoyu Wang

Specifically, based on transformer, we propose a new network structure to compress the feature into a low dimensional space, and an inhomogeneous neighborhood relationship preserving (INRP) loss that aims to maintain high search accuracy.

Feature Compression Information Retrieval +2

Multilevel Stochastic Optimization for Imputation in Massive Medical Data Records

no code implementations19 Oct 2021 Wenrui Li, Xiaoyu Wang, Yuetian Sun, Snezana Milanovic, Mark Kon, Julio Enrique Castrillon-Candas

In this paper, we apply a recently developed multi-level stochastic optimization approach to the problem of imputation in massive medical records.

Imputation Stochastic Optimization

YMIR: A Rapid Data-centric Development Platform for Vision Applications

1 code implementation19 Nov 2021 Phoenix X. Huang, Wenze Hu, William Brendel, Manmohan Chandraker, Li-Jia Li, Xiaoyu Wang

This paper introduces an open source platform to support the rapid development of computer vision applications at scale.

Active Learning

On Uniform Boundedness Properties of SGD and its Momentum Variants

no code implementations25 Jan 2022 Xiaoyu Wang, Mikael Johansson

In this note, we investigate uniform boundedness properties of iterates and function values along the trajectories of the stochastic gradient descent algorithm and its important momentum variant.

regression Retrieval

Semi-parametric Makeup Transfer via Semantic-aware Correspondence

1 code implementation4 Mar 2022 Mingrui Zhu, Yun Yi, Nannan Wang, Xiaoyu Wang, Xinbo Gao

The large discrepancy between the source non-makeup image and the reference makeup image is one of the key challenges in makeup transfer.

ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer

3 code implementations8 Mar 2022 Haokui Zhang, Wenze Hu, Xiaoyu Wang

Experiment results show that the proposed ParC-Net achieves better performance than popular light-weight ConvNets and vision transformer based models in common vision tasks and datasets, while having fewer parameters and faster inference speed.

Image Classification object-detection +3

FaceMap: Towards Unsupervised Face Clustering via Map Equation

1 code implementation21 Mar 2022 Xiaotian Yu, Yifan Yang, Aibo Wang, Ling Xing, Hanling Yi, Guangming Lu, Xiaoyu Wang

Face clustering is an essential task in computer vision due to the explosion of related applications such as augmented reality or photo album management.

Clustering Community Detection +3

Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin

1 code implementation CVPR 2022 Hangyu Li, Nannan Wang, Xi Yang, Xiaoyu Wang, Xinbo Gao

In this paper, we learn an Adaptive Confidence Margin (Ada-CM) to fully leverage all unlabeled data for semi-supervised deep facial expression recognition.

Facial Expression Recognition Facial Expression Recognition (FER)

Implementation of an Automated Learning System for Non-experts

1 code implementation26 Mar 2022 Phoenix X. Huang, Zhiwei Zhao, Chao Liu, Jingyi Liu, Wenze Hu, Xiaoyu Wang

This paper detailed the engineering system implementation of an automated machine learning system called YMIR, which completely relies on graphical interface to interact with users.

BIG-bench Machine Learning Management

Commonsense Knowledge Salience Evaluation with a Benchmark Dataset in E-commerce

1 code implementation22 May 2022 Yincen Qu, Ningyu Zhang, Hui Chen, Zelin Dai, Zezhong Xu, Chengming Wang, Xiaoyu Wang, Qiang Chen, Huajun Chen

In addition to formulating the new task, we also release a new Benchmark dataset of Salience Evaluation in E-commerce (BSEE) and hope to promote related research on commonsense knowledge salience evaluation.

Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation

no code implementations4 Jul 2022 Xiaoyu Wang, Xiangyu Kong, Xiulian Peng, Yan Lu

In this paper we propose a multi-modal multi-correlation learning framework targeting at the task of audio-visual speech separation.

Contrastive Learning Speech Separation

Improving Adversarial Robustness via Mutual Information Estimation

1 code implementation25 Jul 2022 Dawei Zhou, Nannan Wang, Xinbo Gao, Bo Han, Xiaoyu Wang, Yibing Zhan, Tongliang Liu

To alleviate this negative effect, in this paper, we investigate the dependence between outputs of the target model and input adversarial samples from the perspective of information theory, and propose an adversarial defense method.

Adversarial Defense Adversarial Robustness +1

ALBench: A Framework for Evaluating Active Learning in Object Detection

1 code implementation27 Jul 2022 Zhanpeng Feng, Shiliang Zhang, Rinyoichi Takezoe, Wenze Hu, Manmohan Chandraker, Li-Jia Li, Vijay K. Narayanan, Xiaoyu Wang

To facilitate the research in this field, this paper contributes an active learning benchmark framework named as ALBench for evaluating active learning in object detection.

Active Learning Image Classification +4

Lifted Bregman Training of Neural Networks

no code implementations18 Aug 2022 Xiaoyu Wang, Martin Benning

Instead of estimating the parameters with a combination of first-order optimisation method and back-propagation (as is the state-of-the-art), we propose the use of non-smooth first-order optimisation methods that exploit the specific structure of the novel formulation.

Denoising

Predictive Edge Caching through Deep Mining of Sequential Patterns in User Content Retrievals

no code implementations6 Oct 2022 Chen Li, Xiaoyu Wang, Tongyu Zong, Houwei Cao, Yong liu

Edge caching plays an increasingly important role in boosting user content retrieval performance while reducing redundant network traffic.

Retrieval

Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization

1 code implementation7 Oct 2022 Jihwan Jeong, Xiaoyu Wang, Michael Gimelfarb, Hyunwoo Kim, Baher Abdulhai, Scott Sanner

Offline reinforcement learning (RL) addresses the problem of learning a performant policy from a fixed batch of data collected by following some behavior policy.

Continuous Control D4RL +1

Fcaformer: Forward Cross Attention in Hybrid Vision Transformer

2 code implementations ICCV 2023 Haokui Zhang, Wenze Hu, Xiaoyu Wang

Currently, one main research line in designing a more efficient vision transformer is reducing the computational cost of self attention modules by adopting sparse attention or using local attention windows.

Image Classification Knowledge Distillation

ParCNetV2: Oversized Kernel with Enhanced Attention

1 code implementation ICCV 2023 Ruihan Xu, Haokui Zhang, Wenze Hu, Shiliang Zhang, Xiaoyu Wang

Specifically, we propose a new convolutional neural network, ParCNetV2, that extends position-aware circular convolution (ParCNet) with oversized convolutions and bifurcate gate units to enhance attention.

NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

1 code implementation CVPR 2023 Yun Yi, Haokui Zhang, Wenze Hu, Nannan Wang, Xiaoyu Wang

In this paper, we propose a neural architecture representation model that can be used to estimate these attributes holistically.

Representation Learning

A Critical Review of Traffic Signal Control and A Novel Unified View of Reinforcement Learning and Model Predictive Control Approaches for Adaptive Traffic Signal Control

no code implementations26 Nov 2022 Xiaoyu Wang, Scott Sanner, Baher Abdulhai

Recent years have witnessed substantial growth in adaptive traffic signal control (ATSC) methodologies that improve transportation network efficiency, especially in branches leveraging artificial intelligence based optimization and control algorithms such as reinforcement learning as well as conventional model predictive control.

Model Predictive Control

Deep Active Learning for Computer Vision: Past and Future

no code implementations27 Nov 2022 Rinyoichi Takezoe, Xu Liu, Shunan Mao, Marco Tianyu Chen, Zhanpeng Feng, Shiliang Zhang, Xiaoyu Wang

As an important data selection schema, active learning emerges as the essential component when iterating an Artificial Intelligence (AI) model.

Active Learning

All-to-key Attention for Arbitrary Style Transfer

no code implementations ICCV 2023 Mingrui Zhu, Xiao He, Nannan Wang, Xiaoyu Wang, Xinbo Gao

In this paper, we propose a novel all-to-key attention mechanism -- each position of content features is matched to stable key positions of style features -- that is more in line with the characteristics of style transfer.

Position Style Transfer

Universal Object Detection with Large Vision Model

1 code implementation19 Dec 2022 Feng Lin, Wenze Hu, YaoWei Wang, Yonghong Tian, Guangming Lu, Fanglin Chen, Yong Xu, Xiaoyu Wang

In this study, our focus is on a specific challenge: the large-scale, multi-domain universal object detection problem, which contributes to the broader goal of achieving a universal vision system.

Object object-detection +1

Few-shot Face Image Translation via GAN Prior Distillation

no code implementations28 Jan 2023 Ruoyu Zhao, Mingrui Zhu, Xiaoyu Wang, Nannan Wang

GPD contains two models: a teacher network with GAN Prior and a student network that fulfills end-to-end translation.

Knowledge Distillation Translation

A Lifted Bregman Formulation for the Inversion of Deep Neural Networks

no code implementations1 Mar 2023 Xiaoyu Wang, Martin Benning

We propose a novel framework for the regularised inversion of deep neural networks.

Boosting Weakly-Supervised Temporal Action Localization with Text Information

1 code implementation CVPR 2023 Guozhang Li, De Cheng, Xinpeng Ding, Nannan Wang, Xiaoyu Wang, Xinbo Gao

For the discriminative objective, we propose a Text-Segment Mining (TSM) mechanism, which constructs a text description based on the action class label, and regards the text as the query to mine all class-related segments.

Sentence Weakly-supervised Temporal Action Localization +1

Path Planning for Air-Ground Robot Considering Modal Switching Point Optimization

no code implementations14 May 2023 Xiaoyu Wang, Kangyao Huang, Xinyu Zhang, Honglin Sun, Wenzhuo LIU, Huaping Liu, Jun Li, Pingping Lu

A robot for the field application environment was proposed, and a lightweight global spatial planning technique for the robot based on the graph-search algorithm taking mode switching point optimization into account, with an emphasis on energy efficiency, searching speed, and the viability of real deployment.

Effective Bilevel Optimization via Minimax Reformulation

no code implementations22 May 2023 Xiaoyu Wang, Rui Pan, Renjie Pi, Tong Zhang

To address this issue, we propose a reformulation of bilevel optimization as a minimax problem, effectively decoupling the outer-inner dependency.

Bilevel Optimization Meta-Learning

Perimeter Control Using Deep Reinforcement Learning: A Model-free Approach towards Homogeneous Flow Rate Optimization

no code implementations29 May 2023 Xiaocan Li, Ray Coden Mercurius, Ayal Taitler, Xiaoyu Wang, Mohammad Noaeen, Scott Sanner, Baher Abdulhai

Moreover, no existing studies have employed reinforcement learning for homogeneous flow rate optimization in microscopic simulation, where spatial characteristics, vehicle-level information, and metering realizations -- often overlooked in macroscopic simulations -- are taken into account.

reinforcement-learning

MixBCT: Towards Self-Adapting Backward-Compatible Training

1 code implementation14 Aug 2023 Yu Liang, Shiliang Zhang, YaoWei Wang, Sheng Xiao, Kenli Li, Xiaoyu Wang

As a solution, backward-compatible training can be employed to avoid the necessity of updating old retrieval datasets.

Face Recognition Image Retrieval +1

GestureGPT: Zero-shot Interactive Gesture Understanding and Grounding with Large Language Model Agents

no code implementations19 Oct 2023 Xin Zeng, Xiaoyu Wang, Tengxiang Zhang, Chun Yu, Shengdong Zhao, Yiqiang Chen

Current gesture recognition systems primarily focus on identifying gestures within a predefined set, leaving a gap in connecting these gestures to interactive GUI elements or system functions (e. g., linking a 'thumb-up' gesture to a 'like' button).

Gesture Recognition Language Modelling +1

Learning county from pixels: Corn yield prediction with attention-weighted multiple instance learning

no code implementations2 Dec 2023 Xiaoyu Wang, Yuchi Ma, Qunying Huang, Zhengwei Yang, Zhou Zhang

Furthermore, through an in-depth study of the relationship between mixed pixels and attention, it is verified that our approach can capture critical feature information while filtering out noise from mixed pixels.

Multiple Instance Learning

Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise

no code implementations22 Dec 2023 Rui Pan, Yuxing Liu, Xiaoyu Wang, Tong Zhang

This means SGD with heavy-ball momentum is useful in the large-batch settings such as distributed machine learning or federated learning, where a smaller number of iterations can significantly reduce the number of communication rounds, leading to acceleration in practice.

Federated Learning

Accelerating Deep Learning with Millions of Classes

no code implementations ECCV 2020 Zhuoning Yuan, Zhishuai Guo, Xiaotian Yu, Xiaoyu Wang, Tianbao Yang

In our experiment, we demonstrate that the proposed frame-work is able to train deep learning models with millions of classes and achieve above 10×speedup compared to existing approaches.

Classification General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.