Search Results for author: Qian Zhang

Found 144 papers, 50 papers with code

OPPO’s Machine Translation Systems for WMT20

no code implementations WMT (EMNLP) 2020 Tingxun Shi, Shiyu Zhao, Xiaopu Li, Xiaoxue Wang, Qian Zhang, Di Ai, Dawei Dang, Xue Zhengshan, Jie Hao

In this paper we demonstrate our (OPPO’s) machine translation systems for the WMT20 Shared Task on News Translation for all the 22 language pairs.

Machine Translation Translation

Cross-to-merge training with class balance strategy for learning with noisy labels

1 code implementation Expert Systems with Applications 2024 Qian Zhang, Yi Zhu, Ming Yang, Ge Jin, YingWen Zhu, Qiu Chen

Although sample selection is a mainstream method in the field of learning with noisy labels, which aims to mitigate the impact of noisy labels during model training, the testing performance of these methods exhibits significant fluctuations across different noise rates and types.

Learning with noisy labels

MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning

1 code implementation13 Mar 2024 Jialv Zou, Bencheng Liao, Qian Zhang, Wenyu Liu, Xinggang Wang

Learning robust and scalable visual representations from massive multi-view video data remains a challenge in computer vision and autonomous driving.

3D Object Detection Autonomous Driving +2

VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning

no code implementations20 Feb 2024 Shaoyu Chen, Bo Jiang, Hao Gao, Bencheng Liao, Qing Xu, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang

Learning a human-like driving policy from large-scale driving demonstrations is promising, but the uncertainty and non-deterministic nature of planning make it challenging.

Autonomous Driving

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

6 code implementations17 Jan 2024 Lianghui Zhu, Bencheng Liao, Qian Zhang, Xinlong Wang, Wenyu Liu, Xinggang Wang

The results demonstrate that Vim is capable of overcoming the computation & memory constraints on performing Transformer-style understanding for high-resolution images and it has great potential to be the next-generation backbone for vision foundation models.

object-detection Object Detection +3

Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain

1 code implementation4 Jan 2024 Xuanhua He, Tao Hu, Guoli Wang, Zejin Wang, Run Wang, Qian Zhang, Keyu Yan, Ziyi Chen, Rui Li, Chenjun Xie, Jie Zhang, Man Zhou

However, current methods often ignore the difference between cell phone RAW images and DSLR camera RGB images, a difference that goes beyond the color matrix and extends to spatial structure due to resolution variations.

Image Restoration

Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery

no code implementations23 Dec 2023 Jialu Zhang, Xiaoying Yang, Wentao He, Jianfeng Ren, Qian Zhang, Titian Zhao, Ruibin Bai, Xiangjian He, Jiang Liu

A set of rewards measuring the localization accuracy, the accuracy of predicted labels, and the scale consistency among nearby patches are designed in the agent to guide the scale optimization.

Object object-detection +1

An empirical study of next-basket recommendations

no code implementations5 Dec 2023 Zhufeng Shao, Shoujin Wang, Qian Zhang, Wenpeng Lu, Zhao Li, Xueping Peng

This methodological rigor establishes a cohesive framework for the impartial evaluation of diverse NBR approaches.

Recommendation Systems

A Gronwall Inequality Based Approach to Transient Stability Assessment for Power Grids

no code implementations3 Nov 2023 Qian Zhang, Deqiang Gan

This paper proposes a novel Gronwall inequality-based method for transient stability assessment for power systems.

Affective Video Content Analysis: Decade Review and New Perspectives

no code implementations26 Oct 2023 Junxiao Xue, Jie Wang, Xuecheng Wu, Qian Zhang

In this study, we comprehensively review the development of AVCA over the past decade, particularly focusing on the most advanced methods adopted to address the three major challenges of video feature extraction, expression subjectivity, and multimodal feature fusion.

Emotional Intelligence Facial Expression Recognition +1

Circuit as Set of Points

1 code implementation NeurIPS 2023 Jialv Zou, Xinggang Wang, Jiahao Guo, Wenyu Liu, Qian Zhang, Chang Huang

In our work, we propose a novel perspective for circuit design by treating circuit components as point clouds and using Transformer-based point cloud perception methods to extract features from the circuit.

FireFly v2: Advancing Hardware Support for High-Performance Spiking Neural Network with a Spatiotemporal FPGA Accelerator

no code implementations28 Sep 2023 Jindong Li, Guobin Shen, Dongcheng Zhao, Qian Zhang, Yi Zeng

As a further step in supporting high-performance SNNs on specialized hardware, we introduce FireFly v2, an FPGA SNN accelerator that can address the issue of non-spike operation in current SOTA SNN algorithms, which presents an obstacle in the end-to-end deployment onto existing SNN hardware.

Techno-Economic Analysis of Synthetic Fuel Production from Existing Nuclear Power Plants across the United States

no code implementations21 Sep 2023 Marisol Garrouste, Michael T. Craig, Daniel Wendt, Maria Herrera Diaz, William Jenson, Qian Zhang, Brendan Kochunas

Low carbon synfuel can displace transport fossil fuels such as diesel and jet fuel and help achieve the decarbonization of the transportation sector at a global scale, but large-scale cost-effective production facilities are needed.

Artificial to Spiking Neural Networks Conversion for Scientific Machine Learning

no code implementations31 Aug 2023 Qian Zhang, Chenxi Wu, Adar Kahana, Youngeun Kim, Yuhang Li, George Em Karniadakis, Priyadarshini Panda

We introduce a method to convert Physics-Informed Neural Networks (PINNs), commonly used in scientific machine learning, to Spiking Neural Networks (SNNs), which are expected to have higher energy efficiency compared to traditional Artificial Neural Networks (ANNs).

Computational Efficiency

Improving Few-shot Image Generation by Structural Discrimination and Textural Modulation

1 code implementation30 Aug 2023 Mengping Yang, Zhe Wang, Wenyi Feng, Qian Zhang, Ting Xiao

Furthermore, the frequency awareness of the model is reinforced by encouraging the model to distinguish frequency signals.

Image Generation

DSAT-Net: Dual Spatial Attention Transformer for Building Extraction from Aerial Images

1 code implementation IEEE Geoscience and Remote Sensing Letters 2023 Renhe Zhang, Zhechun Wan, Qian Zhang, Guixu Zhang

The local attention path (LAP) uses efficient stripe convolution to generate local attention, which can alleviate the loss of information caused by down-sampling operation in the GAP and supplement the spatial details.

Extracting Buildings In Remote Sensing Images Semantic Segmentation

Conditional Perceptual Quality Preserving Image Compression

no code implementations16 Aug 2023 Tongda Xu, Qian Zhang, Yanghao Li, Dailan He, Zhe Wang, Yuanyuan Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang

We propose conditional perceptual quality, an extension of the perceptual quality defined in \citet{blau2018perception}, by conditioning it on user defined information.

Image Compression

MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction

1 code implementation10 Aug 2023 Bencheng Liao, Shaoyu Chen, Yunchi Zhang, Bo Jiang, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang

We propose a unified permutation-equivalent modeling approach, \ie, modeling map element as a point set with a group of equivalent permutations, which accurately describes the shape of map element and stabilizes the learning process.

Autonomous Driving

Universal Rates for Multiclass Learning

no code implementations5 Jul 2023 Steve Hanneke, Shay Moran, Qian Zhang

Pseudo-cubes are a structure, rooted in the work of Daniely and Shalev-Shwartz (2014), and recently shown by Brukhim, Carmon, Dinur, Moran, and Yehudayoff (2022) to characterize PAC learnability (i. e., uniform rates) for multiclass classification.

Binary Classification

ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration

1 code implementation23 Jun 2023 Jiaqi Ma, Tianheng Cheng, Guoli Wang, Qian Zhang, Xinggang Wang, Lefei Zhang

We then leverage degradation-aware visual prompts to establish a controllable and universal model for image restoration, called ProRes, which is applicable to an extensive range of image restoration tasks.

Deblurring Denoising +1

Augmenting Greybox Fuzzing with Generative AI

no code implementations11 Jun 2023 Jie Hu, Qian Zhang, Heng Yin

Large language models (LLM) pre-trained with an enormous amount of natural language corpus have proved to be effective for understanding the implicit format syntax and generating format-conforming inputs.

Vulnerability Detection

VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving Scene

2 code implementations19 Apr 2023 Shaoyu Chen, Yunchi Zhang, Bencheng Liao, Jiafeng Xie, Tianheng Cheng, Wei Sui, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang

We design a divide-and-conquer annotation scheme to solve the spatial extensibility problem of HD map generation, and abstract map elements with a variety of geometric patterns as unified point sequence representation, which can be extended to most map elements in the driving scene.

Autonomous Driving

TinyDet: Accurate Small Object Detection in Lightweight Generic Detectors

no code implementations7 Apr 2023 Shaoyu Chen, Tianheng Cheng, Jiemin Fang, Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang

Small object detection requires the detection head to scan a large number of positions on image feature maps, which is extremely hard for computation- and energy-efficient lightweight generic detectors.

object-detection Small Object Detection

OpenInst: A Simple Query-Based Method for Open-World Instance Segmentation

no code implementations28 Mar 2023 Cheng Wang, Guoli Wang, Qian Zhang, Peng Guo, Wenyu Liu, Xinggang Wang

Fortunately, we have identified two observations that help us achieve the best of both worlds: 1) query-based methods demonstrate superiority over dense proposal-based methods in open-world instance segmentation, and 2) learning localization cues is sufficient for open world instance segmentation.

Autonomous Driving Open-World Instance Segmentation +2

Interpretable Motion Planner for Urban Driving via Hierarchical Imitation Learning

no code implementations24 Mar 2023 Bikun Wang, Zhipeng Wang, Chenhao Zhu, Zhiqiang Zhang, Zhichen Wang, Penghong Lin, Jingchu Liu, Qian Zhang

We evaluate our method both in closed-loop simulation and real world driving, and demonstrate the neural network planner has outstanding performance in complex urban autonomous driving scenarios.

Autonomous Driving Imitation Learning +1

VAD: Vectorized Scene Representation for Efficient Autonomous Driving

2 code implementations ICCV 2023 Bo Jiang, Shaoyu Chen, Qing Xu, Bencheng Liao, Jiajie Chen, Helong Zhou, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang

In this paper, we propose VAD, an end-to-end vectorized paradigm for autonomous driving, which models the driving scene as a fully vectorized representation.

Autonomous Driving Trajectory Planning

Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction

1 code implementation15 Mar 2023 Bencheng Liao, Shaoyu Chen, Bo Jiang, Tianheng Cheng, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang

We present a path-based online lane graph construction method, termed LaneGAP, which end-to-end learns the path and recovers the lane graph via a Path2Graph algorithm.

Autonomous Driving graph construction +1

Deep Learning Approach to Predict Hemorrhage in Moyamoya Disease

no code implementations1 Feb 2023 Meng Zhao, Yonggang Ma, Qian Zhang, Jizong Zhao

Objective: Reliable tools to predict moyamoya disease (MMD) patients at risk for hemorrhage could have significant value.

FireFly: A High-Throughput Hardware Accelerator for Spiking Neural Networks with Efficient DSP and Memory Optimization

no code implementations5 Jan 2023 Jindong Li, Guobin Shen, Dongcheng Zhao, Qian Zhang, Yi Zeng

To improve memory efficiency, we design a memory system to enable efficient synaptic weights and membrane voltage memory access with reasonable on-chip RAM consumption.

Towards Accurate Ground Plane Normal Estimation from Ego-Motion

1 code implementation8 Dec 2022 Jiaxin Zhang, Wei Sui, Qian Zhang, Tao Chen, Cong Yang

In this paper, we introduce a novel approach for ground plane normal estimation of wheeled vehicles.

3D Object Detection Autonomous Driving +3

Non-reversible Parallel Tempering for Deep Posterior Approximation

no code implementations20 Nov 2022 Wei Deng, Qian Zhang, Qi Feng, Faming Liang, Guang Lin

Notably, in big data scenarios, we obtain an appealing communication cost $O(P\log P)$ based on the optimal window size.

SMS: Spiking Marching Scheme for Efficient Long Time Integration of Differential Equations

no code implementations17 Nov 2022 Qian Zhang, Adar Kahana, George Em Karniadakis, Panos Stinis

We propose a Spiking Neural Network (SNN)-based explicit numerical scheme for long time integration of time-dependent Ordinary and Partial Differential Equations (ODEs, PDEs).

Multi-Camera Calibration Free BEV Representation for 3D Object Detection

no code implementations31 Oct 2022 Hongxiang Jiang, Wenming Meng, Hongmei Zhu, Qian Zhang, Jihao Yin

In advanced paradigms of autonomous driving, learning Bird's Eye View (BEV) representation from surrounding views is crucial for multi-task framework.

3D Object Detection Autonomous Driving +4

Semi-supervised Body Parsing and Pose Estimation for Enhancing Infant General Movement Assessment

2 code implementations14 Oct 2022 Haomiao Ni, Yuan Xue, Liya Ma, Qian Zhang, Xiaoye Li, Xiaolei Huang

We collected a new clinical IMV dataset with GMA annotations, and our experiments show that SPN models for body parsing and pose estimation trained on the first two datasets generalize well to the new clinical dataset and their results can significantly boost the CRNN-based GMA prediction performance.

Data Augmentation Generative Adversarial Network +1

A Systematical Evaluation for Next-Basket Recommendation Algorithms

no code implementations7 Sep 2022 Zhufeng Shao, Shoujin Wang, Qian Zhang, Wenpeng Lu, Zhao Li, Xueping Peng

Different studies often evaluate NBR approaches on different datasets, under different experimental settings, making it hard to fairly and effectively compare the performance of different NBR approaches.

Next-basket recommendation Recommendation Systems

ELMformer: Efficient Raw Image Restoration with a Locally Multiplicative Transformer

no code implementations31 Aug 2022 Jiaqi Ma, Shengyuan Yan, Lefei Zhang, Guoli Wang, Qian Zhang

In order to get raw images of high quality for downstream Image Signal Process (ISP), in this paper we present an Efficient Locally Multiplicative Transformer called ELMformer for raw image restoration.

Attribute Deblurring +2

MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

1 code implementation30 Aug 2022 Bencheng Liao, Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Qian Zhang, Wenyu Liu, Chang Huang

High-definition (HD) map provides abundant and precise environmental information of the driving scene, serving as a fundamental and indispensable component for planning in autonomous driving system.

3D Lane Detection Autonomous Driving

MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition

1 code implementation11 Aug 2022 Chuanguang Yang, Zhulin An, Helong Zhou, Linhang Cai, Xiang Zhi, Jiwen Wu, Yongjun Xu, Qian Zhang

MixSKD mutually distills feature maps and probability distributions between the random pair of original images and their mixup images in a meaningful way.

Data Augmentation Image Classification +5

BrainCog: A Spiking Neural Network based Brain-inspired Cognitive Intelligence Engine for Brain-inspired AI and Brain Simulation

no code implementations18 Jul 2022 Yi Zeng, Dongcheng Zhao, Feifei Zhao, Guobin Shen, Yiting Dong, Enmeng Lu, Qian Zhang, Yinqian Sun, Qian Liang, Yuxuan Zhao, Zhuoya Zhao, Hongjian Fang, Yuwei Wang, Yang Li, Xin Liu, Chengcheng Du, Qingqun Kong, Zizhe Ruan, Weida Bi

These brain-inspired AI models have been effectively validated on various supervised, unsupervised, and reinforcement learning tasks, and they can be used to enable AI models to be with multiple brain-inspired cognitive functions.

Decision Making

Polar Parametrization for Vision-based Surround-View 3D Detection

1 code implementation22 Jun 2022 Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Qian Zhang, Chang Huang, Wenyu Liu

Based on Polar Parametrization, we propose a surround-view 3D DEtection TRansformer, named PolarDETR.

Inductive Bias Position

Featurized Query R-CNN

1 code implementation13 Jun 2022 Wenqiang Zhang, Tianheng Cheng, Xinggang Wang, Shaoyu Chen, Qian Zhang, Wenyu Liu

The query mechanism introduced in the DETR method is changing the paradigm of object detection and recently there are many query-based methods have obtained strong object detection performance.

Object object-detection +1

Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer

1 code implementation9 Jun 2022 Shaoyu Chen, Tianheng Cheng, Xinggang Wang, Wenming Meng, Qian Zhang, Wenyu Liu

GKT leverages the geometric priors to guide the transformer to focus on discriminative regions and unfolds kernel features to generate BEV representation.

Autonomous Driving Representation Learning

Functional Linear Regression of Cumulative Distribution Functions

1 code implementation28 May 2022 Qian Zhang, Anuran Makur, Kamyar Azizzadenesheli

In particular, given $n$ samples with $d$ basis functions, we show estimation error upper bounds of $\widetilde O(\sqrt{d/n})$ for fixed design, random design, and adversarial context cases.

Decision Making regression

Contrastive Siamese Network for Semi-supervised Speech Recognition

no code implementations27 May 2022 Soheil Khorram, Jaeyoung Kim, Anshuman Tripathi, Han Lu, Qian Zhang, Hasim Sak

This paper introduces contrastive siamese (c-siam) network, an architecture for leveraging unlabeled acoustic data in speech recognition.

speech-recognition Speech Recognition

DPSNN: A Differentially Private Spiking Neural Network with Temporal Enhanced Pooling

no code implementations24 May 2022 Jihang Wang, Dongcheng Zhao, Guobin Shen, Qian Zhang, Yi Zeng

Privacy protection is a crucial issue in machine learning algorithms, and the current privacy protection is combined with traditional artificial neural networks based on real values.

Face Recognition Image Classification +5

Graph Neural Networks Intersect Probabilistic Graphical Models: A Survey

no code implementations24 May 2022 Chenqing Hua, Sitao Luan, Qian Zhang, Jie Fu

Graph Neural Networks (GNNs) are new inference methods developed in recent years and are attracting growing attention due to their effectiveness and flexibility in solving inference and learning problems over graph-structured data.

Spiking Neural Operators for Scientific Machine Learning

no code implementations17 May 2022 Adar Kahana, Qian Zhang, Leonard Gleyzer, George Em Karniadakis

We demonstrate this new approach for classification using the SNN in the branch, achieving results comparable to the literature.

Edge-computing regression

Cross-Domain Correlation Distillation for Unsupervised Domain Adaptation in Nighttime Semantic Segmentation

no code implementations CVPR 2022 Huan Gao, Jichang Guo, Guoli Wang, Qian Zhang

The invariance of illumination or inherent difference between two images is fully explored so as to make up for the lack of labels for nighttime images.

Autonomous Driving Semantic Segmentation +1

Learning Dynamic View Synthesis With Few RGBD Cameras

no code implementations22 Apr 2022 Shengze Wang, Youngjoong Kwon, Yuan Shen, Qian Zhang, Andrei State, Jia-Bin Huang, Henry Fuchs

Experiments on the HTI dataset show that our method outperforms the baseline per-frame image fidelity and spatial-temporal consistency.

Novel View Synthesis

Cross-Image Relational Knowledge Distillation for Semantic Segmentation

1 code implementation CVPR 2022 Chuanguang Yang, Helong Zhou, Zhulin An, Xue Jiang, Yongjun Xu, Qian Zhang

Current Knowledge Distillation (KD) methods for semantic segmentation often guide the student to mimic the teacher's structured information generated from individual data samples.

Knowledge Distillation Segmentation +1

Illumination-Invariant Active Camera Relocalization for Fine-Grained Change Detection in the Wild

no code implementations13 Apr 2022 Nan Li, Wei Feng, Qian Zhang

Active camera relocalization (ACR) is a new problem in computer vision that significantly reduces the false alarm caused by image distortions due to camera pose misalignment in fine-grained change detection (FGCD).

Camera Relocalization Change Detection +1

Atomic Filter: a Weak Form of Shift Operator for Graph Signals

no code implementations1 Apr 2022 Lihua Yang, Qing Zhang, Qian Zhang, Chao Huang

In order to establish the theory of filtering, windowed Fourier transform and wavelet transform in the setting of graph signals, we need to extend the shift operation of classical signals to graph signals.

DNN-Driven Compressive Offloading for Edge-Assisted Semantic Video Segmentation

no code implementations28 Mar 2022 Xuedou Xiao, Juecheng Zhang, Wei Wang, Jianhua He, Qian Zhang

Existing compression algorithms are not fit for semantic segmentation, as the lack of obvious and concentrated regions of interest (RoIs) forces the adoption of uniform compression strategies, leading to low compression ratios or accuracy.

Optical Flow Estimation Segmentation +3

An Active Contour Model with Local Variance Force Term and Its Efficient Minimization Solver for Multi-phase Image Segmentation

no code implementations17 Mar 2022 Chaoyu Liu, Zhonghua Qiao, Qian Zhang

In this paper, we propose an active contour model with a local variance force (LVF) term that can be applied to multi-phase image segmentation problems.

Image Segmentation Segmentation +1

Modeling Complex Dependencies for Session-based Recommendations via Graph Neural Networks

no code implementations29 Jan 2022 Qian Zhang, Wenpeng Lu

Based on a strong assumption of adjacent dependency, any two adjacent items in a session are necessarily dependent in most GNN-based SBRs.

Representation Learning Session-Based Recommendations

Forgery Attack Detection in Surveillance Video Streams Using Wi-Fi Channel State Information

no code implementations24 Jan 2022 Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang

The cybersecurity breaches expose surveillance video streams to forgery attacks, under which authentic streams are falsified to hide unauthorized activities.

Time Series Time Series Analysis +1

Learning Quality-aware Representation for Multi-person Pose Regression

no code implementations4 Jan 2022 Yabo Xiao, Dongdong Yu, Xiaojuan Wang, Lei Jin, Guoli Wang, Qian Zhang

Off-the-shelf single-stage multi-person pose regression methods generally leverage the instance score (i. e., confidence of the instance localization) to indicate the pose quality for selecting the pose candidates.

regression

AdaptivePose: Human Parts as Adaptive Points

1 code implementation27 Dec 2021 Yabo Xiao, Xiaojuan Wang, Dongdong Yu, Guoli Wang, Qian Zhang, Mingshu He

Multi-person pose estimation methods generally follow top-down and bottom-up paradigms, both of which can be considered as two-stage approaches thus leading to the high computation cost and low efficiency.

Multi-Person Pose Estimation

Road-aware Monocular Structure from Motion and Homography Estimation

no code implementations16 Dec 2021 Wei Sui, Teng Chen, Jiaxin Zhang, Jiao Lu, Qian Zhang

The Depth-CNN and Pose-CNN estimate dense depth map and ego-motion respectively, solving SFM, while the Pose-CNN and Ground-CNN followed by a homography layer solve the ground plane estimation problem.

Autonomous Driving Homography Estimation +1

On Convergence of Federated Averaging Langevin Dynamics

no code implementations9 Dec 2021 Wei Deng, Qian Zhang, Yi-An Ma, Zhao Song, Guang Lin

We develop theoretical guarantees for FA-LD for strongly log-concave distributions with non-i. i. d data and study how the injected noise and the stochastic-gradient noise, the heterogeneity of data, and the varying learning rates affect the convergence.

Uncertainty Quantification

Monocular Road Planar Parallax Estimation

no code implementations22 Nov 2021 Haobo Yuan, Teng Chen, Wei Sui, Jiafeng Xie, Lefei Zhang, Yuan Li, Qian Zhang

It implies planar parallax and can be combined with the road plane serving as a reference to estimate the 3D structure by warping the consecutive frames.

3D Reconstruction Autonomous Driving

Spiking CapsNet: A Spiking Neural Network With A Biologically Plausible Routing Rule Between Capsules

no code implementations15 Nov 2021 Dongcheng Zhao, Yang Li, Yi Zeng, Jihang Wang, Qian Zhang

Our Spiking CapsNet fully combines the strengthens of SNN and CapsNet, and shows strong robustness to noise and affine transformation.

Aspect-driven User Preference and News Representation Learning for News Recommendation

no code implementations12 Oct 2021 Rongyao Wang, Wenpeng Lu, Shoujin Wang, Xueping Peng, Hao Wu, Qian Zhang

News recommender systems are essential for helping users to efficiently and effectively find out those interesting news from a large amount of news.

News Recommendation Recommendation Systems +1

Non-reversible Parallel Tempering for Uncertainty Approximation in Deep Learning

no code implementations29 Sep 2021 Wei Deng, Qian Zhang, Qi Feng, Faming Liang, Guang Lin

Parallel tempering (PT), also known as replica exchange, is the go-to workhorse for simulations of multi-modal distributions.

Adversarial Relighting Against Face Recognition

no code implementations18 Aug 2021 Qian Zhang, Qing Guo, Ruijun Gao, Felix Juefei-Xu, Hongkai Yu, Wei Feng

To this end, we first propose the physical modelbased adversarial relighting attack (ARA) denoted as albedoquotient-based adversarial relighting attack (AQ-ARA).

Adversarial Attack Face Recognition

ICAF: Iterative Contrastive Alignment Framework for Multimodal Abstractive Summarization

no code implementations11 Aug 2021 Zijian Zhang, Chang Shu, Youxin Chen, Jing Xiao, Qian Zhang, Lu Zheng

Integrating multimodal knowledge for abstractive summarization task is a work-in-progress research area, with present techniques inheriting fusion-then-generation paradigm.

Abstractive Text Summarization Sentence Summarization

Pseudo Facial Generation With Extreme Poses for Face Recognition

no code implementations CVPR 2021 Guoli Wang, Jiaqi Ma, Qian Zhang, Jiwen Lu, Jie zhou

Many of them settle it by generating fake frontal faces from extreme ones, whereas they are tough to maintain the identity information with high computational consumption and uncontrolled disturbances.

Face Recognition

THOR, Trace-based Hardware-adaptive layer-ORiented Natural Gradient Descent Computation

no code implementations AAAI Technical Track on Machine Learning 2021 Mengyun Chen, Kaixin Gao, Xiaolei Liu, Zidong Wang, Ningxi Ni, Qian Zhang, Lei Chen, Chao Ding, ZhengHai Huang, Min Wang, Shuangling Wang, Fan Yu, Xinyuan Zhao, Dachuan Xu

It is well-known that second-order optimizer can accelerate the training of deep neural networks, however, the huge computation cost of second-order optimization makes it impractical to apply in real practice.

Reducing Streaming ASR Model Delay with Self Alignment

no code implementations6 May 2021 Jaeyoung Kim, Han Lu, Anshuman Tripathi, Qian Zhang, Hasim Sak

From LibriSpeech evaluation, self alignment outperformed existing schemes: 25% and 56% less delay compared to FastEmit and constrained alignment at the similar word error rate.

Deep Online Correction for Monocular Visual Odometry

no code implementations18 Mar 2021 Jiaxin Zhang, Wei Sui, Xinggang Wang, Wenming Meng, Hongmei Zhu, Qian Zhang

Second, the poses predicted by CNNs are further improved by minimizing photometric errors via gradient updates of poses during inference phases.

Monocular Visual Odometry RTE

Take More Positives: An Empirical Study of Contrastive Learing in Unsupervised Person Re-Identification

no code implementations12 Jan 2021 Xuanyu He, Wei zhang, Ran Song, Qian Zhang, Xiangyuan Lan, Lin Ma

By studying two unsupervised person re-ID methods in a cross-method way, we point out a hard negative problem is handled implicitly by their designs of data augmentations and PK sampler respectively.

Contrastive Learning Unsupervised Person Re-Identification

Towards Cross-Modal Forgery Detection and Localization on Live Surveillance Videos

no code implementations4 Jan 2021 Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang

Traditional video forensics approaches can detect and localize forgery traces in each video frame using computationally-expensive spatial-temporal analysis, while falling short in real-time verification of live video feeds.

Time Series Analysis Video Forensics Cryptography and Security

Stacked Homography Transformations for Multi-View Pedestrian Detection

no code implementations ICCV 2021 Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan

This task is confronted with two challenges: how to establish the 3D correspondences from views to the BEV map and how to assemble occupancy information across views.

Multiview Detection Pedestrian Detection

Survey of the Detection and Classification of Pulmonary Lesions via CT and X-Ray

no code implementations31 Dec 2020 Yixuan Sun, Chengyao Li, Qian Zhang, Aimin Zhou, Guixu Zhang

In recent years, the prevalence of several pulmonary diseases, especially the coronavirus disease 2019 (COVID-19) pandemic, has attracted worldwide attention.

General Classification

Matrix optimization based Euclidean embedding with outliers

no code implementations23 Dec 2020 Qian Zhang, Xinyuan Zhao, Chao Ding

Euclidean embedding from noisy observations containing outlier errors is an important and challenging problem in statistics and machine learning.

ResizeMix: Mixing Data with Preserved Object Information and True Labels

1 code implementation21 Dec 2020 Jie Qin, Jiemin Fang, Qian Zhang, Wenyu Liu, Xingang Wang, Xinggang Wang

Especially, CutMix uses a simple but effective method to improve the classifiers by randomly cropping a patch from one image and pasting it on another image.

Data Augmentation Image Classification +3

Causality-Aware Neighborhood Methods for Recommender Systems

no code implementations17 Dec 2020 Masahiro Sato, Sho Takemori, Janmajay Singh, Qian Zhang

In this work, we unify traditional neighborhood recommendation methods with the matching estimator, and develop robust ranking methods for the causal effect of recommendations.

Causal Inference Recommendation Systems

DataVault: A Data Storage Infrastructure for the Einstein Toolkit

no code implementations11 Dec 2020 Yufeng Luo, Roland Haas, Qian Zhang, Gabrielle Allen

Data sharing is essential in the numerical simulations research.

General Relativity and Quantum Cosmology Databases

Predicting seasonal influenza using supermarket retail records

1 code implementation8 Dec 2020 Ioanna Miliou, Xinyue Xiong, Salvatore Rinzivillo, Qian Zhang, Giulio Rossetti, Fosca Giannotti, Dino Pedreschi, Alessandro Vespignani

In this paper, we propose the use of a novel data source, namely retail market data to improve seasonal influenza forecasting.

On Efficient and Robust Metrics for RANSAC Hypotheses and 3D Rigid Registration

no code implementations10 Nov 2020 Jiaqi Yang, Zhiqiang Huang, Siwen Quan, Qian Zhang, Yanning Zhang, Zhiguo Cao

This paper focuses on developing efficient and robust evaluation metrics for RANSAC hypotheses to achieve accurate 3D rigid registration.

Kernel Two-Dimensional Ridge Regression for Subspace Clustering

no code implementations3 Nov 2020 Chong Peng, Qian Zhang, Zhao Kang, Chenglizhao Chen, Qiang Cheng

It directly uses 2D data as inputs such that the learning of representations benefits from inherent structures and relationships of the data.

Clustering regression +1

Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition

no code implementations7 Oct 2020 Anshuman Tripathi, Jaeyoung Kim, Qian Zhang, Han Lu, Hasim Sak

In this paper we present a Transformer-Transducer model architecture and a training technique to unify streaming and non-streaming speech recognition models into one model.

speech-recognition Speech Recognition

Deep Momentum Uncertainty Hashing

no code implementations17 Sep 2020 Chaoyou Fu, Guoli Wang, Xiang Wu, Qian Zhang, Ran He

It embodies the uncertainty of the hashing network to the corresponding input image.

Combinatorial Optimization Deep Hashing

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation

1 code implementation13 Aug 2020 Jialian Wu, Liangchen Song, Tiancai Wang, Qian Zhang, Junsong Yuan

In the classification tree, as the number of parent class nodes are significantly less, their logits are less noisy and can be utilized to suppress the wrong/noisy logits existed in the fine-grained class nodes.

Classification Few-Shot Object Detection +7

SiamParseNet: Joint Body Parsing and Label Propagation in Infant Movement Videos

1 code implementation16 Jul 2020 Haomiao Ni, Yuan Xue, Qian Zhang, Xiaolei Huang

In this paper, we propose a semi-supervised body parsing model, termed SiamParseNet (SPN), to jointly learn single frame body parsing and label propagation between frames in a semi-supervised fashion.

Meta Learning for Support Recovery in High-dimensional Precision Matrix Estimation

no code implementations22 Jun 2020 Qian Zhang, Yilin Zheng, Jean Honorio

Then for the novel task, we prove that the minimization of the $\ell_1$-regularized log-determinant Bregman divergence with the additional constraint that the support is a subset of the estimated support union could reduce the sufficient sample complexity of successful support recovery to $O(\log(|S_{\text{off}}|))$ where $|S_{\text{off}}|$ is the number of off-diagonal elements in the support union and is much less than $N$ for sparse matrices.

Meta-Learning Vocal Bursts Intensity Prediction

FNA++: Fast Network Adaptation via Parameter Remapping and Architecture Search

2 code implementations21 Jun 2020 Jiemin Fang, Yuzhu Sun, Qian Zhang, Kangjian Peng, Yuan Li, Wenyu Liu, Xinggang Wang

In this paper, we propose a Fast Network Adaptation (FNA++) method, which can adapt both the architecture and parameters of a seed network (e. g. an ImageNet pre-trained network) to become a network with different depths, widths, or kernel sizes via a parameter remapping technique, making it possible to use NAS for segmentation and detection tasks a lot more efficiently.

Image Classification Neural Architecture Search +5

Boundary Guidance Hierarchical Network for Real-Time Tongue Segmentation

no code implementations14 Mar 2020 Xinyi Zeng, Qian Zhang, Jia Chen, Guixu Zhang, Aimin Zhou, Yiqin Wang

Finally, the proposed hybrid loss in a four hierarchy-pixel, patch, map and boundary guides the network to effectively segment the tongue regions and accurate tongue boundaries.

Image Segmentation Semantic Segmentation

Active Lighting Recurrence by Parallel Lighting Analogy for Fine-Grained Change Detection

no code implementations22 Feb 2020 Qian Zhang, Wei Feng, Liang Wan, Fei-Peng Tian, Xiaowei Wang, Ping Tan

Besides, we also theoretically prove the invariance of our ALR approach to the ambiguity of normal and lighting decomposition.

Change Detection Navigate

Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss

5 code implementations7 Feb 2020 Qian Zhang, Han Lu, Hasim Sak, Anshuman Tripathi, Erik McDermott, Stephen Koo, Shankar Kumar

We present results on the LibriSpeech dataset showing that limiting the left context for self-attention in the Transformer layers makes decoding computationally tractable for streaming, with only a slight degradation in accuracy.

speech-recognition Speech Recognition

Fast Neural Network Adaptation via Parameter Remapping and Architecture Search

no code implementations ICLR 2020 Jiemin Fang, Yuzhu Sun, Kangjian Peng, Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang

In our experiments, we conduct FNA on MobileNetV2 to obtain new networks for both segmentation and detection that clearly out-perform existing networks designed both manually and by NAS.

Image Classification Neural Architecture Search +4

FasterSeg: Searching for Faster Real-time Semantic Segmentation

2 code implementations ICLR 2020 Wuyang Chen, Xinyu Gong, Xian-Ming Liu, Qian Zhang, Yuan Li, Zhangyang Wang

We present FasterSeg, an automatically designed semantic segmentation network with not only state-of-the-art performance but also faster speed than current methods.

Neural Architecture Search Real-Time Semantic Segmentation +1

AugFPN: Improving Multi-scale Feature Learning for Object Detection

2 code implementations CVPR 2020 Chaoxu Guo, Bin Fan, Qian Zhang, Shiming Xiang, Chunhong Pan

In this paper, we begin by first analyzing the design defects of feature pyramid in FPN, and then introduce a new feature pyramid architecture named AugFPN to address these problems.

Object object-detection +1

Learning Where to Focus for Efficient Video Object Detection

1 code implementation ECCV 2020 Zhengkai Jiang, Yu Liu, Ceyuan Yang, Jihao Liu, Peng Gao, Qian Zhang, Shiming Xiang, Chunhong Pan

Transferring existing image-based detectors to the video is non-trivial since the quality of frames is always deteriorated by part occlusion, rare pose, and motion blur.

Object object-detection +1

VarGFaceNet: An Efficient Variable Group Convolutional Neural Network for Lightweight Face Recognition

2 code implementations11 Oct 2019 Mengjia Yan, Mengao Zhao, Zining Xu, Qian Zhang, Guoli Wang, Zhizhong Su

To improve the discriminative and generalization ability of lightweight network for face recognition, we propose an efficient variable group convolutional network called VarGFaceNet.

Face Detection Face Identification +2

Sensor-Augmented Neural Adaptive Bitrate Video Streaming on UAVs

no code implementations23 Sep 2019 Xuedou Xiao, Wei Wang, Taobin Chen, Yang Cao, Tao Jiang, Qian Zhang

In this paper, we present SA-ABR, a new sensor-augmented system that generates ABR video streaming algorithms with the assistance of various kinds of inherent sensor data that are used to pilot UAVs.

Improved Mix-up with KL-Entropy for Learning From Noisy Labels

no code implementations15 Aug 2019 Qian Zhang, Feifei Lee, Ya-Gang Wang, Qiu Chen

On the websites, there exist a lot of image data which contains inaccurate annotations, but training on these datasets may make networks easier to over-fit the noisy labels and cause performance degradation.

Image Classification

FrameRank: A Text Processing Approach to Video Summarization

no code implementations11 Apr 2019 Zhuo Lei, Chao Zhang, Qian Zhang, Guoping Qiu

In constructing the dataset, because of the subjectivity of user-generated video summarization, we manually annotate 25 summaries for each video, which are in total 1300 summaries.

Unsupervised Video Summarization

Simultaneous Spectral-Spatial Feature Selection and Extraction for Hyperspectral Images

no code implementations8 Apr 2019 Lefei Zhang, Qian Zhang, Bo Du, Xin Huang, Yuan Yan Tang, DaCheng Tao

In a feature representation point of view, a nature approach to handle this situation is to concatenate the spectral and spatial features into a single but high dimensional vector and then apply a certain dimension reduction technique directly on that concatenated vector before feed it into the subsequent classifier.

Dimensionality Reduction feature selection +2

High Fidelity Face Manipulation with Extreme Poses and Expressions

no code implementations28 Mar 2019 Chaoyou Fu, Yibo Hu, Xiang Wu, Guoli Wang, Qian Zhang, Ran He

Furthermore, due to the lack of high-resolution face manipulation databases to verify the effectiveness of our method, we collect a new high-quality Multi-View Face (MVF-HQ) database.

Face Generation Face Recognition +1

Progressive Sparse Local Attention for Video object detection

no code implementations ICCV 2019 Chaoxu Guo, Bin Fan, Jie Gu, Qian Zhang, Shiming Xiang, Veronique Prinet, Chunhong Pan

Instead of relying on optical flow, this paper proposes a novel module called Progressive Sparse Local Attention (PSLA), which establishes the spatial correspondence between features across frames in a local region with progressively sparser stride and uses the correspondence to propagate features.

Object object-detection +2

Enhancing Remote Sensing Image Retrieval with Triplet Deep Metric Learning Network

no code implementations15 Feb 2019 Rui Cao, Qian Zhang, Jiasong Zhu, Qing Li, Qingquan Li, Bozhi Liu, Guoping Qiu

With the rapid growing of remotely sensed imagery data, there is a high demand for effective and efficient image retrieval tools to manage and exploit such data.

Image Retrieval Metric Learning +1

ECGadv: Generating Adversarial Electrocardiogram to Misguide Arrhythmia Classification System

1 code implementation12 Jan 2019 Huangxun Chen, Chenyu Huang, Qianyi Huang, Qian Zhang, Wei Wang

Deep neural networks (DNNs)-powered Electrocardiogram (ECG) diagnosis systems recently achieve promising progress to take over tedious examinations by cardiologists.

Classification General Classification

Cross-Technology Communications for Heterogeneous IoT Devices Through Artificial Doppler Shifts

no code implementations27 Nov 2018 Wei Wang, Shiyue He, Liang Sun, Tao Jiang, Qian Zhang

To this end, we propose DopplerFi, a communication framework that enables a two-way communication channel between BLE and Wi-Fi by injecting artificial Doppler shifts, which can be decoded by sensing the patterns in the Gaussian frequency shift keying (GFSK) demodulator and Channel State Information (CSI).

Networking and Internet Architecture

Joint Neural Architecture Search and Quantization

no code implementations23 Nov 2018 Yukang Chen, Gaofeng Meng, Qian Zhang, Xinbang Zhang, Liangchen Song, Shiming Xiang, Chunhong Pan

Here our goal is to automatically find a compact neural network model with high performance that is suitable for mobile devices.

Model Compression Neural Architecture Search +1

Mancs: A Multi-task Attentional Network with Curriculum Sampling for Person Re-identification

no code implementations ECCV 2018 Cheng Wang, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang

We propose a novel deep network called Mancs that solves the person re-identification problem from the following aspects: fully utilizing the attention mechanism for the person misalignment problem and properly sampling for the ranking loss to obtain more stable person representation.

Person Re-Identification

Reinforced Evolutionary Neural Architecture Search

1 code implementation1 Aug 2018 Yukang Chen, Gaofeng Meng, Qian Zhang, Shiming Xiang, Chang Huang, Lisen Mu, Xinggang Wang

To address this issue, we propose the Reinforced Evolutionary Neural Architecture Search (RE- NAS), which is an evolutionary method with the reinforced mutation for NAS.

Neural Architecture Search Semantic Segmentation

Unsupervised Domain Adaptive Re-Identification: Theory and Practice

3 code implementations30 Jul 2018 Liangchen Song, Cheng Wang, Lefei Zhang, Bo Du, Qian Zhang, Chang Huang, Xinggang Wang

We study the problem of unsupervised domain adaptive re-identification (re-ID) which is an active topic in computer vision but lacks a theoretical foundation.

General Classification Unsupervised Domain Adaptation

Quality Classified Image Analysis with Application to Face Detection and Recognition

no code implementations19 Jan 2018 Fei Yang, Qian Zhang, Miaohui Wang, Guoping Qiu

We will present experimental results to show that our quality classified framework can accurately classify images based on the type and severity of image degradations and can significantly boost the performances of state-of-the-art face detector and recognizer in dealing with image datasets containing mixed quality images.

Face Detection

Automatic Visual Theme Discovery from Joint Image and Text Corpora

no code implementations7 Sep 2016 Ke Sun, Xianxu Hou, Qian Zhang, Guoping Qiu

Furthermore, not all tags have the same descriptive power for visual contents and large vocabulary available from natural language could result in a very diverse set of keywords.

Clustering Descriptive +4

6D Dynamic Camera Relocalization From Single Reference Image

no code implementations CVPR 2016 Wei Feng, Fei-Peng Tian, Qian Zhang, Jizhou Sun

Based on inexpensive platform with unreliable absolute repositioning accuracy (ARA), we propose a hand-eye calibration free strategy to actively relocate camera into the same 6D pose that produces the input reference image, by sequentially correcting 3D relative rotation and translation.

Camera Relocalization Translation

3D Keypoint Detection Based on Deep Neural Network with Sparse Autoencoder

no code implementations30 Apr 2016 Xinyu Lin, Ce Zhu, Qian Zhang, Yipeng Liu

Researchers have proposed various methods to extract 3D keypoints from the surface of 3D mesh models over the last decades, but most of them are based on geometric methods, which lack enough flexibility to meet the requirements for various applications.

Keypoint Detection regression

Topical differences between Chinese language Twitter and Sina Weibo

no code implementations22 Dec 2015 Qian Zhang, Bruno Gonçalves

Using a large corpus of Weibo and Chinese language tweets, covering the period from January $1$ to December $31$, $2012$, we obtain a list of topics using clustered \#tags that we can then use to compare the two platforms.

Cultural Vocal Bursts Intensity Prediction

Fine-Grained Change Detection of Misaligned Scenes With Varied Illuminations

no code implementations ICCV 2015 Wei Feng, Fei-Peng Tian, Qian Zhang, Nan Zhang, Liang Wan, Jizhou Sun

To guarantee detection sensitivity and accuracy of minute changes, in an observation, we capture a group of images under multiple illuminations, which need only to be roughly aligned to the last time lighting conditions.

Change Detection

Prediction of the Yield of Enzymatic Synthesis of Betulinic Acid Ester Using Artificial Neural Networks and Support Vector Machine

no code implementations12 Nov 2015 Run Wang, Qiaoli Mo, Qian Zhang, Fudi Chen, Dazuo Yang

To simplify the number of times of optimization in experimental works, here, we use artificial neural network (ANN) and support vector machine (SVM) models for the prediction of yields of 3\b{eta}-O-phthalic ester of betulinic acid synthesized by betulinic acid and phthalic anhydride using lipase as biocatalyst.

Cannot find the paper you are looking for? You can Submit a new open access paper.