no code implementations • WMT (EMNLP) 2020 • Tingxun Shi, Shiyu Zhao, Xiaopu Li, Xiaoxue Wang, Qian Zhang, Di Ai, Dawei Dang, Xue Zhengshan, Jie Hao
In this paper we demonstrate our (OPPO’s) machine translation systems for the WMT20 Shared Task on News Translation for all the 22 language pairs.
no code implementations • 30 Jun 2025 • Kaiwen Zhang, Zhenyu Tang, Xiaotao Hu, Xingang Pan, Xiaoyang Guo, YuAn Liu, Jingwei Huang, Li Yuan, Qian Zhang, Xiao-Xiao Long, Xun Cao, Wei Yin
In this work, we propose Epona, an autoregressive diffusion world model that enables localized spatiotemporal distribution modeling through two key innovations: 1) Decoupled spatiotemporal factorization that separates temporal dynamics modeling from fine-grained future world generation, and 2) Modular trajectory and video prediction that seamlessly integrate motion planning with visual modeling in an end-to-end framework.
1 code implementation • 29 May 2025 • Xinye Li, Zunwen Zheng, Qian Zhang, Dekai Zhuang, Jiabao Kang, Liyan Xu, Qingbin Liu, Xi Chen, Zhiying Tu, Dianhui Chu, Dianbo Sui
Knowledge Editing (KE) has gained increasing attention, yet current KE tasks remain relatively simple.
1 code implementation • 22 May 2025 • Huaiyuan Yao, Pengfei Li, Bu Jin, Yupeng Zheng, An Liu, Lisen Mu, Qing Su, Qian Zhang, Yilun Chen, Peng Li
Recent advances in autonomous driving research towards motion planners that are robust, safe, and adaptive.
1 code implementation • 21 May 2025 • Andrew P. Berg, Qian Zhang, Mia Y. Wang
Unmanned Aerial Vehicles (UAVs) pose an escalating security concerns as the market for consumer and military UAVs grows.
1 code implementation • 21 May 2025 • Andrew P. Berg, Qian Zhang, Mia Y. Wang
Unmanned aerial vehicle (UAV) usage is expected to surge in the coming decade, raising the need for heightened security measures to prevent airspace violations and security threats.
no code implementations • 20 May 2025 • Zhenyao Li, Shengwen Liao, Qian Zhang, Xuechun Zhang, Deqiang Gan
Integration of renewable resources is profoundly reshaping the dynamics of modern power systems.
1 code implementation • 8 May 2025 • Weichen Zhang, Chen Gao, Shiquan Yu, Ruiying Peng, Baining Zhao, Qian Zhang, Jinqiang Cui, Xinlei Chen, Yong Li
In this work, we propose \textbf{CityNavAgent}, a large language model (LLM)-empowered agent that significantly reduces the navigation complexity for urban aerial VLN.
no code implementations • 26 Mar 2025 • Xiangwen Zhang, Qian Zhang, Longfei Han, Qiang Qu, Xiaoming Chen
In this paper, we introduce AccidentSim, a novel framework that generates physically realistic vehicle collision videos by extracting and utilizing the physical clues and contextual information available in real-world vehicle accident reports.
1 code implementation • 24 Mar 2025 • Fiseha B. Tesema, Alejandro Guerra Manzanares, Tianxiang Cui, Qian Zhang, Moses Solomon, Sean He
Colorectal cancer (CRC) is a major global cause of cancer-related deaths, with early polyp detection and removal during colonoscopy being crucial for prevention.
no code implementations • 11 Mar 2025 • Miao Zhang, Zhenlong Fang, Tianyi Wang, Qian Zhang, Shuai Lu, Junfeng Jiao, Tianyu Shi
Traditional Reinforcement Learning (RL) suffers from replicating human-like behaviors, generalizing effectively in multi-agent scenarios, and overcoming inherent interpretability issues. These tasks are compounded when deep environment understanding, agent coordination and dynamic optimization are required.
1 code implementation • 11 Mar 2025 • Jialv Zou, Bencheng Liao, Qian Zhang, Wenyu Liu, Xinggang Wang
The model fully leverages Mamba-2's high computational and memory efficiency, extending its capabilities from text generation to multimodal generation.
no code implementations • 10 Mar 2025 • Fu Rong, Meng Lan, Qian Zhang, Lefei Zhang
Although Segment Anything Model 2 (SAM 2) has shown remarkable performance in various segmentation tasks, its application to RRSIS presents several challenges, including understanding the text-described RS scenes and generating effective prompts from text descriptions.
1 code implementation • 10 Mar 2025 • Bo Jiang, Shaoyu Chen, Qian Zhang, Wenyu Liu, Xinggang Wang
Some studies integrate vision-language models (VLMs) into autonomous driving, but they typically rely on pre-trained models with simple supervised fine-tuning (SFT) on driving data, without further exploration of training strategies or optimizations specifically tailored for planning.
1 code implementation • CVPR 2025 • Zebin Xing, Xingyu Zhang, Yang Hu, Bo Jiang, Tong He, Qian Zhang, Xiaoxiao Long, Wei Yin
Furthermore, GoalFlow employs an efficient generative method, Flow Matching, to generate multimodal trajectories, and incorporates a refined scoring mechanism to select the optimal trajectory from the candidates.
Ranked #8 on
NavSim
on OpenScene
no code implementations • 20 Feb 2025 • Zongyou Yu, Qiang Qu, Qian Zhang, Nan Zhang, Xiaoming Chen
Recent advancements in event-based recognition have demonstrated significant promise, yet most existing approaches rely on extensive training, limiting their adaptability for efficient processing of event-driven visual content.
1 code implementation • 18 Feb 2025 • Bencheng Liao, Hongyuan Tao, Qian Zhang, Tianheng Cheng, Yingyue Li, Haoran Yin, Wenyu Liu, Xinggang Wang
We propose an seeding strategy to carve Mamba from trained Transformer and a three-stage distillation recipe, which can effectively transfer the knowledge from Transformer to Mamba while preserving multimodal capabilities.
no code implementations • 18 Feb 2025 • Hao Gao, Shaoyu Chen, Bo Jiang, Bencheng Liao, Yiang Shi, Xiaoyang Guo, Yuechuan Pu, Haoran Yin, Xiangyu Li, Xinbang Zhang, Ying Zhang, Wenyu Liu, Qian Zhang, Xinggang Wang
By leveraging 3DGS techniques, we construct a photorealistic digital replica of the real physical world, enabling the AD policy to extensively explore the state space and learn to handle out-of-distribution scenarios through large-scale trial and error.
1 code implementation • 23 Jan 2025 • Fu Rong, Meng Lan, Qian Zhang, Lefei Zhang
Referring video object segmentation (RVOS) aims to segment objects in a video according to textual descriptions, which requires the integration of multimodal information and temporal dynamics perception.
Referring Expression Segmentation
Referring Video Object Segmentation
+3
no code implementations • 22 Jan 2025 • Xu Zhang, huan zhang, Guoli Wang, Qian Zhang, Lefei Zhang, Bo Du
Existing underwater image restoration (UIR) methods generally only handle color distortion or jointly address color and haze issues, but they often overlook the more complex degradations that can occur in underwater scenes.
no code implementations • 21 Jan 2025 • Jiaxi Zhuang, Qian Zhang, Ying Qian
Retrosynthesis plays a crucial role in the fields of organic synthesis and drug development, where the goal is to identify suitable reactants that can yield a target product molecule.
no code implementations • 2 Jan 2025 • Qian Zhang, Dmitry Krotov, George Em Karniadakis
We formulate reconstruction as a mapping from incomplete observed data to full reconstructed fields.
1 code implementation • 27 Dec 2024 • Xiaotao Hu, Wei Yin, Mingkai Jia, Junyuan Deng, Xiaoyang Guo, Qian Zhang, Xiaoxiao Long, Ping Tan
However, prior works tend to produce unsatisfactory results, as the classic GPT framework is designed to handle 1D contextual information, such as text, and lacks the inherent ability to model the spatial and temporal dynamics essential for video generation.
1 code implementation • 18 Dec 2024 • Jiaqi Yang, Chu'ai Zhang, Zhengbao Wang, Xinyue Cao, Xuan Ouyang, Xiyu Zhang, Zhenxuan Zeng, Zhao Zeng, Borui Lu, Zhiyi Xia, Qian Zhang, Yulan Guo, Yanning Zhang
3D point cloud registration is a fundamental problem in computer vision, computer graphics, robotics, remote sensing, and etc.
1 code implementation • Pattern Recognition 2024 • Qian Zhang, Yi Zhu, Filipe R. Cordeiro, Qiu Chen
Accordingly, their effectiveness is significantly influenced by the precision of the separated clean set, prior knowledge of noise, and the robustness of SSL.
Ranked #1 on
Learning with noisy labels
on CIFAR-10N-Random3
no code implementations • 17 Dec 2024 • Rixin Zhou, Honglin Pang, Qian Zhang, Ruihua Qi, Xi Yang, Chuntao Li
In real-world applications across specialized domains, addressing complex out-of-distribution (OOD) challenges is a common and significant concern.
1 code implementation • 9 Dec 2024 • Qian Zhang, Panfeng Chen, Jiali Li, Linkun Feng, Shuyu Liu, Heng Zhao, Mei Chen, Hui Li, Yanhao Wang
Through an in-depth analysis of experimental results, we offer insights into the ability of LLMs to answer pediatric questions in the Chinese context, highlighting their limitations for further improvements.
1 code implementation • 26 Nov 2024 • Junyuan Deng, Wei Yin, Xiaoyang Guo, Qian Zhang, Xiaotao Hu, Weiqiang Ren, Xiaoxiao Long, Ping Tan
Monocular camera calibration is essential for many 3D vision tasks.
1 code implementation • CVPR 2025 • Bencheng Liao, Shaoyu Chen, Haoran Yin, Bo Jiang, Cheng Wang, Sixu Yan, Xinbang Zhang, Xiangyu Li, Ying Zhang, Qian Zhang, Xinggang Wang
However, the numerous denoising steps in the robotic diffusion policy and the more dynamic, open-world nature of traffic scenes pose substantial challenges for generating diverse driving actions at a real-time speed.
Ranked #13 on
NavSim
on OpenScene
no code implementations • 11 Nov 2024 • Ruyin Wan, Qian Zhang, George Em Karniadakis
Spiking neural networks (SNNs) represent a promising approach in machine learning, combining the hierarchical learning capabilities of deep neural networks with the energy efficiency of spike-based computations.
1 code implementation • 29 Oct 2024 • Bo Jiang, Shaoyu Chen, Bencheng Liao, Xingyu Zhang, Wei Yin, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang
In contrast, Large Vision-Language Models (LVLMs) excel in scene understanding and reasoning.
no code implementations • 14 Oct 2024 • Songen Gu, Wei Yin, Bu Jin, Xiaoyang Guo, Junming Wang, Haodong Li, Qian Zhang, Xiaoxiao Long
The ability of this world model to capture the evolution of the environment is crucial for planning in autonomous driving.
no code implementations • 7 Oct 2024 • Junming Wang, Xingyu Zhang, Zebin Xing, Songen Gu, Xiaoyang Guo, Yang Hu, Ziying Song, Qian Zhang, Xiaoxiao Long, Wei Yin
In this paper, we propose HE-Drive: the first human-like-centric end-to-end autonomous driving system to generate trajectories that are both temporally consistent and comfortable.
no code implementations • 4 Oct 2024 • Qian Zhang
Moreover, to generate more expressive prompts, the study introduces a class-wise augmentation from the visual modality, resulting in significant robustness to a wider range of unseen classes.
no code implementations • 30 Sep 2024 • Junming Wang, Wei Yin, Xiaoxiao Long, Xingyu Zhang, Zebin Xing, Xiaoyang Guo, Qian Zhang
In this paper, we introduce OccRWKV, an efficient semantic occupancy network inspired by Receptance Weighted Key Value (RWKV).
1 code implementation • 28 Aug 2024 • Xu Zhang, Jiaqi Ma, Guoli Wang, Qian Zhang, huan zhang, Lefei Zhang
Existing All-in-One image restoration methods often fail to perceive degradation types and severity levels simultaneously, overlooking the importance of fine-grained quality perception.
no code implementations • 17 Aug 2024 • Qian Zhang, Le Xie
This paper studies some compression methods to accelerate the scenario-based chance-constrained security-constrained economic dispatch (SCED) problem.
no code implementations • 4 Aug 2024 • Guohang Zeng, Qian Zhang, Guangquan Zhang, Jie Lu
However, a limitation of these methods is that the mapping function is trained on overlapping users across domains, while only a small number of overlapping users are available for training.
1 code implementation • 2 Aug 2024 • Qian Zhang, Xiangzi Dai, Ninghua Yang, Xiang An, Ziyong Feng, Xingyu Ren
However, the original VAR model is constrained to class-conditioned synthesis, relying solely on textual captions for guidance.
no code implementations • 12 Jul 2024 • Qian Zhang, P. R. Kumar, Le Xie
This study addresses the transmission value of energy storage in electric grids.
no code implementations • 9 Jul 2024 • Jiajun Liang, Qian Zhang, Wei Deng, Qifan Song, Guang Lin
This work introduces a novel and efficient Bayesian federated learning algorithm, namely, the Federated Averaging stochastic Hamiltonian Monte Carlo (FA-HMC), for parameter estimation and uncertainty quantification.
no code implementations • 8 Jul 2024 • Luzhou Xu, Jaime Lien, Haiguang Li, Nicholas Gillian, Rajeev Nongpiur, Jihan Li, Qian Zhang, Jian Cui, David Jorgensen, Adam Bernstein, Lauren Bedal, Eiji Hayashi, Jin Yamanaka, Alex Lee, Jian Wang, D Shin, Ivan Poupyrev, Trausti Thormundsson, Anupam Pathak, Shwetak Patel
This study represents the first application of the noncontact HR detection technology to sleep and meditation tracking, offering a promising alternative to wearable devices for HR monitoring during sleep and meditation.
1 code implementation • 4 Jul 2024 • Yiang Shi, Tianheng Cheng, Qian Zhang, Wenyu Liu, Xinggang Wang
Owing to the inherent flexibility of the point-based representation, OSP achieves strong performance compared with existing methods and excels in terms of training and inference adaptability.
no code implementations • 25 Jun 2024 • Le Xie, Subir Majumder, Tong Huang, Qian Zhang, Ping Chang, David J. Hill, Mohammad Shahidehpour
Addressing the urgency of climate change necessitates a coordinated and inclusive effort from all relevant stakeholders.
1 code implementation • 28 May 2024 • Bencheng Liao, Xinggang Wang, Lianghui Zhu, Qian Zhang, Chang Huang
Recently, linear complexity sequence modeling networks have achieved modeling capabilities similar to Vision Transformers on a variety of computer vision tasks, while using fewer FLOPs and less memory.
no code implementations • 30 Apr 2024 • Guobin Shen, Dongcheng Zhao, Xiang He, Linghao Feng, Yiting Dong, Jihang Wang, Qian Zhang, Yi Zeng
Decoding non-invasive brain recordings is pivotal for advancing our understanding of human cognition but faces challenges due to individual differences and complex neural signal representations.
1 code implementation • Expert Systems with Applications 2024 • Qian Zhang, Yi Zhu, Ming Yang, Ge Jin, YingWen Zhu, Qiu Chen
Although sample selection is a mainstream method in the field of learning with noisy labels, which aims to mitigate the impact of noisy labels during model training, the testing performance of these methods exhibits significant fluctuations across different noise rates and types.
Ranked #3 on
Learning with noisy labels
on Clothing1M
1 code implementation • 13 Mar 2024 • Jialv Zou, Bencheng Liao, Qian Zhang, Wenyu Liu, Xinggang Wang
Learning robust and scalable visual representations from massive multi-view video data remains a challenge in computer vision and autonomous driving.
no code implementations • 29 Feb 2024 • Yi Zeng, Feifei Zhao, Yuxuan Zhao, Dongcheng Zhao, Enmeng Lu, Qian Zhang, Yuwei Wang, Hui Feng, Zhuoya Zhao, Jihang Wang, Qingqun Kong, Yinqian Sun, Yang Li, Guobin Shen, Bing Han, Yiting Dong, Wenxuan Pan, Xiang He, Aorigele Bao, Jin Wang
In this paper, we introduce a Brain-inspired and Self-based Artificial Intelligence (BriSe AI) paradigm.
1 code implementation • 20 Feb 2024 • Shaoyu Chen, Bo Jiang, Hao Gao, Bencheng Liao, Qing Xu, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang
Learning a human-like driving policy from large-scale driving demonstrations is promising, but the uncertainty and non-deterministic nature of planning make it challenging.
Ranked #27 on
NavSim
on OpenScene
1 code implementation • IEEE Transactions on Geoscience and Remote Sensing 2024 • Renhe Zhang, Qian Zhang, Guixu Zhang
Convolutional Neural Networks (CNNs) can capture local context information well but cannot model the global dependencies.
14 code implementations • 17 Jan 2024 • Lianghui Zhu, Bencheng Liao, Qian Zhang, Xinlong Wang, Wenyu Liu, Xinggang Wang
The results demonstrate that Vim is capable of overcoming the computation & memory constraints on performing Transformer-style understanding for high-resolution images and it has great potential to be the next-generation backbone for vision foundation models.
1 code implementation • 4 Jan 2024 • Xuanhua He, Tao Hu, Guoli Wang, Zejin Wang, Run Wang, Qian Zhang, Keyu Yan, Ziyi Chen, Rui Li, Chenjun Xie, Jie Zhang, Man Zhou
However, current methods often ignore the difference between cell phone RAW images and DSLR camera RGB images, a difference that goes beyond the color matrix and extends to spatial structure due to resolution variations.
no code implementations • 23 Dec 2023 • Jialu Zhang, Xiaoying Yang, Wentao He, Jianfeng Ren, Qian Zhang, Titian Zhao, Ruibin Bai, Xiangjian He, Jiang Liu
A set of rewards measuring the localization accuracy, the accuracy of predicted labels, and the scale consistency among nearby patches are designed in the agent to guide the scale optimization.
no code implementations • 5 Dec 2023 • Zhufeng Shao, Shoujin Wang, Qian Zhang, Wenpeng Lu, Zhao Li, Xueping Peng
This methodological rigor establishes a cohesive framework for the impartial evaluation of diverse NBR approaches.
no code implementations • 3 Nov 2023 • Qian Zhang, Deqiang Gan
This paper proposes a novel Gronwall inequality-based method for transient stability assessment for power systems.
no code implementations • 26 Oct 2023 • Junxiao Xue, Jie Wang, Xuecheng Wu, Qian Zhang
In this study, we comprehensively review the development of AVCA over the past decade, particularly focusing on the most advanced methods adopted to address the three major challenges of video feature extraction, expression subjectivity, and multimodal feature fusion.
1 code implementation • NeurIPS 2023 • Jialv Zou, Xinggang Wang, Jiahao Guo, Wenyu Liu, Qian Zhang, Chang Huang
In our work, we propose a novel perspective for circuit design by treating circuit components as point clouds and using Transformer-based point cloud perception methods to extract features from the circuit.
1 code implementation • 29 Sep 2023 • Cheng Guo, Leidong Fan, Qian Zhang, Hanyuan Liu, Kanglin Liu, Xiuhua Jiang
The latter requires more efficiency, thus the pre-calculated LUT (look-up table) has become a popular solution.
no code implementations • 28 Sep 2023 • Jindong Li, Guobin Shen, Dongcheng Zhao, Qian Zhang, Yi Zeng
As a further step in supporting high-performance SNNs on specialized hardware, we introduce FireFly v2, an FPGA SNN accelerator that can address the issue of non-spike operation in current SOTA SNN algorithms, which presents an obstacle in the end-to-end deployment onto existing SNN hardware.
1 code implementation • 21 Sep 2023 • Jiaxin Zhang, Shiyuan Chen, Haoran Yin, Ruohong Mei, Xuan Liu, Cong Yang, Qian Zhang, Wei Sui
The recent development of online static map element (a. k. a.
no code implementations • 21 Sep 2023 • Marisol Garrouste, Michael T. Craig, Daniel Wendt, Maria Herrera Diaz, William Jenson, Qian Zhang, Brendan Kochunas
Low carbon synfuel can displace transport fossil fuels such as diesel and jet fuel and help achieve the decarbonization of the transportation sector at a global scale, but large-scale cost-effective production facilities are needed.
no code implementations • 31 Aug 2023 • Qian Zhang, Chenxi Wu, Adar Kahana, Youngeun Kim, Yuhang Li, George Em Karniadakis, Priyadarshini Panda
We introduce a method to convert Physics-Informed Neural Networks (PINNs), commonly used in scientific machine learning, to Spiking Neural Networks (SNNs), which are expected to have higher energy efficiency compared to traditional Artificial Neural Networks (ANNs).
1 code implementation • 30 Aug 2023 • Mengping Yang, Zhe Wang, Wenyi Feng, Qian Zhang, Ting Xiao
Furthermore, the frequency awareness of the model is reinforced by encouraging the model to distinguish frequency signals.
1 code implementation • IEEE Geoscience and Remote Sensing Letters 2023 • Renhe Zhang, Zhechun Wan, Qian Zhang, Guixu Zhang
The local attention path (LAP) uses efficient stripe convolution to generate local attention, which can alleviate the loss of information caused by down-sampling operation in the GAP and supplement the spatial details.
Extracting Buildings In Remote Sensing Images
Semantic Segmentation
no code implementations • 16 Aug 2023 • Tongda Xu, Qian Zhang, Yanghao Li, Dailan He, Zhe Wang, Yuanyuan Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang
We propose conditional perceptual quality, an extension of the perceptual quality defined in \citet{blau2018perception}, by conditioning it on user defined information.
2 code implementations • 10 Aug 2023 • Bencheng Liao, Shaoyu Chen, Yunchi Zhang, Bo Jiang, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang
We propose a unified permutation-equivalent modeling approach, \ie, modeling map element as a point set with a group of equivalent permutations, which accurately describes the shape of map element and stabilizes the learning process.
no code implementations • 5 Jul 2023 • Steve Hanneke, Shay Moran, Qian Zhang
Pseudo-cubes are a structure, rooted in the work of Daniely and Shalev-Shwartz (2014), and recently shown by Brukhim, Carmon, Dinur, Moran, and Yehudayoff (2022) to characterize PAC learnability (i. e., uniform rates) for multiclass classification.
1 code implementation • 23 Jun 2023 • Jiaqi Ma, Tianheng Cheng, Guoli Wang, Qian Zhang, Xinggang Wang, Lefei Zhang
We then leverage degradation-aware visual prompts to establish a controllable and universal model for image restoration, called ProRes, which is applicable to an extensive range of image restoration tasks.
no code implementations • 11 Jun 2023 • Jie Hu, Qian Zhang, Heng Yin
Large language models (LLM) pre-trained with an enormous amount of natural language corpus have proved to be effective for understanding the implicit format syntax and generating format-conforming inputs.
1 code implementation • IEEE Geoscience and Remote Sensing Letters 2023 • Renhe Zhang, Qian Zhang, Guixu Zhang
Furthermore, unlike the previous single-skip-connection structure of U-shaped methods, we build a novel dual skip connection structure inside the model.
1 code implementation • 19 Apr 2023 • Shaoyu Chen, Yunchi Zhang, Bencheng Liao, Jiafeng Xie, Tianheng Cheng, Wei Sui, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang
We design a divide-and-conquer annotation scheme to solve the spatial extensibility problem of HD map generation, and abstract map elements with a variety of geometric patterns as unified point sequence representation, which can be extended to most map elements in the driving scene.
no code implementations • 7 Apr 2023 • Shaoyu Chen, Tianheng Cheng, Jiemin Fang, Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang
Small object detection requires the detection head to scan a large number of positions on image feature maps, which is extremely hard for computation- and energy-efficient lightweight generic detectors.
1 code implementation • 28 Mar 2023 • Cheng Wang, Guoli Wang, Qian Zhang, Peng Guo, Wenyu Liu, Xinggang Wang
Fortunately, we have identified two observations that help us achieve the best of both worlds: 1) query-based methods demonstrate superiority over dense proposal-based methods in open-world instance segmentation, and 2) learning localization cues is sufficient for open world instance segmentation.
1 code implementation • CVPR 2023 • Rixin Zhou, Jiafu Wei, Qian Zhang, Ruihua Qi, Xi Yang, Chuntao Li
The archaeological dating of bronze dings has played a critical role in the study of ancient Chinese history.
no code implementations • 24 Mar 2023 • Bikun Wang, Zhipeng Wang, Chenhao Zhu, Zhiqiang Zhang, Zhichen Wang, Penghong Lin, Jingchu Liu, Qian Zhang
We evaluate our method both in closed-loop simulation and real world driving, and demonstrate the neural network planner has outstanding performance in complex urban autonomous driving scenarios.
2 code implementations • ICCV 2023 • Bo Jiang, Shaoyu Chen, Qing Xu, Bencheng Liao, Jiajie Chen, Helong Zhou, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang
In this paper, we propose VAD, an end-to-end vectorized paradigm for autonomous driving, which models the driving scene as a fully vectorized representation.
Ranked #29 on
Bench2Drive
on Bench2Drive
1 code implementation • 15 Mar 2023 • Bencheng Liao, Shaoyu Chen, Bo Jiang, Tianheng Cheng, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang
Motivated by this, we propose to model the lane graph in a novel path-wise manner, which well preserves the continuity of the lane and encodes traffic information for planning.
no code implementations • 2 Feb 2023 • Meng Zhao, Yifan Hu, Ruixuan Jiang, Yuanli Zhao, Dong Zhang, Yan Zhang, Rong Wang, Yong Cao, Qian Zhang, Yonggang Ma, Jiaxi Li, Shaochen Yu, Wenjie Li, Ran Zhang, Yefeng Zheng, Shuo Wang, Jizong Zhao
Conclusions: The proposed deep learning algorithms can be an effective tool for early identification of hemorrhage etiologies based on NCCT scans.
no code implementations • 1 Feb 2023 • Meng Zhao, Yonggang Ma, Qian Zhang, Jizong Zhao
Objective: Reliable tools to predict moyamoya disease (MMD) patients at risk for hemorrhage could have significant value.
no code implementations • 5 Jan 2023 • Jindong Li, Guobin Shen, Dongcheng Zhao, Qian Zhang, Yi Zeng
To improve memory efficiency, we design a memory system to enable efficient synaptic weights and membrane voltage memory access with reasonable on-chip RAM consumption.
no code implementations • CVPR 2023 • Cong Pan, Yonghao He, Junran Peng, Qian Zhang, Wei Sui, Zhaoxiang Zhang
Moreover, we find that the image feature maps' resolution in the cross-attention module has a limited effect on the final performance.
Ranked #6 on
Bird's-Eye View Semantic Segmentation
on nuScenes
1 code implementation • 8 Dec 2022 • Jiaxin Zhang, Wei Sui, Qian Zhang, Tao Chen, Cong Yang
In this paper, we introduce a novel approach for ground plane normal estimation of wheeled vehicles.
no code implementations • 5 Dec 2022 • Bo Jiang, Shaoyu Chen, Xinggang Wang, Bencheng Liao, Tianheng Cheng, Jiajie Chen, Helong Zhou, Qian Zhang, Wenyu Liu, Chang Huang
Motion prediction is highly relevant to the perception of dynamic objects and static map elements in the scenarios of autonomous driving.
no code implementations • 20 Nov 2022 • Wei Deng, Qian Zhang, Qi Feng, Faming Liang, Guang Lin
Notably, in big data scenarios, we obtain an appealing communication cost $O(P\log P)$ based on the optimal window size.
no code implementations • 17 Nov 2022 • Qian Zhang, Adar Kahana, George Em Karniadakis, Panos Stinis
We propose a Spiking Neural Network (SNN)-based explicit numerical scheme for long time integration of time-dependent Ordinary and Partial Differential Equations (ODEs, PDEs).
no code implementations • 31 Oct 2022 • Hongxiang Jiang, Wenming Meng, Hongmei Zhu, Qian Zhang, Jihao Yin
In advanced paradigms of autonomous driving, learning Bird's Eye View (BEV) representation from surrounding views is crucial for multi-task framework.
2 code implementations • 14 Oct 2022 • Haomiao Ni, Yuan Xue, Liya Ma, Qian Zhang, Xiaoye Li, Xiaolei Huang
We collected a new clinical IMV dataset with GMA annotations, and our experiments show that SPN models for body parsing and pose estimation trained on the first two datasets generalize well to the new clinical dataset and their results can significantly boost the CRNN-based GMA prediction performance.
1 code implementation • CVPR 2023 • Tianheng Cheng, Xinggang Wang, Shaoyu Chen, Qian Zhang, Wenyu Liu
Most existing methods for weakly supervised instance segmentation focus on designing heuristic losses with priors from bounding boxes.
no code implementations • 7 Sep 2022 • Zhufeng Shao, Shoujin Wang, Qian Zhang, Wenpeng Lu, Zhao Li, Xueping Peng
Different studies often evaluate NBR approaches on different datasets, under different experimental settings, making it hard to fairly and effectively compare the performance of different NBR approaches.
no code implementations • 31 Aug 2022 • Jiaqi Ma, Shengyuan Yan, Lefei Zhang, Guoli Wang, Qian Zhang
In order to get raw images of high quality for downstream Image Signal Process (ISP), in this paper we present an Efficient Locally Multiplicative Transformer called ELMformer for raw image restoration.
1 code implementation • 30 Aug 2022 • Bencheng Liao, Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Qian Zhang, Wenyu Liu, Chang Huang
High-definition (HD) map provides abundant and precise environmental information of the driving scene, serving as a fundamental and indispensable component for planning in autonomous driving system.
Ranked #9 on
3D Lane Detection
on OpenLane-V2 val
1 code implementation • 11 Aug 2022 • Chuanguang Yang, Zhulin An, Helong Zhou, Linhang Cai, Xiang Zhi, Jiwen Wu, Yongjun Xu, Qian Zhang
MixSKD mutually distills feature maps and probability distributions between the random pair of original images and their mixup images in a meaningful way.
no code implementations • 18 Jul 2022 • Yi Zeng, Dongcheng Zhao, Feifei Zhao, Guobin Shen, Yiting Dong, Enmeng Lu, Qian Zhang, Yinqian Sun, Qian Liang, Yuxuan Zhao, Zhuoya Zhao, Hongjian Fang, Yuwei Wang, Yang Li, Xin Liu, Chengcheng Du, Qingqun Kong, Zizhe Ruan, Weida Bi
These brain-inspired AI models have been effectively validated on various supervised, unsupervised, and reinforcement learning tasks, and they can be used to enable AI models to be with multiple brain-inspired cognitive functions.
1 code implementation • 5 Jul 2022 • Zhi Liu, Shaoyu Chen, Xiaojie Guo, Xinggang Wang, Tianheng Cheng, Hongmei Zhu, Qian Zhang, Wenyu Liu, Yi Zhang
In this work, we propose PolarBEV for vision-based uneven BEV representation learning.
1 code implementation • 22 Jun 2022 • Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Qian Zhang, Chang Huang, Wenyu Liu
Based on Polar Parametrization, we propose a surround-view 3D DEtection TRansformer, named PolarDETR.
1 code implementation • 13 Jun 2022 • Wenqiang Zhang, Tianheng Cheng, Xinggang Wang, Shaoyu Chen, Qian Zhang, Wenyu Liu
The query mechanism introduced in the DETR method is changing the paradigm of object detection and recently there are many query-based methods have obtained strong object detection performance.
1 code implementation • 9 Jun 2022 • Shaoyu Chen, Tianheng Cheng, Xinggang Wang, Wenming Meng, Qian Zhang, Wenyu Liu
GKT leverages the geometric priors to guide the transformer to focus on discriminative regions and unfolds kernel features to generate BEV representation.
1 code implementation • 28 May 2022 • Qian Zhang, Anuran Makur, Kamyar Azizzadenesheli
In particular, given $n$ samples with $d$ basis functions, we show estimation error upper bounds of $\widetilde O(\sqrt{d/n})$ for fixed design, random design, and adversarial context cases.
no code implementations • 27 May 2022 • Soheil Khorram, Jaeyoung Kim, Anshuman Tripathi, Han Lu, Qian Zhang, Hasim Sak
This paper introduces contrastive siamese (c-siam) network, an architecture for leveraging unlabeled acoustic data in speech recognition.
no code implementations • 24 May 2022 • Chenqing Hua, Sitao Luan, Qian Zhang, Jie Fu
Graph Neural Networks (GNNs) are new inference methods developed in recent years and are attracting growing attention due to their effectiveness and flexibility in solving inference and learning problems over graph-structured data.
no code implementations • 24 May 2022 • Jihang Wang, Dongcheng Zhao, Guobin Shen, Qian Zhang, Yi Zeng
Privacy protection is a crucial issue in machine learning algorithms, and the current privacy protection is combined with traditional artificial neural networks based on real values.
no code implementations • 17 May 2022 • Adar Kahana, Qian Zhang, Leonard Gleyzer, George Em Karniadakis
We demonstrate this new approach for classification using the SNN in the branch, achieving results comparable to the literature.
no code implementations • CVPR 2022 • Huan Gao, Jichang Guo, Guoli Wang, Qian Zhang
The invariance of illumination or inherent difference between two images is fully explored so as to make up for the lack of labels for nighttime images.
no code implementations • 22 Apr 2022 • Shengze Wang, Youngjoong Kwon, Yuan Shen, Qian Zhang, Andrei State, Jia-Bin Huang, Henry Fuchs
Experiments on the HTI dataset show that our method outperforms the baseline per-frame image fidelity and spatial-temporal consistency.
1 code implementation • CVPR 2022 • Chuanguang Yang, Helong Zhou, Zhulin An, Xue Jiang, Yongjun Xu, Qian Zhang
Current Knowledge Distillation (KD) methods for semantic segmentation often guide the student to mimic the teacher's structured information generated from individual data samples.
no code implementations • 13 Apr 2022 • Nan Li, Wei Feng, Qian Zhang
Active camera relocalization (ACR) is a new problem in computer vision that significantly reduces the false alarm caused by image distortions due to camera pose misalignment in fine-grained change detection (FGCD).
no code implementations • 1 Apr 2022 • Lihua Yang, Qing Zhang, Qian Zhang, Chao Huang
In order to establish the theory of filtering, windowed Fourier transform and wavelet transform in the setting of graph signals, we need to extend the shift operation of classical signals to graph signals.
no code implementations • 28 Mar 2022 • Xuedou Xiao, Juecheng Zhang, Wei Wang, Jianhua He, Qian Zhang
Existing compression algorithms are not fit for semantic segmentation, as the lack of obvious and concentrated regions of interest (RoIs) forces the adoption of uniform compression strategies, leading to low compression ratios or accuracy.
1 code implementation • CVPR 2022 • Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Wenqiang Zhang, Qian Zhang, Chang Huang, Wenyu Liu
For segmentation, we integrate AziNorm into KPConv.
2 code implementations • CVPR 2022 • Tianheng Cheng, Xinggang Wang, Shaoyu Chen, Wenqiang Zhang, Qian Zhang, Chang Huang, Zhaoxiang Zhang, Wenyu Liu
In this paper, we propose a conceptually novel, efficient, and fully convolutional framework for real-time instance segmentation.
Ranked #7 on
Real-time Instance Segmentation
on MSCOCO
no code implementations • 17 Mar 2022 • Chaoyu Liu, Zhonghua Qiao, Qian Zhang
In this paper, we propose an active contour model with a local variance force (LVF) term that can be applied to multi-phase image segmentation problems.
no code implementations • 29 Jan 2022 • Qian Zhang, Wenpeng Lu
Based on a strong assumption of adjacent dependency, any two adjacent items in a session are necessarily dependent in most GNN-based SBRs.
no code implementations • 24 Jan 2022 • Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang
The cybersecurity breaches expose surveillance video streams to forgery attacks, under which authentic streams are falsified to hide unauthorized activities.
no code implementations • 4 Jan 2022 • Yabo Xiao, Dongdong Yu, Xiaojuan Wang, Lei Jin, Guoli Wang, Qian Zhang
Off-the-shelf single-stage multi-person pose regression methods generally leverage the instance score (i. e., confidence of the instance localization) to indicate the pose quality for selecting the pose candidates.
1 code implementation • 27 Dec 2021 • Yabo Xiao, Xiaojuan Wang, Dongdong Yu, Guoli Wang, Qian Zhang, Mingshu He
Multi-person pose estimation methods generally follow top-down and bottom-up paradigms, both of which can be considered as two-stage approaches thus leading to the high computation cost and low efficiency.
no code implementations • 16 Dec 2021 • Wei Sui, Teng Chen, Jiaxin Zhang, Jiao Lu, Qian Zhang
The Depth-CNN and Pose-CNN estimate dense depth map and ego-motion respectively, solving SFM, while the Pose-CNN and Ground-CNN followed by a homography layer solve the ground plane estimation problem.
no code implementations • 9 Dec 2021 • Wei Deng, Qian Zhang, Yi-An Ma, Zhao Song, Guang Lin
We develop theoretical guarantees for FA-LD for strongly log-concave distributions with non-i. i. d data and study how the injected noise and the stochastic-gradient noise, the heterogeneity of data, and the varying learning rates affect the convergence.
no code implementations • 24 Nov 2021 • Jialu Zhang, Qian Zhang, Jianfeng Ren, Yitian Zhao, Jiang Liu
Multi-label image classification is a fundamental but challenging task in computer vision.
no code implementations • 22 Nov 2021 • Haobo Yuan, Teng Chen, Wei Sui, Jiafeng Xie, Lefei Zhang, Yuan Li, Qian Zhang
It implies planar parallax and can be combined with the road plane serving as a reference to estimate the 3D structure by warping the consecutive frames.
no code implementations • 15 Nov 2021 • Dongcheng Zhao, Yang Li, Yi Zeng, Jihang Wang, Qian Zhang
Our Spiking CapsNet fully combines the strengthens of SNN and CapsNet, and shows strong robustness to noise and affine transformation.
no code implementations • 12 Oct 2021 • Rongyao Wang, Wenpeng Lu, Shoujin Wang, Xueping Peng, Hao Wu, Qian Zhang
News recommender systems are essential for helping users to efficiently and effectively find out those interesting news from a large amount of news.
no code implementations • 29 Sep 2021 • Wei Deng, Qian Zhang, Qi Feng, Faming Liang, Guang Lin
Parallel tempering (PT), also known as replica exchange, is the go-to workhorse for simulations of multi-modal distributions.
no code implementations • 18 Aug 2021 • Qian Zhang, Qing Guo, Ruijun Gao, Felix Juefei-Xu, Hongkai Yu, Wei Feng
To this end, we first propose the physical modelbased adversarial relighting attack (ARA) denoted as albedoquotient-based adversarial relighting attack (AQ-ARA).
no code implementations • 11 Aug 2021 • Zijian Zhang, Chang Shu, Youxin Chen, Jing Xiao, Qian Zhang, Lu Zheng
Integrating multimodal knowledge for abstractive summarization task is a work-in-progress research area, with present techniques inheriting fusion-then-generation paradigm.
1 code implementation • ICCV 2021 • Shaoyu Chen, Jiemin Fang, Qian Zhang, Wenyu Liu, Xinggang Wang
Instance segmentation on point clouds is a fundamental task in 3D scene perception.
Ranked #4 on
3D Instance Segmentation
on S3DIS
(mCov metric, using extra
training data)
no code implementations • CVPR 2021 • Guoli Wang, Jiaqi Ma, Qian Zhang, Jiwen Lu, Jie zhou
Many of them settle it by generating fake frontal faces from extreme ones, whereas they are tough to maintain the identity information with high computational consumption and uncontrolled disturbances.
no code implementations • 5 Jun 2021 • Qian Zhang, Konstantina Sampani, Mengjia Xu, Shengze Cai, Yixiang Deng, He Li, Jennifer K. Sun, George Em Karniadakis
Microaneurysms (MAs) are one of the earliest signs of diabetic retinopathy (DR), a frequent complication of diabetes that can lead to visual impairment and blindness.
no code implementations • 25 May 2021 • Yang Liu, Qian Zhang, Yongyong Chen, Qiang Cheng, Chong Peng
It is a challenging task to remove heavy and mixed types of noise from Hyperspectral images (HSIs).
no code implementations • 23 May 2021 • Guoliang Hua, Hong Liu, Wenhao Li, Qian Zhang, Runwei Ding, Xin Xu
Instead, exploiting multi-view information is a practical way to achieve absolute 3D human pose estimation.
Monocular 3D Human Pose Estimation
Weakly-supervised 3D Human Pose Estimation
+1
no code implementations • AAAI Technical Track on Machine Learning 2021 • Mengyun Chen, Kaixin Gao, Xiaolei Liu, Zidong Wang, Ningxi Ni, Qian Zhang, Lei Chen, Chao Ding, ZhengHai Huang, Min Wang, Shuangling Wang, Fan Yu, Xinyuan Zhao, Dachuan Xu
It is well-known that second-order optimizer can accelerate the training of deep neural networks, however, the huge computation cost of second-order optimization makes it impractical to apply in real practice.
no code implementations • 6 May 2021 • Jaeyoung Kim, Han Lu, Anshuman Tripathi, Qian Zhang, Hasim Sak
From LibriSpeech evaluation, self alignment outperformed existing schemes: 25% and 56% less delay compared to FastEmit and constrained alignment at the similar word error rate.
no code implementations • 18 Mar 2021 • Jiaxin Zhang, Wei Sui, Xinggang Wang, Wenming Meng, Hongmei Zhu, Qian Zhang
Second, the poses predicted by CNNs are further improved by minimizing photometric errors via gradient updates of poses during inference phases.
4 code implementations • 1 Feb 2021 • Helong Zhou, Liangchen Song, Jiajie Chen, Ye Zhou, Guoli Wang, Junsong Yuan, Qian Zhang
The outputs from the teacher network are used as soft labels for supervising the training of a new network.
Ranked #36 on
Knowledge Distillation
on ImageNet
no code implementations • 12 Jan 2021 • Xuanyu He, Wei zhang, Ran Song, Qian Zhang, Xiangyuan Lan, Lin Ma
By studying two unsupervised person re-ID methods in a cross-method way, we point out a hard negative problem is handled implicitly by their designs of data augmentations and PK sampler respectively.
no code implementations • 4 Jan 2021 • Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang
Traditional video forensics approaches can detect and localize forgery traces in each video frame using computationally-expensive spatial-temporal analysis, while falling short in real-time verification of live video feeds.
Time Series Analysis
Video Forensics
Cryptography and Security
no code implementations • ICCV 2021 • Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan
This task is confronted with two challenges: how to establish the 3D correspondences from views to the BEV map and how to assemble occupancy information across views.
Ranked #2 on
Multiview Detection
on CVCS
(MODA (1m) metric)
no code implementations • ICLR 2021 • Helong Zhou, Liangchen Song, Jiajie Chen, Ye Zhou, Guoli Wang, Junsong Yuan, Qian Zhang
In this paper, we investigate the bias-variance tradeoff brought by distillation with soft labels.
no code implementations • 31 Dec 2020 • Yixuan Sun, Chengyao Li, Qian Zhang, Aimin Zhou, Guixu Zhang
In recent years, the prevalence of several pulmonary diseases, especially the coronavirus disease 2019 (COVID-19) pandemic, has attracted worldwide attention.
no code implementations • 23 Dec 2020 • Qian Zhang, Xinyuan Zhao, Chao Ding
Euclidean embedding from noisy observations containing outlier errors is an important and challenging problem in statistics and machine learning.
1 code implementation • 21 Dec 2020 • Jie Qin, Jiemin Fang, Qian Zhang, Wenyu Liu, Xingang Wang, Xinggang Wang
Especially, CutMix uses a simple but effective method to improve the classifiers by randomly cropping a patch from one image and pasting it on another image.
no code implementations • 17 Dec 2020 • Masahiro Sato, Sho Takemori, Janmajay Singh, Qian Zhang
In this work, we unify traditional neighborhood recommendation methods with the matching estimator, and develop robust ranking methods for the causal effect of recommendations.
no code implementations • 11 Dec 2020 • Yufeng Luo, Roland Haas, Qian Zhang, Gabrielle Allen
Data sharing is essential in the numerical simulations research.
General Relativity and Quantum Cosmology Databases
1 code implementation • 8 Dec 2020 • Ioanna Miliou, Xinyue Xiong, Salvatore Rinzivillo, Qian Zhang, Giulio Rossetti, Fosca Giannotti, Dino Pedreschi, Alessandro Vespignani
In this paper, we propose the use of a novel data source, namely retail market data to improve seasonal influenza forecasting.
no code implementations • 10 Nov 2020 • Jiaqi Yang, Zhiqiang Huang, Siwen Quan, Qian Zhang, Yanning Zhang, Zhiguo Cao
This paper focuses on developing efficient and robust evaluation metrics for RANSAC hypotheses to achieve accurate 3D rigid registration.
no code implementations • 3 Nov 2020 • Chong Peng, Qian Zhang, Zhao Kang, Chenglizhao Chen, Qiang Cheng
It directly uses 2D data as inputs such that the learning of representations benefits from inherent structures and relationships of the data.
no code implementations • 7 Oct 2020 • Anshuman Tripathi, Jaeyoung Kim, Qian Zhang, Han Lu, Hasim Sak
In this paper we present a Transformer-Transducer model architecture and a training technique to unify streaming and non-streaming speech recognition models into one model.
no code implementations • 17 Sep 2020 • Chaoyou Fu, Guoli Wang, Xiang Wu, Qian Zhang, Ran He
It embodies the uncertainty of the hashing network to the corresponding input image.
no code implementations • 16 Aug 2020 • Xinyu Gong, Wuyang Chen, Yifan Jiang, Ye Yuan, Xian-Ming Liu, Qian Zhang, Yuan Li, Zhangyang Wang
Such simplification limits the fusion of information at different scales and fails to maintain high-resolution representations.
1 code implementation • 13 Aug 2020 • Jialian Wu, Liangchen Song, Tiancai Wang, Qian Zhang, Junsong Yuan
In the classification tree, as the number of parent class nodes are significantly less, their logits are less noisy and can be utilized to suppress the wrong/noisy logits existed in the fine-grained class nodes.
Ranked #5 on
Few-Shot Object Detection
on LVIS v1.0 val
1 code implementation • 16 Jul 2020 • Haomiao Ni, Yuan Xue, Qian Zhang, Xiaolei Huang
In this paper, we propose a semi-supervised body parsing model, termed SiamParseNet (SPN), to jointly learn single frame body parsing and label propagation between frames in a semi-supervised fashion.
no code implementations • WS 2020 • Qian Zhang, Xiaopu Li, Dawei Dang, Tingxun Shi, Di Ai, Zhengshan Xue, Jie Hao
In this paper, we demonstrate our machine translation system applied for the Chinese-Japanese bidirectional translation task (aka.
no code implementations • 22 Jun 2020 • Qian Zhang, Yilin Zheng, Jean Honorio
Then for the novel task, we prove that the minimization of the $\ell_1$-regularized log-determinant Bregman divergence with the additional constraint that the support is a subset of the estimated support union could reduce the sufficient sample complexity of successful support recovery to $O(\log(|S_{\text{off}}|))$ where $|S_{\text{off}}|$ is the number of off-diagonal elements in the support union and is much less than $N$ for sparse matrices.
2 code implementations • 21 Jun 2020 • Jiemin Fang, Yuzhu Sun, Qian Zhang, Kangjian Peng, Yuan Li, Wenyu Liu, Xinggang Wang
In this paper, we propose a Fast Network Adaptation (FNA++) method, which can adapt both the architecture and parameters of a seed network (e. g. an ImageNet pre-trained network) to become a network with different depths, widths, or kernel sizes via a parameter remapping technique, making it possible to use NAS for segmentation and detection tasks a lot more efficiently.
no code implementations • 14 Mar 2020 • Xinyi Zeng, Qian Zhang, Jia Chen, Guixu Zhang, Aimin Zhou, Yiqin Wang
Finally, the proposed hybrid loss in a four hierarchy-pixel, patch, map and boundary guides the network to effectively segment the tongue regions and accurate tongue boundaries.
no code implementations • 22 Feb 2020 • Qian Zhang, Wei Feng, Liang Wan, Fei-Peng Tian, Xiaowei Wang, Ping Tan
Besides, we also theoretically prove the invariance of our ALR approach to the ambiguity of normal and lighting decomposition.
5 code implementations • 7 Feb 2020 • Qian Zhang, Han Lu, Hasim Sak, Anshuman Tripathi, Erik McDermott, Stephen Koo, Shankar Kumar
We present results on the LibriSpeech dataset showing that limiting the left context for self-attention in the Transformer layers makes decoding computationally tractable for streaming, with only a slight degradation in accuracy.
no code implementations • ICLR 2020 • Jiemin Fang, Yuzhu Sun, Kangjian Peng, Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang
In our experiments, we conduct FNA on MobileNetV2 to obtain new networks for both segmentation and detection that clearly out-perform existing networks designed both manually and by NAS.
1 code implementation • 6 Jan 2020 • Deyu Yin, Qian Zhang, Jingbin Liu, Xinlian Liang, Yunsheng Wang, Jyri Maanpää, Hao Ma, Juha Hyyppä, Ruizhi Chen
As an important technology in 3D mapping, autonomous driving, and robot navigation, LiDAR odometry is still a challenging task.
2 code implementations • ICLR 2020 • Wuyang Chen, Xinyu Gong, Xian-Ming Liu, Qian Zhang, Yuan Li, Zhangyang Wang
We present FasterSeg, an automatically designed semantic segmentation network with not only state-of-the-art performance but also faster speed than current methods.
Ranked #1 on
Semantic Segmentation
on BDD
Neural Architecture Search
Real-Time Semantic Segmentation
+1
2 code implementations • CVPR 2020 • Chaoxu Guo, Bin Fan, Qian Zhang, Shiming Xiang, Chunhong Pan
In this paper, we begin by first analyzing the design defects of feature pyramid in FPN, and then introduce a new feature pyramid architecture named AugFPN to address these problems.
1 code implementation • ECCV 2020 • Zhengkai Jiang, Yu Liu, Ceyuan Yang, Jihao Liu, Peng Gao, Qian Zhang, Shiming Xiang, Chunhong Pan
Transferring existing image-based detectors to the video is non-trivial since the quality of frames is always deteriorated by part occlusion, rare pose, and motion blur.
Ranked #25 on
Video Object Detection
on ImageNet VID
3 code implementations • 11 Oct 2019 • Mengjia Yan, Mengao Zhao, Zining Xu, Qian Zhang, Guoli Wang, Zhizhong Su
To improve the discriminative and generalization ability of lightweight network for face recognition, we propose an efficient variable group convolutional network called VarGFaceNet.
Ranked #3 on
Face Verification
on CFP-FP
no code implementations • 23 Sep 2019 • Xuedou Xiao, Wei Wang, Taobin Chen, Yang Cao, Tao Jiang, Qian Zhang
In this paper, we present SA-ABR, a new sensor-augmented system that generates ABR video streaming algorithms with the assistance of various kinds of inherent sensor data that are used to pilot UAVs.
no code implementations • 15 Aug 2019 • Qian Zhang, Feifei Lee, Ya-Gang Wang, Qiu Chen
On the websites, there exist a lot of image data which contains inaccurate annotations, but training on these datasets may make networks easier to over-fit the noisy labels and cause performance degradation.
no code implementations • 15 Jul 2019 • Haisheng Fu, Feng Liang, Bo Lei, Nai Bian, Qian Zhang, Mohammad Akbari, Jie Liang, Chengjie Tu
Recently deep learning-based methods have been applied in image compression and achieved many promising results.
1 code implementation • 12 Jul 2019 • Qian Zhang, Jianjun Li, Meng Yao, Liangchen Song, Helong Zhou, Zhichao Li, Wenming Meng, Xuezhi Zhang, Guoli Wang
In this paper, we propose a novel network design mechanism for efficient embedded computing.
Ranked #5 on
Face Verification
on CFP-FP
1 code implementation • CVPR 2020 • Jiemin Fang, Yuzhu Sun, Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang
We revisit the search space design in most previous NAS methods and find the number and widths of blocks are set manually.
Ranked #91 on
Neural Architecture Search
on ImageNet
no code implementations • 11 Apr 2019 • Zhuo Lei, Chao Zhang, Qian Zhang, Guoping Qiu
In constructing the dataset, because of the subjectivity of user-generated video summarization, we manually annotate 25 summaries for each video, which are in total 1300 summaries.
no code implementations • 11 Apr 2019 • Qian Zhang, Li Wang, Xiaopeng Zong, Weili Lin, Gang Li, Dinggang Shen
Skull stripping for brain MR images is a basic segmentation task.
no code implementations • 8 Apr 2019 • Lefei Zhang, Qian Zhang, Bo Du, Xin Huang, Yuan Yan Tang, DaCheng Tao
In a feature representation point of view, a nature approach to handle this situation is to concatenate the spectral and spatial features into a single but high dimensional vector and then apply a certain dimension reduction technique directly on that concatenated vector before feed it into the subsequent classifier.
no code implementations • 28 Mar 2019 • Chaoyou Fu, Yibo Hu, Xiang Wu, Guoli Wang, Qian Zhang, Ran He
Furthermore, due to the lack of high-resolution face manipulation databases to verify the effectiveness of our method, we collect a new high-quality Multi-View Face (MVF-HQ) database.
no code implementations • ICCV 2019 • Chaoxu Guo, Bin Fan, Jie Gu, Qian Zhang, Shiming Xiang, Veronique Prinet, Chunhong Pan
Instead of relying on optical flow, this paper proposes a novel module called Progressive Sparse Local Attention (PSLA), which establishes the spatial correspondence between features across frames in a local region with progressively sparser stride and uses the correspondence to propagate features.
no code implementations • 15 Feb 2019 • Rui Cao, Qian Zhang, Jiasong Zhu, Qing Li, Qingquan Li, Bozhi Liu, Guoping Qiu
With the rapid growing of remotely sensed imagery data, there is a high demand for effective and efficient image retrieval tools to manage and exploit such data.
1 code implementation • 17 Jan 2019 • Jiemin Fang, Yukang Chen, Xinbang Zhang, Qian Zhang, Chang Huang, Gaofeng Meng, Wenyu Liu, Xinggang Wang
In our implementations, architectures are first searched on a small dataset, e. g., CIFAR-10.
1 code implementation • 12 Jan 2019 • Huangxun Chen, Chenyu Huang, Qianyi Huang, Qian Zhang, Wei Wang
Deep neural networks (DNNs)-powered Electrocardiogram (ECG) diagnosis systems recently achieve promising progress to take over tedious examinations by cardiologists.
no code implementations • 27 Nov 2018 • Wei Wang, Shiyue He, Liang Sun, Tao Jiang, Qian Zhang
To this end, we propose DopplerFi, a communication framework that enables a two-way communication channel between BLE and Wi-Fi by injecting artificial Doppler shifts, which can be decoded by sensing the patterns in the Gaussian frequency shift keying (GFSK) demodulator and Channel State Information (CSI).
Networking and Internet Architecture
no code implementations • 23 Nov 2018 • Yukang Chen, Gaofeng Meng, Qian Zhang, Xinbang Zhang, Liangchen Song, Shiming Xiang, Chunhong Pan
Here our goal is to automatically find a compact neural network model with high performance that is suitable for mobile devices.
no code implementations • 6 Nov 2018 • Fei Yang, Qian Zhang, Chi Zheng, Guoping Qiu
In the computer research area, facial expression recognition is a hot research problem.
no code implementations • ECCV 2018 • Cheng Wang, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang
We propose a novel deep network called Mancs that solves the person re-identification problem from the following aspects: fully utilizing the attention mechanism for the person misalignment problem and properly sampling for the ranking loss to obtain more stable person representation.
1 code implementation • 1 Aug 2018 • Yukang Chen, Gaofeng Meng, Qian Zhang, Shiming Xiang, Chang Huang, Lisen Mu, Xinggang Wang
To address this issue, we propose the Reinforced Evolutionary Neural Architecture Search (RE- NAS), which is an evolutionary method with the reinforced mutation for NAS.
3 code implementations • 30 Jul 2018 • Liangchen Song, Cheng Wang, Lefei Zhang, Bo Du, Qian Zhang, Chang Huang, Xinggang Wang
We study the problem of unsupervised domain adaptive re-identification (re-ID) which is an active topic in computer vision but lacks a theoretical foundation.
no code implementations • 19 Jan 2018 • Fei Yang, Qian Zhang, Miaohui Wang, Guoping Qiu
We will present experimental results to show that our quality classified framework can accurately classify images based on the type and severity of image degradations and can significantly boost the performances of state-of-the-art face detector and recognizer in dealing with image datasets containing mixed quality images.
no code implementations • 7 Sep 2016 • Ke Sun, Xianxu Hou, Qian Zhang, Guoping Qiu
Furthermore, not all tags have the same descriptive power for visual contents and large vocabulary available from natural language could result in a very diverse set of keywords.
no code implementations • CVPR 2016 • Wei Feng, Fei-Peng Tian, Qian Zhang, Jizhou Sun
Based on inexpensive platform with unreliable absolute repositioning accuracy (ARA), we propose a hand-eye calibration free strategy to actively relocate camera into the same 6D pose that produces the input reference image, by sequentially correcting 3D relative rotation and translation.
no code implementations • 30 Apr 2016 • Xinyu Lin, Ce Zhu, Qian Zhang, Yipeng Liu
Researchers have proposed various methods to extract 3D keypoints from the surface of 3D mesh models over the last decades, but most of them are based on geometric methods, which lack enough flexibility to meet the requirements for various applications.
no code implementations • 22 Dec 2015 • Qian Zhang, Bruno Gonçalves
Using a large corpus of Weibo and Chinese language tweets, covering the period from January $1$ to December $31$, $2012$, we obtain a list of topics using clustered \#tags that we can then use to compare the two platforms.
no code implementations • ICCV 2015 • Wei Feng, Fei-Peng Tian, Qian Zhang, Nan Zhang, Liang Wan, Jizhou Sun
To guarantee detection sensitivity and accuracy of minute changes, in an observation, we capture a group of images under multiple illuminations, which need only to be roughly aligned to the last time lighting conditions.
no code implementations • 12 Nov 2015 • Run Wang, Qiaoli Mo, Qian Zhang, Fudi Chen, Dazuo Yang
To simplify the number of times of optimization in experimental works, here, we use artificial neural network (ANN) and support vector machine (SVM) models for the prediction of yields of 3\b{eta}-O-phthalic ester of betulinic acid synthesized by betulinic acid and phthalic anhydride using lipase as biocatalyst.