no code implementations • NLP4ConvAI (ACL) 2022 • Zhiqi Huang, Milind Rao, Anirudh Raju, Zhe Zhang, Bach Bui, Chul Lee
The proposed framework benefits from three key aspects: 1) pre-trained sub-networks of ASR model and language model; 2) multi-task learning objective to exploit shared knowledge from different tasks; 3) end-to-end training of ASR and downstream NLP task based on sequence loss.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+5
no code implementations • 30 May 2023 • Yanan Zhang, Weijie Cui, Yangfan Zhang, Xiaoling Bai, Zhe Zhang, Jin Ma, Xiang Chen, Tianhua Zhou
In search engines, query expansion (QE) is a crucial technique to improve search experience.
no code implementations • 13 May 2023 • Dandan Zhao, Zhe Zhang, Dongdong Lu, Jian Kang, Xiaolan Qiu, Yirong Wu
Although convolutional neural networks have been successfully employed for SAR image target recognition, surpassing traditional algorithms, most existing research concentrates on the amplitude domain and neglects the essential phase information.
1 code implementation • CVPR 2023 • Jiacheng Deng, Chuxin Wang, Jiahao Lu, Jianfeng He, Tianzhu Zhang, Jiyang Yu, Zhe Zhang
The key of our approach is to exploit an orientation estimation module with a domain adaptive discriminator to align the orientations of point cloud pairs, which significantly alleviates the mispredictions of symmetrical parts.
Ranked #1 on
3D Dense Shape Correspondence
on SHREC'19
(using extra training data)
no code implementations • 10 Apr 2023 • Guoru Zhou, Zhongqiu Xu, Yizhe Fan, Zhe Zhang, Xiaolan Qiu, Bingchen Zhang, Kun fu, Yirong Wu
High-resolution is a key trend in the development of synthetic aperture radar (SAR), which enables the capture of fine details and accurate representation of backscattering properties.
no code implementations • 20 Mar 2023 • Yuwei Wu, Zhe Zhang, Xiaolan Qiu, Yao Zhao, Weidong Yu
repetition frequency (PRF).
no code implementations • 8 Mar 2023 • Silin Gao, Zhe Zhang, Muhan Wang, Yan Zhang, Jie Zhao, Bingchen Zhang, Yue Wang, Yirong Wu
This paper focuses on the gridless direction-of-arrival (DoA) estimation for data acquired by non-uniform linear arrays (NLAs) in automotive applications.
no code implementations • 26 Jan 2023 • Ryan Humble, Zhe Zhang, Finn O'Shea, Eric Darve, Daniel Ratner
Anomaly detection is an important task for complex systems (e. g., industrial facilities, manufacturing, large-scale science experiments), where failures in a sub-system can lead to low yield, faulty products, or even damage to components.
no code implementations • 23 Jan 2023 • Gurunath Reddy M, Zhe Zhang, Yi Yu, Florian Harscoet, Simon Canales, Suhua Tang
We propose a deep attention-based alignment network, which aims to automatically predict lyrics and melody with given incomplete lyrics as input in a way similar to the music creation of humans.
no code implementations • CVPR 2023 • Jianfeng He, Yuan Gao, Tianzhu Zhang, Zhe Zhang, Feng Wu
Second, the HKDL module can generate keypoint detectors in a hierarchical way, which is helpful for detecting keypoints with diverse levels of structures.
1 code implementation • CVPR 2023 • Zhe Zhang, Rui Peng, Yuxi Hu, Ronggang Wang
To intensify the full-scene geometry perception of our model, we present the depth distribution similarity loss based on the Gaussian-Mixture Model assumption.
Ranked #1 on
Point Clouds
on Tanks and Temples
no code implementations • 1 Dec 2022 • Pengyu Jiang, Zhe Zhang, Bingchen Zhang, Zhongqiu Xu
In this paper, we propose a nested TomoSAR technique, which introduces the nested array into TomoSAR as the baseline configuration.
no code implementations • 30 Nov 2022 • Muhan Wang, Zhe Zhang, Xiaolan Qiu, Silin Gao, Yue Wang
In addition, adaptive threshold is introduced for each azimuth-range pixel, enabling the threshold shrinkage to be not only layer-varied but also element-wise.
no code implementations • 27 Oct 2022 • Lei Kou, Chuang Liu, Guowei Cai, Zhe Zhang
Secondly, the wavelet transform is used to remove the redundant data of the features, and then the training sample data is greatly compressed.
no code implementations • 26 Sep 2022 • Chuang Liu, Lei Kou, Guowei Cai, Zihan Zhao, Zhe Zhang
Power electronics converters have been widely used in aerospace system, DC transmission, distributed energy, smart grid and so forth, and the reliability of power electronics converters has been a hotspot in academia and industry.
1 code implementation • 16 Sep 2022 • Zhe Zhang, Yukun Zou, Junjie Lai, Qing Xu
Deep Q-learning Network (DQN) is a successful way which combines reinforcement learning with deep neural networks and leads to a widespread application of reinforcement learning.
no code implementations • 9 Sep 2022 • Shuoguang Yang, Zhe Zhang, Ethan X. Fang
Stochastic compositional optimization (SCO) has attracted considerable attention because of its broad applicability to important real-world problems.
1 code implementation • 13 Aug 2022 • Vincent Jeanselme, Maria De-Arteaga, Zhe Zhang, Jessica Barrett, Brian Tom
First, we provide a structured view of the relationship between clinical presence mechanisms and group-specific missingness patterns.
no code implementations • 30 Jun 2022 • Wei Duan, Zhe Zhang, Yi Yu, Keizo Oyama
Generating melody from lyrics is an interesting yet challenging task in the area of artificial intelligence and music.
no code implementations • 28 May 2022 • Kan Xie, Zhe Zhang, Bo Li, Jiawen Kang, Dusit Niyato, Shengli Xie, Yi Wu
However, for machine learning-based traffic sign recognition on the Internet of Vehicles (IoV), a large amount of traffic sign data from distributed vehicles is needed to be gathered in a centralized server for model training, which brings serious privacy leakage risk because of traffic sign data containing lots of location privacy information.
1 code implementation • 24 May 2022 • Jinghui Xiao, Qun Liu, Xin Jiang, Yuanfeng Xiong, Haiteng Wu, Zhe Zhang
Pinyin to Character conversion (P2C) task is the key task of Input Method Engine (IME) in commercial input software for Asian languages, such as Chinese, Japanese, Thai language and so on.
no code implementations • 5 May 2022 • Muhan Wang, Zhe Zhang, Yue Wang, Silin Gao, Xiaolan Qiu
Synthetic aperture radar (SAR) tomography (TomoSAR) has attracted remarkable interest for its ability in achieving three-dimensional reconstruction along the elevation direction from multiple observations.
no code implementations • 26 Apr 2022 • Silin Gao, Zhe Zhang, Bingchen Zhang, Yirong Wu
The resolving along the elevation direction can be treated as a line spectrum estimation problem.
no code implementations • 20 Mar 2022 • Zhe Zhang, Yaozhong Gan, Xiaoyang Tan
Advantage Learning (AL) seeks to increase the action gap between the optimal action and its competitors, so as to improve the robustness to estimation errors.
no code implementations • 20 Mar 2022 • Yaozhong Gan, Zhe Zhang, Xiaoyang Tan
Advantage learning (AL) aims to improve the robustness of value-based reinforcement learning against estimation errors with action-gap-based regularization.
no code implementations • 16 Mar 2022 • Ruizhe Shi, Zhe Zhang, Xiaolan Qiu, Chibiao Ding
Numerical simulations and real data experiments show that the proposed GDLS algorithm outperforms the state-of-the-art methods e. g., CS and ANM, in terms of estimation performances.
no code implementations • 10 Mar 2022 • Guanghui Lan, Zhe Zhang
Specifically, the DRAO method achieves the optimal communication complexity by assuming a certain saddle point subproblem can be easily solved in the server node.
no code implementations • 3 Jan 2022 • Zhe Zhang, Shiyao Ma, Zhaohui Yang, Zehui Xiong, Jiawen Kang, Yi Wu, Kejia Zhang, Dusit Niyato
This emerging technology relies on sharing ground truth labeled data between Unmanned Aerial Vehicle (UAV) swarms to train a high-quality automatic image recognition model.
no code implementations • CVPR 2022 • Jiamin Wu, Tianzhu Zhang, Zhe Zhang, Feng Wu, Yongdong Zhang
To address this issue, we propose an end-to-end Motion-modulated Temporal Fragment Alignment Network (MTFAN) by jointly exploring the task-specific motion modulation and the multi-level temporal fragment alignment for Few-Shot Action Recognition (FSAR).
no code implementations • NeurIPS 2021 • Sheng Zhang, Zhe Zhang, Siva Theja Maguluri
The focus of this paper is on sample complexity guarantees of average-reward reinforcement learning algorithms, which are known to be more challenging to study than their discounted-reward counterparts.
no code implementations • 1 Dec 2021 • Jie Zhu, Bo Peng, Wanqing Li, Haifeng Shen, Zhe Zhang, Jianjun Lei
It is built upon Transformer and is capable of extracting dense features with global context and 3D consistency, which are crucial to achieving reliable matching for MVS.
no code implementations • 26 Oct 2021 • Zhe Zhang, Shiyao Ma, Jiangtian Nie, Yi Wu, Qiang Yan, Xiaoke Xu, Dusit Niyato
In this paper, we present a robust semi-supervised FL system design, where the system aims to solve the problem of data availability and non-IID in FL.
no code implementations • 20 Oct 2021 • Zhe Zhang, Bingchen Zhang, Chenglong Jiang, Xingdong Liang, Longyong Chen, Wen Hong, Yirong Wu
In this paper we report the first airborne experiments of sparse microwave imaging, conducted in September 2013 and May 2014, using our prototype sparse microwave imaging radar system.
no code implementations • 24 Sep 2021 • Bahareh Alizadeh, Diya Li, Zhe Zhang, Amir H. Behzadan
Water events are the most frequent and costliest climate disasters around the world.
no code implementations • 30 Jun 2021 • Anirudh Raju, Milind Rao, Gautam Tiwari, Pranav Dheram, Bryan Anderson, Zhe Zhang, Chul Lee, Bach Bui, Ariya Rastrow
Spoken language understanding (SLU) systems extract both text transcripts and semantics associated with intents and slots from input speech utterances.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
1 code implementation • 9 Jun 2021 • Jingyuan Chen, Guanchen Ding, Yuchen Yang, Wenwei Han, Kangmin Xu, Tianyi Gao, Zhe Zhang, Wanping Ouyang, Hao Cai, Zhenzhong Chen
For the vehicle detection and tracking module, we adopted YOLOv5 and multi-scale tracking to localize the anomalies.
no code implementations • 13 Apr 2021 • Zhe Zhou, Bizhao Shi, Zhe Zhang, Yijin Guan, Guangyu Sun, Guojie Luo
At the hardware design level, we propose a pipelined CirCore architecture, which supports efficient block-circulant matrices computation.
no code implementations • 1 Apr 2021 • Zhe Zhang, Linjun Zhang
In this paper, we develop a general framework to design differentially private expectation-maximization (EM) algorithms in high-dimensional latent variable models, based on the noisy iterative hard-thresholding.
no code implementations • 27 Jan 2021 • Yichao Du, Pengfei Luo, Xudong Hong, Tong Xu, Zhe Zhang, Chao Ren, Yi Zheng, Enhong Chen
Clinical diagnosis, which aims to assign diagnosis codes for a patient based on the clinical note, plays an essential role in clinical decision-making.
no code implementations • 7 Jan 2021 • Yao Chen, Jiangang Liu, Zhe Zhang, Shiping Wen, Wenjun Xiong
In this work, we propose a novel Knowledge Graph Embedding (KGE) strategy, called M\"{o}biusE, in which the entities and relations are embedded to the surface of a M\"{o}bius ring.
no code implementations • 17 Dec 2020 • Yaozhong Gan, Zhe Zhang, Xiaoyang Tan
Learning complicated value functions in high dimensional state space by function approximation is a challenging task, partially due to that the max-operator used in temporal difference updates can theoretically cause instability for most linear or non-linear approximation schemes.
no code implementations • 23 Nov 2020 • Zhaoyue Chen, Nick Koudas, Zhe Zhang, Xiaohui Yu
For the case of NN, we propose algorithms to train the network taking normalized data as the input.
no code implementations • 19 Nov 2020 • Zhe Zhang, Guanghui Lan
All these complexity results seem to be new in the literature and they indicate that the convex NSCO problem has the same order of oracle complexity as those without the nested composition in all but the strongly convex and outer-non-smooth problem.
2 code implementations • 26 Oct 2020 • Zhe Zhang, Chunyu Wang, Weichao Qiu, Wenhu Qin, Wenjun Zeng
To make the task truly unconstrained, we present AdaFuse, an adaptive multiview fusion method, which can enhance the features in occluded views by leveraging those in visible views.
Ranked #1 on
3D Human Pose Estimation
on Total Capture
no code implementations • 21 Aug 2020 • Zhe Zhang, Meixia Tao
This approach, on one hand, can learn the caching policy in continuous action space by using the actor-critic architecture.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Zhe Zhang, Chung-Wei Hang, Munindar P. Singh
Sentiments in opinionated text are often determined by both aspects and target words (or targets).
1 code implementation • CVPR 2020 • Zhe Zhang, Chunyu Wang, Wenhu Qin, Wen-Jun Zeng
Then we lift the multi-view 2D poses to the 3D space by an Orientation Regularized Pictorial Structure Model (ORPSM) which jointly minimizes the projection error between the 3D and 2D poses, along with the discrepancy between the 3D pose and IMU orientations.
Ranked #1 on
3D Absolute Human Pose Estimation
on Total Capture
no code implementations • 20 Mar 2020 • Rui Xiang, Feng Zheng, Huapeng Su, Zhe Zhang
In this paper, we propose an end-to-end deep learning network named 3dDepthNet, which produces an accurate dense depth image from a single pair of sparse LiDAR depth and color image for robotics and autonomous driving tasks.
1 code implementation • 23 Nov 2019 • Zhe Zhang, Jie Tang, Gangshan Wu
Specifically, our LPN-50 can achieve 68. 7 in AP score on the COCO test-dev set, with only 2. 7M parameters and 1. 0 GFLOPs, while the inference speed is 17 FPS on an Intel i7-8700K CPU machine.
no code implementations • IJCNLP 2019 • Zhe Zhang, Munindar P. Singh
Opinionated text often involves attributes such as authorship and location that influence the sentiments expressed for different aspects.
1 code implementation • 29 Jun 2019 • Xiaobiao Huang, Minghao Song, Zhe Zhang
We present a multi-objective evolutionary optimization algorithm that uses Gaussian process (GP) regression-based models to select trial solutions in a multi-generation iterative procedure.
no code implementations • 24 Mar 2019 • Anthony Hsu, Keqiu Hu, Jonathan Hung, Arun Suresh, Zhe Zhang
Training machine learning (ML) models on large datasets requires considerable computing power.
no code implementations • EMNLP 2018 • Zhe Zhang, Munindar Singh
We propose Limbic, an unsupervised probabilistic model that addresses the problem of discovering aspects and sentiments and associating them with authors of opinionated texts.
no code implementations • 6 Mar 2018 • Feng Zheng, Grace Tsai, Zhe Zhang, Shaoshan Liu, Chen-Chi Chu, Hongbing Hu
In this paper, we present the Trifo Visual Inertial Odometry (Trifo-VIO), a tightly-coupled filtering-based stereo VIO system using both points and lines.
no code implementations • 2 Oct 2017 • Zhe Zhang, Shaoshan Liu, Grace Tsai, Hongbing Hu, Chen-Chi Chu, Feng Zheng
In this paper, we present the PerceptIn Robotics Vision System (PIRVS) system, a visual-inertial computing hardware with embedded simultaneous localization and mapping (SLAM) algorithm.
1 code implementation • 23 Apr 2017 • Zhe Zhang, Brian Bockelman
ROOT provides an flexible format used throughout the HEP community.
Distributed, Parallel, and Cluster Computing
no code implementations • 16 Apr 2017 • Shaoshan Liu, Bolin Ding, Jie Tang, Dawei Sun, Zhe Zhang, Grace Tsai, Jean-Luc Gaudiot
The rise of robotic applications has led to the generation of a huge volume of unstructured data, whereas the current cloud infrastructure was designed to process limited amounts of structured data.
no code implementations • 24 Nov 2016 • Zhe Zhang, Daniel B. Neill
We present a novel subset scan method to detect if a probabilistic binary classifier has statistically significant bias -- over or under predicting the risk -- for some subgroup, and identify the characteristics of this subgroup.
no code implementations • CVPR 2014 • Zhe Zhang, Kin Hong Wong
Firstly, we extend the original mean shift approach to handle orientation space and scale space and name this new method as mean transform.