Search Results for author: Zhe Zhang

Found 69 papers, 12 papers with code

MTL-SLT: Multi-Task Learning for Spoken Language Tasks

no code implementations NLP4ConvAI (ACL) 2022 Zhiqi Huang, Milind Rao, Anirudh Raju, Zhe Zhang, Bach Bui, Chul Lee

The proposed framework benefits from three key aspects: 1) pre-trained sub-networks of ASR model and language model; 2) multi-task learning objective to exploit shared knowledge from different tasks; 3) end-to-end training of ASR and downstream NLP task based on sequence loss.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

HyperGANStrument: Instrument Sound Synthesis and Editing with Pitch-Invariant Hypernetworks

no code implementations9 Jan 2024 Zhe Zhang, Taketo Akama

GANStrument, exploiting GANs with a pitch-invariant feature extractor and instance conditioning technique, has shown remarkable capabilities in synthesizing realistic instrument sounds.

DA-STC: Domain Adaptive Video Semantic Segmentation via Spatio-Temporal Consistency

1 code implementation22 Nov 2023 Zhe Zhang, Gaochang Wu, Jing Zhang, Chunhua Shen, DaCheng Tao, Tianyou Chai

To solve the challenge, we propose a novel DA-STC method for domain adaptive video semantic segmentation, which incorporates a bidirectional multi-level spatio-temporal fusion module and a category-aware spatio-temporal feature alignment module to facilitate consistent learning for domain-invariant features.

Representation Learning Segmentation +2

Syllable-level lyrics generation from melody exploiting character-level language model

no code implementations2 Oct 2023 Zhe Zhang, Karol Lasocki, Yi Yu, Atsuhiro Takasu

The generation of lyrics tightly connected to accompanying melodies involves establishing a mapping between musical notes and syllables of lyrics.

Language Modelling Sentence

Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation

no code implementations26 Jul 2023 Chaohui Yu, Qiang Zhou, Jingliang Li, Zhe Zhang, Zhibin Wang, Fan Wang

To better utilize the sparse 3D points, we propose an efficient point cloud guidance loss to adaptively drive the NeRF's geometry to align with the shape of the sparse 3D points.

Text to 3D

Controllable Lyrics-to-Melody Generation

no code implementations5 Jun 2023 Zhe Zhang, Yi Yu, Atsuhiro Takasu

Lyrics-to-melody generation is an interesting and challenging topic in AI music research field.

Music Generation

CVGG-Net: Ship Recognition for SAR Images Based on Complex-Valued Convolutional Neural Network

no code implementations13 May 2023 Dandan Zhao, Zhe Zhang, Dongdong Lu, Jian Kang, Xiaolan Qiu, Yirong Wu

Although convolutional neural networks have been successfully employed for SAR image target recognition, surpassing traditional algorithms, most existing research concentrates on the amplitude domain and neglects the essential phase information.

SE-ORNet: Self-Ensembling Orientation-aware Network for Unsupervised Point Cloud Shape Correspondence

1 code implementation CVPR 2023 Jiacheng Deng, Chuxin Wang, Jiahao Lu, Jianfeng He, Tianzhu Zhang, Jiyang Yu, Zhe Zhang

The key of our approach is to exploit an orientation estimation module with a domain adaptive discriminator to align the orientations of point cloud pairs, which significantly alleviates the mispredictions of symmetrical parts.

Ranked #2 on 3D Dense Shape Correspondence on SHREC'19 (using extra training data)

3D Dense Shape Correspondence

SPHR-SAR-Net: Superpixel High-resolution SAR Imaging Network Based on Nonlocal Total Variation

no code implementations10 Apr 2023 Guoru Zhou, Zhongqiu Xu, Yizhe Fan, Zhe Zhang, Xiaolan Qiu, Bingchen Zhang, Kun fu, Yirong Wu

High-resolution is a key trend in the development of synthetic aperture radar (SAR), which enables the capture of fine details and accurate representation of backscattering properties.

Efficient Gridless DoA Estimation Method of Non-uniform Linear Arrays with Applications in Automotive Radars

no code implementations8 Mar 2023 Silin Gao, Zhe Zhang, Muhan Wang, Yan Zhang, Jie Zhao, Bingchen Zhang, Yue Wang, Yirong Wu

This paper focuses on the gridless direction-of-arrival (DoA) estimation for data acquired by non-uniform linear arrays (NLAs) in automotive applications.

Coincident Learning for Unsupervised Anomaly Detection

no code implementations26 Jan 2023 Ryan Humble, Zhe Zhang, Finn O'Shea, Eric Darve, Daniel Ratner

While complex systems often have a wealth of data, labeled anomalies are typically rare (or even nonexistent) and expensive to acquire.

Time Series Time Series Analysis +1

Deep Attention-Based Alignment Network for Melody Generation from Incomplete Lyrics

no code implementations23 Jan 2023 Gurunath Reddy M, Zhe Zhang, Yi Yu, Florian Harscoet, Simon Canales, Suhua Tang

We propose a deep attention-based alignment network, which aims to automatically predict lyrics and melody with given incomplete lyrics as input in a way similar to the music creation of humans.

Deep Attention

D2Former: Jointly Learning Hierarchical Detectors and Contextual Descriptors via Agent-Based Transformers

no code implementations CVPR 2023 Jianfeng He, Yuan Gao, Tianzhu Zhang, Zhe Zhang, Feng Wu

Second, the HKDL module can generate keypoint detectors in a hierarchical way, which is helpful for detecting keypoints with diverse levels of structures.

GeoMVSNet: Learning Multi-View Stereo With Geometry Perception

1 code implementation CVPR 2023 Zhe Zhang, Rui Peng, Yuxi Hu, Ronggang Wang

To intensify the full-scene geometry perception of our model, we present the depth distribution similarity loss based on the Gaussian-Mixture Model assumption.

Depth Estimation Point Clouds +1

CL-MVSNet: Unsupervised Multi-View Stereo with Dual-Level Contrastive Learning

no code implementations ICCV 2023 Kaiqiang Xiong, Rui Peng, Zhe Zhang, Tianxing Feng, Jianbo Jiao, Feng Gao, Ronggang Wang

On the one hand, we present an image-level contrastive branch to guide the model to acquire more context awareness, thus leading to more complete depth estimation in indistinguishable regions.

Contrastive Learning Depth Estimation

A novel TomoSAR imaging method with few observations based on nested array

no code implementations1 Dec 2022 Pengyu Jiang, Zhe Zhang, Bingchen Zhang, Zhongqiu Xu

In this paper, we propose a nested TomoSAR technique, which introduces the nested array into TomoSAR as the baseline configuration.

ATASI-Net: An Efficient Sparse Reconstruction Network for Tomographic SAR Imaging with Adaptive Threshold

no code implementations30 Nov 2022 Muhan Wang, Zhe Zhang, Xiaolan Qiu, Silin Gao, Yue Wang

In addition, adaptive threshold is introduced for each azimuth-range pixel, enabling the threshold shrinkage to be not only layer-varied but also element-wise.

Super-Resolution

Fault Diagnosis for Power Electronics Converters based on Deep Feedforward Network and Wavelet Compression

no code implementations27 Oct 2022 Lei Kou, Chuang Liu, Guowei Cai, Zhe Zhang

Secondly, the wavelet transform is used to remove the redundant data of the features, and then the training sample data is greatly compressed.

Review for AI-based Open-Circuit Faults Diagnosis Methods in Power Electronics Converters

no code implementations26 Sep 2022 Chuang Liu, Lei Kou, Guowei Cai, Zihan Zhao, Zhe Zhang

Power electronics converters have been widely used in aerospace system, DC transmission, distributed energy, smart grid and so forth, and the reliability of power electronics converters has been a hotspot in academia and industry.

M$^2$DQN: A Robust Method for Accelerating Deep Q-learning Network

1 code implementation16 Sep 2022 Zhe Zhang, Yukun Zou, Junjie Lai, Qing Xu

Deep Q-learning Network (DQN) is a successful way which combines reinforcement learning with deep neural networks and leads to a widespread application of reinforcement learning.

Q-Learning reinforcement-learning +1

Stochastic Compositional Optimization with Compositional Constraints

no code implementations9 Sep 2022 Shuoguang Yang, Zhe Zhang, Ethan X. Fang

Stochastic compositional optimization (SCO) has attracted considerable attention because of its broad applicability to important real-world problems.

Management

Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness

1 code implementation13 Aug 2022 Vincent Jeanselme, Maria De-Arteaga, Zhe Zhang, Jessica Barrett, Brian Tom

Machine learning risks reinforcing biases present in data, and, as we argue in this work, in what is absent from data.

Fairness Imputation

Interpretable Melody Generation from Lyrics with Discrete-Valued Adversarial Training

no code implementations30 Jun 2022 Wei Duan, Zhe Zhang, Yi Yu, Keizo Oyama

Generating melody from lyrics is an interesting yet challenging task in the area of artificial intelligence and music.

Efficient Federated Learning with Spike Neural Networks for Traffic Sign Recognition

no code implementations28 May 2022 Kan Xie, Zhe Zhang, Bo Li, Jiawen Kang, Dusit Niyato, Shengli Xie, Yi Wu

However, for machine learning-based traffic sign recognition on the Internet of Vehicles (IoV), a large amount of traffic sign data from distributed vehicles is needed to be gathered in a centralized server for model training, which brings serious privacy leakage risk because of traffic sign data containing lots of location privacy information.

Federated Learning Privacy Preserving +1

PERT: A New Solution to Pinyin to Character Conversion Task

1 code implementation24 May 2022 Jinghui Xiao, Qun Liu, Xin Jiang, Yuanfeng Xiong, Haiteng Wu, Zhe Zhang

Pinyin to Character conversion (P2C) task is the key task of Input Method Engine (IME) in commercial input software for Asian languages, such as Chinese, Japanese, Thai language and so on.

Language Modelling

TomoSAR-ALISTA: Efficient TomoSAR Imaging via Deep Unfolded Network

no code implementations5 May 2022 Muhan Wang, Zhe Zhang, Yue Wang, Silin Gao, Xiaolan Qiu

Synthetic aperture radar (SAR) tomography (TomoSAR) has attracted remarkable interest for its ability in achieving three-dimensional reconstruction along the elevation direction from multiple observations.

3D Reconstruction Super-Resolution

Smoothing Advantage Learning

no code implementations20 Mar 2022 Yaozhong Gan, Zhe Zhang, Xiaoyang Tan

Advantage learning (AL) aims to improve the robustness of value-based reinforcement learning against estimation errors with action-gap-based regularization.

Robust Action Gap Increasing with Clipped Advantage Learning

no code implementations20 Mar 2022 Zhe Zhang, Yaozhong Gan, Xiaoyang Tan

Advantage Learning (AL) seeks to increase the action gap between the optimal action and its competitors, so as to improve the robustness to estimation errors.

A Novel Gradient Descent Least Squares (GDLS) Algorithm for Efficient SMV Gridless Line Spectrum Estimation with Applications in Tomographic SAR Imaging

no code implementations16 Mar 2022 Ruizhe Shi, Zhe Zhang, Xiaolan Qiu, Chibiao Ding

Numerical simulations and real data experiments show that the proposed GDLS algorithm outperforms the state-of-the-art methods e. g., CS and ANM, in terms of estimation performances.

Optimal Methods for Convex Risk Averse Distributed Optimization

no code implementations10 Mar 2022 Guanghui Lan, Zhe Zhang

Specifically, the DRAO method achieves the optimal communication complexity by assuming a certain saddle point subproblem can be easily solved in the server node.

Distributed Optimization

Robust Semi-supervised Federated Learning for Images Automatic Recognition in Internet of Drones

no code implementations3 Jan 2022 Zhe Zhang, Shiyao Ma, Zhaohui Yang, Zehui Xiong, Jiawen Kang, Yi Wu, Kejia Zhang, Dusit Niyato

This emerging technology relies on sharing ground truth labeled data between Unmanned Aerial Vehicle (UAV) swarms to train a high-quality automatic image recognition model.

Federated Learning Privacy Preserving

Motion-Modulated Temporal Fragment Alignment Network for Few-Shot Action Recognition

no code implementations CVPR 2022 Jiamin Wu, Tianzhu Zhang, Zhe Zhang, Feng Wu, Yongdong Zhang

To address this issue, we propose an end-to-end Motion-modulated Temporal Fragment Alignment Network (MTFAN) by jointly exploring the task-specific motion modulation and the multi-level temporal fragment alignment for Few-Shot Action Recognition (FSAR).

Few-Shot action recognition Few Shot Action Recognition +1

Multi-View Stereo with Transformer

no code implementations1 Dec 2021 Jie Zhu, Bo Peng, Wanqing Li, Haifeng Shen, Zhe Zhang, Jianjun Lei

It is built upon Transformer and is capable of extracting dense features with global context and 3D consistency, which are crucial to achieving reliable matching for MVS.

Finite Sample Analysis of Average-Reward TD Learning and $Q$-Learning

no code implementations NeurIPS 2021 Sheng Zhang, Zhe Zhang, Siva Theja Maguluri

The focus of this paper is on sample complexity guarantees of average-reward reinforcement learning algorithms, which are known to be more challenging to study than their discounted-reward counterparts.

Q-Learning

Semi-Supervised Federated Learning with non-IID Data: Algorithm and System Design

no code implementations26 Oct 2021 Zhe Zhang, Shiyao Ma, Jiangtian Nie, Yi Wu, Qiang Yan, Xiaoke Xu, Dusit Niyato

In this paper, we present a robust semi-supervised FL system design, where the system aims to solve the problem of data availability and non-IID in FL.

Federated Learning

The First Airborne Experiment of Sparse Microwave Imaging: Prototype System Design and Result Analysis

no code implementations20 Oct 2021 Zhe Zhang, Bingchen Zhang, Chenglong Jiang, Xingdong Liang, Longyong Chen, Wen Hong, Yirong Wu

In this paper we report the first airborne experiments of sparse microwave imaging, conducted in September 2013 and May 2014, using our prototype sparse microwave imaging radar system.

On joint training with interfaces for spoken language understanding

no code implementations30 Jun 2021 Anirudh Raju, Milind Rao, Gautam Tiwari, Pranav Dheram, Bryan Anderson, Zhe Zhang, Chul Lee, Bach Bui, Ariya Rastrow

Spoken language understanding (SLU) systems extract both text transcripts and semantics associated with intents and slots from input speech utterances.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

BlockGNN: Towards Efficient GNN Acceleration Using Block-Circulant Weight Matrices

no code implementations13 Apr 2021 Zhe Zhou, Bizhao Shi, Zhe Zhang, Yijin Guan, Guangyu Sun, Guojie Luo

At the hardware design level, we propose a pipelined CirCore architecture, which supports efficient block-circulant matrices computation.

Edge-computing

High-Dimensional Differentially-Private EM Algorithm: Methods and Near-Optimal Statistical Guarantees

no code implementations1 Apr 2021 Zhe Zhang, Linjun Zhang

In this paper, we develop a general framework to design differentially private expectation-maximization (EM) algorithms in high-dimensional latent variable models, based on the noisy iterative hard-thresholding.

regression

Inheritance-guided Hierarchical Assignment for Clinical Automatic Diagnosis

no code implementations27 Jan 2021 Yichao Du, Pengfei Luo, Xudong Hong, Tong Xu, Zhe Zhang, Chao Ren, Yi Zheng, Enhong Chen

Clinical diagnosis, which aims to assign diagnosis codes for a patient based on the clinical note, plays an essential role in clinical decision-making.

Decision Making

MöbiusE: Knowledge Graph Embedding on Möbius Ring

no code implementations7 Jan 2021 Yao Chen, Jiangang Liu, Zhe Zhang, Shiping Wen, Wenjun Xiong

In this work, we propose a novel Knowledge Graph Embedding (KGE) strategy, called M\"{o}biusE, in which the entities and relations are embedded to the surface of a M\"{o}bius ring.

Knowledge Graph Embedding

Stabilizing Q Learning Via Soft Mellowmax Operator

no code implementations17 Dec 2020 Yaozhong Gan, Zhe Zhang, Xiaoyang Tan

Learning complicated value functions in high dimensional state space by function approximation is a challenging task, partially due to that the max-operator used in temporal difference updates can theoretically cause instability for most linear or non-linear approximation schemes.

Multi-agent Reinforcement Learning Q-Learning

Efficient Construction of Nonlinear Models over Normalized Data

no code implementations23 Nov 2020 Zhaoyue Chen, Nick Koudas, Zhe Zhang, Xiaohui Yu

For the case of NN, we propose algorithms to train the network taking normalized data as the input.

Optimal Algorithms for Convex Nested Stochastic Composite Optimization

no code implementations19 Nov 2020 Zhe Zhang, Guanghui Lan

All these complexity results seem to be new in the literature and they indicate that the convex NSCO problem has the same order of oracle complexity as those without the nested composition in all but the strongly convex and outer-non-smooth problem.

Stochastic Optimization

AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild

2 code implementations26 Oct 2020 Zhe Zhang, Chunyu Wang, Weichao Qiu, Wenhu Qin, Wenjun Zeng

To make the task truly unconstrained, we present AdaFuse, an adaptive multiview fusion method, which can enhance the features in occluded views by leveraging those in visible views.

3D Human Pose Estimation

Deep Learning for Wireless Coded Caching with Unknown and Time-Variant Content Popularity

no code implementations21 Aug 2020 Zhe Zhang, Meixia Tao

This approach, on one hand, can learn the caching policy in continuous action space by using the actor-critic architecture.

Clustering

Fusing Wearable IMUs with Multi-View Images for Human Pose Estimation: A Geometric Approach

1 code implementation CVPR 2020 Zhe Zhang, Chunyu Wang, Wenhu Qin, Wen-Jun Zeng

Then we lift the multi-view 2D poses to the 3D space by an Orientation Regularized Pictorial Structure Model (ORPSM) which jointly minimizes the projection error between the 3D and 2D poses, along with the discrepancy between the 3D pose and IMU orientations.

2D Pose Estimation 3D Absolute Human Pose Estimation

3dDepthNet: Point Cloud Guided Depth Completion Network for Sparse Depth and Single Color Image

no code implementations20 Mar 2020 Rui Xiang, Feng Zheng, Huapeng Su, Zhe Zhang

In this paper, we propose an end-to-end deep learning network named 3dDepthNet, which produces an accurate dense depth image from a single pair of sparse LiDAR depth and color image for robotics and autonomous driving tasks.

Autonomous Driving Depth Completion +1

Simple and Lightweight Human Pose Estimation

1 code implementation23 Nov 2019 Zhe Zhang, Jie Tang, Gangshan Wu

Specifically, our LPN-50 can achieve 68. 7 in AP score on the COCO test-dev set, with only 2. 7M parameters and 1. 0 GFLOPs, while the inference speed is 17 FPS on an Intel i7-8700K CPU machine.

Keypoint Detection Novel Concepts

Leveraging Structural and Semantic Correspondence for Attribute-Oriented Aspect Sentiment Discovery

no code implementations IJCNLP 2019 Zhe Zhang, Munindar P. Singh

Opinionated text often involves attributes such as authorship and location that influence the sentiments expressed for different aspects.

Attribute Semantic correspondence

Multi-objective multi-generation Gaussian process optimizer for design optimization

1 code implementation29 Jun 2019 Xiaobiao Huang, Minghao Song, Zhe Zhang

We present a multi-objective evolutionary optimization algorithm that uses Gaussian process (GP) regression-based models to select trial solutions in a multi-generation iterative procedure.

regression

TonY: An Orchestrator for Distributed Machine Learning Jobs

no code implementations24 Mar 2019 Anthony Hsu, Keqiu Hu, Jonathan Hung, Arun Suresh, Zhe Zhang

Training machine learning (ML) models on large datasets requires considerable computing power.

BIG-bench Machine Learning

Limbic: Author-Based Sentiment Aspect Modeling Regularized with Word Embeddings and Discourse Relations

no code implementations EMNLP 2018 Zhe Zhang, Munindar Singh

We propose Limbic, an unsupervised probabilistic model that addresses the problem of discovering aspects and sentiments and associating them with authors of opinionated texts.

General Classification Semantic Similarity +5

Trifo-VIO: Robust and Efficient Stereo Visual Inertial Odometry using Points and Lines

no code implementations6 Mar 2018 Feng Zheng, Grace Tsai, Zhe Zhang, Shaoshan Liu, Chen-Chi Chu, Hongbing Hu

In this paper, we present the Trifo Visual Inertial Odometry (Trifo-VIO), a tightly-coupled filtering-based stereo VIO system using both points and lines.

PIRVS: An Advanced Visual-Inertial SLAM System with Flexible Sensor Fusion and Hardware Co-Design

no code implementations2 Oct 2017 Zhe Zhang, Shaoshan Liu, Grace Tsai, Hongbing Hu, Chen-Chi Chu, Feng Zheng

In this paper, we present the PerceptIn Robotics Vision System (PIRVS) system, a visual-inertial computing hardware with embedded simultaneous localization and mapping (SLAM) algorithm.

Sensor Fusion Simultaneous Localization and Mapping

Exploring compression techniques for ROOT IO

1 code implementation23 Apr 2017 Zhe Zhang, Brian Bockelman

ROOT provides an flexible format used throughout the HEP community.

Distributed, Parallel, and Cluster Computing

Learn-Memorize-Recall-Reduce A Robotic Cloud Computing Paradigm

no code implementations16 Apr 2017 Shaoshan Liu, Bolin Ding, Jie Tang, Dawei Sun, Zhe Zhang, Grace Tsai, Jean-Luc Gaudiot

The rise of robotic applications has led to the generation of a huge volume of unstructured data, whereas the current cloud infrastructure was designed to process limited amounts of structured data.

Cloud Computing Memorization

Identifying Significant Predictive Bias in Classifiers

no code implementations24 Nov 2016 Zhe Zhang, Daniel B. Neill

We present a novel subset scan method to detect if a probabilistic binary classifier has statistically significant bias -- over or under predicting the risk -- for some subgroup, and identify the characteristics of this subgroup.

Pyramid-based Visual Tracking Using Sparsity Represented Mean Transform

no code implementations CVPR 2014 Zhe Zhang, Kin Hong Wong

Firstly, we extend the original mean shift approach to handle orientation space and scale space and name this new method as mean transform.

Visual Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.