Search Results for author: Jianwei Zhang

Found 54 papers, 21 papers with code

Multi-fingered Robotic Hand Grasping in Cluttered Environments through Hand-object Contact Semantic Mapping

no code implementations • 12 Apr 2024 • Lei Zhang, Kaixin Bai, Guowen Huang, Zhaopeng Chen, Jianwei Zhang

The integration of optimization method and generative models has significantly advanced dexterous manipulation techniques for five-fingered hand grasping.

Grasp Generation

Paper
Add Code

Equivariant Local Reference Frames for Unsupervised Non-rigid Point Cloud Shape Correspondence

no code implementations • 1 Apr 2024 • Ling Wang, Runfa Chen, Yikai Wang, Fuchun Sun, Xinzhou Wang, Sun Kai, Guangyuan Fu, Jianwei Zhang, Wenbing Huang

Based on the assumption of local rigidity, one solution for reducing complexity is to decompose the overall shape into independent local regions using Local Reference Frames (LRFs) that are invariant to SE(3) transformations.

Paper
Add Code

Unified Source-Free Domain Adaptation

1 code implementation • 12 Mar 2024 • Song Tang, Wenxin Su, Mao Ye, Jianwei Zhang, Xiatian Zhu

To tackle this unified SFDA problem, we propose a novel approach called Latent Causal Factors Discovery (LCFD).

Language Modelling Source-Free Domain Adaptation +1

Paper
Code

Anatomy-Guided Surface Diffusion Model for Alzheimer's Disease Normative Modeling

no code implementations • 7 Mar 2024 • Jianwei Zhang, Yonggang Shi

Normative modeling has emerged as a pivotal approach for characterizing heterogeneity and individual variance in neurodegenerative diseases, notably Alzheimer's disease(AD).

Anatomy

Paper
Add Code

Identity information based on human magnetocardiography signals

no code implementations • 2 Mar 2024 • Pengju Zhang, Chenxi Sun, Jianwei Zhang, Hong Guo

We have developed an individual identification system based on magnetocardiography (MCG) signals captured using optically pumped magnetometers (OPMs).

Management

Paper
Add Code

A Collision-Aware Cable Grasping Method in Cluttered Environment

no code implementations • 22 Feb 2024 • Lei Zhang, Kaixin Bai, Qiang Li, Zhaopeng Chen, Jianwei Zhang

We introduce a Cable Grasping-Convolutional Neural Network designed to facilitate robust cable grasping in cluttered environments.

Paper
Add Code

A Closed-Loop Multi-perspective Visual Servoing Approach with Reinforcement Learning

no code implementations • 25 Dec 2023 • Lei Zhang, Jiacheng Pei, Kaixin Bai, Zhaopeng Chen, Jianwei Zhang

Traditional visual servoing methods suffer from serving between scenes from multiple perspectives, which humans can complete with visual signals alone.

OpenAI Gym reinforcement-learning

Paper
Add Code

Search Optimization with Query Likelihood Boosting and Two-Level Approximate Search for Edge Devices

no code implementations • 12 Dec 2023 • Jianwei Zhang, Helian Feng, Xin He, Grant P. Strimel, Farhad Ghassemi, Ali Kebarighotbi

We present a novel search optimization solution for approximate nearest neighbor (ANN) search on resource-constrained edge devices.

Retrieval

Paper
Add Code

ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning

no code implementations • 11 Dec 2023 • Xincheng Yu, Dongyue Guo, Jianwei Zhang, Yi Lin

Radio speech echo is a specific phenomenon in the air traffic control (ATC) domain, which degrades speech quality and further impacts automatic speech recognition (ASR) accuracy.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

JarviX: A LLM No code Platform for Tabular Data Analysis and Optimization

no code implementations • 3 Dec 2023 • Shang-Ching Liu, Shengkun Wang, Wenqi Lin, Chung-Wei Hsiung, Yi-Chen Hsieh, Yu-Ping Cheng, Sian-Hong Luo, Tsungyao Chang, Jianwei Zhang

In this study, we introduce JarviX, a sophisticated data analytics framework.

AutoML

Paper
Add Code

Learning Repeatable Speech Embeddings Using An Intra-class Correlation Regularizer

1 code implementation • NeurIPS 2023 • Jianwei Zhang, Suren Jayasuriya, Visar Berisha

A good supervised embedding for a specific machine learning task is only sensitive to changes in the label of interest and is invariant to other confounding factors.

Speaker Verification

Paper
Code

Key Frame Mechanism For Efficient Conformer Based End-to-end Speech Recognition

1 code implementation • 23 Oct 2023 • Peng Fan, Changhao Shan, Sining Sun, Qing Yang, Jianwei Zhang

Following the initial encoder, we introduce an intermediate CTC loss function to compute the label frame, enabling us to extract the key frames and blank frames for KFSA.

Automatic Speech Recognition speech-recognition +1

Paper
Code

Qwen Technical Report

2 code implementations • 28 Sep 2023 • Jinze Bai, Shuai Bai, Yunfei Chu, Zeyu Cui, Kai Dang, Xiaodong Deng, Yang Fan, Wenbin Ge, Yu Han, Fei Huang, Binyuan Hui, Luo Ji, Mei Li, Junyang Lin, Runji Lin, Dayiheng Liu, Gao Liu, Chengqiang Lu, Keming Lu, Jianxin Ma, Rui Men, Xingzhang Ren, Xuancheng Ren, Chuanqi Tan, Sinan Tan, Jianhong Tu, Peng Wang, Shijie Wang, Wei Wang, Shengguang Wu, Benfeng Xu, Jin Xu, An Yang, Hao Yang, Jian Yang, Shusheng Yang, Yang Yao, Bowen Yu, Hongyi Yuan, Zheng Yuan, Jianwei Zhang, Xingxuan Zhang, Yichang Zhang, Zhenru Zhang, Chang Zhou, Jingren Zhou, Xiaohuan Zhou, Tianhang Zhu

Large language models (LLMs) have revolutionized the field of artificial intelligence, enabling natural language processing tasks that were previously thought to be exclusive to humans.

Ranked #3 on Multi-Label Text Classification on CC3M-TagMask

Language Modelling Large Language Model +2

10,892

Paper
Code

3D Semantic Subspace Traverser: Empowering 3D Generative Model with Shape Editing Capability

1 code implementation • ICCV 2023 • Ruowei Wang, Yu Liu, Pei Su, Jianwei Zhang, Qijun Zhao

Our method utilizes implicit functions as the 3D shape representation and combines a novel latent-space GAN with a linear subspace model to discover semantic dimensions in the local latent space of 3D shapes.

3D Shape Generation 3D Shape Representation +1

Paper
Code

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic

no code implementations • 5 Jun 2023 • Tianying Ji, Yu Luo, Fuchun Sun, Xianyuan Zhan, Jianwei Zhang, Huazhe Xu

Learning high-quality Q-value functions plays a key role in the success of many modern off-policy deep reinforcement learning (RL) algorithms.

Continuous Control Reinforcement Learning (RL)

Paper
Add Code

Deep Active Learning with Structured Neural Depth Search

no code implementations • 5 Jun 2023 • Xiaoyun Zhang, Xieyi Ping, Jianwei Zhang

Previous work optimizes traditional active learning (AL) processes with incremental neural network architecture search (Active-iNAS) based on data complexity change, which improves the accuracy and learning efficiency.

Active Learning Variational Inference

Paper
Add Code

FlightBERT++: A Non-autoregressive Multi-Horizon Flight Trajectory Prediction Framework

no code implementations • 2 May 2023 • Dongyue Guo, Zheng Zhang, Zhen Yan, Jianwei Zhang, Yi Lin

Flight Trajectory Prediction (FTP) is an essential task in Air Traffic Control (ATC), which can assist air traffic controllers in managing airspace more safely and efficiently.

Computational Efficiency Trajectory Prediction

Paper
Add Code

SIA-FTP: A Spoken Instruction Aware Flight Trajectory Prediction Framework

no code implementations • 2 May 2023 • Dongyue Guo, Jianwei Zhang, Yi Lin

A major reason is that spoken instructions and flight trajectories are presented in different modalities in the current air traffic control (ATC) system, bringing great challenges to considering the maneuvering instruction in the FTP tasks.

Trajectory Prediction

Paper
Add Code

Towards Accurate Acne Detection via Decoupled Sequential Detection Head

no code implementations • 28 Jan 2023 • Xin Wei, Lei Zhang, Jianwei Zhang, Junyou Wang, Wenjie Liu, Jiaqi Li, Xian Jiang

In addition, we build a high-quality acne detection dataset named ACNE-DET to verify the effectiveness of DSDH.

Paper
Add Code

Towards Precise Model-free Robotic Grasping with Sim-to-Real Transfer Learning

no code implementations • 28 Jan 2023 • Lei Zhang, Kaixin Bai, Zhaopeng Chen, Yunlei Shi, Jianwei Zhang

In physical robotic experiments, our grasping framework grasped single known objects and novel complex-shaped household objects with a success rate of 90. 91%.

Data Augmentation Robotic Grasping +1

Paper
Add Code

Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection

1 code implementation • 17 Nov 2022 • Jianwei Zhang, Julie Liss, Suren Jayasuriya, Visar Berisha

In this paper, we propose a deep learning framework for generating acoustic feature embeddings sensitive to vocal quality and robust across different corpora.

Cross-corpus

Paper
Code

Embedded Silicon-Organic Integrated Neuromorphic System

no code implementations • 18 Oct 2022 • Shengjie Zheng, Ling Liu, Junjie Yang, Jianwei Zhang, Tao Su, Bin Yue, Xiaojian Li

The development of artificial intelligence (AI) and robotics are both based on the tenet of "science and technology are people-oriented", and both need to achieve efficient communication with the human brain.

Paper
Add Code

Auto Machine Learning for Medical Image Analysis by Unifying the Search on Data Augmentation and Neural Architecture

no code implementations • 21 Jul 2022 • Jianwei Zhang, Dong Li, Lituan Wang, Lei Zhang

To address the problem, an improved augmentation search strategy, named Augmented Density Matching, was proposed by randomly sampling policies from a prior distribution for training.

AutoML Data Augmentation

Paper
Add Code

Learning High-quality Proposals for Acne Detection

1 code implementation • 8 Jul 2022 • Jianwei Zhang, Lei Zhang, Junyou Wang, Xin Wei, Jiaqi Li, Xian Jiang, Dan Du

Acne detection is crucial for interpretative diagnosis and precise treatment of skin disease.

Classification Region Proposal +2

Paper
Code

Knowledge Distillation of Transformer-based Language Models Revisited

no code implementations • 29 Jun 2022 • Chengqiang Lu, Jianwei Zhang, Yunfei Chu, Zhengyu Chen, Jingren Zhou, Fei Wu, Haiqing Chen, Hongxia Yang

In the past few years, transformer-based pre-trained language models have achieved astounding success in both industry and academia.

Knowledge Distillation Language Modelling

Paper
Add Code

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI

1 code implementation • 11 Nov 2021 • Jiangchao Yao, Shengyu Zhang, Yang Yao, Feng Wang, Jianxin Ma, Jianwei Zhang, Yunfei Chu, Luo Ji, Kunyang Jia, Tao Shen, Anpeng Wu, Fengda Zhang, Ziqi Tan, Kun Kuang, Chao Wu, Fei Wu, Jingren Zhou, Hongxia Yang

However, edge computing, especially edge and cloud collaborative computing, are still in its infancy to announce their success due to the resource-constrained IoT scenarios with very limited algorithms deployed.

Cloud Computing Edge-computing

Paper
Code

Speech recognition for air traffic control via feature learning and end-to-end training

no code implementations • 4 Nov 2021 • Peng Fan, Dongyue Guo, Yi Lin, Bo Yang, Jianwei Zhang

In this work, we propose a new automatic speech recognition (ASR) system based on feature learning and an end-to-end training procedure for air traffic control (ATC) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

A Comparative Study of Speaker Role Identification in Air Traffic Communication Using Deep Learning Approaches

no code implementations • 3 Nov 2021 • Dongyue Guo, Jianwei Zhang, Bo Yang, Yi Lin

Most importantly, a multi-modal speaker role identification network (MMSRINet) is designed to achieve the SRI task by considering both the speech and textual modality features.

Binary Classification

Paper
Add Code

Nearest Neighborhood-Based Deep Clustering for Source Data-absent Unsupervised Domain Adaptation

1 code implementation • 27 Jul 2021 • Song Tang, Yan Yang, Zhiyuan Ma, Norman Hendrich, Fanyu Zeng, Shuzhi Sam Ge, ChangShui Zhang, Jianwei Zhang

To reach this goal, we construct the nearest neighborhood for every target data and take it as the fundamental clustering unit by building our objective on the geometry.

Clustering Deep Clustering +1

Paper
Code

Restoring degraded speech via a modified diffusion model

no code implementations • 22 Apr 2021 • Jianwei Zhang, Suren Jayasuriya, Visar Berisha

We replace the mel-spectrum upsampler in DiffWave with a deep CNN upsampler, which is trained to alter the degraded speech mel-spectrum to match that of the original speech.

Paper
Add Code

CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds

1 code implementation • 2 Mar 2021 • Ge Gao, Mikko Lauri, Xiaolin Hu, Jianwei Zhang, Simone Frintrop

In contrast, this domain gap is considerably smaller and easier to fill for depth information.

6D Pose Estimation Object +1

Paper
Code

M6: A Chinese Multimodal Pretrainer

no code implementations • 1 Mar 2021 • Junyang Lin, Rui Men, An Yang, Chang Zhou, Ming Ding, Yichang Zhang, Peng Wang, Ang Wang, Le Jiang, Xianyan Jia, Jie Zhang, Jianwei Zhang, Xu Zou, Zhikang Li, Xiaodong Deng, Jie Liu, Jinbao Xue, Huiling Zhou, Jianxin Ma, Jin Yu, Yong Li, Wei Lin, Jingren Zhou, Jie Tang, Hongxia Yang

In this work, we construct the largest dataset for multimodal pretraining in Chinese, which consists of over 1. 9TB images and 292GB texts that cover a wide range of domains.

Image Generation

Paper
Add Code

Dynamic Memory based Attention Network for Sequential Recommendation

1 code implementation • 18 Feb 2021 • Qiaoyu Tan, Jianwei Zhang, Ninghao Liu, Xiao Huang, Hongxia Yang, Jingren Zhou, Xia Hu

It segments the overall long behavior sequence into a series of sub-sequences, then trains the model and maintains a set of memory blocks to preserve long-term interests of users.

Sequential Recommendation

Paper
Code

Sparse-Interest Network for Sequential Recommendation

1 code implementation • 18 Feb 2021 • Qiaoyu Tan, Jianwei Zhang, Jiangchao Yao, Ninghao Liu, Jingren Zhou, Hongxia Yang, Xia Hu

Our sparse-interest module can adaptively infer a sparse set of concepts for each user from the large concept pool and output multiple embeddings accordingly.

Sequential Recommendation

Paper
Code

ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems

no code implementations • 17 Feb 2021 • Yi Lin, Bo Yang, Linchao Li, Dongyue Guo, Jianwei Zhang, Hu Chen, Yi Zhang

Finally, by integrating the SRL with ASR, an end-to-end multilingual ASR framework is formulated in a supervised manner, which is able to translate the raw wave into text in one model, i. e., wave-to-text.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Q-SR: An Extensible Optimization Framework for Segment Routing

no code implementations • 24 Dec 2020 • Jianwei Zhang

For the offline setting, we develop a fully polynomial time approximation scheme (FPTAS) which can finds a $(1+\omega)$-approximation solution for any specified $\omega>0$ in time that is a polynomial function of the network size.

Networking and Internet Architecture

Paper
Add Code

A Survey on Machine Learning from Few Samples

no code implementations • 6 Sep 2020 • Jiang Lu, Pinghua Gong, Jieping Ye, Jianwei Zhang, ChangShui Zhang

The capability of learning and generalizing from very few samples successfully is a noticeable demarcation separating artificial intelligence and human intelligence since humans can readily establish their cognition to novelty from just a single or a handful of examples whereas machine learning algorithms typically entail hundreds or thousands of supervised samples to guarantee generalization ability.

BIG-bench Machine Learning Meta-Learning

Paper
Add Code

Cascade Convolutional Neural Network for Image Super-Resolution

no code implementations • 24 Aug 2020 • Jianwei Zhang, zhenxing Wang, yuhui Zheng, Guoqing Zhang

With the development of the super-resolution convolutional neural network (SRCNN), deep learning technique has been widely applied in the field of image super-resolution.

Ranked #1 on Image Super-Resolution on BSD200 - 2x upscaling

Image Super-Resolution

Paper
Add Code

Self-Adapting Recurrent Models for Object Pushing from Learning in Simulation

no code implementations • 27 Jul 2020 • Lin Cong, Michael Görner, Philipp Ruppel, Hongzhuo Liang, Norman Hendrich, Jianwei Zhang

In this paper, we collect all training data in a physics simulator and build an LSTM-based model to fit the pushing dynamics.

Robotics

Paper
Add Code

Continuous Learning and Inference of Individual Probability of SARS-CoV-2 Infection Based on Interaction Data

no code implementations • 8 Jun 2020 • Shangching Liu, Koyun Liu, Hwaihai Chiang, Jianwei Zhang, Tsungyao Chang

This study presents a new approach to determine the likelihood of asymptomatic carriers of the SARS-CoV-2 virus by using interaction-based continuous learning and inference of individual probability (CLIIP) for contagious ranking.

Paper
Add Code

Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems

no code implementations • 20 May 2020 • Chang Zhou, Jianxin Ma, Jianwei Zhang, Jingren Zhou, Hongxia Yang

Deep candidate generation (DCG) that narrows down the collection of relevant items from billions to hundreds via representation learning has become prevalent in industrial recommender systems.

Contrastive Learning Fairness +3

Paper
Add Code

Controllable Multi-Interest Framework for Recommendation

2 code implementations • 19 May 2020 • Yukuo Cen, Jianwei Zhang, Xu Zou, Chang Zhou, Hongxia Yang, Jie Tang

Recent works usually give an overall embedding from a user's behavior sequence.

Sequential Recommendation

2,146

Paper
Code

A Mobile Robot Hand-Arm Teleoperation System by Vision and IMU

1 code implementation • 11 Mar 2020 • Shuang Li, Jiaxi Jiang, Philipp Ruppel, Hongzhuo Liang, Xiaojian Ma, Norman Hendrich, Fuchun Sun, Jianwei Zhang

In this paper, we present a multimodal mobile teleoperation system that consists of a novel vision-based hand pose regression network (Transteleop) and an IMU-based arm tracking method.

Anatomy Image-to-Image Translation +1

Paper
Code

Robust Robotic Pouring using Audition and Haptics

1 code implementation • 29 Feb 2020 • Hongzhuo Liang, Chuangchuang Zhou, Shuang Li, Xiaojian Ma, Norman Hendrich, Timo Gerkmann, Fuchun Sun, Marcus Stoffel, Jianwei Zhang

Both network training results and robot experiments demonstrate that MP-Net is robust against noise and changes to the task and environment.

Paper
Code

6D Object Pose Regression via Supervised Learning on Point Clouds

1 code implementation • 24 Jan 2020 • Ge Gao, Mikko Lauri, Yulong Wang, Xiaolin Hu, Jianwei Zhang, Simone Frintrop

We use depth information represented by point clouds as the input to both deep networks and geometry-based pose refinement and use separate networks for rotation and translation regression.

Object regression +1

Paper
Code

Dimensional Reweighting Graph Convolution Networks

no code implementations • 25 Sep 2019 • Xu Zou, Qiuye Jia, Jianwei Zhang, Chang Zhou, Zijun Yao, Hongxia Yang, Jie Tang

In this paper, we propose a method named Dimensional reweighting Graph Convolutional Networks (DrGCNs), to tackle the problem of variance between dimensional information in the node representations of GCNs.

Node Classification

Paper
Add Code

Dimensional Reweighting Graph Convolutional Networks

2 code implementations • 4 Jul 2019 • Xu Zou, Qiuye Jia, Jianwei Zhang, Chang Zhou, Hongxia Yang, Jie Tang

Graph Convolution Networks (GCNs) are becoming more and more popular for learning node representations on graphs.

Node Classification

Paper
Code

Representation Learning for Attributed Multiplex Heterogeneous Network

4 code implementations • 5 May 2019 • Yukuo Cen, Xu Zou, Jianwei Zhang, Hongxia Yang, Jingren Zhou, Jie Tang

Network embedding (or graph embedding) has been widely used in many real-world applications.

Ranked #1 on Link Prediction on Alibaba

Graph Embedding Link Prediction +2

12,994

Paper
Code

Making Sense of Audio Vibration for Liquid Height Estimation in Robotic Pouring

1 code implementation • 2 Mar 2019 • Hongzhuo Liang, Shuang Li, Xiaojian Ma, Norman Hendrich, Timo Gerkmann, Jianwei Zhang

PouringNet is trained on our collected real-world pouring dataset with multimodal sensing data, which contains more than 3000 recordings of audio, force feedback, video and trajectory data of the human hand that performs the pouring task.

Robotics Sound Audio and Speech Processing

Paper
Code

Vision-based Teleoperation of Shadow Dexterous Hand using End-to-End Deep Neural Network

4 code implementations • 17 Sep 2018 • Shuang Li, Xiaojian Ma, Hongzhuo Liang, Michael Görner, Philipp Ruppel, Bing Fang, Fuchun Sun, Jianwei Zhang

In this paper, we present TeachNet, a novel neural network architecture for intuitive and markerless vision-based teleoperation of dexterous robotic hands.

Robotics

Paper
Code

PointNetGPD: Detecting Grasp Configurations from Point Sets

4 code implementations • 17 Sep 2018 • Hongzhuo Liang, Xiaojian Ma, Shuang Li, Michael Görner, Song Tang, Bin Fang, Fuchun Sun, Jianwei Zhang

In this paper, we propose an end-to-end grasp evaluation model to address the challenging problem of localizing robot grasp configurations directly from the point cloud.

Robotics

301

Paper
Code