Search Results for author: Zhiyong Wang

Found 42 papers, 14 papers with code

Variance-Dependent Regret Bounds for Non-stationary Linear Bandits

no code implementations15 Mar 2024 Zhiyong Wang, Jize Xie, Yi Chen, John C. S. Lui, Dongruo Zhou

We investigate the non-stationary stochastic linear bandit problem where the reward distribution evolves each round.

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

no code implementations26 Feb 2024 Hantao Yang, Xutong Liu, Zhiyong Wang, Hong Xie, John C. S. Lui, Defu Lian, Enhong Chen

We study the problem of federated contextual combinatorial cascading bandits, where $|\mathcal{U}|$ agents collaborate under the coordination of a central server to provide tailored recommendations to the $|\mathcal{U}|$ corresponding users.

Design Your Own Universe: A Physics-Informed Agnostic Method for Enhancing Graph Neural Networks

no code implementations26 Jan 2024 Dai Shi, Andi Han, Lequan Lin, Yi Guo, Zhiyong Wang, Junbin Gao

Physics-informed Graph Neural Networks have achieved remarkable performance in learning through graph-structured data by mitigating common GNN challenges such as over-smoothing, over-squashing, and heterophily adaption.

XAI for In-hospital Mortality Prediction via Multimodal ICU Data

1 code implementation29 Dec 2023 Xingqiao Li, Jindong Gu, Zhiyong Wang, Yancheng Yuan, Bo Du, Fengxiang He

To address this issue, this paper proposes an eXplainable Multimodal Mortality Predictor (X-MMP) approaching an efficient, explainable AI solution for predicting in-hospital mortality via multimodal ICU data.

Decision Making Mortality Prediction

SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation

2 code implementations22 Dec 2023 Wenxi Yue, Jing Zhang, Kun Hu, Qiuxia Wu, ZongYuan Ge, Yong Xia, Jiebo Luo, Zhiyong Wang

Specifically, we achieve this by proposing (1) Collaborative Prompts that describe instrument structures via collaborating category-level and part-level texts; (2) Cross-Modal Prompt Encoder that encodes text prompts jointly with visual embeddings into discriminative part-level representations; and (3) Part-to-Whole Adaptive Fusion and Hierarchical Decoding that adaptively fuse the part-level representations into a whole for accurate instrument segmentation in surgical scenarios.

Segmentation Semantic Segmentation

The FruitShell French synthesis system at the Blizzard 2023 Challenge

no code implementations1 Sep 2023 Xin Qi, Xiaopeng Wang, Zhiyong Wang, Wang Liu, Mingming Ding, Shuchen Shi

The evaluation results of our system showed a quality MOS score of 3. 6 for the Hub task and 3. 4 for the Spoke task, placing our system at an average level among all participating teams.

Data Augmentation Speech Synthesis +1

Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance

no code implementations31 Aug 2023 Zexin Hu, Kun Hu, Clinton Mo, Lei Pan, Zhiyong Wang

Sketch-based terrain generation seeks to create realistic landscapes for virtual environments in various applications such as computer games, animation and virtual reality.

Denoising

Bridging the Gap: Fine-to-Coarse Sketch Interpolation Network for High-Quality Animation Sketch Inbetweening

no code implementations25 Aug 2023 Jiaming Shen, Kun Hu, Wei Bao, Chang Wen Chen, Zhiyong Wang

The 2D animation workflow is typically initiated with the creation of keyframes using sketch-based drawing.

Robust Audio Anti-Spoofing with Fusion-Reconstruction Learning on Multi-Order Spectrograms

1 code implementation18 Aug 2023 Penghui Wen, Kun Hu, Wenxi Yue, Sen Zhang, Wanlei Zhou, Zhiyong Wang

Robust audio anti-spoofing has been increasingly challenging due to the recent advancements on deepfake techniques.

Face Swapping

SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation

1 code implementation17 Aug 2023 Wenxi Yue, Jing Zhang, Kun Hu, Yong Xia, Jiebo Luo, Zhiyong Wang

However, we observe two problems with this naive pipeline: (1) the domain gap between natural objects and surgical instruments leads to inferior generalisation of SAM; and (2) SAM relies on precise point or box locations for accurate segmentation, requiring either extensive manual guidance or a well-performing specialist detector for prompt preparation, which leads to a complex multi-stage pipeline.

Image Segmentation Segmentation +1

When Deep Learning Meets Multi-Task Learning in SAR ATR: Simultaneous Target Recognition and Segmentation

no code implementations14 Aug 2023 Chenwei Wang, Jifang Pei, Zhiyong Wang, Yulin Huang, Junjie Wu, Haiguang Yang, Jianyu Yang

In this paper, we propose a new multi-task learning approach for SAR ATR, which could obtain the accurate category and precise shape of the targets simultaneously.

Learning Theory Multi-Task Learning +1

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark

1 code implementation NeurIPS 2023 Zhenfei Yin, Jiong Wang, JianJian Cao, Zhelun Shi, Dingning Liu, Mukai Li, Lu Sheng, Lei Bai, Xiaoshui Huang, Zhiyong Wang, Jing Shao, Wanli Ouyang

To the best of our knowledge, we present one of the very first open-source endeavors in the field, LAMM, encompassing a Language-Assisted Multi-Modal instruction tuning dataset, framework, and benchmark.

Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning

1 code implementation6 Jun 2023 Peggy Tang, Junbin Gao, Lei Zhang, Zhiyong Wang

Recently, compressive text summarisation offers a balance between the conciseness issue of extractive summarisation and the factual hallucination issue of abstractive summarisation.

Hallucination reinforcement-learning

Full Resolution Repetition Counting

no code implementations23 May 2023 Jianing Li, Bowen Chen, Zhiyong Wang, Honghai Liu

Given an untrimmed video, repetitive actions counting aims to estimate the number of repetitions of class-agnostic actions.

Continuous Intermediate Token Learning with Implicit Motion Manifold for Keyframe Based Motion Interpolation

1 code implementation CVPR 2023 Clinton Ansun Mo, Kun Hu, Chengjiang Long, Zhiyong Wang

Deriving sophisticated 3D motions from sparse keyframes is a particularly challenging problem, due to continuity and exceptionally skeletal precision.

Motion Interpolation Motion Synthesis

Multi-Scale Control Signal-Aware Transformer for Motion Synthesis without Phase

no code implementations3 Mar 2023 Lintao Wang, Kun Hu, Lei Bai, Yu Ding, Wanli Ouyang, Zhiyong Wang

As past poses often contain useful auxiliary hints, in this paper, we propose a task-agnostic deep learning method, namely Multi-scale Control Signal-aware Transformer (MCS-T), with an attention based encoder-decoder architecture to discover the auxiliary information implicitly for synthesizing controllable motion without explicitly requiring auxiliary information such as phase.

Feature Engineering Motion Synthesis

Efficient Explorative Key-term Selection Strategies for Conversational Contextual Bandits

1 code implementation1 Mar 2023 Zhiyong Wang, Xutong Liu, Shuai Li, John C. S. Lui

To tackle these issues, we first propose ``ConLinUCB", a general framework for conversational bandits with better information incorporation, combining arm-level and key-term-level feedback to estimate user preference in one step at each time.

Computational Efficiency Multi-Armed Bandits +1

Robust Knowledge Adaptation for Federated Unsupervised Person ReID

no code implementations18 Jan 2023 Jianfeng Weng, Kun Hu, Tingting Yao, Jingya Wang, Zhiyong Wang

Thus, in this work, a federated unsupervised cluster-contrastive (FedUCC) learning method is proposed for Person ReID.

Federated Learning Person Re-Identification

VAPCNet: Viewpoint-Aware 3D Point Cloud Completion

no code implementations ICCV 2023 Zhiheng Fu, Longguang Wang, Lian Xu, Zhiyong Wang, Hamid Laga, Yulan Guo, Farid Boussaid, Mohammed Bennamoun

In this paper, we thus propose an unsupervised viewpoint representation learning scheme for 3D point cloud completion without explicit viewpoint estimation.

Point Cloud Completion Representation Learning +1

Towards Efficient Visual Simplification of Computational Graphs in Deep Neural Networks

no code implementations21 Dec 2022 Rusheng Pan, Zhiyong Wang, Yating Wei, Han Gao, Gongchang Ou, Caleb Chen Cao, Jingli Xu, Tong Xu, Wei Chen

A computational graph in a deep neural network (DNN) denotes a specific data flow diagram (DFD) composed of many tensors and operators.

TLDW: Extreme Multimodal Summarisation of News Videos

no code implementations16 Oct 2022 Peggy Tang, Kun Hu, Lei Zhang, Jiebo Luo, Zhiyong Wang

Multimodal summarisation with multimodal output is drawing increasing attention due to the rapid growth of multimedia data.

Sentence

Multi-level Adversarial Spatio-temporal Learning for Footstep Pressure based FoG Detection

no code implementations22 Sep 2022 Kun Hu, Shaohui Mei, Wei Wang, Kaylena A. Ehgoetz Martens, Liang Wang, Simon J. G. Lewis, David D. Feng, Zhiyong Wang

The proposed scheme also sheds light on improving subject-level clinical studies from other scenarios as it can be integrated with many existing deep architectures.

Skin Lesion Recognition with Class-Hierarchy Regularized Hyperbolic Embeddings

no code implementations13 Sep 2022 Zhen Yu, Toan Nguyen, Yaniv Gal, Lie Ju, Shekhar S. Chandra, Lei Zhang, Paul Bonnington, Victoria Mar, Zhiyong Wang, ZongYuan Ge

Accordingly, the learned prototypes preserve the semantic class relations in the embedding space and we can predict the label of an image by assigning its feature to the nearest hyperbolic class prototype.

Deep Laparoscopic Stereo Matching with Transformers

1 code implementation25 Jul 2022 Xuelian Cheng, Yiran Zhong, Mehrtash Harandi, Tom Drummond, Zhiyong Wang, ZongYuan Ge

The self-attention mechanism, successfully employed with the transformer structure is shown promise in many computer vision tasks including image recognition, and object detection.

object-detection Object Detection +2

Action Recognition With Motion Diversification and Dynamic Selection

no code implementations TIP 2022 Peiqin Zhuang, Yu Guo, Zhipeng Yu, Luping Zhou, Lei Bai, Ding Liang, Zhiyong Wang, Yali Wang, Wanli Ouyang

To address this issue, we introduce a Motion Diversification and Selection (MoDS) module to generate diversified spatio-temporal motion features and then select the suitable motion representation dynamically for categorizing the input video.

Action Recognition Computational Efficiency

OTExtSum: Extractive Text Summarisation with Optimal Transport

1 code implementation Findings (NAACL) 2022 Peggy Tang, Kun Hu, Rui Yan, Lei Zhang, Junbin Gao, Zhiyong Wang

Optimal sentence extraction is conceptualised as obtaining an optimal summary that minimises the transportation cost to a given document regarding their semantic distributions.

Sentence

Sign Language Translation with Hierarchical Spatio-TemporalGraph Neural Network

no code implementations14 Nov 2021 Jichao Kan, Kun Hu, Markus Hagenbuchner, Ah Chung Tsoi, Mohammed Bennamounm, Zhiyong Wang

Therefore, in this paper, these unique characteristics of sign languages are formulated as hierarchical spatio-temporal graph representations, including high-level and fine-level graphs of which a vertex characterizes a specified body part and an edge represents their interactions.

Machine Translation NMT +2

Coupling Matrix Manifolds and Their Applications in Optimal Transport

no code implementations15 Nov 2019 Dai Shi, Junbin Gao, Xia Hong, S. T. Boris Choy, Zhiyong Wang

These geometrical features of CMM have paved the way for developing numerical Riemannian optimization algorithms such as Riemannian gradient descent and Riemannian trust-region algorithms, forming a uniform optimization method for all types of OT problems.

Riemannian optimization

IntersectGAN: Learning Domain Intersection for Generating Images with Multiple Attributes

no code implementations21 Sep 2019 Zehui Yao, Boyan Zhang, Zhiyong Wang, Wanli Ouyang, Dong Xu, Dagan Feng

For example, given two image domains $X_1$ and $X_2$ with certain attributes, the intersection $X_1 \cap X_2$ denotes a new domain where images possess the attributes from both $X_1$ and $X_2$ domains.

Attribute

MRFalign: Protein Homology Detection through Alignment of Markov Random Fields

no code implementations12 Jan 2014 Jianzhu Ma, Sheng Wang, Zhiyong Wang, Jinbo Xu

A sequence profile is usually represented as a position-specific scoring matrix (PSSM) or an HMM (Hidden Markov Model) and accordingly PSSM-PSSM or HMM-HMM comparison is used for homolog detection.

Multiple Sequence Alignment

Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning

no code implementations10 Dec 2013 Jianzhu Ma, Sheng Wang, Zhiyong Wang, Jinbo Xu

To further improve the accuracy of the estimated precision matrices, we employ a supervised learning method to predict contact probability from a variety of evolutionary and non-evolutionary information and then incorporate the predicted probability as prior into our GGL framework.

Predicting protein contact map using evolutionary and physical constraints by integer programming (extended version)

no code implementations8 Aug 2013 Zhiyong Wang, Jinbo Xu

Most existing methods predict the contact map matrix element-by-element, ignoring correlation among contacts and physical feasibility of the whole contact map.

Cannot find the paper you are looking for? You can Submit a new open access paper.