Search Results for author: Ziheng Wang

Found 39 papers, 10 papers with code

MARMOT: Masked Autoencoder for Modeling Transient Imaging

no code implementations10 Jun 2025 Siyuan Shen, Ziheng Wang, Xingyue Peng, Suan Xia, Ruiqian Li, Shiying Li, Jingyi Yu

Our MARMOT is a self-supervised model pretrianed on massive and diverse NLOS transient datasets.

Decoder

A New Segment Routing method with Swap Node Selection Strategy Based on Deep Reinforcement Learning for Software Defined Network

no code implementations21 Mar 2025 Miao Ye, Jihao Zheng, Qiuxiang Jiang, Yuan Huang, Ziheng Wang, Yong Wang

The existing segment routing (SR) methods need to determine the routing first and then use path segmentation approaches to select swap nodes to form a segment routing path (SRP).

Deep Reinforcement Learning

TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM

1 code implementation17 Mar 2025 Ye Wang, Boshen Xu, Zihao Yue, Zihan Xiao, Ziheng Wang, Liang Zhang, Dingyi Yang, Wenxuan Wang, Qin Jin

We introduce TimeZero, a reasoning-guided LVLM designed for the temporal video grounding (TVG) task.

Video Grounding

TransiT: Transient Transformer for Non-line-of-sight Videography

no code implementations14 Mar 2025 Ruiqian Li, Siyuan Shen, Suan Xia, Ziheng Wang, Xingyue Peng, Chengxuan Song, Yingsheng Zhu, Tao Wu, Shiying Li, Jingyi Yu

High frame rates, for example, can be achieved by reducing either per-point scanning time or scanning density, but at the cost of lowering the information density at individual frames.

Autonomous Navigation Transfer Learning

Solving the long-tailed distribution problem by exploiting the synergies and balance of different techniques

no code implementations23 Jan 2025 Ziheng Wang, Toni Lassila, Sharib Ali

While many studies have sought to improve long tail recognition by altering the data distribution in the feature space and adjusting model decision boundaries, research on the synergy and corrective approach among various methods is limited.

Contrastive Learning

Surgical Visual Understanding (SurgVU) Dataset

no code implementations16 Jan 2025 Aneeq Zia, Max Berniker, Rogerio Nespolo, Conor Perreault, Ziheng Wang, Benjamin Mueller, Ryan Schmidt, Kiran Bhattacharyya, Xi Liu, Anthony Jarc

Owing to recent advances in machine learning and the ability to harvest large amounts of data during robotic-assisted surgeries, surgical data science is ripe for foundational work.

Synthetic Data Generation for Residential Load Patterns via Recurrent GAN and Ensemble Method

no code implementations20 Oct 2024 Xinyu Liang, Ziheng Wang, Hao Wang

Our developed ERGAN can capture diverse load patterns across various households, thereby enhancing the realism and diversity of the synthetic data generated.

Diversity Generative Adversarial Network +1

Adaptive Resolution Inference (ARI): Energy-Efficient Machine Learning for Internet of Things

no code implementations26 Aug 2024 Ziheng Wang, Pedro Reviriego, Farzad Niknia, Javier Conde, Shanshan Liu, Fabrizio Lombardi

This enables most inferences to run with the reduced precision model and only a small fraction requires the full model, so significantly reducing computation and energy while not affecting model performance.

Quantization

MiranDa: Mimicking the Learning Processes of Human Doctors to Achieve Causal Inference for Medication Recommendation

1 code implementation23 Jul 2024 Ziheng Wang, Xinhe Li, Haruki Momma, Ryoichi Nagatomi

To enhance therapeutic outcomes from a pharmacological perspective, we propose MiranDa, designed for medication recommendation, which is the first actionable model capable of providing the estimated length of stay in hospitals (ELOS) as counterfactual outcomes that guide clinical practice and model training.

Causal Inference counterfactual

EgoNCE++: Do Egocentric Video-Language Models Really Understand Hand-Object Interactions?

1 code implementation28 May 2024 Boshen Xu, Ziheng Wang, Yang Du, Zhinan Song, Sipeng Zheng, Qin Jin

Due to the occurrence of diverse EgoHOIs in the real world, we propose an open-vocabulary benchmark named EgoHOIBench to reveal the diminished performance of current egocentric video-language models (EgoVLM) on fined-grained concepts, indicating that these models still lack a full spectrum of egocentric understanding.

Action Recognition Attribute +2

Movie101v2: Improved Movie Narration Benchmark

1 code implementation20 Apr 2024 Zihao Yue, Yepeng Zhang, Ziheng Wang, Qin Jin

Automatic movie narration aims to generate video-aligned plot descriptions to assist visually impaired audiences.

Video Captioning

Weak Convergence Analysis of Online Neural Actor-Critic Algorithms

no code implementations25 Mar 2024 Samuel Chun-Hei Lam, Justin Sirignano, Ziheng Wang

Then, using a Poisson equation, we prove that the fluctuations of the model updates around the limit distribution due to the randomly-arriving data samples vanish as the number of parameter updates $\rightarrow \infty$.

An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs

1 code implementation28 Jun 2023 Haihao Shen, Hengyu Meng, Bo Dong, Zhe Wang, Ofir Zafrir, Yi Ding, Yu Luo, Hanwen Chang, Qun Gao, Ziheng Wang, Guy Boudoukh, Moshe Wasserblat

We apply our sparse accelerator on widely-used Transformer-based language models including Bert-Mini, DistilBERT, Bert-Base, and BERT-Large.

Model Compression

Concurrent Classifier Error Detection (CCED) in Large Scale Machine Learning Systems

no code implementations2 Jun 2023 Pedro Reviriego, Ziheng Wang, Alvaro Alonso, Zhen Gao, Farzad Niknia, Shanshan Liu, Fabrizio Lombardi

In this paper, we introduce Concurrent Classifier Error Detection (CCED), a scheme to implement CED in ML systems using a concurrent ML classifier to detect errors.

image-classification Image Classification

Movie101: A New Movie Understanding Benchmark

1 code implementation20 May 2023 Zihao Yue, Qi Zhang, Anwen Hu, Liang Zhang, Ziheng Wang, Qin Jin

Closer to real scenarios, the Movie Clip Narrating (MCN) task in our benchmark asks models to generate role-aware narration paragraphs for complete movie clips where no actors are speaking.

Video Captioning

Edit As You Wish: Video Caption Editing with Multi-grained User Control

1 code implementation15 May 2023 Linli Yao, Yuanmeng Zhang, Ziheng Wang, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Xu sun, Qin Jin

In this paper, we propose a novel \textbf{V}ideo \textbf{C}aption \textbf{E}diting \textbf{(VCE)} task to automatically revise an existing video description guided by multi-grained user requests.

Attribute Position +4

Uncertainty-aware Self-supervised Learning for Cross-domain Technical Skill Assessment in Robot-assisted Surgery

no code implementations28 Apr 2023 Ziheng Wang, Andrea Mariani, Arianna Menciassi, Elena De Momi, Ann Majewicz Fey

In this paper, we propose a novel approach for skill assessment by transferring domain knowledge from labeled kinematic data to unlabeled data.

Self-Supervised Learning

Automatic Detection of Out-of-body Frames in Surgical Videos for Privacy Protection Using Self-supervised Learning and Minimal Labels

no code implementations31 Mar 2023 Ziheng Wang, Conor Perreault, Xi Liu, Anthony Jarc

Endoscopic video recordings are widely used in minimally invasive robot-assisted surgery, but when the endoscope is outside the patient's body, it can capture irrelevant segments that may contain sensitive information.

Self-Supervised Learning

A Forward Propagation Algorithm for Online Optimization of Nonlinear Stochastic Differential Equations

no code implementations10 Jul 2022 Ziheng Wang, Justin Sirignano

We then re-write the algorithm using the PDE solution, which allows us to characterize the parameter evolution around the direction of steepest descent.

Continuous-time stochastic gradient descent for optimizing over the stationary distribution of stochastic differential equations

no code implementations14 Feb 2022 Ziheng Wang, Justin Sirignano

The gradient estimate is simultaneously updated using forward propagation of the SDE state derivatives, asymptotically converging to the direction of steepest descent.

SparseDNN: Fast Sparse Deep Learning Inference on CPUs

1 code implementation20 Jan 2021 Ziheng Wang

While we find mature support for quantized neural networks in production frameworks such as OpenVINO and MNN, support for pruned sparse neural networks is still lacking.

Deep Learning Quantization

SparseRT: Accelerating Unstructured Sparsity on GPUs for Deep Learning Inference

no code implementations26 Aug 2020 Ziheng Wang

In recent years, there has been a flurry of research in deep neural network pruning and compression.

Deep Learning Network Pruning

Structured Pruning of Large Language Models

2 code implementations EMNLP 2020 Ziheng Wang, Jeremy Wohlwend, Tao Lei

Large language models have recently achieved state of the art performance across a wide variety of natural language tasks.

Language Modeling Language Modelling +2

Accelerated CNN Training Through Gradient Approximation

no code implementations15 Aug 2019 Ziheng Wang, Sree Harsha Nelaturu

In this work, we explore three alternative methods to approximate gradients, with an efficient GPU kernel implementation for one of them.

Transferrable Operative Difficulty Assessment in Robot-assisted Teleoperation: A Domain Adaptation Approach

no code implementations12 Jun 2019 Ziheng Wang, Cong Feng, Jie Zhang, Ann Majewicz Fey

Providing an accurate and efficient assessment of operative difficulty is important for designing robot-assisted teleoperation interfaces that are easy and natural for human operators to use.

Steering Control Unsupervised Domain Adaptation

SATR-DL: Improving Surgical Skill Assessment and Task Recognition in Robot-assisted Surgery with Deep Neural Networks

no code implementations15 Jun 2018 Ziheng Wang, Ann Majewicz Fey

Purpose: This paper focuses on an automated analysis of surgical motion profiles for objective skill assessment and task recognition in robot-assisted surgery.

Representation Learning

A Hierarchical Probabilistic Model for Facial Feature Detection

no code implementations CVPR 2014 Yue Wu, Ziheng Wang, Qiang Ji

Facial feature detection from facial images has attracted great attention in the field of computer vision.

model parameter estimation

Structured Feature Selection

no code implementations ICCV 2015 Tian Gao, Ziheng Wang, Qiang Ji

Then we apply structured feature selection to two applications: 1) We introduce a new method that enables STMB to scale up and show the competitive performance of our algorithms on large-scale image classification tasks.

Dimensionality Reduction feature selection +3

Classifier Learning With Hidden Information

no code implementations CVPR 2015 Ziheng Wang, Qiang Ji

Experimental results on different applications demonstrate the effectiveness of the proposed methods for exploiting hidden information and their superior performance to existing methods.

Cannot find the paper you are looking for? You can Submit a new open access paper.