Search Results for author: Hao Ding

Found 34 papers, 15 papers with code

dARt Vinci: Egocentric Data Collection for Surgical Robot Learning at Scale

no code implementations7 Mar 2025 Yihao Liu, Yu-Chun Ku, Jiaming Zhang, Hao Ding, Peter Kazanzides, Mehran Armand

The system uses Augmented Reality (AR) hand tracking and a high-fidelity physics engine to capture subtle maneuvers in primitive surgical tasks: By eliminating the need for a physical robot setup and providing flexibility in terms of time, space, and hardware resources-such as multiview sensors and actuators-specialized simulation is a viable alternative.

Imitation Learning

MoSFormer: Augmenting Temporal Context with Memory of Surgery for Surgical Phase Recognition

no code implementations2 Mar 2025 Hao Ding, Xu Lian, Mathias Unberath

We further introduce step filtering to refine history representation and develop a memory caching pipeline to improve training and inference stability, mitigating shortcut learning and overfitting.

counterfactual Counterfactual Inference +1

Kimi k1.5: Scaling Reinforcement Learning with LLMs

2 code implementations22 Jan 2025 Kimi Team, Angang Du, Bofei Gao, Bowei Xing, Changjiu Jiang, Cheng Chen, Cheng Li, Chenjun Xiao, Chenzhuang Du, Chonghua Liao, Chuning Tang, Congcong Wang, Dehao Zhang, Enming Yuan, Enzhe Lu, Fengxiang Tang, Flood Sung, Guangda Wei, Guokun Lai, Haiqing Guo, Han Zhu, Hao Ding, Hao Hu, Hao Yang, Hao Zhang, Haotian Yao, Haotian Zhao, Haoyu Lu, Haoze Li, Haozhen Yu, Hongcheng Gao, Huabin Zheng, Huan Yuan, Jia Chen, Jianhang Guo, Jianlin Su, Jianzhou Wang, Jie Zhao, Jin Zhang, Jingyuan Liu, Junjie Yan, Junyan Wu, Lidong Shi, Ling Ye, Longhui Yu, Mengnan Dong, Neo Zhang, Ningchen Ma, Qiwei Pan, Qucheng Gong, Shaowei Liu, Shengling Ma, Shupeng Wei, Sihan Cao, Siying Huang, Tao Jiang, Weihao Gao, Weimin Xiong, Weiran He, Weixiao Huang, Wenhao Wu, Wenyang He, Xianghui Wei, Xianqing Jia, Xingzhe Wu, Xinran Xu, Xinxing Zu, Xinyu Zhou, Xuehai Pan, Y. Charles, Yang Li, Yangyang Hu, Yangyang Liu, Yanru Chen, Yejie Wang, Yibo Liu, Yidao Qin, Yifeng Liu, Ying Yang, Yiping Bao, Yulun Du, Yuxin Wu, Yuzhi Wang, Zaida Zhou, Zhaoji Wang, Zhaowei Li, Zhen Zhu, Zheng Zhang, Zhexu Wang, Zhilin Yang, Zhiqi Huang, Zihao Huang, Ziyao Xu, Zonghan Yang

Moreover, we present effective long2short methods that use long-CoT techniques to improve short-CoT models, yielding state-of-the-art short-CoT reasoning results -- e. g., 60. 8 on AIME, 94. 6 on MATH500, 47. 3 on LiveCodeBench -- outperforming existing short-CoT models such as GPT-4o and Claude Sonnet 3. 5 by a large margin (up to +550%).

Math reinforcement-learning +2

Neural Finite-State Machines for Surgical Phase Recognition

no code implementations27 Nov 2024 Hao Ding, Zhongpai Gao, Benjamin Planche, Tianyu Luan, Abhishek Sharma, Meng Zheng, Ange Lou, Terrence Chen, Mathias Unberath, Ziyan Wu

Surgical phase recognition (SPR) is crucial for applications in workflow optimization, performance evaluation, and real-time intervention guidance.

Surgical phase recognition

Language-Model Prior Overcomes Cold-Start Items

1 code implementation13 Nov 2024 Shiyu Wang, Hao Ding, Yupeng Gu, Sergul Aydore, Kousha Kalantari, Branislav Kveton

Existing solutions for the cold-start problem, such as content-based recommenders and hybrid methods, leverage item metadata to determine item similarities.

Collaborative Filtering Language Modeling +3

Towards Robust Algorithms for Surgical Phase Recognition via Digital Twin-based Scene Representation

no code implementations26 Oct 2024 Hao Ding, Yuqian Zhang, Hongchao Shu, Xu Lian, Ji Woong Kim, Axel Krieger, Mathias Unberath

This approach takes advantage of the recent vision foundation models that ensure reliable low-level scene understanding to craft DT-based scene representations that support various high-level tasks.

Informativeness Scene Understanding +1

DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction

no code implementations18 Oct 2024 Ange Lou, Benjamin Planche, Zhongpai Gao, Yamin Li, Tianyu Luan, Hao Ding, Meng Zheng, Terrence Chen, Ziyan Wu, Jack Noble

Numerous recent approaches to modeling and re-rendering dynamic scenes leverage plane-based explicit representations, addressing slow training times associated with models like neural radiance fields (NeRF) and Gaussian splatting (GS).

NeRF Novel View Synthesis

Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data

1 code implementation10 Oct 2024 Can Wang, Dianbo Sui, Hongliang Sun, Hao Ding, Bolin Zhang, Zhiying Tu

This paper introduces a novel method to estimate the performance of LLM services across different tasks and contexts, which can be "plug-and-play" utilizing only a few unlabeled samples like ICL.

In-Context Learning Language Modeling +2

Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation

no code implementations7 Aug 2024 Yiqing Shen, Hao Ding, Xinyuan Shao, Mathias Unberath

Fully supervised deep learning (DL) models for surgical video segmentation have been shown to struggle with non-adversarial, real-world corruptions of image quality including smoke, bleeding, and low illumination.

Adversarial Robustness Image Segmentation +7

SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge

1 code implementation16 Jul 2024 Hao Ding, Tuxun Lu, Yuqian Zhang, Ruixing Liang, Hongchao Shu, Lalithkumar Seenivasan, Yonghao Long, Qi Dou, Cong Gao, Mathias Unberath

To address this limitation, we introduce the SegSTRONG-C challenge that aims to promote the development of algorithms robust to unforeseen but plausible image corruptions of surgery, like smoke, bleeding, and low brightness.

Data Augmentation Segmentation

Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images

no code implementations12 Jul 2024 Tianyu Luan, Zhongpai Gao, Luyuan Xie, Abhishek Sharma, Hao Ding, Benjamin Planche, Meng Zheng, Ange Lou, Terrence Chen, Junsong Yuan, Ziyan Wu

Traditional top-down methods, relying on whole-body parametric models like SMPL, falter when only a small part of the human is visible, as they require visibility of most of the human body for accurate mesh reconstruction.

DaReNeRF: Direction-aware Representation for Dynamic Scenes

no code implementations CVPR 2024 Ange Lou, Benjamin Planche, Zhongpai Gao, Yamin Li, Tianyu Luan, Hao Ding, Terrence Chen, Jack Noble, Ziyan Wu

However, the straightforward decomposition of 4D dynamic scenes into multiple 2D plane-based representations proves insufficient for re-rendering high-fidelity scenes with complex motions.

NeRF Novel View Synthesis

Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs

no code implementations22 Dec 2023 Behnam Rahdari, Hao Ding, Ziwei Fan, Yifei Ma, Zhuotong Chen, Anoop Deoras, Branislav Kveton

The unique capabilities of Large Language Models (LLMs), such as the natural language text generation ability, position them as strong candidates for providing explanation for recommendations.

Explanation Generation Position +1

Pre-trained Recommender Systems: A Causal Debiasing Perspective

1 code implementation30 Oct 2023 Ziqian Lin, Hao Ding, Nghia Trong Hoang, Branislav Kveton, Anoop Deoras, Hao Wang

In particular, we propose to develop a generic recommender that captures universal interaction patterns by training on generic user-item interaction data extracted from different domains, which can then be fast adapted to improve few-shot learning performance in unseen new domains (with limited data).

Few-Shot Learning Recommendation Systems

Personalized Federated Domain Adaptation for Item-to-Item Recommendation

no code implementations5 Jun 2023 Ziwei Fan, Hao Ding, Anoop Deoras, Trong Nghia Hoang

To mitigate this data bottleneck, we postulate that recommendation patterns learned from existing mature market segments (with private data) could be adapted to build effective warm-start models for emerging ones.

Domain Adaptation Personalized Federated Learning +1

GCNSLIM: Graph Convolutional Network with Sparse Linear Methods for E-government Service Recommendation

1 code implementation15 May 2023 Lingyuan Kong, Hao Ding, Guangwei Hu

However, existing GCN-based CF methods are mainly based on matrix factorization and incorporate some optimization tech-niques to enhance performance, which are not enough to handle the complexities of diverse real-world recommendation scenarios.

Music Recommendation

Few-shot Medical Image Segmentation with Cycle-resemblance Attention

no code implementations7 Dec 2022 Hao Ding, Changchang Sun, Hao Tang, Dawen Cai, Yan Yan

Recently, due to the increasing requirements of medical imaging applications and the professional requirements of annotating medical images, few-shot learning has gained increasing attention in the medical image semantic segmentation field.

Few-Shot Learning Image Segmentation +4

Rethinking Causality-driven Robot Tool Segmentation with Temporal Constraints

1 code implementation30 Nov 2022 Hao Ding, Jie Ying Wu, Zhaoshuo Li, Mathias Unberath

Method: To address the above limitations, we take temporal relation into consideration and propose a temporal causal model for robot tool segmentation on video sequences.

counterfactual Segmentation

CaRTS: Causality-driven Robot Tool Segmentation from Vision and Kinematics Data

1 code implementation15 Mar 2022 Hao Ding, Jintan Zhang, Peter Kazanzides, Jie Ying Wu, Mathias Unberath

Vision-based segmentation of the robotic tool during robot-assisted surgery enables downstream applications, such as augmented reality feedback, while allowing for inaccuracies in robot kinematics.

counterfactual Segmentation

Language Models as Recommender Systems: Evaluations and Limitations

no code implementations NeurIPS Workshop ICBINB 2021 Yuhui Zhang, Hao Ding, Zeren Shui, Yifei Ma, James Zou, Anoop Deoras, Hao Wang

Pre-trained language models (PLMs) such as BERT and GPT learn general text representations and encode extensive world knowledge; thus, they can be efficiently and accurately adapted to various downstream tasks.

Movie Recommendation Session-Based Recommendations +1

On the Sins of Image Synthesis Loss for Self-supervised Depth Estimation

no code implementations13 Sep 2021 Zhaoshuo Li, Nathan Drenkow, Hao Ding, Andy S. Ding, Alexander Lu, Francis X. Creighton, Russell H. Taylor, Mathias Unberath

It is based on the idea that observed frames can be synthesized from neighboring frames if accurate depth of the scene is known - or in this case, estimated.

Attribute Depth Estimation +3

Influence Selection for Active Learning

1 code implementation ICCV 2021 Zhuoming Liu, Hao Ding, Huaping Zhong, Weijia Li, Jifeng Dai, Conghui He

To obtain the Influence of the unlabeled sample in the active learning scenario, we design the Untrained Unlabeled sample Influence Calculation(UUIC) to estimate the unlabeled sample's expected gradient with which we calculate its Influence.

Active Learning Diversity

Zero-Shot Recommender Systems

no code implementations18 May 2021 Hao Ding, Yifei Ma, Anoop Deoras, Yuyang Wang, Hao Wang

This poses a chicken-and-egg problem for early-stage products, whose amount of data, in turn, relies on the performance of their RS.

Recommendation Systems Zero-Shot Learning

Recurrent Exploration Networks for Recommender Systems

no code implementations1 Jan 2021 Hao Wang, Yifei Ma, Hao Ding, Bernie Wang

Recurrent neural networks have proven effective in modeling sequential user feedbacks for recommender systems.

Recommendation Systems Representation Learning

Deeply Shape-guided Cascade for Instance Segmentation

1 code implementation CVPR 2021 Hao Ding, Siyuan Qiao, Alan Yuille, Wei Shen

The key to a successful cascade architecture for precise instance segmentation is to fully leverage the relationship between bounding box detection and mask segmentation across multiple stages.

Instance Segmentation Region Proposal +2

Unsupervised Inductive Graph-Level Representation Learning via Graph-Graph Proximity

1 code implementation1 Apr 2019 Yunsheng Bai, Hao Ding, Yang Qiao, Agustin Marinovic, Ken Gu, Ting Chen, Yizhou Sun, Wei Wang

We introduce a novel approach to graph-level representation learning, which is to embed an entire graph into a vector space where the embeddings of two graphs preserve their graph-graph proximity.

General Classification Graph Classification +3

Convolutional Set Matching for Graph Similarity

1 code implementation23 Oct 2018 Yunsheng Bai, Hao Ding, Yizhou Sun, Wei Wang

We introduce GSimCNN (Graph Similarity Computation via Convolutional Neural Networks) for predicting the similarity score between two graphs.

Graph Similarity set matching

Learning-based Efficient Graph Similarity Computation via Multi-Scale Convolutional Set Matching

1 code implementation10 Sep 2018 Yunsheng Bai, Hao Ding, Yizhou Sun, Wei Wang

Since computing the exact distance/similarity between two graphs is typically NP-hard, a series of approximate methods have been proposed with a trade-off between accuracy and speed.

Clustering Combinatorial Optimization +5

Multi-Way Multi-Level Kernel Modeling for Neuroimaging Classification

no code implementations CVPR 2017 Lifang He, Chun-Ta Lu, Hao Ding, Shen Wang, Linlin Shen, Philip S. Yu, Ann B. Ragin

Owing to prominence as a diagnostic tool for probing the neural correlates of cognition, neuroimaging tensor data has been the focus of intense investigation.

Classification Diagnostic +1

Learning from Multi-View Multi-Way Data via Structural Factorization Machines

no code implementations10 Apr 2017 Chun-Ta Lu, Lifang He, Hao Ding, Bokai Cao, Philip S. Yu

Real-world relations among entities can often be observed and determined by different perspectives/views.

Cannot find the paper you are looking for? You can Submit a new open access paper.