Search Results for author: Ping Hu

Found 24 papers, 5 papers with code

Many-to-many Splatting for Efficient Video Frame Interpolation

1 code implementation CVPR 2022 Ping Hu, Simon Niklaus, Stan Sclaroff, Kate Saenko

Motion-based video frame interpolation commonly relies on optical flow to warp pixels from the inputs to the desired interpolation instant.

Motion Estimation Optical Flow Estimation +1

Real-time Semantic Segmentation with Fast Attention

1 code implementation7 Jul 2020 Ping Hu, Federico Perazzi, Fabian Caba Heilbron, Oliver Wang, Zhe Lin, Kate Saenko, Stan Sclaroff

The proposed architecture relies on our fast spatial attention, which is a simple yet efficient modification of the popular self-attention mechanism and captures the same rich spatial context at a small fraction of the computational cost, by changing the order of operations.

Real-Time Semantic Segmentation Segmentation

ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes

1 code implementation CVPR 2022 Dina Bashkirova, Mohamed Abdelfattah, Ziliang Zhu, James Akl, Fadi Alladkani, Ping Hu, Vitaly Ablavsky, Berk Calli, Sarah Adel Bargal, Kate Saenko

Recyclable waste detection poses a unique computer vision challenge as it requires detection of highly deformable and often translucent objects in cluttered scenes without the kind of context information usually present in human-centric datasets.

Object object-detection +5

Motion-Guided Cascaded Refinement Network for Video Object Segmentation

no code implementations CVPR 2018 Ping Hu, Gang Wang, Xiangfei Kong, Jason Kuen, Yap-Peng Tan

Then, the proposed Cascaded Refinement Network(CRN) takes the coarse segmentation as guidance to generate an accurate segmentation of full resolution.

Object Optical Flow Estimation +4

Global Context-Aware Attention LSTM Networks for 3D Action Recognition

no code implementations CVPR 2017 Jun Liu, Gang Wang, Ping Hu, Ling-Yu Duan, Alex C. Kot

Hence we propose a new class of LSTM network, Global Context-Aware Attention LSTM (GCA-LSTM), for 3D action recognition, which is able to selectively focus on the informative joints in the action sequence with the assistance of global contextual information.

Action Analysis One-Shot 3D Action Recognition +1

Deep Level Sets for Salient Object Detection

no code implementations CVPR 2017 Ping Hu, Bing Shuai, Jun Liu, Gang Wang

Our method drives the network to learn a Level Set function for salient objects so it can output more accurate boundaries and compact saliency.

Object object-detection +3

Uncertainty-Aware Learning for Zero-Shot Semantic Segmentation

no code implementations NeurIPS 2020 Ping Hu, Stan Sclaroff, Kate Saenko

Recently, most ZSS methods focus on learning the visual-semantic correspondence to transfer knowledge from seen classes to unseen classes at the pixel level.

Semantic correspondence Semantic Segmentation +1

Tilings in graphons

no code implementations9 Jun 2016 Jan Hladky, Ping Hu, Diana Piguet

We introduce a counterpart to the notion of vertex disjoint tilings by copy of a fixed graph F to the setting of graphons.

Combinatorics

Conterfactual Generative Zero-Shot Semantic Segmentation

no code implementations11 Jun 2021 Feihong Shen, Jun Liu, Ping Hu

In this work, we consider counterfactual methods to avoid the confounder in the original model.

Causal Inference counterfactual +4

Learning to Detect Every Thing in an Open World

no code implementations3 Dec 2021 Kuniaki Saito, Ping Hu, Trevor Darrell, Kate Saenko

LDET leads to significant improvements on many datasets in the open-world instance segmentation task, outperforming baselines on cross-category generalization on COCO, as well as cross-dataset evaluation on UVO and Cityscapes.

Data Augmentation object-detection +3

Splatting-based Synthesis for Video Frame Interpolation

no code implementations25 Jan 2022 Simon Niklaus, Ping Hu, Jiawen Chen

Frame interpolation is an essential video processing technique that adjusts the temporal resolution of an image sequence.

Optical Flow Estimation Video Frame Interpolation

Optimal Aggregation Strategies for Social Learning over Graphs

no code implementations14 Mar 2022 Ping Hu, Virginia Bordignon, Stefan Vlaski, Ali H. Sayed

Adaptive social learning is a useful tool for studying distributed decision-making problems over graphs.

Decision Making

Token Boosting for Robust Self-Supervised Visual Transformer Pre-training

no code implementations CVPR 2023 Tianjiao Li, Lin Geng Foo, Ping Hu, Xindi Shang, Hossein Rahmani, Zehuan Yuan, Jun Liu

Pre-training VTs on such corrupted data can be challenging, especially when we pre-train via the masked autoencoding approach, where both the inputs and masked ``ground truth" targets can potentially be unreliable in this case.

Non-Asymptotic Performance of Social Machine Learning Under Limited Data

no code implementations15 Jun 2023 Ping Hu, Virginia Bordignon, Mert Kayaalp, Ali H. Sayed

This paper studies the probability of error associated with the social machine learning framework, which involves an independent training phase followed by a cooperative decision-making phase over a graph.

Classification Decision Making

DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations

no code implementations3 Aug 2023 Ping Hu, Ximeng Sun, Stan Sclaroff, Kate Saenko

Previous works have focused on learning the alignment between textual and visual spaces to compensate for limited image labels, yet may suffer from reduced accuracy due to the scarcity of high-quality multi-label annotations.

Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic Segmentation

no code implementations ICCV 2023 Duo Peng, Ping Hu, Qiuhong Ke, Jun Liu

Translating images from a source domain to a target domain for learning target models is one of the most common strategies in domain adaptive semantic segmentation (DASS).

Denoising Semantic Segmentation +1

GAIT: Generating Aesthetic Indoor Tours with Deep Reinforcement Learning

no code implementations ICCV 2023 Desai Xie, Ping Hu, Xin Sun, Soren Pirk, Jianming Zhang, Radomir Mech, Arie E. Kaufman

Placing and orienting a camera to compose aesthetically meaningful shots of a scene is not only a key objective in real-world photography and cinematography but also for virtual content creation.

Mixed Reality reinforcement-learning

Video Frame Interpolation with Many-to-many Splatting and Spatial Selective Refinement

no code implementations29 Oct 2023 Ping Hu, Simon Niklaus, Lu Zhang, Stan Sclaroff, Kate Saenko

In this work, we first propose a fully differentiable Many-to-Many (M2M) splatting framework to interpolate frames efficiently.

Computational Efficiency Motion Estimation +1

Adaptive Multi-Modality Prompt Learning

no code implementations30 Nov 2023 Zongqian Wu, Yujing Liu, Mengmeng Zhan, Jialie Shen, Ping Hu, Xiaofeng Zhu

Although current prompt learning methods have successfully been designed to effectively reuse the large pre-trained models without fine-tuning their large number of parameters, they still have limitations to be addressed, i. e., without considering the adverse impact of meaningless patches in every image and without simultaneously considering in-sample generalization and out-of-sample generalization.

Deep Generative Modeling for Financial Time Series with Application in VaR: A Comparative Review

no code implementations18 Jan 2024 Lars Ericson, Xuejun Zhu, Xusi Han, Rao Fu, Shuang Li, Steve Guo, Ping Hu

The objectives for financial time series generation are to generate synthetic data paths with good variety, and similar distribution and dynamics to the original historical data.

Time Series Time Series Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.