Search Results for author: Hao Fang

Found 46 papers, 20 papers with code

Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track

no code implementations19 Aug 2024 Feiyu Pan, Hao Fang, Runmin Cong, Wei zhang, Xiankai Lu

Video Object Segmentation (VOS) task aims to segmenting a particular object instance throughout the entire video sequence given only the object mask of the first frame.

Object Segmentation +4

UNINEXT-Cutie: The 1st Solution for LSVOS Challenge RVOS Track

no code implementations19 Aug 2024 Hao Fang, Feiyu Pan, Xiankai Lu, Wei zhang, Runmin Cong

Referring video object segmentation (RVOS) relies on natural language expressions to segment target objects in video.

Referring Video Object Segmentation Semantic Segmentation +1

A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks

2 code implementations18 Jul 2024 Yixiang Qiu, Hao Fang, Hongyao Yu, Bin Chen, Meikang Qiu, Shu-Tao Xia

Model Inversion (MI) attacks aim to reconstruct privacy-sensitive training data from released models by utilizing output information, raising extensive concerns about the security of Deep Neural Networks (DNNs).

CLIP-Guided Networks for Transferable Targeted Attacks

1 code implementation14 Jul 2024 Hao Fang, Jiawei Kong, Bin Chen, Tao Dai, Hao Wu, Shu-Tao Xia

Transferable targeted adversarial attacks aim to mislead models into outputting adversary-specified predictions in black-box scenarios.

Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation

1 code implementation10 Jul 2024 Hao Fang, Peng Wu, Yawei Li, Xinxin Zhang, Xiankai Lu

We discover that the domain gap between the VLM features (e. g., CLIP) and the instance queries and the underutilization of temporal consistency are two central causes.

Instance Segmentation Semantic Segmentation +2

Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

no code implementations9 Jun 2024 Xinhao Zhong, Hao Fang, Bin Chen, Xulin Gu, Tao Dai, Meikang Qiu, Shu-Tao Xia

Dataset distillation is an emerging dataset reduction method, which condenses large-scale datasets while maintaining task accuracy.

Dataset Distillation

One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models

no code implementations8 Jun 2024 Hao Fang, Jiawei Kong, Wenbo Yu, Bin Chen, Jiawei Li, Shutao Xia, Ke Xu

To this end, we revisit the multimodal alignments in VLP model training and propose the Contrastive-training Perturbation Generator with Cross-modal conditions (C-PGC).

Contrastive Learning

3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation

no code implementations7 Jun 2024 Feiyu Pan, Hao Fang, Xiankai Lu

The current RVOS methods typically use independently pre-trained vision and language models as backbones, resulting in a significant domain gap between video and text.

Referring Video Object Segmentation Semantic Segmentation +2

GI-NAS: Boosting Gradient Inversion Attacks through Adaptive Neural Architecture Search

no code implementations31 May 2024 Wenbo Yu, Hao Fang, Bin Chen, Xiaohang Sui, Chuan Chen, Hao Wu, Shu-Tao Xia, Ke Xu

In this paper, we further exploit such implicit prior knowledge by proposing Gradient Inversion via Neural Architecture Search (GI-NAS), which adaptively searches the network and captures the implicit priors behind neural architectures.

Federated Learning Neural Architecture Search

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

1 code implementation7 Mar 2024 Boshi Wang, Hao Fang, Jason Eisner, Benjamin Van Durme, Yu Su

We find that existing LLMs, including GPT-4 and open-source LLMs specifically fine-tuned for tool use, only reach a correctness rate in the range of 30% to 60%, far from reliable use in practice.

Continual Learning In-Context Learning

Inverse Optimal Control for Linear Quadratic Tracking with Unknown Target States

no code implementations27 Feb 2024 Yao Li, Chengpu Yu, Hao Fang, Jie Chen

A computationally efficient and numerically reliable parameter identification algorithm is proposed by equating optimal control strategies with a system of linear equations, and the associated relative error upper bound is derived in terms of data volume and signal-to-noise ratio.

Privacy Leakage on DNNs: A Survey of Model Inversion Attacks and Defenses

1 code implementation6 Feb 2024 Hao Fang, Yixiang Qiu, Hongyao Yu, Wenbo Yu, Jiawei Kong, Baoli Chong, Bin Chen, Xuan Wang, Shu-Tao Xia, Ke Xu

However, Model Inversion (MI) attacks, which disclose private information about the training dataset by abusing access to the trained models, have emerged as a formidable privacy threat.

Few-Shot Adaptation for Parsing Contextual Utterances with LLMs

1 code implementation18 Sep 2023 Kevin Lin, Patrick Xia, Hao Fang

We evaluate the ability of semantic parsers based on large language models (LLMs) to handle contextual utterances.

In-Context Learning Semantic Parsing

GIFD: A Generative Gradient Inversion Method with Feature Domain Optimization

1 code implementation ICCV 2023 Hao Fang, Bin Chen, Xuan Wang, Zhi Wang, Shu-Tao Xia

Federated Learning (FL) has recently emerged as a promising distributed machine learning framework to preserve clients' privacy, by allowing multiple clients to upload the gradients calculated from their local data to a central server.

Federated Learning Image Generation

Natural Language Decomposition and Interpretation of Complex Utterances

no code implementations15 May 2023 Harsh Jhamtani, Hao Fang, Patrick Xia, Eran Levy, Jacob Andreas, Ben Van Durme

Designing natural language interfaces has historically required collecting supervised data to translate user requests into carefully designed intent representations.

Language Modelling

Surveillance Face Presentation Attack Detection Challenge

no code implementations15 Apr 2023 Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Zhen Lei

Based on this dataset and protocol-$3$ for evaluating the robustness of the algorithm under quality changes, we organized a face presentation attack detection challenge in surveillance scenarios.

Face Anti-Spoofing Face Presentation Attack Detection +1

Surveillance Face Anti-spoofing

no code implementations3 Jan 2023 Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Chenxu Zhao, Xu Zhang, Stan Z. Li, Zhen Lei

In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks.

Contrastive Learning Face Anti-Spoofing +2

Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering

1 code implementation11 Oct 2022 Hao Cheng, Hao Fang, Xiaodong Liu, Jianfeng Gao

Given its effectiveness on knowledge-intensive natural language processing tasks, dense retrieval models have become increasingly popular.

Open-Domain Question Answering Retrieval

Unified Control Framework for Real-Time Interception and Obstacle Avoidance of Fast-Moving Objects with Diffusion Variational Autoencoder

no code implementations27 Sep 2022 Apan Dastider, Hao Fang, Mingjie Lin

Real-time interception of fast-moving objects by robotic arms in dynamic environments poses a formidable challenge due to the need for rapid reaction times, often within milliseconds, amidst dynamic obstacles.

Motion Planning Navigate

When More Data Hurts: A Troubling Quirk in Developing Broad-Coverage Natural Language Understanding Systems

1 code implementation24 May 2022 Elias Stengel-Eskin, Emmanouil Antonios Platanios, Adam Pauls, Sam Thomson, Hao Fang, Benjamin Van Durme, Jason Eisner, Yu Su

Rejecting class imbalance as the sole culprit, we reveal that the trend is closely associated with an effect we call source signal dilution, where strong lexical cues for the new symbol become diluted as the training dataset grows.

Intent Recognition Natural Language Understanding +1

Salient Object Ranking with Position-Preserved Attention

1 code implementation ICCV 2021 Hao Fang, Daoxin Zhang, Yi Zhang, Minghao Chen, Jiawei Li, Yao Hu, Deng Cai, Xiaofei He

In this paper, we study the Salient Object Ranking (SOR) task, which manages to assign a ranking order of each detected object according to its visual saliency.

Image Cropping Instance Segmentation +7

Connect-and-Slice: An Hybrid Approach for Reconstructing 3D Objects

no code implementations CVPR 2020 Hao Fang, Florent Lafarge

Converting point clouds generated by Laser scanning, multiview stereo imagery or depth cameras into compact polygon meshes is a challenging problem in vision.

Building A User-Centric and Content-Driven Socialbot

no code implementations6 May 2020 Hao Fang

Additionally, we construct a new knowledge base to power the socialbot by collecting social chat content from a variety of sources.

Management Reading Comprehension +2

A Dynamic Speaker Model for Conversational Interactions

1 code implementation NAACL 2019 Hao Cheng, Hao Fang, Mari Ostendorf

Characterizing these differences can be useful in human-computer interaction, as well as analysis of human-human conversations.

Text Generation

Planar Shape Detection at Structural Scales

no code implementations CVPR 2018 Hao Fang, Florent Lafarge, Mathieu Desbrun

Interpreting 3D data such as point clouds or surface meshes depends heavily on the scale of observation.

A Factored Neural Network Model for Characterizing Online Discussions in Vector Space

1 code implementation EMNLP 2017 Hao Cheng, Hao Fang, Mari Ostendorf

We develop a novel factored neural model that learns comment embeddings in an unsupervised way leveraging the structure of distributional context in online discussion forums.

Feature Engineering

Learning Latent Local Conversation Modes for Predicting Community Endorsement in Online Discussions

no code implementations16 Aug 2016 Hao Fang, Hao Cheng, Mari Ostendorf

Many social media platforms offer a mechanism for readers to react to comments, both positively and negatively, which in aggregate can be thought of as community endorsement.

Bi-directional Attention with Agreement for Dependency Parsing

1 code implementation EMNLP 2016 Hao Cheng, Hao Fang, Xiaodong He, Jianfeng Gao, Li Deng

We develop a novel bi-directional attention model for dependency parsing, which learns to agree on headword predictions from the forward and backward parsing directions.

Dependency Parsing

Talking to the crowd: What do people react to in online discussions?

no code implementations EMNLP 2015 Aaron Jaech, Victoria Zayats, Hao Fang, Mari Ostendorf, Hannaneh Hajishirzi

This paper addresses the question of how language use affects community reaction to comments in online discussion forums, and the relative importance of the message vs. the messenger.

Cannot find the paper you are looking for? You can Submit a new open access paper.