Search Results for author: Xing Di

Found 18 papers, 5 papers with code

ProS: Facial Omni-Representation Learning via Prototype-based Self-Distillation

no code implementations3 Nov 2023 Xing Di, Yiyu Zheng, Xiaoming Liu, Yu Cheng

This paper presents a novel approach, called Prototype-based Self-Distillation (ProS), for unsupervised face representation learning.

Attribute Representation Learning

Hypotheses Tree Building for One-Shot Temporal Sentence Localization

no code implementations5 Jan 2023 Daizong Liu, Xiang Fang, Pan Zhou, Xing Di, Weining Lu, Yu Cheng

Given an untrimmed video, temporal sentence localization (TSL) aims to localize a specific segment according to a given sentence query.

Sentence

Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding

no code implementations2 Jan 2023 Jiahao Zhu, Daizong Liu, Pan Zhou, Xing Di, Yu Cheng, Song Yang, Wenzheng Xu, Zichuan Xu, Yao Wan, Lichao Sun, Zeyu Xiong

All existing works first utilize a sparse sampling strategy to extract a fixed number of video frames and then conduct multi-modal interactions with query sentence for reasoning.

Sentence Temporal Sentence Grounding

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs

1 code implementation1 Jan 2023 Huaizheng Zhang, Yuanming Li, Wencong Xiao, Yizheng Huang, Xing Di, Jianxiong Yin, Simon See, Yong Luo, Chiew Tong Lau, Yang You

The vision of this paper is to provide a more comprehensive and practical benchmark study for MIG in order to eliminate the need for tedious manual benchmarking and tuning efforts.

Benchmarking

Backdoor Attacks on Crowd Counting

1 code implementation12 Jul 2022 Yuhua Sun, Tailai Zhang, Xingjun Ma, Pan Zhou, Jian Lou, Zichuan Xu, Xing Di, Yu Cheng, Lichao

In this paper, we propose two novel Density Manipulation Backdoor Attacks (DMBA$^{-}$ and DMBA$^{+}$) to attack the model to produce arbitrarily large or small density estimations.

Backdoor Attack Crowd Counting +3

Unsupervised Temporal Video Grounding with Deep Semantic Clustering

no code implementations14 Jan 2022 Daizong Liu, Xiaoye Qu, Yinzhen Wang, Xing Di, Kai Zou, Yu Cheng, Zichuan Xu, Pan Zhou

Temporal video grounding (TVG) aims to localize a target segment in a video according to a given sentence query.

Clustering Sentence +1

Memory-Guided Semantic Learning Network for Temporal Sentence Grounding

no code implementations3 Jan 2022 Daizong Liu, Xiaoye Qu, Xing Di, Yu Cheng, Zichuan Xu, Pan Zhou

To tackle this issue, we propose a memory-augmented network, called Memory-Guided Semantic Learning Network (MGSL-Net), that learns and memorizes the rarely appeared content in TSG tasks.

Sentence Temporal Sentence Grounding

Heterogeneous Face Frontalization via Domain Agnostic Learning

no code implementations17 Jul 2021 Xing Di, Shuowen Hu, Vishal M. Patel

We propose a domain agnostic learning-based generative adversarial network (DAL-GAN) which can synthesize frontal views in the visible domain from thermal faces with pose variations.

Face Generation Generative Adversarial Network

Multimodal Face Synthesis from Visual Attributes

1 code implementation9 Apr 2021 Xing Di, Vishal M. Patel

Extensive experiments and comparisons with several state-of-the-art methods are performed to verify the effectiveness of the proposed attribute-based multimodal synthesis method.

Attribute Face Generation +1

A Large-Scale, Time-Synchronized Visible and Thermal Face Dataset

no code implementations7 Jan 2021 Domenick Poster, Matthew Thielke, Robert Nguyen, Srinivasan Rajaraman, Xing Di, Cedric Nimpa Fondje, Vishal M. Patel, Nathaniel J. Short, Benjamin S. Riggan, Nasser M. Nasrabadi, Shuowen Hu

Thermal face imagery, which captures the naturally emitted heat from the face, is limited in availability compared to face imagery in the visible spectrum.

Face Verification

Multi-Scale Thermal to Visible Face Verification via Attribute Guided Synthesis

no code implementations20 Apr 2020 Xing Di, Benjamin S. Riggan, Shuowen Hu, Nathaniel J. Short, Vishal M. Patel

Finally, a pre-trained VGG-Face network is leveraged to extract features from the synthesized image and the input visible image for verification.

Attribute Face Verification

Facial Synthesis from Visual Attributes via Sketch using Multi-Scale Generators

no code implementations17 Dec 2019 Xing Di, Vishal M. Patel

In this paper, we take a different approach, where we formulate the original problem as a stage-wise learning problem.

Attribute Face Generation

Face Synthesis from Visual Attributes via Sketch using Conditional VAEs and GANs

1 code implementation30 Dec 2017 Xing Di, Vishal M. Patel

In this paper, we take a different approach, where we formulate the original problem as a stage-wise learning problem.

Attribute Face Generation

GP-GAN: Gender Preserving GAN for Synthesizing Faces from Landmarks

2 code implementations3 Oct 2017 Xing Di, Vishwanath A. Sindagi, Vishal M. Patel

The primary aim of this work is to demonstrate that information preserved by landmarks (gender in particular) can be further accentuated by leveraging generative models to synthesize corresponding faces.

Face Generation Generative Adversarial Network

Cannot find the paper you are looking for? You can Submit a new open access paper.