Search Results for author: Zhang Zhang

Found 38 papers, 22 papers with code

Debiasing Large Visual Language Models

1 code implementation8 Mar 2024 Yi-Fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

In the realms of computer vision and natural language processing, Large Vision-Language Models (LVLMs) have become indispensable tools, proficient in generating textual descriptions based on visual inputs.

Fairness Question Answering

Variational Continual Test-Time Adaptation

no code implementations13 Feb 2024 Fan Lyu, Kaile Du, Yuyang Li, Hanyu Zhao, Zhang Zhang, Guangcan Liu, Liang Wang

At the source stage, we transform a pre-trained deterministic model into a Bayesian Neural Network (BNN) via a variational warm-up strategy, injecting uncertainties into the model.

Test-time Adaptation Variational Inference

Assaying on the Robustness of Zero-Shot Machine-Generated Text Detectors

1 code implementation20 Dec 2023 Yi-Fan Zhang, Zhang Zhang, Liang Wang, Tieniu Tan, Rong Jin

In an effort to address these issues, we delve into the realm of zero-shot machine-generated text detection.

Binary Classification Text Detection +1

Model-free Test Time Adaptation for Out-Of-Distribution Detection

no code implementations28 Nov 2023 Yifan Zhang, Xue Wang, Tian Zhou, Kun Yuan, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

We demonstrate the effectiveness of \abbr through comprehensive experiments on multiple OOD detection benchmarks, extensive empirical studies show that \abbr significantly improves the performance of OOD detection over state-of-the-art methods.

Decision Making Out-of-Distribution Detection +2

Neural Network Pruning by Gradient Descent

1 code implementation21 Nov 2023 Zhang Zhang, Ruyi Tao, Jiang Zhang

The rapid increase in the parameters of deep learning models has led to significant costs, challenging computational efficiency and model interpretability.

Computational Efficiency Feature Importance +2

OneNet: Enhancing Time Series Forecasting Models under Concept Drift by Online Ensembling

1 code implementation NeurIPS 2023 Yi-Fan Zhang, Qingsong Wen, Xue Wang, Weiqi Chen, Liang Sun, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

Online updating of time series forecasting models aims to address the concept drifting problem by efficiently updating forecasting models based on streaming data.

Time Series Time Series Forecasting

Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action Recognition

1 code implementation18 Sep 2023 Ming-Zhe Li, Zhen Jia, Zhang Zhang, Zhanyu Ma, Liang Wang

In order to solve this dilemma, we propose a multi-semantic fusion (MSF) model for improving the performance of GZSSAR, where two kinds of class-level textual descriptions (i. e., action descriptions and motion descriptions), are collected as auxiliary semantic information to enhance the learning efficacy of generalizable skeleton features.

Action Recognition Generalized Zero Shot skeletal action recognition +1

Illumination Distillation Framework for Nighttime Person Re-Identification and A New Benchmark

1 code implementation31 Aug 2023 Andong Lu, Zhang Zhang, Yan Huang, Yifan Zhang, Chenglong Li, Jin Tang, Liang Wang

The illumination enhancement branch first estimates an enhanced image from the nighttime image using a nonlinear curve mapping method and then extracts the enhanced features.

Person Re-Identification

Adaptive Fusion of Radiomics and Deep Features for Lung Adenocarcinoma Subtype Recognition

no code implementations27 Aug 2023 Jing Zhou, Xiaotong Fu, Xirong Li, Wei Feng, Zhang Zhang, Ying Ji

The most common type of lung cancer, lung adenocarcinoma (LUAD), has been increasingly detected since the advent of low-dose computed tomography screening technology.

AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation

1 code implementation25 Apr 2023 Yi-Fan Zhang, Xue Wang, Kexin Jin, Kun Yuan, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

In particular, when the adaptation target is a series of domains, the adaptation accuracy of AdaNPC is 50% higher than advanced TTA methods.

Domain Generalization Test-time Adaptation

Semantic Prompt for Few-Shot Image Recognition

1 code implementation CVPR 2023 Wentao Chen, Chenyang Si, Zhang Zhang, Liang Wang, Zilei Wang, Tieniu Tan

Instead of the naive exploitation of semantic information for remedying classifiers, we explore leveraging semantic information as prompts to tune the visual feature extraction network adaptively.

Few-Shot Learning

Human Image Generation: A Comprehensive Survey

no code implementations17 Dec 2022 Zhen Jia, Zhang Zhang, Liang Wang, Tieniu Tan

Image and video synthesis has become a blooming topic in computer vision and machine learning communities along with the developments of deep generative models, due to its great academic and application value.

Data Augmentation Image Generation +2

Domain-Specific Risk Minimization for Out-of-Distribution Generalization

1 code implementation18 Aug 2022 Yi-Fan Zhang, Jindong Wang, Jian Liang, Zhang Zhang, Baosheng Yu, Liang Wang, DaCheng Tao, Xing Xie

Our bound motivates two strategies to reduce the gap: the first one is ensembling multiple classifiers to enrich the hypothesis space, then we propose effective gap estimation methods for guiding the selection of a better hypothesis for the target.

Domain Generalization Out-of-Distribution Generalization

Cross-Domain Cross-Set Few-Shot Learning via Learning Compact and Aligned Representations

1 code implementation16 Jul 2022 Wentao Chen, Zhang Zhang, Wei Wang, Liang Wang, Zilei Wang, Tieniu Tan

Different from previous cross-domain FSL work (CD-FSL) that considers the domain shift between base and novel classes, the new problem, termed cross-domain cross-set FSL (CDSC-FSL), requires few-shot learners not only to adapt to the new domain, but also to be consistent between different domains within each novel class.

Few-Shot Learning

Completing Networks by Learning Local Connection Patterns

1 code implementation25 Apr 2022 Zhang Zhang, Ruyi Tao, Yongzai Tao, Mingze Qi, Jiang Zhang

And experiments show that our model perform better on a network with higher Reachable CC.

Link Prediction

Delving into Sample Loss Curve to Embrace Noisy and Imbalanced Data

1 code implementation30 Dec 2021 Shenwang Jiang, Jianan Li, Ying Wang, Bo Huang, Zhang Zhang, Tingfa Xu

In practice, however, biased samples with corrupted labels and of tailed classes commonly co-exist in training data.

Attribute Meta-Learning

Intention Recognition for Multiple Agents

no code implementations5 Dec 2021 Zhang Zhang, Yifeng Zeng, Yinghui Pan

Then, we transform the intention recognition into an un-supervised learning problem and adapt a clustering algorithm to group intentions of multiple agents through comparing their behavioural models.

Clustering Descriptive +1

Generalizable Person Re-identification Without Demographics

no code implementations29 Sep 2021 Yifan Zhang, Feng Li, Zhang Zhang, Liang Wang, DaCheng Tao, Tieniu Tan

However, the convex condition of KL DRO may not hold for overparameterized neural networks, such that applying KL DRO often fails to generalize under distribution shifts in real scenarios.

Generalizable Person Re-identification

Focal and Efficient IOU Loss for Accurate Bounding Box Regression

no code implementations20 Jan 2021 Yi-Fan Zhang, Weiqiang Ren, Zhang Zhang, Zhen Jia, Liang Wang, Tieniu Tan

(ii) Most of the loss functions ignore the imbalance problem in BBR that the large number of anchor boxes which have small overlaps with the target boxes contribute most to the optimization of BBR.

object-detection Object Detection +2

Stronger, Faster and More Explainable: A Graph Convolutional Baseline for Skeleton-based Action Recognition

1 code implementation20 Oct 2020 Yi-Fan Song, Zhang Zhang, Caifeng Shan, Liang Wang

However, the complexity of the State-Of-The-Art (SOTA) models of this task tends to be exceedingly sophisticated and over-parameterized, where the low efficiency in model training and inference has obstructed the development in the field, especially for large-scale action datasets.

Action Recognition Skeleton Based Action Recognition

Richly Activated Graph Convolutional Network for Robust Skeleton-based Action Recognition

3 code implementations9 Aug 2020 Yi-Fan Song, Zhang Zhang, Caifeng Shan, Liang Wang

More crucially, on the synthetic occlusion and jittering datasets, the performance deterioration due to the occluded and disturbed joints can be significantly alleviated by utilizing the proposed RA-GCN.

Action Recognition Skeleton Based Action Recognition +1

Deep Fusion Feature Representation Learning with Hard Mining Center-Triplet Loss for Person Re-identification

1 code implementation IEEE Transactions on Multimedia 2020 Cairong Zhao, Xinbi Lv, Zhang Zhang, WangMeng Zuo, Jun Wu, Duoqian Miao

The extraction of robust feature representations from pedestrian images through CNNs with a single deterministic pooling operation is problematic as the features in real pedestrian images are complex and diverse.

Person Re-Identification Representation Learning

Slow Feature Analysis for Human Action Recognition

no code implementations15 Jul 2019 Zhang Zhang, DaCheng Tao

In this paper, we introduce the SFA framework to the problem of human action recognition by incorporating the discriminative information with SFA learning and considering the spatial relationship of body parts.

Action Recognition Temporal Action Localization

Richly Activated Graph Convolutional Network for Action Recognition with Incomplete Skeletons

3 code implementations16 May 2019 Yi-Fan Song, Zhang Zhang, Liang Wang

To enhance the robustness of action recognition models to incomplete skeletons, we propose a multi-stream graph convolutional network (GCN) for exploring sufficient discriminative features distributed over all skeleton joints.

Action Recognition Skeleton Based Action Recognition +1

Large-Scale Pedestrian Retrieval Competition

no code implementations6 Mar 2019 Da Li, Zhang Zhang

The Large-Scale Pedestrian Retrieval Competition (LSPRC) mainly focuses on person retrieval which is an important end application in intelligent vision system of surveillance.

Attribute Pedestrian Detection +2

A General Deep Learning Framework for Network Reconstruction and Dynamics Learning

1 code implementation30 Dec 2018 Zhang Zhang, Yi Zhao, Jing Liu, Shuo Wang, Ruyi Tao, Ruyue Xin, Jiang Zhang

We exhibit the universality of our framework on different kinds of time-series data: with the same structure, our model can be trained to accurately recover the network structure and predict future states on continuous, discrete, and binary dynamics, and outperforms competing network reconstruction methods.

Time Series Time Series Analysis

The Cinderella Complex: Word Embeddings Reveal ender Stereotypes in Movies and Books

1 code implementation12 Nov 2018 Huimin Xu, Zhang Zhang, Lingfei Wu, Cheng-Jun Wang

Our analysis of thousands of movies and books reveals how these cultural products weave stereotypical gender roles into morality tales and perpetuate gender inequality through storytelling.

Word Embeddings

Learning Deep Context-aware Features over Body and Latent Parts for Person Re-identification

no code implementations CVPR 2017 Dangwei Li, Xiaotang Chen, Zhang Zhang, Kaiqi Huang

It is a challenging task due to the large variations in person pose, occlusion, background clutter, etc How to extract powerful features is a fundamental problem in ReID and is still an open problem today.

Person Identification Person Re-Identification +1

A Large-scale Distributed Video Parsing and Evaluation Platform

no code implementations29 Nov 2016 Kai Yu, Yang Zhou, Da Li, Zhang Zhang, Kaiqi Huang

Visual surveillance systems have become one of the largest data sources of Big Visual Data in real world.

Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization

no code implementations17 Nov 2016 Kai Yu, Biao Leng, Zhang Zhang, Dangwei Li, Kaiqi Huang

Based on GoogLeNet, firstly, a set of mid-level attribute features are discovered by novelly designed detection layers, where a max-pooling based weakly-supervised object detection technique is used to train these layers with only image-level labels without the need of bounding box annotations of pedestrian attributes.

Attribute Clustering +5

ReD-SFA: Relation Discovery Based Slow Feature Analysis for Trajectory Clustering

no code implementations CVPR 2016 Zhang Zhang, Kaiqi Huang, Tieniu Tan, Peipei Yang, Jun Li

For spectral embedding/clustering, it is still an open problem on how to construct an relation graph to reflect the intrinsic structures in data.

Clustering graph construction +5

A Richly Annotated Dataset for Pedestrian Attribute Recognition

2 code implementations23 Mar 2016 Dangwei Li, Zhang Zhang, Xiaotang Chen, Haibin Ling, Kaiqi Huang

RAP has in total 41, 585 pedestrian samples, each of which is annotated with 72 attributes as well as viewpoints, occlusions, body parts information.

Attribute Pedestrian Attribute Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.