Search Results for author: Ziwei Wang

Found 58 papers, 22 papers with code

Deep Hashing with Active Pairwise Supervision

no code implementations ECCV 2020 Ziwei Wang, Quan Zheng, Jiwen Lu, Jie zhou

n this paper, we propose a Deep Hashing method with Active Pairwise Supervision(DH-APS).

Deep Hashing

Q-VLM: Post-training Quantization for Large Vision-Language Models

1 code implementation10 Oct 2024 Changyuan Wang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu

On the contrary, we mine the cross-layer dependency that significantly influences discretization errors of the entire vision-language model, and embed this dependency into optimal quantization strategy searching with low search cost.

Language Modelling Quantization

PFAttack: Stealthy Attack Bypassing Group Fairness in Federated Learning

no code implementations9 Oct 2024 Jiashi Gao, Ziwei Wang, Xiangyu Zhao, Xin Yao, Xuetao Wei

However, these studies primarily focus on perturbing accuracy, leaving a critical question unexplored: Can an attacker bypass the group fairness mechanisms in FL and manipulate the global model to be biased?

Fairness Federated Learning +1

EmbodiedSAM: Online Segment Any 3D Thing in Real Time

no code implementations21 Aug 2024 Xiuwei Xu, Huangxing Chen, Linqing Zhao, Ziwei Wang, Jie zhou, Jiwen Lu

In this paper, we aim to leverage Segment Anything Model (SAM) for real-time 3D instance segmentation in an online setting.

3D Instance Segmentation Semantic Segmentation

Embodied Instruction Following in Unknown Environments

no code implementations17 Jun 2024 Zhenyu Wu, Ziwei Wang, Xiuwei Xu, Jiwen Lu, Haibin Yan

For the task planner, we generate the feasible step-by-step plans for human goal accomplishment according to the task completion process and the known visual clues.

Instruction Following

ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation

no code implementations3 Jun 2024 Guanxing Lu, Zifeng Gao, Tianxing Chen, Wenxun Dai, Ziwei Wang, Yansong Tang

To model this process, we design a consistency distillation technique to predict the action sample directly instead of predicting the noise within the vision community for fast convergence in the low-dimensional action manifold.

Denoising

ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation

no code implementations13 Mar 2024 Guanxing Lu, Shiyi Zhang, Ziwei Wang, Changliu Liu, Jiwen Lu, Yansong Tang

Performing language-conditioned robotic manipulation tasks in unstructured environments is highly demanded for general intelligent robots.

Memory-based Adapters for Online 3D Scene Perception

no code implementations CVPR 2024 Xiuwei Xu, Chong Xia, Ziwei Wang, Linqing Zhao, Yueqi Duan, Jie zhou, Jiwen Lu

To this end, we propose an adapter-based plug-and-play module for the backbone of 3D scene perception model, which constructs memory to cache and aggregate the extracted RGB-D features to empower offline models with temporal learning ability.

Online Training of Large Language Models: Learn while chatting

no code implementations4 Mar 2024 Juhao Liang, Ziwei Wang, Zhuoheng Ma, Jianquan Li, Zhiyi Zhang, Xiangbo Wu, Benyou Wang

Large Language Models(LLMs) have dramatically revolutionized the field of Natural Language Processing(NLP), offering remarkable capabilities that have garnered widespread usage.

ThinkBot: Embodied Instruction Following with Thought Chain Reasoning

no code implementations12 Dec 2023 Guanxing Lu, Ziwei Wang, Changliu Liu, Jiwen Lu, Yansong Tang

Embodied Instruction Following (EIF) requires agents to complete human instruction by interacting objects in complicated surrounding environments.

Instruction Following

Context Matters: Data-Efficient Augmentation of Large Language Models for Scientific Applications

2 code implementations12 Dec 2023 Xiang Li, Haoran Tang, Siyu Chen, Ziwei Wang, Anurag Maravi, Marcin Abram

In this paper, we explore the challenges inherent to Large Language Models (LLMs) like GPT-4, particularly their propensity for hallucinations, logic mistakes, and incorrect conclusions when tasked with answering complex questions.

Robust Adversarial Attacks Detection for Deep Learning based Relative Pose Estimation for Space Rendezvous

no code implementations10 Nov 2023 Ziwei Wang, Nabil Aouf, Jose Pizarro, Christophe Honvault

The adversarial attack detector is then built based on a Long Short Term Memory (LSTM) network which takes the explainability measure namely SHapley Value from the CNN-based pose estimator and flags the detection of adversarial attacks when acting.

Adversarial Attack Detection Pose Estimation

MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory

1 code implementation NeurIPS 2023 Yinan Liang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu

Due to the high price and heavy energy consumption of GPUs, deploying deep models on IoT devices such as microcontrollers makes significant contributions for ecological AI.

Image Classification

CopyScope: Model-level Copyright Infringement Quantification in the Diffusion Workflow

no code implementations13 Oct 2023 Junlei Zhou, Jiashi Gao, Ziwei Wang, Xuetao Wei

Previous work only focused on data attribution from the training data perspective, which is unsuitable for tracing and quantifying copyright infringement in practice because of the following reasons: (1) the training datasets are not always available in public; (2) the model provider is the responsible party, not the image.

Image Generation

Anyview: Generalizable Indoor 3D Object Detection with Variable Frames

no code implementations9 Oct 2023 Zhenyu Wu, Xiuwei Xu, Ziwei Wang, Chong Xia, Linqing Zhao, Jiwen Lu, Haibin Yan

Existing methods only consider fixed frames of input data for a single detector, such as monocular RGB-D images or point clouds reconstructed from dense multi-view RGB-D images.

3D Object Detection Object +2

Dynamic Hand Gesture-Featured Human Motor Adaptation in Tool Delivery using Voice Recognition

no code implementations20 Sep 2023 Haolin Fei, Stefano Tedeschi, Yanpei Huang, Andrew Kennedy, Ziwei Wang

In response to these challenges, this paper introduces an innovative human-robot collaborative framework that seamlessly integrates hand gesture and dynamic movement recognition, voice recognition, and a switchable control adaptation strategy.

Hand Gesture Recognition Hand-Gesture Recognition

An Asynchronous Linear Filter Architecture for Hybrid Event-Frame Cameras

1 code implementation3 Sep 2023 Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert Mahony

We also demonstrate the integration of image convolution with linear spatial kernels Gaussian, Sobel, and Laplacian as an application of our architecture.

Video Reconstruction

Self-Enforced Job Matching

no code implementations26 Aug 2023 Ce Liu, Ziwei Wang, HanZhe Zhang

The classic two-sided many-to-one job matching model assumes that firms treat workers as substitutes and workers ignore colleagues when choosing where to work.

Asynchronous Blob Tracker for Event Cameras

2 code implementations20 Jul 2023 Ziwei Wang, Timothy Molloy, Pieter van Goor, Robert Mahony

Event-based cameras are popular for tracking fast-moving objects due to their high temporal resolution, low latency, and high dynamic range.

Autonomous Driving Collision Avoidance

Embodied Task Planning with Large Language Models

1 code implementation4 Jul 2023 Zhenyu Wu, Ziwei Wang, Xiuwei Xu, Jiwen Lu, Haibin Yan

Equipping embodied agents with commonsense is important for robots to successfully complete complex human instructions in general environments.

Towards Accurate Post-training Quantization for Diffusion Models

1 code implementation CVPR 2024 Changyuan Wang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu

On the contrary, we design group-wise quantization functions for activation discretization in different timesteps and sample the optimal timestep for informative calibration image generation, so that our quantized diffusion model can reduce the discretization errors with negligible computational overhead.

Data Free Quantization Image Generation

3D Small Object Detection with Dynamic Spatial Pruning

1 code implementation5 May 2023 Xiuwei Xu, Zhihao Sun, Ziwei Wang, Hongmin Liu, Jie zhou, Jiwen Lu

Specifically, we theoretically derive a dynamic spatial pruning (DSP) strategy to prune the redundant spatial representation of 3D scene in a cascade manner according to the distribution of objects.

3D Object Detection Decoder +3

Partner Choice and Morality: Preference Evolution under Stable Matching

no code implementations23 Apr 2023 Ziwei Wang, Jiabin Wu

We present a model that investigates preference evolution with endogenous matching.

Friction

Learning Accurate Performance Predictors for Ultrafast Automated Model Compression

1 code implementation13 Apr 2023 Ziwei Wang, Jiwen Lu, Han Xiao, Shengyu Liu, Jie zhou

On the contrary, we obtain the optimal efficient networks by directly optimizing the compression policy with an accurate performance predictor, where the ultrafast automated model compression for various computational cost constraint is achieved without complex compression policy search and evaluation.

Image Classification Model Compression +3

Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis

no code implementations CVPR 2023 Xiuwei Xu, Ziwei Wang, Jie zhou, Jiwen Lu

In this paper, we propose binary sparse convolutional networks called BSC-Net for efficient point cloud analysis.

Binarization Quantization

Category-level Shape Estimation for Densely Cluttered Objects

no code implementations23 Feb 2023 Zhenyu Wu, Ziwei Wang, Jiwen Lu, Haibin Yan

Then we fuse the feature maps representing the visual information of multi-view RGB images and the pixel affinity learned from the clutter point cloud, where the acquired instance segmentation masks of multi-view RGB images are projected to partition the clutter point cloud.

Instance Segmentation Object +3

Planning Irregular Object Packing via Hierarchical Reinforcement Learning

no code implementations17 Nov 2022 Sichao Huang, Ziwei Wang, Jie zhou, Jiwen Lu

We compare our approach with existing robotic packing methods for irregular objects in a physics simulator.

Hierarchical Reinforcement Learning Object +4

Insight into cloud processes from unsupervised classification with a rotationally invariant autoencoder

1 code implementation2 Nov 2022 Takuya Kurihana, James Franke, Ian Foster, Ziwei Wang, Elisabeth Moyer

Clouds play a critical role in the Earth's energy budget and their potential changes are one of the largest uncertainties in future climate projections.

InvisibiliTee: Angle-agnostic Cloaking from Person-Tracking Systems with a Tee

1 code implementation15 Aug 2022 Yaxian Li, Bingqing Zhang, Guoping Zhao, Mingyu Zhang, Jiajun Liu, Ziwei Wang, JiRong Wen

After a survey for person-tracking system-induced privacy concerns, we propose a black-box adversarial attack method on state-of-the-art human detection models called InvisibiliTee.

Adversarial Attack Human Detection

Shap-CAM: Visual Explanations for Convolutional Neural Networks based on Shapley Value

no code implementations7 Aug 2022 Quan Zheng, Ziwei Wang, Jie zhou, Jiwen Lu

Explaining deep convolutional neural networks has been recently drawing increasing attention since it helps to understand the networks' internal operations and why they make certain decisions.

Decision Making Fairness

Multimodal sensor data fusion for in-situ classification of animal behavior using accelerometry and GNSS data

1 code implementation24 Jun 2022 Reza Arablouei, Ziwei Wang, Greg J. Bishop-Hurley, Jiajun Liu

However, the multimodal animal behavior classification algorithm based on posterior probability fusion is preferable to the one based on feature concatenation as it delivers better classification accuracy, has less computational and memory complexity, is more robust to sensor data failure, and enjoys better modularity.

Classification

Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search

1 code implementation CVPR 2022 Han Xiao, Ziwei Wang, Zheng Zhu, Jie zhou, Jiwen Lu

Differentiable architecture search (DARTS) acquires the optimal architectures by optimizing the architecture parameters with gradient descent, which significantly reduces the search cost.

Neural Architecture Search

A Linear Comb Filter for Event Flicker Removal

1 code implementation17 May 2022 Ziwei Wang, Dingran Yuan, Yonhon Ng, Robert Mahony

Event cameras are bio-inspired sensors that capture per-pixel asynchronous intensity change rather than the synchronous absolute intensity frames captured by a classical camera sensor.

OpenQA: Hybrid QA System Relying on Structured Knowledge Base as well as Non-structured Data

no code implementations31 Dec 2021 Gaochen Wu, Bin Xu, Yuxin Qin, Yang Liu, Lingyu Liu, Ziwei Wang

Search engines based on keyword retrieval can no longer adapt to the way of information acquisition in the era of intelligent Internet of Things due to the return of keyword related Internet pages.

Answer Selection Machine Reading Comprehension +3

Stereo Hybrid Event-Frame (SHEF) Cameras for 3D Perception

1 code implementation11 Oct 2021 Ziwei Wang, Liyuan Pan, Yonhon Ng, Zheyu Zhuang, Robert Mahony

We provide a SHEF dataset targeted at evaluating disparity estimation algorithms and introduce a stereo disparity estimation algorithm that uses edge information extracted from the event stream correlated with the edge detected in the frame data.

Disparity Estimation Stereo Depth Estimation +1

Generalizable Mixed-Precision Quantization via Attribution Rank Preservation

1 code implementation ICCV 2021 Ziwei Wang, Han Xiao, Jiwen Lu, Jie zhou

On the contrary, our GMPQ searches the mixed-quantization policy that can be generalized to largescale datasets with only a small amount of data, so that the search cost is significantly reduced without performance degradation.

Quantization

Multi-objective Evolutionary Approach for Efficient Kernel Size and Shape for CNN

no code implementations28 Jun 2021 Ziwei Wang, Martin A. Trefzer, Simon J. Bale, Andy M. Tyrrell

Therefore, this paper considers optimising the computational resource consumption by reducing the size and number of kernels in convolutional layers.

Evolutionary Algorithms

Enhanced Modality Transition for Image Captioning

no code implementations23 Feb 2021 Ziwei Wang, Yadan Luo, Zi Huang

In this work, we explicitly build a Modality Transition Module (MTM) to transfer visual features into semantic representations before forwarding them to the language model.

Decoder Image Captioning +3

Reanalyses and a high-resolution model fail to capture the `high tail' of CAPE distributions

no code implementations24 Dec 2020 Ziwei Wang, James A. Franke, Zhenqi Luo, Elisabeth J. Moyer

Both reanalyses and model consistently show too-narrow distributions of CAPE, with the high tail ($>$ 95th percentile) systematically biased low by up to 10% in surface-based CAPE and 20% at the most unstable layer.

Atmospheric and Oceanic Physics

Event Camera Calibration of Per-pixel Biased Contrast Threshold

1 code implementation17 Dec 2020 Ziwei Wang, Yonhon Ng, Pieter van Goor, Robert Mahony

Currently, most of the existing works use a single contrast threshold to estimate the intensity change of all pixels.

Camera Calibration

An Asynchronous Kalman Filter for Hybrid Event Cameras

1 code implementation ICCV 2021 Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert Mahony

Conversely, conventional image sensors measure absolute intensity of slowly changing scenes effectively but do poorly on high dynamic range or quickly changing scenes.

Event-Based Video Reconstruction Video Reconstruction

ORD: Object Relationship Discovery for Visual Dialogue Generation

no code implementations15 Jun 2020 Ziwei Wang, Zi Huang, Yadan Luo, Huimin Lu

With the rapid advancement of image captioning and visual question answering at single-round level, the question of how to generate multi-round dialogue about visual content has not yet been well explored. Existing visual dialogue methods encode the image into a fixed feature vector directly, concatenated with the question and history embeddings to predict the response. Some recent methods tackle the co-reference resolution problem using co-attention mechanism to cross-refer relevant elements from the image, history, and the target question. However, it remains challenging to reason visual relationships, since the fine-grained object-level information is omitted before co-attentive reasoning.

Dialogue Generation Graph Attention +5

BiDet: An Efficient Binarized Object Detector

2 code implementations CVPR 2020 Ziwei Wang, Ziyi Wu, Jiwen Lu, Jie zhou

Conventional network binarization methods directly quantize the weights and activations in one-stage or two-stage detectors with constrained representational capacity, so that the information redundancy in the networks causes numerous false positives and degrades the performance significantly.

Binarization Object +2

Learning from the Past: Continual Meta-Learning via Bayesian Graph Modeling

no code implementations12 Nov 2019 Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Mahsa Baktashmotlagh, Yang Yang

Meta-learning for few-shot learning allows a machine to leverage previously acquired knowledge as a prior, thus improving the performance on novel tasks with only small amounts of data.

Continual Learning Few-Shot Learning

Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation

no code implementations1 Aug 2019 Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Jingjing Li, Yang Yang

Visual paragraph generation aims to automatically describe a given image from different perspectives and organize sentences in a coherent way.

Imitation Learning reinforcement-learning +2

Learning Channel-Wise Interactions for Binary Convolutional Neural Networks

no code implementations CVPR 2019 Ziwei Wang, Jiwen Lu, Chenxin Tao, Jie Zhou, Qi Tian

In this paper, we propose a channel-wise interaction based binary convolutional neural network learning method (CI-BCNN) for efficient inference.

Quantization Reinforcement Learning

Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval

no code implementations5 Apr 2019 Yadan Luo, Ziwei Wang, Zi Huang, Yang Yang, Huimin Lu

With the increasing number of online stores, there is a pressing need for intelligent search systems to understand the item photos snapped by customers and search against large-scale product databases to find their desired items.

Attribute Image Retrieval +1

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration

3 code implementations CVPR 2019 Yang He, Ping Liu, Ziwei Wang, Zhilan Hu, Yi Yang

In this paper, we analyze this norm-based criterion and point out that its effectiveness depends on two requirements that are not always met: (1) the norm deviation of the filters should be large; (2) the minimum norm of the filters should be small.

Image Classification

Look Deeper See Richer: Depth-aware Image Paragraph Captioning

no code implementations ACM International Conference on Multimedia 2018 Ziwei Wang, Yadan Luo, Yang Li, Zi Huang, Hongzhi Yin

Existing image paragraph captioning methods give a series of sentences to represent the objects and regions of interests, where the descriptions are essentially generated by feeding the image fragments containing objects and regions into conventional image single-sentence captioning models.

Decoder Image Captioning +2

Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning

no code implementations22 Aug 2018 Yadan Luo, Ziwei Wang, Zi Huang, Yang Yang, Cong Zhao

Rich high-quality annotated data is critical for semantic segmentation learning, yet acquiring dense and pixel-wise ground-truth is both labor- and time-consuming.

Segmentation Semantic Segmentation +1

GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning

no code implementations CVPR 2018 Yueqi Duan, Ziwei Wang, Jiwen Lu, Xudong Lin, Jie zhou

Specifically, we design a deep reinforcement learning model to learn the structure of the graph for bitwise interaction mining, reducing the uncertainty of binary codes by maximizing the mutual information with inputs and related bits, so that the ambiguous bits receive additional instruction from the graph for confident binarization.

Binarization reinforcement-learning +3

Learning Deep Binary Descriptor With Multi-Quantization

no code implementations CVPR 2017 Yueqi Duan, Jiwen Lu, Ziwei Wang, Jianjiang Feng, Jie zhou

In this paper, we propose an unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual matching.

Binarization Image Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.