Search Results for author: Ziwei Wang

Found 50 papers, 19 papers with code

Deep Hashing with Active Pairwise Supervision

no code implementations • ECCV 2020 • Ziwei Wang, Quan Zheng, Jiwen Lu, Jie zhou

n this paper, we propose a Deep Hashing method with Active Pairwise Supervision(DH-APS).

Paper
Add Code

ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation

no code implementations • 13 Mar 2024 • Guanxing Lu, Shiyi Zhang, Ziwei Wang, Changliu Liu, Jiwen Lu, Yansong Tang

Then, we build a Gaussian world model to parameterize the distribution in our dynamic Gaussian Splatting framework, which provides informative supervision in the interactive environment via future scene reconstruction.

Paper
Add Code

Memory-based Adapters for Online 3D Scene Perception

no code implementations • 11 Mar 2024 • Xiuwei Xu, Chong Xia, Ziwei Wang, Linqing Zhao, Yueqi Duan, Jie zhou, Jiwen Lu

To this end, we propose an adapter-based plug-and-play module for the backbone of 3D scene perception model, which constructs memory to cache and aggregate the extracted RGB-D features to empower offline models with temporal learning ability.

Paper
Add Code

Online Training of Large Language Models: Learn while chatting

no code implementations • 4 Mar 2024 • Juhao Liang, Ziwei Wang, Zhuoheng Ma, Jianquan Li, Zhiyi Zhang, Xiangbo Wu, Benyou Wang

Large Language Models(LLMs) have dramatically revolutionized the field of Natural Language Processing(NLP), offering remarkable capabilities that have garnered widespread usage.

Paper
Add Code

ThinkBot: Embodied Instruction Following with Thought Chain Reasoning

no code implementations • 12 Dec 2023 • Guanxing Lu, Ziwei Wang, Changliu Liu, Jiwen Lu, Yansong Tang

Embodied Instruction Following (EIF) requires agents to complete human instruction by interacting objects in complicated surrounding environments.

Instruction Following

Paper
Add Code

Context Matters: Data-Efficient Augmentation of Large Language Models for Scientific Applications

1 code implementation • 12 Dec 2023 • Xiang Li, Haoran Tang, Siyu Chen, Ziwei Wang, Anurag Maravi, Marcin Abram

In this paper, we explore the challenges inherent to Large Language Models (LLMs) like GPT-4, particularly their propensity for hallucinations, logic mistakes, and incorrect conclusions when tasked with answering complex questions.

Paper
Code

Robust Adversarial Attacks Detection for Deep Learning based Relative Pose Estimation for Space Rendezvous

no code implementations • 10 Nov 2023 • Ziwei Wang, Nabil Aouf, Jose Pizarro, Christophe Honvault

The adversarial attack detector is then built based on a Long Short Term Memory (LSTM) network which takes the explainability measure namely SHapley Value from the CNN-based pose estimator and flags the detection of adversarial attacks when acting.

Adversarial Attack Detection Pose Estimation

Paper
Add Code

MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory

1 code implementation • NeurIPS 2023 • Yinan Liang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu

Due to the high price and heavy energy consumption of GPUs, deploying deep models on IoT devices such as microcontrollers makes significant contributions for ecological AI.

Image Classification

Paper
Code

CopyScope: Model-level Copyright Infringement Quantification in the Diffusion Workflow

no code implementations • 13 Oct 2023 • Junlei Zhou, Jiashi Gao, Ziwei Wang, Xuetao Wei

Previous work only focused on data attribution from the training data perspective, which is unsuitable for tracing and quantifying copyright infringement in practice because of the following reasons: (1) the training datasets are not always available in public; (2) the model provider is the responsible party, not the image.

Image Generation

Paper
Add Code

Anyview: Generalizable Indoor 3D Object Detection with Variable Frames

no code implementations • 9 Oct 2023 • Zhenyu Wu, Xiuwei Xu, Ziwei Wang, Chong Xia, Linqing Zhao, Jiwen Lu, Haibin Yan

Existing methods only consider fixed frames of input data for a single detector, such as monocular RGB-D images or point clouds reconstructed from dense multi-view RGB-D images.

3D Object Detection Object +2

Paper
Add Code

Real-time Multi-modal Object Detection and Tracking on Edge for Regulatory Compliance Monitoring

no code implementations • 5 Oct 2023 • Jia Syuen Lim, Ziwei Wang, Jiajun Liu, Abdelwahed Khamis, Reza Arablouei, Robert Barlow, Ryan McAllister

Regulatory compliance auditing across diverse industrial domains requires heightened quality assurance and traceability.

object-detection Object Detection +1

Paper
Add Code

Dynamic Hand Gesture-Featured Human Motor Adaptation in Tool Delivery using Voice Recognition

no code implementations • 20 Sep 2023 • Haolin Fei, Stefano Tedeschi, Yanpei Huang, Andrew Kennedy, Ziwei Wang

In response to these challenges, this paper introduces an innovative human-robot collaborative framework that seamlessly integrates hand gesture and dynamic movement recognition, voice recognition, and a switchable control adaptation strategy.

Hand Gesture Recognition Hand-Gesture Recognition

Paper
Add Code

An Asynchronous Linear Filter Architecture for Hybrid Event-Frame Cameras

1 code implementation • 3 Sep 2023 • Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert Mahony

We also demonstrate the integration of image convolution with linear spatial kernels Gaussian, Sobel, and Laplacian as an application of our architecture.

Video Reconstruction

Paper
Code

Self-Enforced Job Matching

no code implementations • 26 Aug 2023 • Ce Liu, Ziwei Wang, HanZhe Zhang

The classic two-sided many-to-one job matching model assumes that firms treat workers as substitutes and workers ignore colleagues when choosing where to work.

Paper
Add Code

Event Blob Tracking: An Asynchronous Real-Time Algorithm

1 code implementation • 20 Jul 2023 • Ziwei Wang, Timothy Molloy, Pieter van Goor, Robert Mahony

Event-based cameras have become increasingly popular for tracking fast-moving objects due to their high temporal resolution, low latency, and high dynamic range.

Autonomous Driving Collision Avoidance

Paper
Code

Embodied Task Planning with Large Language Models

1 code implementation • 4 Jul 2023 • Zhenyu Wu, Ziwei Wang, Xiuwei Xu, Jiwen Lu, Haibin Yan

Equipping embodied agents with commonsense is important for robots to successfully complete complex human instructions in general environments.

112

Paper
Code

Towards Accurate Data-free Quantization for Diffusion Models

no code implementations • 30 May 2023 • Changyuan Wang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu

On the contrary, we design group-wise quantization functions for activation discretization in different timesteps and sample the optimal timestep for informative calibration image generation, so that our quantized diffusion model can reduce the discretization errors with negligible computational overhead.

Data Free Quantization Image Generation

Paper
Add Code

3D Small Object Detection with Dynamic Spatial Pruning

1 code implementation • 5 May 2023 • Xiuwei Xu, Zhihao Sun, Ziwei Wang, Hongmin Liu, Jie zhou, Jiwen Lu

Specifically, we theoretically derive a dynamic spatial pruning (DSP) strategy to prune the redundant spatial representation of 3D scene in a cascade manner according to the distribution of objects.

3D Object Detection Object +2

Paper
Code

Partner Choice and Morality: Preference Evolution under Stable Matching

no code implementations • 23 Apr 2023 • Ziwei Wang, Jiabin Wu

We present a model that investigates preference evolution with endogenous matching.

Friction

Paper
Add Code

Learning Accurate Performance Predictors for Ultrafast Automated Model Compression

1 code implementation • 13 Apr 2023 • Ziwei Wang, Jiwen Lu, Han Xiao, Shengyu Liu, Jie zhou

On the contrary, we obtain the optimal efficient networks by directly optimizing the compression policy with an accurate performance predictor, where the ultrafast automated model compression for various computational cost constraint is achieved without complex compression policy search and evaluation.

Image Classification Model Compression +3

Paper
Code

Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis

no code implementations • CVPR 2023 • Xiuwei Xu, Ziwei Wang, Jie zhou, Jiwen Lu

In this paper, we propose binary sparse convolutional networks called BSC-Net for efficient point cloud analysis.

Binarization Quantization

Paper
Add Code

Category-level Shape Estimation for Densely Cluttered Objects

no code implementations • 23 Feb 2023 • Zhenyu Wu, Ziwei Wang, Jiwen Lu, Haibin Yan

Then we fuse the feature maps representing the visual information of multi-view RGB images and the pixel affinity learned from the clutter point cloud, where the acquired instance segmentation masks of multi-view RGB images are projected to partition the clutter point cloud.

Instance Segmentation Object +3

Paper
Add Code

Planning Irregular Object Packing via Hierarchical Reinforcement Learning

no code implementations • 17 Nov 2022 • Sichao Huang, Ziwei Wang, Jie zhou, Jiwen Lu

We compare our approach with existing robotic packing methods for irregular objects in a physics simulator.

Hierarchical Reinforcement Learning Object +3

Paper
Add Code

Insight into cloud processes from unsupervised classification with a rotationally invariant autoencoder

1 code implementation • 2 Nov 2022 • Takuya Kurihana, James Franke, Ian Foster, Ziwei Wang, Elisabeth Moyer

Clouds play a critical role in the Earth's energy budget and their potential changes are one of the largest uncertainties in future climate projections.

Paper
Code

Point-Syn2Real: Semi-Supervised Synthetic-to-Real Cross-Domain Learning for Object Classification in 3D Point Clouds

no code implementations • 31 Oct 2022 • Ziwei Wang, Reza Arablouei, Jiajun Liu, Paulo Borges, Greg Bishop-hurley, Nicholas Heaney

Object classification using LiDAR 3D point cloud data is critical for modern applications such as autonomous driving.

Autonomous Driving domain classification +1

Paper
Add Code

InvisibiliTee: Angle-agnostic Cloaking from Person-Tracking Systems with a Tee

1 code implementation • 15 Aug 2022 • Yaxian Li, Bingqing Zhang, Guoping Zhao, Mingyu Zhang, Jiajun Liu, Ziwei Wang, JiRong Wen

After a survey for person-tracking system-induced privacy concerns, we propose a black-box adversarial attack method on state-of-the-art human detection models called InvisibiliTee.

Adversarial Attack Human Detection

Paper
Code

Shap-CAM: Visual Explanations for Convolutional Neural Networks based on Shapley Value

no code implementations • 7 Aug 2022 • Quan Zheng, Ziwei Wang, Jie zhou, Jiwen Lu

Explaining deep convolutional neural networks has been recently drawing increasing attention since it helps to understand the networks' internal operations and why they make certain decisions.

Decision Making Fairness

Paper
Add Code

Multimodal sensor data fusion for in-situ classification of animal behavior using accelerometry and GNSS data

1 code implementation • 24 Jun 2022 • Reza Arablouei, Ziwei Wang, Greg J. Bishop-Hurley, Jiajun Liu

However, the multimodal animal behavior classification algorithm based on posterior probability fusion is preferable to the one based on feature concatenation as it delivers better classification accuracy, has less computational and memory complexity, is more robust to sensor data failure, and enjoys better modularity.

Classification

Paper
Code

Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search

1 code implementation • CVPR 2022 • Han Xiao, Ziwei Wang, Zheng Zhu, Jie zhou, Jiwen Lu

Differentiable architecture search (DARTS) acquires the optimal architectures by optimizing the architecture parameters with gradient descent, which significantly reduces the search cost.

Ranked #1 on Neural Architecture Search on NAS-Bench-201, CIFAR-100

Neural Architecture Search

Paper
Code

A Linear Comb Filter for Event Flicker Removal

1 code implementation • 17 May 2022 • Ziwei Wang, Dingran Yuan, Yonhon Ng, Robert Mahony

Event cameras are bio-inspired sensors that capture per-pixel asynchronous intensity change rather than the synchronous absolute intensity frames captured by a classical camera sensor.

Paper
Code

OpenQA: Hybrid QA System Relying on Structured Knowledge Base as well as Non-structured Data

no code implementations • 31 Dec 2021 • Gaochen Wu, Bin Xu, Yuxin Qin, Yang Liu, Lingyu Liu, Ziwei Wang

Search engines based on keyword retrieval can no longer adapt to the way of information acquisition in the era of intelligent Internet of Things due to the return of keyword related Internet pages.

Answer Selection Machine Reading Comprehension +3

Paper
Add Code

Stereo Hybrid Event-Frame (SHEF) Cameras for 3D Perception

1 code implementation • 11 Oct 2021 • Ziwei Wang, Liyuan Pan, Yonhon Ng, Zheyu Zhuang, Robert Mahony

We provide a SHEF dataset targeted at evaluating disparity estimation algorithms and introduce a stereo disparity estimation algorithm that uses edge information extracted from the event stream correlated with the edge detected in the frame data.

Disparity Estimation Stereo Depth Estimation +2

Paper
Code

Instance Similarity Learning for Unsupervised Feature Representation

1 code implementation • ICCV 2021 • Ziwei Wang, Yunsong Wang, Ziyi Wu, Jiwen Lu, Jie zhou

In this paper, we propose an instance similarity learning (ISL) method for unsupervised feature representation.

Image Classification Semantic Similarity +1

Paper
Code

Generalizable Mixed-Precision Quantization via Attribution Rank Preservation

1 code implementation • ICCV 2021 • Ziwei Wang, Han Xiao, Jiwen Lu, Jie zhou

On the contrary, our GMPQ searches the mixed-quantization policy that can be generalized to largescale datasets with only a small amount of data, so that the search cost is significantly reduced without performance degradation.

Quantization

Paper
Code

Multi-objective Evolutionary Approach for Efficient Kernel Size and Shape for CNN

no code implementations • 28 Jun 2021 • Ziwei Wang, Martin A. Trefzer, Simon J. Bale, Andy M. Tyrrell

Therefore, this paper considers optimising the computational resource consumption by reducing the size and number of kernels in convolutional layers.

Evolutionary Algorithms

Paper
Add Code

Enhanced Modality Transition for Image Captioning

no code implementations • 23 Feb 2021 • Ziwei Wang, Yadan Luo, Zi Huang

In this work, we explicitly build a Modality Transition Module (MTM) to transfer visual features into semantic representations before forwarding them to the language model.

Image Captioning Language Modelling +2

Paper
Add Code

Reanalyses and a high-resolution model fail to capture the `high tail' of CAPE distributions

no code implementations • 24 Dec 2020 • Ziwei Wang, James A. Franke, Zhenqi Luo, Elisabeth J. Moyer

Both reanalyses and model consistently show too-narrow distributions of CAPE, with the high tail ($>$ 95th percentile) systematically biased low by up to 10% in surface-based CAPE and 20% at the most unstable layer.

Atmospheric and Oceanic Physics

Paper
Add Code

Event Camera Calibration of Per-pixel Biased Contrast Threshold

1 code implementation • 17 Dec 2020 • Ziwei Wang, Yonhon Ng, Pieter van Goor, Robert Mahony

Currently, most of the existing works use a single contrast threshold to estimate the intensity change of all pixels.

Camera Calibration

Paper
Code

An Asynchronous Kalman Filter for Hybrid Event Cameras

1 code implementation • ICCV 2021 • Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert Mahony

Conversely, conventional image sensors measure absolute intensity of slowly changing scenes effectively but do poorly on high dynamic range or quickly changing scenes.

Event-Based Video Reconstruction Video Reconstruction

Paper
Code

ORD: Object Relationship Discovery for Visual Dialogue Generation

no code implementations • 15 Jun 2020 • Ziwei Wang, Zi Huang, Yadan Luo, Huimin Lu

With the rapid advancement of image captioning and visual question answering at single-round level, the question of how to generate multi-round dialogue about visual content has not yet been well explored. Existing visual dialogue methods encode the image into a fixed feature vector directly, concatenated with the question and history embeddings to predict the response. Some recent methods tackle the co-reference resolution problem using co-attention mechanism to cross-refer relevant elements from the image, history, and the target question. However, it remains challenging to reason visual relationships, since the fine-grained object-level information is omitted before co-attentive reasoning.

Dialogue Generation Graph Attention +5

Paper
Add Code

BiDet: An Efficient Binarized Object Detector

2 code implementations • CVPR 2020 • Ziwei Wang, Ziyi Wu, Jiwen Lu, Jie zhou

Conventional network binarization methods directly quantize the weights and activations in one-stage or two-stage detectors with constrained representational capacity, so that the information redundancy in the networks causes numerous false positives and degrades the performance significantly.

Binarization Object +2

173

Paper
Code

Learning from the Past: Continual Meta-Learning via Bayesian Graph Modeling

no code implementations • 12 Nov 2019 • Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Mahsa Baktashmotlagh, Yang Yang

Meta-learning for few-shot learning allows a machine to leverage previously acquired knowledge as a prior, thus improving the performance on novel tasks with only small amounts of data.

Continual Learning Few-Shot Learning

Paper
Add Code

Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation

no code implementations • 1 Aug 2019 • Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Jingjing Li, Yang Yang

Visual paragraph generation aims to automatically describe a given image from different perspectives and organize sentences in a coherent way.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Learning Channel-Wise Interactions for Binary Convolutional Neural Networks

no code implementations • CVPR 2019 • Ziwei Wang, Jiwen Lu, Chenxin Tao, Jie Zhou, Qi Tian

In this paper, we propose a channel-wise interaction based binary convolutional neural network learning method (CI-BCNN) for efficient inference.

Quantization

Paper
Add Code

Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval

no code implementations • 5 Apr 2019 • Yadan Luo, Ziwei Wang, Zi Huang, Yang Yang, Huimin Lu

With the increasing number of online stores, there is a pressing need for intelligent search systems to understand the item photos snapped by customers and search against large-scale product databases to find their desired items.

Attribute Image Retrieval +1

Paper
Add Code

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration

3 code implementations • CVPR 2019 • Yang He, Ping Liu, Ziwei Wang, Zhilan Hu, Yi Yang

In this paper, we analyze this norm-based criterion and point out that its effectiveness depends on two requirements that are not always met: (1) the norm deviation of the filters should be large; (2) the minimum norm of the filters should be small.

Image Classification

38,418

Paper
Code

Look Deeper See Richer: Depth-aware Image Paragraph Captioning

no code implementations • ACM International Conference on Multimedia 2018 • Ziwei Wang, Yadan Luo, Yang Li, Zi Huang, Hongzhi Yin

Existing image paragraph captioning methods give a series of sentences to represent the objects and regions of interests, where the descriptions are essentially generated by feeding the image fragments containing objects and regions into conventional image single-sentence captioning models.

Ranked #9 on Image Paragraph Captioning on Image Paragraph Captioning

Image Captioning Image Paragraph Captioning +1

Paper
Add Code

Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning

no code implementations • 22 Aug 2018 • Yadan Luo, Ziwei Wang, Zi Huang, Yang Yang, Cong Zhao

Rich high-quality annotated data is critical for semantic segmentation learning, yet acquiring dense and pixel-wise ground-truth is both labor- and time-consuming.

Segmentation Semantic Segmentation +1

Paper
Add Code

GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning

no code implementations • CVPR 2018 • Yueqi Duan, Ziwei Wang, Jiwen Lu, Xudong Lin, Jie zhou

Specifically, we design a deep reinforcement learning model to learn the structure of the graph for bitwise interaction mining, reducing the uncertainty of binary codes by maximizing the mutual information with inputs and related bits, so that the ambiguous bits receive additional instruction from the graph for confident binarization.

Binarization reinforcement-learning +2

Paper
Add Code

Learning Deep Binary Descriptor With Multi-Quantization

no code implementations • CVPR 2017 • Yueqi Duan, Jiwen Lu, Ziwei Wang, Jianjiang Feng, Jie zhou

In this paper, we propose an unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual matching.

Binarization Image Retrieval +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.