no code implementations • ECCV 2020 • Ziwei Wang, Quan Zheng, Jiwen Lu, Jie zhou
n this paper, we propose a Deep Hashing method with Active Pairwise Supervision(DH-APS).
1 code implementation • 10 Oct 2024 • Changyuan Wang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu
On the contrary, we mine the cross-layer dependency that significantly influences discretization errors of the entire vision-language model, and embed this dependency into optimal quantization strategy searching with low search cost.
no code implementations • 9 Oct 2024 • Jiashi Gao, Ziwei Wang, Xiangyu Zhao, Xin Yao, Xuetao Wei
However, these studies primarily focus on perturbing accuracy, leaving a critical question unexplored: Can an attacker bypass the group fairness mechanisms in FL and manipulate the global model to be biased?
no code implementations • 21 Aug 2024 • Xiuwei Xu, Huangxing Chen, Linqing Zhao, Ziwei Wang, Jie zhou, Jiwen Lu
In this paper, we aim to leverage Segment Anything Model (SAM) for real-time 3D instance segmentation in an online setting.
no code implementations • 31 Jul 2024 • Xusheng Luo, Tianhao Wei, Simin Liu, Ziwei Wang, Luis Mattei-Mendez, Taylor Loper, Joshua Neighbor, Casidhe Hutchison, Changliu Liu
This work addresses the certification of the local robustness of vision-based two-stage 6D object pose estimation.
1 code implementation • 2 Jul 2024 • Xiang Li, Haoran Tang, Siyu Chen, Ziwei Wang, Ryan Chen, Marcin Abram
This effect is especially visible for open questions and questions of high difficulty or novelty.
no code implementations • 17 Jun 2024 • Zhenyu Wu, Ziwei Wang, Xiuwei Xu, Jiwen Lu, Haibin Yan
For the task planner, we generate the feasible step-by-step plans for human goal accomplishment according to the task completion process and the known visual clues.
no code implementations • 3 Jun 2024 • Guanxing Lu, Zifeng Gao, Tianxing Chen, Wenxun Dai, Ziwei Wang, Yansong Tang
To model this process, we design a consistency distillation technique to predict the action sample directly instead of predicting the noise within the vision community for fast convergence in the low-dimensional action manifold.
no code implementations • 13 Mar 2024 • Guanxing Lu, Shiyi Zhang, Ziwei Wang, Changliu Liu, Jiwen Lu, Yansong Tang
Performing language-conditioned robotic manipulation tasks in unstructured environments is highly demanded for general intelligent robots.
no code implementations • CVPR 2024 • Xiuwei Xu, Chong Xia, Ziwei Wang, Linqing Zhao, Yueqi Duan, Jie zhou, Jiwen Lu
To this end, we propose an adapter-based plug-and-play module for the backbone of 3D scene perception model, which constructs memory to cache and aggregate the extracted RGB-D features to empower offline models with temporal learning ability.
no code implementations • 4 Mar 2024 • Juhao Liang, Ziwei Wang, Zhuoheng Ma, Jianquan Li, Zhiyi Zhang, Xiangbo Wu, Benyou Wang
Large Language Models(LLMs) have dramatically revolutionized the field of Natural Language Processing(NLP), offering remarkable capabilities that have garnered widespread usage.
no code implementations • CVPR 2024 • Linqing Zhao, Xiuwei Xu, Ziwei Wang, Yunpeng Zhang, Borui Zhang, Wenzhao Zheng, Dalong Du, Jie zhou, Jiwen Lu
In this paper we present a tensor decomposition and low-rank recovery approach (LowRankOcc) for vision-based 3D semantic occupancy prediction.
no code implementations • 12 Dec 2023 • Guanxing Lu, Ziwei Wang, Changliu Liu, Jiwen Lu, Yansong Tang
Embodied Instruction Following (EIF) requires agents to complete human instruction by interacting objects in complicated surrounding environments.
2 code implementations • 12 Dec 2023 • Xiang Li, Haoran Tang, Siyu Chen, Ziwei Wang, Anurag Maravi, Marcin Abram
In this paper, we explore the challenges inherent to Large Language Models (LLMs) like GPT-4, particularly their propensity for hallucinations, logic mistakes, and incorrect conclusions when tasked with answering complex questions.
no code implementations • 10 Nov 2023 • Ziwei Wang, Nabil Aouf, Jose Pizarro, Christophe Honvault
The adversarial attack detector is then built based on a Long Short Term Memory (LSTM) network which takes the explainability measure namely SHapley Value from the CNN-based pose estimator and flags the detection of adversarial attacks when acting.
1 code implementation • NeurIPS 2023 • Yinan Liang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu
Due to the high price and heavy energy consumption of GPUs, deploying deep models on IoT devices such as microcontrollers makes significant contributions for ecological AI.
no code implementations • 13 Oct 2023 • Junlei Zhou, Jiashi Gao, Ziwei Wang, Xuetao Wei
Previous work only focused on data attribution from the training data perspective, which is unsuitable for tracing and quantifying copyright infringement in practice because of the following reasons: (1) the training datasets are not always available in public; (2) the model provider is the responsible party, not the image.
no code implementations • 9 Oct 2023 • Zhenyu Wu, Xiuwei Xu, Ziwei Wang, Chong Xia, Linqing Zhao, Jiwen Lu, Haibin Yan
Existing methods only consider fixed frames of input data for a single detector, such as monocular RGB-D images or point clouds reconstructed from dense multi-view RGB-D images.
no code implementations • 5 Oct 2023 • Jia Syuen Lim, Ziwei Wang, Jiajun Liu, Abdelwahed Khamis, Reza Arablouei, Robert Barlow, Ryan McAllister
Regulatory compliance auditing across diverse industrial domains requires heightened quality assurance and traceability.
no code implementations • 20 Sep 2023 • Haolin Fei, Stefano Tedeschi, Yanpei Huang, Andrew Kennedy, Ziwei Wang
In response to these challenges, this paper introduces an innovative human-robot collaborative framework that seamlessly integrates hand gesture and dynamic movement recognition, voice recognition, and a switchable control adaptation strategy.
1 code implementation • 3 Sep 2023 • Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert Mahony
We also demonstrate the integration of image convolution with linear spatial kernels Gaussian, Sobel, and Laplacian as an application of our architecture.
no code implementations • 26 Aug 2023 • Ce Liu, Ziwei Wang, HanZhe Zhang
The classic two-sided many-to-one job matching model assumes that firms treat workers as substitutes and workers ignore colleagues when choosing where to work.
2 code implementations • 20 Jul 2023 • Ziwei Wang, Timothy Molloy, Pieter van Goor, Robert Mahony
Event-based cameras are popular for tracking fast-moving objects due to their high temporal resolution, low latency, and high dynamic range.
1 code implementation • 4 Jul 2023 • Zhenyu Wu, Ziwei Wang, Xiuwei Xu, Jiwen Lu, Haibin Yan
Equipping embodied agents with commonsense is important for robots to successfully complete complex human instructions in general environments.
1 code implementation • CVPR 2024 • Changyuan Wang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu
On the contrary, we design group-wise quantization functions for activation discretization in different timesteps and sample the optimal timestep for informative calibration image generation, so that our quantized diffusion model can reduce the discretization errors with negligible computational overhead.
1 code implementation • 5 May 2023 • Xiuwei Xu, Zhihao Sun, Ziwei Wang, Hongmin Liu, Jie zhou, Jiwen Lu
Specifically, we theoretically derive a dynamic spatial pruning (DSP) strategy to prune the redundant spatial representation of 3D scene in a cascade manner according to the distribution of objects.
no code implementations • 23 Apr 2023 • Ziwei Wang, Jiabin Wu
We present a model that investigates preference evolution with endogenous matching.
1 code implementation • 13 Apr 2023 • Ziwei Wang, Jiwen Lu, Han Xiao, Shengyu Liu, Jie zhou
On the contrary, we obtain the optimal efficient networks by directly optimizing the compression policy with an accurate performance predictor, where the ultrafast automated model compression for various computational cost constraint is achieved without complex compression policy search and evaluation.
no code implementations • CVPR 2023 • Xiuwei Xu, Ziwei Wang, Jie zhou, Jiwen Lu
In this paper, we propose binary sparse convolutional networks called BSC-Net for efficient point cloud analysis.
no code implementations • 23 Feb 2023 • Zhenyu Wu, Ziwei Wang, Jiwen Lu, Haibin Yan
Then we fuse the feature maps representing the visual information of multi-view RGB images and the pixel affinity learned from the clutter point cloud, where the acquired instance segmentation masks of multi-view RGB images are projected to partition the clutter point cloud.
no code implementations • 17 Nov 2022 • Sichao Huang, Ziwei Wang, Jie zhou, Jiwen Lu
We compare our approach with existing robotic packing methods for irregular objects in a physics simulator.
1 code implementation • 2 Nov 2022 • Takuya Kurihana, James Franke, Ian Foster, Ziwei Wang, Elisabeth Moyer
Clouds play a critical role in the Earth's energy budget and their potential changes are one of the largest uncertainties in future climate projections.
no code implementations • 31 Oct 2022 • Ziwei Wang, Reza Arablouei, Jiajun Liu, Paulo Borges, Greg Bishop-hurley, Nicholas Heaney
Object classification using LiDAR 3D point cloud data is critical for modern applications such as autonomous driving.
1 code implementation • 15 Aug 2022 • Yaxian Li, Bingqing Zhang, Guoping Zhao, Mingyu Zhang, Jiajun Liu, Ziwei Wang, JiRong Wen
After a survey for person-tracking system-induced privacy concerns, we propose a black-box adversarial attack method on state-of-the-art human detection models called InvisibiliTee.
no code implementations • 7 Aug 2022 • Quan Zheng, Ziwei Wang, Jie zhou, Jiwen Lu
Explaining deep convolutional neural networks has been recently drawing increasing attention since it helps to understand the networks' internal operations and why they make certain decisions.
1 code implementation • 24 Jun 2022 • Reza Arablouei, Ziwei Wang, Greg J. Bishop-Hurley, Jiajun Liu
However, the multimodal animal behavior classification algorithm based on posterior probability fusion is preferable to the one based on feature concatenation as it delivers better classification accuracy, has less computational and memory complexity, is more robust to sensor data failure, and enjoys better modularity.
1 code implementation • CVPR 2022 • Han Xiao, Ziwei Wang, Zheng Zhu, Jie zhou, Jiwen Lu
Differentiable architecture search (DARTS) acquires the optimal architectures by optimizing the architecture parameters with gradient descent, which significantly reduces the search cost.
1 code implementation • 17 May 2022 • Ziwei Wang, Dingran Yuan, Yonhon Ng, Robert Mahony
Event cameras are bio-inspired sensors that capture per-pixel asynchronous intensity change rather than the synchronous absolute intensity frames captured by a classical camera sensor.
no code implementations • 31 Dec 2021 • Gaochen Wu, Bin Xu, Yuxin Qin, Yang Liu, Lingyu Liu, Ziwei Wang
Search engines based on keyword retrieval can no longer adapt to the way of information acquisition in the era of intelligent Internet of Things due to the return of keyword related Internet pages.
1 code implementation • 11 Oct 2021 • Ziwei Wang, Liyuan Pan, Yonhon Ng, Zheyu Zhuang, Robert Mahony
We provide a SHEF dataset targeted at evaluating disparity estimation algorithms and introduce a stereo disparity estimation algorithm that uses edge information extracted from the event stream correlated with the edge detected in the frame data.
1 code implementation • ICCV 2021 • Ziwei Wang, Han Xiao, Jiwen Lu, Jie zhou
On the contrary, our GMPQ searches the mixed-quantization policy that can be generalized to largescale datasets with only a small amount of data, so that the search cost is significantly reduced without performance degradation.
1 code implementation • ICCV 2021 • Ziwei Wang, Yunsong Wang, Ziyi Wu, Jiwen Lu, Jie zhou
In this paper, we propose an instance similarity learning (ISL) method for unsupervised feature representation.
no code implementations • 28 Jun 2021 • Ziwei Wang, Martin A. Trefzer, Simon J. Bale, Andy M. Tyrrell
Therefore, this paper considers optimising the computational resource consumption by reducing the size and number of kernels in convolutional layers.
no code implementations • 23 Feb 2021 • Ziwei Wang, Yadan Luo, Zi Huang
In this work, we explicitly build a Modality Transition Module (MTM) to transfer visual features into semantic representations before forwarding them to the language model.
no code implementations • 24 Dec 2020 • Ziwei Wang, James A. Franke, Zhenqi Luo, Elisabeth J. Moyer
Both reanalyses and model consistently show too-narrow distributions of CAPE, with the high tail ($>$ 95th percentile) systematically biased low by up to 10% in surface-based CAPE and 20% at the most unstable layer.
Atmospheric and Oceanic Physics
1 code implementation • 17 Dec 2020 • Ziwei Wang, Yonhon Ng, Pieter van Goor, Robert Mahony
Currently, most of the existing works use a single contrast threshold to estimate the intensity change of all pixels.
1 code implementation • ICCV 2021 • Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert Mahony
Conversely, conventional image sensors measure absolute intensity of slowly changing scenes effectively but do poorly on high dynamic range or quickly changing scenes.
no code implementations • 15 Jun 2020 • Ziwei Wang, Zi Huang, Yadan Luo, Huimin Lu
With the rapid advancement of image captioning and visual question answering at single-round level, the question of how to generate multi-round dialogue about visual content has not yet been well explored. Existing visual dialogue methods encode the image into a fixed feature vector directly, concatenated with the question and history embeddings to predict the response. Some recent methods tackle the co-reference resolution problem using co-attention mechanism to cross-refer relevant elements from the image, history, and the target question. However, it remains challenging to reason visual relationships, since the fine-grained object-level information is omitted before co-attentive reasoning.
2 code implementations • CVPR 2020 • Ziwei Wang, Ziyi Wu, Jiwen Lu, Jie zhou
Conventional network binarization methods directly quantize the weights and activations in one-stage or two-stage detectors with constrained representational capacity, so that the information redundancy in the networks causes numerous false positives and degrades the performance significantly.
no code implementations • 12 Nov 2019 • Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Mahsa Baktashmotlagh, Yang Yang
Meta-learning for few-shot learning allows a machine to leverage previously acquired knowledge as a prior, thus improving the performance on novel tasks with only small amounts of data.
no code implementations • 1 Aug 2019 • Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Jingjing Li, Yang Yang
Visual paragraph generation aims to automatically describe a given image from different perspectives and organize sentences in a coherent way.
no code implementations • CVPR 2019 • Ziwei Wang, Jiwen Lu, Chenxin Tao, Jie Zhou, Qi Tian
In this paper, we propose a channel-wise interaction based binary convolutional neural network learning method (CI-BCNN) for efficient inference.
no code implementations • 5 Apr 2019 • Yadan Luo, Ziwei Wang, Zi Huang, Yang Yang, Huimin Lu
With the increasing number of online stores, there is a pressing need for intelligent search systems to understand the item photos snapped by customers and search against large-scale product databases to find their desired items.
3 code implementations • CVPR 2019 • Yang He, Ping Liu, Ziwei Wang, Zhilan Hu, Yi Yang
In this paper, we analyze this norm-based criterion and point out that its effectiveness depends on two requirements that are not always met: (1) the norm deviation of the filters should be large; (2) the minimum norm of the filters should be small.
no code implementations • ACM International Conference on Multimedia 2018 • Ziwei Wang, Yadan Luo, Yang Li, Zi Huang, Hongzhi Yin
Existing image paragraph captioning methods give a series of sentences to represent the objects and regions of interests, where the descriptions are essentially generated by feeding the image fragments containing objects and regions into conventional image single-sentence captioning models.
no code implementations • 22 Aug 2018 • Yadan Luo, Ziwei Wang, Zi Huang, Yang Yang, Cong Zhao
Rich high-quality annotated data is critical for semantic segmentation learning, yet acquiring dense and pixel-wise ground-truth is both labor- and time-consuming.
no code implementations • CVPR 2018 • Yueqi Duan, Ziwei Wang, Jiwen Lu, Xudong Lin, Jie zhou
Specifically, we design a deep reinforcement learning model to learn the structure of the graph for bitwise interaction mining, reducing the uncertainty of binary codes by maximizing the mutual information with inputs and related bits, so that the ambiguous bits receive additional instruction from the graph for confident binarization.
no code implementations • CVPR 2017 • Yueqi Duan, Jiwen Lu, Ziwei Wang, Jianjiang Feng, Jie zhou
In this paper, we propose an unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual matching.