no code implementations • COLING 2022 • BoWen Zhang, Xu Huang, Zhichao Huang, Hu Huang, Baoquan Zhang, Xianghua Fu, Liwen Jing
SILTN is interpretable because it is a neurosymbolic formalism and a computational model that supports learning and reasoning about data with a differentiable first-order logic language (FOL).
no code implementations • COLING 2022 • Bo Liu, Wandi Xu, Yuejia Xiang, XiaoJun Wu, Lejian He, BoWen Zhang, Li Zhu
However, we find that noise learning in text classification is relatively underdeveloped: 1. many methods that have been proven effective in the image domain are not explored in text classification, 2. it is difficult to conduct a fair comparison between previous studies as they do experiments in different noise settings.
1 code implementation • 6 Mar 2023 • BoWen Zhang, Harold Soh
In this work, we explore the potential of large-language models (LLMs) -- which have consumed vast amounts of human-generated text data -- to act as zero-shot human models for HRI.
no code implementations • 17 Feb 2023 • BoWen Zhang, Zhijin Qin, Geoffrey Ye Li
Wireless extended reality (XR) has attracted wide attentions as a promising technology to improve users' mobility and quality of experience.
no code implementations • 2 Feb 2023 • Eivind Meyer, Maurice Brenner, BoWen Zhang, Max Schickert, Bilal Musani, Matthias Althoff
Heterogeneous graphs offer powerful data representations for traffic, given their ability to model the complex interaction effects among a varying number of traffic participants and the underlying road infrastructure.
no code implementations • 30 Jan 2023 • Chen Chen, BoWen Zhang, Liangliang Cao, Jiguang Shen, Tom Gunter, Albin Madappally Jose, Alexander Toshev, Jonathon Shlens, Ruoming Pang, Yinfei Yang
We extend the CLIP model and build a sparse text and image representation (STAIR), where the image and text are mapped to a sparse token space.
no code implementations • 19 Jan 2023 • BoWen Zhang, Xiaojie Jin, Weibo Gong, Kai Xu, Zhao Zhang, Peng Wang, Xiaohui Shen, Jiashi Feng
State-of-the-art video-text retrieval (VTR) methods usually fully fine-tune the pre-trained model (e. g.
no code implementations • 16 Jan 2023 • Tianyue Cao, BoWen Zhang, Zhao Jin, Yongzhi Cao, Hanpin Wang
To deal with properties on variable-length sequences and multilevel data structures, we propose sequence-heap separation logic which integrates sequences into logical reasoning on heap-manipulated programs.
1 code implementation • 30 Dec 2022 • BoWen Zhang, Daijun Ding, Liwen Jing
ChatGPT has the potential to be the best AI model for stance detection tasks in NLP, or at least change the research paradigm of this field.
no code implementations • 16 Dec 2022 • BoWen Zhang, Zhijin Qin, Yiyu Guo, Geoffrey Ye Li
In particular, semantic sensing is used to improve the sensing efficiency by exploring the spatial-temporal distributions of semantic information.
1 code implementation • 15 Dec 2022 • BoWen Zhang, Chenyang Qi, Pan Zhang, Bo Zhang, HsiangTao Wu, Dong Chen, Qifeng Chen, Yong Wang, Fang Wen
In this work, we propose an ID-preserving talking head generation framework, which advances previous methods in two aspects.
1 code implementation • 7 Dec 2022 • Ziqin Zhou, BoWen Zhang, Yinjie Lei, Lingqiao Liu, Yifan Liu
Recently, CLIP has been applied to pixel-level zero-shot learning tasks via a two-stage scheme.
no code implementations • 15 Nov 2022 • R. Austin McEver, BoWen Zhang, B. S. Manjunath
However, in many scenarios, it can be difficult to collect images for training, not to mention the costs associated with collecting annotations suitable for training these object detectors.
no code implementations • 10 Nov 2022 • Zeyu Feng, BoWen Zhang, Jianxin Bi, Harold Soh
In this work, we focus on the problem of safe policy transfer in reinforcement learning: we seek to leverage existing policies when learning a new task with specified constraints.
1 code implementation • 12 Oct 2022 • BoWen Zhang, Zhi Tian, Quan Tang, Xiangxiang Chu, Xiaolin Wei, Chunhua Shen, Yifan Liu
We explore the capability of plain Vision Transformers (ViTs) for semantic segmentation and propose the SegVit.
Ranked #6 on
Semantic Segmentation
on PASCAL Context
1 code implementation • 17 Sep 2022 • BoWen Zhang, Xi Zhao, He Wang, Ruizhen Hu
The core challenge is to generate plausible geometries to fill the unobserved part of the object based on a partial scan, which is under-constrained and suffers from a huge solution space.
no code implementations • 1 Jun 2022 • R. Austin McEver, BoWen Zhang, Connor Levenson, A S M Iftekhar, B. S. Manjunath
Each video includes annotations indicating the start and end times of substrates across the video in addition to counts of species of interest.
no code implementations • 11 May 2022 • BoWen Zhang, Houssem Sifaou, Geoffrey Ye Li
On the other hand, considering the generality of a tracking system, we decouple the tracking system from the CSI environments so that one tracking system for all environments becomes possible.
no code implementations • 10 Mar 2022 • Lucas Relic, BoWen Zhang, Yi-Lin Tuan, Michael Beyeler
Retinal implants have the potential to treat incurable blindness, yet the quality of the artificial vision they produce is still rudimentary.
1 code implementation • CVPR 2022 • BoWen Zhang, Shuyang Gu, Bo Zhang, Jianmin Bao, Dong Chen, Fang Wen, Yong Wang, Baining Guo
To this end, we believe that local attention is crucial to strike the balance between computational efficiency and modeling capacity.
Ranked #1 on
Image Generation
on CelebA 256x256
(FID metric)
2 code implementations • 14 Dec 2021 • Yidong Wang, BoWen Zhang, Wenxin Hou, Zhen Wu, Jindong Wang, Takahiro Shinozaki
The long-tailed class distribution in visual recognition tasks poses great challenges for neural networks on how to handle the biased predictions between head and tail classes, i. e., the model tends to classify tail classes as head classes.
no code implementations • 14 Dec 2021 • BoWen Zhang, Jiahui Yu, Christopher Fifty, Wei Han, Andrew M. Dai, Ruoming Pang, Fei Sha
We term this approach as Co-training Videos and Images for Action Recognition (CoVeR).
Ranked #5 on
Action Classification
on Moments in Time
(using extra training data)
1 code implementation • NeurIPS 2021 • BoWen Zhang, Yidong Wang, Wenxin Hou, Hao Wu, Jindong Wang, Manabu Okumura, Takahiro Shinozaki
However, like other modern SSL algorithms, FixMatch uses a pre-defined constant threshold for all classes to select unlabeled data that contribute to the training, thus failing to consider different learning status and learning difficulties of different classes.
no code implementations • Findings (EMNLP) 2021 • BoWen Zhang, Hexiang Hu, Linlu Qiu, Peter Shaw, Fei Sha
We investigate ways to compose complex concepts in texts from primitive ones while grounding them in images.
2 code implementations • EMNLP 2021 • Linlu Qiu, Hexiang Hu, BoWen Zhang, Peter Shaw, Fei Sha
We analyze the grounded SCAN (gSCAN) benchmark, which was recently proposed to study systematic generalization for grounded language understanding.
1 code implementation • NeurIPS 2021 • BoWen Zhang, Yifan Liu, Zhi Tian, Chunhua Shen
This neural representation enables our decoder to leverage the smoothness prior in the semantic label space, and thus makes our decoder more efficient.
1 code implementation • 13 Jun 2021 • Marija Vella, BoWen Zhang, Wei Chen, João F. C. Mota
Such methods, however, cannot guarantee that the input measurements are satisfied in the recovered image, since the learned parameters by the network are applied to every test image.
no code implementations • 28 Feb 2021 • Yichao Zhou, Wei-Ting Chen, BoWen Zhang, David Lee, J. Harry Caufield, Kai-Wei Chang, Yizhou Sun, Peipei Ping, Wei Wang
Clinical case reports are written descriptions of the unique aspects of a particular clinical case, playing an essential role in sharing clinical experiences about atypical disease phenotypes and new therapies.
no code implementations • 5 Feb 2021 • Zhi Tian, BoWen Zhang, Hao Chen, Chunhua Shen
In the literature, top-performing instance segmentation methods typically follow the paradigm of Mask R-CNN and rely on ROI operations (typically ROIAlign) to attend to each instance.
no code implementations • 18 Nov 2020 • BoWen Zhang, Hexiang Hu, Joonseok Lee, Ming Zhao, Sheide Chammas, Vihan Jain, Eugene Ie, Fei Sha
Identifying a short segment in a long video that semantically matches a text query is a challenging task that has important application potentials in language-based video search, browsing, and navigation.
no code implementations • 29 Oct 2020 • Wei Chen, BoWen Zhang, Shi Jin, Bo Ai, Zhangdui Zhong
Sparse signal recovery problems from noisy linear measurements appear in many areas of wireless communications.
no code implementations • EMNLP 2020 • BoWen Zhang, Hexiang Hu, Vihan Jain, Eugene Ie, Fei Sha
Recent progresses have leveraged the ideas of pre-training (from language modeling) and attention layers in Transformers to learn representation from datasets containing images aligned with linguistic expressions that describe the images.
no code implementations • 6 Oct 2020 • BoWen Zhang, Hao Chen, Meng Wang, Yuanjun Xiong
We formulate the problem of online temporal action detection in live streaming videos, acknowledging one important property of live streaming videos that there is normally a broadcast delay between the latest captured frame and the actual frame viewed by the audience.
no code implementations • ACL 2020 • Bowen Zhang, Min Yang, Xutao Li, Yunming Ye, Xiaofei Xu, Kuai Dai
Specifically, a semantic-emotion heterogeneous graph is constructed from external semantic and emotion lexicons, which is then fed into a graph convolutional network to learn multi-hop semantic connections between words and emotion tags.
no code implementations • 26 Mar 2020 • Bowen Zhang, Benedetta Tondi, Xixiang Lv, Mauro Barni
The existence of adversarial examples and the easiness with which they can be generated raise several security concerns with regard to deep learning systems, pushing researchers to develop suitable defense mechanisms.
no code implementations • 13 Jan 2020 • Bowen Zhang, Hexiang Hu, Fei Sha
To narrate a sequence of images, we use the predicted anchor word embeddings and the image features as the joint input to a seq2seq model.
no code implementations • 1 Oct 2019 • Bowen Zhang, Benedetta Tondi, Mauro Barni
In this paper, we study the vulnerability of anti-spoofing methods based on deep learning against adversarial perturbations.
Cryptography and Security
1 code implementation • ECCV 2018 • Bowen Zhang, Hexiang Hu, Fei Sha
Similarly, a paragraph may contain sentences with different topics, which collectively conveys a coherent message or story.
no code implementations • 27 Jul 2018 • Bowen Zhang, Xifan Zhang, Fan Cheng, Deli Zhao
During testing, combined with the test sample and the points in the class, a new simplex is formed.
1 code implementation • CVPR 2016 • Bowen Zhang, Li-Min Wang, Zhe Wang, Yu Qiao, Hanli Wang
The deep two-stream architecture exhibited excellent performance on video based action recognition.
Ranked #70 on
Action Recognition
on UCF101