1 code implementation • ECCV 2020 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao
The proposed framework is gradient-based and model-agnostic.
1 code implementation • 14 Nov 2024 • Zheng Zhou, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiaowei Huang, Qi Zhao
Dataset Distillation (DD) is an emerging technique that compresses large-scale datasets into significantly smaller synthesized datasets while preserving high test performance and enabling the efficient training of large models.
1 code implementation • 8 Nov 2024 • Shuchang Lyu, Qi Zhao, Guangliang Cheng, Yiwei He, Zheng Zhou, Guangbiao Wang, Zhenwei Shi
This task presents two principal challenges: (1) severe inconsistencies in feature representation across different remote sensing domains, and (2) a domain gap that emerges due to the representation bias of source domain patterns when translating features to predictive logits.
no code implementations • 16 Oct 2024 • Yao Shen, Ziwei Wei, Chunmeng Liu, Shuming Wei, Qi Zhao, Kaiyang Zeng, Guangyao Li
To address these challenges, we propose an Adaptive Prompt Learning with SAM (APL-SAM) framework tailored for few-shot SPM image segmentation.
1 code implementation • 28 Aug 2024 • Qi Zhao, Haotian Fu, Chen Sun, George Konidaris
Long-horizon decision-making tasks present significant challenges for LLM-based agents due to the need for extensive planning over multiple steps.
no code implementations • 20 Aug 2024 • Yilun Kong, Hangyu Mao, Qi Zhao, Bin Zhang, Jingqing Ruan, Li Shen, Yongzhe Chang, Xueqian Wang, Rui Zhao, DaCheng Tao
We derive insights from offline prompting demonstration data, which already exists in large quantities as a by-product of benchmarking diverse prompts on open-sourced tasks, thereby circumventing the expenses of online interactions.
no code implementations • 5 Aug 2024 • Xianyu Chen, Ming Jiang, Qi Zhao
While exploring visual scenes, humans' scanpaths are driven by their underlying attention processes.
1 code implementation • 3 Jun 2024 • Zheng Zhou, Hongbo Zhao, Guangliang Cheng, Xiangtai Li, Shuchang Lyu, Wenquan Feng, Qi Zhao
Our extensive experiments confirm the effectiveness of BACON and its seamless integration with existing methods, thereby enhancing their performance for the DD task.
no code implementations • 28 May 2024 • Shaoxuan Cui, Qi Zhao, Guofeng Zhang, Hildeberto Jardón-Kojakhmetov, Ming Cao
To address this challenge, we introduce HOIs (Higher-Order Interactions) which are able to capture, for example, the indirect effect of one species on a second one correlating to a third species.
no code implementations • 6 May 2024 • Qi Zhao, Tengfei Liu, Bai Yan, Qiqi Duan, Jian Yang, Yuhui Shi
To bridge the gap, this paper proposes an autoregressive learning-based designer for automated design of metaheuristic algorithms.
no code implementations • CVPR 2024 • Xianyu Chen, Ming Jiang, Qi Zhao
Understanding how attention varies across individuals has significant scientific and societal impacts.
no code implementations • 14 Apr 2024 • Bai Yan, Qi Zhao, Jin Zhang, J. Andrew Zhang
This paper tackles the deployment challenges of Simultaneous Transmitting and Reflecting Reconfigurable Intelligent Surface (STAR-RIS) in communication systems.
no code implementations • CVPR 2024 • Qi Zhao, M. Salman Asif, Zhan Ma
To address this issue, we introduce the Pyramidal Neural Representation for Videos (PNeRV), which is built on a multi-scale information connection and comprises a lightweight rescaling operator, Kronecker Fully-connected layer (KFc), and a Benign Selective Memory (BSM) mechanism.
1 code implementation • 26 Mar 2024 • Jue Wang, Yuxiang Lin, Qi Zhao, Dong Luo, Shuaibao Chen, Wei Chen, Xiaojiang Peng
The widespread use of various chemical gases in industrial processes necessitates effective measures to prevent their leakage during transportation and storage, given their high toxicity.
1 code implementation • 4 Jan 2024 • Jing Wu, Suiyao Chen, Qi Zhao, Renat Sergazinov, Chen Li, ShengJie Liu, Chongchao Zhao, Tianpei Xie, Hanqing Guo, Cheng Ji, Daniel Cociorva, Hakan Brunzel
Self-supervised representation learning methods have achieved significant success in computer vision and natural language processing, where data samples exhibit explicit spatial or semantic dependencies.
1 code implementation • 22 Nov 2023 • Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun
To interpret the important text evidence for question answering, we generalize the concept bottleneck model to work with tokens and nonlinear models, which uses hard attention to select a small subset of tokens from the free-form text as inputs to the LLM reasoner.
Ranked #9 on Video Question Answering on NExT-QA
1 code implementation • 22 Oct 2023 • Chunlei Wang, Wenquan Feng, Xiangtai Li, Guangliang Cheng, Shuchang Lyu, Binghao Liu, Lijiang Chen, Qi Zhao
While current foundational models excel at various visual language tasks, there's a noticeable absence of models specifically tailored for open-vocabulary visual grounding.
1 code implementation • NeurIPS 2023 • Shi Chen, Ming Jiang, Qi Zhao
In recent years, deep saliency models have made significant progress in predicting human visual attention.
no code implementations • 9 Oct 2023 • Qiqi Duan, Chang Shao, Guochen Zhou, Minghan Zhang, Qi Zhao, Yuhui Shi
In the post-Moore era, main performance gains of black-box optimizers are increasingly depending on parallelism, especially for large-scale optimization (LSO).
no code implementations • 7 Sep 2023 • Zhuoyuan Ma, Qi Zhao, Bai Yan, Jin Zhang
The paper constructs a STAR-RIS assisted multi-user multiple-input single-output (MU-MISO) mobile wireless network and jointly optimizes the dynamic deployment strategy of STAR-RIS and the hybrid beamforming strategy to maximize the long-term total communication rate of users.
1 code implementation • 31 Jul 2023 • Qi Zhao, Shijie Wang, Ce Zhang, Changcheng Fu, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun
We propose to formulate the LTA task from two perspectives: a bottom-up approach that predicts the next actions autoregressively by modeling temporal dynamics; and a top-down approach that infers the goal of the actor and plans the needed procedure to accomplish the goal.
Ranked #2 on Long Term Action Anticipation on Ego4D
1 code implementation • 23 Jul 2023 • Menghao Li, Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao
The proposed framework is evaluated on five regular VG datasets and two newly constructed robust VG datasets.
no code implementations • 9 May 2023 • Guangliang Cheng, Yunmeng Huang, Xiangtai Li, Shuchang Lyu, Zhaoyang Xu, Qi Zhao, Shiming Xiang
We first introduce some preliminary knowledge for the change detection task, such as problem definition, datasets, evaluation metrics, and transformer basics, as well as provide a detailed taxonomy of existing algorithms from three different perspectives: algorithm granularity, supervision modes, and learning frameworks in the methodology section.
Change Detection Change detection for remote sensing images +1
no code implementations • CVPR 2023 • Qi Zhao, M. Salman Asif, Zhan Ma
DNeRV achieves competitive results against the state-of-the-art neural compression approaches and outperforms existing implicit methods on downstream inpainting and interpolation for $960 \times 1920$ videos.
1 code implementation • 11 Apr 2023 • Qiqi Duan, Chang Shao, Guochen Zhou, Haobin Yang, Qi Zhao, Yuhui Shi
Given the ubiquity of non-separable optimization problems in real worlds, in this paper we analyze and extend the large-scale version of the well-known cooperative coevolution (CC), a divide-and-conquer black-box optimization framework, on non-separable functions.
1 code implementation • CVPR 2023 • Shi Chen, Qi Zhao
They have yet to develop the capability to address novel objects or spurious biases in real-world scenarios, and also fall short of interpreting the rationales behind their decisions.
1 code implementation • 12 Mar 2023 • Qi Zhao, Bai Yan, Taiwei Hu, Xianglong Chen, Qiqi Duan, Jian Yang, Yuhui Shi
In response, this paper proposes AutoOptLib, the first platform for accessible automated design of metaheuristic optimizers.
no code implementations • 12 Mar 2023 • Qi Zhao, Qiqi Duan, Bai Yan, Shi Cheng, Yuhui Shi
Metaheuristics have gained great success in academia and practice because their search logic can be applied to any problem with available solution representation, solution quality evaluation, and certain notions of locality.
1 code implementation • 13 Jan 2023 • Qi Zhao, Shuchang Lyu, Binghao Liu, Lijiang Chen, Hongbo Zhao
We first propose source student backbone and target student backbone to respectively extract the source-style and target-style feature for both source and target images.
1 code implementation • ICCV 2023 • Yifeng Zhang, Shi Chen, Qi Zhao
Answering visual questions requires the ability to parse visual observations and correlate them with a variety of knowledge.
1 code implementation • 12 Dec 2022 • Qiqi Duan, Guochen Zhou, Chang Shao, Zhuowei Wang, Mingyang Feng, Yuwei Huang, Yajing Tan, Yijun Yang, Qi Zhao, Yuhui Shi
In this paper, we present an open-source pure-Python library called PyPop7 for black-box optimization (BBO).
no code implementations • 26 Aug 2022 • Ye Wang, Qi Zhao, Wenyan Wu, Ailsa Willis, Angus R. Simpson, Erik Weyer
This paper presents a case study of the operational management of the Robinvale high-pressure piped irrigation water delivery system (RVHPS) in Australia.
no code implementations • 17 Aug 2022 • Menghao Li, Wenquan Feng, Shuchang Lyu, Lijiang Chen, Qi Zhao
On the DSB2018 and CA2. 5, our network surpasses previous methods by 1. 2% (AP50).
1 code implementation • 14 Jul 2022 • Qi Zhao, Shuchang Lyu, Wenpei Bai, Linghan Cai, Binghao Liu, Guangliang Cheng, Meijing Wu, Xiubo Sang, Min Yang, Lijiang Chen
To solve this problem, we propose a Multi-Modality Ovarian Tumor Ultrasound (MMOTU) image dataset containing 1469 2d ultrasound images and 170 contrast enhanced ultrasonography (CEUS) images with pixel-wise and global-wise annotations.
1 code implementation • 20 Apr 2022 • Shi Chen, Ming Jiang, Jinhui Yang, Qi Zhao
In this work, we propose an Attention with Reasoning capability (AiR) framework that uses attention to understand and improve the process leading to task outcomes.
no code implementations • 14 Apr 2022 • Qiuhao Chen, Yuxuan Du, Qi Zhao, Yuling Jiao, Xiliang Lu, Xingyao Wu
We systematically evaluate the performance of our proposal in compiling quantum operators with both inverse-closed and inverse-free universal basis sets.
1 code implementation • 3 Apr 2022 • Qi Zhao, Bai Yan, Xianglong Chen, Taiwei Hu, Shi Cheng, Yuhui Shi
However, the specific algorithm prototype and linear algorithm representation in the current automated design pipeline restrict the design within a fixed algorithm structure, which hinders discovering novelties and diversity across the metaheuristic family.
no code implementations • 16 Mar 2022 • Diu Khue Luu, Anh Tuan Nguyen, Ming Jiang, Markus W. Drealan, Jian Xu, Tong Wu, Wing-kin Tam, Wenfeng Zhao, Brian Z. H. Lim, Cynthia K. Overstreet, Qi Zhao, Jonathan Cheng, Edward W. Keefer, Zhi Yang
Objective: The next generation prosthetic hand that moves and feels like a real hand requires a robust neural interconnection between the human minds and machines.
1 code implementation • CVPR 2022 • Shi Chen, Qi Zhao
Finally, with our new data and method, we perform extensive analyses to study the effectiveness of our explanation under different settings, including multi-task learning and transfer learning.
Ranked #2 on Explanatory Visual Question Answering on GQA-REX
no code implementations • 26 Jan 2022 • Qi Zhao, Huanhao Li, Zhipeng Yu, Chi Man Woo, Tianting Zhong, Shengfu Cheng, Yuanjin Zheng, Honglin Liu, Jie Tian, Puxiang Lai
A scattering ground glass is exploited to generate physical secret keys of gigabit length and encrypt face images via seemingly random optical speckles at light speed.
1 code implementation • 23 Jan 2022 • Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao
To explore these issues, we formulate a new semi-supervised continual learning method, which can be generically applied to existing continual learning models.
1 code implementation • 23 Jan 2022 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao
To this end, we propose a new learning approach, namely gradient adjustment learning (GAL), to leverage the knowledge learned from the past training iterations to adjust vanilla gradients, such that the remainders are minimized and the approximations are improved.
1 code implementation • CVPR 2022 • Jinhui Yang, Xianyu Chen, Ming Jiang, Shi Chen, Louis Wang, Qi Zhao
With an overarching goal of developing intelligent systems to assist humans in various daily activities, we propose VisualHow, a free-form and open-ended research that focuses on understanding a real-life problem and deriving its solution by incorporating key components across multiple modalities.
1 code implementation • CVPR 2022 • Yifeng Zhang, Ming Jiang, Qi Zhao
Explainable visual question answering (VQA) models have been developed with neural modules and query-based knowledge incorporation to answer knowledge-requiring questions.
no code implementations • NeurIPS 2021 • Evan McCarty, Qi Zhao, Anastasios Sidiropoulos, Yusu Wang
This leads to a mixed algorithmic-ML framework, which we call NN-Baker that has the capacity to approximately solve a family of graph optimization problems (e. g, maximum independent set and minimum vertex cover) in time linear to input graph size, and only polynomial to approximation parameter.
no code implementations • 29 Nov 2021 • Qi Zhao, YuFei Wang, Shuchang Lyu, Lijiang Chen
In this paper, we propose attention-based feature decomposition-reconstruction network for scene text detection, which utilizes contextual information and low-level feature to enhance the performance of segmentation-based text detector.
1 code implementation • 27 Oct 2021 • Shixiang Wang, Xue-Song Liu, Jianfeng Li, Qi Zhao
Cox analysis is a common clinical data analysis technique to link valuable variables to clinical outcomes including dead and relapse.
no code implementations • 9 Oct 2021 • Qi Zhao, Xu Wang, Shuchang Lyu, Binghao Liu, Yifan Yang
To handle these two issues, we propose a feature consistency driven attention erasing network (FCAENet) for fine-grained image retrieval.
1 code implementation • NeurIPS 2021 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao
Secondly, due to the data complexity, it is challenging to differentiate the incorrect predictions from the correct ones on real-world large-scale datasets.
no code implementations • 19 Sep 2021 • Bai Yan, Qi Zhao, Jin Zhang, J. Andrew Zhang, Xin Yao
To investigate the number and distribution of local optima, we conduct a fitness landscape analysis on the sum rate maximization problems.
no code implementations • 3 Sep 2021 • Xin Chen, Qi Zhao, Xinyang Liu
And the result shows that the performance of NER will be improved if the word specificity is incorporated into existing NER methods.
2 code implementations • 26 Aug 2021 • Xingkui Zhu, Shuchang Lyu, Xu Wang, Qi Zhao
Object detection on drone-captured scenarios is a recent popular task.
1 code implementation • Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence 2021 • Xianyu Chen, Ming Jiang, Qi Zhao
Image captioning models depend on training with paired image-text corpora, which poses various challenges in describing images containing novel objects absent from the training data.
1 code implementation • 14 Aug 2021 • Qi Zhao, Binghao Liu, Shuchang Lyu, Huojin Chen
To deal with the above two issues, we propose self-distillation embedded supervised affinity attention model to improve the performance of few-shot segmentation task.
no code implementations • CVPR 2021 • Yifeng Zhang, Ming Jiang, Qi Zhao
Existing explainable and explicit visual reasoning methods only perform reasoning based on visual evidence but do not take into account knowledge beyond what is in the visual scene.
1 code implementation • CVPR 2021 • Xianyu Chen, Ming Jiang, Qi Zhao
Conditioned on a task guidance map, the proposed model learns question-specific attention patterns to generate scanpaths.
no code implementations • 14 Jun 2021 • Qi Zhao, Bai Yan, Yuhui Shi
In many clustering scenes, data samples' attribute values change over time.
no code implementations • 14 Jun 2021 • Bai Yan, Qi Zhao, Jin Zhang, J. Andrew Zhang, Xin Yao
To overcome the above shortcomings of relaxation, we propose a novel idea of simultaneously estimating the frequencies and model order by means of the atomic $l_0$ norm.
no code implementations • 14 Jun 2021 • Bai Yan, Qi Zhao, Jin Zhang, J. Andrew Zhang, Xin Yao
We formulate a multiobjective off-grid DOA estimation model to realize this idea, by which the source number can be automatically identified together with DOA estimation.
no code implementations • 1 Apr 2021 • Qi Zhao, Yujing Ma, Shuchang Lyu, Lijiang Chen
On this issue, we embed self-distillation (SD) method to transfer knowledge from ensemble network to main-branch in it.
no code implementations • 24 Mar 2021 • Anh Tuan Nguyen, Markus W. Drealan, Diu Khue Luu, Ming Jiang, Jian Xu, Jonathan Cheng, Qi Zhao, Edward W. Keefer, Zhi Yang
This enables the implementation of the neuroprosthetic hand as a portable and self-contained unit with real-time control of individual finger movements.
no code implementations • 14 Mar 2021 • Manoj Rohit Vemparala, Alexander Frickenstein, Nael Fasfous, Lukas Frickenstein, Qi Zhao, Sabine Kuhn, Daniel Ehrhardt, Yuankai Wu, Christian Unger, Naveen Shankar Nagaraja, Walter Stechele
The distilled models exhibit their strength against all white box attacks with an exception of C&W.
no code implementations • 1 Mar 2021 • Qi Zhao, Shuchang Lyu, Zhiwei Zhang, Ting-Bing Xu, Guangliang Cheng
In real applications, different computation-resource devices need different-depth networks (e. g., ResNet-18/34/50) with high-accuracy.
1 code implementation • IEEE Winter Conference on Applications of Computer Vision 2021 • Xianyu Chen, Ming Jiang, Qi Zhao
We propose an ensemble-based self-distillation method that allows image captioning models to be trained with unpaired images and captions.
no code implementations • 30 Dec 2020 • Yuewen Li, Wenquan Feng, Shuchang Lyu, Qi Zhao, Xuliang Li
In this paper, we present an effective object detection framework (MM-FSOD) that integrates metric learning and meta-learning to tackle the few-shot object detection task.
no code implementations • 29 Dec 2020 • Qi Zhao, Shuchang Lyu, Yuewen Li, Yujing Ma, Lijiang Chen
To avoid the interference from confusing information, we propose Multi-granularity Multi-Level Feature Ensemble Module (MGML-FEM) which can provide diverse predictions by full-channel feature generator (FC-FG).
no code implementations • 4 Dec 2020 • Xiao Yuan, Pei Zeng, Minbo Gao, Qi Zhao
Focusing on a general dynamical resource theory of quantum channels, here we consider tasks of one-shot resource distillation and dilution with a single copy of the resource.
Quantum Physics
no code implementations • 11 Oct 2020 • Qi Zhao
Moreover, this study shows that the LSTM model could extract universal features from trade-by-trade data, as the learned parameters well maintain their high performance on other cryptocurrency instruments that were not included in training data.
no code implementations • 1 Sep 2020 • Qi Zhao, Zheng Zhao, Xiaoya Fan, Zhengwei Yuan, Qian Mao, YuDong Yao
Recently, with the increasing availability of RNA structure data, new methods based on machine-learning technologies, especially deep learning, have alleviated the issue.
1 code implementation • ECCV 2020 • Shi Chen, Ming Jiang, Jinhui Yang, Qi Zhao
In this work, we propose an Attention with Reasoning capability (AiR) framework that uses attention to understand and improve the process leading to task outcomes.
no code implementations • 27 Jul 2020 • Yifeng Zhang, Ming Jiang, Qi Zhao
At the core of the method is a new Graph Semantic Saliency Network (GraSSNet) that constructs a graph that encodes semantic relationships learned from external knowledge.
1 code implementation • 23 Jul 2020 • Xianyu Chen, Ming Jiang, Qi Zhao
Few-shot object detection aims at detecting objects with few annotated examples, which remains a challenging research problem yet to be explored.
1 code implementation • 9 Jul 2020 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao
The proposed framework is gradient-based and model-agnostic.
1 code implementation • ACL 2020 • Bodhisattwa Majumder, Navneet Potti, Sandeep Tata, James B. Wendt, Qi Zhao, Marc Najork
We propose a novel approach using representation learning for tackling the problem of extracting structured information from form-like document images.
no code implementations • 23 May 2020 • Abbas Kazerouni, Qi Zhao, Jing Xie, Sandeep Tata, Marc Najork
Furthermore, there is usually only a small amount of initial training data available when building machine-learned models to solve such problems.
no code implementations • 9 Feb 2020 • Junnan Li, Ziwei Xu, Yongkang Wong, Qi Zhao, Mohan Kankanhalli
Therefore, it is important to develop algorithms that can leverage off-the-shelf labeled dataset to learn useful knowledge for the target task.
1 code implementation • 17 Dec 2019 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao
We propose a Direction Concentration Learning (DCL) method to improve congruency in the learning process, where enhancing congruency influences the convergence path to be less circuitous.
Ranked #8 on Image Classification on Tiny ImageNet Classification (using extra training data)
no code implementations • 15 Nov 2019 • Juanyong Duan, Sim Heng Ong, Qi Zhao
Unlike previous paradigms that directly ask annotators to distinguish between real and fake data in a straightforward way, we propose and annotate a set of carefully designed attributes that encode important image information at various levels, to understand the differences between fake and real images.
1 code implementation • 30 Apr 2019 • Meghan Thommes, Taiyao Wang, Qi Zhao, Ioannis C. Paschalidis, Daniel Segrè
Specifically, we searched for communities able to survive under constraints (such as a limited number of reactions) that would not be sustainable by individual species.
1 code implementation • NeurIPS 2019 • Qi Zhao, Yusu Wang
However often in practice, the choice of the weight function should depend on the nature of the specific type of data one considers, and it is thus highly desirable to learn a best weight function (and thus metric for persistence diagrams) from labelled data.
Ranked #1 on Graph Classification on NCI109
Graph Classification Computational Geometry
1 code implementation • 8 Apr 2019 • Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao
In addition, analysis of the intra-class compactness and inter-class separability demonstrates the advantages of the proposed function over the softmax function, which is consistent with the performance improvement.
no code implementations • ECCV 2018 • Shi Chen, Qi Zhao
Visual attention has shown usefulness in image captioning, with the goal of enabling a caption model to selectively focus on regions of interest.
no code implementations • 20 Dec 2018 • Xiao Yuan, Suguru Endo, Qi Zhao, Ying Li, Simon Benjamin
In this work, we introduce variational quantum simulation of mixed states under general stochastic evolution.
Quantum Physics
1 code implementation • CVPR 2019 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli
Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect.
Ranked #26 on Image Classification on Clothing1M (using extra training data)
no code implementations • 13 Dec 2018 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli
Social relationships form the basis of social structure of humans.
no code implementations • 4 Dec 2018 • Xiaosi Xu, Qi Zhao, Xiao Yuan, Simon C. Benjamin
We consider an approach to fault tolerant quantum computing based on a simple error detecting code operating as the substrate for a conventional surface code.
Quantum Physics
1 code implementation • NeurIPS 2018 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli
Different from previous works in video representation learning, our unsupervised learning task is to predict 3D motion in multiple target views using video representation from a source view.
no code implementations • 29 Aug 2018 • Bingjie Xu, Junnan Li, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao
The recent advances in instance-level detection tasks lay strong foundation for genuine comprehension of the visual scenes.
1 code implementation • 31 Jul 2018 • Mengmi Zhang, Keng Teck Ma, Shih-Cheng Yen, Joo Hwee Lim, Qi Zhao, Jiashi Feng
Egocentric spatial memory (ESM) defines a memory system with encoding, storing, recognizing and recalling the spatial information about the environment from an egocentric perspective.
no code implementations • 25 Jul 2018 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli
Video storytelling introduces new challenges, mainly due to the diversity of the story and the length and complexity of the video.
no code implementations • CVPR 2018 • Shaojing Fan, Zhiqi Shen, Ming Jiang, Bryan L. Koenig, Juan Xu, Mohan S. Kankanhalli, Qi Zhao
In this paper, we present the first study to focus on the relation between emotional properties of an image and visual attention.
no code implementations • 14 Feb 2018 • Anh Tuan Nguyen, Jian Xu, Diu Khue Luu, Qi Zhao, Zhi Yang
We envision that our theory would provide a framework for the future development of bio-inspired redundant artificial systems as well as assist the studies of the fundamental mechanisms governing various biological processes.
no code implementations • ICLR 2018 • Mengmi Zhang, Keng Teck Ma, Joo Hwee Lim, Shih-Cheng Yen, Qi Zhao, Jiashi Feng
During the exploration, our proposed ESM network model updates belief of the global map based on local observations using a recurrent neural network.
no code implementations • ICCV 2017 • Ming Jiang, Qi Zhao
This paper presents a novel method for quantitative and objective diagnoses of Autism Spectrum Disorder (ASD) using eye tracking and deep neural networks.
no code implementations • 3 Aug 2017 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli
However, due to the domain shift problem, the performance of Web images trained deep classifiers tend to degrade when directly deployed to videos.
1 code implementation • ICCV 2017 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli
Since the beginning of early civilizations, social relationships derived from each individual fundamentally form the basis of social structure in our daily life.
Ranked #4 on Visual Social Relationship Recognition on PIPA
1 code implementation • CVPR 2017 • Mengmi Zhang, Keng Teck Ma, Joo Hwee Lim, Qi Zhao, Jiashi Feng
Through competition with discriminator, the generator progressively improves quality of the future frames and thus anticipates future gaze better.
1 code implementation • 19 Jun 2017 • Nick Erickson, Qi Zhao
This paper introduces Dex, a reinforcement learning environment toolkit specialized for training and evaluation of continual learning methods as well as general reinforcement learning problems.
no code implementations • CVPR 2016 • Shaojing Fan, Tian-Tsong Ng, Bryan L. Koenig, Ming Jiang, Qi Zhao
(3) It can guide the design of a generalized computational algorithm for multi-dimensional visual perception.
no code implementations • ICCV 2015 • Xun Huang, Chengyao Shen, Xavier Boix, Qi Zhao
Saliency in Context (SALICON) is an ongoing effort that aims at understanding and predicting visual attention.
no code implementations • 19 Nov 2015 • Yan Luo, Xavier Boix, Gemma Roig, Tomaso Poggio, Qi Zhao
To see this, first, we report results in ImageNet that lead to a revision of the hypothesis that adversarial perturbations are a consequence of CNNs acting as a linear classifier: CNNs act locally linearly to changes in the image regions with objects recognized by the CNN, and in other regions the CNN may act non-linearly.
no code implementations • CVPR 2015 • Ming Jiang, Shengsheng Huang, Juanyong Duan, Qi Zhao
Saliency in Context (SALICON) is an ongoing effort that aims at understanding and predicting visual attention.
no code implementations • CVPR 2015 • Yan Luo, Yongkang Wong, Qi Zhao
In addition, since new datasets are built and shared in the community from time to time, it would be good not to retrain the entire model when new data are added.
no code implementations • 23 Dec 2014 • Chengyao Shen, Xun Huang, Qi Zhao
Visualizations also show that these features are selective to potential objects in the scene and the responses of these features work well in predicting eye fixations on the images when combined with learned weights.
no code implementations • NeurIPS 2009 • Zhi Yang, Qi Zhao, Edward Keefer, Wentai Liu
Multiple noise sources have been studied through analytical models as well as empirical measurements.