Search Results for author: Tiancheng Zhao

Found 36 papers, 20 papers with code

Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head

1 code implementation11 Mar 2024 Tiancheng Zhao, Peng Liu, Xuan He, Lu Zhang, Kyusong Lee

End-to-end transformer-based detectors (DETRs) have shown exceptional performance in both closed-set and open-vocabulary object detection (OVD) tasks through the integration of language modalities.

Object object-detection +2

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection

1 code implementation22 Dec 2023 Haozhan Shen, Tiancheng Zhao, Mingwei Zhu, Jianwei Yin

Visual grounding, a crucial vision-language task involving the understanding of the visual context based on the query expression, necessitates the model to capture the interactions between objects, as well as various spatial and attribute information.

Attribute object-detection +2

Benchmarking Sequential Visual Input Reasoning and Prediction in Multimodal Large Language Models

1 code implementation20 Oct 2023 Mingwei Zhu, Leigang Sha, Yu Shu, Kangjia Zhao, Tiancheng Zhao, Jianwei Yin

Multimodal large language models (MLLMs) have shown great potential in perception and interpretation tasks, but their capabilities in predictive reasoning remain under-explored.

Activity Prediction Benchmarking +2

How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection

2 code implementations25 Aug 2023 Yiyang Yao, Peng Liu, Tiancheng Zhao, Qianqian Zhang, Jiajia Liao, Chunxin Fang, Kyusong Lee, Qing Wang

Extensive experimental results show that existing top OVD models all fail on the new tasks except for simple object types, demonstrating the value of the proposed dataset in pinpointing the weakness of current OVD models and guiding future research.

Object Detection

RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing

1 code implementation20 Jun 2023 Zilun Zhang, Tiancheng Zhao, Yulong Guo, Jianwei Yin

Moreover, we present an image-text paired dataset in the field of remote sensing (RS), RS5M, which has 5 million RS images with English descriptions.

 Ranked #1 on Cross-Modal Retrieval on RSITMD (using extra training data)

Cross-Modal Retrieval Image Retrieval +5

OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network

1 code implementation10 Sep 2022 Tiancheng Zhao, Peng Liu, Kyusong Lee

The advancement of object detection (OD) in open-vocabulary and open-world scenarios is a critical challenge in computer vision.

Continual Learning Object +2

Data Augmentation is a Hyperparameter: Cherry-picked Self-Supervision for Unsupervised Anomaly Detection is Creating the Illusion of Success

1 code implementation16 Aug 2022 Jaemin Yoo, Tiancheng Zhao, Leman Akoglu

Self-supervised learning (SSL) has emerged as a promising alternative to create supervisory signals to real-world problems, avoiding the extensive cost of manual labeling.

Data Augmentation Self-Supervised Anomaly Detection +2

VL-CheckList: Evaluating Pre-trained Vision-Language Models with Objects, Attributes and Relations

1 code implementation1 Jul 2022 Tiancheng Zhao, Tianqi Zhang, Mingwei Zhu, Haozhan Shen, Kyusong Lee, Xiaopeng Lu, Jianwei Yin

Inspired by the CheckList for testing natural language processing, we exploit VL-CheckList, a novel framework to understand the capabilities of VLP models.

SF-QA: Simple and Fair Evaluation Library for Open-domain Question Answering

1 code implementation EACL 2021 Xiaopeng Lu, Kyusong Lee, Tiancheng Zhao

Although open-domain question answering (QA) draws great attention in recent years, it requires large amounts of resources for building the full system and is often difficult to reproduce previous results due to complex configurations.

Open-Domain Question Answering

VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words

1 code implementation ACL 2021 Xiaopeng Lu, Tiancheng Zhao, Kyusong Lee

To the best of our knowledge, VisualSparta is the first transformer-based text-to-image retrieval model that can achieve real-time searching for large-scale datasets, with significant accuracy improvement compared to previous state-of-the-art methods.

Cross-Modal Retrieval Image Retrieval +2

``None of the Above'': Measure Uncertainty in Dialog Response Retrieval

no code implementations ACL 2020 Yulan Feng, Shikib Mehri, Maxine Eskenazi, Tiancheng Zhao

This paper discusses the importance of uncovering uncertainty in end-to-end dialog tasks and presents our experimental results on uncertainty classification on the processed Ubuntu Dialog Corpus.

General Classification Retrieval

Report from the NSF Future Directions Workshop, Toward User-Oriented Agents: Research Directions and Challenges

no code implementations10 Jun 2020 Maxine Eskenazi, Tiancheng Zhao

This USER Workshop was convened with the goal of defining future research directions for the burgeoning intelligent agent research community and to communicate them to the National Science Foundation.

"None of the Above":Measure Uncertainty in Dialog Response Retrieval

no code implementations4 Apr 2020 Yulan Feng, Shikib Mehri, Maxine Eskenazi, Tiancheng Zhao

This paper discusses the importance of uncovering uncertainty in end-to-end dialog tasks, and presents our experimental results on uncertainty classification on the Ubuntu Dialog Corpus.

General Classification Object Detection +1

Investigating Evaluation of Open-Domain Dialogue Systems With Human Generated Multiple References

2 code implementations WS 2019 Prakhar Gupta, Shikib Mehri, Tiancheng Zhao, Amy Pavel, Maxine Eskenazi, Jeffrey P. Bigham

The aim of this paper is to mitigate the shortcomings of automatic evaluation of open-domain dialog systems through multi-reference evaluation.

Dialogue Evaluation

Unsupervised Dialog Structure Learning

1 code implementation NAACL 2019 Weiyan Shi, Tiancheng Zhao, Zhou Yu

The learned dialog structure can shed light on how to analyze human dialogs, and more importantly contribute to the design and evaluation of dialog systems.

Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models

3 code implementations NAACL 2019 Tiancheng Zhao, Kaige Xie, Maxine Eskenazi

Defining action spaces for conversational agents and optimizing their decision-making process with reinforcement learning is an enduring challenge.

Decision Making Dialogue Generation +4

Dirichlet Variational Autoencoder for Text Modeling

no code implementations31 Oct 2018 Yijun Xiao, Tiancheng Zhao, William Yang Wang

We introduce an improved variational autoencoder (VAE) for text modeling with topic information explicitly modeled as a Dirichlet latent variable.

DialCrowd: A toolkit for easy dialog system assessment

no code implementations WS 2018 Kyusong Lee, Tiancheng Zhao, Alan W. black, Maxine Eskenazi

When creating a dialog system, developers need to test each version to ensure that it is performing correctly.


Zero-Shot Dialog Generation with Cross-Domain Latent Actions

2 code implementations WS 2018 Tiancheng Zhao, Maxine Eskenazi

This paper introduces zero-shot dialog generation (ZSDG), as a step towards neural dialog systems that can instantly generalize to new situations with minimal data.

Dialogue Generation Goal-Oriented Dialog

Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog

no code implementations WS 2018 Jiaping Zhang, Tiancheng Zhao, Zhou Yu

We propose a multimodal hierarchical reinforcement learning framework that dynamically integrates vision and language for task-oriented visual dialog.

Hierarchical Reinforcement Learning reinforcement-learning +3

Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders

1 code implementation ACL 2017 Tiancheng Zhao, Ran Zhao, Maxine Eskenazi

While recent neural encoder-decoder models have shown great promise in modeling open-domain conversations, they often generate dull and generic responses.

Decision Making Decoder +1

DialPort: Connecting the Spoken Dialog Research Community to Real User Data

no code implementations8 Jun 2016 Tiancheng Zhao, Kyusong Lee, Maxine Eskenazi

This paper describes a new spoken dialog portal that connects systems produced by the spoken dialog academic research community and gives them access to real users.

Algorithms for Batch Hierarchical Reinforcement Learning

no code implementations29 Mar 2016 Tiancheng Zhao, Mohammad Gowayyed

We show that it is possible to effectively learn recursive optimal policies for any valid hierarchical decomposition of the original MDP, given a fixed dataset collected from a flat stochastic behavioral policy.

Hierarchical Reinforcement Learning reinforcement-learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.