Search Results for author: Jing Bi

Found 11 papers, 4 papers with code

AVicuna: Audio-Visual LLM with Interleaver and Context-Boundary Alignment for Temporal Referential Dialogue

no code implementations24 Mar 2024 Yunlong Tang, Daiki Shimada, Jing Bi, Chenliang Xu

In everyday communication, humans frequently use speech and gestures to refer to specific areas or objects, a process known as Referential Dialogue (RD).

Video Understanding

OSCaR: Object State Captioning and State Change Representation

1 code implementation27 Feb 2024 Nguyen Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu

To address these challenges, in this paper, we introduce the Object State Captioning and State Change Representation (OSCaR) dataset and benchmark.

Change Detection Object

Video Understanding with Large Language Models: A Survey

1 code implementation29 Dec 2023 Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, JianGuo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu

With the burgeoning growth of online video platforms and the escalating volume of video content, the demand for proficient video understanding tools has intensified markedly.

Video Understanding

MISAR: A Multimodal Instructional System with Augmented Reality

1 code implementation18 Oct 2023 Jing Bi, Nguyen Manh Nguyen, Ali Vosoughi, Chenliang Xu

Augmented reality (AR) requires the seamless integration of visual, auditory, and linguistic channels for optimized human-computer interaction.

Multi-omics Prediction from High-content Cellular Imaging with Deep Learning

1 code implementation15 Jun 2023 Rahil Mehrizi, Arash Mehrjou, Maryana Alegro, Yi Zhao, Benedetta Carbone, Carl Fishwick, Johanna Vappiani, Jing Bi, Siobhan Sanford, Hakan Keles, Marcus Bantscheff, Cuong Nguyen, Patrick Schwab

High-content cellular imaging, transcriptomics, and proteomics data provide rich and complementary views on the molecular layers of biology that influence cellular states and function.

Performances of Symmetric Loss for Private Data from Exponential Mechanism

no code implementations9 Oct 2022 Jing Bi, Vorapong Suppakitpaisarn

This study explores the robustness of learning by symmetric loss on private data.

Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning

no code implementations ICCV 2021 Jing Bi, Jiebo Luo, Chenliang Xu

In this work, we leverage instructional videos to study humans' decision-making processes, focusing on learning a model to plan goal-directed actions in real-life videos.

Action Recognition Bayesian Inference +1

rQdia: Regularizing Q-Value Distributions With Image Augmentation

no code implementations29 Sep 2021 Samuel Lerman, Jing Bi, Chenliang Xu

rQdia (pronounced “Arcadia”) regularizes Q-value distributions with augmented images in pixel-based deep reinforcement learning.

Continuous Control Image Augmentation +2

Cubic Spline Smoothing Compensation for Irregularly Sampled Sequences

no code implementations3 Oct 2020 Jing Shi, Jing Bi, Yingru Liu, Chenliang Xu

The marriage of recurrent neural networks and neural ordinary differential networks (ODE-RNN) is effective in modeling irregularly-observed sequences.

Learning from Interventions using Hierarchical Policies for Safe Learning

no code implementations4 Dec 2019 Jing Bi, Vikas Dhiman, Tianyou Xiao, Chenliang Xu

The recently proposed Learning from Interventions (LfI) overcomes this limitation by using an expert overseer.

Navigation by Imitation in a Pedestrian-Rich Environment

no code implementations1 Nov 2018 Jing Bi, Tianyou Xiao, Qiuyue Sun, Chenliang Xu

Deep neural networks trained on demonstrations of human actions give robot the ability to perform self-driving on the road.

Imitation Learning Navigate

Cannot find the paper you are looking for? You can Submit a new open access paper.