Search Results for author: Xiaofeng Gao

Found 32 papers, 11 papers with code

GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

no code implementations26 Feb 2024 Yichi Zhang, Ziqiao Ma, Xiaofeng Gao, Suhaila Shakiah, Qiaozi Gao, Joyce Chai

Most multimodal large language models (MLLMs) learn language-to-object grounding through causal language modeling where grounded objects are captured by bounding boxes as sequences of location tokens.

 Ranked #1 on Generalized Referring Expression Segmentation on gRefCOCO (using extra training data)

Causal Language Modeling Generalized Referring Expression Segmentation +2

Mixture of Link Predictors

no code implementations13 Feb 2024 Li Ma, Haoyu Han, Juanhui Li, Harry Shomer, Hui Liu, Xiaofeng Gao, Jiliang Tang

Link prediction, which aims to forecast unseen connections in graphs, is a fundamental task in graph machine learning.

Link Prediction

Pareto-based Multi-Objective Recommender System with Forgetting Curve

no code implementations28 Dec 2023 Jipeng Jin, Zhaoxiang Zhang, Zhiheng Li, Xiaofeng Gao, Xiongwen Yang, Lei Xiao, Jie Jiang

Considering recency effect in memories, we propose a forgetting model based on Ebbinghaus Forgetting Curve to cope with negative feedback.

Recommendation Systems

Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty

1 code implementation2 Dec 2023 Cheng-Fu Yang, Haoyang Xu, Te-Lin Wu, Xiaofeng Gao, Kai-Wei Chang, Feng Gao

In this paper, we aim to tackle this problem with a unified framework consisting of an end-to-end trainable method and a planning algorithm.

Denoising Vision-Language Navigation

DSCom: A Data-Driven Self-Adaptive Community-Based Framework for Influence Maximization in Social Networks

no code implementations18 Nov 2023 Yuxin Zuo, Haojia Sun, Yongyi Hu, Jianxiong Guo, Xiaofeng Gao

Several previous works have addressed this topic in a statistical way and provided efficient algorithms with theoretical guarantee.

Temporal Interest Network for Click-Through Rate Prediction

1 code implementation15 Aug 2023 Haolin Zhou, Junwei Pan, Xinyi Zhou, Xihua Chen, Jie Jiang, Xiaofeng Gao, Guihai Chen

Our proposed TIN outperforms the best-performing baselines by 0. 43\% and 0. 29\% on two datasets, respectively.

Click-Through Rate Prediction

LEMMA: Learning Language-Conditioned Multi-Robot Manipulation

no code implementations2 Aug 2023 Ran Gong, Xiaofeng Gao, Qiaozi Gao, Suhaila Shakiah, Govind Thattai, Gaurav S. Sukhatme

We introduce a benchmark for LanguagE-Conditioned Multi-robot MAnipulation (LEMMA) focused on task allocation and long-horizon object manipulation based on human language instructions in a tabletop setting.

LEMMA Robot Manipulation

Alioth: A Machine Learning Based Interference-Aware Performance Monitor for Multi-Tenancy Applications in Public Cloud

1 code implementation18 Jul 2023 Tianyao Shi, Yingxuan Yang, Yunlong Cheng, Xiaofeng Gao, Zhen Fang, Yongqiang Yang

Multi-tenancy in public clouds may lead to co-location interference on shared resources, which possibly results in performance degradation of cloud applications.

Decision Making Denoising +3

AutoAttention: Automatic Field Pair Selection for Attention in User Behavior Modeling

no code implementations27 Oct 2022 Zuowu Zheng, Xiaofeng Gao, Junwei Pan, Qi Luo, Guihai Chen, Dapeng Liu, Jie Jiang

In this paper, we propose a novel model named AutoAttention, which includes all item/user/context side fields as the query, and assigns a learnable weight for each field pair between behavior fields and query fields.

Click-Through Rate Prediction

VRKitchen2.0-IndoorKit: A Tutorial for Augmented Indoor Scene Building in Omniverse

no code implementations23 Jun 2022 Yizhou Zhao, Steven Gong, Xiaofeng Gao, Wensi Ai, Song-Chun Zhu

With the recent progress of simulations by 3D modeling software and game engines, many researchers have focused on Embodied AI tasks in the virtual environment.

Benchmarking Indoor Scene Synthesis

Effects of Augmented-Reality-Based Assisting Interfaces on Drivers' Object-wise Situational Awareness in Highly Autonomous Vehicles

no code implementations6 Jun 2022 Xiaofeng Gao, Xingwei Wu, Samson Ho, Teruhisa Misu, Kumar Akash

To understand the effect of highlighting on drivers' SA for objects with different types and locations under various traffic densities, we conducted an in-person experiment with 20 participants on a driving simulator.

Autonomous Driving Object

HIEN: Hierarchical Intention Embedding Network for Click-Through Rate Prediction

no code implementations1 Jun 2022 Zuowu Zheng, Changwang Zhang, Xiaofeng Gao, Guihai Chen

Based on this observation, in this paper, we propose a novel approach Hierarchical Intention Embedding Network (HIEN), which considers dependencies of attributes based on bottom-up tree aggregation in the constructed attribute graph.

Attribute Click-Through Rate Prediction +1

Trading Hard Negatives and True Negatives: A Debiased Contrastive Collaborative Filtering Approach

no code implementations25 Apr 2022 Chenxiao Yang, Qitian Wu, Jipeng Jin, Xiaofeng Gao, Junwei Pan, Guihai Chen

To circumvent false negatives, we develop a principled approach to improve the reliability of negative instances and prove that the objective is an unbiased estimation of sampling from the true negative distribution.

Collaborative Filtering

DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following

2 code implementations27 Feb 2022 Xiaofeng Gao, Qiaozi Gao, Ran Gong, Kaixiang Lin, Govind Thattai, Gaurav S. Sukhatme

Language-guided Embodied AI benchmarks requiring an agent to navigate an environment and manipulate objects typically allow one-way communication: the human user gives a natural language command to the agent, and the agent can only follow the command passively.

Instruction Following Navigate

Cross-Task Knowledge Distillation in Multi-Task Recommendation

no code implementations20 Feb 2022 Chenxiao Yang, Junwei Pan, Xiaofeng Gao, Tingyu Jiang, Dapeng Liu, Guihai Chen

Multi-task learning (MTL) has been widely used in recommender systems, wherein predicting each type of user feedback on items (e. g, click, purchase) are treated as individual tasks and jointly trained with a unified model.

Knowledge Distillation Multi-Task Learning +1

Show Me What You Can Do: Capability Calibration on Reachable Workspace for Human-Robot Collaboration

no code implementations6 Mar 2021 Xiaofeng Gao, Luyao Yuan, Tianmin Shu, Hongjing Lu, Song-Chun Zhu

Our experiments with human participants demonstrate that a short calibration using REMP can effectively bridge the gap between what a non-expert user thinks a robot can reach and the ground truth.

Motion Planning

Inductive Collaborative Filtering via Relation Graph Learning

no code implementations1 Jan 2021 Qitian Wu, Hengrui Zhang, Xiaofeng Gao, Hongyuan Zha

In this paper, we propose an inductive collaborative filtering framework that learns a hidden relational graph among users from the rating matrix.

Collaborative Filtering Graph Learning +2

Joint Mind Modeling for Explanation Generation in Complex Human-Robot Collaborative Tasks

no code implementations24 Jul 2020 Xiaofeng Gao, Ran Gong, Yizhou Zhao, Shu Wang, Tianmin Shu, Song-Chun Zhu

Thus, in this paper, we propose a novel explainable AI (XAI) framework for achieving human-like communication in human-robot collaborations, where the robot builds a hierarchical mind model of the human user and generates explanations of its own mind as a form of communications based on its online Bayesian inference of the user's mental state.

Bayesian Inference Explainable Artificial Intelligence (XAI) +1

Towards Open-World Recommendation: An Inductive Model-based Collaborative Filtering Approach

1 code implementation9 Jul 2020 Qitian Wu, Hengrui Zhang, Xiaofeng Gao, Junchi Yan, Hongyuan Zha

The first model follows conventional matrix factorization which factorizes a group of key users' rating matrix to obtain meta latents.

Collaborative Filtering Matrix Completion +2

MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding

no code implementations18 Feb 2020 Haolin Zhou, Chaoqi Yang, Xiaofeng Gao, Qiong Chen, Gongshen Liu, Guihai Chen

Online Real-Time Bidding (RTB) is a complex auction game among which advertisers struggle to bid for ad impressions when a user request occurs.

Reinforcement Learning (RL)

A Hierarchical Optimizer for Recommendation System Based on Shortest Path Algorithm

no code implementations7 Nov 2019 Jiacheng Dai, Zhifeng Jia, Xiaofeng Gao, Guihai Chen

Top-k Nearest Geosocial Keyword (T-kNGK) query on geosocial network is defined to give users k recommendations based on some keywords and designated spatial range, and can be realized by shortest path algorithms.

NETR-Tree: An Eifficient Framework for Social-Based Time-Aware Spatial Keyword Query

no code implementations26 Aug 2019 Xiuqi Huang, Yuanning Gao, Xiaofeng Gao, Guihai Chen

In the user layer, we exploit the network embedding strategy to measure the relationship effect in users' relationship network.

Network Embedding

VRKitchen: an Interactive 3D Virtual Environment for Task-oriented Learning

1 code implementation13 Mar 2019 Xiaofeng Gao, Ran Gong, Tianmin Shu, Xu Xie, Shu Wang, Song-Chun Zhu

One of the main challenges of advancing task-oriented learning such as visual task planning and reinforcement learning is the lack of realistic and standardized environments for training and testing AI agents.

reinforcement-learning Reinforcement Learning (RL)

Accelerate RNN-based Training with Importance Sampling

no code implementations31 Oct 2017 Fei Wang, Xiaofeng Gao, Guihai Chen, Jun Ye

Unfortunately, the calculation of the sampling probability distribution $P$ causes a major limitation of IS: it requires the input data to be well-structured, i. e., the feature vector is properly defined.

Stochastic Optimization

Learning Social Affordance Grammar from Videos: Transferring Human Interactions to Human-Robot Interactions

no code implementations1 Mar 2017 Tianmin Shu, Xiaofeng Gao, Michael S. Ryoo, Song-Chun Zhu

In this paper, we present a general framework for learning social affordance grammar as a spatiotemporal AND-OR graph (ST-AOG) from RGB-D videos of human interactions, and transfer the grammar to humanoids to enable a real-time motion inference for human-robot interaction (HRI).


Cannot find the paper you are looking for? You can Submit a new open access paper.