no code implementations • WMT (EMNLP) 2021 • Hangcheng Guo, wenbin liu, yanqing he, Tian Lan, Hongjiao Xu, zhenfeng wu, you pan
This paper describes the ISTIC’s submission to the Triangular Machine Translation Task of Russian-to-Chinese machine translation for WMT’ 2021.
1 code implementation • 5 Sep 2024 • JianGuo Zhang, Tian Lan, Ming Zhu, Zuxin Liu, Thai Hoang, Shirley Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu, Yihao Feng, Tulika Awalgaonkar, Rithesh Murthy, Eric Hu, Zeyuan Chen, ran Xu, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong
By releasing the xLAM series, we aim to advance the performance of open-source LLMs for autonomous AI agents, potentially accelerating progress and democratizing access to high-performance models for agent tasks.
no code implementations • 18 Aug 2024 • Omar Ghazal, Tian Lan, Shalman Ojukwu, Komal Krishnamurthy, Alex Yakovlev, Rishad Shafik
The modern implementation of machine learning architectures faces significant challenges due to frequent data transfer between memory and processing units.
no code implementations • 13 Aug 2024 • Kexun Zhang, Weiran Yao, Zuxin Liu, Yihao Feng, Zhiwei Liu, Rithesh Murthy, Tian Lan, Lei LI, Renze Lou, Jiacheng Xu, Bo Pang, Yingbo Zhou, Shelby Heinecke, Silvio Savarese, Huan Wang, Caiming Xiong
For instance, a group of open-source SWE agents, with a maximum individual resolve rate of 27. 3% on SWE-Bench Lite, can achieve a 34. 3% resolve rate with DEI, making a 25% improvement and beating most closed-source solutions.
no code implementations • 1 Aug 2024 • Tian Lan, Huan Wang, Caiming Xiong, Silvio Savarese
We introduce WarpSci, a domain agnostic framework designed to overcome crucial system bottlenecks encountered in the application of reinforcement learning to intricate environments with vast datasets featuring high-dimensional observation or action spaces.
no code implementations • 16 Jul 2024 • Xiaochuan Gou, Ziyue Li, Tian Lan, Junpeng Lin, Zhishuai Li, Bingyu Zhao, Chen Zhang, Di Wang, Xiangliang Zhang
Our data can revolutionalize traditional traffic-related tasks towards higher interpretability and practice: instead of traditional prediction or classification tasks, we conduct: (1) post-incident traffic forecasting to quantify the impact of different incidents on traffic indexes; (2) incident classification using traffic indexes to determine the incidents types for precautions measures; (3) global causal analysis among the traffic indexes, meta-attributes, and incidents to give high-level guidance of the interrelations of various factors; (4) local causal analysis within road nodes to examine how different incidents affect the road segments' relations.
no code implementations • 26 Jun 2024 • Zuxin Liu, Thai Hoang, JianGuo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu, Yihao Feng, Rithesh Murthy, Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong
The advancement of function-calling agent models requires diverse, reliable, and high-quality datasets.
no code implementations • 30 May 2024 • Zeyu Fang, Tian Lan
It iteratively leverages a guided diffusion world model to directly evaluate the offline target policy with actions drawn from it, and then performs an importance-sampled world model update to adaptively align the world model with the updated policy.
no code implementations • 26 May 2024 • Jiayu Chen, Bhargav Ganguly, Tian Lan, Vaneet Aggarwal
Skills are effective temporal abstractions established for sequential decision making tasks, which enable efficient hierarchical learning for long-horizon tasks and facilitate multi-task learning through their transferability.
no code implementations • 22 May 2024 • Tian Lan, Qinwei Lin, Haoqian Wang
Further, an additional language-extended loop closure module which is based on CLIP feature is designed to continually perform global optimization to correct drift errors accumulated as the system runs.
1 code implementation • 7 May 2024 • Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani
Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets.
no code implementations • 22 Mar 2024 • Zuyuan Zhang, Hanhan Zhou, Mahdi Imani, Taeyoung Lee, Tian Lan
With the advancements of artificial intelligence (AI), we're seeing more scenarios that require AI to work closely with other agents, whose goals and strategies might not be known beforehand.
no code implementations • 4 Mar 2024 • Chen Xu, Tian Lan, Changlong Yu, Wei Wang, Jun Gao, Yu Ji, Qunxi Dong, Kun Qian, Piji Li, Wei Bi, Bin Hu
Constrained decoding approaches aim to control the meaning or style of text generated by a Pre-trained Language Model (PLM) using specific target words during inference.
1 code implementation • 23 Feb 2024 • Zhiwei Liu, Weiran Yao, JianGuo Zhang, Liangwei Yang, Zuxin Liu, Juntao Tan, Prafulla K. Choubey, Tian Lan, Jason Wu, Huan Wang, Shelby Heinecke, Caiming Xiong, Silvio Savarese
Thus, we open-source a new AI agent library, AgentLite, which simplifies this process by offering a lightweight, user-friendly platform for innovating LLM agent reasoning, architectures, and applications with ease.
2 code implementations • 23 Feb 2024 • JianGuo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Tulika Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong
It meticulously standardizes and unifies these trajectories into a consistent format, streamlining the creation of a generic data loader optimized for agent training.
1 code implementation • 21 Feb 2024 • Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, Vaneet Aggarwal
In this paper, we provide the first systematic review on the applications of deep generative models for offline policy learning.
1 code implementation • 21 Feb 2024 • Tian Lan, Wenwei Zhang, Chen Xu, Heyan Huang, Dahua Lin, Kai Chen, Xian-Ling Mao
To ensure the reliability, a large number of critiques are annotated to serve as references, enabling GPT-4 to evaluate textual critiques reliably.
no code implementations • 25 Jan 2024 • Yongsheng Mei, Mahdi Imani, Tian Lan
Bayesian optimization (BO) has established itself as a leading strategy for efficiently optimizing expensive-to-evaluate functions.
no code implementations • 12 Dec 2023 • Jingdi Chen, Hanhan Zhou, Yongsheng Mei, Gina Adam, Nathaniel D. Bastian, Tian Lan
Many cybersecurity problems that require real-time decision-making based on temporal observations can be abstracted as a sequence modeling problem, e. g., network intrusion detection from a sequence of arriving packets.
no code implementations • 27 Nov 2023 • Jingdi Chen, Lei Zhang, Joseph Riem, Gina Adam, Nathaniel D. Bastian, Tian Lan
Deep Learning (DL) based methods have shown great promise in network intrusion detection by identifying malicious network traffic behavior patterns with high accuracy, but their applications to real-time, packet-level detections in high-speed communication networks are challenging due to the high computation time and resource requirements of Deep Neural Networks (DNNs), as well as lack of explainability.
no code implementations • 9 Sep 2023 • Muzhe Guo, Feixu Yu, Tian Lan, Fang Jin
Reinforcement learning (RL) is a powerful tool for solving complex decision-making problems, but its lack of transparency and interpretability has been a major challenge in domains where decisions have significant real-world consequences.
no code implementations • 28 Aug 2023 • Hanhan Zhou, Tian Lan, Vaneet Aggarwal
Offline reinforcement learning aims to utilize datasets of previously gathered environment-action interaction records to learn a policy without access to the real environment.
1 code implementation • 7 Aug 2023 • Jingdi Chen, Tian Lan, Carlee Joe-Wong
This result enables us to recast multi-agent communication into a novel online clustering problem over the local observations at each agent, with messages as cluster labels and the upper bound on the return gap as clustering loss.
no code implementations • 1 Aug 2023 • Zihao Zhao, Yuzhu Mao, Zhenpeng Shi, Yang Liu, Tian Lan, Wenbo Ding, Xiao-Ping Zhang
In response, this paper introduces AQUILA (adaptive quantization in device selection strategy), a novel adaptive framework devised to effectively handle these issues, enhancing the efficiency and robustness of FL.
no code implementations • 21 Jul 2023 • Jiayu Chen, Jingdi Chen, Tian Lan, Vaneet Aggarwal
Our key idea is to approximate the joint state space as a Kronecker graph, based on which we can directly estimate its Fiedler vector using the Laplacian spectrum of individual agents' transition graphs.
1 code implementation • 13 Jul 2023 • Tian Lan, Deng Cai, Yan Wang, Heyan Huang, Xian-Ling Mao
The dominant text generation models compose the output by sequentially selecting words from a fixed vocabulary.
1 code implementation • 5 Jun 2023 • Tian Lan, Ziyue Li, Zhishuai Li, Lei Bai, Man Li, Fugee Tsung, Wolfgang Ketter, Rui Zhao, Chen Zhang
This encourages the multi-task design: with each DAG as a task, the MM-DAG tries to learn the multiple DAGs jointly so that their consensus and consistency are maximized.
no code implementations • 1 Jun 2023 • Jiachen Li, Xinwei Shi, Feiyu Chen, Jonathan Stroud, Zhishuai Zhang, Tian Lan, Junhua Mao, Jeonhyung Kang, Khaled S. Refaat, Weilong Yang, Eugene Ie, CongCong Li
Accurate understanding and prediction of human behaviors are critical prerequisites for autonomous vehicles, especially in highly dynamic and interactive scenarios such as intersections in dense urban areas.
no code implementations • 30 May 2023 • Chenyi Liu, Vaneet Aggarwal, Tian Lan, Nan Geng, Yuan Yang, Mingwei Xu, Qing Li
By providing a neural network function approximation of this common kernel using graph attention networks, we develop a unified learning-based framework, FERN, for scalable Failure Evaluation and Robust Network design.
1 code implementation • 27 May 2023 • Jiayu Chen, Tian Lan, Vaneet Aggarwal
Imperfect Information Games (IIGs) offer robust models for scenarios where decision-makers face uncertainty or lack complete information.
1 code implementation • 25 May 2023 • Yixuan Su, Tian Lan, Huayang Li, Jialu Xu, Yan Wang, Deng Cai
To do so, PandaGPT combines the multimodal encoders from ImageBind and the large language models from Vicuna.
1 code implementation • 22 May 2023 • Jiayu Chen, Dipesh Tamboli, Tian Lan, Vaneet Aggarwal
Multi-task Imitation Learning (MIL) aims to train a policy capable of performing a distribution of tasks based on multi-task expert demonstrations, which is essential for general-purpose robots.
1 code implementation • 21 Feb 2023 • Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei
To this end, we propose MAC-PO, which formulates optimal prioritized experience replay for multi-agent problems as a regret minimization over the sampling weights of transitions.
no code implementations • 11 Feb 2023 • Yongsheng Mei, Hanhan Zhou, Tian Lan
Such an optimization problem can be relaxed and solved using the Lagrangian multiplier method to obtain the close-form optimal projection weights.
1 code implementation • 6 Feb 2023 • Yongsheng Mei, Guru Venkataramani, Tian Lan
Our experimental results on the Multi-modal Brain Tumor Segmentation Challenge (BraTS) datasets outperform those of state-of-the-art segmentation baselines, with validation Dice similarity coefficients of 0. 920, 0. 897, 0. 837 for the whole tumor, tumor core, and enhancing tumor on BraTS-2020.
no code implementations • 6 Jan 2023 • Weicheng Gao, Xiaopeng Yang, Xiaodong Qu, Tian Lan
The data preprocessing module achieves wall clutter, human motion features, and noise subspaces separation.
1 code implementation • 5 Dec 2022 • Tian Lan, Yixuan Su, Shuhang Liu, Heyan Huang, Xian-Ling Mao
In this study, we formulate open-ended text generation from a new perspective, i. e., we view it as an exploration process within a directed graph.
1 code implementation • 13 Oct 2022 • Yongsheng Mei, Tian Lan, Mahdi Imani, Suresh Subramaniam
This joint distribution is used in the body of the BO acquisition functions to search for local optima during the optimization process.
no code implementations • 7 Oct 2022 • Jiayu Chen, Marina Haliem, Tian Lan, Vaneet Aggarwal
In this case, we propose Multi-agent Deep Covering Option Discovery, which constructs the multi-agent options through minimizing the expected cover time of the multiple agents' joint state space.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 5 Oct 2022 • Jiayu Chen, Tian Lan, Vaneet Aggarwal
In this work, we develop a novel HIL algorithm based on Adversarial Inverse Reinforcement Learning and adapt it with the Expectation-Maximization algorithm in order to directly recover a hierarchical policy from the unannotated demonstrations.
1 code implementation • 22 Jun 2022 • Hanhan Zhou, Tian Lan, Vaneet Aggarwal
Multi-agent reinforcement learning (MARL) has witnessed significant progress with the development of value function factorization methods.
1 code implementation • 5 May 2022 • Yixuan Su, Tian Lan, Yahui Liu, Fangyu Liu, Dani Yogatama, Yan Wang, Lingpeng Kong, Nigel Collier
MAGIC is a flexible framework and is theoretically compatible with any text generation tasks that incorporate image grounding.
1 code implementation • ACL 2022 • Heqi Zheng, Xiao Zhang, Zewen Chi, Heyan Huang, Tan Yan, Tian Lan, Wei Wei, Xian-Ling Mao
In this paper, we propose XPR, a cross-lingual phrase retriever that extracts phrase representations from unlabeled example sentences.
no code implementations • 5 Apr 2022 • Yuzhu Mao, Zihao Zhao, Meilin Yang, Le Liang, Yang Liu, Wenbo Ding, Tian Lan, Xiao-Ping Zhang
It is demonstrated that SAFARI under unreliable communications is guaranteed to converge at the same rate as the standard FedAvg with perfect communications.
no code implementations • CVPR 2022 • Jialian Wu, Sudhir Yarram, Hui Liang, Tian Lan, Junsong Yuan, Jayan Eledath, Gerard Medioni
In addition, VisTR is not fully end-to-end learnable in multiple video clips as it requires a hand-crafted data association to link instance tracklets between successive clips.
2 code implementations • 13 Feb 2022 • Yixuan Su, Tian Lan, Yan Wang, Dani Yogatama, Lingpeng Kong, Nigel Collier
Text generation is of great importance to many natural language processing applications.
1 code implementation • 27 Jan 2022 • Hanhan Zhou, Tian Lan, Guru Venkataramani, Wenbo Ding
In this paper, we present a unifying framework for heterogeneous FL algorithms with {\em arbitrary} adaptive online model pruning and provide a general convergence analysis.
no code implementations • 20 Jan 2022 • Jiayu Chen, Jingdi Chen, Tian Lan, Vaneet Aggarwal
Covering skill (a. k. a., option) discovery has been developed to improve the exploration of reinforcement learning in single-agent scenarios with sparse reward signals, through connecting the most distant states in the embedding space provided by the Fiedler vector of the state transition graph.
1 code implementation • 4 Jan 2022 • Hanhan Zhou, Tian Lan, Vaneet Aggarwal
To this end, we present LSF-SAC, a novel framework that features a variational inference-based information-sharing mechanism as extra state information to assist individual agents in the value function factorization.
no code implementations • 23 Nov 2021 • Hanhan Zhou, Tian Lan, Guru Venkataramani
The virtual try-on system has gained great attention due to its potential to give customers a realistic, personalized product presentation in virtualized settings.
2 code implementations • Findings (NAACL) 2022 • Yixuan Su, Fangyu Liu, Zaiqiao Meng, Tian Lan, Lei Shu, Ehsan Shareghi, Nigel Collier
Masked language models (MLMs) such as BERT and RoBERTa have revolutionized the field of Natural Language Understanding in the past few years.
1 code implementation • 13 Oct 2021 • Tian Lan, Deng Cai, Yan Wang, Yixuan Su, Heyan Huang, Xian-Ling Mao
In this study, we present a solution to directly select proper responses from a large corpus or even a nonparallel corpus that only consists of unpaired sentences, using a dense retrieval model.
2 code implementations • 20 Sep 2021 • Aadyot Bhatnagar, Paul Kassianik, Chenghao Liu, Tian Lan, Wenzhuo Yang, Rowan Cassius, Doyen Sahoo, Devansh Arpit, Sri Subramanian, Gerald Woo, Amrita Saha, Arun Kumar Jagota, Gokulakrishnan Gopalakrishnan, Manpreet Singh, K C Krithika, Sukumar Maddineni, Daeki Cho, Bo Zong, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Steven Hoi, Huan Wang
We introduce Merlion, an open-source machine learning library for time series.
3 code implementations • 13 Sep 2021 • Jian Xu, Shao-Lun Huang, Linqi Song, Tian Lan
To this end, previous work either makes use of auxiliary data at parameter server to verify the received gradients (e. g., by computing validation error rate) or leverages statistic-based methods (e. g. median and Krum) to identify and remove malicious gradients from Byzantine clients.
3 code implementations • 31 Aug 2021 • Tian Lan, Sunil Srinivasa, Huan Wang, Stephan Zheng
We present WarpDrive, a flexible, lightweight, and easy-to-use open-source RL framework that implements end-to-end deep multi-agent RL on a single GPU (Graphics Processing Unit), built on PyCUDA and PyTorch.
no code implementations • 30 Jul 2021 • Guangfeng Yan, Shao-Lun Huang, Tian Lan, Linqi Song
Gradient quantization is an emerging technique in reducing communication costs in distributed learning.
1 code implementation • 5 Mar 2021 • Jiayu Chen, Abhishek K. Umrawal, Tian Lan, Vaneet Aggarwal
Then an efficient multi-transfer matching algorithm is executed to assign the delivery requests to the trucks.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 1 Jan 2021 • Guangfeng Yan, Shao-Lun Huang, Tian Lan, Linqi Song
This paper addresses this issue by proposing a novel dynamic quantized SGD (DQSGD) framework, which enables us to optimize the quantization strategy for each gradient descent step by exploring the trade-off between communication cost and modeling error.
no code implementations • 21 Dec 2020 • Tian Lan, Xian-Ling Mao, Zhipeng Zhao, Wei Wei, Heyan Huang
Since the pre-trained language models are widely used, retrieval-based open-domain dialog systems, have attracted considerable attention from researchers recently.
1 code implementation • 17 Dec 2020 • Tian Lan, Xian-Ling Mao, Xiaoyan Gao, Wei Wei, Heyan Huang
Specifically, in our proposed DSHC model, a hashing optimizing module that consists of two autoencoder models is stacked on a trained dense representation model, and three loss functions are designed to optimize it.
4 code implementations • 19 Aug 2020 • Hang Zhao, Jiyang Gao, Tian Lan, Chen Sun, Benjamin Sapp, Balakrishnan Varadarajan, Yue Shen, Yi Shen, Yuning Chai, Cordelia Schmid, Cong-Cong Li, Dragomir Anguelov
Our key insight is that for prediction within a moderate time horizon, the future modes can be effectively captured by a set of target states.
no code implementations • 7 Aug 2020 • Tian Lan, Xian-Ling Mao, Wei Wei, He-Yan Huang
Thus, in this paper, we will measure systematically nearly all representative hierarchical and non-hierarchical models over the same experimental settings to check which kind is better.
1 code implementation • 6 Apr 2020 • Tian Lan, Xian-Ling Mao, Wei Wei, Xiaoyan Gao, He-Yan Huang
Through extensive experiments, the learning-based metrics are demonstrated that they are the most effective evaluation metrics for open-domain generative dialogue systems.
no code implementations • 18 Feb 2020 • Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A. Plummer, Stan Sclaroff, Jayan Eledath, Gerard Medioni
We embed the attention module in a ``slow-fast'' architecture, where the slower network runs on sparsely sampled keyframes and the light-weight shallow network runs on non-keyframes at a high frame rate.
no code implementations • 20 Dec 2019 • Tian Lan, Xian-Ling Mao, He-Yan Huang, Wei Wei
Intuitively, a dialogue model that can control the timing of talking autonomously based on the conversation context can chat with humans more naturally.
no code implementations • 1 Nov 2019 • Hongfa Xue, Yongsheng Mei, Kailash Gogineni, Guru Venkataramani, Tian Lan
Detecting code clones is crucial in various software engineering tasks.
no code implementations • 17 Sep 2019 • Tian Lan, Xian-Ling Mao, He-Yan Huang
As far as we know, the existing task-oriented dialogue systems obtain the dialogue policy through classification, which can assign either a dialogue act and its corresponding parameters or multiple dialogue acts without their corresponding parameters for a dialogue action.
1 code implementation • 25 Aug 2019 • Yong Hu, He-Yan Huang, Tian Lan, Xiaochi Wei, Yuxiang Nie, Jiarui Qi, Liner Yang, Xian-Ling Mao
Second language acquisition (SLA) modeling is to predict whether second language learners could correctly answer the questions according to what they have learned.
no code implementations • 26 Jun 2019 • Siheng Chen, Sufeng. Niu, Tian Lan, Baoan Liu
We present a novel graph-neural-network-based system to effectively represent large-scale 3D point clouds with the applications to autonomous driving.
no code implementations • 30 May 2018 • Tian Lan, Yuanyuan Li, Jonah Kimani Murugi, Yi Ding, Zhiguang Qin
The early detection and early diagnosis of lung cancer are crucial to improve the survival rate of lung cancer patients.
no code implementations • 27 Aug 2016 • Jianhong Wang, Tian Lan, Xu Zhang, Limin Luo
This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR).
no code implementations • ICCV 2015 • Tian Lan, Yuke Zhu, Amir Roshan Zamir, Silvio Savarese
Realistic videos of human actions exhibit rich spatiotemporal structures at multiple levels of granularity: an action can always be decomposed into multiple finer-grained elements in both space and time.
no code implementations • NeurIPS 2013 • Guang-Tong Zhou, Tian Lan, Arash Vahdat, Greg Mori
We present a maximum margin framework that clusters data using latent variables.
no code implementations • CVPR 2013 • Guang-Tong Zhou, Tian Lan, Weilong Yang, Greg Mori
We conduct image classification by learning a class-toimage distance function that matches objects.
no code implementations • CVPR 2013 • Tian Lan, Greg Mori
We propose Max-Margin Riffled Independence Model (MMRIM), a new method for image tag ranking modeling the structured preferences among tags.
no code implementations • NeurIPS 2010 • Tian Lan, Yang Wang, Weilong Yang, Greg Mori
We propose a discriminative model for recognizing group activities.