Search Results for author: Tianyu Cao

Found 15 papers, 6 papers with code

MetaTS: Meta Teacher-Student Network for Multilingual Sequence Labeling with Minimal Supervision

no code implementations EMNLP 2021 Zheng Li, Danqing Zhang, Tianyu Cao, Ying WEI, Yiwei Song, Bing Yin

In this work, we explore multilingual sequence labeling with minimal supervision using a single unified model for multiple languages.

Meta-Learning

Out of Style: RAG's Fragility to Linguistic Variation

1 code implementation11 Apr 2025 Tianyu Cao, Neel Bhandari, Akhila Yerukola, Akari Asai, Maarten Sap

Despite the impressive performance of Retrieval-augmented Generation (RAG) systems across various NLP benchmarks, their robustness in handling real-world user-LLM interaction queries remains largely underexplored.

Question Answering RAG +1

DCatalyst: A Unified Accelerated Framework for Decentralized Optimization

no code implementations30 Jan 2025 Tianyu Cao, Xiaokai Chen, Gesualdo Scutari

We introduce DCatalyst, a unified black-box framework that integrates Nesterov acceleration into decentralized optimization algorithms.

Enhancing Convergence of Decentralized Gradient Tracking under the KL Property

no code implementations12 Dec 2024 Xiaokai Chen, Tianyu Cao, Gesualdo Scutari

We study decentralized multiagent optimization over networks, modeled as undirected graphs.

Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models

1 code implementation28 Oct 2024 Yilun Jin, Zheng Li, Chenwei Zhang, Tianyu Cao, Yifan Gao, Pratik Jayarao, Mao Li, Xin Liu, Ritesh Sarkhel, Xianfeng Tang, Haodong Wang, Zhengyang Wang, Wenju Xu, Jingfeng Yang, Qingyu Yin, Xian Li, Priyanka Nigam, Yi Xu, Kai Chen, Qiang Yang, Meng Jiang, Bing Yin

Shopping MMLU consists of 57 tasks covering 4 major shopping skills: concept understanding, knowledge reasoning, user behavior alignment, and multi-linguality, and can thus comprehensively evaluate the abilities of LLMs as general shop assistants.

Few-Shot Learning MMLU

Con4m: Context-aware Consistency Learning Framework for Segmented Time Series Classification

no code implementations31 Jul 2024 Junru Chen, Tianyu Cao, Jing Xu, Jiahe Li, Zhilong Chen, Tao Xiao, Yang Yang

Leveraging the contextual priors of MVD at both the data and label levels, we propose a novel consistency learning framework Con4m, which effectively utilizes contextual information more conducive to discriminating consecutive segments in segmented TSC tasks, while harmonizing inconsistent boundary labels for training.

Time Series Time Series Classification

Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports

no code implementations9 Apr 2024 Tianyu Cao, Natraj Raman, Danial Dervovic, Chenhao Tan

In this paper, we use financial report summarization as a case study because financial reports are not only long but also use numbers and tables extensively.

Form Hallucination +2

Enhancing User Intent Capture in Session-Based Recommendation with Attribute Patterns

1 code implementation NeurIPS 2023 Xin Liu, Zheng Li, Yifan Gao, Jingfeng Yang, Tianyu Cao, Zhengyang Wang, Bing Yin, Yangqiu Song

The goal of session-based recommendation in E-commerce is to predict the next item that an anonymous user will purchase based on the browsing and purchase history.

Attribute Session-Based Recommendations

Mutually-paced Knowledge Distillation for Cross-lingual Temporal Knowledge Graph Reasoning

no code implementations27 Mar 2023 Ruijie Wang, Zheng Li, Jingfeng Yang, Tianyu Cao, Chao Zhang, Bing Yin, Tarek Abdelzaher

This paper investigates cross-lingual temporal knowledge graph reasoning problem, which aims to facilitate reasoning on Temporal Knowledge Graphs (TKGs) in low-resource languages by transfering knowledge from TKGs in high-resource ones.

Knowledge Distillation Knowledge Graphs +1

Multilingual Knowledge Graph Completion with Self-Supervised Adaptive Graph Alignment

1 code implementation ACL 2022 Zijie Huang, Zheng Li, Haoming Jiang, Tianyu Cao, Hanqing Lu, Bing Yin, Karthik Subbian, Yizhou Sun, Wei Wang

In this paper, we explore multilingual KG completion, which leverages limited seed alignment as a bridge, to embrace the collective knowledge from multiple languages.

Knowledge Graph Completion

Acceleration in Distributed Optimization under Similarity

no code implementations24 Oct 2021 Ye Tian, Gesualdo Scutari, Tianyu Cao, Alexander Gasnikov

In order to reduce the number of communications to reach a solution accuracy, we proposed a {\it preconditioned, accelerated} distributed method.

Distributed Optimization

QUEACO: Borrowing Treasures from Weakly-labeled Behavior Data for Query Attribute Value Extraction

no code implementations19 Aug 2021 Danqing Zhang, Zheng Li, Tianyu Cao, Chen Luo, Tony Wu, Hanqing Lu, Yiwei Song, Bing Yin, Tuo Zhao, Qiang Yang

We study the problem of query attribute value extraction, which aims to identify named entities from user queries as diverse surface form attribute values and afterward transform them into formally canonical forms.

Attribute Attribute Value Extraction +3

Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data

1 code implementation ACL 2021 Haoming Jiang, Danqing Zhang, Tianyu Cao, Bing Yin, Tuo Zhao

Unfortunately, we observe that weakly labeled data does not necessarily improve, or even deteriorate the model performance (due to the extensive noise in the weak labels) when we train deep NER models over a simple or weighted combination of the strongly labeled and weakly labeled data.

named-entity-recognition Named Entity Recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.