Search Results for author: Yunjun Gao

Found 25 papers, 18 papers with code

FIT-RAG: Black-Box RAG with Factual Information and Token Reduction

no code implementations21 Mar 2024 YUREN MAO, XueMei Dong, Wenyi Xu, Yunjun Gao, Bin Wei, Ying Zhang

Simply concatenating all the retrieved documents brings large amounts of unnecessary tokens for LLMs, which degenerates the efficiency of black-box RAG.

Open-Domain Question Answering Retrieval +2

FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis

no code implementations19 Jan 2024 Chao Zhang, YUREN MAO, Yijiang Fan, Yu Mi, Yunjun Gao, Lu Chen, Dongfang Lou, Jinshu Lin

Text-to-SQL, which provides zero-code interface for operating relational databases, has gained much attention in financial analysis; because, financial professionals may not well-skilled in SQL programming.

Language Modelling Large Language Model +1

Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data Segment

1 code implementation4 Jan 2024 Mengzhao Wang, Weizhi Xu, Xiaomeng Yi, Songlin Wu, Zhangyang Peng, Xiangyu Ke, Yunjun Gao, Xiaoliang Xu, Rentong Guo, Charles Xie

In this paper, we present Starling, an I/O-efficient disk-resident graph index framework that optimizes data layout and search strategy within the segment.

View-based Explanations for Graph Neural Networks

1 code implementation4 Jan 2024 Tingyang Chen, Dazhuo Qiu, Yinghui Wu, Arijit Khan, Xiangyu Ke, Yunjun Gao

Existing approaches aim to understand the overall results of GNNs rather than providing explanations for specific class labels of interest, and may return explanation structures that are hard to access, nor directly queryable. We propose GVEX, a novel paradigm that generates Graph Views for EXplanation.

Graph Classification

MUST: An Effective and Scalable Framework for Multimodal Search of Target Modality

1 code implementation11 Dec 2023 Mengzhao Wang, Xiangyu Ke, Xiaoliang Xu, Lu Chen, Yunjun Gao, Pinpin Huang, Runkai Zhu

We investigate the problem of multimodal search of target modality, where the task involves enhancing a query in a specific target modality by integrating information from auxiliary modalities.

Information Retrieval

MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching

1 code implementation2 Aug 2023 Xiaocan Zeng, Pengfei Wang, YUREN MAO, Lu Chen, Xiaoze Liu, Yunjun Gao

Traditional unsupervised EM assumes that all entities come from two tables; however, it is more common to match entities from multiple tables in practical applications, that is, multi-table entity matching (multi-table EM).

Management

C3: Zero-shot Text-to-SQL with ChatGPT

1 code implementation14 Jul 2023 XueMei Dong, Chao Zhang, Yuhang Ge, YUREN MAO, Yunjun Gao, Lu Chen, Jinshu Lin, Dongfang Lou

This paper proposes a ChatGPT-based zero-shot Text-to-SQL method, dubbed C3, which achieves 82. 3\% in terms of execution accuracy on the holdout test set of Spider and becomes the state-of-the-art zero-shot Text-to-SQL method on the Spider Challenge.

Text-To-SQL

Real-time Workload Pattern Analysis for Large-scale Cloud Databases

no code implementations5 Jul 2023 Jiaqi Wang, Tianyi Li, Anni Wang, Xiaoze Liu, Lu Chen, Jie Chen, Jianye Liu, Junyang Wu, Feifei Li, Yunjun Gao

This has led to the increasing volume of database workloads, which provides the opportunity for pattern analysis.

Knowledge-refined Denoising Network for Robust Recommendation

1 code implementation28 Apr 2023 Xinjun Zhu, Yuntao Du, YUREN MAO, Lu Chen, Yujia Hu, Yunjun Gao

Knowledge graph (KG), which contains rich side information, becomes an essential part to boost the recommendation performance and improve its explainability.

Denoising Knowledge-Aware Recommendation +1

Towards Explainable Collaborative Filtering with Taste Clusters Learning

1 code implementation27 Apr 2023 Yuntao Du, Jianxun Lian, Jing Yao, Xiting Wang, Mingqi Wu, Lu Chen, Yunjun Gao, Xing Xie

In recent decades, there have been significant advancements in latent embedding-based CF methods for improved accuracy, such as matrix factorization, neural collaborative filtering, and LightGCN.

Collaborative Filtering Decision Making +3

SEA: A Scalable Entity Alignment System

1 code implementation14 Apr 2023 Junyang Wu, Tianyi Li, Lu Chen, Yunjun Gao, Ziheng Wei

To enhance the usability of GNN-based EA models in real-world applications, we present SEA, a scalable entity alignment system that enables to (i) train large-scale GNNs for EA, (ii) speed up the normalization and the evaluation process, and (iii) report clear results for users to estimate different models and parameter settings.

Entity Alignment Knowledge Graphs

Unsupervised Entity Alignment for Temporal Knowledge Graphs

1 code implementation1 Feb 2023 Xiaoze Liu, Junyang Wu, Tianyi Li, Lu Chen, Yunjun Gao

State-of-the-art time-aware EA studies have suggested that the temporal information of TKGs facilitates the performance of EA.

Entity Alignment Graph Matching +1

Estimator: An Effective and Scalable Framework for Transportation Mode Classification over Trajectories

no code implementations11 Dec 2022 Danlei Hu, Ziquan Fang, Hanxi Fang, Tianyi Li, Chunhui Shen, Lu Chen, Yunjun Gao

Transportation mode classification, the process of predicting the class labels of moving objects transportation modes, has been widely applied to a variety of real world applications, such as traffic management, urban computing, and behavior study.

Classification Management

ClusterEA: Scalable Entity Alignment with Stochastic Training and Normalized Mini-batch Similarities

2 code implementations20 May 2022 Yunjun Gao, Xiaoze Liu, Junyang Wu, Tianyi Li, Pengfei Wang, Lu Chen

To tackle this challenge, we present ClusterEA, a general framework that is capable of scaling up EA models and enhancing their results by leveraging normalization methods on mini-batches with a high entity equivalent rate.

Entity Alignment Entity Embeddings +1

Self-Guided Learning to Denoise for Robust Recommendation

2 code implementations14 Apr 2022 Yunjun Gao, Yuntao Du, Yujia Hu, Lu Chen, Xinjun Zhu, Ziquan Fang, Baihua Zheng

Besides, our method can automatically switch its learning phase at the memorization point from memorization to self-guided learning, and select clean and informative memorized data via a novel adaptive denoising scheduler to improve the robustness.

Denoising Memorization +2

HAKG: Hierarchy-Aware Knowledge Gated Network for Recommendation

1 code implementation11 Apr 2022 Yuntao Du, Xinjun Zhu, Lu Chen, Baihua Zheng, Yunjun Gao

Furthermore, we propose a dual item embeddings design to represent and propagate collaborative signals and knowledge associations separately, and leverage the gated aggregation to distill discriminative information for better capturing user behavior patterns.

Knowledge-Aware Recommendation

Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation

1 code implementation CVPR 2022 Zhenguang Liu, Runyang Feng, Haoming Chen, Shuang Wu, Yixing Gao, Yunjun Gao, Xiang Wang

State-of-the-art methods strive to incorporate additional visual evidences from neighboring frames (supporting frames) to facilitate the pose estimation of the current frame (key frame).

Pose Estimation

MetaKG: Meta-learning on Knowledge Graph for Cold-start Recommendation

1 code implementation8 Feb 2022 Yuntao Du, Xinjun Zhu, Lu Chen, Ziquan Fang, Yunjun Gao

Inspired by the success of meta-learning on scarce training samples, we propose a novel meta-learning based framework called MetaKG, which encompasses a collaborative-aware meta learner and a knowledge-aware meta learner, to capture meta users' preference and entities' knowledge for cold-start recommendations.

Meta-Learning

Deep Spatially and Temporally Aware Similarity Computation for Road Network Constrained Trajectories

1 code implementation17 Dec 2021 Ziquan Fang, Yuntao Du, Xinjun Zhu, Lu Chen, Yunjun Gao, Christian S. Jensen

Trajectory similarity computation has drawn massive attention, as it is core functionality in a wide range of applications such as ride-sharing, traffic analysis, and social recommendation.

Representation Learning

FastSGD: A Fast Compressed SGD Framework for Distributed Machine Learning

no code implementations8 Dec 2021 Keyu Yang, Lu Chen, Zhihao Zeng, Yunjun Gao

Distributed ML models trained by SGD involve large amounts of gradient communication, which limits the scalability of distributed ML.

BIG-bench Machine Learning Quantization

Finding Materialized Models for Model Reuse

1 code implementation13 Oct 2021 Minjun Zhao, Lu Chen, Keyu Yang, Yuntao Du, Yunjun Gao

It uses a Gaussian mixture-based metric called separation degree to rank materialized models.

Model Selection Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.