Search Results for author: Jinyang Gao

Found 28 papers, 15 papers with code

XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL

2 code implementations13 Nov 2024 Yingqi Gao, Yifu Liu, Xiaoxia Li, Xiaorong Shi, Yin Zhu, Yiming Wang, Shiqi Li, Wei Li, Yuntao Hong, Zhiling Luo, Jinyang Gao, Liyu Mou, Yu Li

On the other hand, we implement the ICL approach with an example selection method based on named entity recognition to prevent overemphasis on entities.

Diversity In-Context Learning +3

What is Wrong with Perplexity for Long-context Language Modeling?

1 code implementation31 Oct 2024 Lizhe Fang, Yifei Wang, Zhaoyang Liu, Chenheng Zhang, Stefanie Jegelka, Jinyang Gao, Bolin Ding, Yisen Wang

To address this, we propose \textbf{LongPPL}, a novel metric that focuses on key tokens by employing a long-short context contrastive method to identify them.

Document Summarization In-Context Learning +2

MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases

no code implementations24 Oct 2024 Zhisheng Lin, Yifu Liu, Zhiling Luo, Jinyang Gao, Yu Li

Additionally, a shared expert group is introduced to address data imbalance, facilitating the transfer of common knowledge from high-resource dialects to low-resource ones.

$α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs

1 code implementation14 Oct 2024 Junkang Wu, Xue Wang, Zhengyi Yang, Jiancan Wu, Jinyang Gao, Bolin Ding, Xiang Wang, Xiangnan He

Aligning large language models (LLMs) with human values and intentions is crucial for their utility, honesty, and safety.

Computational Efficiency

Semantic Alignment for Multimodal Large Language Models

no code implementations23 Aug 2024 Tao Wu, Mengze Li, Jingyuan Chen, Wei Ji, Wang Lin, Jinyang Gao, Kun Kuang, Zhou Zhao, Fei Wu

By involving the bidirectional semantic guidance between different images in the visual-token extraction process, SAM aims to enhance the preservation of linking information for coherent analysis and align the semantics of different images before feeding them into LLM.

Large Language Model Visual Storytelling

$β$-DPO: Direct Preference Optimization with Dynamic $β$

1 code implementation11 Jul 2024 Junkang Wu, Yuexiang Xie, Zhengyi Yang, Jiancan Wu, Jinyang Gao, Bolin Ding, Xiang Wang, Xiangnan He

Direct Preference Optimization (DPO) has emerged as a compelling approach for training Large Language Models (LLMs) to adhere to human preferences.

Informativeness

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

1 code implementation10 Jul 2024 Junkang Wu, Yuexiang Xie, Zhengyi Yang, Jiancan Wu, Jiawei Chen, Jinyang Gao, Bolin Ding, Xiang Wang, Xiangnan He

We categorize noise into pointwise noise, which includes low-quality data points, and pairwise noise, which encompasses erroneous data pair associations that affect preference rankings.

When to Trust LLMs: Aligning Confidence with Response Quality

1 code implementation26 Apr 2024 Shuchang Tao, Liuyi Yao, Hanxing Ding, Yuexiang Xie, Qi Cao, Fei Sun, Jinyang Gao, HuaWei Shen, Bolin Ding

Specifically, the order-preserving reward incentivizes the model to verbalize greater confidence for responses of higher quality to align the order of confidence and quality.

Text Generation

Double-I Watermark: Protecting Model Copyright for LLM Fine-tuning

no code implementations22 Feb 2024 Shen Li, Liuyi Yao, Jinyang Gao, Lan Zhang, Yaliang Li

To support various applications, a prevalent and efficient approach for business owners is leveraging their valuable datasets to fine-tune a pre-trained LLM through the API provided by LLM owners or cloud servers.

DiffsFormer: A Diffusion Transformer on Stock Factor Augmentation

no code implementations5 Feb 2024 Yuan Gao, Haokun Chen, Xiang Wang, Zhicai Wang, Xue Wang, Jinyang Gao, Bolin Ding

Our research demonstrates the efficacy of leveraging AIGS and the DiffsFormer architecture to mitigate data scarcity in stock forecasting tasks.

CARD: Channel Aligned Robust Blend Transformer for Time Series Forecasting

1 code implementation20 May 2023 Wang Xue, Tian Zhou, Qingsong Wen, Jinyang Gao, Bolin Ding, Rong Jin

In this work, we design a special Transformer, i. e., Channel Aligned Robust Blend Transformer (CARD for short), that addresses key shortcomings of CI type Transformer in time series forecasting.

Time Series Time Series Forecasting

Studying the Impact of Data Disclosure Mechanism in Recommender Systems via Simulation

no code implementations1 Apr 2022 Ziqian Chen, Fei Sun, Yifan Tang, Haokun Chen, Jinyang Gao, Bolin Ding

Then we study users' privacy decision making under different data disclosure mechanisms and recommendation models, and how their data disclosure decisions affect the recommender system's performance.

Decision Making Federated Learning +2

OpenBox: A Generalized Black-box Optimization Service

6 code implementations1 Jun 2021 Yang Li, Yu Shen, Wentao Zhang, Yuanwei Chen, Huaijun Jiang, Mingchao Liu, Jiawei Jiang, Jinyang Gao, Wentao Wu, Zhi Yang, Ce Zhang, Bin Cui

Black-box optimization (BBO) has a broad range of applications, including automatic machine learning, engineering, physics, and experimental design.

Experimental Design Transfer Learning

CausCF: Causal Collaborative Filtering for RecommendationEffect Estimation

no code implementations28 May 2021 Xu Xie, Zhaoyang Liu, Shiwen Wu, Fei Sun, Cihang Liu, Jiawei Chen, Jinyang Gao, Bin Cui, Bolin Ding

It is based on the idea that similar users not only have a similar taste on items, but also have similar treatment effect under recommendations.

Collaborative Filtering Recommendation Systems

Explore User Neighborhood for Real-time E-commerce Recommendation

no code implementations28 Feb 2021 Xu Xie, Fei Sun, Xiaoyong Yang, Zhao Yang, Jinyang Gao, Wenwu Ou, Bin Cui

On the one hand, it utilizes UI relations and user neighborhood to capture both global and local information.

Collaborative Filtering Recommendation Systems

PURE: An Uncertainty-aware Recommendation Framework for Maximizing Expected Posterior Utility of Platform

no code implementations1 Jan 2021 Haokun Chen, Zhaoyang Liu, Chen Xu, Ziqian Chen, Jinyang Gao, Bolin Ding

In this paper, we propose a novel recommendation framework which effectively utilizes the information of user uncertainty over different item dimensions and explicitly takes into consideration the impact of display policy on user in order to achieve maximal expected posterior utility for the platform.

Efficient Automatic CASH via Rising Bandits

no code implementations8 Dec 2020 Yang Li, Jiawei Jiang, Jinyang Gao, Yingxia Shao, Ce Zhang, Bin Cui

In this framework, the BO methods are used to solve the HPO problem for each ML algorithm separately, incorporating a much smaller hyperparameter space for BO methods.

Bayesian Optimization BIG-bench Machine Learning +2

MFES-HB: Efficient Hyperband with Multi-Fidelity Quality Measurements

5 code implementations5 Dec 2020 Yang Li, Yu Shen, Jiawei Jiang, Jinyang Gao, Ce Zhang, Bin Cui

Instead of sampling configurations randomly in HB, BOHB samples configurations based on a BO surrogate model, which is constructed with the high-fidelity measurements only.

Bayesian Optimization Hyperparameter Optimization

Contrastive Learning for Sequential Recommendation

1 code implementation27 Oct 2020 Xu Xie, Fei Sun, Zhaoyang Liu, Shiwen Wu, Jinyang Gao, Bolin Ding, Bin Cui

Sequential recommendation methods play a crucial role in modern recommender systems because of their ability to capture a user's dynamic interest from her/his historical interactions.

Contrastive Learning Data Augmentation +1

Modeling Personalized Item Frequency Information for Next-basket Recommendation

2 code implementations31 May 2020 Haoji Hu, Xiangnan He, Jinyang Gao, Zhi-Li Zhang

NBR is in general more complex than the widely studied sequential (session-based) recommendation which recommends the next item based on a sequence of items.

Next-basket recommendation Session-Based Recommendations

Privileged Features Distillation at Taobao Recommendations

no code implementations11 Jul 2019 Chen Xu, Quan Li, Junfeng Ge, Jinyang Gao, Xiaoyong Yang, Changhua Pei, Fei Sun, Jian Wu, Hanxiao Sun, Wenwu Ou

To guarantee the consistency of off-line training and on-line serving, we usually utilize the same features that are both available.

Model Slicing for Supporting Complex Analytics with Elastic Inference Cost and Resource Constraints

1 code implementation3 Apr 2019 Shaofeng Cai, Gang Chen, Beng Chin Ooi, Jinyang Gao

Model slicing could be viewed as an elastic computation solution without requiring more computational resources.

Model Compression

Medical Concept Embedding with Time-Aware Attention

1 code implementation6 Jun 2018 Xiangrui Cai, Jinyang Gao, Kee Yuan Ngiam, Beng Chin Ooi, Ying Zhang, Xiaojie Yuan

Embeddings of medical concepts such as medication, procedure and diagnosis codes in Electronic Medical Records (EMRs) are central to healthcare analytics.

Clustering

PANDA: Facilitating Usable AI Development

no code implementations26 Apr 2018 Jinyang Gao, Wei Wang, Meihui Zhang, Gang Chen, H. V. Jagadish, Guoliang Li, Teck Khim Ng, Beng Chin Ooi, Sheng Wang, Jingren Zhou

In many complex applications such as healthcare, subject matter experts (e. g. Clinicians) are the ones who appreciate the importance of features that affect health, and their knowledge together with existing knowledge bases are critical to the end results.

Autonomous Driving

Rafiki: Machine Learning as an Analytics Service System

1 code implementation PVLDB (The Proceedings of the VLDB Endowment) 2018 Wei Wang, Sheng Wang, Jinyang Gao, Meihui Zhang, Gang Chen, Teck Khim Ng, Beng Chin Ooi

Second, expertise knowledge is required to optimize the training and inference procedures in terms of efficiency and effectiveness, which imposes heavy burden on the system users.

BIG-bench Machine Learning Hyperparameter Optimization +2

Deep Learning At Scale and At Ease

no code implementations25 Mar 2016 Wei Wang, Gang Chen, Haibo Chen, Tien Tuan Anh Dinh, Jinyang Gao, Beng Chin Ooi, Kian-Lee Tan, Sheng Wang

The other is scalability, that is the deep learning system must be able to provision for a huge demand of computing resources for training large models with massive datasets.

Deep Learning Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.