Search Results for author: Qingyang Li

Found 17 papers, 5 papers with code

Synthetic Dialogue Dataset Generation using LLM Agents

1 code implementation • 30 Jan 2024 • Yelaman Abdullin, Diego Molla-Aliod, Bahadorreza Ofoghi, John Yearwood, Qingyang Li

We conduct human and automatic evaluations, including an evaluation approach that uses GPT-4 to mimic the human evaluation metrics.

Prompt Engineering

Paper
Code

Ask One More Time: Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios

no code implementations • 14 Nov 2023 • Lei Lin, Jiayi Fu, Pengli Liu, Qingyang Li, Yan Gong, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai

Although chain-of-thought (CoT) prompting combined with language models has achieved encouraging results on complex reasoning tasks, the naive greedy decoding used in CoT prompting usually causes the repetitiveness and local optimality.

Language Modelling

Paper
Add Code

Comparative study of microgrid optimal scheduling under multi-optimization algorithm fusion

no code implementations • 3 Oct 2023 • Hongyi Duan, Qingyang Li, Yuchen Li, Jianan Zhang, Yuming Xie

As global attention on renewable and clean energy grows, the research and implementation of microgrids become paramount.

Scheduling

Paper
Add Code

Improvement and Enhancement of YOLOv5 Small Target Recognition Based on Multi-module Optimization

no code implementations • 3 Oct 2023 • Qingyang Li, Yuchen Li, Hongyi Duan, JiaLiang Kang, Jianan Zhang, Xueqian Gan, Ruotong Xu

In this paper, the limitations of YOLOv5s model on small target detection task are deeply studied and improved.

Paper
Add Code

Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems

1 code implementation • 3 May 2023 • Xiong-Hui Chen, Bowei He, Yang Yu, Qingyang Li, Zhiwei Qin, Wenjie Shang, Jieping Ye, Chen Ma

However, building a user simulator with no reality-gap, i. e., can predict user's feedback exactly, is unrealistic because the users' reaction patterns are complex and historical logs for each user are limited, which might mislead the simulator-based recommendation policy.

Decision Making Recommendation Systems +1

Paper
Code

Fewer is More: Efficient Object Detection in Large Aerial Images

1 code implementation • 26 Dec 2022 • Xingxing Xie, Gong Cheng, Qingyang Li, Shicheng Miao, Ke Li, Junwei Han

Current mainstream object detection methods for large aerial images usually divide large images into patches and then exhaustively detect the objects of interest on all patches, no matter whether there exist objects or not.

4k Object +2

Paper
Code

Spatio-temporal Incentives Optimization for Ride-hailing Services with Offline Deep Reinforcement Learning

no code implementations • 6 Nov 2022 • Yanqiu Wu, Qingyang Li, Zhiwei Qin

Motivated by this observation, we make an attempt to optimize the distribution of demand to handle this problem by learning the long-term spatio-temporal values as a guideline for pricing strategy.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Offline Model-based Adaptable Policy Learning

1 code implementation • NeurIPS 2021 • Xiong-Hui Chen, Yang Yu, Qingyang Li, Fan-Ming Luo, Zhiwei Qin, Wenjie Shang, Jieping Ye

Current offline reinforcement learning methods commonly learn in the policy space constrained to in-support regions by the offline dataset, in order to ensure the robustness of the outcome policies.

Decision Making reinforcement-learning +1

Paper
Code

Offline Adaptive Policy Leaning in Real-World Sequential Recommendation Systems

no code implementations • 1 Jan 2021 • Xiong-Hui Chen, Yang Yu, Qingyang Li, Zhiwei Tony Qin, Wenjie Shang, Yiping Meng, Jieping Ye

Instead of increasing the fidelity of models for policy learning, we handle the distortion issue via learning to adapt to diverse simulators generated by the offline dataset.

Sequential Recommendation

Paper
Add Code

Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation

1 code implementation • 2 Apr 2020 • Mengyue Yang, Qingyang Li, Zhiwei Qin, Jieping Ye

In this paper, we propose a hierarchical adaptive contextual bandit method (HATCH) to conduct the policy learning of contextual bandits with a budget constraint.

Multi-Armed Bandits

Paper
Code

Weak Edge Identification Nets for Ocean Front Detection

no code implementations • 17 Sep 2019 • Qingyang Li, Guoqiang Zhong, Cui Xie

The method uses the stochastic gradient descent and the correlation loss function to obtain a good ocean front image output.

Edge Detection

Paper
Add Code

Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation

no code implementations • 12 Jul 2019 • Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei Qin, Yiping Meng, Jieping Ye

DEMER also derives a recommendation policy with a significantly improved performance in the test phase of the real application.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Long Short-Term Attention

no code implementations • 30 Oct 2018 • Guoqiang Zhong, Xin Lin, Kang Chen, Qingyang Li, Kai-Zhu Huang

Attention is an important cognition process of humans, which helps humans concentrate on critical information during their perception and learning.

Paper
Add Code

Multi-task Dictionary Learning based Convolutional Neural Network for Computer aided Diagnosis with Longitudinal Images

no code implementations • 31 Aug 2017 • Jie Zhang, Qingyang Li, Richard J. Caselli, Jieping Ye, Yalin Wang

Firstly, we pre-train CNN on the ImageNet dataset and transfer the knowledge from the pre-trained model to the medical imaging progression representation, generating the features for different tasks.

Dictionary Learning Image Classification +1

Paper
Add Code

Large-scale Feature Selection of Risk Genetic Factors for Alzheimer's Disease via Distributed Group Lasso Regression

no code implementations • 27 Apr 2017 • Qingyang Li, Dajiang Zhu, Jie Zhang, Derrek Paul Hibar, Neda Jahanshad, Yalin Wang, Jieping Ye, Paul M. Thompson, Jie Wang

Then we select the relevant group features by performing the group Lasso feature selection process in a sequence of parameters.

feature selection

Paper
Add Code

Large-scale Collaborative Imaging Genetics Studies of Risk Genetic Factors for Alzheimer's Disease Across Multiple Institutions

no code implementations • 19 Aug 2016 • Qingyang Li, Tao Yang, Liang Zhan, Derrek Paul Hibar, Neda Jahanshad, Yalin Wang, Jieping Ye, Paul M. Thompson, Jie Wang

To the best of our knowledge, this is the first successful run of the computationally intensive model selection procedure to learn a consistent model across different institutions without compromising their privacy while ranking the SNPs that may collectively affect AD.

Model Selection

Paper
Add Code

Stochastic Coordinate Coding and Its Application for Drosophila Gene Expression Pattern Annotation

no code implementations • 30 Jul 2014 • Binbin Lin, Qingyang Li, Qian Sun, Ming-Jun Lai, Ian Davidson, Wei Fan, Jieping Ye

The effectiveness of gene expression pattern annotation relies on the quality of feature representation.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.