Search Results for author: YuQi Yang

Found 14 papers, 7 papers with code

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

1 code implementation22 Dec 2024 Yuxiang Zhang, YuQi Yang, Jiangming Shu, Yuhang Wang, Jinlin Xiao, Jitao Sang

OpenAI's recent introduction of Reinforcement Fine-Tuning (RFT) showcases the potential of reasoning foundation model and offers a new paradigm for fine-tuning beyond simple pattern imitation.

o1-Coder: an o1 Replication for Coding

1 code implementation29 Nov 2024 Yuxiang Zhang, Shangxi Wu, YuQi Yang, Jiangming Shu, Jinlin Xiao, Chao Kong, Jitao Sang

The technical report introduces O1-CODER, an attempt to replicate OpenAI's o1 model with a focus on coding tasks.

Reinforcement Learning (RL)

Night-to-Day Translation via Illumination Degradation Disentanglement

no code implementations21 Nov 2024 Guanzhou Lan, YuQi Yang, Zhigang Wang, Dong Wang, Bin Zhao, Xuelong Li

Specifically, our method comprises a degradation disentanglement module and a degradation-aware contrastive learning module.

Contrastive Learning Disentanglement +1

Efficient Diffusion as Low Light Enhancer

no code implementations16 Oct 2024 Guanzhou Lan, Qianli Ma, YuQi Yang, Zhigang Wang, Dong Wang, Xuelong Li, Bin Zhao

In this paper, we identify two primary factors contributing to performance degradation: fitting errors and the inference gap.

Low-Light Image Enhancement

Exploring the Privacy Protection Capabilities of Chinese Large Language Models

no code implementations27 Mar 2024 YuQi Yang, Xiaowen Huang, Jitao Sang

Large language models (LLMs), renowned for their impressive capabilities in various tasks, have significantly advanced artificial intelligence.

Multi-Task Dense Prediction via Mixture of Low-Rank Experts

1 code implementation CVPR 2024 YuQi Yang, Peng-Tao Jiang, Qibin Hou, Hao Zhang, Jinwei Chen, Bo Li

Furthermore, to control the parameters and computational cost brought by the increase in the number of experts, we take inspiration from LoRA and propose to leverage the low-rank format of a vanilla convolution in the expert network.

Decoder

Empowering Segmentation Ability to Multi-modal Large Language Models

no code implementations21 Mar 2024 YuQi Yang, Peng-Tao Jiang, Jing Wang, Hao Zhang, Kai Zhao, Jinwei Chen, Bo Li

Multi-modal large language models (MLLMs) can understand image-language prompts and demonstrate impressive reasoning ability.

Dialogue Generation Reasoning Segmentation +2

3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining

1 code implementation14 Apr 2023 Siming Yan, YuQi Yang, YuXiao Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu, QiXing Huang

Masked autoencoders (MAE) have recently been introduced to 3D self-supervised pretraining for point clouds due to their great success in NLP and computer vision.

Decoder

Multi-head Uncertainty Inference for Adversarial Attack Detection

no code implementations20 Dec 2022 YuQi Yang, Songyun Yang, Jiyang Xie. Zhongwei Si, Kai Guo, Ke Zhang, Kongming Liang

We adopt a multi-head architecture with multiple prediction heads (i. e., classifiers) to obtain predictions from different depths in the DNNs and introduce shallow information for the UI.

Adversarial Attack Detection Adversarial Defense

L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation

1 code implementation CVPR 2022 Peng-Tao Jiang, YuQi Yang, Qibin Hou, Yunchao Wei

Our framework conducts the global network to learn the captured rich object detail knowledge from a global view and thereby produces high-quality attention maps that can be directly used as pseudo annotations for semantic segmentation networks.

Object Transfer Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.