Search Results for author: Yinghui Xu

Found 45 papers, 15 papers with code

AI2Agent: An End-to-End Framework for Deploying AI Projects as Autonomous Agents

1 code implementation31 Mar 2025 Jiaxiang Chen, Jingwei Shi, Lei Gan, Jiale Zhang, Qingyu Zhang, Dongqian Zhang, Xin Pang, Zhucong Li, Yinghui Xu

As AI technology advances, it is driving innovation across industries, increasing the demand for scalable AI project deployment.

Text-to-Image Generation

Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents

no code implementations11 Mar 2025 Rui Xu, Mingyu Wang, Xintao Wang, Dakuan Lu, Xiaoyu Tan, Wei Chu, Yinghui Xu

To address this challenge, we propose MIRROR, a chain-of-thought approach that generates character thoughts by retrieving memories, predicting character reactions, and synthesizing motivations.

AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification

no code implementations17 Feb 2025 Xiaoyu Tan, Tianchu Yao, Chao Qu, Bin Li, Minghao Yang, Dakuan Lu, Haozhe Wang, Xihe Qiu, Wei Chu, Yinghui Xu, Yuan Qi

In this paper, we present AURORA, a novel automated framework for training universal process reward models (PRMs) using ensemble prompting and reverse verification.

SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain

1 code implementation26 Jan 2025 Dakuan Lu, Xiaoyu Tan, Rui Xu, Tianchu Yao, Chao Qu, Wei Chu, Yinghui Xu, Yuan Qi

Recent breakthroughs in large language models (LLMs) exemplified by the impressive mathematical and scientific reasoning capabilities of the o1 model have spotlighted the critical importance of high-quality training data in advancing LLM performance across STEM disciplines.

An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS Diagnosis

1 code implementation25 Dec 2024 Yingchen Wei, Xihe Qiu, Xiaoyu Tan, Jingjing Huang, Wei Chu, Yinghui Xu, Yuan Qi

Cross-attention combines image and text data for better feature extraction, and ordered regression loss ensures stable learning.

Diagnostic

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

no code implementations7 Nov 2024 Siming Huang, Tianhao Cheng, J. K. Liu, Jiaran Hao, Liuyihan Song, Yang Xu, J. Yang, Jiaheng Liu, Chenchen Zhang, Linzheng Chai, Ruifeng Yuan, Zhaoxiang Zhang, Jie Fu, Qian Liu, Ge Zhang, Zili Wang, Yuan Qi, Yinghui Xu, Wei Chu

To address the gap, we introduce OpenCoder, a top-tier code LLM that not only achieves performance comparable to leading models but also serves as an "open cookbook" for the research community.

Code Generation

LMGT: Optimizing Exploration-Exploitation Balance in Reinforcement Learning through Language Model Guided Trade-offs

no code implementations7 Sep 2024 Yongxin Deng, Xihe Qiu, Xiaoyu Tan, Wei Chu, Yinghui Xu

The uncertainty inherent in the environmental transition model of Reinforcement Learning (RL) necessitates a careful balance between exploration and exploitation to optimize the use of computational resources for accurately estimating an agent's expected reward.

Language Modeling Language Modelling +2

Interpreting and Mitigating Hallucination in MLLMs through Multi-agent Debate

no code implementations30 Jul 2024 Zheng Lin, Zhenxing Niu, Zhibin Wang, Yinghui Xu

MLLMs often generate outputs that are inconsistent with the visual content, a challenge known as hallucination.

Hallucination

Robust Deep Hawkes Process under Label Noise of Both Event and Occurrence

1 code implementation24 Jul 2024 Xiaoyu Tan, Bin Li, Xihe Qiu, Jingjing Huang, Yinghui Xu, Wei Chu

To the best of our knowledge, this is the first study to successfully address both event and time label noise in deep Hawkes process models, offering a promising solution for medical applications, specifically in diagnosing OSAHS.

Thought-Like-Pro: Enhancing Reasoning of Large Language Models through Self-Driven Prolog-based Chain-of-Thought

no code implementations18 Jul 2024 Xiaoyu Tan, Yongxin Deng, Xihe Qiu, Weidi Xu, Chao Qu, Wei Chu, Yinghui Xu, Yuan Qi

To address these challenges, we introduce a novel learning framework, THOUGHT-LIKE-PRO In this framework, we utilize imitation learning to imitate the Chain-of-Thought (CoT) process which is verified and translated from reasoning trajectories generated by a symbolic Prolog logic engine.

Imitation Learning

Struct-X: Enhancing Large Language Models Reasoning with Structured Data

no code implementations17 Jul 2024 Xiaoyu Tan, Haoyu Wang, Xihe Qiu, Yuan Cheng, Yinghui Xu, Wei Chu, Yuan Qi

Structured data, rich in logical and relational information, has the potential to enhance the reasoning abilities of large language models (LLMs).

Data Augmentation Reading Comprehension

MINDECHO: Role-Playing Language Agents for Key Opinion Leaders

no code implementations7 Jul 2024 Rui Xu, Dakuan Lu, Xiaoyu Tan, Xintao Wang, Siyu Yuan, Jiangjie Chen, Wei Chu, Yinghui Xu

Large language models~(LLMs) have demonstrated impressive performance in various applications, among which role-playing language agents (RPLAs) have engaged a broad user base.

Retrieval

AI2Apps: A Visual IDE for Building LLM-based AI Agent Applications

1 code implementation7 Apr 2024 Xin Pang, Zhucong Li, Jiaxiang Chen, Yuan Cheng, Yinghui Xu, Yuan Qi

We introduce AI2Apps, a Visual Integrated Development Environment (Visual IDE) with full-cycle capabilities that accelerates developers to build deployable LLM-based AI agent Applications.

AI Agent Management

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

1 code implementation21 Mar 2024 Shenhao Zhu, Junming Leo Chen, Zuozhuo Dai, Qingkun Su, Yinghui Xu, Xun Cao, Yao Yao, Hao Zhu, Siyu Zhu

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching

no code implementations9 Dec 2023 Zhenting Qi, Xiaoyu Tan, Shaojie Shi, Chao Qu, Yinghui Xu, Yuan Qi

Instruction fine-tuning has conventionally been employed to adapt Large Language Models (LLMs) to a variety of tasks.

In-Context Learning

InfMLLM: A Unified Framework for Visual-Language Tasks

1 code implementation12 Nov 2023 Qiang Zhou, Zhibin Wang, Wei Chu, Yinghui Xu, Hao Li, Yuan Qi

Our experiments demonstrate that preserving the positional information of visual embeddings through the pool-adapter is particularly beneficial for tasks like visual grounding.

Image Captioning Instruction Following +3

FuXi: A cascade machine learning forecasting system for 15-day global weather forecast

2 code implementations22 Jun 2023 Lei Chen, Xiaohui Zhong, Feng Zhang, Yuan Cheng, Yinghui Xu, Yuan Qi, Hao Li

Over the past few years, due to the rapid development of machine learning (ML) models for weather forecasting, state-of-the-art ML models have shown superior performance compared to the European Centre for Medium-Range Weather Forecasts (ECMWF)'s high-resolution forecast (HRES) in 10-day forecasts at a spatial resolution of 0. 25 degree.

Weather Forecasting

Communication Efficient SGD via Gradient Sampling With Bayes Prior

no code implementations CVPR 2021 Liuyihan Song, Kang Zhao, Pan Pan, Yu Liu, Yingya Zhang, Yinghui Xu, Rong Jin

Different from all of them, we regard large and small gradients selection as the exploitation and exploration of gradient information, respectively.

Image Classification object-detection +2

DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training

1 code implementation ICCV 2021 Kun Yuan, Yiming Chen, Xinmeng Huang, Yingya Zhang, Pan Pan, Yinghui Xu, Wotao Yin

Experimental results on a variety of computer vision tasks and models demonstrate that DecentLaM promises both efficient and high-quality training.

Multiple Object Tracking with Correlation Learning

no code implementations CVPR 2021 Qiang Wang, Yun Zheng, Pan Pan, Yinghui Xu

Recent works have shown that convolutional networks have substantially improved the performance of multiple object tracking by simultaneously learning detection and appearance features.

Multiple Object Tracking Object +1

Few-Shot Incremental Learning with Continually Evolved Classifiers

1 code implementation CVPR 2021 Chi Zhang, Nan Song, Guosheng Lin, Yun Zheng, Pan Pan, Yinghui Xu

First, we adopt a simple but effective decoupled learning strategy of representations and classifiers that only the classifiers are updated in each incremental session, which avoids knowledge forgetting in the representations.

class-incremental learning Few-Shot Class-Incremental Learning +1

Self-supervised Video Representation Learning by Context and Motion Decoupling

1 code implementation CVPR 2021 Lianghua Huang, Yu Liu, Bin Wang, Pan Pan, Yinghui Xu, Rong Jin

A key challenge in self-supervised video representation learning is how to effectively capture motion information besides context bias.

Action Recognition Decoder +4

Practical Relative Order Attack in Deep Ranking

2 code implementations ICCV 2021 Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Yinghui Xu, Nanning Zheng, Gang Hua

In this paper, we formulate a new adversarial attack against deep ranking systems, i. e., the Order Attack, which covertly alters the relative order among a selected set of candidates according to an attacker-specified permutation, with limited interference to other unrelated candidates.

Adversarial Attack Triplet

Large-Scale Visual Search with Binary Distributed Graph at Alibaba

no code implementations9 Feb 2021 Kang Zhao, Pan Pan, Yun Zheng, Yanhao Zhang, Changxu Wang, Yingya Zhang, Yinghui Xu, Rong Jin

For a deployed visual search system with several billions of online images in total, building a billion-scale offline graph in hours is essential, which is almost unachievable by most existing methods.

graph construction

Distribution Adaptive INT8 Quantization for Training CNNs

no code implementations9 Feb 2021 Kang Zhao, Sida Huang, Pan Pan, Yinghan Li, Yingya Zhang, Zhenyu Gu, Yinghui Xu

Researches have demonstrated that low bit-width (e. g., INT8) quantization can be employed to accelerate the inference process.

Image Classification object-detection +3

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

no code implementations9 Feb 2021 Yu Liu, Lianghua Huang, Pan Pan, Bin Wang, Yinghui Xu, Rong Jin

However, scaling up the classification task from thousands of semantic labels to millions of instance labels brings specific challenges including 1) the large-scale softmax computation; 2) the slow convergence due to the infrequent visiting of instance samples; and 3) the massive number of negative classes that can be noisy.

Classification General Classification +2

Large Scale Long-tailed Product Recognition System at Alibaba

no code implementations9 Feb 2021 Xiangzeng Zhou, Pan Pan, Yun Zheng, Yinghui Xu, Rong Jin

In this paper, we present a novel side information based large scale visual recognition co-training~(SICoT) system to deal with the long tail problem by leveraging the image related side information.

Object Recognition

Virtual ID Discovery from E-commerce Media at Alibaba: Exploiting Richness of User Click Behavior for Visual Search Relevance

no code implementations9 Feb 2021 Yanhao Zhang, Pan Pan, Yun Zheng, Kang Zhao, Jianmin Wu, Yinghui Xu, Rong Jin

Benefiting from exploration of user click data, our networks are more effective to encode richer supervision and better distinguish real-shot images in terms of category and feature.

Balanced Order Batching with Task-Oriented Graph Clustering

no code implementations19 Aug 2020 Lu Duan, Haoyuan Hu, Zili Wu, Guozheng Li, Xinhang Zhang, Yu Gong, Yinghui Xu

In this paper, rather than designing heuristics, we propose an end-to-end learning and optimization framework named Balanced Task-orientated Graph Clustering Network (BTOGCN) to solve the BOBP by reducing it to balanced graph clustering optimization problem.

Clustering Deep Clustering +1

Accelerating Primal Solution Findings for Mixed Integer Programs Based on Solution Prediction

no code implementations23 Jun 2019 Jian-Ya Ding, Chao Zhang, Lei Shen, Shengyin Li, Bing Wang, Yinghui Xu, Le Song

In many applications, a similar MIP model is solved on a regular basis, maintaining remarkable similarities in model structures and solution appearances but differing in formulation coefficients.

Combinatorial Optimization

Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application

1 code implementation2 Mar 2018 Yujing Hu, Qing Da, An-Xiang Zeng, Yang Yu, Yinghui Xu

For better utilizing the correlation between different ranking steps, in this paper, we propose to use reinforcement learning (RL) to learn an optimal ranking policy which maximizes the expected accumulative rewards in a search session.

Decision Making Learning-To-Rank +2

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models

3 code implementations30 May 2017 Jun Wang, Lantao Yu, Wei-Nan Zhang, Yu Gong, Yinghui Xu, Benyou Wang, Peng Zhang, Dell Zhang

This paper provides a unified account of two schools of thinking in information retrieval modelling: the generative retrieval focusing on predicting relevant documents given a query, and the discriminative retrieval focusing on predicting relevancy given a query-document pair.

Document Ranking Information Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.