Search Results for author: Ming Yin

Found 37 papers, 4 papers with code

基于自动识别的委婉语历时性发展变化与社会共变研究(A Study on the Diachronic Development and Social Covariance of Euphemism Based on Automatic Recognition)

no code implementations • CCL 2021 • Chenlin Zhang, Mingwen Wang, Yiming Tan, Ming Yin, Xinyi Zhang

“本文主要以汉语委婉语作为研究对象, 基于大量人工标注, 借助机器学习有监督分类方法, 实现了较高精度的委婉语自动识别, 并基于此对1946年-2017年的《人民日报》中的委婉语历时变化发展情况进行量化统计分析。从大规模数据的角度探讨委婉语历时性发展变化、委婉语与社会之间的共变关系, 验证了语言的格雷什姆规律与更新规律。”

Paper
Add Code

Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making

no code implementations • 25 Mar 2024 • Shuai Ma, Qiaoyi Chen, Xinru Wang, Chengbo Zheng, Zhenhui Peng, Ming Yin, Xiaojuan Ma

In AI-assisted decision-making, humans often passively review AI's suggestion and decide whether to accept or reject it as a whole.

Decision Making

Paper
Add Code

Offline Multitask Representation Learning for Reinforcement Learning

no code implementations • 18 Mar 2024 • Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin, Doina Precup

We study offline multitask representation learning in reinforcement learning (RL), where a learner is provided with an offline dataset from different tasks that share a common representation and is asked to learn the shared representation.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Robust Federated Learning Mitigates Client-side Training Data Distribution Inference Attacks

no code implementations • 5 Mar 2024 • Yichang Xu, Ming Yin, Minghong Fang, Neil Zhenqiang Gong

Recent studies have revealed that federated learning (FL), once considered secure due to clients not sharing their private data with the server, is vulnerable to attacks such as client-side training data distribution inference, where a malicious client can recreate the victim's data.

Federated Learning

Paper
Add Code

Beyond Recommender: An Exploratory Study of the Effects of Different AI Roles in AI-Assisted Decision Making

no code implementations • 4 Mar 2024 • Shuai Ma, Chenyi Zhang, Xinru Wang, Xiaojuan Ma, Ming Yin

Artificial Intelligence (AI) is increasingly employed in various decision-making tasks, typically as a Recommender, providing recommendations that the AI deems correct.

Decision Making

Paper
Add Code

Poisoning Federated Recommender Systems with Fake Users

no code implementations • 18 Feb 2024 • Ming Yin, Yichang Xu, Minghong Fang, Neil Zhenqiang Gong

Current poisoning attacks on federated recommender systems often rely on additional information, such as the local training data of genuine users or item popularity.

Federated Learning Recommendation Systems

Paper
Add Code

Does More Advice Help? The Effects of Second Opinions in AI-Assisted Decision Making

no code implementations • 13 Jan 2024 • Zhuoran Lu, Dakuo Wang, Ming Yin

AI assistance in decision-making has become popular, yet people's inappropriate reliance on AI often leads to unsatisfactory human-AI collaboration performance.

Decision Making

Paper
Add Code

Decoding AI's Nudge: A Unified Framework to Predict Human Behavior in AI-assisted Decision Making

no code implementations • 11 Jan 2024 • Zhuoyan Li, Zhuoran Lu, Ming Yin

In this paper, we propose a computational framework that can provide an interpretable characterization of the influence of different forms of AI assistance on decision makers in AI-assisted decision making.

Decision Making

Paper
Add Code

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

2 code implementations • 27 Nov 2023 • Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen

We introduce MMMU: a new benchmark designed to evaluate multimodal models on massive multi-discipline tasks demanding college-level subject knowledge and deliberate reasoning.

Complex Query Answering Logical Reasoning +1

7,086

Paper
Code

Synthetic Data Generation with Large Language Models for Text Classification: Potential and Limitations

no code implementations • 11 Oct 2023 • Zhuoyan Li, Hangxiao Zhu, Zhuoran Lu, Ming Yin

The collection and curation of high-quality training data is crucial for developing text classification models with superior performance, but it is often associated with significant costs and time investment.

Synthetic Data Generation text-classification +1

Paper
Add Code

Model-Free Algorithm with Improved Sample Efficiency for Zero-Sum Markov Games

no code implementations • 17 Aug 2023 • Songtao Feng, Ming Yin, Yu-Xiang Wang, Jing Yang, Yingbin Liang

In this work, we propose a model-free stage-based Q-learning algorithm and show that it achieves the same sample complexity as the best model-based algorithm, and hence for the first time demonstrate that model-free algorithms can enjoy the same optimality in the $H$ dependence as model-based algorithms.

Multi-agent Reinforcement Learning Q-Learning +1

Paper
Add Code

Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data

no code implementations • 24 Jun 2023 • Sunil Madhow, Dan Xiao, Ming Yin, Yu-Xiang Wang

Developing theoretical guarantees on the sample complexity of offline RL methods is an important step towards making data-hungry RL algorithms practically viable.

Offline RL reinforcement-learning

Paper
Add Code

Non-stationary Reinforcement Learning under General Function Approximation

no code implementations • 1 Jun 2023 • Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin Liang

To the best of our knowledge, this is the first dynamic regret analysis in non-stationary MDPs with general function approximation.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

TheoremQA: A Theorem-driven Question Answering dataset

1 code implementation • 21 May 2023 • Wenhu Chen, Ming Yin, Max Ku, Pan Lu, Yixin Wan, Xueguang Ma, Jianyu Xu, Xinyi Wang, Tony Xia

We evaluate a wide spectrum of 16 large language and code models with different prompting strategies like Chain-of-Thoughts and Program-of-Thoughts.

Ranked #1 on Natural Questions on TheoremQA

Math Question Answering

152

Paper
Code

Interactive Concept Learning for Uncovering Latent Themes in Large Text Collections

no code implementations • 8 May 2023 • Maria Leonor Pacheco, Tunazzina Islam, Lyle Ungar, Ming Yin, Dan Goldwasser

Experts across diverse disciplines are often interested in making sense of large text collections.

Topic Models

Paper
Add Code

No-Regret Linear Bandits beyond Realizability

no code implementations • 26 Feb 2023 • Chong Liu, Ming Yin, Yu-Xiang Wang

It achieves a near-optimal $\sqrt{T}$ regret for problems that the best-known regret is almost linear in time horizon $T$.

Paper
Add Code

Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs

no code implementations • 24 Feb 2023 • Dan Qiao, Ming Yin, Yu-Xiang Wang

In many real-life reinforcement learning (RL) problems, deploying new policies is costly.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Who Should I Trust: AI or Myself? Leveraging Human and AI Correctness Likelihood to Promote Appropriate Trust in AI-Assisted Decision-Making

no code implementations • 14 Jan 2023 • Shuai Ma, Ying Lei, Xinru Wang, Chengbo Zheng, Chuhan Shi, Ming Yin, Xiaojuan Ma

To mitigate this gap, we proposed to promote humans' appropriate trust based on the CL of both sides at a task-instance level.

Decision Making

Paper
Add Code

Offline Reinforcement Learning with Closed-Form Policy Improvement Operators

no code implementations • 29 Nov 2022 • Jiachen Li, Edwin Zhang, Ming Yin, Qinxun Bai, Yu-Xiang Wang, William Yang Wang

Behavior constrained policy optimization has been demonstrated to be a successful paradigm for tackling Offline Reinforcement Learning.

D4RL Offline RL +2

Paper
Add Code

On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation

no code implementations • 23 Nov 2022 • Thanh Nguyen-Tang, Ming Yin, Sunil Gupta, Svetha Venkatesh, Raman Arora

To the best of our knowledge, these are the first $\tilde{\mathcal{O}}(\frac{1}{K})$ bound and absolute zero sub-optimality bound respectively for offline RL with linear function approximation from adaptive data with partial coverage.

Offline RL reinforcement-learning +1

Paper
Add Code

Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient

no code implementations • 3 Oct 2022 • Ming Yin, Mengdi Wang, Yu-Xiang Wang

Offline reinforcement learning, which aims at optimizing sequential decision-making strategies with historical data, has been extensively applied in real-life applications.

Decision Making Offline RL +3

Paper
Add Code

Why Quantization Improves Generalization: NTK of Binary Weight Neural Networks

no code implementations • 13 Jun 2022 • Kaiqi Zhang, Ming Yin, Yu-Xiang Wang

We propose a quasi neural network to approximate the distribution propagation, which is a neural network with continuous parameters and smooth activation function.

Quantization

Paper
Add Code

Offline Stochastic Shortest Path: Learning, Evaluation and Towards Optimality

no code implementations • 10 Jun 2022 • Ming Yin, Wenjing Chen, Mengdi Wang, Yu-Xiang Wang

Goal-oriented Reinforcement Learning, where the agent needs to reach the goal state while simultaneously minimizing the cost, has received significant attention in real-world applications.

Paper
Add Code

A Holistic Framework for Analyzing the COVID-19 Vaccine Debate

1 code implementation • NAACL 2022 • Maria Leonor Pacheco, Tunazzina Islam, Monal Mahajan, Andrey Shor, Ming Yin, Lyle Ungar, Dan Goldwasser

The Covid-19 pandemic has led to infodemic of low quality information leading to poor health decisions.

Sentiment Analysis

Paper
Code

Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism

no code implementations • 11 Mar 2022 • Ming Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang

However, a precise understanding of the statistical limits with function representations, remains elusive, even when such a representation is linear.

Decision Making reinforcement-learning +1

Paper
Add Code

Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost

no code implementations • 13 Feb 2022 • Dan Qiao, Ming Yin, Ming Min, Yu-Xiang Wang

In this paper, we propose a new algorithm based on stage-wise exploration and adaptive policy elimination that achieves a regret of $\widetilde{O}(\sqrt{H^4S^2AT})$ while requiring a switching cost of $O(HSA \log\log T)$.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Towards Instance-Optimal Offline Reinforcement Learning with Pessimism

no code implementations • NeurIPS 2021 • Ming Yin, Yu-Xiang Wang

We study the offline reinforcement learning (offline RL) problem, where the goal is to learn a reward-maximizing policy in an unknown Markov Decision Process (MDP) using the data coming from a policy $\mu$.

Offline RL reinforcement-learning +1

Paper
Add Code

Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings

no code implementations • NeurIPS 2021 • Ming Yin, Yu-Xiang Wang

This work studies the statistical limits of uniform convergence for offline policy evaluation (OPE) problems with model-based methods (for episodic MDP) and provides a unified framework towards optimal learning for several well-motivated offline tasks.

Offline RL

Paper
Add Code

Near-Optimal Offline Reinforcement Learning via Double Variance Reduction

no code implementations • NeurIPS 2021 • Ming Yin, Yu Bai, Yu-Xiang Wang

Our main result shows that OPDVR provably identifies an $\epsilon$-optimal policy with $\widetilde{O}(H^2/d_m\epsilon^2)$ episodes of offline data in the finite-horizon stationary transition setting, where $H$ is the horizon length and $d_m$ is the minimal marginal state-action distribution induced by the behavior policy.

Offline RL reinforcement-learning +1

Paper
Add Code

Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning

no code implementations • 7 Jul 2020 • Ming Yin, Yu Bai, Yu-Xiang Wang

The problem of Offline Policy Evaluation (OPE) in Reinforcement Learning (RL) is a critical step towards applying RL in real-life applications.

Offline RL reinforcement-learning +1

Paper
Add Code

Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning

no code implementations • 29 Jan 2020 • Ming Yin, Yu-Xiang Wang

We consider the problem of off-policy evaluation for reinforcement learning, where the goal is to estimate the expected reward of a target policy $\pi$ using offline data collected by running a logging policy $\mu$.

Off-policy evaluation reinforcement-learning

Paper
Add Code

Shared Generative Latent Representation Learning for Multi-view Clustering

1 code implementation • 23 Jul 2019 • Ming Yin, Weitian Huang, Junbin Gao

Clustering multi-view data has been a fundamental research topic in the computer vision community.

Clustering Representation Learning

Paper
Code

Low-rank Multi-view Clustering in Third-Order Tensor Space

no code implementations • 30 Aug 2016 • Ming Yin, Junbin Gao, Shengli Xie, Yi Guo

Multi-view subspace clustering is based on the fact that the multi-view data are generated from a latent subspace.

Clustering Multi-view Subspace Clustering

Paper
Add Code

Neighborhood Preserved Sparse Representation for Robust Classification on Symmetric Positive Definite Matrices

no code implementations • 27 Jan 2016 • Ming Yin, Shengli Xie, Yi Guo, Junbin Gao, Yun Zhang

Due to its promising classification performance, sparse representation based classification(SRC) algorithm has attracted great attention in the past few years.

Classification General Classification +2

Paper
Add Code

Kernel Sparse Subspace Clustering on Symmetric Positive Definite Manifolds

no code implementations • CVPR 2016 • Ming Yin, Yi Guo, Junbin Gao, Zhaoshui He, Shengli Xie

Sparse subspace clustering (SSC), as one of the most successful subspace clustering methods, has achieved notable clustering accuracy in computer vision tasks.

Clustering

Paper
Add Code

Supervised learning of sparse context reconstruction coefficients for data representation and classification

no code implementations • 18 Aug 2015 • Xuejie Liu, Jingbin Wang, Ming Yin, Benjamin Edwards, Peijuan Xu

Context of data points, which is usually defined as the other data points in a data set, has been found to play important roles in data representation and classification.

Classification General Classification

Paper
Add Code

Representing data by sparse combination of contextual data points for classification

no code implementations • 30 Jun 2015 • Jing-Yan Wang, Yihua Zhou, Ming Yin, Shaochang Chen, Benjamin Edwards

In this objective, the reconstruction error is minimized and the coefficient spar- sity is encouraged.

General Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.