Search Results for author: Yutong Wang

Found 37 papers, 17 papers with code

MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation

no code implementations18 Feb 2025 Yutong Wang, Pengliang Ji, Chaoqun Yang, Kaixin Li, Ming Hu, Jiaoyang Li, Guillaume Sartoretti

The LLM-as-a-Judge paradigm shows promise for evaluating generative content but lacks reliability in reasoning-intensive scenarios, such as programming.

global-optimization Large Language Model

Make-A-Character 2: Animatable 3D Character Generation From a Single Image

no code implementations14 Jan 2025 Lin Liu, Yutong Wang, Jiahao Chen, Jianfang Li, Tangli Xue, Longlong Li, Jianqiang Ren, Liefeng Bo

This report introduces Make-A-Character 2, an advanced system for generating high-quality 3D characters from single portrait photographs, ideal for game development and digital human applications.

Image to 3D

UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility

1 code implementation4 Jan 2025 Yonglin Tian, Fei Lin, Yiduo Li, Tengchao Zhang, Qiyao Zhang, Xuan Fu, Jun Huang, Xingyuan Dai, Yutong Wang, Chunwei Tian, Bai Li, Yisheng Lv, Levente Kovács, Fei-Yue Wang

Low-altitude mobility, exemplified by unmanned aerial vehicles (UAVs), has introduced transformative advancements across various domains, like transportation, logistics, and agriculture.

The Implicit Bias of Gradient Descent on Separable Multiclass Data

no code implementations2 Nov 2024 Hrithik Ravi, Clayton Scott, Daniel Soudry, Yutong Wang

Implicit bias describes the phenomenon where optimization-based training algorithms, without explicit regularization, show a preference for simple estimators even when more complex estimators have equal objective values.

Binary Classification Classification

Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding

no code implementations28 Oct 2024 He Jiang, Yutong Wang, Rishi Veerapaneni, Tanishq Duhan, Guillaume Sartoretti, Jiaoyang Li

Lifelong Multi-Agent Path Finding (LMAPF) is a variant of MAPF where agents are continually assigned new goals, necessitating frequent re-planning to accommodate these dynamic changes.

Imitation Learning Multi-Agent Path Finding

Herald: A Natural Language Annotated Lean 4 Dataset

no code implementations9 Oct 2024 Guoxiong Gao, Yutong Wang, Jiedong Jiang, Qi Gao, Zihan Qin, Tianyi Xu, Bin Dong

A significant challenge in training LLMs for these formal languages is the lack of parallel datasets that align natural language with formal language proofs.

Math Mathematical Reasoning

GOReloc: Graph-based Object-Level Relocalization for Visual SLAM

1 code implementation15 Aug 2024 Yutong Wang, Chaoyang Jiang, Xieyuanli Chen

It determines the pose of a camera sensor by robustly associating the object detections in the current frame with 3D objects in a lightweight object-level map.

Object object-detection +1

TasTe: Teaching Large Language Models to Translate through Self-Reflection

1 code implementation12 Jun 2024 Yutong Wang, Jiali Zeng, Xuebo Liu, Fandong Meng, Jie zhou, Min Zhang

The evaluation results in four language directions on the WMT22 benchmark reveal the effectiveness of our approach compared to existing methods.

Instruction Following Machine Translation +2

AMCEN: An Attention Masking-based Contrastive Event Network for Two-stage Temporal Knowledge Graph Reasoning

no code implementations16 May 2024 Jing Yang, Xiao Wang, Yutong Wang, Jiawei Wang, Fei-Yue Wang

To achieve more accurate TKG reasoning, we propose an attention masking-based contrastive event network (AMCEN) with local-global temporal patterns for the two-stage prediction of future events.

Contrastive Learning Knowledge Graphs +1

Sim2Real in Reconstructive Spectroscopy: Deep Learning with Augmented Device-Informed Data Simulation

1 code implementation19 Mar 2024 Jiyi Chen, Pengyu Li, Yutong Wang, Pei-Cheng Ku, Qing Qu

This work proposes a deep learning (DL)-based framework, namely Sim2Real, for spectral signal reconstruction in reconstructive spectroscopy, focusing on efficient data sampling and fast inference time.

Data Augmentation

Near-Interpolators: Rapid Norm Growth and the Trade-Off between Interpolation and Generalization

1 code implementation12 Mar 2024 Yutong Wang, Rishi Sonthalia, Wei Hu

Under a random matrix theoretic assumption on the data distribution and an eigendecay assumption on the data covariance matrix $\boldsymbol{\Sigma}$, we demonstrate that any near-interpolator exhibits rapid norm growth: for $\tau$ fixed, $\boldsymbol{\beta}$ has squared $\ell_2$-norm $\mathbb{E}[\|{\boldsymbol{\beta}}\|_{2}^{2}] = \Omega(n^{\alpha})$ where $n$ is the number of samples and $\alpha >1$ is the exponent of the eigendecay, i. e., $\lambda_i(\boldsymbol{\Sigma}) \sim i^{-\alpha}$.

VOOM: Robust Visual Object Odometry and Mapping using Hierarchical Landmarks

1 code implementation21 Feb 2024 Yutong Wang, Chaoyang Jiang, Xieyuanli Chen

Meanwhile, local bundle adjustment is performed utilizing the objects and points-based covisibility graphs in our visual object mapping process.

Computational Efficiency Object +1

Make-A-Character: High Quality Text-to-3D Character Generation within Minutes

no code implementations24 Dec 2023 Jianqiang Ren, Chao He, Lin Liu, Jiahao Chen, Yutong Wang, Yafei Song, Jianfang Li, Tangli Xue, Siqi Hu, Tao Chen, Kunkun Zheng, Jianjing Xiang, Liefeng Bo

There is a growing demand for customized and expressive 3D characters with the emergence of AI agents and Metaverse, but creating 3D characters using traditional computer graphics tools is a complex and time-consuming task.

3D Generation Text to 3D

Unified Binary and Multiclass Margin-Based Classification

no code implementations29 Nov 2023 Yutong Wang, Clayton Scott

The notion of margin loss has been central to the development and analysis of algorithms for binary classification.

Binary Classification Classification +1

Neural Collapse in Multi-label Learning with Pick-all-label Loss

1 code implementation24 Oct 2023 Pengyu Li, Xiao Li, Yutong Wang, Qing Qu

We study deep neural networks for the multi-label classification (MLab) task through the lens of neural collapse (NC).

All Multi-class Classification +4

Diversity from Human Feedback

no code implementations10 Oct 2023 Ren-Jian Wang, Ke Xue, Yutong Wang, Peng Yang, Haobo Fu, Qiang Fu, Chao Qian

DivHF learns a behavior descriptor consistent with human preference by querying human feedback.

Combinatorial Optimization Diversity +1

Benign Overfitting and Grokking in ReLU Networks for XOR Cluster Data

no code implementations4 Oct 2023 Zhiwei Xu, Yutong Wang, Spencer Frei, Gal Vardi, Wei Hu

Second, they can undergo a period of classical, harmful overfitting -- achieving a perfect fit to training data with near-random performance on test data -- before transitioning ("grokking") to near-optimal generalization later in training.

On Classification-Calibration of Gamma-Phi Losses

no code implementations14 Feb 2023 Yutong Wang, Clayton D. Scott

Gamma-Phi losses constitute a family of multiclass classification loss functions that generalize the logistic and other common losses, and have found application in the boosting literature.

Classification

Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution

1 code implementation9 Aug 2022 Ke Xue, Yutong Wang, Cong Guan, Lei Yuan, Haobo Fu, Qiang Fu, Chao Qian, Yang Yu

Generating agents that can achieve zero-shot coordination (ZSC) with unseen partners is a new challenge in cooperative multi-agent reinforcement learning (MARL).

Multi-agent Reinforcement Learning

MORE: A Metric Learning Based Framework for Open-domain Relation Extraction

1 code implementation1 Jun 2022 Yutong Wang, Renze Lou, Kai Zhang, MaoYan Chen, Yujiu Yang

To address these problems, in this work, we propose a novel learning framework named MORE (Metric learning-based Open Relation Extraction).

Clustering Metric Learning +2

Consistent Interpolating Ensembles via the Manifold-Hilbert Kernel

no code implementations19 May 2022 Yutong Wang, Clayton D. Scott

Recent research in the theory of overparametrized learning has sought to establish generalization guarantees in the interpolating regime.

Learning from Label Proportions by Learning with Label Noise

1 code implementation4 Mar 2022 Jianxin Zhang, Yutong Wang, Clayton Scott

Learning from label proportions (LLP) is a weakly supervised classification problem where data points are grouped into bags, and the label proportions within each bag are observed instead of the instance-level labels.

Weakly Supervised Classification

FCMNet: Full Communication Memory Net for Team-Level Cooperation in Multi-Agent Systems

1 code implementation28 Jan 2022 Yutong Wang, Guillaume Sartoretti

There, our comparison results show that FCMNet outperforms state-of-the-art communication-based reinforcement learning methods in all StarCraft II micromanagement tasks, and value decomposition methods in certain tasks.

Decision Making reinforcement-learning +3

VC dimension of partially quantized neural networks in the overparametrized regime

1 code implementation ICLR 2022 Yutong Wang, Clayton D. Scott

Indeed, existing applications of VC theory to large networks obtain upper bounds on VC dimension that are proportional to the number of weights, and for a large class of networks, these upper bound are known to be tight.

An Exact Solver for the Weston-Watkins SVM Subproblem

1 code implementation10 Feb 2021 Yutong Wang, Clayton D. Scott

Recent empirical evidence suggests that the Weston-Watkins support vector machine is among the best performing multiclass extensions of the binary SVM.

Weston-Watkins Hinge Loss and Ordered Partitions

no code implementations NeurIPS 2020 Yutong Wang, Clayton D. Scott

A recent empirical comparison of nine such formulations [Do\v{g}an et al. 2016] recommends the variant proposed by Weston and Watkins (WW), despite the fact that the WW-hinge loss is not calibrated with respect to the 0-1 loss.

Weak Supervision Enhanced Generative Network for Question Generation

no code implementations1 Jul 2019 Yutong Wang, Jiyuan Zheng, Qijiong Liu, Zhou Zhao, Jun Xiao, Yueting Zhuang

More specifically, we devise a discriminator, Relation Guider, to capture the relations between the whole passage and the associated answer and then the Multi-Interaction mechanism is deployed to transfer the knowledge dynamically for our question generation system.

Decoder Question Answering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.