Search Results for author: Junzhe Wang

Found 10 papers, 8 papers with code

VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-Training

1 code implementation12 Mar 2024 Mohammad Nazeri, Junzhe Wang, Amirreza Payandeh, Xuesu Xiao

However, most robotic visual navigation methods rely on deep learning models pre-trained on vision tasks, which prioritize salient objects -- not necessarily relevant to navigation and potentially misleading.

Self-Supervised Learning Visual Navigation

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

1 code implementation8 Feb 2024 Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, wei he, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

In this paper, we propose R$^3$: Learning Reasoning through Reverse Curriculum Reinforcement Learning (RL), a novel method that employs only outcome supervision to achieve the benefits of process supervision for large language models.

GSM8K reinforcement-learning +1

Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER

no code implementations10 Oct 2022 Ruotian Ma, Xuanting Chen, Lin Zhang, Xin Zhou, Junzhe Wang, Tao Gui, Qi Zhang, Xiang Gao, Yunwen Chen

In this work, we conduct an empirical study on the "Unlabeled Entity Problem" and find that it leads to severe confusion between "O" and entities, decreasing class discrimination of old classes and declining the model's ability to learn new classes.

class-incremental learning Class Incremental Learning +4

Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents

1 code implementation Findings (ACL) 2022 Yicheng Zou, Hongwei Liu, Tao Gui, Junzhe Wang, Qi Zhang, Meng Tang, Haixiang Li, Daniel Wang

Text semantic matching is a fundamental task that has been widely used in various scenarios, such as community question answering, information retrieval, and recommendation.

Community Question Answering Information Retrieval +2

ATPFL: Automatic Trajectory Prediction Model Design Under Federated Learning Framework

no code implementations CVPR 2022 Chunnan Wang, Xiang Chen, Junzhe Wang, Hongzhi Wang

Although the Trajectory Prediction (TP) model has achieved great success in computer vision and robotics fields, its architecture and training scheme design rely on heavy manual work and domain knowledge, which is not friendly to common users.

Federated Learning Trajectory Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.