Search Results for author: Jiaqi Chen

Found 29 papers, 18 papers with code

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

1 code implementation18 Feb 2025 Ruotian Ma, Peisong Wang, Cheng Liu, Xingyan Liu, Jiaqi Chen, Bang Zhang, Xin Zhou, Nan Du, Jia Li

In this work, we introduce S$^2$R, an efficient framework that enhances LLM reasoning by teaching models to self-verify and self-correct during inference.

Math

CRoF: CLIP-based Robust Few-shot Learning on Noisy Labels

no code implementations17 Dec 2024 Shizhuo Deng, Bowen Han, Jiaqi Chen, Hao Wang, Dongyue Chen, Tong Jia

Noisy labels threaten the robustness of few-shot learning (FSL) due to the inexact features in a new domain.

Domain Generalization Few-Shot Learning +1

SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition

1 code implementation22 Oct 2024 Jiaqi Chen, Yan Yang, Shizhuo Deng, Da Teng, Liyuan Pan

To address these issues, we propose the SpikMamba framework, which combines the energy efficiency of spiking neural networks and the long sequence modeling capability of Mamba to efficiently capture global features from spatially sparse and high a temporal resolution event data.

Action Recognition Autonomous Driving +2

AFlow: Automating Agentic Workflow Generation

2 code implementations14 Oct 2024 Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu, Fengwei Teng, Xionghui Chen, Jiaqi Chen, Mingchen Zhuge, Xin Cheng, Sirui Hong, Jinlin Wang, Bingnan Zheng, Bang Liu, Yuyu Luo, Chenglin Wu

Large language models (LLMs) have demonstrated remarkable potential in solving complex tasks across diverse domains, typically by employing agentic workflows that follow detailed instructions and operational sequences.

Code Generation

NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild

1 code implementation CVPR 2024 Weining Ren, Zihan Zhu, Boyang Sun, Jiaqi Chen, Marc Pollefeys, Songyou Peng

Neural Radiance Fields (NeRFs) have shown remarkable success in synthesizing photorealistic views from multi-view images of static scenes, but face challenges in dynamic, real-world environments with distractors like moving objects, shadows, and lighting changes.

NeRF

"Where am I?" Scene Retrieval with Language

no code implementations22 Apr 2024 Jiaqi Chen, Daniel Barath, Iro Armeni, Marc Pollefeys, Hermann Blum

We define this task as "language-based scene-retrieval" and it is closely related to "coarse-localization," but we are instead searching for a match from a collection of disjoint scenes and not necessarily a large-scale continuous map.

Retrieval

NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

1 code implementation12 Mar 2024 Bingqian Lin, Yunshuang Nie, Ziming Wei, Jiaqi Chen, Shikui Ma, Jianhua Han, Hang Xu, Xiaojun Chang, Xiaodan Liang

Vision-and-Language Navigation (VLN), as a crucial research problem of Embodied AI, requires an embodied agent to navigate through complex 3D environments following natural language instructions.

Navigate Vision and Language Navigation

S-Agents: Self-organizing Agents in Open-ended Environments

1 code implementation7 Feb 2024 Jiaqi Chen, Yuxian Jiang, Jiachen Lu, Li Zhang

Leveraging large language models (LLMs), autonomous agents have significantly improved, gaining the ability to handle a variety of tasks.

Minecraft

MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation

no code implementations14 Jan 2024 Jiaqi Chen, Bingqian Lin, ran Xu, Zhenhua Chai, Xiaodan Liang, Kwan-Yee K. Wong

Embodied agents equipped with GPT as their brains have exhibited extraordinary decision-making and generalization abilities across various tasks.

Decision Making Vision and Language Navigation

A Data-Driven Integrated Framework for Fast-Charging Facility Planning using Multi-Period Bi-Objective Optimization

no code implementations21 Nov 2023 Mingjia He, Panchamy Krishnakumari, Ding Luo, Jiaqi Chen

The case study on the Amsterdam highway network shows how existing traffic data can be used to generate more realistic charging demand scenarios and how it can be integrated and evaluated within the optimization framework for facility planning.

Generative Semantic Segmentation

2 code implementations CVPR 2023 Jiaqi Chen, Jiachen Lu, Xiatian Zhu, Li Zhang

To that end, the segmentation mask is expressed with a special type of image (dubbed as maskige).

Segmentation Semantic Segmentation

UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression

2 code implementations6 Dec 2022 Jiaqi Chen, Tong Li, Jinghui Qin, Pan Lu, Liang Lin, Chongyu Chen, Xiaodan Liang

Naturally, we also present a unified multi-task Geometric Transformer framework, Geoformer, to tackle calculation and proving problems simultaneously in the form of sequence generation, which finally shows the reasoning ability can be improved on both two tasks by unifying formulation.

Geometry Problem Solving Logical Reasoning +1

DeepInteraction: 3D Object Detection via Modality Interaction

2 code implementations23 Aug 2022 Zeyu Yang, Jiaqi Chen, Zhenwei Miao, Wei Li, Xiatian Zhu, Li Zhang

Existing top-performance 3D object detectors typically rely on the multi-modal fusion strategy.

3D Object Detection Decoder +3

Unbiased Math Word Problems Benchmark for Mitigating Solving Bias

2 code implementations Findings (NAACL) 2022 Zhicheng Yang, Jinghui Qin, Jiaqi Chen, Xiaodan Liang

However, current solvers exist solving bias which consists of data bias and learning bias due to biased dataset and improper training strategy.

Math

LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning

2 code implementations17 May 2022 Zhicheng Yang, Jinghui Qin, Jiaqi Chen, Liang Lin, Xiaodan Liang

To address this issue and make a step towards interpretable MWP solving, we first construct a high-quality MWP dataset named InterMWP which consists of 11, 495 MWPs and annotates interpretable logical formulas based on algebraic knowledge as the grounded linguistic logic of each solution equation.

Math Math Word Problem Solving

UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model

1 code implementation ICCV 2021 Haonan Yan, Jiaqi Chen, Xujie Zhang, Shengkai Zhang, Nianhong Jiao, Xiaodan Liang, Tianxiang Zheng

However, the popular DensePose-COCO dataset relies on a sophisticated manual annotation system, leading to severe limitations in acquiring the denser and more accurate annotated pose resources.

3D Reconstruction

IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning

1 code implementation25 Oct 2021 Pan Lu, Liang Qiu, Jiaqi Chen, Tony Xia, Yizhou Zhao, Wei zhang, Zhou Yu, Xiaodan Liang, Song-Chun Zhu

Also, we develop a strong IconQA baseline Patch-TRM that applies a pyramid cross-modal Transformer with input diagram embeddings pre-trained on the icon dataset.

Arithmetic Reasoning Math Word Problem Solving +2

GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning

1 code implementation Findings (ACL) 2021 Jiaqi Chen, Jianheng Tang, Jinghui Qin, Xiaodan Liang, Lingbo Liu, Eric P. Xing, Liang Lin

Therefore, we propose a Geometric Question Answering dataset GeoQA, containing 4, 998 geometric problems with corresponding annotated programs, which illustrate the solving process of the given problems.

Math Mathematical Reasoning +1

A Guide to Reducing Carbon Emissions through Data Center Geographical Load Shifting

no code implementations19 May 2021 Julia Lindberg, Yasmine Abdennadher, Jiaqi Chen, Bernard C. Lesieutre, Line Roald

This paper focuses on how data centers can use their geographic load flexibility to reduce carbon emissions through clever interactions with electricity markets.

Constructing Canonical Feynman Integrals with Intersection Theory

no code implementations7 Aug 2020 Jiaqi Chen, Xuhang Jiang, Xiaofeng Xu, Li Lin Yang

Canonical Feynman integrals are of great interest in the study of scattering amplitudes at the multi-loop level.

High Energy Physics - Theory High Energy Physics - Phenomenology

Efficient Crowd Counting via Structured Knowledge Transfer

2 code implementations23 Mar 2020 Lingbo Liu, Jiaqi Chen, Hefeng Wu, Tianshui Chen, Guanbin Li, Liang Lin

Crowd counting is an application-oriented task and its inference efficiency is crucial for real-world applications.

Crowd Counting Transfer Learning

Robot Calligraphy using Pseudospectral Optimal Control in Conjunction with a Simulated Brush Model

no code implementations18 Nov 2019 Sen Wang, Jiaqi Chen, Xuanliang Deng, Seth Hutchinson, Frank Dellaert

Chinese calligraphy is a unique form of art that has great artistic value but is difficult to master.

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.