Search Results for author: Xinyu Hu

Found 15 papers, 5 papers with code

Themis: Towards Flexible and Interpretable NLG Evaluation

1 code implementation26 Jun 2024 Xinyu Hu, Li Lin, Mingqi Gao, Xunjian Yin, Xiaojun Wan

The evaluation of natural language generation (NLG) tasks is a significant and longstanding research issue.

nlg evaluation Text Generation

Task Oriented In-Domain Data Augmentation

1 code implementation24 Jun 2024 Xiao Liang, Xinyu Hu, Simiao Zuo, Yeyun Gong, Qiang Lou, Yi Liu, Shao-Lun Huang, Jian Jiao

On average, TRAIT improves LLM performance by 8% in the advertisement domain and 7. 5% in the math domain.

Data Augmentation Math

MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency

no code implementations19 Jun 2024 Junzhe Zhang, Huixuan Zhang, Xunjian Yin, Baizhou Huang, Xu Zhang, Xinyu Hu, Xiaojun Wan

Our benchmark facilitates independent correction of misreading and misrecognition errors by editing the corresponding knowledge component.

knowledge editing

Are LLM-based Evaluators Confusing NLG Quality Criteria?

2 code implementations19 Feb 2024 Xinyu Hu, Mingqi Gao, Sen Hu, Yang Zhang, Yicheng Chen, Teng Xu, Xiaojun Wan

Some prior work has shown that LLMs perform well in NLG evaluation for different tasks.

nlg evaluation

LLM-based NLG Evaluation: Current Status and Challenges

no code implementations2 Feb 2024 Mingqi Gao, Xinyu Hu, Jie Ruan, Xiao Pu, Xiaojun Wan

Evaluating natural language generation (NLG) is a vital but challenging problem in artificial intelligence.

nlg evaluation Text Generation

Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing

no code implementations20 Oct 2023 Xinyu Hu, Pengfei Tang, Simiao Zuo, Zihan Wang, Bowen Song, Qiang Lou, Jian Jiao, Denis Charles

In Evoke, there are two instances of a same LLM: one as a reviewer (LLM-Reviewer), it scores the current prompt; the other as an author (LLM-Author), it edits the prompt by considering the edit history and the reviewer's feedback.

Logical Fallacy Detection

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency

1 code implementation16 Jul 2023 Bowen Song, Soo Min Kwon, Zecheng Zhang, Xinyu Hu, Qing Qu, Liyue Shen

However, training diffusion models in the pixel space are both data-intensive and computationally demanding, which restricts their applicability as priors for high-dimensional real-world data such as medical images.

Decoder

DeepTagger: Knowledge Enhanced Named Entity Recognition for Web-Based Ads Queries

no code implementations30 Jun 2023 Simiao Zuo, Pengfei Tang, Xinyu Hu, Qiang Lou, Jian Jiao, Denis Charles

For model-free enhancement, we collect unlabeled web queries to augment domain knowledge; and we collect web search results to enrich the information of ads queries.

Data Augmentation named-entity-recognition +2

Error-Robust Retrieval for Chinese Spelling Check

1 code implementation15 Nov 2022 Xunjian Yin, Xinyu Hu, Jin Jiang, Xiaojun Wan

Chinese Spelling Check (CSC) aims to detect and correct error tokens in Chinese contexts, which has a wide range of applications.

Retrieval

Implementation of improved RGBD 3D target detection model based on FPGA heterogeneous computing architecture

no code implementations 2022/08/15 2022 Yu Wang, Wenbin, FENG Chongchong YU, Xinyu Hu, Yuqiu ZHANG4

In order to solve the problems of low model accuracy, poor computing power, poor parallel ability and excessive power consumption in the deployment of RGBD based 3 D target detection model at the embedded end, this paper first proposes an improved RGBD 3 D target detection model based on ENet semantic segmentation model, which takes ENet as the semantic segmentation network, RGB image and depth information are fused to realize 3 D target detection. Secondly, in order to apply the model at the edge, this paper constructs a lightweight network and cuts the network in the down-sampling stage of ENet model. Finally, this paper uses Xilinx ZCU104 as the hardware development kit, which takes FPGA as the auxiliary parallel operation unit and ARM as the main operation unit. It is a heterogeneous computing architecture with the ability to deal with complex operations. The architecture uses FPGA to accelerate the depth model in parallel, which improves the operation speed and reduces the power consumption. The test results of the model on ZCU104 are compared with other hardware. The results show that while ensuring the accuracy, the power consumption of the heterogeneous computing architecture used in this paper is 93% lowerthan that of Intel Xeon e5-2620 v4 CPU, the speed is 12 times higher, and the speed is more than 180 times higher than that of ARM Cortex-A53 commonly used at the edge.

Semantic Segmentation

DeeprETA: An ETA Post-processing System at Scale

no code implementations5 Jun 2022 Xinyu Hu, Tanmay Binaykiya, Eric Frank, Olcay Cirit

Estimated Time of Arrival (ETA) plays an important role in delivery and ride-hailing platforms.

regression

5th Place Solution for VSPW 2021 Challenge

no code implementations13 Dec 2021 Jiafan Zhuang, Yixin Zhang, Xinyu Hu, Junjie Li, Zilei Wang

In this article, we introduce the solution we used in the VSPW 2021 Challenge.

Semantic Segmentation

A Survey of Knowledge Enhanced Pre-trained Models

no code implementations1 Oct 2021 Jian Yang, Xinyu Hu, Gang Xiao, Yulong Shen

Pre-trained language models learn informative word representations on a large-scale text corpus through self-supervised learning, which has achieved promising performance in fields of natural language processing (NLP) after fine-tuning.

Logical Reasoning Representation Learning +1

When does loss-based prioritization fail?

no code implementations16 Jul 2021 Niel Teng Hu, Xinyu Hu, Rosanne Liu, Sara Hooker, Jason Yosinski

Each example is propagated forward and backward through the network the same amount of times, independent of how much the example contributes to the learning protocol.

Applying SVGD to Bayesian Neural Networks for Cyclical Time-Series Prediction and Inference

no code implementations17 Jan 2019 Xinyu Hu, Paul Szerlip, Theofanis Karaletsos, Rohit Singh

A regression-based BNN model is proposed to predict spatiotemporal quantities like hourly rider demand with calibrated uncertainties.

regression Time Series +1

Cannot find the paper you are looking for? You can Submit a new open access paper.