Search Results for author: Yifei Yang

Found 25 papers, 6 papers with code

Aspect-based Sentiment Analysis as Machine Reading Comprehension

no code implementations COLING 2022 Yifei Yang, Hai Zhao

Existing studies typically handle aspect-based sentiment analysis by stacking multiple neural modules, which inevitably result in severe error propagation.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Nested Named Entity Recognition as Corpus Aware Holistic Structure Parsing

no code implementations COLING 2022 Yifei Yang, Zuchao Li, Hai Zhao

Thus in order to address this mismatch, this work models the full nested NEs in a sentence as a holistic structure, then we propose a holistic structure parsing algorithm to disclose the entire NEs once for all.

Domain Adaptation named-entity-recognition +4

Hypertext Entity Extraction in Webpage

no code implementations4 Mar 2024 Yifei Yang, Tianqiao Liu, Bo Shao, Hai Zhao, Linjun Shou, Ming Gong, Daxin Jiang

Webpage entity extraction is a fundamental natural language processing task in both research and applications.

Head-wise Shareable Attention for Large Language Models

1 code implementation19 Feb 2024 Zouying Cao, Yifei Yang, Hai Zhao

In this paper, we present a perspective on $\textit{$\textbf{head-wise shareable attention for large language models}$}$.

LaCo: Large Language Model Pruning via Layer Collapse

1 code implementation17 Feb 2024 Yifei Yang, Zouying Cao, Hai Zhao

Large language models (LLMs) based on transformer are witnessing a notable trend of size expansion, which brings considerable costs to both model training and inference.

Knowledge Distillation Language Modelling +2

FreDF: Learning to Forecast in Frequency Domain

no code implementations4 Feb 2024 Hao Wang, Licheng Pan, Zhichao Chen, Degui Yang, Sen Zhang, Yifei Yang, Xinggao Liu, Haoxuan Li, DaCheng Tao

Time series modeling is uniquely challenged by the presence of autocorrelation in both historical and label sequences.

Time Series

Monotone Generative Modeling via a Gromov-Monge Embedding

no code implementations2 Nov 2023 Wonjun Lee, Yifei Yang, Dongmian Zou, Gilad Lerman

Generative Adversarial Networks (GANs) are powerful tools for creating new content, but they face challenges such as sensitivity to starting conditions and mode collapse.

AutoHall: Automated Hallucination Dataset Generation for Large Language Models

no code implementations30 Sep 2023 Zouying Cao, Yifei Yang, Hai Zhao

While Large language models (LLMs) have garnered widespread applications across various domains due to their powerful language understanding and generation capabilities, the detection of non-factual or hallucinatory content generated by LLMs remains scarce.

Fact Checking Hallucination

BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer

1 code implementation1 Jul 2023 Zuchao Li, Shitou Zhang, Hai Zhao, Yifei Yang, Dongjie Yang

BatGPT is a large-scale language model designed and trained jointly by Wuhan University and Shanghai Jiao Tong University.

Language Modelling Question Answering +1

CMMLU: Measuring massive multitask language understanding in Chinese

1 code implementation15 Jun 2023 Haonan Li, Yixuan Zhang, Fajri Koto, Yifei Yang, Hai Zhao, Yeyun Gong, Nan Duan, Timothy Baldwin

As the capabilities of large language models (LLMs) continue to advance, evaluating their performance becomes increasingly crucial and challenging.

Large Language Model

RefGPT: Dialogue Generation of GPT, by GPT, and for GPT

1 code implementation24 May 2023 Dongjie Yang, Ruifeng Yuan, Yuantao Fan, Yifei Yang, Zili Wang, Shusen Wang, Hai Zhao

Therefore, we propose a method called RefGPT to generate enormous truthful and customized dialogues without worrying about factual errors caused by the model hallucination.

Dialogue Generation Hallucination

Attack Named Entity Recognition by Entity Boundary Interference

no code implementations9 May 2023 Yifei Yang, Hongqiu Wu, Hai Zhao

This is due to the fine-grained nature of NER, as even minor word changes in the sentence can result in the emergence or mutation of any entities, resulting in invalid adversarial examples.

named-entity-recognition Named Entity Recognition +3

UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields

no code implementations ICCV 2023 Yuanbo Yang, Yifei Yang, Hanlei Guo, Rong Xiong, Yue Wang, Yiyi Liao

Generating photorealistic images with controllable camera pose and scene contents is essential for many applications including AR/VR and simulation.

3D-Aware Image Synthesis Object

Learning Transformation-Predictive Representations for Detection and Description of Local Features

no code implementations CVPR 2023 ZiHao Wang, Chunxu Wu, Yifei Yang, Zhen Li

The task of key-points detection and description is to estimate the stable location and discriminative representation of local features, which is essential for image matching.

Contrastive Learning

Open-Set Object Detection Using Classification-free Object Proposal and Instance-level Contrastive Learning

no code implementations21 Nov 2022 Zhongxiang Zhou, Yifei Yang, Yue Wang, Rong Xiong

To disambiguate unknown objects and background in the first subtask, we propose to use classification-free region proposal network (CF-RPN) which estimates the objectness score of each region purely using cues from object's location and shape preventing overfitting to the training categories.

Contrastive Learning Object +4

Graph Neural Network Based Node Deployment for Throughput Enhancement

no code implementations19 Aug 2022 Yifei Yang, Dongmian Zou, Xiaofan He

Besides, we show that an expressive GNN has the capacity to approximate both the function value and the gradients of a multivariate permutation-invariant function, as a theoretic support to the proposed method.

Nested Named Entity Recognition as Holistic Structure Parsing

no code implementations17 Apr 2022 Yifei Yang, Zuchao Li, Hai Zhao

Thus in order to address this mismatch, this work models the full nested NEs in a sentence as a holistic structure, then we propose a holistic structure parsing algorithm to disclose the entire NEs once for all.

Domain Adaptation named-entity-recognition +4

Rethinking Controllable Variational Autoencoders

no code implementations CVPR 2022 Huajie Shao, Yifei Yang, Haohong Lin, Longzhong Lin, Yizhuo Chen, Qinmin Yang, Han Zhao

It has shown success in a variety of applications, such as image generation, disentangled representation learning, and language modeling.

Disentanglement Image Generation +1

Improving unsupervised anomaly localization by applying multi-scale memories to autoencoders

no code implementations21 Dec 2020 Yifei Yang, Shibing Xiang, Ruixiang Zhang

Autoencoder and its variants have been widely applicated in anomaly detection. The previous work memory-augmented deep autoencoder proposed memorizing normality to detect anomaly, however it neglects the feature discrepancy between different resolution scales, therefore we introduce multi-scale memories to record scale-specific features and multi-scale attention fuser between the encoding and decoding module of the autoencoder for anomaly detection, namely MMAE. MMAE updates slots at corresponding resolution scale as prototype features during unsupervised learning.

Anomaly Detection

Person Re-identification in Aerial Imagery

1 code implementation14 Aug 2019 Shizhou Zhang, Qi Zhang, Yifei Yang, Xing Wei, Peng Wang, Bingliang Jiao, Yanning Zhang

Our method can learn a discriminative and compact feature representation for ReID in aerial imagery and can be trained in an end-to-end fashion efficiently.

object-detection Object Detection +1

Vehicle Re-identification in Aerial Imagery: Dataset and Approach

no code implementations ICCV 2019 Peng Wang, Bingliang Jiao, Lu Yang, Yifei Yang, Shizhou Zhang, Wei Wei, Yanning Zhang

It is capable of explicitly detecting discriminative parts for each specific vehicle and significantly outperforms the evaluated baselines and state-of-the-art vehicle ReID approaches.

Vehicle Re-Identification

Cannot find the paper you are looking for? You can Submit a new open access paper.