Search Results for author: Liangyu Chen

Found 22 papers, 11 papers with code

Points as Queries: Weakly Semi-supervised Object Detection by Points

1 code implementation CVPR 2021 Liangyu Chen, Tong Yang, Xiangyu Zhang, Wei zhang, Jian Sun

We propose a novel point annotated setting for the weakly semi-supervised object detection task, in which the dataset comprises small fully annotated images and large weakly annotated images by points.

object-detection Object Detection +1

HINet: Half Instance Normalization Network for Image Restoration

2 code implementations13 May 2021 Liangyu Chen, Xin Lu, Jie Zhang, Xiaojie Chu, Chengpeng Chen

Specifically, we present a novel block: Half Instance Normalization Block (HIN Block), to boost the performance of image restoration networks.

Deblurring Image Deblurring +3

Multi-task Graph Convolutional Neural Network for Calcification Morphology and Distribution Analysis in Mammograms

no code implementations14 May 2021 Hao Du, Melissa Min-Szu Yao, Liangyu Chen, Wing P. Chan, Mengling Feng

In this study, we proposed a multi-task deep graph convolutional network (GCN) method for the automatic characterization of morphology and distribution of microcalcifications in mammograms.

Graph Classification Graph Learning

Improving Image Restoration by Revisiting Global Information Aggregation

2 code implementations8 Dec 2021 Xiaojie Chu, Liangyu Chen, Chengpeng Chen, Xin Lu

Our TLC converts global operations to local ones only during inference so that they aggregate features within local spatial regions rather than the entire large images.

Color Image Denoising Deblurring +7

Simple Baselines for Image Restoration

9 code implementations10 Apr 2022 Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, Jian Sun

Although there have been significant advances in the field of image restoration recently, the system complexity of the state-of-the-art (SOTA) methods is increasing as well, which may hinder the convenient analysis and comparison of methods.

Deblurring Image Deblurring +2

NAFSSR: Stereo Image Super-Resolution Using NAFNet

4 code implementations19 Apr 2022 Xiaojie Chu, Liangyu Chen, Wenqing Yu

This paper inherits a strong and simple image restoration model, NAFNet, for single-view feature extraction and extends it by adding cross attention modules to fuse features between views to adapt to binocular scenarios.

Image Restoration Stereo Image Super-Resolution

BEIKE NLP at SemEval-2022 Task 4: Prompt-Based Paragraph Classification for Patronizing and Condescending Language Detection

no code implementations SemEval (NAACL) 2022 Yong Deng, Chenxiao Dou, Liangyu Chen, Deqiang Miao, Xianghui Sun, Baochang Ma, Xiangang Li

PCL detection task is aimed at identifying and categorizing language that is patronizing or condescending towards vulnerable communities in the general media. Compared to other NLP tasks of paragraph classification, the negative language presented in the PCL detection task is usually more implicit and subtle to be recognized, making the performance of common text-classification approaches disappointed.

Binary Condescension Detection Multi-Label Classification +1

Automatic tagging of knowledge points for K12 math problems

no code implementations21 Aug 2022 Xiaolu Wang, Ziqi Ding, Liangyu Chen

In this paper, K12 math problems taken as the research object, the LABS model based on label-semantic attention and multi-label smoothing combining textual features is proposed to improve the automatic tagging of knowledge points for math problems.

Math text-classification +1

Making Your First Choice: To Address Cold Start Problem in Vision Active Learning

1 code implementation5 Oct 2022 Liangyu Chen, Yutong Bai, Siyu Huang, Yongyi Lu, Bihan Wen, Alan L. Yuille, Zongwei Zhou

However, we uncover a striking contradiction to this promise: active learning fails to select data as efficiently as random selection at the first few choices.

Active Learning Contrastive Learning

Otter: A Multi-Modal Model with In-Context Instruction Tuning

1 code implementation5 May 2023 Bo Li, Yuanhan Zhang, Liangyu Chen, Jinghao Wang, Jingkang Yang, Ziwei Liu

Large language models (LLMs) have demonstrated significant universal capabilities as few/zero-shot learners in various tasks due to their pre-training on vast amounts of text data, as exemplified by GPT-3, which boosted to InstrctGPT and ChatGPT, effectively following natural language instructions to accomplish real-world tasks.

In-Context Learning Instruction Following +2

Benchmarking and Analyzing Generative Data for Visual Recognition

no code implementations25 Jul 2023 Bo Li, Haotian Liu, Liangyu Chen, Yong Jae Lee, Chunyuan Li, Ziwei Liu

Advancements in large pre-trained generative models have expanded their potential as effective data generators in visual recognition.

Benchmarking Retrieval

ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation

1 code implementation28 Jul 2023 Cheng Wen, Xianghui Sun, Shuaijiang Zhao, Xiaoquan Fang, Liangyu Chen, Wei Zou

This paper presents the development and evaluation of ChatHome, a domain-specific language model (DSLM) designed for the intricate field of home renovation.

Language Modelling

LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation

1 code implementation16 Oct 2023 Ruiqi Wu, Liangyu Chen, Tong Yang, Chunle Guo, Chongyi Li, Xiangyu Zhang

Specifically, we design a first-frame-conditioned pipeline that uses an off-the-shelf text-to-image model for content generation so that our tuned video diffusion model mainly focuses on motion learning.

Image Animation Text-to-Image Generation +2

DUMA: a Dual-Mind Conversational Agent with Fast and Slow Thinking

no code implementations27 Oct 2023 Xiaoyu Tian, Liangyu Chen, Na Liu, Yaxuan Liu, Wei Zou, Kaijiang Chen, Ming Cui

The fast thinking model serves as the primary interface for external interactions and initial response generation, evaluating the necessity for engaging the slow thinking model based on the complexity of the complete response.

Response Generation

Panoptic Video Scene Graph Generation

3 code implementations CVPR 2023 Jingkang Yang, Wenxuan Peng, Xiangtai Li, Zujin Guo, Liangyu Chen, Bo Li, Zheng Ma, Kaiyang Zhou, Wayne Zhang, Chen Change Loy, Ziwei Liu

PVSG relates to the existing video scene graph generation (VidSGG) problem, which focuses on temporal interactions between humans and objects grounded with bounding boxes in videos.

Graph Generation Panoptic Scene Graph Generation +5

FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability

no code implementations6 Dec 2023 Linze Li, Sunqi Fan, Hengjun Pu, Zhaodong Bing, Yao Tang, Tianzhu Ye, Tong Yang, Liangyu Chen, Jiajun Liang

Our method's efficacy has been validated on multiple representative DreamBooth and LoRA models, delivering substantial improvements over the original outcomes in terms of facial fidelity, text-to-image editability, and video motion.

Face Model Video Generation

From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models

no code implementations5 Jan 2024 Na Liu, Liangyu Chen, Xiaoyu Tian, Wei Zou, Kaijiang Chen, Ming Cui

This paper introduces RAISE (Reasoning and Acting through Scratchpad and Examples), an advanced architecture enhancing the integration of Large Language Models (LLMs) like GPT-4 into conversational agents.

MMInA: Benchmarking Multihop Multimodal Internet Agents

no code implementations15 Apr 2024 Ziniu Zhang, Shulin Tian, Liangyu Chen, Ziwei Liu

To answer this question, we present MMInA, a multihop and multimodal benchmark to evaluate the embodied agents for compositional Internet tasks, with several appealing properties: 1) Evolving real-world multimodal websites.

Benchmarking

Cannot find the paper you are looking for? You can Submit a new open access paper.