Search Results for author: Pingjian Zhang

Found 8 papers, 0 papers with code

Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with Large Vision-Language Model Support

no code implementations • 26 Jan 2024 • XiaoJun Wu, Dixiang Zhang, Ruyi Gan, Junyu Lu, Ziwei Wu, Renliang Sun, Jiaxing Zhang, Pingjian Zhang, Yan Song

Recent advancements in text-to-image models have significantly enhanced image generation capabilities, yet a notable gap of open-source models persists in bilingual or Chinese language support.

Language Modelling Text-to-Image Generation

Paper
Add Code

Unified Lattice Graph Fusion for Chinese Named Entity Recognition

no code implementations • 28 Dec 2023 • Dixiang Zhang, Junyu Lu, Pingjian Zhang

To solve this issue, we propose a Unified Lattice Graph Fusion (ULGF) approach for Chinese NER.

Chinese Named Entity Recognition named-entity-recognition +2

Paper
Add Code

Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects

no code implementations • 8 Dec 2023 • Junyu Lu, Dixiang Zhang, Songxin Zhang, Zejian Xie, Zhuoyang Song, Cong Lin, Jiaxing Zhang, BingYi Jing, Pingjian Zhang

During the instruction fine-tuning stage, we introduce semantic-aware visual feature extraction, a crucial method that enables the model to extract informative features from concrete visual objects.

Ranked #1 on Image Captioning on nocaps entire

Image Captioning object-detection +5

Paper
Add Code

iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image Diffusion Model for Interior Design

no code implementations • 7 Dec 2023 • Ruyi Gan, XiaoJun Wu, Junyu Lu, Yuanhe Tian, Dixiang Zhang, Ziwei Wu, Renliang Sun, Chang Liu, Jiaxing Zhang, Pingjian Zhang, Yan Song

However, there are few specialized models in certain domains, such as interior design, which is attributed to the complex textual descriptions and detailed visual elements inherent in design, alongside the necessity for adaptable resolution.

Image Generation

Paper
Add Code

Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning

no code implementations • 12 Oct 2023 • Junyu Lu, Dixiang Zhang, XiaoJun Wu, Xinyu Gao, Ruyi Gan, Jiaxing Zhang, Yan Song, Pingjian Zhang

Recent advancements enlarge the capabilities of large language models (LLMs) in zero-shot image-to-text generation and understanding by integrating multi-modal inputs.

Image Captioning In-Context Learning +5

Paper
Add Code

UniEX: An Effective and Efficient Framework for Unified Information Extraction via a Span-extractive Perspective

no code implementations • 17 May 2023 • Ping Yang, Junyu Lu, Ruyi Gan, Junjie Wang, Yuxiang Zhang, Jiaxing Zhang, Pingjian Zhang

We propose a new paradigm for universal information extraction (IE) that is compatible with any schema format and applicable to a list of IE tasks, such as named entity recognition, relation extraction, event extraction and sentiment analysis.

Event Extraction named-entity-recognition +3

Paper
Add Code

Flat Multi-modal Interaction Transformer for Named Entity Recognition

no code implementations • COLING 2022 • Junyu Lu, Dixiang Zhang, Pingjian Zhang

Then, we transform the fine-grained semantic representation of the vision and text into a unified lattice structure and design a novel relative position encoding to match different modalities in Transformer.

Boundary Detection Multi-modal Named Entity Recognition +2

Paper
Add Code

Entity Candidate Network for Whole-Aware Named Entity Recognition

no code implementations • 29 Apr 2020 • Wendong He, Yizhen Shao, Pingjian Zhang

ECNet identifies the full span of a named entity and its type at each position based on Entity Loss.

coreference-resolution named-entity-recognition +6

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.