Search Results for author: Junyu Lu

Found 27 papers, 10 papers with code

Astrea: A MOE-based Visual Understanding Model with Progressive Alignment

no code implementations12 Mar 2025 Xiaoda Yang, Junyu Lu, Hongshun Qiu, Sijing Li, Hao Li, Shengpeng Ji, Xudong Tang, Jiayang Xu, Jiaqi Duan, Ziyue Jiang, Cong Lin, Sihang Cai, Zejian Xie, Zhuoyang Song, Songxin Zhang

Vision-Language Models (VLMs) based on Mixture-of-Experts (MoE) architectures have emerged as a pivotal paradigm in multimodal understanding, offering a powerful framework for integrating visual and linguistic information.

Contrastive Learning Cross-Modal Retrieval +2

Unveiling the Capabilities of Large Language Models in Detecting Offensive Language with Annotation Disagreement

no code implementations10 Feb 2025 Junyu Lu, Kai Ma, Kaichun Wang, Kelaiti Xiao, Roy Ka-Wei Lee, Bo Xu, Liang Yang, Hongfei Lin

Large Language Models (LLMs) have become essential for offensive language detection, yet their ability to handle annotation disagreement remains underexplored.

Binary Classification Decision Making +1

Towards Comprehensive Detection of Chinese Harmful Memes

1 code implementation3 Oct 2024 Junyu Lu, Bo Xu, Xiaokun Zhang, Hongbo Wang, Haohao Zhu, Dongyu Zhang, Liang Yang, Hongfei Lin

Harmful memes have proliferated on the Chinese Internet, while research on detecting Chinese harmful memes significantly lags behind due to the absence of reliable datasets and effective detectors.

Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector

1 code implementation8 Sep 2024 Hongbo Wang, Junyu Lu, Yan Han, Kai Ma, Liang Yang, Hongfei Lin

Patronizing and Condescending Language (PCL) is a form of discriminatory toxic speech targeting vulnerable groups, threatening both online and offline safety.

Form

Integrating Multi-view Analysis: Multi-view Mixture-of-Expert for Textual Personality Detection

1 code implementation16 Aug 2024 Haohao Zhu, Xiaokun Zhang, Junyu Lu, Liang Yang, Hongfei Lin

Experimental results on two widely-used personality detection datasets demonstrate the effectiveness of the MvP model and the benefits of automatically analyzing user posts from diverse perspectives for textual personality detection.

Take its Essence, Discard its Dross! Debiasing for Toxic Language Detection via Counterfactual Causal Effect

1 code implementation3 Jun 2024 Junyu Lu, Bo Xu, Xiaokun Zhang, Kaiyuan Liu, Dongyu Zhang, Liang Yang, Hongfei Lin

Current methods of toxic language detection (TLD) typically rely on specific tokens to conduct decisions, which makes them suffer from lexical bias, leading to inferior performance and generalization.

counterfactual Counterfactual Inference +2

Enhancing Textual Personality Detection toward Social Media: Integrating Long-term and Short-term Perspectives

no code implementations23 Apr 2024 Haohao Zhu, Xiaokun Zhang, Junyu Lu, Youlin Wu, Zewen Bai, Changrong Min, Liang Yang, Bo Xu, Dongyu Zhang, Hongfei Lin

This limitation hinders a comprehensive understanding of individuals' personalities, as both stable traits and dynamic states are vital.

Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with Large Vision-Language Model Support

1 code implementation26 Jan 2024 XiaoJun Wu, Dixiang Zhang, Ruyi Gan, Junyu Lu, Ziwei Wu, Renliang Sun, Jiaxing Zhang, Pingjian Zhang, Yan Song

Recent advancements in text-to-image models have significantly enhanced image generation capabilities, yet a notable gap of open-source models persists in bilingual or Chinese language support.

Language Modeling Language Modelling +1

Unlocking the Potential of Large Language Models for Explainable Recommendations

1 code implementation25 Dec 2023 Yucong Luo, Mingyue Cheng, Hao Zhang, Junyu Lu, Qi Liu, Enhong Chen

In this study, we propose LLMXRec, a simple yet effective two-stage explainable recommendation framework aimed at further boosting the explanation quality by employing LLMs.

Decision Making Explainable Recommendation +2

Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects

no code implementations8 Dec 2023 Junyu Lu, Dixiang Zhang, Songxin Zhang, Zejian Xie, Zhuoyang Song, Cong Lin, Jiaxing Zhang, BingYi Jing, Pingjian Zhang

During the instruction fine-tuning stage, we introduce semantic-aware visual feature extraction, a crucial method that enables the model to extract informative features from concrete visual objects.

Image Captioning object-detection +5

iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image Diffusion Model for Interior Design

no code implementations7 Dec 2023 Ruyi Gan, XiaoJun Wu, Junyu Lu, Yuanhe Tian, Dixiang Zhang, Ziwei Wu, Renliang Sun, Chang Liu, Jiaxing Zhang, Pingjian Zhang, Yan Song

However, there are few specialized models in certain domains, such as interior design, which is attributed to the complex textual descriptions and detailed visual elements inherent in design, alongside the necessity for adaptable resolution.

Image Generation

Ziya2: Data-centric Learning is All LLMs Need

no code implementations6 Nov 2023 Ruyi Gan, Ziwei Wu, Renliang Sun, Junyu Lu, XiaoJun Wu, Dixiang Zhang, Kunhao Pan, Junqing He, Yuanhe Tian, Ping Yang, Qi Yang, Hao Wang, Jiaxing Zhang, Yan Song

Although many such issues are addressed along the line of research on LLMs, an important yet practical limitation is that many studies overly pursue enlarging model sizes without comprehensively analyzing and optimizing the use of pre-training data in their learning process, as well as appropriate organization and leveraging of such data in training LLMs under cost-effective settings.

All

Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning

no code implementations12 Oct 2023 Junyu Lu, Dixiang Zhang, XiaoJun Wu, Xinyu Gao, Ruyi Gan, Jiaxing Zhang, Yan Song, Pingjian Zhang

Recent advancements enlarge the capabilities of large language models (LLMs) in zero-shot image-to-text generation and understanding by integrating multi-modal inputs.

Image Captioning Image-text Retrieval +8

Hate Speech Detection via Dual Contrastive Learning

no code implementations10 Jul 2023 Junyu Lu, Hongfei Lin, Xiaokun Zhang, Zhaoqing Li, Tongyue Zhang, Linlin Zong, Fenglong Ma, Bo Xu

Our framework jointly optimizes the self-supervised and the supervised contrastive learning loss for capturing span-level information beyond the token-level emotional semantics used in existing models, particularly detecting speech containing abusive and insulting words.

Contrastive Learning Hate Speech Detection

UniEX: An Effective and Efficient Framework for Unified Information Extraction via a Span-extractive Perspective

no code implementations17 May 2023 Ping Yang, Junyu Lu, Ruyi Gan, Junjie Wang, Yuxiang Zhang, Jiaxing Zhang, Pingjian Zhang

We propose a new paradigm for universal information extraction (IE) that is compatible with any schema format and applicable to a list of IE tasks, such as named entity recognition, relation extraction, event extraction and sentiment analysis.

Event Extraction named-entity-recognition +3

Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmarks

1 code implementation8 May 2023 Junyu Lu, Bo Xu, Xiaokun Zhang, Changrong Min, Liang Yang, Hongfei Lin

In addition, it is crucial to introduce lexical knowledge to detect the toxicity of posts, which has been a challenge for researchers.

Hate Speech Detection

Flat Multi-modal Interaction Transformer for Named Entity Recognition

no code implementations COLING 2022 Junyu Lu, Dixiang Zhang, Pingjian Zhang

Then, we transform the fine-grained semantic representation of the vision and text into a unified lattice structure and design a novel relative position encoding to match different modalities in Transformer.

Boundary Detection Multi-modal Named Entity Recognition +2

Unified BERT for Few-shot Natural Language Understanding

no code implementations24 Jun 2022 Junyu Lu, Ping Yang, Ruyi Gan, Jing Yang, Jiaxing Zhang

Even as pre-trained language models share a semantic encoder, natural language understanding suffers from a diversity of output schemas.

Diversity Natural Language Understanding

SiNER: A Large Dataset for Sindhi Named Entity Recognition

no code implementations LREC 2020 Wazir Ali, Junyu Lu, Zenglin Xu

We introduce the SiNER: a named entity recognition (NER) dataset for low-resourced Sindhi language with quality baselines.

named-entity-recognition Named Entity Recognition +1

Word Embedding based New Corpus for Low-resourced Language: Sindhi

no code implementations28 Nov 2019 Wazir Ali, Jay Kumar, Junyu Lu, Zenglin Xu

Our intrinsic evaluation results demonstrate the high quality of our generated Sindhi word embeddings using SG, CBoW, and GloVe as compare to SdfastText word representations.

Word Embeddings

Cannot find the paper you are looking for? You can Submit a new open access paper.