Search Results for author: Jiajun Zhang

Found 112 papers, 34 papers with code

Entity-level Cross-modal Learning Improves Multi-modal Machine Translation

no code implementations • Findings (EMNLP) 2021 • Xin Huang, Jiajun Zhang, Chengqing Zong

Inspired by the findings of (CITATION) that entities are most informative in the image, we propose an explicit entity-level cross-modal learning approach that aims to augment the entity representation.

Machine Translation Representation Learning +1

Paper
Add Code

Cross-Modal Cloze Task: A New Task to Brain-to-Word Decoding

1 code implementation • Findings (ACL) 2022 • Shuxian Zou, Shaonan Wang, Jiajun Zhang, Chengqing Zong

More importantly, it demonstrates that it is feasible to decode a certain word within a large vocabulary from its neural brain activity.

Binary Classification Language Modelling

Paper
Code

Addressing Asymmetry in Multilingual Neural Machine Translation with Fuzzy Task Clustering

no code implementations • COLING 2022 • Qian Wang, Jiajun Zhang

However, the existing clustering methods based on language similarity cannot handle the asymmetric problem in multilingual NMT, i. e., one translation task A can benefit from another translation task B but task B will be harmed by task A.

Clustering Machine Translation +3

Paper
Add Code

Bridging the Gap between Different Vocabularies for LLM Ensemble

1 code implementation • 15 Apr 2024 • Yangyifan Xu, Jinliang Lu, Jiajun Zhang

Ensembling different large language models (LLMs) to unleash their complementary potential and harness their individual strengths is highly valuable.

Arithmetic Reasoning Data-to-Text Generation +1

Paper
Code

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning

no code implementations • 26 Mar 2024 • Yuelin Bai, Xinrun Du, Yiming Liang, Yonggang Jin, Ziqiang Liu, Junting Zhou, Tianyu Zheng, Xincheng Zhang, Nuo Ma, Zekun Wang, Ruibin Yuan, Haihong Wu, Hongquan Lin, Wenhao Huang, Jiajun Zhang, Wenhu Chen, Chenghua Lin, Jie Fu, Min Yang, Shiwen Ni, Ge Zhang

To bridge this gap, we introduce COIG-CQIA, a high-quality Chinese instruction tuning dataset.

Paper
Add Code

DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning

no code implementations • 7 Mar 2024 • Xingwei Qu, Yiming Liang, Yucheng Wang, Tianyu Zheng, Tommy Yue, Lei Ma, Stephen W. Huang, Jiajun Zhang, Wenhu Chen, Chenghua Lin, Jie Fu, Ge Zhang

It has long been assumed that the sheer number of parameters in large language models (LLMs) drives in-context learning (ICL) capabilities, enabling remarkable performance improvements by leveraging task-specific demonstrations.

Few-Shot Learning In-Context Learning +1

Paper
Add Code

DPPA: Pruning Method for Large Language Model to Model Merging

1 code implementation • 5 Mar 2024 • Yaochen Zhu, Rui Xia, Jiajun Zhang

In this paper, we introduce a dual-stage method termed Dynamic Pruning Partition Amplification (DPPA), devised to tackle the challenge of merging complex fine-tuned models.

Language Modelling Large Language Model

Paper
Code

Evolving to the Future: Unseen Event Adaptive Fake News Detection on Social Media

no code implementations • 29 Feb 2024 • Jiajun Zhang, ZHIXUN LI, Qiang Liu, Shu Wu, Liang Wang

With the rapid development of social media, the wide dissemination of fake news on social media is increasingly threatening both individuals and society.

Contrastive Learning Fake News Detection

Paper
Add Code

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models

no code implementations • 20 Feb 2024 • Yizhi Li, Ge Zhang, Xingwei Qu, Jiali Li, Zhaoqun Li, Zekun Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Stephen W. Huang, Chenghua Lin, Wenhu Chen, Jie Fu

The advancement of large language models (LLMs) has enhanced the ability to generalize across a wide range of unseen natural language processing (NLP) tasks through instruction-following.

Instruction Following

Paper
Add Code

A Survey on Data Selection for LLM Instruction Tuning

1 code implementation • 4 Feb 2024 • Jiahao Wang, Bolin Zhang, Qianlong Du, Jiajun Zhang, Dianhui Chu

Instruction tuning is a vital step of training large language models (LLM), so how to enhance the effect of instruction tuning has received increased attention.

Instruction Following

Paper
Code

Ins-HOI: Instance Aware Human-Object Interactions Recovery

1 code implementation • 15 Dec 2023 • Jiajun Zhang, Yuxiang Zhang, Hongwen Zhang, Xiao Zhou, Boyao Zhou, Ruizhi Shao, Zonghai Hu, Yebin Liu

To address this, we further propose a complementary training strategy that leverages synthetic data to introduce instance-level shape priors, enabling the disentanglement of occupancy fields for different instances.

Descriptive Disentanglement +3

Paper
Code

BiPFT: Binary Pre-trained Foundation Transformer with Low-rank Estimation of Binarization Residual Polynomials

1 code implementation • 14 Dec 2023 • Xingrun Xing, Li Du, Xinyuan Wang, Xianlin Zeng, Yequan Wang, Zheng Zhang, Jiajun Zhang

Specifically, we first analyze the binarization error in self-attention operations and derive the polynomials of binarization error.

Binarization Natural Language Understanding

Paper
Code

Toward Real World Stereo Image Super-Resolution via Hybrid Degradation Model and Discriminator for Implied Stereo Image Information

1 code implementation • 13 Dec 2023 • Yuanbo Zhou, Yuyang Xue, Jiang Bi, Wenlin He, Xinlin Zhang, Jiajun Zhang, Wei Deng, Ruofeng Nie, Junlin Lan, Qinquan Gao, Tong Tong

Real-world stereo image super-resolution has a significant influence on enhancing the performance of computer vision systems.

Stereo Image Super-Resolution

Paper
Code

MoDS: Model-oriented Data Selection for Instruction Tuning

1 code implementation • 27 Nov 2023 • Qianlong Du, Chengqing Zong, Jiajun Zhang

First, our approach utilizes a quality evaluation model to filter out the high-quality subset from the original instruction dataset, and then designs an algorithm to further select from the high-quality subset a seed instruction dataset with good coverage.

Instruction Following

Paper
Code

Align after Pre-train: Improving Multilingual Generative Models with Cross-lingual Alignment

no code implementations • 14 Nov 2023 • Chong Li, Shaonan Wang, Jiajun Zhang, Chengqing Zong

It aligns the internal sentence representations across different languages via multilingual contrastive learning and aligns model outputs by answering prompts in different languages.

Contrastive Learning Sentence

Paper
Add Code

ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model

1 code implementation • 2 Nov 2023 • Jianghao Chen, Pu Jian, Tengxiao Xi, Dongyi Yi, Qianlong Du, Chenglin Ding, Guibo Zhu, Chengqing Zong, Jinqiao Wang, Jiajun Zhang

Using our proposed approach, we release the largest and latest large-scale high-quality Chinese web text ChineseWebText, which consists of 1. 42 TB and each text is associated with a quality score, facilitating the LLM researchers to choose the data according to the desired quality thresholds.

111

Paper
Code

Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning

1 code implementation • 16 Oct 2023 • Chong Li, Shaonan Wang, Yunhao Zhang, Jiajun Zhang, Chengqing Zong

We further propose a simple multi-task training method to increase functional specialization and mitigate negative information transfer in multi-task learning.

Multi-Task Learning

Paper
Code

Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets

no code implementations • 10 Oct 2023 • Jiajun Zhang, Georgina Cosma, Sarah Bugby, Jason Watkins

Recently, much attention has been directed towards the retrieval of irregular patterns within industrial or medical images by extracting features from the images, such as deep features, colour-based features, shape-based features and local features.

Image Retrieval Retrieval

Paper
Add Code

MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023

no code implementations • 6 Sep 2023 • Zhihang Xu, Shaofei Zhang, Xi Wang, Jiajun Zhang, Wenning Wei, Lei He, Sheng Zhao

In this paper, we present MuLanTTS, the Microsoft end-to-end neural text-to-speech (TTS) system designed for the Blizzard Challenge 2023.

Speech Synthesis

Paper
Add Code

BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing

1 code implementation • 2 Sep 2023 • Chen Wang, Minpeng Liao, Zhongqiang Huang, Jinliang Lu, Junhong Wu, Yuchen Liu, Chengqing Zong, Jiajun Zhang

One is a cascaded approach where outputs (tokens or states) of a separately trained speech recognition system are used as inputs for LLMs, which limits their potential in modeling alignment between speech and text.

speech-recognition Speech Recognition +1

Paper
Code

ForestMonkey: Toolkit for Reasoning with AI-based Defect Detection and Classification Models

no code implementations • 25 Jul 2023 • Jiajun Zhang, Georgina Cosma, Sarah Bugby, Jason Watkins

Additionally, this paper investigates the time performance of the FM toolkit when applied to four AI models with different datasets.

Defect Detection Explainable Artificial Intelligence (XAI)

Paper
Add Code

Morphological Image Analysis and Feature Extraction for Reasoning with AI-based Defect Detection and Classification Models

no code implementations • 21 Jul 2023 • Jiajun Zhang, Georgina Cosma, Sarah Bugby, Axel Finke, Jason Watkins

As the use of artificial intelligent (AI) models becomes more prevalent in industries such as engineering and manufacturing, it is essential that these models provide transparent reasoning behind their predictions.

Defect Detection

Paper
Add Code

ProxyCap: Real-time Monocular Full-body Capture in World Space via Human-Centric Proxy-to-Motion Learning

no code implementations • 3 Jul 2023 • Yuxiang Zhang, Hongwen Zhang, Liangxiao Hu, Jiajun Zhang, Hongwei Yi, Shengping Zhang, Yebin Liu

For more accurate and physically plausible predictions in world space, our network is designed to learn human motions from a human-centric perspective, which enables the understanding of the same motion captured with different camera trajectories.

Ranked #208 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation

Paper
Add Code

BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages

2 code implementations • 29 May 2023 • Wen Yang, Chong Li, Jiajun Zhang, Chengqing Zong

Second, we continue training the model with a large-scale parallel dataset that covers 102 natural languages.

Translation

197

Paper
Code

Language Cognition and Language Computation -- Human and Machine Language Understanding

no code implementations • 12 Jan 2023 • Shaonan Wang, Nai Ding, Nan Lin, Jiajun Zhang, Chengqing Zong

Language understanding is a key scientific issue in the fields of cognitive and computer science.

Paper
Add Code

Life-long Learning for Multilingual Neural Machine Translation with Knowledge Distillation

no code implementations • 6 Dec 2022 • Yang Zhao, Junnan Zhu, Lu Xiang, Jiajun Zhang, Yu Zhou, FeiFei Zhai, Chengqing Zong

To alleviate the CF, we investigate knowledge distillation based life-long learning methods.

Knowledge Distillation Machine Translation +1

Paper
Add Code

Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation

1 code implementation • 18 Oct 2022 • Chen Wang, Yuchen Liu, Boxing Chen, Jiajun Zhang, Wei Luo, Zhongqiang Huang, Chengqing Zong

Existing zero-shot methods fail to align the two modalities of speech and text into a shared semantic space, resulting in much worse performance compared to the supervised ST methods.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Code

1st Place Solutions for the UVO Challenge 2022

no code implementations • 18 Oct 2022 • Jiajun Zhang, BoYu Chen, Zhilong Ji, Jinfeng Bai, Zonghai Hu

This paper describes the approach we have taken in the challenge.

object-detection Object Detection +1

Paper
Add Code

Other Roles Matter! Enhancing Role-Oriented Dialogue Summarization via Role Interactions

2 code implementations • ACL 2022 • Haitao Lin, Junnan Zhu, Lu Xiang, Yu Zhou, Jiajun Zhang, Chengqing Zong

Therefore, we propose a novel role interaction enhanced method for role-oriented dialogue summarization.

896

Paper
Code

A Roadmap for Big Model

no code implementations • 26 Mar 2022 • Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, HuaWei Shen, HUI ZHANG, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan YAO, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Chujie Zheng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, LiWei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang

With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm.

Language Modelling Machine Translation +1

Paper
Add Code

Learning Confidence for Transformer-based Neural Machine Translation

1 code implementation • ACL 2022 • Yu Lu, Jiali Zeng, Jiajun Zhang, Shuangzhi Wu, Mu Li

Confidence estimation aims to quantify the confidence of the model prediction, providing an expectation of success.

Machine Translation NMT +2

Paper
Code

Instance-aware Prompt Learning for Language Understanding and Generation

1 code implementation • 18 Jan 2022 • Feihu Jin, Jinliang Lu, Jiajun Zhang, Chengqing Zong

Specifically, we suppose that each learnable prompt token has a different contribution to different instances, and we learn the contribution by calculating the relevance score between an instance and each prompt token.

Few-Shot Learning

Paper
Code

Parameter Differentiation based Multilingual Neural Machine Translation

2 code implementations • 27 Dec 2021 • Qian Wang, Jiajun Zhang

Further analyses reveal that the parameter sharing configuration obtained by our method correlates well with the linguistic proximities.

Machine Translation Open-Ended Question Answering +2

29,201

Paper
Code

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark

no code implementations • 27 Dec 2021 • Yuan YAO, Qingxiu Dong, Jian Guan, Boxi Cao, Zhengyan Zhang, Chaojun Xiao, Xiaozhi Wang, Fanchao Qi, Junwei Bao, Jinran Nie, Zheni Zeng, Yuxian Gu, Kun Zhou, Xuancheng Huang, Wenhao Li, Shuhuai Ren, Jinliang Lu, Chengqiang Xu, Huadong Wang, Guoyang Zeng, Zile Zhou, Jiajun Zhang, Juanzi Li, Minlie Huang, Rui Yan, Xiaodong He, Xiaojun Wan, Xin Zhao, Xu sun, Yang Liu, Zhiyuan Liu, Xianpei Han, Erhong Yang, Zhifang Sui, Maosong Sun

We argue that for general-purpose language intelligence evaluation, the benchmark itself needs to be comprehensive and systematic.

Paper
Add Code

Deep Learning-based Segmentation of Cerebral Aneurysms in 3D TOF-MRA using Coarse-to-Fine Framework

no code implementations • 26 Oct 2021 • Meng Chen, Chen Geng, Dongdong Wang, Jiajun Zhang, Ruoyu Di, Fengmei Li, Zhiyong Zhou, Sirong Piao, Yuxin Li, Yaikang Dai

The segmentation metrics we used include DSC, HD, and VS.

Segmentation

Paper
Add Code

Towards Brain-to-Text Generation: Neural Decoding with Pre-trained Encoder-Decoder Models

no code implementations • NeurIPS Workshop AI4Scien 2021 • Shuxian Zou, Shaonan Wang, Jiajun Zhang, Chengqing Zong

However, most of the existing studies have focused on discriminating which one in two stimuli corresponds to the given brain image, which is far from directly generating text from neural activities.

Text Generation

Paper
Add Code

Exploiting Curriculum Learning in Unsupervised Neural Machine Translation

1 code implementation • Findings (EMNLP) 2021 • Jinliang Lu, Jiajun Zhang

Back-translation (BT) has become one of the de facto components in unsupervised neural machine translation (UNMT), and it explicitly makes UNMT have translation ability.

Machine Translation Translation

Paper
Code

CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization

2 code implementations • EMNLP 2021 • Haitao Lin, Liqun Ma, Junnan Zhu, Lu Xiang, Yu Zhou, Jiajun Zhang, Chengqing Zong

Therefore, in this paper, we introduce a novel Chinese dataset for Customer Service Dialogue Summarization (CSDS).

Paper
Code

Augmenting Slot Values and Contexts for Spoken Language Understanding with Pretrained Models

1 code implementation • 19 Aug 2021 • Haitao Lin, Lu Xiang, Yu Zhou, Jiajun Zhang, Chengqing Zong

We propose two strategies for finetuning process: value-based and context-based augmentation.

Data Augmentation slot-filling +2

Paper
Code

Attention Calibration for Transformer in Neural Machine Translation

no code implementations • ACL 2021 • Yu Lu, Jiali Zeng, Jiajun Zhang, Shuangzhi Wu, Mu Li

Attention mechanisms have achieved substantial improvements in neural machine translation by dynamically selecting relevant inputs for different predictions.

Machine Translation Translation

Paper
Add Code

Gaze Estimation with an Ensemble of Four Architectures

1 code implementation • 5 Jul 2021 • Xin Cai, BoYu Chen, Jiabei Zeng, Jiajun Zhang, Yunjia Sun, Xiao Wang, Zhilong Ji, Xiao Liu, Xilin Chen, Shiguang Shan

This paper presents a method for gaze estimation according to face images.

Gaze Estimation

Paper
Code

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation

2 code implementations • 1 Jul 2021 • Jing Liu, Xinxin Zhu, Fei Liu, Longteng Guo, Zijia Zhao, Mingzhen Sun, Weining Wang, Hanqing Lu, Shiyu Zhou, Jiajun Zhang, Jinqiao Wang

In this paper, we propose an Omni-perception Pre-Trainer (OPT) for cross-modal understanding and generation, by jointly modeling visual, text and audio resources.

Ranked #1 on Image Retrieval on Localized Narratives

Audio to Text Retrieval Cross-Modal Retrieval +3

334

Paper
Code

Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation

1 code implementation • ACL 2021 • Yangyifan Xu, Yijin Liu, Fandong Meng, Jiajun Zhang, Jinan Xu, Jie zhou

Recently, token-level adaptive training has achieved promising improvement in machine translation, where the cross-entropy loss function is adjusted by assigning different training weights to different tokens, in order to alleviate the token imbalance problem.

Machine Translation Translation

Paper
Code

Pre-Training on Dynamic Graph Neural Networks

1 code implementation • 24 Feb 2021 • Ke-Jia Chen, Jiajun Zhang, Linpu Jiang, Yunyun Wang, Yuxuan Dai

This paper proposes a pre-training method on dynamic graph neural networks (PT-DGNN), which uses dynamic attributed graph generation tasks to simultaneously learn the structure, semantics, and evolution features of the graph.

Graph Generation Graph Sampling +1

Paper
Code

Touch Editing: A Flexible One-Time Interaction Approach for Translation

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Qian Wang, Jiajun Zhang, Lemao Liu, Guoping Huang, Chengqing Zong

We propose a touch-based editing method for translation, which is more flexible than traditional keyboard-mouse-based translation postediting.

Translation

Paper
Add Code

Knowledge Graph Enhanced Neural Machine Translation via Multi-task Learning on Sub-entity Granularity

no code implementations • COLING 2020 • Yang Zhao, Lu Xiang, Junnan Zhu, Jiajun Zhang, Yu Zhou, Chengqing Zong

Previous studies combining knowledge graph (KG) with neural machine translation (NMT) have two problems: i) Knowledge under-utilization: they only focus on the entities that appear in both KG and training sentence pairs, making much knowledge in KG unable to be fully utilized.

Machine Translation Multi-Task Learning +3

Paper
Add Code

Multimodal Sentence Summarization via Multimodal Selective Encoding

no code implementations • COLING 2020 • Haoran Li, Junnan Zhu, Jiajun Zhang, Xiaodong He, Chengqing Zong

Thus, we propose a multimodal selective gate network that considers reciprocal relationships between textual and multi-level visual features, including global image descriptor, activation grids, and object proposals, to select highlights of the event when encoding the source sentence.

Sentence Sentence Summarization

Paper
Add Code

Distill and Replay for Continual Language Learning

no code implementations • COLING 2020 • Jingyuan Sun, Shaonan Wang, Jiajun Zhang, Chengqing Zong

The framework is based on language models and can be smoothly built with different language model architectures.

Language Modelling Natural Language Understanding

Paper
Add Code

Deep Template Matching for Pedestrian Attribute Recognition with the Auxiliary Supervision of Attribute-wise Keypoints

no code implementations • 13 Nov 2020 • Jiajun Zhang, Pengyuan Ren, Jianmin Li

Pedestrian Attribute Recognition (PAR) has aroused extensive attention due to its important role in video surveillance scenarios.

Attribute Pedestrian Attribute Recognition +1

Paper
Add Code

Bridging the Modality Gap for Speech-to-Text Translation

no code implementations • 28 Oct 2020 • Yuchen Liu, Junnan Zhu, Jiajun Zhang, Chengqing Zong

End-to-end speech translation aims to translate speech in one language into text in another language via an end-to-end way.

Speech-to-Text Translation Translation

Paper
Add Code

Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

no code implementations • EMNLP 2020 • Xiaomian Kang, Yang Zhao, Jiajun Zhang, Chengqing Zong

Specifically, we introduce a selection module that is independent of the translation module to score each candidate context sentence.

Machine Translation reinforcement-learning +3

Paper
Add Code

CASIA's System for IWSLT 2020 Open Domain Translation

no code implementations • WS 2020 • Qian Wang, Yuchen Liu, Cong Ma, Yu Lu, Yining Wang, Long Zhou, Yang Zhao, Jiajun Zhang, Cheng-qing Zong

This paper describes the CASIA{'}s system for the IWSLT 2020 open domain translation task.

Knowledge Distillation Machine Translation +1

Paper
Add Code

Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization

no code implementations • ACL 2020 • Junnan Zhu, Yu Zhou, Jiajun Zhang, Cheng-qing Zong

Cross-lingual summarization aims at summarizing a document in one language (e. g., Chinese) into another language (e. g., English).

Translation

Paper
Add Code

Improving Autoregressive NMT with Non-Autoregressive Model

no code implementations • WS 2020 • Long Zhou, Jiajun Zhang, Cheng-qing Zong

In this work, we propose a novel Encoder-NAD-AD framework for NMT, aiming at boosting AT with global information produced by NAT model.

Knowledge Distillation Machine Translation +2

Paper
Add Code

Neural Machine Translation: Challenges, Progress and Future

1 code implementation • 13 Apr 2020 • Jiajun Zhang, Cheng-qing Zong

Machine translation (MT) is a technique that leverages computers to translate human languages automatically.

Machine Translation NMT +1

103

Paper
Code

Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding

1 code implementation • 16 Dec 2019 • Yuchen Liu, Jiajun Zhang, Hao Xiong, Long Zhou, Zhongjun He, Hua Wu, Haifeng Wang, Cheng-qing Zong

Speech-to-text translation (ST), which translates source language speech into target language text, has attracted intensive attention in recent years.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Code

Modelling cosmic ray electron physics in cosmological smoothed particle hydrodynamics simulation

no code implementations • 1 Dec 2019 • Dongchao Zheng, Weitian Li, Zhenghao Zhu, Chenxi Shan, Jiajun Zhang, Linfeng Xiao, Xiaoli Lian, Dan Hu

Cosmic ray electron (CRE) acceleration and cooling are important physical processes in astrophysics.

High Energy Astrophysical Phenomena Cosmology and Nongalactic Astrophysics

Paper
Add Code

Chinese Spelling Error Detection Using a Fusion Lattice LSTM

no code implementations • 25 Nov 2019 • Hao Wang, Bing Wang, Jianyong Duan, Jiajun Zhang

Spelling error detection serves as a crucial preprocessing in many natural language processing applications.

Paper
Add Code

Synchronously Generating Two Languages with Interactive Decoding

no code implementations • IJCNLP 2019 • Yining Wang, Jiajun Zhang, Long Zhou, Yuchen Liu, Cheng-qing Zong

In this paper, we introduce a novel interactive approach to translate a source language into two different languages simultaneously and interactively.

Machine Translation NMT +2

Paper
Add Code

NCLS: Neural Cross-Lingual Summarization

1 code implementation • IJCNLP 2019 • Junnan Zhu, Qian Wang, Yining Wang, Yu Zhou, Jiajun Zhang, Shaonan Wang, Cheng-qing Zong

Moreover, we propose to further improve NCLS by incorporating two related tasks, monolingual summarization and machine translation, into the training process of CLS under multi-task learning.

Machine Translation Multi-Task Learning +1

Paper
Code

Are You for Real? Detecting Identity Fraud via Dialogue Interactions

1 code implementation • IJCNLP 2019 • Weikang Wang, Jiajun Zhang, Qian Li, Cheng-qing Zong, Zhifei Li

In this paper, we focus on identity fraud detection in loan applications and propose to solve this problem with a novel interactive dialogue system which consists of two modules.

Dialogue Management Fraud Detection +1

Paper
Code

A Compact and Language-Sensitive Multilingual Translation Method

no code implementations • ACL 2019 • Yining Wang, Long Zhou, Jiajun Zhang, FeiFei Zhai, Jingfang Xu, Cheng-qing Zong

We verify our methods on various translation scenarios, including one-to-many, many-to-many and zero-shot.

Machine Translation NMT +1

Paper
Add Code

Understanding Memory Modules on Learning Simple Algorithms

no code implementations • 1 Jul 2019 • Kexin Wang, Yu Zhou, Shaonan Wang, Jiajun Zhang, Cheng-qing Zong

Recent work has shown that memory modules are crucial for the generalization ability of neural networks on learning simple algorithms.

Dimensionality Reduction

Paper
Add Code

Sequence Generation: From Both Sides to the Middle

no code implementations • 23 Jun 2019 • Long Zhou, Jiajun Zhang, Cheng-qing Zong, Heng Yu

The encoder-decoder framework has achieved promising process for many sequence generation tasks, such as neural machine translation and text summarization.

Machine Translation Sentence +2

Paper
Add Code

Incremental Learning from Scratch for Task-Oriented Dialogue Systems

1 code implementation • ACL 2019 • Weikang Wang, Jiajun Zhang, Qian Li, Mei-Yuh Hwang, Cheng-qing Zong, Zhifei Li

Clarifying user needs is essential for existing task-oriented dialogue systems.

Incremental Learning Task-Oriented Dialogue Systems

Paper
Code

Memory Consolidation for Contextual Spoken Language Understanding with Dialogue Logistic Inference

no code implementations • ACL 2019 • He Bai, Yu Zhou, Jiajun Zhang, Cheng-qing Zong

Dialogue contexts are proven helpful in the spoken language understanding (SLU) system and they are typically encoded with explicit memory representations.

Retrieval slot-filling +2

Paper
Add Code

Synchronous Bidirectional Neural Machine Translation

2 code implementations • TACL 2019 • Long Zhou, Jiajun Zhang, Cheng-qing Zong

In this paper, we introduce a synchronous bidirectional neural machine translation (SB-NMT) that predicts its outputs using left-to-right and right-to-left decoding simultaneously and interactively, in order to leverage both of the history and future information at the same time.

Ranked #28 on Machine Translation on WMT2014 English-German

Machine Translation NMT +1

Paper
Code

End-to-End Speech Translation with Knowledge Distillation

no code implementations • 17 Apr 2019 • Yuchen Liu, Hao Xiong, Zhongjun He, Jiajun Zhang, Hua Wu, Haifeng Wang, Cheng-qing Zong

End-to-end speech translation (ST), which directly translates from source language speech into target language text, has attracted intensive attentions in recent years.

Knowledge Distillation speech-recognition +2

Paper
Add Code

Synchronous Bidirectional Inference for Neural Sequence Generation

1 code implementation • 24 Feb 2019 • Jiajun Zhang, Long Zhou, Yang Zhao, Cheng-qing Zong

In this work, we propose a synchronous bidirectional inference model to generate outputs using both left-to-right and right-to-left decoding simultaneously and interactively.

Abstractive Text Summarization Machine Translation +1

Paper
Code

Language-Independent Representor for Neural Machine Translation

no code implementations • 1 Nov 2018 • Long Zhou, Yuchen Liu, Jiajun Zhang, Cheng-qing Zong, Guoping Huang

Current Neural Machine Translation (NMT) employs a language-specific encoder to represent the source sentence and adopts a language-specific decoder to generate target translation.

Machine Translation Multi-Task Learning +3

Paper
Add Code

Three Strategies to Improve One-to-Many Multilingual Translation

no code implementations • EMNLP 2018 • Yining Wang, Jiajun Zhang, FeiFei Zhai, Jingfang Xu, Cheng-qing Zong

However, previous studies show that one-to-many translation based on this framework cannot perform on par with the individually trained models.

Machine Translation Multi-Task Learning +1

Paper
Add Code

Addressing Troublesome Words in Neural Machine Translation

no code implementations • EMNLP 2018 • Yang Zhao, Jiajun Zhang, Zhongjun He, Cheng-qing Zong, Hua Wu

One of the weaknesses of Neural Machine Translation (NMT) is in handling lowfrequency and ambiguous words, which we refer as troublesome words.

Machine Translation NMT +1

Paper
Add Code

MSMO: Multimodal Summarization with Multimodal Output

no code implementations • EMNLP 2018 • Junnan Zhu, Haoran Li, Tianshang Liu, Yu Zhou, Jiajun Zhang, Cheng-qing Zong

In this paper, we propose a novel task, multimodal summarization with multimodal output (MSMO).

Informativeness Text Summarization

Paper
Add Code

A Teacher-Student Framework for Maintainable Dialog Manager

no code implementations • EMNLP 2018 • Weikang Wang, Jiajun Zhang, Han Zhang, Mei-Yuh Hwang, Cheng-qing Zong, Zhifei Li

Specifically, the {``}student{''} is an extended dialog manager based on a new ontology, and the {``}teacher{''} is existing resources used for guiding the learning process of the {``}student{''}.

Reinforcement Learning (RL)

Paper
Add Code

Associative Multichannel Autoencoder for Multimodal Word Representation

1 code implementation • EMNLP 2018 • Shaonan Wang, Jiajun Zhang, Cheng-qing Zong

In this paper we address the problem of learning multimodal word representations by integrating textual, visual and auditory inputs.

Paper
Code

Source-Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language

no code implementations • 19 Aug 2018 • He Bai, Yu Zhou, Jiajun Zhang, Liang Zhao, Mei-Yuh Hwang, Cheng-qing Zong

This paper focuses on the language transferring task given a tiny in-domain parallel SLU corpus.

Cultural Vocal Bursts Intensity Prediction domain classification +6

Paper
Add Code

Source Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language

no code implementations • COLING 2018 • He Bai, Yu Zhou, Jiajun Zhang, Liang Zhao, Mei-Yuh Hwang, Cheng-qing Zong

An SLU corpus is a monolingual corpus with domain/intent/slot labels.

Cultural Vocal Bursts Intensity Prediction domain classification +8

Paper
Add Code

Ensure the Correctness of the Summary: Incorporate Entailment Knowledge into Abstractive Sentence Summarization

no code implementations • COLING 2018 • Haoran Li, Junnan Zhu, Jiajun Zhang, Cheng-qing Zong

In this paper, we investigate the sentence summarization task that produces a summary from a source sentence.

Ranked #7 on Text Summarization on DUC 2004 Task 1

Abstractive Text Summarization Informativeness +3

Paper
Add Code

Phrase Table as Recommendation Memory for Neural Machine Translation

no code implementations • 25 May 2018 • Yang Zhao, Yining Wang, Jiajun Zhang, Cheng-qing Zong

Neural Machine Translation (NMT) has drawn much attention due to its promising translation performance recently.

Machine Translation NMT +2

Paper
Add Code

Exploiting Pre-Ordering for Neural Machine Translation

no code implementations • LREC 2018 • Yang Zhao, Jiajun Zhang, Cheng-qing Zong

Machine Translation Translation

Paper
Add Code

Learning Multimodal Word Representation via Dynamic Fusion Methods

no code implementations • 2 Jan 2018 • Shaonan Wang, Jiajun Zhang, Cheng-qing Zong

Multimodal models have been proven to outperform text-based models on learning semantic word representations.

Paper
Add Code

Learning from Parenthetical Sentences for Term Translation in Machine Translation

no code implementations • WS 2017 • Guoping Huang, Jiajun Zhang, Yu Zhou, Cheng-qing Zong

Terms extensively exist in specific domains, and term translation plays a critical role in domain-specific machine translation (MT) tasks.

Machine Translation Sentence +1

Paper
Add Code

Investigating Inner Properties of Multimodal Representation and Semantic Compositionality with Brain-based Componential Semantics

no code implementations • 15 Nov 2017 • Shaonan Wang, Jiajun Zhang, Nan Lin, Cheng-qing Zong

Considering that multimodal models are originally motivated by human concept representations, we assume that correlating multimodal representations with brain-based semantics would interpret their inner properties to answer the above questions.

Learning Semantic Representations Natural Language Understanding

Paper
Add Code

Word, Subword or Character? An Empirical Study of Granularity in Chinese-English NMT

1 code implementation • 13 Nov 2017 • Yining Wang, Long Zhou, Jiajun Zhang, Cheng-qing Zong

Our experiments show that subword model performs best for Chinese-to-English translation with the vocabulary which is not so big while hybrid word-character model is most suitable for English-to-Chinese translation.

Machine Translation NMT +1

Paper
Code

Doppler-Radar Based Hand Gesture Recognition System Using Convolutional Neural Networks

no code implementations • 7 Nov 2017 • Jiajun Zhang, Jinkun Tao, Jiangtao Huangfu, Zhiguo Shi

In this paper, a Doppler Radar based hand gesture recognition system using convolutional neural networks is proposed.

Hand Gesture Recognition Hand-Gesture Recognition

Paper
Add Code

Deformable Deep Convolutional Generative Adversarial Network in Microwave Based Hand Gesture Recognition System

no code implementations • 6 Nov 2017 • Jiajun Zhang, Zhiguo Shi

Traditional vision-based hand gesture recognition systems is limited under dark circumstances.

BIG-bench Machine Learning Generative Adversarial Network +2

Paper
Add Code

Towards Neural Machine Translation with Partially Aligned Corpora

no code implementations • IJCNLP 2017 • Yining Wang, Yang Zhao, Jiajun Zhang, Cheng-qing Zong, Zhengshan Xue

While neural machine translation (NMT) has become the new paradigm, the parameter optimization requires large-scale parallel data which is scarce in many domains and language pairs.

Machine Translation NMT +2

Paper
Add Code

Exploiting Word Internal Structures for Generic Chinese Sentence Representation

no code implementations • EMNLP 2017 • Shaonan Wang, Jiajun Zhang, Cheng-qing Zong

We introduce a novel mixed characterword architecture to improve Chinese sentence representations, by utilizing rich semantic information of word internal structures.

Sentence Sentence Similarity

Paper
Add Code

Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video

no code implementations • EMNLP 2017 • Haoran Li, Junnan Zhu, Cong Ma, Jiajun Zhang, Cheng-qing Zong

In this work, we propose an extractive Multi-modal Summarization (MMS) method which can automatically generate a textual summary given a set of documents, images, audios and videos related to a specific topic.

Automatic Speech Recognition (ASR) Document Summarization +1

Paper
Add Code

Look-ahead Attention for Generation in Neural Machine Translation

no code implementations • 30 Aug 2017 • Long Zhou, Jiajun Zhang, Cheng-qing Zong

The attention model has become a standard component in neural machine translation (NMT) and it guides translation process by selectively focusing on parts of the source sentence when predicting each target word.

Machine Translation NMT +2

Paper
Add Code

Neural System Combination for Machine Translation

no code implementations • ACL 2017 • Long Zhou, Wenpeng Hu, Jiajun Zhang, Cheng-qing Zong

Neural machine translation (NMT) becomes a new approach to machine translation and generates much more fluent results compared to statistical machine translation (SMT).

Machine Translation NMT +1

Paper
Add Code

Shortcut Sequence Tagging

no code implementations • 3 Jan 2017 • Huijia Wu, Jiajun Zhang, Cheng-qing Zong

To simply the stacked architecture, we propose a framework called shortcut block, which is a marriage of the gating mechanism and shortcuts, while discarding the self-connected part in LSTM cell.

POS POS Tagging

Paper
Add Code

Different Contexts Lead to Different Word Embeddings

no code implementations • COLING 2016 • Wenpeng Hu, Jiajun Zhang, Nan Zheng

Recent work for learning word representations has applied successfully to many NLP applications, such as sentiment analysis and question answering.

Clustering Information Retrieval +3

Paper
Add Code

Ultra-Light Axion Dark Matter and its impacts on dark halo structure in $N$-body simulation

1 code implementation • 3 Nov 2016 • Jiajun Zhang, Yue-Lin Sming Tsai, Jui-Lin Kuo, Kingman Cheung, Ming-Chung Chu

The existence of the solitonic core reveals the non-linear effect of quantum pressure and impacts the structure formation in the FDM model.

Cosmology and Nongalactic Astrophysics Astrophysics of Galaxies High Energy Physics - Phenomenology

Paper
Code

Exploiting Source-side Monolingual Data in Neural Machine Translation

no code implementations • EMNLP 2016 • Jiajun Zhang, Cheng-qing Zong

Machine Translation Multi-Task Learning +1

Paper
Add Code

Bridging Neural Machine Translation and Bilingual Dictionaries

no code implementations • 24 Oct 2016 • Jiajun Zhang, Cheng-qing Zong

Neural Machine Translation (NMT) has become the new state-of-the-art in several language pairs.

Machine Translation NMT +2

Paper
Add Code

An Empirical Exploration of Skip Connections for Sequential Tagging

no code implementations • COLING 2016 • Huijia Wu, Jiajun Zhang, Cheng-qing Zong

In this paper, we empirically explore the effects of various kinds of skip connections in stacked bidirectional LSTMs for sequential tagging.

CCG Supertagging POS +1

Paper
Add Code

A Dynamic Window Neural Network for CCG Supertagging

no code implementations • 10 Oct 2016 • Huijia Wu, Jiajun Zhang, Cheng-qing Zong

These motivate us to build a supertagger with a dynamic window approach, which can be treated as an attention mechanism on the local contexts.

CCG Supertagging Sentence +1

Paper
Add Code

Learning Sentence Representation with Guidance of Human Attention

no code implementations • 29 Sep 2016 • Shaonan Wang, Jiajun Zhang, Cheng-qing Zong

Recently, much progress has been made in learning general-purpose sentence representations that can be used across domains.

POS Sentence

Paper
Add Code

One Sentence One Model for Neural Machine Translation

no code implementations • LREC 2018 • Xiao-Qing Li, Jiajun Zhang, Cheng-qing Zong

Neural machine translation (NMT) becomes a new state-of-the-art and achieves promising translation results using a simple encoder-decoder neural network.

Machine Translation NMT +2

Paper
Add Code

An End-to-End Chinese Discourse Parser with Adaptation to Explicit and Non-explicit Relation Recognition

no code implementations • CONLL 2016 • Xiaomian Kang, Haoran Li, Long Zhou, Jiajun Zhang, Cheng-qing Zong

General Classification Machine Translation +3

Paper
Add Code

Neural Name Translation Improves Neural Machine Translation

no code implementations • 7 Jul 2016 • Xiao-Qing Li, Jiajun Zhang, Cheng-qing Zong

In order to control computational complexity, neural machine translation (NMT) systems convert all rare words outside the vocabulary into a single unk symbol.

Machine Translation NMT +2

Paper
Add Code

A Bilingual Discourse Corpus and Its Applications

no code implementations • LREC 2016 • Yang Liu, Jiajun Zhang, Cheng-qing Zong, Yating Yang, Xi Zhou

Existing discourse research only focuses on the monolingual languages and the inconsistency between languages limits the power of the discourse theory in multilingual applications such as machine translation.

Machine Translation Translation

Paper
Add Code

Local Translation Prediction with Global Sentence Representation

no code implementations • 27 Feb 2015 • Jiajun Zhang

With the sentence-level feature representation, we further design a feed-forward neural network to better predict translations using both local and global information.

Machine Translation Sentence +1

Paper
Add Code

Beyond Word-based Language Model in Statistical Machine Translation

no code implementations • 5 Feb 2015 • Jiajun Zhang, Shujie Liu, Mu Li, Ming Zhou, Cheng-qing Zong

Language model is one of the most important modules in statistical machine translation and currently the word-based language model dominants this community.

Language Modelling Machine Translation +1