Search Results for author: Liwen Zhang

Found 28 papers, 16 papers with code

Generalized Supervised Attention for Text Generation

1 code implementation • Findings (ACL) 2021 • Yixian Liu, Liwen Zhang, Xinyu Zhang, Yong Jiang, Yue Zhang, Kewei Tu

Paper
Code

SHARP: Search-Based Adversarial Attack for Structured Prediction

no code implementations • Findings (NAACL) 2022 • Liwen Zhang, Zixia Jia, Wenjuan Han, Zilong Zheng, Kewei Tu

Adversarial attack of structured prediction models faces various challenges such as the difficulty of perturbing discrete words, the sentence quality issue, and the sensitivity of outputs to small perturbations.

Adversarial Attack Dependency Parsing +4

Paper
Add Code

TripleSurv: Triplet Time-adaptive Coordinate Loss for Survival Analysis

no code implementations • 5 Jan 2024 • Liwen Zhang, Lianzhen Zhong, Fan Yang, Di Dong, Hui Hui, Jie Tian

However, ranking loss only focus on the ranking of survival time and does not consider potential effect of samples for exact survival time values.

Survival Analysis

Paper
Add Code

YAYI 2: Multilingual Open-Source Large Language Models

no code implementations • 22 Dec 2023 • Yin Luo, Qingchao Kong, Nan Xu, Jia Cao, Bao Hao, Baoyu Qu, Bo Chen, Chao Zhu, Chenyang Zhao, Donglei Zhang, Fan Feng, Feifei Zhao, Hailong Sun, Hanxuan Yang, Haojun Pan, Hongyu Liu, Jianbin Guo, Jiangtao Du, Jingyi Wang, Junfeng Li, Lei Sun, Liduo Liu, Lifeng Dong, Lili Liu, Lin Wang, Liwen Zhang, Minzheng Wang, Pin Wang, Ping Yu, Qingxiao Li, Rui Yan, Rui Zou, Ruiqun Li, Taiwen Huang, Xiaodong Wang, Xiaofei Wu, Xin Peng, Xina Zhang, Xing Fang, Xinglin Xiao, Yanni Hao, Yao Dong, Yigang Wang, Ying Liu, Yongyu Jiang, Yungan Wang, Yuqi Wang, Zhangsheng Wang, Zhaoxin Yu, Zhen Luo, Wenji Mao, Lei Wang, Dajun Zeng

As the latest advancements in natural language processing, large language models (LLMs) have achieved human-level language understanding and generation abilities in many real-world tasks, and even have been regarded as a potential path to the artificial general intelligence.

Paper
Add Code

FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models

1 code implementation • 19 Aug 2023 • Liwen Zhang, Weige Cai, Zhaowei Liu, Zhi Yang, Wei Dai, Yujie Liao, Qianru Qin, Yifei Li, Xingyu Liu, Zhiqiang Liu, Zhoufan Zhu, Anbo Wu, Xin Guo, Yun Chen

Our work offers a more comprehensive financial knowledge evaluation benchmark, utilizing data of mock exams and covering a wide range of evaluated LLMs.

Multiple-choice

119

Paper
Code

Membrane Potential Batch Normalization for Spiking Neural Networks

1 code implementation • ICCV 2023 • Yufei Guo, Yuhan Zhang, Yuanpei Chen, Weihang Peng, Xiaode Liu, Liwen Zhang, Xuhui Huang, Zhe Ma

All these BNs are suggested to be used after the convolution layer as usually doing in CNNs.

Paper
Code

RMP-Loss: Regularizing Membrane Potential Distribution for Spiking Neural Networks

2 code implementations • ICCV 2023 • Yufei Guo, Xiaode Liu, Yuanpei Chen, Liwen Zhang, Weihang Peng, Yuhan Zhang, Xuhui Huang, Zhe Ma

Spiking Neural Networks (SNNs) as one of the biology-inspired models have received much attention recently.

Quantization

Paper
Code

Variance Control for Distributional Reinforcement Learning

1 code implementation • 30 Jul 2023 • Qi Kuang, Zhoufan Zhu, Liwen Zhang, Fan Zhou

Although distributional reinforcement learning (DRL) has been widely examined in the past few years, very few studies investigate the validity of the obtained Q-function estimator in the distributional setting.

Distributional Reinforcement Learning reinforcement-learning

Paper
Code

InfLoR-SNN: Reducing Information Loss for Spiking Neural Networks

no code implementations • 10 Jul 2023 • Yufei Guo, Yuanpei Chen, Liwen Zhang, Xiaode Liu, Xinyi Tong, Yuanyuan Ou, Xuhui Huang, Zhe Ma

The Spiking Neural Network (SNN) has attracted more and more attention recently.

Quantization

Paper
Add Code

Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization

1 code implementation • 12 Jun 2023 • Pengzhi Gao, Liwen Zhang, Zhongjun He, Hua Wu, Haifeng Wang

Multilingual sentence representations are the foundation for similarity-based bitext mining, which is crucial for scaling multilingual neural machine translation (NMT) system to more languages.

Machine Translation NMT +2

Paper
Code

Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization

1 code implementation • 12 May 2023 • Pengzhi Gao, Liwen Zhang, Zhongjun He, Hua Wu, Haifeng Wang

The experimental analysis also proves that CrossConST could close the sentence representation gap and better align the representation space.

Machine Translation NMT +2

Paper
Code

Joint A-SNN: Joint Training of Artificial and Spiking Neural Networks via Self-Distillation and Weight Factorization

no code implementations • 3 May 2023 • Yufei Guo, Weihang Peng, Yuanpei Chen, Liwen Zhang, Xiaode Liu, Xuhui Huang, Zhe Ma

In this paper, we propose a joint training framework of ANN and SNN, in which the ANN can guide the SNN's optimization.

Paper
Add Code

PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation

2 code implementations • CVPR 2023 • Liwen Zhang, Xinyan Zhang, Youcheng Zhang, Yufei Guo, Yuanpei Chen, Xuhui Huang, Zhe Ma

However, neither the regular convolution operation nor the modified ones are specific to interpret radar signals.

Object Scene Understanding +1

Paper
Code

IM-Loss: Information Maximization Loss for Spiking Neural Networks

no code implementations • NeurIPS 2022 • Yufei Guo, Yuanpei Chen, Liwen Zhang, Xiaode Liu, YingLei Wang, Xuhui Huang, Zhe Ma

To deal with this problem, the Information maximization loss (IM-Loss) that aims at maximizing the information flow in the SNN is proposed in the paper.

Ranked #6 on Event data classification on CIFAR10-DVS

Event data classification Image Classification +1

Paper
Add Code

Real Spike: Learning Real-valued Spikes for Spiking Neural Networks

1 code implementation • 13 Oct 2022 • Yufei Guo, Liwen Zhang, Yuanpei Chen, Xinyi Tong, Xiaode Liu, YingLei Wang, Xuhui Huang, Zhe Ma

Motivated by this assumption, a training-inference decoupling method for SNNs named as Real Spike is proposed, which not only enjoys both unshared convolution kernels and binary spikes in inference-time but also maintains both shared convolution kernels and Real-valued Spikes during training.

Paper
Code

Pseudo-Label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object Detection

1 code implementation • Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops 2022 • Jun Yu, Liwen Zhang, Shenshen Du, Hao Chang, Keda Lu, Zhong Zhang, Ye Yu, Lei Wang, Qiang Ling

To overcome these difficulties, this paper first select fewer but suitable data augmentation methods to improve the accuracy of the supervised model based on the labeled training set, which is suitable for the characteristics of hyperspectral images.

Data Augmentation object-detection +3

Paper
Code

Bag of Tricks and a Strong Baseline for FGVC

1 code implementation • Conference and Labs of the Evaluation Forum 2022 • Jun Yu, Hao Chang, Keda Lu, Guochen Xie, Liwen Zhang, Zhongpeng Cai, Shenshen Du, Zhihong Wei, Zepeng Liu, Fang Gao, Feng Shuang

This motivates us to explore the impact of different methods and components in fine-grained classification on FungiCLEF 2022.

Classification Fine-Grained Image Classification

Paper
Code

Scene Clustering Based Pseudo-labeling Strategy for Multi-modal Aerial View Object Classification

2 code implementations • 4 May 2022 • Jun Yu, Hao Chang, Keda Lu, Liwen Zhang, Shenshen Du, Zhong Zhang

Multi-modal aerial view object classification (MAVOC) in Automatic target recognition (ATR), although an important and challenging problem, has been under studied.

Clustering Image Classification

Paper
Code

RecDis-SNN: Rectifying Membrane Potential Distribution for Directly Training Spiking Neural Networks

no code implementations • CVPR 2022 • Yufei Guo, Xinyi Tong, Yuanpei Chen, Liwen Zhang, Xiaode Liu, Zhe Ma, Xuhui Huang

Unfortunately, with the propagation of binary spikes, the distribution of membrane potential will shift, leading to degeneration, saturation, and gradient mismatch problems, which would be disadvantageous to the network optimization and convergence.

Quantization

Paper
Add Code

Adapting Unsupervised Syntactic Parsing Methodology for Discourse Dependency Parsing

no code implementations • ACL 2021 • Liwen Zhang, Ge Wang, Wenjuan Han, Kewei Tu

In this paper, we propose a simple yet effective method to adapt unsupervised syntactic dependency parsing methodology for unsupervised discourse dependency parsing.

Dependency Parsing Discourse Parsing

Paper
Add Code

Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning

no code implementations • 14 May 2021 • Fan Zhou, Zhoufan Zhu, Qi Kuang, Liwen Zhang

Although distributional reinforcement learning (DRL) has been widely examined in the past few years, there are two open questions people are still trying to address.

Atari Games Distributional Reinforcement Learning +3

Paper
Add Code

Constrained Text Generation with Global Guidance -- Case Study on CommonGen

no code implementations • 12 Mar 2021 • Yixian Liu, Liwen Zhang, Wenjuan Han, Yue Zhang, Kewei Tu

We focus on CommonGen, the task of generating text based on a set of concepts, as a representative task of constrained text generation.

Common Sense Reasoning reinforcement-learning +3

Paper
Add Code

Adversarial Attack and Defense of Structured Prediction Models

1 code implementation • EMNLP 2020 • Wenjuan Han, Liwen Zhang, Yong Jiang, Kewei Tu

To address these problems, we propose a novel and unified framework that learns to attack a structured prediction model using a sequence-to-sequence model with feedbacks from multiple reference models of the same structured prediction task.

Adversarial Attack Dependency Parsing +3

Paper
Code

Latent Variable Sentiment Grammar

1 code implementation • ACL 2019 • Liwen Zhang, Kewei Tu, Yue Zhang

Neural models have been investigated for sentiment classification over constituent trees.

General Classification Sentiment Analysis +1

Paper
Code

Tropical Geometry of Deep Neural Networks

1 code implementation • ICML 2018 • Liwen Zhang, Gregory Naitzat, Lek-Heng Lim

Among other things, we deduce that feedforward ReLU neural networks with one hidden layer can be characterized by zonotopes, which serve as building blocks for deeper networks; we relate decision boundaries of such neural networks to tropical hypersurfaces, a major object of study in tropical geometry; and we prove that linear regions of such neural networks correspond to vertices of polytopes associated with tropical rational functions.

Paper
Code

Gaussian Mixture Latent Vector Grammars

1 code implementation • ACL 2018 • Yanpeng Zhao, Liwen Zhang, Kewei Tu

We introduce Latent Vector Grammars (LVeGs), a new framework that extends latent variable grammars such that each nonterminal symbol is associated with a continuous vector space representing the set of (infinitely many) subtypes of the nonterminal.

Constituency Parsing Part-Of-Speech Tagging