Search Results for author: Liwen Zhang

Found 28 papers, 16 papers with code

SHARP: Search-Based Adversarial Attack for Structured Prediction

no code implementations Findings (NAACL) 2022 Liwen Zhang, Zixia Jia, Wenjuan Han, Zilong Zheng, Kewei Tu

Adversarial attack of structured prediction models faces various challenges such as the difficulty of perturbing discrete words, the sentence quality issue, and the sensitivity of outputs to small perturbations.

Adversarial Attack Dependency Parsing +4

TripleSurv: Triplet Time-adaptive Coordinate Loss for Survival Analysis

no code implementations5 Jan 2024 Liwen Zhang, Lianzhen Zhong, Fan Yang, Di Dong, Hui Hui, Jie Tian

However, ranking loss only focus on the ranking of survival time and does not consider potential effect of samples for exact survival time values.

Survival Analysis

FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models

1 code implementation19 Aug 2023 Liwen Zhang, Weige Cai, Zhaowei Liu, Zhi Yang, Wei Dai, Yujie Liao, Qianru Qin, Yifei Li, Xingyu Liu, Zhiqiang Liu, Zhoufan Zhu, Anbo Wu, Xin Guo, Yun Chen

Our work offers a more comprehensive financial knowledge evaluation benchmark, utilizing data of mock exams and covering a wide range of evaluated LLMs.

Multiple-choice

Variance Control for Distributional Reinforcement Learning

1 code implementation30 Jul 2023 Qi Kuang, Zhoufan Zhu, Liwen Zhang, Fan Zhou

Although distributional reinforcement learning (DRL) has been widely examined in the past few years, very few studies investigate the validity of the obtained Q-function estimator in the distributional setting.

Distributional Reinforcement Learning reinforcement-learning

Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization

1 code implementation12 Jun 2023 Pengzhi Gao, Liwen Zhang, Zhongjun He, Hua Wu, Haifeng Wang

Multilingual sentence representations are the foundation for similarity-based bitext mining, which is crucial for scaling multilingual neural machine translation (NMT) system to more languages.

Machine Translation NMT +2

Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization

1 code implementation12 May 2023 Pengzhi Gao, Liwen Zhang, Zhongjun He, Hua Wu, Haifeng Wang

The experimental analysis also proves that CrossConST could close the sentence representation gap and better align the representation space.

Machine Translation NMT +2

Joint A-SNN: Joint Training of Artificial and Spiking Neural Networks via Self-Distillation and Weight Factorization

no code implementations3 May 2023 Yufei Guo, Weihang Peng, Yuanpei Chen, Liwen Zhang, Xiaode Liu, Xuhui Huang, Zhe Ma

In this paper, we propose a joint training framework of ANN and SNN, in which the ANN can guide the SNN's optimization.

Real Spike: Learning Real-valued Spikes for Spiking Neural Networks

1 code implementation13 Oct 2022 Yufei Guo, Liwen Zhang, Yuanpei Chen, Xinyi Tong, Xiaode Liu, YingLei Wang, Xuhui Huang, Zhe Ma

Motivated by this assumption, a training-inference decoupling method for SNNs named as Real Spike is proposed, which not only enjoys both unshared convolution kernels and binary spikes in inference-time but also maintains both shared convolution kernels and Real-valued Spikes during training.

Pseudo-Label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object Detection

1 code implementation Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops 2022 Jun Yu, Liwen Zhang, Shenshen Du, Hao Chang, Keda Lu, Zhong Zhang, Ye Yu, Lei Wang, Qiang Ling

To overcome these difficulties, this paper first select fewer but suitable data augmentation methods to improve the accuracy of the supervised model based on the labeled training set, which is suitable for the characteristics of hyperspectral images.

Data Augmentation object-detection +3

Scene Clustering Based Pseudo-labeling Strategy for Multi-modal Aerial View Object Classification

2 code implementations4 May 2022 Jun Yu, Hao Chang, Keda Lu, Liwen Zhang, Shenshen Du, Zhong Zhang

Multi-modal aerial view object classification (MAVOC) in Automatic target recognition (ATR), although an important and challenging problem, has been under studied.

Clustering Image Classification

RecDis-SNN: Rectifying Membrane Potential Distribution for Directly Training Spiking Neural Networks

no code implementations CVPR 2022 Yufei Guo, Xinyi Tong, Yuanpei Chen, Liwen Zhang, Xiaode Liu, Zhe Ma, Xuhui Huang

Unfortunately, with the propagation of binary spikes, the distribution of membrane potential will shift, leading to degeneration, saturation, and gradient mismatch problems, which would be disadvantageous to the network optimization and convergence.

Quantization

Adapting Unsupervised Syntactic Parsing Methodology for Discourse Dependency Parsing

no code implementations ACL 2021 Liwen Zhang, Ge Wang, Wenjuan Han, Kewei Tu

In this paper, we propose a simple yet effective method to adapt unsupervised syntactic dependency parsing methodology for unsupervised discourse dependency parsing.

Dependency Parsing Discourse Parsing

Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning

no code implementations14 May 2021 Fan Zhou, Zhoufan Zhu, Qi Kuang, Liwen Zhang

Although distributional reinforcement learning (DRL) has been widely examined in the past few years, there are two open questions people are still trying to address.

Atari Games Distributional Reinforcement Learning +3

Constrained Text Generation with Global Guidance -- Case Study on CommonGen

no code implementations12 Mar 2021 Yixian Liu, Liwen Zhang, Wenjuan Han, Yue Zhang, Kewei Tu

We focus on CommonGen, the task of generating text based on a set of concepts, as a representative task of constrained text generation.

Common Sense Reasoning reinforcement-learning +3

Adversarial Attack and Defense of Structured Prediction Models

1 code implementation EMNLP 2020 Wenjuan Han, Liwen Zhang, Yong Jiang, Kewei Tu

To address these problems, we propose a novel and unified framework that learns to attack a structured prediction model using a sequence-to-sequence model with feedbacks from multiple reference models of the same structured prediction task.

Adversarial Attack Dependency Parsing +3

Tropical Geometry of Deep Neural Networks

1 code implementation ICML 2018 Liwen Zhang, Gregory Naitzat, Lek-Heng Lim

Among other things, we deduce that feedforward ReLU neural networks with one hidden layer can be characterized by zonotopes, which serve as building blocks for deeper networks; we relate decision boundaries of such neural networks to tropical hypersurfaces, a major object of study in tropical geometry; and we prove that linear regions of such neural networks correspond to vertices of polytopes associated with tropical rational functions.

Gaussian Mixture Latent Vector Grammars

1 code implementation ACL 2018 Yanpeng Zhao, Liwen Zhang, Kewei Tu

We introduce Latent Vector Grammars (LVeGs), a new framework that extends latent variable grammars such that each nonterminal symbol is associated with a continuous vector space representing the set of (infinitely many) subtypes of the nonterminal.

Constituency Parsing Part-Of-Speech Tagging

Jointly Learning Multiple Measures of Similarities from Triplet Comparisons

no code implementations5 Mar 2015 Liwen Zhang, Subhransu Maji, Ryota Tomioka

Similarity between objects is multi-faceted and it can be easier for human annotators to measure it when the focus is on a specific aspect.

Metric Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.