Search Results for author: Zhe Dong

Found 7 papers, 0 papers with code

SKILL: Structured Knowledge Infusion for Large Language Models

no code implementations17 May 2022 Fedor Moiseev, Zhe Dong, Enrique Alfonseca, Martin Jaggi

The models pre-trained on factual triples compare competitively with the ones on natural language sentences that contain the same knowledge.

Knowledge Graphs

Exploring Dual Encoder Architectures for Question Answering

no code implementations14 Apr 2022 Zhe Dong, Jianmo Ni, Dan Bikel, Enrique Alfonseca, YuAn Wang, Chen Qu, Imed Zitouni

Dual encoders have been used for question-answering (QA) and information retrieval (IR) tasks with good results.

Information Retrieval Question Answering

Coupled Gradient Estimators for Discrete Latent Variables

no code implementations NeurIPS 2021 Zhe Dong, andriy mnih, George Tucker

Training models with discrete latent variables is challenging due to the high variance of unbiased gradient estimators.

DisARM: An Antithetic Gradient Estimator for Binary Latent Variables

no code implementations NeurIPS 2020 Zhe Dong, andriy mnih, George Tucker

Applying antithetic sampling over the augmenting variables yields a relatively low-variance and unbiased estimator applicable to any model with binary latent variables.

On Predictive Information in RNNs

no code implementations21 Oct 2019 Zhe Dong, Deniz Oktay, Ben Poole, Alexander A. Alemi

Certain biological neurons demonstrate a remarkable capability to optimally compress the history of sensory inputs while being maximally informative about the future.

Information Plane

On Predictive Information Sub-optimality of RNNs

no code implementations25 Sep 2019 Zhe Dong, Deniz Oktay, Ben Poole, Alexander A. Alemi

Certain biological neurons demonstrate a remarkable capability to optimally compress the history of sensory inputs while being maximally informative about the future.

Information Plane

Cannot find the paper you are looking for? You can Submit a new open access paper.