Search Results for author: Yannic Kilcher

Found 17 papers, 3 papers with code

How does BERT capture semantics? A closer look at polysemous words

1 code implementation • EMNLP (BlackboxNLP) 2020 • David Yenicelik, Florian Schmidt, Yannic Kilcher

The recent paradigm shift to contextual word embeddings has seen tremendous success across a wide range of down-stream tasks.

Semanticity prediction Semantic Similarity +3

Paper
Code

OpenAssistant Conversations -- Democratizing Large Language Model Alignment

no code implementations • 14 Apr 2023 • Andreas Köpf, Yannic Kilcher, Dimitri von Rütte, Sotiris Anagnostidis, Zhi-Rui Tam, Keith Stevens, Abdullah Barhoum, Nguyen Minh Duc, Oliver Stanley, Richárd Nagyfi, Shahul ES, Sameer Suri, David Glushkov, Arnav Dantuluri, Andrew Maguire, Christoph Schuhmann, Huu Nguyen, Alexander Mattick

In an effort to democratize research on large-scale alignment, we release OpenAssistant Conversations, a human-generated, human-annotated assistant-style conversation corpus consisting of 161, 443 messages in 35 different languages, annotated with 461, 292 quality ratings, resulting in over 10, 000 complete and fully annotated conversation trees.

Language Modelling Large Language Model

Paper
Add Code

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

2 code implementations • 26 Jan 2022 • Dimitri von Rütte, Luca Biggio, Yannic Kilcher, Thomas Hofmann

Generating music with deep neural networks has been an area of active research in recent years.

Inductive Bias Music Generation

583

Paper
Code

Boosting Search Engines with Interactive Agents

no code implementations • 1 Sep 2021 • Leonard Adolphs, Benjamin Boerschinger, Christian Buck, Michelle Chen Huebscher, Massimiliano Ciaramita, Lasse Espeholt, Thomas Hofmann, Yannic Kilcher, Sascha Rothe, Pier Giuseppe Sessa, Lierni Sestorain Saralegui

This paper presents first successful steps in designing search agents that learn meta-strategies for iterative query refinement in information-seeking tasks.

Information Retrieval Reading Comprehension +3

Paper
Add Code

Generative Minimization Networks: Training GANs Without Competition

no code implementations • 23 Mar 2021 • Paulina Grnarova, Yannic Kilcher, Kfir Y. Levy, Aurelien Lucchi, Thomas Hofmann

Among known problems experienced by practitioners is the lack of convergence guarantees or convergence to a non-optimum cycle.

Paper
Add Code

Rethinking Neural Networks With Benford's Law

no code implementations • 5 Feb 2021 • Surya Kant Sahu, Abhinav Java, Arshad Shaikh, Yannic Kilcher

To that end, we first define a metric, MLH (Model Enthalpy), that measures the closeness of a set of numbers to Benford's Law and we show empirically that it is a strong predictor of Validation Accuracy.

Fraud Detection Total Energy

Paper
Add Code

Meta Answering for Machine Reading

no code implementations • 11 Nov 2019 • Benjamin Borschinger, Jordan Boyd-Graber, Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Michelle Chen Huebscher, Wojciech Gajewski, Yannic Kilcher, Rodrigo Nogueira, Lierni Sestorain Saralegu

We investigate a framework for machine reading, inspired by real world information-seeking problems, where a meta question answering system interacts with a black box environment.

Natural Questions Question Answering +1

Paper
Add Code

Adversarial Training Generalizes Data-dependent Spectral Norm Regularization

no code implementations • 25 Sep 2019 • Kevin Roth, Yannic Kilcher, Thomas Hofmann

We establish a theoretical link between adversarial training and operator norm regularization for deep neural networks.

Paper
Add Code

Adversarial Training is a Form of Data-dependent Operator Norm Regularization

no code implementations • NeurIPS 2020 • Kevin Roth, Yannic Kilcher, Thomas Hofmann

We establish a theoretical link between adversarial training and operator norm regularization for deep neural networks.

Paper
Add Code

Escaping Flat Areas via Function-Preserving Structural Network Modifications

no code implementations • ICLR 2019 • Yannic Kilcher, Gary Bécigneul, Thomas Hofmann

We develop our method for fully-connected as well as convolutional layers.

Paper
Add Code

The Odds are Odd: A Statistical Test for Detecting Adversarial Examples

1 code implementation • 13 Feb 2019 • Kevin Roth, Yannic Kilcher, Thomas Hofmann

We investigate conditions under which test statistics exist that can reliably detect examples, which have been adversarially manipulated in a white-box attack.

Paper
Code

The best defense is a good offense: Countering black box attacks by predicting slightly wrong labels

no code implementations • 15 Nov 2017 • Yannic Kilcher, Thomas Hofmann

Black-Box attacks on machine learning models occur when an attacker, despite having no access to the inner workings of a model, can successfully craft an attack by means of model theft.

Paper
Add Code

Parametrizing filters of a CNN with a GAN

no code implementations • ICLR 2018 • Yannic Kilcher, Gary Becigneul, Thomas Hofmann

It is commonly agreed that the use of relevant invariances as a good statistical bias is important in machine-learning.

Generative Adversarial Network

Paper
Add Code

Flexible Prior Distributions for Deep Generative Models

no code implementations • ICLR 2018 • Yannic Kilcher, Aurelien Lucchi, Thomas Hofmann

We consider the problem of training generative models with deep neural networks as generators, i. e. to map latent codes to data points.

Paper
Add Code

Semantic Interpolation in Implicit Models

no code implementations • ICLR 2018 • Yannic Kilcher, Aurelien Lucchi, Thomas Hofmann

In implicit models, one often interpolates between sampled points in latent space.

Paper
Add Code

Generator Reversal

no code implementations • 28 Jul 2017 • Yannic Kilcher, Aurélien Lucchi, Thomas Hofmann

We consider the problem of training generative models with deep neural networks as generators, i. e. to map latent codes to data points.

Paper
Add Code

Scalable Adaptive Stochastic Optimization Using Random Projections

no code implementations • NeurIPS 2016 • Gabriel Krummenacher, Brian McWilliams, Yannic Kilcher, Joachim M. Buhmann, Nicolai Meinshausen

We show that the regret of Ada-LR is close to the regret of full-matrix AdaGrad which can have an up-to exponentially smaller dependence on the dimension than the diagonal variant.

Dimensionality Reduction Stochastic Optimization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.