Inference Optimization

18 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

Input Convex Neural Networks

locuslab/icnn ICML 2017

We show that many existing neural network architectures can be made input-convex with a minor modification, and develop specialized optimization algorithms tailored to this setting.

Enhanced graph-learning schemes driven by similar distributions of motifs

reysam93/adaptive_agg_gcn 11 Jul 2022

Guided by this, we first assume that we have a reference graph that is related to the sought graph (in the sense of having similar motif densities) and then, we exploit this relation by incorporating a similarity constraint and a regularization term in the network topology inference optimization problem.

Representing Edge Flows on Graphs via Sparse Cell Complexes

josefhoppe/edge-flow-cell-complexes 4 Sep 2023

In this paper, we generalize this approach to cellular complexes and introduce the flow representation learning problem, i. e., the problem of augmenting the observed graph by a set of cells, such that the eigenvectors of the associated Hodge Laplacian provide a sparse, interpretable representation of the observed edge flows on the graph.

Patched MOA: optimizing inference for diverse software development tasks

codelion/optillm 26 Jul 2024

This paper introduces Patched MOA (Mixture of Agents), an inference optimization technique that significantly enhances the performance of large language models (LLMs) across diverse software development tasks.

CycleBNN: Cyclic Precision Training in Binary Neural Networks

fedeloper/binary_deepfake_detection 28 Sep 2024

This paper works on Binary Neural Networks (BNNs), a promising avenue for efficient deep learning, offering significant reductions in computational overhead and memory footprint to full precision networks.

Iterative Amortized Inference

joelouismarino/iterative_inference ICML 2018

The failure of these models to reach fully optimized approximate posterior estimates results in an amortization gap.

A General Method for Amortizing Variational Filtering

joelouismarino/amortized-variational-filtering NeurIPS 2018

We introduce the variational filtering EM algorithm, a simple, general-purpose method for performing variational inference in dynamical latent variable models using information from only past and present variables, i. e. filtering.

Easy and Efficient Transformer : Scalable Inference Solution For large NLP model

NetEase-FuXi/EET 26 Apr 2021

To fill such a gap, we introduce a scalable inference solution: Easy and Efficient Transformer (EET), including a series of transformer inference optimization at the algorithm and implementation levels.

A Novel 1D State Space for Efficient Music Rhythmic Analysis

mjhydri/1d-statespace 1 Nov 2021

Inferring music time structures has a broad range of applications in music production, processing and analysis.

ADJUST: A Dictionary-Based Joint Reconstruction and Unmixing Method for Spectral Tomography

mzeegers/ADJUST 21 Dec 2021

However, these methods inherently suffer from the ill-posedness of the joint reconstruction problem.