Galactica: A Large Language Model for Science

paperswithcode/galai 16 Nov 2022

We believe these results demonstrate the potential for language models as a new interface for science.

Anachronisms Bias Detection +13

KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation

eleutherai/gpt-neox 20 May 2022

Relative positional embeddings (RPE) have received considerable attention since RPEs effectively model the relative distance among tokens and enable length extrapolation.

Language Modelling

One is All: Bridging the Gap Between Neural Radiance Fields Architectures with Progressive Volume Distillation

megvii-research/AAAI2023-PVD 29 Nov 2022

In this paper, we propose Progressive Volume Distillation (PVD), a systematic distillation method that allows any-to-any conversions between different architectures, including MLP, sparse or low-rank tensors, hashtables and their compositions.

 Ranked #1 on Novel View Synthesis on NeRF (Average PSNR metric)

3D Reconstruction Neural Rendering +1

Mixed Neural Voxels for Fast Multi-view Video Synthesis

fengres/mixvoxels 1 Dec 2022

In this paper, we present a novel method named MixVoxels to better represent the dynamic scenes with fast training speed and competitive rendering qualities.

MetaFormer Baselines for Vision

sail-sg/metaformer 24 Oct 2022

By simply applying depthwise separable convolutions as token mixer in the bottom stages and vanilla self-attention in the top stages, the resulting model CAFormer sets a new record on ImageNet-1K: it achieves an accuracy of 85. 5% at 224x224 resolution, under normal supervised training without external data or distillation.

Image Classification

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

XavierXiao/Dreambooth-Stable-Diffusion 25 Aug 2022

Once the subject is embedded in the output domain of the model, the unique identifier can then be used to synthesize fully-novel photorealistic images of the subject contextualized in different scenes.

Image Generation

UIU-Net: U-Net in U-Net for Infrared Small Object Detection

danfenghong/ieee_tip_uiu-net 2 Dec 2022

RM-DS integrates Residual U-blocks into a deep supervision network to generate deep multi-scale resolution-maintenance features while learning global context information.

object-detection Representation Learning +1

LiT: Zero-Shot Transfer with Locked-image text Tuning

mlfoundations/open_clip CVPR 2022

This paper presents contrastive-tuning, a simple method employing contrastive training to align image and text models while still taking advantage of their pre-training.

Image Classification Retrieval +2

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model

huggingface/diffusers 15 Nov 2022

Through our experiments, we demonstrate that VD and its underlying framework have the following merits: a) VD handles all subtasks with competitive quality; b) VD initiates novel extensions and applications such as disentanglement of style and semantic, image-text dual-guided generation, etc.

Disentanglement Image Captioning +4

