Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems

mistlab/swarm-slam 16 Jan 2023

Collaborative Simultaneous Localization And Mapping (C-SLAM) is a vital component for successful multi-robot operations in environments without an external positioning system, such as indoors, underground or underwater.

Simultaneous Localization and Mapping

Cut and Learn for Unsupervised Object Detection and Instance Segmentation

facebookresearch/cutler 26 Jan 2023

We propose Cut-and-LEaRn (CutLER), a simple approach for training unsupervised object detection and segmentation models.

Instance Segmentation object-detection +2

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

showlab/Tune-A-Video 22 Dec 2022

To reproduce the success of text-to-image (T2I) generation, recent works in text-to-video (T2V) generation employ large-scale text-video dataset for fine-tuning.

Style Transfer Text-to-Video Generation +1

What Makes Good Examples for Visual In-Context Learning?

zhangyuanhan-ai/visual_prompt_retrieval 31 Jan 2023

To overcome the problem, we propose a prompt retrieval framework to automate the selection of in-context examples.


Disentangling Random and Cyclic Effects in Time-Lapse Sequences

harskish/tlgan 4 Jul 2022

We introduce the problem of disentangling time-lapse sequences in a way that allows separate, after-the-fact control of overall trends, cyclic effects, and random effects in the images, and describe a technique based on data-driven generative models that achieves this goal.

DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

BurhanUlTayyab/DetectGPT 26 Jan 2023

The fluency and factual knowledge of large language models (LLMs) heightens the need for corresponding systems to detect whether a piece of text is machine-written.

Language Modelling

Interpretation of 3D CNNs for Brain MRI Data Classification

maxs-kan/InterpretableNeuroDL 20 Jun 2020

Deep learning shows high potential for many medical image analysis tasks.

Classification General Classification

AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities

automatic1111/stable-diffusion-webui 12 Nov 2022

In this work, we present a conceptually simple and effective method to train a strong bilingual/multilingual multimodal representation model.

Contrastive Learning Cross-Modal Retrieval +10

Deep Amended Gradient Descent for Efficient Spectral Reconstruction from Single RGB Images

zbzhzhy/gd-net 12 Aug 2021

Then, we design a lightweight neural network with a multi-stage architecture to mimic the formed amended gradient descent process, in which efficient convolution and novel spectral zero-mean normalization are proposed to effectively extract spatial-spectral features for regressing an initialization, a basic gradient, and an incremental gradient.

Spectral Reconstruction

Fine-Tuning Language Models from Human Preferences

lvwerra/trl 18 Sep 2019

Most work on reward learning has used simulated environments, but complex information about values is often expressed in natural language, and we believe reward learning for language is a key to making RL practical and safe for real-world tasks.

Language Modelling

