DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

dreamgaussian/dreamgaussian 28 Sep 2023

In contrast to the occupancy pruning used in Neural Radiance Fields, we demonstrate that the progressive densification of 3D Gaussians converges significantly faster for 3D generative tasks.

855
5.83 stars / hour

ProPainter: Improving Propagation and Transformer for Video Inpainting

sczhou/propainter ICCV 2023

We also propose a mask-guided sparse video Transformer, which achieves high efficiency by discarding unnecessary and redundant tokens.

Optical Flow Estimation Video Inpainting

2,344
3.69 stars / hour

Demystifying CLIP Data

facebookresearch/metaclip 28 Sep 2023

We believe that the main ingredient to the success of CLIP is its data and not the model architecture or pre-training objective.

167
2.54 stars / hour

Text-to-3D using Gaussian Splatting

gsgen3d/gsgen 28 Sep 2023

In this stage, we increase the number of Gaussians by compactness-based densification to enhance continuity and improve fidelity.

Text to 3D

178
2.03 stars / hour

Deep Geometrized Cartoon Line Inbetweening

lisiyao21/animeinbet ICCV 2023

To preserve the precision and detail of the line drawings, we propose a new approach, AnimeInbet, which geometrizes raster line drawings into graphs of endpoints and reframes the inbetweening task as a graph fusion problem with vertex repositioning.

110
1.64 stars / hour

Qwen Technical Report

QwenLM/Qwen-7B 28 Sep 2023

Large language models (LLMs) have revolutionized the field of artificial intelligence, enabling natural language processing tasks that were previously thought to be exclusive to humans.

Language Modelling Large Language Model

5,116
1.16 stars / hour

3D Gaussian Splatting for Real-Time Radiance Field Rendering

graphdeco-inria/gaussian-splatting 8 Aug 2023

Radiance Field methods have recently revolutionized novel-view synthesis of scenes captured with multiple photos or videos.

Camera Calibration Novel View Synthesis

4,763
1.05 stars / hour

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition

internlm/internlm-xcomposer 26 Sep 2023

We propose InternLM-XComposer, a vision-language large model that enables advanced image-text comprehension and composition.

Image Comprehension Reading Comprehension

118
1.02 stars / hour

Communicative Agents for Software Development

openbmb/chatdev 16 Jul 2023

At the core of this paradigm lies ChatDev, a virtual chat-powered software development company that mirrors the established waterfall model, meticulously dividing the development process into four distinct chronological stages: designing, coding, testing, and documenting.

Decision Making

9,852
1.02 stars / hour

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

dvlab-research/longlora 21 Sep 2023

LongLoRA adopts LLaMA2 7B from 4k context to 100k, or LLaMA2 70B to 32k on a single 8x A100 machine.

1,099
0.91 stars / hour