3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation

wang-zidu/3ddfa-v3 1 Dec 2023

In this paper, we fully utilize the facial part segmentation geometry by introducing Part Re-projection Distance Loss (PRDL).

3D Face Reconstruction Segmentation

90
0.34 stars / hour

SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

modelscope/swift 18 Dec 2023

Image diffusion models have been utilized in various tasks, such as text-to-image generation and controllable image synthesis.

Decoder Text-to-Image Generation

1,781
0.34 stars / hour

Grokfast: Accelerated Grokking by Amplifying Slow Gradients

ironjr/grokfast 30 May 2024

One puzzling artifact in machine learning dubbed grokking is where delayed generalization is achieved tenfolds of iterations after near perfect overfitting to the training data.

33
0.33 stars / hour

Scalable MatMul-free Language Modeling

ridgerchu/matmulfreellm 4 Jun 2024

Our experiments show that our proposed MatMul-free models achieve performance on-par with state-of-the-art Transformers that require far more memory during inference at a scale up to at least 2. 7B parameters.

Language Modelling

15
0.33 stars / hour

GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning

cmavro/gnn-rag 30 May 2024

In our GNN-RAG framework, the GNN acts as a dense subgraph reasoner to extract useful graph information, while the LLM leverages its natural language processing ability for ultimate KGQA.

Graph Question Answering Knowledge Graphs +4

54
0.32 stars / hour

U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers

yuchuantian/u-dit 4 May 2024

Diffusion Transformers (DiTs) introduce the transformer architecture to diffusion tasks for latent-space image generation.

Image Generation Inductive Bias

33
0.31 stars / hour

CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner

wyysf-98/craftsman 23 May 2024

We present a novel generative 3D modeling system, coined CraftsMan, which can generate high-fidelity 3D geometries with highly varied shapes, regular mesh topologies, and detailed surfaces, and, notably, allows for refining the geometry in an interactive manner.

3D Generation

236
0.31 stars / hour

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

zhangzc21/dyntet 27 Feb 2024

Recent works in implicit representations, such as Neural Radiance Fields (NeRF), have advanced the generation of realistic and animatable head avatars from video sequences.

175
0.31 stars / hour

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

openbmb/minicpm 9 Apr 2024

For data scaling, we introduce a Warmup-Stable-Decay (WSD) learning rate scheduler (LRS), conducive to continuous training and domain adaptation.

Domain Adaptation

4,180
0.29 stars / hour

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

tencent/hunyuandit 14 May 2024

For fine-grained language understanding, we train a Multimodal Large Language Model to refine the captions of the images.

Image Generation Language Modelling +2

2,088
0.28 stars / hour