SSL4EO-L: Datasets and Foundation Models for Landsat Imagery

microsoft/torchgeo NeurIPS 2023

The Landsat program is the longest-running Earth observation program in history, with 50+ years of data acquisition by 8 satellites.

Cloud Detection Earth Observation +2

2,563
0.49 stars / hour

ClimaX: A foundation model for weather and climate

microsoft/ClimaX 24 Jan 2023

We develop and demonstrate ClimaX, a flexible and generalizable deep learning model for weather and climate science that can be trained using heterogeneous datasets spanning different variables, spatio-temporal coverage, and physical groundings.

Self-Supervised Learning Weather Forecasting

593
0.46 stars / hour

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

zhenye234/xcodec 30 Aug 2024

By enhancing the semantic ability of the codec, X-Codec significantly reduces WER in speech synthesis tasks and extends these benefits to non-speech applications, including music and sound generation.

Audio Compression Audio Generation +3

60
0.45 stars / hour

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

thudm/longwriter 13 Aug 2024

By incorporating this dataset into model training, we successfully scale the output length of existing models to over 10, 000 words while maintaining output quality.

962
0.45 stars / hour

Law of Vision Representation in MLLMs

bronyayang/law_of_vision_representation_in_mllms 29 Aug 2024

We present the "Law of Vision Representation" in multimodal large language models (MLLMs).

cross-modal alignment Language Modelling

80
0.45 stars / hour

Bilateral Reference for High-Resolution Dichotomous Image Segmentation

zhengpeng7/birefnet 7 Jan 2024

It comprises two essential components: the localization module (LM) and the reconstruction module (RM) with our proposed bilateral reference (BiRef).

Camouflaged Object Segmentation Dichotomous Image Segmentation +3

907
0.45 stars / hour

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

openbmb/minicpm 9 Apr 2024

For data scaling, we introduce a Warmup-Stable-Decay (WSD) learning rate scheduler (LRS), conducive to continuous training and domain adaptation.

Domain Adaptation

5,241
0.44 stars / hour

AnyGraph: Graph Foundation Model in the Wild

hkuds/anygraph 20 Aug 2024

Furthermore, we have validated the model's fast adaptation ability and scaling law emergence, showcasing its versatility.

Graph Learning Zero-Shot Learning

82
0.44 stars / hour

LinFusion: 1 GPU, 1 Minute, 16K Image

huage001/linfusion 3 Sep 2024

We find that the distilled model, termed LinFusion, achieves performance on par with or superior to the original SD after only modest training, while significantly reducing time and memory complexity.

16k Causal Inference +1

104
0.42 stars / hour

Cradle: Empowering Foundation Agents Towards General Computer Control

baai-agents/cradle 5 Mar 2024

To handle this issue, we propose the General Computer Control (GCC) setting to restrict foundation agents to interact with software through the most unified and standardized interface, i. e., using screenshots as input and keyboard and mouse actions as output.

Efficient Exploration

1,677
0.41 stars / hour