1 code implementation • 20 Mar 2024 • Yumeng Li, William Beluch, Margret Keuper, Dan Zhang, Anna Khoreva
Despite tremendous progress in the field of text-to-video (T2V) synthesis, open-sourced T2V diffusion models struggle to generate longer videos with dynamically varying and evolving content.
1 code implementation • 16 Jan 2024 • Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva
Current L2I models either suffer from poor editability via text or weak alignment between the generated image and the input layout.
no code implementations • 2 Dec 2023 • Yumeng Li, YaoXiang Ding, Zhong Ren, Kun Zhou
Explicit pose prior models compress human poses into latent representations for using in pose-related downstream tasks.
no code implementations • 19 Aug 2023 • Dan Zhang, Kaspar Sakmann, William Beluch, Robin Hutmacher, Yumeng Li
Within the context of autonomous driving, encountering unknown objects becomes inevitable during deployment in the open world.
1 code implementation • 20 Jul 2023 • Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva
To address the challenges posed by complex prompts or scenarios involving multiple entities and to achieve improved attribute binding, we propose Divide & Bind.
1 code implementation • 2 Jul 2023 • Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva
Using the proposed masked noise encoder to randomize style and content combinations in the training set, i. e., intra-source style augmentation (ISSA) effectively increases the diversity of training data and reduces spurious correlation.
no code implementations • 12 Jun 2023 • Akash Singh, Yumeng Li
Lattice parameter, coefficient of thermal expansion (CTE), Young's modulus and yield strength are estimated using machine learning accelerated MD simulations (MLMD), which are compared to experimental/first principle calculations from previous literatures.
1 code implementation • 18 Oct 2022 • Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva
Using the proposed masked noise encoder to randomize style and content combinations in the training set, ISSA effectively increases the diversity of training data and reduces spurious correlation.
no code implementations • 14 Jun 2022 • Yumeng Li, Ning Gao, Hanna Ziesche, Gerhard Neumann
We present a novel meta-learning approach for 6D pose estimation on unknown objects.
no code implementations • 13 Jun 2022 • Zhiyu Yao, Xinyang Chen, Sinan Wang, Qinyan Dai, Yumeng Li, Tanchao Zhu, Mingsheng Long
We conclude this characteristic for sequential behaviors of each user as the Behavior Pathway.
1 code implementation • 25 Feb 2022 • Linhao Luo, Yumeng Li, Buyu Gao, Shuai Tang, Sinan Wang, Jiancheng Li, Tanchao Zhu, Jiancai Liu, Zhao Li, Shirui Pan
We integrate these components into a unified framework and present MAMDR, which can be applied to any model structure to perform multi-domain recommendation.
no code implementations • 21 May 2021 • Ze Meng, Jinnian Zhang, Yumeng Li, Jiancheng Li, Tanchao Zhu, Lifeng Sun
Modeling powerful interactions is a critical challenge in Click-through rate (CTR) prediction, which is one of the most typical machine learning tasks in personalized advertising and recommender systems.
3 code implementations • 11 Nov 2020 • Shuai Zhang, Huoyu Liu, Aston Zhang, Yue Hu, Ce Zhang, Yumeng Li, Tanchao Zhu, Shaojian He, Wenwu Ou
Furthermore, we present two variants of hypercuboids to enhance the capability in capturing the diversities of user interests.