Search Results for author: Zhenzhi Wang

Found 6 papers, 4 papers with code

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

2 code implementations10 Sep 2021 Zhenzhi Wang, LiMin Wang, Tao Wu, TianHao Li, Gangshan Wu

Instead, from a perspective on temporal grounding as a metric-learning problem, we present a Mutual Matching Network (MMN), to directly model the similarity between language queries and video moments in a joint embedding space.

Metric Learning Representation Learning +2

Overview of Tencent Multi-modal Ads Video Understanding Challenge

no code implementations16 Sep 2021 Zhenzhi Wang, Liyu Wu, Zhimin Li, Jiangfeng Xiong, Qinglin Lu

Our challenge includes two tasks: video structuring in the temporal dimension and multi-modal video classification.

Multi-Label Classification Video Classification +1

MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond

no code implementations ICCV 2023 Yixuan Li, Lihan Jiang, Linning Xu, Yuanbo Xiangli, Zhenzhi Wang, Dahua Lin, Bo Dai

While most of recent neural rendering works focus on objects and small-scale scenes, developing neural rendering methods for city-scale scenes is of great potential in many real-world applications.

Neural Rendering

InterControl: Generate Human Motion Interactions by Controlling Every Joint

1 code implementation27 Nov 2023 Zhenzhi Wang, Jingbo Wang, Yixuan Li, Dahua Lin, Bo Dai

Furthermore, we demonstrate that the distance between joint pairs for human-wise interactions can be generated using an off-the-shelf Large Language Model (LLM).

Language Modelling Large Language Model +1

Cannot find the paper you are looking for? You can Submit a new open access paper.