4 code implementations • 17 Sep 2018 • Hongzhuo Liang, Xiaojian Ma, Shuang Li, Michael Görner, Song Tang, Bin Fang, Fuchun Sun, Jianwei Zhang
In this paper, we propose an end-to-end grasp evaluation model to address the challenging problem of localizing robot grasp configurations directly from the point cloud.
Robotics
1 code implementation • 27 Jul 2021 • Song Tang, Yan Yang, Zhiyuan Ma, Norman Hendrich, Fanyu Zeng, Shuzhi Sam Ge, ChangShui Zhang, Jianwei Zhang
To reach this goal, we construct the nearest neighborhood for every target data and take it as the fundamental clustering unit by building our objective on the geometry.
1 code implementation • ICCV 2023 • Song Tang, Chuang Li, Pu Zhang, RongNian Tang
In this paper, we propose a new recurrent cell, SwinLSTM, which integrates Swin Transformer blocks and the simplified LSTM, an extension that replaces the convolutional structure in ConvLSTM with the self-attention mechanism.
Ranked #7 on Video Prediction on Moving MNIST
1 code implementation • 19 Aug 2023 • Song Tang, Chuang Li, Pu Zhang, RongNian Tang
In this paper, we propose a new recurrent cell, SwinLSTM, which integrates Swin Transformer blocks and the simplified LSTM, an extension that replaces the convolutional structure in ConvLSTM with the self-attention mechanism.
1 code implementation • 27 Nov 2023 • Song Tang, Wenxin Su, Mao Ye, Xiatian Zhu
We find that directly applying the ViL model to the target domain in a zero-shot fashion is unsatisfactory, as it is not specialized for this particular task but largely generic.
1 code implementation • 12 Mar 2024 • Song Tang, Wenxin Su, Mao Ye, Jianwei Zhang, Xiatian Zhu
To tackle this unified SFDA problem, we propose a novel approach called Latent Causal Factors Discovery (LCFD).