no code implementations • 14 Mar 2024 • Haoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan, Yilun Du, Yining Hong, Chuang Gan
Recent vision-language-action (VLA) models rely on 2D inputs, lacking integration with the broader realm of the 3D physical world.
no code implementations • 9 Feb 2024 • Zhicheng Zheng, Xin Yan, Zhenfang Chen, Jingzhou Wang, Qin Zhi Eddie Lim, Joshua B. Tenenbaum, Chuang Gan
We evaluated a range of AI models and found that they still struggle to achieve satisfactory performance on ContPhy, which shows that the current AI models still lack physical commonsense for the continuum, especially soft-bodies, and illustrates the value of the proposed dataset.
no code implementations • 24 Aug 2023 • Shijie Zhang, Xin Yan, Xuejiao Yang, Binfeng Jia, Shuangyang Wang
In ExpLTV, we first innovatively design a deep neural network-based game whale detector that can not only infer the intrinsic order in accordance with monetary value, but also precisely identify high spenders (i. e., game whales) and low spenders.
no code implementations • 8 Mar 2023 • Xin Yan, Zuchao Li, Lefei Zhang, Bo Du, DaCheng Tao
Our proposed approach, \textbf{CCViT}, leverages k-means clustering to obtain centroids for image modeling without supervised training of tokenizer model.
1 code implementation • IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2021 • Xin Yan, Li Shen, Jicheng Wang, Xu Deng, Zhilin Li
The MSG module is proposed to use global semantic information to guide the learning of multiple features across different levels, and then respectively to utilize multi-level features for generating multi-scale CAMs.
Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation
no code implementations • NeurIPS 2021 • Yang Bai, Xin Yan, Yong Jiang, Shu-Tao Xia, Yisen Wang
Adversarial robustness has received increasing attention along with the study of adversarial examples.
1 code implementation • 25 Nov 2021 • Yang Bai, Xin Yan, Yong Jiang, Shu-Tao Xia, Yisen Wang
Adversarial robustness has received increasing attention along with the study of adversarial examples.
2 code implementations • CVPR 2020 • Long Chen, Xin Yan, Jun Xiao, Hanwang Zhang, ShiLiang Pu, Yueting Zhuang
To reduce the language biases, several recent works introduce an auxiliary question-only model to regularize the training of targeted VQA model, and achieve dominating performance on VQA-CP.
Ranked #1 on Visual Question Answering (VQA) on VQA-CP (using extra training data)