Search Results for author: Xin Yan

Found 8 papers, 3 papers with code

3D-VLA: A 3D Vision-Language-Action Generative World Model

no code implementations • 14 Mar 2024 • Haoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan, Yilun Du, Yining Hong, Chuang Gan

Recent vision-language-action (VLA) models rely on 2D inputs, lacking integration with the broader realm of the 3D physical world.

Language Modelling Large Language Model +1

Paper
Add Code

ContPhy: Continuum Physical Concept Learning and Reasoning from Videos

no code implementations • 9 Feb 2024 • Zhicheng Zheng, Xin Yan, Zhenfang Chen, Jingzhou Wang, Qin Zhi Eddie Lim, Joshua B. Tenenbaum, Chuang Gan

We evaluated a range of AI models and found that they still struggle to achieve satisfactory performance on ContPhy, which shows that the current AI models still lack physical commonsense for the continuum, especially soft-bodies, and illustrates the value of the proposed dataset.

Paper
Add Code

Out of the Box Thinking: Improving Customer Lifetime Value Modelling via Expert Routing and Game Whale Detection

no code implementations • 24 Aug 2023 • Shijie Zhang, Xin Yan, Xuejiao Yang, Binfeng Jia, Shuangyang Wang

In ExpLTV, we first innovatively design a deep neural network-based game whale detector that can not only infer the intrinsic order in accordance with monetary value, but also precisely identify high spenders (i. e., game whales) and low spenders.

Paper
Add Code

Centroid-centered Modeling for Efficient Vision Transformer Pre-training

no code implementations • 8 Mar 2023 • Xin Yan, Zuchao Li, Lefei Zhang, Bo Du, DaCheng Tao

Our proposed approach, \textbf{CCViT}, leverages k-means clustering to obtain centroids for image modeling without supervised training of tokenizer model.

Semantic Segmentation

Paper
Add Code

MSG-SR-Net: A Weakly Supervised Network Integrating Multi-Scale Generation and Super-Pixel Refinement for Building Extraction from High-Resolution Remotely Sensed Imageries

1 code implementation • IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2021 • Xin Yan, Li Shen, Jicheng Wang, Xu Deng, Zhilin Li

The MSG module is proposed to use global semantic information to guide the learning of multiple features across different levels, and then respectively to utilize multi-level features for generating multi-scale CAMs.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Paper
Code

Clustering Effect of Adversarial Robust Models

no code implementations • NeurIPS 2021 • Yang Bai, Xin Yan, Yong Jiang, Shu-Tao Xia, Yisen Wang

Adversarial robustness has received increasing attention along with the study of adversarial examples.

Adversarial Robustness Clustering +1

Paper
Add Code

Clustering Effect of (Linearized) Adversarial Robust Models

1 code implementation • 25 Nov 2021 • Yang Bai, Xin Yan, Yong Jiang, Shu-Tao Xia, Yisen Wang

Adversarial robustness has received increasing attention along with the study of adversarial examples.

Adversarial Robustness Clustering +1

Paper
Code

Counterfactual Samples Synthesizing for Robust Visual Question Answering

2 code implementations • CVPR 2020 • Long Chen, Xin Yan, Jun Xiao, Hanwang Zhang, ShiLiang Pu, Yueting Zhuang

To reduce the language biases, several recent works introduce an auxiliary question-only model to regularize the training of targeted VQA model, and achieve dominating performance on VQA-CP.

Ranked #1 on Visual Question Answering (VQA) on VQA-CP (using extra training data)

counterfactual Question Answering +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.