1 code implementation • ECCV 2020 • Wenxuan Wu, Zhi Yuan Wang, Zhuwen Li, Wei Liu, Li Fuxin
We propose a novel end-to-end deep scene flow model, called PointPWC-Net, that directly processes 3D point cloud scenes with large motions in a coarse-to-fine fashion.
1 code implementation • 18 Mar 2025 • Xiaoying Xing, Chia-Wen Kuo, Li Fuxin, Yulei Niu, Fan Chen, Ming Li, Ying Wu, Longyin Wen, Sijie Zhu
Large Vision-Language Models (LVLMs) have shown promising performance in vision-language understanding and reasoning tasks.
no code implementations • 30 Oct 2024 • Chen Ziwen, Zexiang Xu, Li Fuxin
Our model maintains a global point cloud representation of the scene, continuously updating the features and 3D locations of points as new images are observed.
no code implementations • 16 Oct 2024 • Chen Ziwen, Hao Tan, Kai Zhang, Sai Bi, Fujun Luan, Yicong Hong, Li Fuxin, Zexiang Xu
Unlike previous feed-forward models that are limited to processing 1~4 input images and can only reconstruct a small portion of a large scene, Long-LRM reconstructs the entire scene in a single feed-forward step.
no code implementations • 18 Jul 2024 • Hung Nguyen, Chanho Kim, Rigved Naukarkar, Li Fuxin
Long-term point tracking is essential to understand non-rigid motion in the physical world better.
1 code implementation • IEEE International Conference on Robotics and Automation (ICRA) 2024 • Amin Ullah, Taiqing Yan, Li Fuxin
Deep learning has brought transformative advancements to object segmentation, especially in marine robotics contexts such as waste management and subaquatic infrastructure oversight.
no code implementations • 29 Apr 2024 • Skand Peri, Iain Lee, Chanho Kim, Li Fuxin, Tucker Hermans, Stefan Lee
In this work, we examine robustness to a suite of these types of visual changes for RGB-D and point cloud based visual control policies.
no code implementations • CVPR 2024 • Chanho Kim, Li Fuxin
Modeling object dynamics with a neural network is an important problem with numerous applications.
1 code implementation • CVPR 2024 • Saeed Khorram, Mingqi Jiang, Mohamad Shahbazi, Mohamad H. Danesh, Li Fuxin
In the presence of imbalanced multi-class training data, GANs tend to favor classes with more samples, leading to the generation of low-quality and less diverse samples in tail classes.
no code implementations • 26 Sep 2023 • Yixuan Huang, Jialin Yuan, Chanho Kim, Pupul Pradhan, Bryan Chen, Li Fuxin, Tucker Hermans
Robots need to have a memory of previously observed, but currently occluded objects to work reliably in realistic environments.
1 code implementation • CVPR 2023 • Chen Ziwen, Kaushik Patnaik, Shuangfei Zhai, Alvin Wan, Zhile Ren, Alex Schwing, Alex Colburn, Li Fuxin
To achieve this, we propose AutoFocusFormer (AFF), a local-attention transformer image recognition backbone, which performs adaptive downsampling by learning to retain the most important pixels for the task.
Ranked #3 on
Instance Segmentation
on Cityscapes val
no code implementations • 29 Jan 2023 • Jialin Yuan, Jay Patravali, Hung Nguyen, Chanho Kim, Li Fuxin
On the related problem of video instance segmentation, our method shows competitive performance with the previous best algorithm that requires joint training with the VOS algorithm.
no code implementations • CVPR 2024 • Mingqi Jiang, Saeed Khorram, Li Fuxin
In order to gain insights about the decision-making of different visual recognition backbones, we propose two methodologies, sub-explanation counting and cross-testing, that systematically applies deep explanation algorithms on a dataset-wide basis, and compares the statistics generated from the amount and nature of the explanations.
no code implementations • CVPR 2023 • Wenxuan Wu, Li Fuxin, Qi Shan
Hence, we preserved the invariances from point convolution, whereas attention helps to select relevant points in the neighborhood for convolution.
no code implementations • 1 Aug 2022 • Ye Yu, Jialin Yuan, Gaurav Mittal, Li Fuxin, Mei Chen
It captures object motion in the video via a novel optical flow calibration module that fuses the segmentation mask with optical flow estimation to improve within-object optical flow smoothness and reduce noise at object boundaries.
Ranked #1 on
Video Object Segmentation
on DAVIS 2017 (test-dev)
(using extra training data)
no code implementations • CVPR 2022 • Saeed Khorram, Li Fuxin
CounterFactual (CF) visual explanations try to find images similar to the query image that change the decision of a vision system to a specified outcome.
no code implementations • 13 Sep 2021 • Li Fuxin, Zhongang Qi, Saeed Khorram, Vivswan Shitole, Prasad Tadepalli, Minsuk Kahng, Alan Fern
This paper summarizes our endeavors in the past few years in terms of explaining image classifiers, with the aim of including negative results and insights we have gained.
no code implementations • 1 May 2021 • Erich Merrill, Stefan Lee, Li Fuxin, Thomas G. Dietterich, Alan Fern
We consider the problem of modeling the dynamics of continuous spatial-temporal processes represented by irregular samples through both space and time.
no code implementations • 1 May 2021 • Saeed Khorram, Xiao Fu, Mohamad H. Danesh, Zhongang Qi, Li Fuxin
We prove the convergence of our proposed method and justify its capabilities through experiments in supervised and weakly-supervised settings.
no code implementations • ICLR 2021 • Xiaoling Hu, Yusu Wang, Li Fuxin, Dimitris Samaras, Chao Chen
In the segmentation of fine-scale structures from natural and biomedical images, per-pixel accuracy is not the only metric of concern.
no code implementations • 1 Mar 2021 • Neale Ratzlaff, Qinxun Bai, Li Fuxin, Wei Xu
Recently, particle-based variational inference (ParVI) methods have gained interest because they can avoid arbitrary parametric assumptions that are common in variational inference.
1 code implementation • CVPR 2021 • Chanho Kim, Li Fuxin, Mazen Alotaibi, James M. Rehg
Many approaches model each target in isolation and lack the ability to use all the targets in the scene to jointly update the memory.
no code implementations • 19 Jan 2021 • Xingyi Li, Wenxuan Wu, Xiaoli Z. Fern, Li Fuxin
This paper investigates different variants of PointConv, a convolution network on point clouds, to examine their robustness to input scale and rotation changes.
1 code implementation • NeurIPS 2021 • Vivswan Shitole, Li Fuxin, Minsuk Kahng, Prasad Tadepalli, Alan Fern
Attention maps are a popular way of explaining the decisions of convolutional networks for image classification.
1 code implementation • NeurIPS 2020 • Jialin Yuan, Chao Chen, Li Fuxin
Specifically, we propose a variational relaxation of instance segmentation as minimizing an optimization functional for a piecewise-constant segmentation problem, which can be used to train an FCN end-to-end.
no code implementations • ICLR 2020 • Jun Li, Li Fuxin, Sinisa Todorovic
We specify two new optimization algorithms: Cayley SGD with momentum, and Cayley ADAM on the Stiefel manifold.
2 code implementations • 27 Nov 2019 • Wenxuan Wu, Zhiyuan Wang, Zhuwen Li, Wei Liu, Li Fuxin
We propose a novel end-to-end deep scene flow model, called PointPWC-Net, on 3D point clouds in a coarse-to-fine fashion.
1 code implementation • 23 Nov 2019 • Chen Ziwen, Wenxuan Wu, Zhongang Qi, Li Fuxin
In this paper, we propose a novel approach to visualize features important to the point cloud classifiers.
no code implementations • ICML 2020 • Neale Ratzlaff, Qinxun Bai, Li Fuxin, Wei Xu
Each random draw from our generative model is a neural network that instantiates the dynamic function, hence multiple draws would approximate the posterior, and the variance in the future prediction based on this posterior is used as an intrinsic reward for exploration.
no code implementations • 25 Sep 2019 • Lawrence Neal, Li Fuxin, Xiaoli Fern
In sequential tasks, planning-based agents have a number of advantages over model-free agents, including sample efficiency and interpretability.
4 code implementations • NeurIPS 2019 • Xiaoling Hu, Li Fuxin, Dimitris Samaras, Chao Chen
Segmentation algorithms are prone to make topological errors on fine-scale structures, e. g., broken connections.
1 code implementation • 2 May 2019 • Zhongang Qi, Saeed Khorram, Li Fuxin
Understanding and interpreting the decisions made by deep learning models is valuable in many domains.
7 code implementations • ICCV 2019 • Xinyao Wang, Liefeng Bo, Li Fuxin
Then we propose a novel loss function, named Adaptive Wing loss, that is able to adapt its shape to different types of ground truth heatmap pixels.
Ranked #8 on
Face Alignment
on WFW (Extra Data)
1 code implementation • 30 Jan 2019 • Neale Ratzlaff, Li Fuxin
We introduce HyperGAN, a new generative model for learning a distribution of neural network parameters.
9 code implementations • CVPR 2019 • Wenxuan Wu, Zhongang Qi, Li Fuxin
Besides, our experiments converting CIFAR-10 into a point cloud showed that networks built on PointConv can match the performance of convolutional networks in 2D images of a similar structure.
Ranked #2 on
3D Part Segmentation
on IntrA
no code implementations • 5 Apr 2018 • Neale Ratzlaff, Li Fuxin
To evaluate against an adversary with complete knowledge of our defense, we adapt the bilateral filter as a trainable layer in a neural network and show that adding this layer makes ImageNet images significantly more robust to attacks.