Search Results for author: Wei-Lun Chao

Found 88 papers, 57 papers with code

TaxaDiffusion: Progressively Trained Diffusion Model for Fine-Grained Species Generation

1 code implementation2 Jun 2025 Amin Karimi Monsefi, Mridul Khurana, Rajiv Ramnath, Anuj Karpatne, Wei-Lun Chao, Cheng Zhang

We propose TaxaDiffusion, a taxonomy-informed training framework for diffusion models to generate fine-grained animal images with high morphological and identity accuracy.

Image Generation Transfer Learning

Revisiting semi-supervised learning in the era of foundation models

1 code implementation12 Mar 2025 Ping Zhang, Zheda Mai, Quang-Huy Nguyen, Wei-Lun Chao

We make a surprising observation: parameter-efficient fine-tuning (PEFT) using only labeled data often matches SSL performance, even without leveraging unlabeled data.

parameter-efficient fine-tuning Pseudo Label

Federated Inverse Probability Treatment Weighting for Individual Treatment Effect Estimation

no code implementations6 Mar 2025 Changchang Yin, Hong-You Chen, Wei-Lun Chao, Ping Zhang

To address this, we propose FED-IPTW, a novel algorithm to extend IPTW into a federated setting that enforces both global (over all the data) and local (within each hospital) decorrelation between covariates and treatments.

Building Machine Learning Challenges for Anomaly Detection in Science

no code implementations3 Mar 2025 Elizabeth G. Campolongo, Yuan-Tang Chou, Ekaterina Govorkova, Wahid Bhimji, Wei-Lun Chao, Chris Harris, Shih-Chieh Hsu, Hilmar Lapp, Mark S. Neubauer, Josephine Namayanja, Aneesh Subramanian, Philip Harris, Advaith Anand, David E. Carlyn, Subhankar Ghosh, Christopher Lawrence, Eric Moreno, Ryan Raikman, Jiaman Wu, Ziheng Zhang, Bayu Adhi, Mohammad Ahmadi Gharehtoragh, Saúl Alonso Monsalve, Marta Babicz, Furqan Baig, Namrata Banerji, William Bardon, Tyler Barna, Tanya Berger-Wolf, Adji Bousso Dieng, Micah Brachman, Quentin Buat, David C. Y. Hui, Phuong Cao, Franco Cerino, Yi-Chun Chang, Shivaji Chaulagain, An-Kai Chen, Deming Chen, Eric Chen, Chia-Jui Chou, Zih-Chen Ciou, Miles Cochran-Branson, Artur Cordeiro Oudot Choi, Michael Coughlin, Matteo Cremonesi, Maria Dadarlat, Peter Darch, Malina Desai, Daniel Diaz, Steven Dillmann, Javier Duarte, Isla Duporge, Urbas Ekka, Saba Entezari Heravi, Hao Fang, Rian Flynn, Geoffrey Fox, Emily Freed, Hang Gao, Jing Gao, Julia Gonski, Matthew Graham, Abolfazl Hashemi, Scott Hauck, James Hazelden, Joshua Henry Peterson, Duc Hoang, Wei Hu, Mirco Huennefeld, David Hyde, Vandana Janeja, Nattapon Jaroenchai, Haoyi Jia, Yunfan Kang, Maksim Kholiavchenko, Elham E. Khoda, Sangin Kim, Aditya Kumar, Bo-Cheng Lai, Trung Le, Chi-Wei Lee, Janghyeon Lee, Shaocheng Lee, Suzan van der Lee, Charles Lewis, Haitong Li, Haoyang Li, Henry Liao, Mia Liu, Xiaolin Liu, Xiulong Liu, Vladimir Loncar, Fangzheng Lyu, Ilya Makarov, Abhishikth Mallampalli Chen-Yu Mao, Alexander Michels, Alexander Migala, Farouk Mokhtar, Mathieu Morlighem, Min Namgung, Andrzej Novak, Andrew Novick, Amy Orsborn, Anand Padmanabhan, Jia-Cheng Pan, Sneh Pandya, Zhiyuan Pei, Ana Peixoto, George Percivall, Alex Po Leung, Sanjay Purushotham, Zhiqiang Que, Melissa Quinnan, Arghya Ranjan, Dylan Rankin, Christina Reissel, Benedikt Riedel, Dan Rubenstein, Argyro Sasli, Eli Shlizerman, Arushi Singh, Kim Singh, Eric R. Sokol, Arturo Sorensen, Yu Su, Mitra Taheri, Vaibhav Thakkar, Ann Mariam Thomas, Eric Toberer, Chenghan Tsai, Rebecca Vandewalle, Arjun Verma, Ricco C. Venterea, He Wang, Jianwu Wang, Sam Wang, Shaowen Wang, Gordon Watts, Jason Weitz, Andrew Wildridge, Rebecca Williams, Scott Wolf, Yue Xu, Jianqi Yan, Jai Yu, Yulei Zhang, Haoran Zhao, Ying Zhao, Yibo Zhong

We present the different datasets along with a scheme to make machine learning challenges around the three datasets findable, accessible, interoperable, and reusable (FAIR).

Anomaly Detection scientific discovery

A Closer Look at TabPFN v2: Strength, Limitation, and Extension

no code implementations24 Feb 2025 Han-Jia Ye, Si-Yang Liu, Wei-Lun Chao

Tabular datasets are inherently heterogeneous, posing significant challenges for developing pre-trained foundation models.

In-Context Learning

Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation

1 code implementation CVPR 2025 Ziheng Zhang, Jianyang Gu, Arpita Chowdhury, Zheda Mai, David Carlyn, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao

Despite its simplicity and computational efficiency, CAM often struggles to identify discriminative regions that distinguish visually similar fine-grained classes.

Computational Efficiency

Prompt-CAM: A Simpler Interpretable Transformer for Fine-Grained Analysis

1 code implementation16 Jan 2025 Arpita Chowdhury, Dipanjyoti Paul, Zheda Mai, Jianyang Gu, Ziheng Zhang, Kazi Sajeed Mehrab, Elizabeth G. Campolongo, Daniel Rubenstein, Charles V. Stewart, Anuj Karpatne, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao

We present a simple usage of pre-trained Vision Transformers (ViTs) for fine-grained analysis, aiming to identify and localize the traits that distinguish visually similar categories, such as different bird species or dog breeds.

Explainable Artificial Intelligence (XAI) Explainable Models +3

Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis

1 code implementation CVPR 2025 Arpita Chowdhury, Dipanjyoti Paul, Zheda Mai, Jianyang Gu, Ziheng Zhang, Kazi Sajeed Mehrab, Elizabeth G. Campolongo, Daniel Rubenstein, Charles V. Stewart, Anuj Karpatne, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao

We present a simple approach to make pre-trained Vision Transformers (ViTs) interpretable for fine-grained analysis, aiming to identify and localize the traits that distinguish visually similar categories, such as bird species.

Visual Prompt Tuning

Learning 3D Perception from Others' Predictions

no code implementations3 Oct 2024 Jinsu Yoo, Zhenyang Feng, Tai-Yu Pan, Yihong Sun, Cheng Perng Phoo, Xiangyu Chen, Mark Campbell, Kilian Q. Weinberger, Bharath Hariharan, Wei-Lun Chao

We investigate a new scenario to construct 3D object detectors: learning from the predictions of a nearby unit that is equipped with an accurate detector.

3D Object Detection object-detection +1

KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models

1 code implementation2 Oct 2024 Pouyan Navard, Amin Karimi Monsefi, Mengxi Zhou, Wei-Lun Chao, Alper Yilmaz, Rajiv Ramnath

Recent advances in diffusion models have significantly improved text-to-image (T2I) generation, but they often struggle to balance fine-grained precision with high-level control.

Image Generation

Fine-Tuning is Fine, if Calibrated

1 code implementation24 Sep 2024 Zheda Mai, Arpita Chowdhury, Ping Zhang, Cheng-Hao Tu, Hong-You Chen, Vardaan Pahuja, Tanya Berger-Wolf, Song Gao, Charles Stewart, Yu Su, Wei-Lun Chao

For example, fine-tuning a pre-trained classifier capable of recognizing a large number of classes to master a subset of classes at hand is shown to drastically degrade the model's accuracy in the other classes it had previously learned.

Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition

2 code implementations CVPR 2025 Zheda Mai, Ping Zhang, Cheng-Hao Tu, Hong-You Chen, Li Zhang, Wei-Lun Chao

Second, despite similar accuracy, we find that PEFT methods make different mistakes and high-confidence predictions, likely due to their different inductive biases.

parameter-efficient fine-tuning Transfer Learning

FedNE: Surrogate-Assisted Federated Neighbor Embedding for Dimensionality Reduction

no code implementations17 Sep 2024 Ziwei Li, Xiaoqi Wang, Hong-You Chen, Han-Wei Shen, Wei-Lun Chao

Federated learning (FL) has rapidly evolved as a promising paradigm that enables collaborative model training across distributed participants without exchanging their local data.

Dimensionality Reduction Federated Learning +1

Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning

1 code implementation16 Sep 2024 Amin Karimi Monsefi, Mengxi Zhou, Nastaran Karimi Monsefi, Ser-Nam Lim, Wei-Lun Chao, Rajiv Ramnath

Second, we employ a two-branch framework empowered by knowledge distillation, enabling the model to take both the filtered and original images as input, largely reducing the burden of downstream tasks.

Few-Shot Learning image-classification +5

VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images

1 code implementation28 Aug 2024 M. Maruf, Arka Daw, Kazi Sajeed Mehrab, Harish Babu Manogaran, Abhilash Neog, Medha Sawhney, Mridul Khurana, James P. Balhoff, Yasin Bakis, Bahadir Altintas, Matthew J. Thompson, Elizabeth G. Campolongo, Josef C. Uyeda, Hilmar Lapp, Henry L. Bart, Paula M. Mabee, Yu Su, Wei-Lun Chao, Charles Stewart, Tanya Berger-Wolf, Wasila Dahdul, Anuj Karpatne

Images are increasingly becoming the currency for documenting biodiversity on the planet, providing novel opportunities for accelerating scientific discoveries in the field of organismal biology, especially with the advent of large vision-language models (VLMs).

Hallucination

MLLM-CompBench: A Comparative Reasoning Benchmark for Multimodal LLMs

1 code implementation23 Jul 2024 Jihyung Kil, Zheda Mai, Justin Lee, Zihe Wang, Kerrie Cheng, Lemeng Wang, Ye Liu, Arpita Chowdhury, Wei-Lun Chao

In this paper, we introduce MLLM-CompBench, a benchmark designed to evaluate the comparative reasoning capability of multimodal large language models (MLLMs).

Attribute

Jigsaw Game: Federated Clustering

no code implementations17 Jul 2024 Jinxuan Xu, Hong-You Chen, Wei-Lun Chao, Yuqian Zhang

Federated learning has recently garnered significant attention, especially within the domain of supervised learning.

Clustering Federated Learning +1

Dual-View Visual Contextualization for Web Navigation

no code implementations CVPR 2024 Jihyung Kil, Chan Hee Song, Boyuan Zheng, Xiang Deng, Yu Su, Wei-Lun Chao

Automatic web navigation aims to build a web agent that can follow language instructions to execute complex and diverse tasks on real-world websites.

Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs

1 code implementation31 Dec 2023 Vardaan Pahuja, Weidi Luo, Yu Gu, Cheng-Hao Tu, Hong-You Chen, Tanya Berger-Wolf, Charles Stewart, Song Gao, Wei-Lun Chao, Yu Su

In this work, we exploit the structured context linked to camera trap images to boost out-of-distribution generalization for species classification tasks in camera traps.

 Ranked #1 on Image Classification on iWildCam2020-WILDS (using extra training data)

Image Classification Knowledge Graphs +2

BioCLIP: A Vision Foundation Model for the Tree of Life

3 code implementations CVPR 2024 Samuel Stevens, Jiaman Wu, Matthew J Thompson, Elizabeth G Campolongo, Chan Hee Song, David Edward Carlyn, Li Dong, Wasila M Dahdul, Charles Stewart, Tanya Berger-Wolf, Wei-Lun Chao, Yu Su

We then develop BioCLIP, a foundation model for the tree of life, leveraging the unique properties of biology captured by TreeOfLife-10M, namely the abundance and variety of images of plants, animals, and fungi, together with the availability of rich structured biological knowledge.

Pre-Training LiDAR-Based 3D Object Detectors Through Colorization

1 code implementation23 Oct 2023 Tai-Yu Pan, Chenyang Ma, Tianle Chen, Cheng Perng Phoo, Katie Z Luo, Yurong You, Mark Campbell, Kilian Q. Weinberger, Bharath Hariharan, Wei-Lun Chao

Accurate 3D object detection and understanding for self-driving cars heavily relies on LiDAR point clouds, necessitating large amounts of labeled data to train.

3D Object Detection Colorization +4

FLEE-GNN: A Federated Learning System for Edge-Enhanced Graph Neural Network in Analyzing Geospatial Resilience of Multicommodity Food Flows

1 code implementation20 Oct 2023 Yuxiao Qu, Jinmeng Rao, Song Gao, Qianheng Zhang, Wei-Lun Chao, Yu Su, Michelle Miller, Alfonso Morales, Patrick Huber

This paper proposes FLEE-GNN, a novel Federated Learning System for Edge-Enhanced Graph Neural Network, designed to overcome these challenges and enhance the analysis of geospatial resilience of multicommodity food flow network, which is one type of spatial networks.

Federated Learning Graph Neural Network

Towards Open-World Segmentation of Parts

1 code implementation CVPR 2023 Tai-Yu Pan, Qing Liu, Wei-Lun Chao, Brian Price

Second, we introduce a novel approach to improve part segmentation on unseen objects, inspired by an interesting finding -- for unseen objects, the pixel-wise features extracted by the model often reveal high-quality part segments.

Contrastive Learning Segmentation

Segment Anything Model (SAM) Enhanced Pseudo Labels for Weakly Supervised Semantic Segmentation

1 code implementation9 May 2023 Tianle Chen, Zheda Mai, Ruiwen Li, Wei-Lun Chao

Weakly supervised semantic segmentation (WSSS) aims to bypass the need for laborious pixel-level annotation by using only image-level annotation.

Object Pseudo Label +2

Unified Out-Of-Distribution Detection: A Model-Specific Perspective

no code implementations ICCV 2023 Reza Averly, Wei-Lun Chao

We show that this framework unifies the detection of OOD examples caused by semantic shift and covariate shift, and closely addresses the concern of applying a machine learning model to uncontrolled environments.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Unsupervised Adaptation from Repeated Traversals for Autonomous Driving

1 code implementation27 Mar 2023 Yurong You, Cheng Perng Phoo, Katie Z Luo, Travis Zhang, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

For a self-driving car to operate reliably, its perceptual system must generalize to the end-user's environment -- ideally without additional annotation efforts.

3D Object Detection Autonomous Driving +2

Learning Fractals by Gradient Descent

1 code implementation14 Mar 2023 Cheng-Hao Tu, Hong-You Chen, David Carlyn, Wei-Lun Chao

Fractals are geometric shapes that can display complex and self-similar patterns found in nature (e. g., clouds and plants).

Making Batch Normalization Great in Federated Deep Learning

no code implementations12 Mar 2023 Jike Zhong, Hong-You Chen, Wei-Lun Chao

We reinvestigate factors that are believed to cause this problem, including the mismatch of BN statistics across clients and the deviation of gradients during local training.

Deep Learning Federated Learning

Train-Once-for-All Personalization

no code implementations CVPR 2023 Hong-You Chen, Yandong Li, Yin Cui, Mingda Zhang, Wei-Lun Chao, Li Zhang

We study the problem of how to train a "personalization-friendly" model such that given only the task descriptions, the model can be adapted to different end-users' needs, e. g., for accurately classifying different subsets of objects.

All

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

1 code implementation ICCV 2023 Chan Hee Song, Jiaman Wu, Clayton Washington, Brian M. Sadler, Wei-Lun Chao, Yu Su

In this work, we propose a novel method, LLM-Planner, that harnesses the power of large language models to do few-shot planning for embodied agents.

Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning

2 code implementations CVPR 2023 Cheng-Hao Tu, Zheda Mai, Wei-Lun Chao

Through introducing a handful of learnable ``query'' tokens to each layer, VQT leverages the inner workings of Transformers to ``summarize'' rich intermediate features of each layer, which can then be used to train the prediction heads of downstream tasks.

parameter-efficient fine-tuning Transfer Learning

Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs

no code implementations23 Sep 2022 Youya Xia, Josephine Monica, Wei-Lun Chao, Bharath Hariharan, Kilian Q Weinberger, Mark Campbell

In this paper, we investigate the idea of turning sensor inputs (i. e., images) captured in an adverse condition into a benign one (i. e., sunny), upon which the downstream tasks (e. g., semantic segmentation) can attain high accuracy.

Autonomous Driving Image-to-Image Translation +4

PreSTU: Pre-Training for Scene-Text Understanding

no code implementations ICCV 2023 Jihyung Kil, Soravit Changpinyo, Xi Chen, Hexiang Hu, Sebastian Goodman, Wei-Lun Chao, Radu Soricut

The ability to recognize and reason about text embedded in visual inputs is often lacking in vision-and-language (V&L) models, perhaps because V&L pre-training methods have often failed to include such an ability in their training objective.

Decoder Image Captioning +3

Gradual Domain Adaptation without Indexed Intermediate Domains

1 code implementation NeurIPS 2021 Hong-You Chen, Wei-Lun Chao

This coarse domain sequence then undergoes a fine indexing step via a novel cycle-consistency loss, which encourages the next intermediate domain to preserve sufficient discriminative knowledge of the current intermediate domain.

Unsupervised Domain Adaptation

On the Importance and Applicability of Pre-Training for Federated Learning

1 code implementation23 Jun 2022 Hong-You Chen, Cheng-Hao Tu, Ziwei Li, Han-Wei Shen, Wei-Lun Chao

To make our findings applicable to situations where pre-trained models are not directly available, we explore pre-training with synthetic data or even with clients' data in a decentralized manner, and found that they can already improve FL notably.

Federated Learning

Learning with Free Object Segments for Long-Tailed Instance Segmentation

no code implementations22 Feb 2022 Cheng Zhang, Tai-Yu Pan, Tianle Chen, Jike Zhong, WenJin Fu, Wei-Lun Chao

One fundamental challenge in building an instance segmentation model for a large number of classes in complex scenes is the lack of training examples, especially for rare objects.

Instance Segmentation Object +1

One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones

1 code implementation CVPR 2022 Chan Hee Song, Jihyung Kil, Tai-Yu Pan, Brian M. Sadler, Wei-Lun Chao, Yu Su

We study the problem of developing autonomous agents that can follow human instructions to infer and perform a sequence of actions to complete the underlying task.

Vision and Language Navigation

Two-Stage Mesh Deep Learning for Automated Tooth Segmentation and Landmark Localization on 3D Intraoral Scans

no code implementations24 Sep 2021 Tai-Hsien Wu, Chunfeng Lian, Sanghee Lee, Matthew Pastewait, Christian Piers, Jie Liu, Fang Wang, Li Wang, Chiung-Ying Chiu, Wenchi Wang, Christina Jackson, Wei-Lun Chao, Dinggang Shen, Ching-Chang Ko

Our TS-MDL first adopts an end-to-end \emph{i}MeshSegNet method (i. e., a variant of the existing MeshSegNet with both improved accuracy and efficiency) to label each tooth on the downsampled scan.

Code Generation

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation

1 code implementation NeurIPS 2021 Tai-Yu Pan, Cheng Zhang, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao

We propose NorCal, Normalized Calibration for long-tailed object detection and instance segmentation, a simple and straightforward recipe that reweighs the predicted scores of each class by its training sample size.

Instance Segmentation Long-tailed Object Detection +4

On Bridging Generic and Personalized Federated Learning for Image Classification

3 code implementations ICLR 2022 Hong-You Chen, Wei-Lun Chao

On the one hand, we introduce a family of losses that are robust to non-identical class distributions, enabling clients to train a generic predictor with a consistent objective across them.

Classification image-classification +2

Few-Shot Learning with a Strong Teacher

1 code implementation1 Jul 2021 Han-Jia Ye, Lu Ming, De-Chuan Zhan, Wei-Lun Chao

Many existing works take the meta-learning approach, constructing a few-shot learner that can learn from few-shot examples to generate a classifier.

Few-Shot Learning

How to Train Your MAML to Excel in Few-Shot Classification

1 code implementation ICLR 2022 Han-Jia Ye, Wei-Lun Chao

We find that these permutations lead to a huge variance of accuracy, making MAML unstable in few-shot classification.

Classification Meta-Learning

Procrustean Training for Imbalanced Deep Learning

no code implementations ICCV 2021 Han-Jia Ye, De-Chuan Zhan, Wei-Lun Chao

To correct these wrong predictions, the neural network then must focus on pushing features of minor class data across the decision boundaries between major and minor classes, leading to much larger gradients for features of minor classes.

Attribute Deep Learning

MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection

1 code implementation ICCV 2021 Cheng Zhang, Tai-Yu Pan, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao

Many objects do not appear frequently enough in complex scenes (e. g., certain handbags in living rooms) for training an accurate object detector, but are often found frequently by themselves (e. g., in product images).

Imputation Instance Segmentation +5

FedBE: Making Bayesian Model Ensemble Applicable to Federated Learning

2 code implementations ICLR 2021 Hong-You Chen, Wei-Lun Chao

Federated learning aims to collaboratively train a strong global model by accessing users' locally trained models but not their own data.

Bayesian Inference Federated Learning

Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation

1 code implementation ICCV 2021 Luyu Yang, Yan Wang, Mingfei Gao, Abhinav Shrivastava, Kilian Q. Weinberger, Wei-Lun Chao, Ser-Nam Lim

To integrate the strengths of the two classifiers, we apply the well-established co-training framework, in which the two classifiers exchange their high confident predictions to iteratively "teach each other" so that both classifiers can excel in the target domain.

Semi-supervised Domain Adaptation Unsupervised Domain Adaptation

Interactive Natural Language-based Person Search

1 code implementation19 Feb 2020 Vikram Shree, Wei-Lun Chao, Mark Campbell

In this work, we consider the problem of searching people in an unconstrained environment, with natural language descriptions.

Person Search Question Answering

An Empirical Study of Person Re-Identification with Attributes

1 code implementation25 Jan 2020 Vikram Shree, Wei-Lun Chao, Mark Campbell

Person re-identification aims to identify a person from an image collection, given one image of that person as the query.

Attribute Person Re-Identification

Visual Question Answering on 360° Images

no code implementations10 Jan 2020 Shih-Han Chou, Wei-Lun Chao, Wei-Sheng Lai, Min Sun, Ming-Hsuan Yang

We then study two different VQA models on VQA 360, including one conventional model that takes an equirectangular image (with intrinsic distortion) as input and one dedicated model that first projects a 360 image onto cubemaps and subsequently aggregates the information from multiple spatial resolutions.

Question Answering Visual Question Answering

Identifying and Compensating for Feature Deviation in Imbalanced Deep Learning

1 code implementation6 Jan 2020 Han-Jia Ye, Hong-You Chen, De-Chuan Zhan, Wei-Lun Chao

Classifiers trained with class-imbalanced data are known to perform poorly on test data of the "minor" classes, of which we have insufficient training data.

Deep Learning

LDLS: 3-D Object Segmentation Through Label Diffusion From 2-D Images

1 code implementation30 Oct 2019 Brian H. Wang, Wei-Lun Chao, Yan Wang, Bharath Hariharan, Kilian Q. Weinberger, Mark Campbell

We obtain 2-D segmentation predictions by applying Mask-RCNN to the RGB image, and then link this image to a 3-D lidar point cloud by building a graph of connections among 3-D points and 2-D pixels.

Image Segmentation Point Cloud Segmentation +2

A New Defense Against Adversarial Images: Turning a Weakness into a Strength

1 code implementation NeurIPS 2019 Tao Yu, Shengyuan Hu, Chuan Guo, Wei-Lun Chao, Kilian Q. Weinberger

Natural images are virtually surrounded by low-density misclassified regions that can be efficiently discovered by gradient-guided search --- enabling the generation of adversarial images.

Adversarial Defense

An Empirical Study on Leveraging Scene Graphs for Visual Question Answering

no code implementations28 Jul 2019 Cheng Zhang, Wei-Lun Chao, Dong Xuan

Specifically, we investigate the use of scene graphs derived from images for Visual QA: an image is abstractly represented by a graph with nodes corresponding to object entities and edges to object relationships.

Knowledge Graphs Question Answering +1

Classifier and Exemplar Synthesis for Zero-Shot Learning

1 code implementation16 Dec 2018 Soravit Changpinyo, Wei-Lun Chao, Boqing Gong, Fei Sha

Zero-shot learning (ZSL) enables solving a task without the need to see its examples.

Denoising Zero-Shot Learning

Learning Answer Embeddings for Visual Question Answering

no code implementations CVPR 2018 Hexiang Hu, Wei-Lun Chao, Fei Sha

These properties make the approach particularly appealing for transfer learning for open-ended Visual QA, where the source dataset on which the model is learned has limited overlapping with the target dataset in the space of answers.

Question Answering Transfer Learning +1

Cross-Dataset Adaptation for Visual Question Answering

no code implementations CVPR 2018 Wei-Lun Chao, Hexiang Hu, Fei Sha

Analogous to domain adaptation for visual recognition, this setting is appealing when the target dataset does not have a sufficient amount of labeled data to learn an "in-domain" model.

Domain Adaptation Question Answering +1

Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets

no code implementations NAACL 2018 Wei-Lun Chao, Hexiang Hu, Fei Sha

We apply the procedures to re-construct decoy answers for two popular Visual QA datasets as well as to create a new Visual QA dataset from the Visual Genome project, resulting in the largest dataset for this task.

Multiple-choice Question Answering +1

Video Summarization with Long Short-term Memory

1 code implementation26 May 2016 Ke Zhang, Wei-Lun Chao, Fei Sha, Kristen Grauman

We propose a novel supervised learning technique for summarizing videos by automatically selecting keyframes or key subshots.

Domain Adaptation Structured Prediction +1

Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning

no code implementations ICCV 2017 Soravit Changpinyo, Wei-Lun Chao, Fei Sha

Leveraging class semantic descriptions and examples of known objects, zero-shot learning makes it possible to train a recognition model for an object class whose examples are not available.

Clustering Object +1

An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild

1 code implementation13 May 2016 Wei-Lun Chao, Soravit Changpinyo, Boqing Gong, Fei Sha

Zero-shot learning (ZSL) methods have been studied in the unrealistic setting where test data are assumed to come from unseen classes only.

Few-Shot Learning Generalized Zero-Shot Learning +1

Summary Transfer: Exemplar-based Subset Selection for Video Summarization

no code implementations CVPR 2016 Ke Zhang, Wei-Lun Chao, Fei Sha, Kristen Grauman

Video summarization has unprecedented importance to help us digest, browse, and search today's ever-growing video collections.

Video Summarization

Synthesized Classifiers for Zero-Shot Learning

2 code implementations CVPR 2016 Soravit Changpinyo, Wei-Lun Chao, Boqing Gong, Fei Sha

Given semantic descriptions of object classes, zero-shot learning aims to accurately recognize objects of the unseen classes, from which no examples are available at the training stage, by associating them to the seen classes, from which labeled examples are provided.

Object Zero-Shot Learning

Large-Margin Determinantal Point Processes

no code implementations6 Nov 2014 Boqing Gong, Wei-Lun Chao, Kristen Grauman, Fei Sha

Extensive empirical studies validate our contributions, including applications on challenging document and video summarization, where flexibility in modeling the kernel matrix and balancing different errors is indispensable.

Diversity parameter estimation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.