no code implementations • ECCV 2020 • Runtao Liu, Qian Yu, Stella X. Yu
Humans can envision a realistic photo given a free-hand sketch that is not only spatially imprecise and geometrically distorted but also without colors and visual details.
1 code implementation • 25 Apr 2022 • Tsung-Wei Ke, Jyh-Jing Hwang, Yunhui Guo, Xudong Wang, Stella X. Yu
We enforce spatial consistency of grouping and bootstrap feature learning with co-segmentation among multiple views of the same image, and enforce semantic consistency across the grouping hierarchy with clustering transformers between coarse- and fine-grained features.
1 code implementation • 5 Jan 2022 • Xudong Wang, Zhirong Wu, Long Lian, Stella X. Yu
Our key insight is that pseudo-labels are naturally imbalanced due to intrinsic data similarity, even when a model is trained on balanced source data and evaluated on balanced target data.
Ranked #1 on
Semi-Supervised Image Classification
on ImageNet - 0.2% labeled data
(using extra training data)
no code implementations • 2 Dec 2021 • Utkarsh Singhal, Yifei Xing, Stella X. Yu
We study complex-valued scaling as a type of symmetry natural and unique to complex-valued measurements and representations.
1 code implementation • NeurIPS 2021 • Runtao Liu, Zhirong Wu, Stella X. Yu, Stephen Lin
Our model starts with two separate pathways: an appearance pathway that outputs feature-based region segmentation for a single image, and a motion pathway that outputs motion features for a pair of images.
Ranked #14 on
Video Polyp Segmentation
on SUN-SEG-Easy
no code implementations • 6 Oct 2021 • Xudong Wang, Long Lian, Stella X. Yu
Existing SSL methods focus on learning a model that effectively integrates information from given small labeled data and large unlabeled data, whereas we focus on selecting the right data to annotate for SSL without requiring any label or task information.
no code implementations • 23 Jul 2021 • Yunhui Guo, Xudong Wang, Yubei Chen, Stella X. Yu
Hyperbolic space can naturally embed hierarchies, unlike Euclidean space.
no code implementations • 15 Jul 2021 • Jiayun Wang, Yubei Chen, Stella X. Yu, Brian Cheung, Yann Lecun
Specifically, for a network, we create a recurrent parameter generator (RPG), from which the parameters of each convolution layer are generated.
no code implementations • 19 May 2021 • Sascha Hornauer, Ke Li, Stella X. Yu, Shabnam Ghaffarzadegan, Liu Ren
Recent progress in network-based audio event classification has shown the benefit of pre-training models on visual data such as ImageNet.
1 code implementation • 5 May 2021 • Zhongqi Miao, Ziwei Liu, Kaitlyn M. Gaynor, Meredith S. Palmer, Stella X. Yu, Wayne M. Getz
Camera trapping is increasingly used to monitor wildlife, but this technology typically requires extensive data annotation.
1 code implementation • ICLR 2021 • Tsung-Wei Ke, Jyh-Jing Hwang, Stella X. Yu
Weakly supervised segmentation requires assigning a label to every pixel based on training instances with partial annotations such as image-level tags, object bounding boxes, labeled points and scribbles.
no code implementations • CVPR 2021 • Xudong Wang, Long Lian, Stella X. Yu
Existing methods focus on training an RL policy that is universal to changing visual domains, whereas we focus on extracting visual foreground that is universal, feeding clean invariant vision to the RL policy learner.
no code implementations • 6 Mar 2021 • Ke Wang, Michael Kellman, Christopher M. Sandino, Kevin Zhang, Shreyas S. Vasanawala, Jonathan I. Tamir, Stella X. Yu, Michael Lustig
Deep learning (DL) based unrolled reconstructions have shown state-of-the-art performance for under-sampled magnetic resonance imaging (MRI).
2 code implementations • ICLR 2021 • Xudong Wang, Long Lian, Zhongqi Miao, Ziwei Liu, Stella X. Yu
We take a dynamic view of the training data and provide a principled model bias and variance analysis as the training data fluctuates: Existing long-tail classifiers invariably increase the model variance and the head-tail model bias gap remains large, due to more and larger confusion with hard negatives for the tail.
Ranked #4 on
Long-tail Learning
on CIFAR-100-LT (ρ=100)
1 code implementation • 25 Sep 2020 • Xudong Wang, Stella X. Yu
The concept of TBC can also be extended to group convolution and fully connected layers, and can be applied to various backbone networks and attention modules.
2 code implementations • CVPR 2021 • Xudong Wang, Ziwei Liu, Stella X. Yu
Unsupervised feature learning has made great strides with contrastive learning based on instance discrimination and invariant mapping, as benchmarked on curated class-balanced datasets.
Contrastive Learning
Semi-Supervised Image Classification
+2
1 code implementation • 17 Jun 2020 • Jiayun Wang, Jierui Lin, Qian Yu, Runtao Liu, Yubei Chen, Stella X. Yu
Additionally, we propose a sketch standardization module to handle different sketch distortions and styles.
1 code implementation • CVPR 2020 • Jiayun Wang, Yubei Chen, Rudrasis Chakraborty, Stella X. Yu
We develop an efficient approach to impose filter orthogonality on a convolutional layer based on the doubly block-Toeplitz matrix representation of the convolutional kernel instead of using the common kernel orthogonality approach, which we show is only necessary but not sufficient for ensuring orthogonal convolutions.
no code implementations • 29 Oct 2019 • Liu Yang, Rudrasis Chakraborty, Stella X. Yu
Our proposed model is rotationally invariant and can preserve geometric shape of a 3D point-cloud.
1 code implementation • ICCV 2019 • Jyh-Jing Hwang, Stella X. Yu, Jianbo Shi, Maxwell D. Collins, Tien-Ju Yang, Xiao Zhang, Liang-Chieh Chen
The proposed SegSort further produces an interpretable result, as each choice of label can be easily understood from the retrieved nearest segments.
Ranked #3 on
Unsupervised Semantic Segmentation
on PASCAL VOC 2012 val
(using extra training data)
no code implementations • 14 Oct 2019 • Qian Yu, Chaofeng Wang, Barbaros Cetiner, Stella X. Yu, Frank Mckenna, Ertugrul Taciroglu, Kincho H. Law
In the first case, a machine learning-assisted framework, BRAILS, is proposed for city-scale building information modeling.
no code implementations • CVPR 2020 • Ziwei Liu, Zhongqi Miao, Xingang Pan, Xiaohang Zhan, Dahua Lin, Stella X. Yu, Boqing Gong
A typical domain adaptation approach is to adapt models trained on the annotated data in a source domain (e. g., sunny weather) for achieving high performance on the test data in a target domain (e. g., rainy weather).
1 code implementation • 26 Jun 2019 • Jiayun Wang, Rudrasis Chakraborty, Stella X. Yu
We propose a novel end-to-end approach to learn different non-rigid transformations of the input point cloud so that optimal local neighborhoods can be adopted at each layer.
no code implementations • 24 Jun 2019 • Rudrasis Chakraborty, Jiayun Wang, Stella X. Yu
On RadioML, our model achieves comparable RF modulation classification accuracy at 10% of the baseline model size.
1 code implementation • CVPR 2019 • Ziwei Liu, Zhongqi Miao, Xiaohang Zhan, Jiayun Wang, Boqing Gong, Stella X. Yu
We define Open Long-Tailed Recognition (OLTR) as learning from such naturally distributed data and optimizing the classification accuracy over a balanced test set which include head, tail, and open classes.
2 code implementations • ECCV 2018 • Zhirong Wu, Alexei A. Efros, Stella X. Yu
Current major approaches to visual recognition follow an end-to-end formulation that classifies an input image into one of the pre-determined set of semantic categories.
2 code implementations • CVPR 2018 • Zhirong Wu, Yuanjun Xiong, Stella X. Yu, Dahua Lin
Neural net classifiers trained on data with annotated class labels can also capture apparent visual similarity among categories without being directed to do so.
Ranked #37 on
Semi-Supervised Image Classification
on ImageNet - 1% labeled data
(Top 5 Accuracy metric)
1 code implementation • CVPR 2019 • Jyh-Jing Hwang, Tsung-Wei Ke, Jianbo Shi, Stella X. Yu
The structure analyzer is trained to maximize the ASM loss, or to emphasize recurring multi-scale hard negative structural mistakes among co-occurring patterns.
no code implementations • 30 Mar 2018 • Jyh-Jing Hwang, Sergei Azernikov, Alexei A. Efros, Stella X. Yu
In the dental industry, it takes a technician years of training to design synthetic crowns that restore the function and integrity of missing teeth.
1 code implementation • ECCV 2018 • Tsung-Wei Ke, Jyh-Jing Hwang, Ziwei Liu, Stella X. Yu
Semantic segmentation has made much progress with increasingly powerful pixel-wise classifiers and incorporating structural priors via Conditional Random Fields (CRF) or Generative Adversarial Networks (GAN).
Ranked #48 on
Semantic Segmentation
on Cityscapes test
1 code implementation • 5 Dec 2017 • Jiayun Wang, Patrick Virtue, Stella X. Yu
To address the overfitting problem in aerial image classification, we consider the neural network as successive transformations of an input image into embedded feature representations and ultimately into a semantic class label, and train neural networks to optimize image representations in the embedded space in addition to optimizing the final classification score.
no code implementations • 1 Jul 2017 • Patrick Virtue, Stella X. Yu, Michael Lustig
The task of MRI fingerprinting is to identify tissue parameters from complex-valued MRI signals.
1 code implementation • CVPR 2017 • Jian Shi, Yue Dong, Hao Su, Stella X. Yu
Rendered with realistic environment maps, millions of synthetic images of objects and their corresponding albedo, shading, and specular ground-truth images are used to train an encoder-decoder CNN.
1 code implementation • CVPR 2017 • Tsung-Wei Ke, Michael Maire, Stella X. Yu
Most critically, multigrid structure enables networks to learn internal attention and dynamic routing mechanisms, and use them to accomplish tasks on which modern CNNs fail.
1 code implementation • ICCV 2017 • Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi
In this work, we show that we can detect important objects in first-person images without the supervision by the camera wearer or even third-person labelers.
no code implementations • ICCV 2017 • Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi
Finally, we use this feature to learn a basketball assessment model from pairs of labeled first-person basketball videos, for which a basketball expert indicates, which of the two players is better.
no code implementations • CVPR 2017 • Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi
It combines these two objectives via a novel random walk layer that enforces consistent spatial grouping in the deep layers of the network.
no code implementations • 21 May 2016 • Xingchao Peng, Judy Hoffman, Stella X. Yu, Kate Saenko
We address the difficult problem of distinguishing fine-grained object categories in low resolution images.
no code implementations • 15 Mar 2016 • Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi
Unlike traditional third-person cameras mounted on robots, a first-person camera, captures a person's visual sensorimotor object interactions from up close.
no code implementations • CVPR 2016 • Michael Maire, Takuya Narihira, Stella X. Yu
Spectral embedding provides a framework for solving perceptual organization problems, including image segmentation and figure/ground organization.
no code implementations • ICCV 2015 • Takuya Narihira, Michael Maire, Stella X. Yu
We demonstrate results on both the synthetic images of Sintel and the real images of the classic MIT intrinsic image dataset.
no code implementations • 23 Nov 2015 • Deepak Pathak, Philipp Krähenbühl, Stella X. Yu, Trevor Darrell
We present a regression framework which models the output distribution of neural networks.
no code implementations • 21 Nov 2015 • Takuya Narihira, Damian Borth, Stella X. Yu, Karl Ni, Trevor Darrell
We consider the visual sentiment task of mapping an image to an adjective noun pair (ANP) such as "cute baby".
no code implementations • CVPR 2015 • Takuya Narihira, Michael Maire, Stella X. Yu
We develop a new approach to inferring lightness, the perceived reflectance of surfaces, from a single image.
no code implementations • CVPR 2015 • Tinghui Zhou, Yong Jae Lee, Stella X. Yu, Alyosha A. Efros
Given a set of poorly aligned images of the same visual concept without any annotations, we propose an algorithm to jointly bring them into pixel-wise correspondence by estimating a FlowWeb representation of the image set.
no code implementations • 16 Oct 2014 • Michael Maire, Stella X. Yu, Pietro Perona
We frame the task of predicting a semantic labeling as a sparse reconstruction procedure that applies a target-specific learned transfer function to a generic deep sparse code representation of an image.
no code implementations • NeurIPS 2010 • Stella X. Yu
Size, color, and orientation have long been considered elementary features whose attributes are extracted in parallel and available to guide the deployment of attention.