Search Results for author: Jia-Bin Huang

Found 97 papers, 47 papers with code

Shuffle and Attend: Video Domain Adaptation

no code implementations ECCV 2020 Jinwoo Choi, Gaurav Sharma, Samuel Schulter, Jia-Bin Huang

As the first novelty, we propose an attention mechanism which focuses on more discriminative clips and directly optimizes for video-level (cf.

Action Recognition Temporal Action Localization +1

CTGAN: Semantic-guided Conditional Texture Generator for 3D Shapes

no code implementations8 Feb 2024 Yi-Ting Pan, Chai-Rong Lee, Shu-Ho Fan, Jheng-Wei Su, Jia-Bin Huang, Yung-Yu Chuang, Hung-Kuo Chu

The entertainment industry relies on 3D visual content to create immersive experiences, but traditional methods for creating textured 3D models can be time-consuming and subjective.

Image Generation Texture Synthesis

IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images

no code implementations23 Jan 2024 Zhi-Hao Lin, Jia-Bin Huang, Zhengqin Li, Zhao Dong, Christian Richardt, Tuotuo Li, Michael Zollhöfer, Johannes Kopf, Shenlong Wang, Changil Kim

While numerous 3D reconstruction and novel-view synthesis methods allow for photorealistic rendering of a scene from multi-view images easily captured with consumer cameras, they bake illumination in their representations and fall short of supporting advanced applications like material editing, relighting, and virtual object insertion.

3D Reconstruction Inverse Rendering +1

TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion

no code implementations17 Jan 2024 Yu-Ying Yeh, Jia-Bin Huang, Changil Kim, Lei Xiao, Thu Nguyen-Phuoc, Numair Khan, Cheng Zhang, Manmohan Chandraker, Carl S Marshall, Zhao Dong, Zhengqin Li

In contrast, TextureDreamer can transfer highly detailed, intricate textures from real-world environments to arbitrary objects with only a few casually captured images, potentially significantly democratizing texture creation.

Texture Synthesis

Fast View Synthesis of Casual Videos

no code implementations4 Dec 2023 Yao-Chih Lee, Zhoutong Zhang, Kevin Blackburn-Matzen, Simon Niklaus, Jianming Zhang, Jia-Bin Huang, Feng Liu

Specifically, we build a global static scene model using an extended plane-based scene representation to synthesize temporally coherent novel video.

Novel View Synthesis

Single-Image 3D Human Digitization with Shape-Guided Diffusion

no code implementations15 Nov 2023 Badour AlBahar, Shunsuke Saito, Hung-Yu Tseng, Changil Kim, Johannes Kopf, Jia-Bin Huang

We present an approach to generate a 360-degree view of a person with a consistent, high-resolution appearance from a single input image.

Image Generation Inverse Rendering

OmnimatteRF: Robust Omnimatte with 3D Background Modeling

1 code implementation ICCV 2023 Geng Lin, Chen Gao, Jia-Bin Huang, Changil Kim, Yipeng Wang, Matthias Zwicker, Ayush Saraf

Video matting has broad applications, from adding interesting effects to casually captured movies to assisting video production professionals.

Image Matting Video Matting

Dynamic Mesh-Aware Radiance Fields

1 code implementation ICCV 2023 Yi-Ling Qiao, Alexander Gao, Yiran Xu, Yue Feng, Jia-Bin Huang, Ming C. Lin

Embedding polygonal mesh assets within photorealistic Neural Radience Fields (NeRF) volumes, such that they can be rendered and their dynamics simulated in a physically consistent manner with the NeRF, is under-explored from the system perspective of integrating NeRF into the traditional graphics pipeline.

Seeing the World through Your Eyes

no code implementations15 Jun 2023 Hadi AlZayer, Kevin Zhang, Brandon Feng, Christopher Metzler, Jia-Bin Huang

The reflective nature of the human eye is an underappreciated source of information about what the world around us looks like.

Grounded Text-to-Image Synthesis with Attention Refocusing

no code implementations8 Jun 2023 Quynh Phung, Songwei Ge, Jia-Bin Huang

Driven by the scalable diffusion models trained on large-scale datasets, text-to-image synthesis methods have shown compelling results.

Image Generation

Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models

no code implementations ICCV 2023 Songwei Ge, Seungjun Nah, Guilin Liu, Tyler Poon, Andrew Tao, Bryan Catanzaro, David Jacobs, Jia-Bin Huang, Ming-Yu Liu, Yogesh Balaji

Despite tremendous progress in generating high-quality images using diffusion models, synthesizing a sequence of animated frames that are both photorealistic and temporally coherent is still in its infancy.

Image Generation Text-to-Video Generation +1

Neural-PBIR Reconstruction of Shape, Material, and Illumination

no code implementations ICCV 2023 Cheng Sun, Guangyan Cai, Zhengqin Li, Kai Yan, Cheng Zhang, Carl Marshall, Jia-Bin Huang, Shuang Zhao, Zhao Dong

In the last stage, initialized by the neural predictions, we perform PBIR to refine the initial results and obtain the final high-quality reconstruction of object shape, material, and illumination.

Depth Prediction Image Relighting +5

Expressive Text-to-Image Generation with Rich Text

no code implementations ICCV 2023 Songwei Ge, Taesung Park, Jun-Yan Zhu, Jia-Bin Huang

For each region, we enforce its text attributes by creating region-specific detailed prompts and applying region-specific guidance, and maintain its fidelity against plain-text generation through region-based injections.

Text Generation Text-to-Image Generation

$\text{DC}^2$: Dual-Camera Defocus Control by Learning to Refocus

no code implementations6 Apr 2023 Hadi AlZayer, Abdullah Abuolaim, Leung Chun Chan, Yang Yang, Ying Chen Lou, Jia-Bin Huang, Abhishek Kar

Smartphone cameras today are increasingly approaching the versatility and quality of professional cameras through a combination of hardware and software advancements.

Deblurring

Consistent View Synthesis with Pose-Guided Diffusion Models

no code implementations CVPR 2023 Hung-Yu Tseng, Qinbo Li, Changil Kim, Suhib Alsisan, Jia-Bin Huang, Johannes Kopf

In this work, we propose a pose-guided diffusion model to generate a consistent long-term video of novel views from a single image.

Novel View Synthesis

DisCO: Portrait Distortion Correction with Perspective-Aware 3D GANs

no code implementations23 Feb 2023 Zhixiang Wang, Yu-Lun Liu, Jia-Bin Huang, Shin'ichi Satoh, Sizhuo Ma, Gurunandan Krishnan, Jian Wang

Close-up facial images captured at short distances often suffer from perspective distortion, resulting in exaggerated facial features and unnatural/unattractive appearances.

Scheduling

Text-driven Visual Synthesis with Latent Diffusion Prior

no code implementations16 Feb 2023 Ting-Hsuan Liao, Songwei Ge, Yiran Xu, Yao-Chih Lee, Badour AlBahar, Jia-Bin Huang

There has been tremendous progress in large-scale text-to-image synthesis driven by diffusion models enabling versatile downstream applications such as 3D object synthesis from texts, image editing, and customized generation.

Image Generation Text to 3D

In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing

no code implementations9 Feb 2023 Yiran Xu, Zhixin Shu, Cameron Smith, Seoung Wug Oh, Jia-Bin Huang

3D-aware GANs offer new capabilities for view synthesis while preserving the editing functionalities of their 2D counterparts.

Robust Dynamic Radiance Fields

1 code implementation CVPR 2023 Yu-Lun Liu, Chen Gao, Andreas Meuleman, Hung-Yu Tseng, Ayush Saraf, Changil Kim, Yung-Yu Chuang, Johannes Kopf, Jia-Bin Huang

Dynamic radiance field reconstruction methods aim to model the time-varying structure and appearance of a dynamic scene.

DC2: Dual-Camera Defocus Control by Learning To Refocus

no code implementations CVPR 2023 Hadi AlZayer, Abdullah Abuolaim, Leung Chun Chan, Yang Yang, Ying Chen Lou, Jia-Bin Huang, Abhishek Kar

Smartphone cameras today are increasingly approaching the versatility and quality of professional cameras through a combination of hardware and software advancements.

Deblurring

AMICO: Amodal Instance Composition

no code implementations11 Oct 2022 Peiye Zhuang, Jia-Bin Huang, Ayush Saraf, Xuejian Rong, Changil Kim, Denis Demandolx

Image composition aims to blend multiple objects to form a harmonized image.

Object

Temporally Consistent Semantic Video Editing

no code implementations21 Jun 2022 Yiran Xu, Badour AlBahar, Jia-Bin Huang

Generative adversarial networks (GANs) have demonstrated impressive image generation quality and semantic editing capability of real images, e. g., changing object classes, modifying attributes, or transferring styles.

Image Generation Video Editing

Learning Dynamic View Synthesis With Few RGBD Cameras

no code implementations22 Apr 2022 Shengze Wang, Youngjoong Kwon, Yuan Shen, Qian Zhang, Andrei State, Jia-Bin Huang, Henry Fuchs

Experiments on the HTI dataset show that our method outperforms the baseline per-frame image fidelity and spatial-temporal consistency.

Novel View Synthesis

Boosting View Synthesis With Residual Transfer

no code implementations CVPR 2022 Xuejian Rong, Jia-Bin Huang, Ayush Saraf, Changil Kim, Johannes Kopf

We present a simple but effective technique to boost the rendering quality, which can be easily integrated with most view synthesis methods.

Novel View Synthesis

Learning Neural Light Fields With Ray-Space Embedding

no code implementations CVPR 2022 Benjamin Attal, Jia-Bin Huang, Michael Zollhöfer, Johannes Kopf, Changil Kim

Our method supports rendering with a single network evaluation per pixel for small baseline light fields and with only a few evaluations per pixel for light fields with larger baselines.

Learning Neural Light Fields with Ray-Space Embedding Networks

1 code implementation2 Dec 2021 Benjamin Attal, Jia-Bin Huang, Michael Zollhoefer, Johannes Kopf, Changil Kim

Our method supports rendering with a single network evaluation per pixel for small baseline light field datasets and can also be applied to larger baselines with only a few evaluations per pixel.

Dynamic View Synthesis from Dynamic Monocular Video

1 code implementation ICCV 2021 Chen Gao, Ayush Saraf, Johannes Kopf, Jia-Bin Huang

We present an algorithm for generating novel views at arbitrary viewpoints and any input time step given a monocular video of a dynamic scene.

DropLoss for Long-Tail Instance Segmentation

1 code implementation13 Apr 2021 Ting-I Hsieh, Esther Robb, Hwann-Tzong Chen, Jia-Bin Huang

Based on this insight, we develop DropLoss -- a novel adaptive loss to compensate for this imbalance without a trade-off between rare and frequent categories.

Instance Segmentation object-detection +3

Learning Representational Invariances for Data-Efficient Action Recognition

1 code implementation30 Mar 2021 Yuliang Zou, Jinwoo Choi, Qitong Wang, Jia-Bin Huang

Data augmentation is a ubiquitous technique for improving image classification when labeled data is scarce.

Action Recognition Data Augmentation +1

Hybrid Neural Fusion for Full-frame Video Stabilization

2 code implementations ICCV 2021 Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang

Existing video stabilization methods often generate visible distortion or require aggressive cropping of frame boundaries, resulting in smaller field of views.

Video Stabilization

Portrait Neural Radiance Fields from a Single Image

no code implementations10 Dec 2020 Chen Gao, YiChang Shih, Wei-Sheng Lai, Chia-Kai Liang, Jia-Bin Huang

We present a method for estimating Neural Radiance Fields (NeRF) from a single headshot portrait.

Meta-Learning

Robust Consistent Video Depth Estimation

1 code implementation CVPR 2021 Johannes Kopf, Xuejian Rong, Jia-Bin Huang

We present an algorithm for estimating consistent dense depth maps and camera poses from a monocular video.

Depth Estimation

Space-time Neural Irradiance Fields for Free-Viewpoint Video

no code implementations CVPR 2021 Wenqi Xian, Jia-Bin Huang, Johannes Kopf, Changil Kim

We present a method that learns a spatiotemporal neural irradiance field for dynamic scenes from a single video.

Depth Estimation

Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

1 code implementation2 Nov 2020 Qi Mao, Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang, Siwei Ma, Ming-Hsuan Yang

Generating a smooth sequence of intermediate results bridges the gap of two different domains, facilitating the morphing effect across domains.

Attribute Image-to-Image Translation +1

Few-Shot Adaptation of Generative Adversarial Networks

1 code implementation22 Oct 2020 Esther Robb, Wen-Sheng Chu, Abhishek Kumar, Jia-Bin Huang

We validate our method in a challenging few-shot setting of 5-100 images in the target domain.

Image Generation

NAS-DIP: Learning Deep Image Prior with Neural Architecture Search

1 code implementation ECCV 2020 Yun-Chun Chen, Chen Gao, Esther Robb, Jia-Bin Huang

Recent work has shown that the structure of deep convolutional neural networks can be used as a structured image prior for solving various inverse image restoration tasks.

Image Restoration Image-to-Image Translation +2

Semantic View Synthesis

1 code implementation ECCV 2020 Hsin-Ping Huang, Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang

We tackle a new problem of semantic view synthesis -- generating free-viewpoint rendering of a synthesized scene using a semantic label map as input.

Image Generation

Learning to See Through Obstructions with Layered Decomposition

1 code implementation11 Aug 2020 Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang

We present a learning-based approach for removing unwanted obstructions, such as window reflections, fence occlusions, or adherent raindrops, from a short sequence of images captured by a moving camera.

Optical Flow Estimation

FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning

2 code implementations ECCV 2020 Chia-Wen Kuo, Chih-Yao Ma, Jia-Bin Huang, Zsolt Kira

Recent state-of-the-art semi-supervised learning (SSL) methods use a combination of image-based transformations and consistency regularization as core components.

Clustering Data Augmentation +1

Instance-aware Image Colorization

2 code implementations CVPR 2020 Jheng-Wei Su, Hung-Kuo Chu, Jia-Bin Huang

Previous methods leverage the deep neural network to map input grayscale images to plausible color outputs directly.

Ranked #2 on Point-interactive Image Colorization on CUB-200-2011 (using extra training data)

Image Colorization Object +1

Consistent Video Depth Estimation

3 code implementations30 Apr 2020 Xuan Luo, Jia-Bin Huang, Richard Szeliski, Kevin Matzen, Johannes Kopf

We present an algorithm for reconstructing dense, geometrically consistent depth for all pixels in a monocular video.

Depth Estimation Monocular Reconstruction

3D Photography using Context-aware Layered Depth Inpainting

1 code implementation CVPR 2020 Meng-Li Shih, Shih-Yang Su, Johannes Kopf, Jia-Bin Huang

We propose a method for converting a single RGB-D input image into a 3D photo - a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view.

Novel View Synthesis

Learning to See Through Obstructions

1 code implementation CVPR 2020 Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang

We present a learning-based approach for removing unwanted obstructions, such as window reflections, fence occlusions or raindrops, from a short sequence of images captured by a moving camera.

Optical Flow Estimation Reflection Removal

Deep Semantic Matching with Foreground Detection and Cycle-Consistency

no code implementations31 Mar 2020 Yun-Chun Chen, Po-Hsiang Huang, Li-Yu Yu, Jia-Bin Huang, Ming-Hsuan Yang, Yen-Yu Lin

Establishing dense semantic correspondences between object instances remains a challenging problem due to background clutter, significant scale and pose differences, and large intra-class variations.

CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency

no code implementations CVPR 2019 Yun-Chun Chen, Yen-Yu Lin, Ming-Hsuan Yang, Jia-Bin Huang

Unsupervised domain adaptation algorithms aim to transfer the knowledge learned from one domain to another (e. g., synthetic to real images).

Data Augmentation Image-to-Image Translation +3

Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition

1 code implementation NeurIPS 2019 Jinwoo Choi, Chen Gao, Joseph C. E. Messou, Jia-Bin Huang

We validate the effectiveness of our method by transferring our pre-trained model to three different tasks, including action classification, temporal localization, and spatio-temporal action detection.

Action Classification Action Detection +4

Guided Image-to-Image Translation with Bi-Directional Feature Transformation

1 code implementation ICCV 2019 Badour AlBahar, Jia-Bin Huang

We address the problem of guided image-to-image translation where we translate an input image into another while respecting the constraints provided by an external, user-provided guidance image.

Image-to-Image Translation Pose Transfer +1

Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

1 code implementation13 Jun 2019 Yun-Chun Chen, Yen-Yu Lin, Ming-Hsuan Yang, Jia-Bin Huang

In contrast to existing algorithms that tackle the tasks of semantic matching and object co-segmentation in isolation, our method exploits the complementary nature of the two tasks.

Object Segmentation +1

Manifold Graph with Learned Prototypes for Semi-Supervised Image Classification

no code implementations12 Jun 2019 Chia-Wen Kuo, Chih-Yao Ma, Jia-Bin Huang, Zsolt Kira

We then show that when combined with these regularizers, the proposed method facilitates the propagation of information from generated prototypes to image data to further improve results.

Classification General Classification +1

DRIT++: Diverse Image-to-Image Translation via Disentangled Representations

4 code implementations2 May 2019 Hsin-Ying Lee, Hung-Yu Tseng, Qi Mao, Jia-Bin Huang, Yu-Ding Lu, Maneesh Singh, Ming-Hsuan Yang

In this work, we present an approach based on disentangled representation for generating diverse outputs without paired training images.

Attribute Image-to-Image Translation +2

Deep Paper Gestalt

2 code implementations20 Dec 2018 Jia-Bin Huang

Recent years have witnessed a significant increase in the number of paper submissions to computer vision conferences.

DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency

1 code implementation ECCV 2018 Yuliang Zou, Zelun Luo, Jia-Bin Huang

We present an unsupervised learning framework for simultaneously training single-view depth prediction and optical flow estimation models using unlabeled video sequences.

Depth And Camera Motion Depth Prediction +1

VideoMatch: Matching based Video Object Segmentation

no code implementations ECCV 2018 Yuan-Ting Hu, Jia-Bin Huang, Alexander G. Schwing

Due to the formulation as a prediction task, most of these methods require fine-tuning during test time, such that the deep nets memorize the appearance of the objects of interest in the given video.

Memorization Object +4

iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection

4 code implementations30 Aug 2018 Chen Gao, Yuliang Zou, Jia-Bin Huang

Our core idea is that the appearance of a person or an object instance contains informative cues on which relevant parts of an image to attend to for facilitating interaction prediction.

Human-Object Interaction Detection Object

Diverse Image-to-Image Translation via Disentangled Representations

7 code implementations ECCV 2018 Hsin-Ying Lee, Hung-Yu Tseng, Jia-Bin Huang, Maneesh Kumar Singh, Ming-Hsuan Yang

Our model takes the encoded content features extracted from a given input and the attribute vectors sampled from the attribute space to produce diverse outputs at test time.

Attribute Domain Adaptation +4

Learning Blind Video Temporal Consistency

1 code implementation ECCV 2018 Wei-Sheng Lai, Jia-Bin Huang, Oliver Wang, Eli Shechtman, Ersin Yumer, Ming-Hsuan Yang

Our method takes the original unprocessed and per-frame processed videos as inputs to produce a temporally consistent video.

Colorization Image-to-Image Translation +4

DeepMVS: Learning Multi-view Stereopsis

1 code implementation CVPR 2018 Po-Han Huang, Kevin Matzen, Johannes Kopf, Narendra Ahuja, Jia-Bin Huang

We present DeepMVS, a deep convolutional neural network (ConvNet) for multi-view stereo reconstruction.

Progressive Representation Adaptation for Weakly Supervised Object Localization

1 code implementation12 Oct 2017 Dong Li, Jia-Bin Huang, Ya-Li Li, Shengjin Wang, Ming-Hsuan Yang

In classification adaptation, we transfer a pre-trained network to a multi-label classification task for recognizing the presence of a certain object in an image.

Classification General Classification +4

Joint Image Filtering with Deep Convolutional Networks

no code implementations11 Oct 2017 Yijun Li, Jia-Bin Huang, Narendra Ahuja, Ming-Hsuan Yang

In contrast to existing methods that consider only the guidance image, the proposed algorithm can selectively transfer salient structures that are consistent with both guidance and target images.

Tracking Persons-of-Interest via Unsupervised Representation Adaptation

2 code implementations5 Oct 2017 Shun Zhang, Jia-Bin Huang, Jongwoo Lim, Yihong Gong, Jinjun Wang, Narendra Ahuja, Ming-Hsuan Yang

Multi-face tracking in unconstrained videos is a challenging problem as faces of one person often appear drastically different in multiple shots due to significant variations in scale, pose, expression, illumination, and make-up.

Clustering

Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks

7 code implementations4 Oct 2017 Wei-Sheng Lai, Jia-Bin Huang, Narendra Ahuja, Ming-Hsuan Yang

However, existing methods often require a large number of network parameters and entail heavy computational loads at runtime for generating high-accuracy super-resolution results.

Image Reconstruction Image Super-Resolution

Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight

2 code implementations2 Oct 2017 Yen-Chen Lin, Ming-Yu Liu, Min Sun, Jia-Bin Huang

Our core idea is that the adversarial examples targeting at a neural network-based policy are not effective for the frame prediction model.

Autonomous Vehicles Decision Making +2

Robust Visual Tracking via Hierarchical Convolutional Features

1 code implementation12 Jul 2017 Chao Ma, Jia-Bin Huang, Xiaokang Yang, Ming-Hsuan Yang

Specifically, we learn adaptive correlation filters on the outputs from each convolutional layer to encode the target appearance.

Object Recognition Visual Tracking

Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking

1 code implementation7 Jul 2017 Chao Ma, Jia-Bin Huang, Xiaokang Yang, Ming-Hsuan Yang

Second, we learn a correlation filter over a feature pyramid centered at the estimated target position for predicting scale changes.

Object Tracking Position

Removing Rain From Single Images via a Deep Detail Network

no code implementations CVPR 2017 Xueyang Fu, Jia-Bin Huang, Delu Zeng, Yue Huang, Xinghao Ding, John Paisley

We propose a new deep network architecture for removing rain streaks from individual images based on the deep convolutional neural network (CNN).

Denoising Rain Removal

Learning Structured Semantic Embeddings for Visual Recognition

no code implementations5 Jun 2017 Dong Li, Hsin-Ying Lee, Jia-Bin Huang, Shengjin Wang, Ming-Hsuan Yang

First, we exploit the discriminative constraints to capture the intra- and inter-class relationships of image embeddings.

General Classification Multi-Label Classification +2

A Comparative Study for Single Image Blind Deblurring

no code implementations CVPR 2016 Wei-Sheng Lai, Jia-Bin Huang, Zhe Hu, Narendra Ahuja, Ming-Hsuan Yang

Using these datasets, we conduct a large-scale user study to quantify the performance of several representative state-of-the-art blind deblurring algorithms.

Single-Image Blind Deblurring

Detecting Migrating Birds at Night

no code implementations CVPR 2016 Jia-Bin Huang, Rich Caruana, Andrew Farnsworth, Steve Kelling, Narendra Ahuja

In this paper, we present a vision-based system for detecting migrating birds in flight at night.

Weakly Supervised Object Localization With Progressive Domain Adaptation

no code implementations CVPR 2016 Dong Li, Jia-Bin Huang, Ya-Li Li, Shengjin Wang, Ming-Hsuan Yang

In this paper, we address this problem by progressive domain adaptation with two main steps: classification adaptation and detection adaptation.

Classification Domain Adaptation +5

Hierarchical Convolutional Features for Visual Tracking

no code implementations ICCV 2015 Chao Ma, Jia-Bin Huang, Xiaokang Yang, Ming-Hsuan Yang

The outputs of the last convolutional layers encode the semantic information of targets and such representations are robust to significant appearance variations.

Object Recognition Visual Object Tracking +1

Single Image Super-Resolution From Transformed Self-Exemplars

no code implementations CVPR 2015 Jia-Bin Huang, Abhishek Singh, Narendra Ahuja

However, the internal dictionary obtained from the given image may not always be sufficiently expressive to cover the textural appearance variations in the scene.

Image Super-Resolution

Cannot find the paper you are looking for? You can Submit a new open access paper.