Search Results for author: Jenq-Neng Hwang

Found 73 papers, 17 papers with code

MovieChat+: Question-aware Sparse Memory for Long Video Question Answering

1 code implementation26 Apr 2024 Enxin Song, Wenhao Chai, Tian Ye, Jenq-Neng Hwang, Xi Li, Gaoang Wang

Recently, integrating video foundation models and large language models to build a video understanding system can overcome the limitations of specific pre-defined vision tasks.

2k Question Answering +2

Single-image driven 3d viewpoint training data augmentation for effective wine label recognition

no code implementations12 Apr 2024 Yueh-Cheng Huang, Hsin-Yi Chen, Cheng-Jui Hung, Jen-Hui Chuang, Jenq-Neng Hwang

Confronting the critical challenge of insufficient training data in the field of complex image recognition, this paper introduces a novel 3D viewpoint augmentation technique specifically tailored for wine label recognition.

Data Augmentation Generative Adversarial Network +1

VersaT2I: Improving Text-to-Image Models with Versatile Reward

no code implementations27 Mar 2024 Jianshu Guo, Wenhao Chai, Jie Deng, Hsiang-Wei Huang, Tian Ye, Yichen Xu, Jiawei Zhang, Jenq-Neng Hwang, Gaoang Wang

Recent text-to-image (T2I) models have benefited from large-scale and high-quality data, demonstrating impressive performance.

Exploring Learning-based Motion Models in Multi-Object Tracking

no code implementations16 Mar 2024 Hsiang-Wei Huang, Cheng-Yen Yang, Wenhao Chai, Zhongyu Jiang, Jenq-Neng Hwang

In the field of multi-object tracking (MOT), traditional methods often rely on the Kalman Filter for motion prediction, leveraging its strengths in linear motion scenarios.

motion prediction Multi-Object Tracking

Contrastive Pre-Training with Multi-View Fusion for No-Reference Point Cloud Quality Assessment

no code implementations15 Mar 2024 Ziyu Shan, Yujie Zhang, Qi Yang, Haichen Yang, Yiling Xu, Jenq-Neng Hwang, Xiaozhong Xu, Shan Liu

Furthermore, in the model fine-tuning stage, we propose a semantic-guided multi-view fusion module to effectively integrate the features of projected images from multiple perspectives.

Philosophy Point Cloud Quality Assessment

Tree Counting by Bridging 3D Point Clouds with Imagery

no code implementations4 Mar 2024 Lei LI, Tianfang Zhang, Zhongyu Jiang, Cheng-Yen Yang, Jenq-Neng Hwang, Stefan Oehmcke, Dimitri Pierre Johannes Gominski, Fabian Gieseke, Christian Igel

We leverage the fusion of three-dimensional LiDAR measurements and 2D imagery to facilitate the accurate counting of trees.


CityGen: Infinite and Controllable 3D City Layout Generation

no code implementations3 Dec 2023 Jie Deng, Wenhao Chai, Jianshu Guo, Qixuan Huang, Wenhao Hu, Jenq-Neng Hwang, Gaoang Wang

In this paper, we propose CityGen, a novel end-to-end framework for infinite, diverse and controllable 3D city layout generation. First, we propose an outpainting pipeline to extend the local layout to an infinite city layout.

See and Think: Embodied Agent in Virtual Environment

no code implementations26 Nov 2023 Zhonghan Zhao, Wenhao Chai, Xuan Wang, Li Boyi, Shengyu Hao, Shidong Cao, Tian Ye, Jenq-Neng Hwang, Gaoang Wang

Vision perception involves the interpretation of visual information in the environment, which is then integrated into the LLMs component with agent state and task instruction.

Question Answering Retrieval

UniHPE: Towards Unified Human Pose Estimation via Contrastive Learning

no code implementations24 Nov 2023 Zhongyu Jiang, Wenhao Chai, Lei LI, Zhuoran Zhou, Cheng-Yen Yang, Jenq-Neng Hwang

In this paper, we propose UniHPE, a unified Human Pose Estimation pipeline, which aligns features from all three modalities, i. e., 2D human pose estimation, lifting-based and image-based 3D human pose estimation, in the same pipeline.

2D Human Pose Estimation 3D Human Pose Estimation +3

Vision meets mmWave Radar: 3D Object Perception Benchmark for Autonomous Driving

no code implementations17 Nov 2023 Yizhou Wang, Jen-Hao Cheng, Jui-Te Huang, Sheng-Yao Kuan, Qiqian Fu, Chiming Ni, Shengyu Hao, Gaoang Wang, Guanbin Xing, Hui Liu, Jenq-Neng Hwang

This kind of radar format can enable machine learning models to generate more reliable object perception results after interacting and fusing the information or features between the camera and radar.

Autonomous Driving Sensor Fusion

Sea You Later: Metadata-Guided Long-Term Re-Identification for UAV-Based Multi-Object Tracking

no code implementations6 Nov 2023 Cheng-Yen Yang, Hsiang-Wei Huang, Zhongyu Jiang, Heng-Cheng Kuo, Jie Mei, Chung-I Huang, Jenq-Neng Hwang

Re-identification (ReID) in multi-object tracking (MOT) for UAVs in maritime computer vision has been challenging for several reasons.

Multi-Object Tracking

A Survey of Deep Learning in Sports Applications: Perception, Comprehension, and Decision

no code implementations7 Jul 2023 Zhonghan Zhao, Wenhao Chai, Shengyu Hao, Wenhao Hu, Guanhong Wang, Shidong Cao, Mingli Song, Jenq-Neng Hwang, Gaoang Wang

Deep learning has the potential to revolutionize sports performance, with applications ranging from perception and comprehension to decision.

Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation

1 code implementation7 Jul 2023 Zhongyu Jiang, Zhuoran Zhou, Lei LI, Wenhao Chai, Cheng-Yen Yang, Jenq-Neng Hwang

Learning-based methods have dominated the 3D human pose estimation (HPE) tasks with significantly better performance in most benchmarks than traditional optimization-based methods.

Ranked #10 on 3D Human Pose Estimation on 3DPW (PA-MPJPE metric)

3D Human Pose Estimation Image to 3D

Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching

no code implementations9 May 2023 Shuting Xia, Tingyu Fan, Yiling Xu, Jenq-Neng Hwang, Zhu Li

3D dynamic point cloud (DPC) compression relies on mining its temporal context, which faces significant challenges due to DPC's sparsity and non-uniform structure.

Feature Correlation Motion Compensation +2

Enhancing Multi-Camera People Tracking with Anchor-Guided Clustering and Spatio-Temporal Consistency ID Re-Assignment

2 code implementations19 Apr 2023 Hsiang-Wei Huang, Cheng-Yen Yang, Zhongyu Jiang, Pyong-Kun Kim, Kyoungoh Lee, Kwangju Kim, Samartha Ramkumar, Chaitanya Mullapudi, In-Su Jang, Chung-I Huang, Jenq-Neng Hwang

Multi-camera multiple people tracking has become an increasingly important area of research due to the growing demand for accurate and efficient indoor people tracking systems, particularly in settings such as retail, healthcare centers, and transit hubs.

Multiple People Tracking

Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation

1 code implementation ICCV 2023 Wenhao Chai, Zhongyu Jiang, Jenq-Neng Hwang, Gaoang Wang

We observe that the degradation is caused by two factors: 1) the large distribution gap over global positions of poses between the source and target datasets due to variant camera parameters and settings, and 2) the deficient diversity of local structures of poses in training.

3D Human Pose Estimation 3D Human Pose Estimation in Limited Data +3

DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes

2 code implementations15 Feb 2023 Shenghao Hao, Peiyuan Liu, Yibing Zhan, Kaixun Jin, Zuozhu Liu, Mingli Song, Jenq-Neng Hwang, Gaoang Wang

Although cross-view multi-object tracking has received increased attention in recent years, existing datasets still have several issues, including 1) missing real-world scenarios, 2) lacking diverse scenes, 3) owning a limited number of tracks, 4) comprising only static cameras, and 5) lacking standard benchmarks, which hinder the investigation and comparison of cross-view tracking methods.

Multi-Object Tracking Object +2

Multi-target multi-camera vehicle tracking using transformer-based camera link model and spatial-temporal information

no code implementations18 Jan 2023 Hsiang-Wei Huang, Cheng-Yen Yang, Jenq-Neng Hwang

Multi-target multi-camera tracking (MTMCT) of vehicles, i. e. tracking vehicles across multiple cameras, is a crucial application for the development of smart city and intelligent traffic system.

CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-wild 2D Annotations

no code implementations8 Jan 2023 Cheng-Yen Yang, Jiajia Luo, Lu Xia, Yuyin Sun, Nan Qiao, Ke Zhang, Zhongyu Jiang, Jenq-Neng Hwang

By adding a camera parameter branch, any in-the-wild 2D annotations can be fed into our pipeline to boost the training diversity and the 3D poses can be implicitly learned by reprojecting back to 2D.

Data Augmentation Monocular 3D Human Pose Estimation

1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results

no code implementations24 Nov 2022 Benjamin Kiefer, Matej Kristan, Janez Perš, Lojze Žust, Fabio Poiesi, Fabio Augusto de Alcantara Andrade, Alexandre Bernardino, Matthew Dawkins, Jenni Raitoharju, Yitong Quan, Adem Atmaca, Timon Höfer, Qiming Zhang, Yufei Xu, Jing Zhang, DaCheng Tao, Lars Sommer, Raphael Spraul, Hangyue Zhao, Hongpu Zhang, Yanyun Zhao, Jan Lukas Augustin, Eui-ik Jeon, Impyeong Lee, Luca Zedda, Andrea Loddo, Cecilia Di Ruberto, Sagar Verma, Siddharth Gupta, Shishir Muralidhara, Niharika Hegde, Daitao Xing, Nikolaos Evangeliou, Anthony Tzes, Vojtěch Bartl, Jakub Špaňhel, Adam Herout, Neelanjan Bhowmik, Toby P. Breckon, Shivanand Kundargi, Tejas Anvekar, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudengudi, Arpita Vats, Yang song, Delong Liu, Yonglin Li, Shuman Li, Chenhao Tan, Long Lan, Vladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi, Hsiang-Wei Huang, Cheng-Yen Yang, Jenq-Neng Hwang, Pyong-Kun Kim, Kwangju Kim, Kyoungoh Lee, Shuai Jiang, Haiwen Li, Zheng Ziqiang, Tuan-Anh Vu, Hai Nguyen-Truong, Sai-Kit Yeung, Zhuang Jia, Sophia Yang, Chih-Chung Hsu, Xiu-Yu Hou, Yu-An Jhang, Simon Yang, Mau-Tsuen Yang

The 1$^{\text{st}}$ Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicle (USV), and organized several subchallenges in this domain: (i) UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking, (iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime Obstacle Detection.

Object object-detection +2

HuPR: A Benchmark for Human Pose Estimation Using Millimeter Wave Radar

1 code implementation22 Oct 2022 Shih-Po Lee, Niraj Prakash Kini, Wen-Hsiao Peng, Ching-Wen Ma, Jenq-Neng Hwang

In addition to the benchmark, we propose a cross-modality training framework that leverages the ground-truth 2D keypoints representing human body joints for training, which are systematically generated from the pre-trained 2D pose estimation network based on a monocular camera input image, avoiding laborious manual label annotation efforts.

2D Pose Estimation Pose Estimation

Image-Text Retrieval with Binary and Continuous Label Supervision

no code implementations20 Oct 2022 Zheng Li, Caili Guo, Zerun Feng, Jenq-Neng Hwang, Ying Jin, Yufeng Zhang

Such a binary indicator covers only a limited subset of image-text semantic relations, which is insufficient to represent relevance degrees between images and texts described by continuous labels such as image captions.

Image Captioning Retrieval +2

Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval

no code implementations28 Sep 2022 Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Jenq-Neng Hwang, Zhongtian Du

More specifically, Triplet loss with Hard Negative mining (Triplet-HN), which is widely used in existing retrieval models to improve the discriminative ability, is easy to fall into local minima in training.

Contrastive Learning Retrieval +2

GaitTAKE: Gait Recognition by Temporal Attention and Keypoint-guided Embedding

no code implementations7 Jul 2022 Hung-Min Hsu, Yizhou Wang, Cheng-Yen Yang, Jenq-Neng Hwang, Hoang Le Uyen Thuc, Kwang-Ju Kim

Gait recognition, which refers to the recognition or identification of a person based on their body shape and walking styles, derived from video data captured from a distance, is widely used in crime prevention, forensic identification, and social security.

Gait Recognition

GLIPv2: Unifying Localization and Vision-Language Understanding

1 code implementation12 Jun 2022 Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng Gao

We present GLIPv2, a grounded VL understanding model, that serves both localization tasks (e. g., object detection, instance segmentation) and Vision-Language (VL) understanding tasks (e. g., VQA, image captioning).

 Ranked #1 on Phrase Grounding on Flickr30k Entities Test (using extra training data)

Contrastive Learning Image Captioning +7

Recent Advances in Embedding Methods for Multi-Object Tracking: A Survey

no code implementations22 May 2022 Gaoang Wang, Mingli Song, Jenq-Neng Hwang

Multi-object tracking (MOT) aims to associate target objects across video frames in order to obtain entire moving trajectories.

Image Classification Multi-Object Tracking +3

Unsupervised Domain Adaptation Learning for Hierarchical Infant Pose Recognition with Synthetic Data

no code implementations4 May 2022 Cheng-Yen Yang, Zhongyu Jiang, Shih-Yu Gu, Jenq-Neng Hwang, Jang-Hee Yoo

Due to limited public infant-related datasets, many works use the SMIL-based method to generate synthetic infant images for training.

Unsupervised Domain Adaptation

TR-MOT: Multi-Object Tracking by Reference

no code implementations30 Mar 2022 Mingfei Chen, Yue Liao, Si Liu, Fei Wang, Jenq-Neng Hwang

RS takes previous detected results as references to aggregate the corresponding features from the combined features of the adjacent frames and makes a one-to-one track state prediction for each reference in parallel.

Multi-Object Tracking Object

The Overlooked Classifier in Human-Object Interaction Recognition

no code implementations10 Mar 2022 Ying Jin, Yinpeng Chen, Lijuan Wang, JianFeng Wang, Pei Yu, Lin Liang, Jenq-Neng Hwang, Zicheng Liu

Human-Object Interaction (HOI) recognition is challenging due to two factors: (1) significant imbalance across classes and (2) requiring multiple labels per image.

Classification Human-Object Interaction Detection +4

HCIL: Hierarchical Class Incremental Learning for Longline Fishing Visual Monitoring

no code implementations25 Feb 2022 Jie Mei, Suzanne Romain, Craig Rose, Kelsey Magrane, Jenq-Neng Hwang

The goal of electronic monitoring of longline fishing is to visually monitor the fish catching activities on fishing vessels based on cameras, either for regulatory compliance or catch counting.

Classification Class Incremental Learning +1

Unsupervised Severely Deformed Mesh Reconstruction (DMR) from a Single-View Image

no code implementations23 Jan 2022 Jie Mei, Jingxi Yu, Suzanne Romain, Craig Rose, Kelsey Magrane, Graeme LeeSon, Jenq-Neng Hwang

Much progress has been made in the supervised learning of 3D reconstruction of rigid objects from multi-view images or a video.

3D Reconstruction

Grounded Language-Image Pre-training

2 code implementations CVPR 2022 Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao

The unification brings two benefits: 1) it allows GLIP to learn from both detection and grounding data to improve both tasks and bootstrap a good grounding model; 2) GLIP can leverage massive image-text pairs by generating grounding boxes in a self-training fashion, making the learned representation semantic-rich.

Described Object Detection Few-Shot Object Detection +1

ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot

no code implementations ICCV 2021 Jiarui Cai, Yizhou Wang, Jenq-Neng Hwang

One-stage long-tailed recognition methods improve the overall performance in a "seesaw" manner, i. e., either sacrifice the head's accuracy for better tail classification or elevate the head's accuracy even higher but ignore the tail.

Long-tail Learning

Deep Open Snake Tracker for Vessel Tracing

no code implementations19 Jul 2021 Li Chen, Wenjin Liu, Niranjan Balu, Mahmud Mossa-Basha, Thomas S. Hatsukami, Jenq-Neng Hwang, Chun Yuan

Vessel tracing by modeling vascular structures in 3D medical images with centerlines and radii can provide useful information for vascular health.

Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking

no code implementations6 May 2021 Gaoang Wang, Yizhou Wang, Renshu Gu, Weijie Hu, Jenq-Neng Hwang

To address such common challenges in most of the existing trackers, in this paper, a tracklet booster algorithm is proposed, which can be built upon any other tracker.

Multi-Object Tracking

Multi-Target Multi-Camera Tracking of Vehicles using Metadata-Aided Re-ID and Trajectory-Based Camera Link Model

no code implementations3 May 2021 Hung-Min Hsu, Jiarui Cai, Yizhou Wang, Jenq-Neng Hwang, Kwang-Ju Kim

In this paper, we propose a novel framework for multi-target multi-camera tracking (MTMCT) of vehicles based on metadata-aided re-identification (MA-ReID) and the trajectory-based camera link model (TCLM).


Absolute 3D Pose Estimation and Length Measurement of Severely Deformed Fish from Monocular Videos in Longline Fishing

no code implementations9 Feb 2021 Jie Mei, Jenq-Neng Hwang, Suzanne Romain, Craig Rose, Braden Moore, Kelsey Magrane

Finally, with a closed-form solution, the relative 3D fish pose can help locate absolute 3D keypoints, resulting in the frame-based absolute fish length measurement, which is further refined based on the statistical temporal inference for the optimal fish length measurement from the video clip.

3D Pose Estimation

Video-based Hierarchical Species Classification for Longline Fishing Monitoring

no code implementations6 Feb 2021 Jie Mei, Jenq-Neng Hwang, Suzanne Romain, Craig Rose, Braden Moore, Kelsey Magrane

However, with a known non-overlapping hierarchical data structure provided by fisheries scientists, our method enforces the hierarchical data structure and introduces an efficient training and inference strategy for video-based fisheries data.

Classification General Classification

Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution

no code implementations31 Oct 2020 Renshu Gu, Gaoang Wang, Jenq-Neng Hwang

Videos that contain multiple potentially occluded people captured from freely moving monocular cameras are very common in real-world scenarios, while 3D HPE for such scenarios is quite challenging, partially because there is a lack of such data with accurate 3D ground truth labels in existing datasets.

3D Human Pose Estimation 3D Pose Estimation

Traffic-Aware Multi-Camera Tracking of Vehicles Based on ReID and Camera Link Model

no code implementations22 Aug 2020 Hung-Min Hsu, Yizhou Wang, Jenq-Neng Hwang

In this paper, we propose an effective and reliable MTMCT framework for vehicles, which consists of a traffic-aware single camera tracking (TSCT) algorithm, a trajectory-based camera link model (CLM) for vehicle re-identification (ReID), and a hierarchical clustering algorithm to obtain the cross camera vehicle trajectories.

Clustering Vehicle Re-Identification

Automated Intracranial Artery Labeling using a Graph Neural Network and Hierarchical Refinement

1 code implementation11 Jul 2020 Li Chen, Thomas Hatsukami, Jenq-Neng Hwang, Chun Yuan

Automatically labeling intracranial arteries (ICA) with their anatomical names is beneficial for feature extraction and detailed analysis of intracranial vascular structures.

IA-MOT: Instance-Aware Multi-Object Tracking with Motion Consistency

no code implementations24 Jun 2020 Jiarui Cai, Yizhou Wang, Haotian Zhang, Hung-Min Hsu, Chengqian Ma, Jenq-Neng Hwang

Meanwhile, the spatial attention, which focuses on the foreground within the bounding boxes, is generated from the given instance masks and applied to the extracted embedding features.

Multi-Object Tracking Multiple Object Tracking +1

OVC-Net: Object-Oriented Video Captioning with Temporal Graph and Detail Enhancement

no code implementations8 Mar 2020 Fangyi Zhu, Jenq-Neng Hwang, Zhanyu Ma, Guang Chen, Jun Guo

Thereafter, we construct a new dataset, providing consistent object-sentence pairs, to facilitate effective cross-modal learning.

Object Sentence +1

RODNet: Radar Object Detection Using Cross-Modal Supervision

1 code implementation3 Mar 2020 Yizhou Wang, Zhongyu Jiang, Xiangyu Gao, Jenq-Neng Hwang, Guanbin Xing, Hui Liu

Radar is usually more robust than the camera in severe driving scenarios, e. g., weak/strong lighting and bad weather.

Autonomous Driving Object +3

Eye in the Sky: Drone-Based Object Tracking and 3D Localization

no code implementations18 Oct 2019 Haotian Zhang, Gaoang Wang, Zhichao Lei, Jenq-Neng Hwang

Drones, or general UAVs, equipped with a single camera have been widely deployed to a broad range of applications, such as aerial photography, fast goods delivery and most importantly, surveillance.

drone-based object tracking Multi-Object Tracking +3

MOANA: An Online Learned Adaptive Appearance Model for Robust Multiple Object Tracking in 3D

no code implementations9 Jan 2019 Zheng Tang, Jenq-Neng Hwang

Multiple object tracking has been a challenging field, mainly due to noisy detection sets and identity switch caused by occlusion and similar appearance among nearby targets.

Multiple Object Tracking

Exploit the Connectivity: Multi-Object Tracking with TrackletNet

1 code implementation18 Nov 2018 Gaoang Wang, Yizhou Wang, Haotian Zhang, Renshu Gu, Jenq-Neng Hwang

Multi-object tracking (MOT) is an important and practical task related to both surveillance systems and moving camera applications, such as autonomous driving and robotic vision.

Autonomous Driving Multi-Object Tracking +1

Y-net: 3D intracranial artery segmentation using a convolutional autoencoder

no code implementations19 Dec 2017 Li Chen, Yanjun Xie, Jie Sun, Niranjan Balu, Mahmud Mossa-Basha, Kristi Pimentel, Thomas S. Hatsukami, Jenq-Neng Hwang, Chun Yuan

Automated segmentation of intracranial arteries on magnetic resonance angiography (MRA) allows for quantification of cerebrovascular features, which provides tools for understanding aging and pathophysiological adaptations of the cerebrovascular system.

Binary Classification General Classification +1

Virtual Blood Vessels in Complex Background using Stereo X-ray Images

no code implementations22 Sep 2017 Qiuyu Chen, Ryoma Bise, Lin Gu, Yinqiang Zheng, Imari Sato, Jenq-Neng Hwang, Nobuaki Imanishi, Sadakazu Aiso

We propose a fully automatic system to reconstruct and visualize 3D blood vessels in Augmented Reality (AR) system from stereo X-ray images with bones and body fat.

Stereo Matching Stereo Matching Hand

Multiple-Kernel Based Vehicle Tracking Using 3D Deformable Model and Camera Self-Calibration

no code implementations22 Aug 2017 Zheng Tang, Gaoang Wang, Tao Liu, Young-Gun Lee, Adwin Jahn, Xu Liu, Xiaodong He, Jenq-Neng Hwang

In this challenge, we propose a model-based vehicle localization method, which builds a kernel at each patch of the 3D deformable vehicle model and associates them with constraints in 3D space.

Ensemble Learning object-detection +1

DesnowNet: Context-Aware Deep Network for Snow Removal

no code implementations15 Aug 2017 Yun-Fu Liu, Da-Wei Jaw, Shih-Chia Huang, Jenq-Neng Hwang

Existing learning-based atmospheric particle-removal approaches such as those used for rainy and hazy images are designed with strong assumptions regarding spatial frequency, trajectory, and translucency.

Semantic Segmentation Snow Removal

Underwater Fish Tracking for Moving Cameras based on Deformable Multiple Kernels

no code implementations5 Mar 2016 Meng-Che Chuang, Jenq-Neng Hwang, Jian-Hui Ye, Shih-Chia Huang, Kresimir Williams

In this paper, a novel tracking algorithm based on the deformable multiple kernels (DMK) is proposed to address these challenges.

A Feature Learning and Object Recognition Framework for Underwater Fish Images

no code implementations5 Mar 2016 Meng-Che Chuang, Jenq-Neng Hwang, Kresimir Williams

Toward this end, we propose an underwater fish recognition framework that consists of a fully unsupervised feature learning technique and an error-resilient classifier.

Clustering Object Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.