1 code implementation • 16 Nov 2024 • Ryan Benkert, Mohit Prabhushankar, Ghassan AlRegib
In this paper, we discuss improving the performance of active learning algorithms both in terms of prediction accuracy and negative flips.
no code implementations • 30 Oct 2024 • Kiran Kokilepersaud, Seulgi Kim, Mohit Prabhushankar, Ghassan AlRegib
Ideally, SSL algorithms would take advantage of this hierarchical emergence to have an additional regularization term to account for this local dimensional collapse effect.
1 code implementation • 29 Oct 2024 • Jorge Quesada, Zoe Fowler, Mohammad Alotaibi, Mohit Prabhushankar, Ghassan AlRegib
Additionally, we demonstrate that performance when using automated methods can be improved by up to 68% via a finetuning approach.
no code implementations • 20 Aug 2024 • Mohit Prabhushankar, Kiran Kokilepersaud, Jorge Quesada, Yavuz Yarici, Chen Zhou, Mohammad Alotaibi, Ghassan AlRegib, Ahmad Mustafa, Yusufjon Kumakov
However, specialized applications that require expert labels lag in data availability.
no code implementations • 20 Aug 2024 • Ghassan AlRegib, Mohit Prabhushankar, Kiran Kokilepersaud, Prithwijit Chowdhury, Zoe Fowler, Stephanie Trejo Corona, Lucas Thomaz, Angshul Majumdar
Balancing personalization and generalization is an important challenge to tackle, as the variation within OCT scans of patients between visits can be minimal while the difference in manifestation of the same disease across different patients may be substantial.
no code implementations • 11 Aug 2024 • Ghazal Kaviani, Reza Marzban, Ghassan AlRegib
This paper investigates image denoising, comparing traditional non-learning-based techniques, represented by Block-Matching 3D (BM3D), with modern learning-based methods, exemplified by NBNet.
no code implementations • 12 Jun 2024 • Prithwijit Chowdhury, Mohit Prabhushankar, Ghassan AlRegib, Mohamed Deriche
Explainable AI (XAI) has revolutionized the field of deep learning by empowering users to have more trust in neural network models.
1 code implementation • 12 Jun 2024 • Efe Ozturk, Mohit Prabhushankar, Ghassan AlRegib
In this study, we introduce an intelligent Test Time Augmentation (TTA) algorithm designed to enhance the robustness and accuracy of image classification models against viewpoint variations.
1 code implementation • 11 Jun 2024 • Yavuz Yarici, Kiran Kokilepersaud, Mohit Prabhushankar, Ghassan AlRegib
In scenarios, where labels are absent, these importance maps provide more intuitive explanations as they are integral to the human visual system.
no code implementations • 10 Jun 2024 • Kiran Kokilepersaud, Yavuz Yarici, Mohit Prabhushankar, Ghassan AlRegib
In reality, the class label is only one level of a \emph{hierarchy of different semantic relationships known as a taxonomy}.
1 code implementation • 1 Jun 2024 • Mohit Prabhushankar, Ghassan AlRegib
We show that every image, network, prediction, and explanatory technique has a unique uncertainty.
no code implementations • 1 Jun 2024 • Ryan Benkert, Mohit Prabhushankar, Ghassan AlRegib
By combining this approach with active learning, a well-known machine learning paradigm for data selection, we arrive at a comprehensive and innovative framework for training set selection in seismic interpretation.
no code implementations • 25 May 2024 • Ryan Benkert, Mohit Prabhushankar, Ghassan AlRegib
We refer to the underlying preservation mechanism as transitional feature preservation.
2 code implementations • 22 May 2024 • Mohit Prabhushankar, Ghassan AlRegib
We observe the following: (i) simple methodologies like negative log likelihood and margin classifiers outperform state-of-the-art uncertainty and out-of-distribution detection techniques for misprediction rates, and (ii) the proposed GradTrust is in the Top-2 performing methods on $37$ of the considered $38$ experimental modalities.
1 code implementation • 10 Apr 2024 • Chen Zhou, Ghassan AlRegib, Armin Parchami, Kunjan Singh
We integrate multi-goal estimation and region-based relation learning to model the two stimuli, social interactions, and stochastic goals, in a prediction framework.
no code implementations • 15 Mar 2024 • Chen Zhou, Mohit Prabhushankar, Ghassan AlRegib
Annotator label uncertainty manifests in variations of labeling quality.
1 code implementation • 17 Nov 2023 • Kiran Kokilepersaud, Yash-yee Logan, Ryan Benkert, Chen Zhou, Mohit Prabhushankar, Ghassan AlRegib, Enrique Corona, Kunjan Singh, Mostafa Parchami
In this paper, we introduce the FOCAL (Ford-OLIVES Collaboration on Active Learning) dataset which enables the study of the impact of annotation-cost within a video active learning setting.
1 code implementation • 20 Jul 2023 • Zoe Fowler, Kiran Kokilepersaud, Mohit Prabhushankar, Ghassan AlRegib
There exists two types of clinical trials: retrospective and prospective.
no code implementations • 24 May 2023 • Kiran Kokilepersaud, Stephanie Trejo Corona, Mohit Prabhushankar, Ghassan AlRegib, Charles Wykoff
We exploit this relationship by using the clinical data as pseudo-labels for our data without biomarker labels in order to choose positive and negative instances for training a backbone network with a supervised contrastive loss.
no code implementations • 28 Apr 2023 • Kiran Kokilepersaud, Mohit Prabhushankar, Yavuz Yarici, Ghassan AlRegib, Armin Parchami
In this work, we present a methodology to shape a fisheye-specific representation space that reflects the interaction between distortion and semantic context present in this data modality.
no code implementations • 6 Apr 2023 • Jinsol Lee, Charlie Lehman, Mohit Prabhushankar, Ghassan AlRegib
We define purview as the additional capacity necessary to characterize inference samples that differ from the training data.
no code implementations • 24 Feb 2023 • Ryan Benkert, Oluwaseun Joseph Aribido, Ghassan AlRegib
In recent years, deep neural networks have significantly impacted the seismic interpretation process.
2 code implementations • 16 Feb 2023 • Ryan Benkert, Mohit Prabhushankar, Ghassan AlRegib, Armin Pacharmi, Enrique Corona
To alleviate this issue, we propose a grounded second-order definition of information content and sample importance within the context of active learning.
1 code implementation • 11 Feb 2023 • Mohit Prabhushankar, Ghassan AlRegib
This paper conjectures and validates a framework that allows for action during inference in supervised neural networks.
1 code implementation • 14 Jan 2023 • Chen Zhou, Ghassan AlRegib, Armin Parchami, Kunjan Singh
In smart transportation, intelligent systems avoid potential collisions by predicting the intent of traffic agents, especially pedestrians.
no code implementations • 12 Jan 2023 • Ryan Benkert, Mohit Prabhushankar, Ghassan AlRegib
However, existing strategies directly base the data selection on the data representation of the unlabeled data which is random for OOD samples by definition.
no code implementations • 10 Jan 2023 • Ryan Benkert, Oluwaseun Joseph Aribido, Ghassan AlRegib
Inspired by this phenomenon, we present a novel segmentation method that actively uses this information to alter the data representation within the model by increasing the variety of difficult regions.
no code implementations • 15 Dec 2022 • Ahmad Mustafa, Ghassan AlRegib
On a dataset of well outcomes and corresponding geophysical attribute data, we show how LIME can induce trust in model's decisions by revealing the decision-making process to be aligned to domain knowledge.
1 code implementation • 15 Dec 2022 • Ahmad Mustafa, Ghassan AlRegib
Deep learning can extract rich data representations if provided sufficient quantities of labeled training data.
1 code implementation • 10 Nov 2022 • Chen Zhou, Mohit Prabhushankar, Ghassan AlRegib
Our evaluation of existing uncertainty estimation algorithms, with the presence of HLU, indicates the limitations of existing uncertainty metrics and algorithms themselves in response to HLU.
no code implementations • 9 Nov 2022 • Kiran Kokilepersaud, Mohit Prabhushankar, Ghassan AlRegib
This is accomplished by leveraging the larger amount of clinical data as pseudo-labels for our data without biomarker labels in order to choose positive and negative instances for training a backbone network with a supervised contrastive loss.
1 code implementation • 22 Sep 2022 • Mohit Prabhushankar, Kiran Kokilepersaud, Yash-yee Logan, Stephanie Trejo Corona, Ghassan AlRegib, Charles Wykoff
The dataset consists of 1268 near-IR fundus images each with at least 49 OCT scans, and 16 biomarkers, along with 4 clinical labels and a disease diagnosis of DR or DME.
1 code implementation • 17 Sep 2022 • Mohit Prabhushankar, Ghassan AlRegib
Finally, we ground the proposed machine introspection to human introspection for the application of image quality assessment.
no code implementations • 23 Jun 2022 • Yash-yee Logan, Ryan Benkert, Ahmad Mustafa, Gukyeong Kwon, Ghassan AlRegib
For this purpose, we propose a framework that incorporates clinical insights into the sample selection process of active learning that can be incorporated with existing algorithms.
no code implementations • 21 Jun 2022 • Yash-yee Logan, Mohit Prabhushankar, Ghassan AlRegib
Hence, active learning techniques that are developed for natural images are insufficient for handling medical data.
no code implementations • 16 Jun 2022 • Jinsol Lee, Ghassan AlRegib
Neural networks for image classification tasks assume that any given image during inference belongs to one of the training classes.
no code implementations • 16 Jun 2022 • Kiran Kokilepersaud, Mohit Prabhushankar, Ghassan AlRegib
In seismic interpretation, pixel-level labels of various rock structures can be time-consuming and expensive to obtain.
no code implementations • 16 Jun 2022 • Jinsol Lee, Mohit Prabhushankar, Ghassan AlRegib
We propose to utilize gradients for detecting adversarial and out-of-distribution samples.
1 code implementation • 8 Mar 2022 • Gukyeong Kwon, Ghassan AlRegib
Also, the two-stream autoencoder works as a unified framework for the gating model and the unseen expert, which makes the proposed method computationally efficient.
2 code implementations • 24 Feb 2022 • Ghassan AlRegib, Mohit Prabhushankar
With $P$ as the prediction from a neural network, these questions are `Why P?
no code implementations • 17 Jan 2022 • Muhammad Amir Shafiq, Zhiling Long, Haibin Di, Ghassan AlRegib
Subsequently, a novel directional center-surround attention model is proposed to incorporate directional comparisons around each voxel for saliency detection within each projected dimension.
no code implementations • 22 Aug 2021 • Oluwaseun Joseph Aribido, Ghassan AlRegib, Yazeed Alaudah
Images from the selected clusters are used to train the encoder-decoder network.
no code implementations • 15 Apr 2021 • Min-Hung Chen, Baopu Li, Yingze Bao, Ghassan AlRegib
The main progress for action segmentation comes from densely-annotated data for fully-supervised learning.
Ranked #16 on Action Segmentation on Breakfast
no code implementations • 23 Mar 2021 • Mohit Prabhushankar, Ghassan AlRegib
Neural networks trained to classify images do so by identifying features that allow them to distinguish between classes.
no code implementations • 23 Mar 2021 • Mohit Prabhushankar, Ghassan AlRegib
In this paper, we formalize the structure of contrastive reasoning and propose a methodology to extract a neural network's notion of contrast.
1 code implementation • 10 Sep 2020 • Oluwaseun Joseph Aribido, Ghassan AlRegib, Mohamed Deriche
Annotating seismic data is expensive, laborious and subjective due to the number of years required for seismic interpreters to attain proficiency in interpretation.
1 code implementation • 2 Sep 2020 • Charles Lehman, Dogancan Temel, Ghassan AlRegib
Semantic segmentation is a scene understanding task at the heart of safety-critical applications where robustness to corrupted inputs is essential.
1 code implementation • 21 Aug 2020 • Shirley Liu, Charles Lehman, Ghassan AlRegib
In this paper, we examine the overfitting behavior of image classification models modified with Implicit Background Estimation (SCrIBE), which transforms them into weakly supervised segmentation models that provide spatial domain visualizations without affecting performance.
no code implementations • 18 Aug 2020 • Jinsol Lee, Ghassan AlRegib
We demonstrate the effectiveness of gradients as a measure of model uncertainty in applications of detecting unfamiliar inputs, including out-of-distribution and corrupted samples.
no code implementations • 13 Aug 2020 • Gukyeong Kwon, Mohit Prabhushankar, Dogancan Temel, Ghassan AlRegib
To articulate the significance of the model perspective in novelty detection, we utilize backpropagated gradients.
no code implementations • 4 Aug 2020 • Yutong Sun, Mohit Prabhushankar, Ghassan AlRegib
In this paper, we show that existing recognition and localization deep architectures, that have not been exposed to eye tracking data or any saliency datasets, are capable of predicting the human visual saliency.
3 code implementations • 1 Aug 2020 • Mohit Prabhushankar, Gukyeong Kwon, Dogancan Temel, Ghassan AlRegib
Current modes of visual explanations answer questions of the form $`Why \text{ } P?'$.
2 code implementations • ECCV 2020 • Gukyeong Kwon, Mohit Prabhushankar, Dogancan Temel, Ghassan AlRegib
Anomalies require more drastic model updates to fully represent them compared to normal data.
no code implementations • 28 Jun 2020 • Ahmad Mustafa, Motaz Alfarraj, Ghassan AlRegib
We empirically compare our proposed workflow with some other sequence modeling-based neural networks that model seismic data only temporally.
no code implementations • 16 Mar 2020 • Yuting Hu, Zhiling Long, Anirudha Sundaresan, Motaz Alfarraj, Ghassan AlRegib, Sungmee Park, Sundaresan Jayaraman
We formulate the problem as a very fine-grained texture classification problem, and study how deep learning-based texture representation techniques can help tackle the task.
1 code implementation • CVPR 2020 • Min-Hung Chen, Baopu Li, Yingze Bao, Ghassan AlRegib, Zsolt Kira
Despite the recent progress of fully-supervised action segmentation techniques, the performance is still not fully satisfactory.
Ranked #12 on Action Segmentation on GTEA
no code implementations • 4 Feb 2020 • Yuting Hu, Zhen Wang, Ghassan AlRegib
In this paper, we present an efficient and distinctive local descriptor, namely block intensity and gradient difference (BIGD).
no code implementations • ICLR 2020 • Gukyeong Kwon, Mohit Prabhushankar, Dogancan Temel, Ghassan AlRegib
To complement the learned information from activation-based representation, we propose utilizing a gradient-based representation that explicitly focuses on missing information.
no code implementations • 25 Sep 2019 • Mohit Prabhushankar, Gukyeong Kwon, Dogancan Temel, Ghassan AlRegib
Such a positioning scheme is based on a data point’s second-order property.
2 code implementations • 29 Aug 2019 • Dogancan Temel, Min-Hung Chen, Ghassan AlRegib
We investigate the effect of challenging conditions through spectral analysis and show that challenging conditions can lead to distinct magnitude spectrum characteristics.
2 code implementations • 27 Aug 2019 • Gukyeong Kwon, Mohit Prabhushankar, Dogancan Temel, Ghassan AlRegib
In this paper, we utilize weight gradients from backpropagation to characterize the representation space learned by deep learning algorithms.
2 code implementations • 19 Aug 2019 • Motaz Alfarraj, Ghassan AlRegib
The proposed workflow uses well-log data to guide the inversion.
no code implementations • 6 Aug 2019 • Dogancan Temel, Melvin J. Mathew, Ghassan AlRegib, Yousuf M. Khalifa
Based on the conducted experiments, proposed algorithm RAPDNet can achieve a sensitivity and a specificity of 90. 6% over 64 test cases in a balanced set, which corresponds to an AUC of 0. 929 in ROC analysis.
5 code implementations • ICCV 2019 • Min-Hung Chen, Zsolt Kira, Ghassan AlRegib, Jaekwon Yoo, Ruxin Chen, Jian Zheng
Finally, we propose Temporal Attentive Adversarial Adaptation Network (TA3N), which explicitly attends to the temporal dynamics using domain discrepancy for more effective domain alignment, achieving state-of-the-art performance on four video DA datasets (e. g. 7. 9% accuracy gain over "Source only" from 73. 9% to 81. 8% on "HMDB --> UCF", and 10. 3% gain on "Kinetics --> Gameplay").
3 code implementations • 6 Jun 2019 • Ahmad Mustafa, Motaz Alfarraj, Ghassan AlRegib
In exploration seismology, seismic inversion refers to the process of inferring physical properties of the subsurface from seismic data.
2 code implementations • 1 Jun 2019 • Chih-Yao Ma, Yannis Kalantidis, Ghassan AlRegib, Peter Vajda, Marcus Rohrbach, Zsolt Kira
When automatically generating a sentence description for an image or video, it often remains unclear how well the generated caption is grounded, that is whether the model uses the correct image regions to output particular words, or if the model is hallucinating based on priors in the dataset and/or the language model.
2 code implementations • 31 May 2019 • Motaz Alfarraj, Ghassan AlRegib
Then, a neural-network-based inversion model comprising convolutional and recurrent neural layers is used to invert seismic data for acoustic impedance.
Image and Video Processing Signal Processing Geophysics
5 code implementations • 26 May 2019 • Min-Hung Chen, Zsolt Kira, Ghassan AlRegib
Finally, we propose Temporal Attentive Adversarial Adaptation Network (TA3N), which explicitly attends to the temporal dynamics using domain discrepancy for more effective domain alignment, achieving state-of-the-art performance on three video DA datasets.
Ranked #1 on Domain Adaptation on UCF-to-Olympic
no code implementations • 24 May 2019 • Hasan Al-Marzouqi, Yuting Hu, Ghassan AlRegib
Image retrieval is an important problem in the area of multimedia processing.
1 code implementation • 23 May 2019 • Yuting Hu, Zhiling Long, Ghassan AlRegib
In this paper, we propose a multi-level texture encoding and representation network (MuLTER) for texture-related applications.
1 code implementation • 23 May 2019 • Charles Lehman, Dogancan Temel, Ghassan AlRegib
Scene understanding and semantic segmentation are at the core of many computer vision tasks, many of which, involve interacting with humans in potentially dangerous ways.
no code implementations • 21 May 2019 • Dogancan Temel, Melvin J. Mathew, Ghassan AlRegib, Yousuf M. Khalifa
In this paper, we introduce a portable eye imaging device denoted as lab-on-a-headset, which can automatically perform a swinging flashlight test.
no code implementations • 16 May 2019 • Yazeed Alaudah, Motaz Alfarraj, Ghassan AlRegib
By having an interpreter select a very small number of exemplar images for every class of subsurface structures, we use a novel similarity-based retrieval technique to extract thousands of images that contain similar subsurface structures from the seismic volume.
3 code implementations • CVPR 2019 • Chih-Yao Ma, Zuxuan Wu, Ghassan AlRegib, Caiming Xiong, Zsolt Kira
As deep learning continues to make progress for challenging perception tasks, there is increased interest in combining vision, language, and decision-making.
Ranked #115 on Vision and Language Navigation on VLN Challenge
1 code implementation • CVPR 2019 (Oral) 2019 • Chih-Yao Ma, Zuxuan Wu, Ghassan AlRegib, Caiming Xiong, Zsolt Kira
As deep learning continues to make progress for challenging perception tasks, there is increased interest in combining vision, language, and decision-making.
2 code implementations • 19 Feb 2019 • Dogancan Temel, Tariq Alshawi, Min-Hung Chen, Ghassan AlRegib
Experimental results show that benchmarked algorithms are highly sensitive to tested challenging conditions, which result in an average performance drop of 0. 17 in terms of precision and a performance drop of 0. 28 in recall under severe conditions.
1 code implementation • 18 Feb 2019 • Dogancan Temel, Jinsol Lee, Ghassan AlRegib
Experimental results show that deep learning-based image representations can estimate the recognition performance variation with a Spearman's rank-order correlation of 0. 94 under multifarious acquisition conditions.
no code implementations • 17 Feb 2019 • Mohit Prabhushankar, Gukyeong Kwon, Dogancan Temel, Ghassan AlRegib
In this paper, we generate and control semantically interpretable filters that are directly learned from natural images in an unsupervised fashion.
no code implementations • 30 Jan 2019 • Zhiling Long, Yazeed Alaudah, Muhammad Ali Qureshi, Motaz Al Farraj, Zhen Wang, Asjad Amin, Mohamed Deriche, Ghassan AlRegib
It is our hope that this comparative study will help acquaint the seismic interpretation community with the many available powerful image texture analysis techniques, providing more alternative attributes for their seismic exploration.
no code implementations • 30 Jan 2019 • Tariq Alshawi, Zhiling Long, Ghassan AlRegib
In this paper, we present an analysis of recorded eye-fixation data from human subjects viewing video sequences.
no code implementations • 30 Jan 2019 • Muhammad Amir Shafiq, Zhiling Long, Tariq Alshawi, Ghassan AlRegib
In this paper, we propose a novel approach for saliency detection for seismic applications using 3D-FFT local spectra and multi-dimensional plane projections.
1 code implementation • 24 Jan 2019 • Motaz Alfarraj, Yazeed Alaudah, Zhiling Long, Ghassan AlRegib
Moreover, directional multiresolution attributes, such as the curvelet transform, are more effective than the non-directional attributes in distinguishing different subsurface structures in large seismic datasets, and can greatly help the interpretation process.
Image and Video Processing Geophysics
no code implementations • 24 Jan 2019 • Motaz Alfarraj, Ghassan AlRegib
Recent advances in machine learning have shown promising results for recurrent neural networks (RNN) in modeling complex sequential data such as videos and speech signals.
5 code implementations • 12 Jan 2019 • Yazeed Alaudah, Patrycja Michalowicz, Motaz Alfarraj, Ghassan AlRegib
In addition to making the dataset and the code publicly available, this work helps advance research in this area by creating an objective benchmark for comparing the results of different machine learning approaches for facies classification.
2 code implementations • ICLR 2019 • Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong
The Vision-and-Language Navigation (VLN) task entails an agent following navigational instruction in photo-realistic unknown environments.
Ranked #115 on Vision and Language Navigation on VLN Challenge
Natural Language Visual Grounding Vision and Language Navigation +2
no code implementations • 9 Jan 2019 • Muhammad Amir Shafiq, Tariq Alshawi, Zhiling Long, Ghassan AlRegib
In this paper, we propose a saliency-based attribute, SalSi, to detect salt dome bodies within seismic volumes.
no code implementations • 6 Jan 2019 • Tariq Alshawi, Zhiling Long, Ghassan AlRegib
Based on the study, we then develop an algorithm that estimates a pixel-wise uncertainty map that reflects our confidence in the associated computational saliency map by relating a pixel's saliency to the saliency of its neighbors.
no code implementations • 31 Dec 2018 • Muhammad Amir Shafiq, Tariq Alshawi, Zhiling Long, Ghassan AlRegib
In this paper, we propose a workflow based on SalSi for the detection and delineation of geological structures such as salt domes.
no code implementations • 19 Dec 2018 • Zhiling Long, Yazeed Alaudah, Muhammad Ali Qureshi, Yuting Hu, Zhen Wang, Motaz Alfarraj, Ghassan AlRegib, Asjad Amin, Mohamed Deriche, Suhail Al-Dharrab, Haibin Di
We focus on spatial attributes in this study and examine them in a new application for seismic interpretation, i. e., seismic volume labeling.
2 code implementations • 12 Dec 2018 • Mohammed A. Aabed, Gukyeong Kwon, Ghassan AlRegib
This is a full-reference tempospatial approach that considers both temporal and spatial PSD characteristics.
1 code implementation • 22 Nov 2018 • Dogancan Temel, Ghassan AlRegib
In this work, we combine these approaches by extending CIEDE2000 formula with perceptual color difference to assess image quality.
1 code implementation • 21 Nov 2018 • Dogancan Temel, Ghassan AlRegib
In addition to support vector machines that are commonly used in the multi-method fusion, we propose using neural networks in the boosting.
2 code implementations • 21 Nov 2018 • Mohit Prabhushankar, Dogancan Temel, Ghassan AlRegib
We use multiple linear decoders to capture different abstraction levels of the image patches.
1 code implementation • 21 Nov 2018 • Dogancan Temel, Ghassan AlRegib
In this work, we compare the state of the art quality and content-based spatial pooling strategies and show that although features are the key in any image quality assessment, pooling also matters.
no code implementations • 21 Nov 2018 • Mohit Prabhushankar, Dogancan Temel, Ghassan AlRegib
While assessing image quality, the filters need to capture perceptual differences based on dissimilarities between a reference image and its distorted version.
no code implementations • 19 Nov 2018 • Dogancan Temel, Ghassan AlRegib
We show that generic descriptors can perform as well as state of the art hand-crafted aesthetics models that use global features.
no code implementations • 18 Nov 2018 • Dogancan Temel, Ghassan AlRegib
An average observer perceives the world in color instead of black and white.
1 code implementation • 16 Nov 2018 • Dogancan Temel, Ghassan AlRegib
Moreover, BleSS significantly enhances the performance of SR-SIM and FSIM in the full TID 2013 database.
1 code implementation • 14 Nov 2018 • Dogancan Temel, Ghassan AlRegib
In terms of the Pearson and the Spearman correlation, ReSIFT is the best performing quality estimator in the overall databases.
1 code implementation • 5 Nov 2018 • Motaz Alfarraj, Yazeed Alaudah, Ghassan AlRegib
In this paper, we introduce a non-parametric texture similarity measure based on the singular value decomposition of the curvelet coefficients followed by a content-based truncation of the singular values.
Image and Video Processing
1 code implementation • 19 Oct 2018 • Huijie Pan, Dogancan Temel, Ghassan AlRegib
In this paper, we propose an algorithm denoted as HeartBEAT that tracks heart rate from wrist-type photoplethysmography (PPG) signals and simultaneously recorded three-axis acceleration data.
1 code implementation • 18 Oct 2018 • Dogancan Temel, Jinsol Lee, Ghassan AlRegib
Moreover, we investigate the relationship between object recognition and image quality and show that objective quality algorithms can estimate recognition performance under certain photometric challenging conditions.
1 code implementation • 15 Oct 2018 • Dogancan Temel, Ghassan AlRegib
Robust and reliable traffic sign detection is necessary to bring autonomous vehicles onto our roads.
no code implementations • 15 Oct 2018 • Dogancan Temel, Ghassan AlRegib
This paper presents a full-reference image quality estimator based on color, structure, and visual system characteristics denoted as CSV.
no code implementations • 14 Oct 2018 • Dogancan Temel, Ghassan AlRegib
To overcome these shortcomings, we introduce an image quality assessment algorithm based on the Spectral Understanding of Multi-scale and Multi-channel Error Representations, denoted as SUMMER.
1 code implementation • 7 Dec 2017 • Dogancan Temel, Gukyeong Kwon, Mohit Prabhushankar, Ghassan AlRegib
We benchmark the performance of existing solutions in real-world scenarios and analyze the performance variation with respect to challenging conditions.
no code implementations • 16 Nov 2017 • Chih-Yao Ma, Asim Kadav, Iain Melvin, Zsolt Kira, Ghassan AlRegib, Hans Peter Graf
We address the problem of video captioning by grounding language generation on object interactions in the video.
no code implementations • CVPR 2018 • Chih-Yao Ma, Asim Kadav, Iain Melvin, Zsolt Kira, Ghassan AlRegib, Hans Peter Graf
Human actions often involve complex interactions across several inter-related objects in the scene.
4 code implementations • 30 Mar 2017 • Chih-Yao Ma, Min-Hung Chen, Zsolt Kira, Ghassan AlRegib
We demonstrate that using both RNNs (using LSTMs) and Temporal-ConvNets on spatiotemporal feature matrices are able to exploit spatiotemporal dynamics to improve the overall performance.
Ranked #56 on Action Recognition on HMDB-51