no code implementations • 10 Dec 2024 • Quinten McNamara, Miguel Ángel del Río Fernández, Nishchal Bhandari, Martin Ratajczak, Danny Chen, Corey Miller, Migüel Jetté
Word error rate (WER) as a metric has a variety of limitations that have plagued the field of speech recognition.
no code implementations • 18 Nov 2024 • Jinhong Wang, Jian Liu, Dongqi Tang, Weiqiang Wang, Wentong Li, Danny Chen, Jintai Chen, Jian Wu
Our idea is simple: We tackle the monocular depth estimation (MDE) task with an autoregressive prediction paradigm, based on two core designs.
no code implementations • 4 Oct 2024 • Nishchal Bhandari, Danny Chen, Miguel Ángel del Río Fernández, Natalie Delworth, Jennifer Drexler Fox, Migüel Jetté, Quinten McNamara, Corey Miller, Ondřej Novotný, Ján Profant, Nan Qin, Martin Ratajczak, Jean-Philippe Robichaud
Today, we are open-sourcing our core speech recognition and diarization models for non-commercial use.
no code implementations • 18 Apr 2024 • Hanjing Zhou, Mingze Yin, Jintai Chen, Danny Chen, Jian Wu
One-shot semantic segmentation aims to segment query images given only ONE annotated support image of the same class.
no code implementations • 11 Apr 2024 • Jinhong Wang, Yi Cheng, Jintai Chen, Hongxia Xu, Danny Chen, Jian Wu
In this paper, we tackle two challenges arisen in multi-rater annotations for medical image segmentation (called ambiguous medical image segmentation): (1) How to train a deep learning model when a group of raters produces a set of diverse but plausible annotations, and (2) how to fine-tune the model efficiently when computation resources are not available for re-training the entire model on a different dataset domain.
1 code implementation • 28 Mar 2024 • Jinhong Wang, Tingting Chen, Jintai Chen, Yixuan Wu, Yuyang Xu, Danny Chen, Haochao Ying, Jian Wu
In this paper, we present a self-supervised method via polar transformation based progressive contrastive learning, called PoCo, for ophthalmic disease diagnosis.
2 code implementations • 12 Mar 2024 • Jinhong Wang, Jintai Chen, Danny Chen, Jian Wu
In this paper, we introduce a Large Kernel Vision Mamba U-shape Network, or LKM-UNet, for medical image segmentation.
no code implementations • 14 Jan 2024 • Guangyu Meng, Ruyu Zhou, Liu Liu, Peixian Liang, Fang Liu, Danny Chen, Michael Niemier, X. Sharon Hu
Earth Mover's Distance (EMD) is an important similarity measure between two distributions, used in computer vision and many other application domains.
1 code implementation • 28 Nov 2023 • Jiahuan Yan, Haojun Gao, Zhang Kai, Weize Liu, Danny Chen, Jian Wu, Jintai Chen
Deep learning approaches exhibit promising performances on various text tasks.
1 code implementation • ICLR 2023 • Jintai Chen, Kuanlun Liao, Yanwen Fang, Danny Chen, Jian Wu
In this paper, we propose to encapsulate all feature values of a record into vectorial features and process them collectively rather than have to deal with individual ones, which directly captures the representations at the data level and benefits robust performances.
1 code implementation • ICCV 2023 • Jinhong Wang, Yi Cheng, Jintai Chen, Tingting Chen, Danny Chen, Jian Wu
In this way, we decompose an ordinal regression task into a series of recursive binary classification steps, so as to subtly distinguish adjacent categories.
1 code implementation • 7 May 2023 • Yi Cheng, Haochao Ying, Renjun Hu, Jinhong Wang, Wenhao Zheng, Xiao Zhang, Danny Chen, Jian Wu
Image ordinal regression has been mainly studied along the line of exploiting the order of categories.
1 code implementation • 12 Dec 2022 • Jinhong Wang, Jingwen Wang, Tingting Chen, Wenhao Zheng, Zhe Xu, Xingdi Wu, Wen Xu, Haochao Ying, Danny Chen, Jian Wu
Clinically, to assess the necessity of cataract surgery, accurately predicting postoperative VA before surgery by analyzing multi-view optical coherence tomography (OCT) images is crucially needed.
no code implementations • 2 Jul 2020 • Hui Xie, Zhe Pan, Leixin Zhou, Fahim A Zaman, Danny Chen, Jost B Jonas, Yaxing Wang, Xiaodong Wu
In this work, we propose to parameterize the surface cost functions in the graph model and leverage DL to learn those parameters.
no code implementations • CVPR 2018 • Xiaowei Xu, Qing Lu, Yu Hu, Lin Yang, Sharon Hu, Danny Chen, Yiyu Shi
Unlike existing litera- ture on quantization which primarily targets memory and computation complexity reduction, we apply quan- tization as a method to reduce over tting in FCNs for better accuracy.