no code implementations • 4 Jun 2025 • Liam Salass, Jerrin Bright, Amir Nazemi, Yuhao Chen, John Zelek, David Clausi
For evaluation, in addition to standard average precision, we propose Rink Space Localization Error (RSLE), a scale-invariant homography-based metric for removing perspective bias from rink space evaluation.
no code implementations • 3 Jun 2025 • Jerrin Bright, Zhibo Wang, Yuhao Chen, Sirisha Rambhatla, John Zelek, David Clausi
Lack of input data for in-the-wild activities often results in low performance across various computer vision tasks.
no code implementations • 7 May 2025 • Ervin Wang, Yuhao Chen
Accurately tracking food consumption is crucial for nutrition and health monitoring.
no code implementations • 5 May 2025 • Kevin Tan, Fan Yang, Yuhao Chen
Accurate dietary monitoring is essential for promoting healthier eating habits.
no code implementations • 1 May 2025 • Wallace Lee, Yuhao Chen
Monitoring dietary habits is crucial for preventing health risks associated with overeating and undereating, including obesity, diabetes, and cardiovascular diseases.
no code implementations • 10 Apr 2025 • Joshua Li, Fernando Jose Pena Cantu, Emily Yu, Alexander Wong, Yuchen Cui, Yuhao Chen
Then, we employ a matching algorithm to map each object in the scene graph with a SAM2-generated or SAM2-propagated mask, producing a temporally-consistent scene graph in dynamic environments.
no code implementations • 2 Apr 2025 • Junchi Zhou, Haozhou Wang, Yoichiro Kato, Tejasri Nampally, P. Rajalakshmi, M. Balram, Keisuke Katsura, Hao Lu, Yue Mu, Wanneng Yang, Yangmingrui Gao, Feng Xiao, Hongtao Chen, Yuhao Chen, Wenjuan Li, Jingwen Wang, Fenghua Yu, Jian Zhou, Wensheng Wang, Xiaochun Hu, Yuanzhu Yang, Yanfeng Ding, Wei Guo, Shouyang Liu
Developing computer vision-based rice phenotyping techniques is crucial for precision field management and accelerating breeding, thereby continuously advancing rice production.
no code implementations • 13 Mar 2025 • Xiangjie Kong, Zhenghao Chen, Weiyao Liu, Kaili Ning, Lechao Zhang, Syauqie Muhammad Marier, Yichen Liu, Yuhao Chen, Feng Xia
However, existing surveys have not provided a unified summary of the wide range of model architectures in this field, nor have they given detailed summaries of works in feature extraction and datasets.
no code implementations • 9 Mar 2025 • Zhaowei Chen, Borui Zhao, Yuchen Ge, Yuhao Chen, RenJie Song, Jiajun Liang
Building on these findings, we propose Asymmetric Decision-Making (ADM) to enhance feature consensus learning for student models while continuously promoting feature diversity in teacher models.
no code implementations • 6 Mar 2025 • Shen Zhang, Yaning Tan, Siyuan Liang, Linze Li, Ge Wu, Yuhao Chen, Shuheng Li, Zhenyu Zhao, Caihua Chen, Jiajun Liang, Yao Tang
Diffusion transformers(DiTs) struggle to generate images at resolutions higher than their training resolutions.
1 code implementation • 16 Feb 2025 • Yuanjie Lyu, Chao Zhang, Yuhao Chen, Yong Chen, Tong Xu
In Retrieval-Augmented Generation (RAG) and agent-based frameworks, the "Chain of Models" approach is widely used, where multiple specialized models work sequentially on distinct sub-tasks.
no code implementations • 26 Jan 2025 • Weixuan Chen, Qianqian Yang, Yuhao Chen, Chongwen Huang, Qian Wang, Zehui Xiong, Zhaoyang Zhang
Although significant improvements in transmission efficiency have been achieved, existing semantic communication (SemCom) methods typically use a fixed transmission rate for varying channel conditions and transmission contents, leading to performance degradation under harsh channel conditions.
no code implementations • CVPR 2025 • Fangyu Wu, Yuhao Chen
In the real world, objects reveal internal textures when sliced or cut, yet this behavior is not well-studied in 3D generation tasks today.
no code implementations • 25 Oct 2024 • E. Zhixuan Zeng, Yuhao Chen, Alexander Wong
This method allows for the automatic understanding of hidden features and supports a broader range of analysis without the need to train specific vectors.
1 code implementation • 15 Oct 2024 • Yizhe Liu, Yan Song Hu, Yuhao Chen, John Zelek
Image-based Pose-Agnostic 3D Anomaly Detection is an important task that has emerged in industrial quality control.
no code implementations • 19 Sep 2024 • Yan Song Hu, Nicolas Abboud, Muhammad Qasim Ali, Adam Srebrnjak Yang, Imad Elhajj, Daniel Asmar, Yuhao Chen, John S. Zelek
As a result, experiments show that our system generates reconstructions with a balance of quality, memory efficiency, and speed that outperforms the state-of-the-art.
no code implementations • 3 Sep 2024 • Yuhao Chen, Jiangpeng He, Gautham Vinod, Siddeshwar Raghavan, Chris Czarnecki, Jinge Ma, Talha Ibn Mahmud, Bruce Coburn, Dayou Mao, Saeejith Nair, Pengcheng Xi, Alexander Wong, Edward Delp, Fengqing Zhu
To bridge the gap between general 3D vision and food computing research, we introduce MetaFood3D.
no code implementations • 7 Aug 2024 • Yan Song Hu, Dayou Mao, Yuhao Chen, John Zelek
Initial applications of 3D Gaussian Splatting (3DGS) in Visual Simultaneous Localization and Mapping (VSLAM) demonstrate the generation of high-quality volumetric reconstructions from monocular video streams.
1 code implementation • 12 Jul 2024 • Jiangpeng He, Yuhao Chen, Gautham Vinod, Talha Ibn Mahmud, Fengqing Zhu, Edward Delp, Alexander Wong, Pengcheng Xi, Ahmad AlMughrabi, Umair Haroon, Ricardo Marques, Petia Radeva, Jiadong Tang, Dianyi Yang, Yu Gao, Zhaoxiang Liang, Yawei Jueluo, Chengyu Shi, Pengyu Wang
Participants were tasked with reconstructing 3D models for 20 selected food items of varying difficulty levels: easy, medium, and hard.
no code implementations • 27 Jun 2024 • Yuxiang Huang, Yuhao Chen, John Zelek
In contrast, traditional methods based on optical flow do not require training data, however, they often fail to capture object-level information, leading to over-segmentation or under-segmentation.
no code implementations • 5 Jun 2024 • E. Zhixuan Zeng, Yuhao Chen, Alexander Wong
Image generation techniques, particularly latent diffusion models, have exploded in popularity in recent years.
no code implementations • 25 May 2024 • Yuhao Chen, Zhimu Wang, Bo Wen, Farhana Zulkernine
Unstructured text in medical notes and dialogues contains rich information.
no code implementations • 22 May 2024 • Harish Prakash, Jia Cheng Shang, Ken M. Nsiempba, Yuhao Chen, David A. Clausi, John S. Zelek
Multi Object Tracking (MOT) in ice hockey pursues the combined task of localizing and associating players across a given sequence to maintain their identities.
no code implementations • 16 May 2024 • Muhammed Patel, Xinwei Chen, Linlin Xu, Yuhao Chen, K Andrea Scott, David A. Clausi
Fully supervised deep learning approaches have demonstrated impressive accuracy in sea ice classification, but their dependence on high-resolution labels presents a significant challenge due to the difficulty of obtaining such data.
no code implementations • 13 May 2024 • Jerrin Bright, Bavesh Balaji, Yuhao Chen, David A Clausi, John S Zelek
In the high-stakes world of baseball, every nuance of a pitcher's mechanics holds the key to maximizing performance and minimizing runs.
no code implementations • 13 May 2024 • Matthew Keller, Chi-en Amy Tai, Yuhao Chen, Pengcheng Xi, Alexander Wong
Many aging individuals encounter challenges in effectively tracking their dietary intake, exacerbating their susceptibility to nutrition-related health complications.
no code implementations • 12 May 2024 • Aaryam Sharma, Chris Czarnecki, Yuhao Chen, Pengcheng Xi, Linlin Xu, Alexander Wong
Monitoring dietary intake is a crucial aspect of promoting healthy living.
no code implementations • 12 May 2024 • Akil Pathiranage, Chris Czarnecki, Yuhao Chen, Pengcheng Xi, Linlin Xu, Alexander Wong
Ellipse estimation is an important topic in food image processing because it can be leveraged to parameterize plates and bowls, which in turn can be used to estimate camera view angles and food portion sizes.
no code implementations • 2 May 2024 • Yuxiang Huang, Yuhao Chen, John Zelek
Detecting and segmenting moving objects from a moving monocular camera is challenging in the presence of unknown camera motion, diverse object motions and complex scene structures.
no code implementations • 7 Apr 2024 • Yuanfeng Xu, Yuhao Chen, Zhongzhan Huang, Zijian He, Guangrun Wang, Philip Torr, Liang Lin
In this paper, we present AnimateZoo, a zero-shot diffusion-based video generator to address this challenging cross-species animation issue, aiming to accurately produce animal animations while preserving the background.
no code implementations • 17 Mar 2024 • Bavesh Balaji, Jerrin Bright, Sirisha Rambhatla, Yuhao Chen, Alexander Wong, John Zelek, David A Clausi
We further introduce a new spatio-temporal network leveraging our novel d-MAE for unique player identification.
no code implementations • 14 Mar 2024 • Jerrin Bright, Bavesh Balaji, Harish Prakash, Yuhao Chen, David A Clausi, John Zelek
Precise Human Mesh Recovery (HMR) with in-the-wild data is a formidable challenge and is often hindered by depth ambiguities and reduced precision.
no code implementations • 5 Feb 2024 • Dayou Mao, Yuhao Chen, Yifan Wu, Maximilian Gilles, Alexander Wong
One of the main motivations of MTL is to develop neural networks capable of inferring multiple tasks simultaneously.
no code implementations • 22 Dec 2023 • Yuhao Chen, Chloe Wong, Hanwen Yang, Juan Aguenza, Sai Bhujangari, Benthan Vu, Xun Lei, Amisha Prasad, Manny Fluss, Eric Phuong, Minghao Liu, Raja Kumar, Vanshika Vats, James Davis
This study critically evaluates the efficacy of prompting methods in enhancing the mathematical reasoning capability of large language models (LLMs).
no code implementations • 11 Dec 2023 • Saeejith Nair, Chi-en Amy Tai, Yuhao Chen, Alexander Wong
As the largest open-source synthetic food dataset, NV-Synth highlights the value of physics-based simulations for enabling scalable and controllable generation of diverse photorealistic meal images to overcome data limitations and drive advancements in automated dietary assessment using computer vision.
no code implementations • 6 Dec 2023 • Olivia Markham, Yuhao Chen, Chi-en Amy Tai, Alexander Wong
To address these limitations, we introduce FoodFusion, a Latent Diffusion model engineered specifically for the faithful synthesis of realistic food images from textual descriptions.
no code implementations • 30 Nov 2023 • Aditya Sridhar, Chi-en Amy Tai, Hayden Gunraj, Yuhao Chen, Alexander Wong
In Canada, prostate cancer is the most common form of cancer in men and accounted for 20% of new cancer cases for this demographic in 2022.
no code implementations • 29 Nov 2023 • Shen Zhang, Zhaowei Chen, Zhenyu Zhao, Yuhao Chen, Yao Tang, Jiajun Liang
Extensive experiments demonstrate that our approach can address object duplication and heavy computation issues, achieving state-of-the-art performance on higher-resolution image synthesis tasks.
no code implementations • 22 Nov 2023 • Yuhao Chen, Yuxuan Yan, Qianqian Yang, Yuanchao Shu, Shibo He, Jiming Chen
Transformer-based large language models (LLMs) have demonstrated impressive capabilities in a variety of natural language processing (NLP) tasks.
no code implementations • 20 Nov 2023 • Chi-en Amy Tai, Saeejith Nair, Olivia Markham, Matthew Keller, Yifan Wu, Yuhao Chen, Alexander Wong
Dietary intake estimation plays a crucial role in understanding the nutritional habits of individuals and populations, aiding in the prevention and management of diet-related health issues.
no code implementations • 10 Nov 2023 • Yuhao Chen, Yuxuan Yan, Qianqian Yang, Yuanchao Shu, Shibo He, Zhiguo Shi, Jiming Chen
Moreover, we propose a bit-level computation-efficient data compression scheme to compress the data to be transmitted between devices during training.
no code implementations • 25 Sep 2023 • Saeejith Nair, Yuhao Chen, Mohammad Javad Shafiee, Alexander Wong
Thus, there is a need to dynamically optimize the neural network component of NeRFs to achieve a balance between computational complexity and specific targets for synthesis quality.
no code implementations • 14 Sep 2023 • Chi-en Amy Tai, Matthew Keller, Saeejith Nair, Yuhao Chen, Yifan Wu, Olivia Markham, Krish Parmar, Pengcheng Xi, Heather Keller, Sharon Kirkpatrick, Alexander Wong
Recent work has focused on using computer vision and machine learning to automatically estimate dietary intake from food images, but the lack of comprehensive datasets with diverse viewpoints, modalities and food annotations hinders the accuracy and realism of such methods.
no code implementations • 12 Sep 2023 • Bavesh Balaji, Jerrin Bright, Harish Prakash, Yuhao Chen, David A Clausi, John Zelek
To address these issues, we propose a robust keyframe identification module that extracts frames containing essential high-level information about the jersey number.
no code implementations • 2 Sep 2023 • Jerrin Bright, Yuhao Chen, John Zelek
The findings highlight the effectiveness of our method in mitigating the challenges posed by motion blur, thereby enhancing the overall quality of pose estimation.
no code implementations • 8 Aug 2023 • Yuhao Chen, Qianqian Yang, Zhiguo Shi, Jiming Chen
In recent years, semantic communication has been a popular research topic for its superiority in communication efficiency.
no code implementations • 15 Jun 2023 • Grant Sinha, Krish Parmar, Hilda Azimi, Amy Tai, Yuhao Chen, Alexander Wong, Pengcheng Xi
To address these issues, two models are trained and compared, one based on convolutional neural networks and the other on Bidirectional Encoder representation for Image Transformers (BEiT).
no code implementations • 5 Jun 2023 • Weixuan Chen, Yuhao Chen, Qianqian Yang, Chongwen Huang, Qian Wang, Zhaoyang Zhang
Adaptive rate control for deep joint source and channel coding (JSCC) is considered as an effective approach to transmit sufficient information in scenarios with limited communication resources.
no code implementations • 21 Apr 2023 • Alexander Wong, Yifan Wu, Saad Abbasi, Saeejith Nair, Yuhao Chen, Mohammad Javad Shafiee
As such, the design of highly efficient multi-task deep neural network architectures tailored for computer vision tasks for robotic grasping on the edge is highly desired for widespread adoption in manufacturing environments.
no code implementations • 12 Apr 2023 • Chi-en Amy Tai, Matthew Keller, Mattie Kerrigan, Yuhao Chen, Saeejith Nair, Pengcheng Xi, Alexander Wong
Unlike existing datasets, a collection of 3D models with nutritional information allow for view synthesis to create an infinite number of 2D images for any given viewpoint/camera angle along with the associated nutritional information.
no code implementations • 12 Apr 2023 • Chi-en Amy Tai, Jason Li, Sriram Kumar, Saeejith Nair, Yuhao Chen, Pengcheng Xi, Alexander Wong
With the growth in capabilities of generative models, there has been growing interest in using photo-realistic renders of common 3D food items to improve downstream tasks such as food printing, nutrition prediction, or management of food wastage.
no code implementations • 10 Apr 2023 • E. Zhixuan Zeng, Yuhao Chen, Alexander Wong
To address these challenges, this paper proposes ShapeShift, a superquadric-based framework for object pose estimation that predicts the object's pose relative to a primitive shape which is fitted to the object.
1 code implementation • CVPR 2023 • Yuhao Chen, Xin Tan, Borui Zhao, Zhaowei Chen, RenJie Song, Jiajun Liang, Xuequan Lu
ANL introduces the additional negative pseudo-label for all unlabeled data to leverage low-confidence examples.
no code implementations • 19 Oct 2022 • Yuhao Chen, Hayden Gunraj, E. Zhixuan Zeng, Robbie Meyer, Maximilian Gilles, Alexander Wong
We also demonstrate that our MC score is a more reliability indicator for outputs during inference time compared to the model generated confidence scores that are often over-confident.
no code implementations • 8 Aug 2022 • Maximilian Gilles, Yuhao Chen, Tim Robin Winter, E. Zhixuan Zeng, Alexander Wong
Autonomous bin picking poses significant challenges to vision-driven robotic systems given the complexity of the problem, ranging from various sensor modalities, to highly entangled object layouts, to diverse item properties and gripper types.
no code implementations • 21 May 2022 • Mingyao Cui, Zidong Wu, Yuhao Chen, Shenheng Xu, Fan Yang, Linglong Dai
By jointly designing the hardware and software, this prototype can realize real-time 4K video transmission with much reduced power consumption.
1 code implementation • 29 Dec 2021 • Yuhao Chen, E. Zhixuan Zeng, Maximilian Gilles, Alexander Wong
We also propose a new layout-weighted performance metric alongside the dataset for evaluating object detection and segmentation performance in a manner that is more appropriate for robotic grasp applications compared to existing general-purpose performance metrics.
no code implementations • 6 Oct 2021 • Yuhao Chen, Qianqian Yang, Shibo He, Zhiguo Shi, Jiming Chen
Our numerical results demonstrate that FTPipeHD is 6. 8x faster in training than the state of the art method when the computing capacity of the best device is 10x greater than the worst one.
no code implementations • 26 May 2021 • Guoqing Zhang, Yuhao Chen, Weisi Lin, Arun Chandran, Xuan Jing
As a prevailing task in video surveillance and forensics field, person re-identification (re-ID) aims to match person images captured from non-overlapped cameras.
1 code implementation • 25 May 2021 • Yuhao Chen, Guoqing Zhang, Yujiang Lu, zhenxing Wang, yuhui Zheng, Ruili Wang
Text-based person search is a sub-task in the field of image retrieval, which aims to retrieve target person images according to a given textual description.
Ranked #10 on
Text based Person Retrieval
on CUHK-PEDES
no code implementations • 21 Mar 2021 • Guoqing Zhang, Yuhao Chen, Yang Dai, yuhui Zheng, Yi Wu
Due to the inaccurate person detections and pose changes, pedestrian misalignment significantly increases the difficulty of feature extraction and matching.
no code implementations • 10 Jul 2020 • Yuhao Chen, Yifan Wu, Linlin Xu, Alexander Wong
In this paper, we leverage the performance of CNNs, and propose a module that uses prior knowledge of building corners to create angular and concise building polygons from CNN segmentation outputs.
no code implementations • 24 Jan 2020 • Changye Yang, Sriram Baireddy, Yuhao Chen, Enyu Cai, Denise Caldwell, Valérian Méline, Anjali S. Iyer-Pascuzzi, Edward J. Delp
Analysis of the shape of plants can potentially be used to accurately quantify the degree of wilting.
no code implementations • 20 Dec 2019 • Kennedy Ralston, Yuhao Chen, Haruna Isah, Farhana Zulkernine
The chatbot could also be adapted for use in other application areas such as student info-centers, government kiosks, and mental health support systems.
no code implementations • 2 Jul 2018 • Javier Ribera, Fangning He, Yuhao Chen, Ayman F. Habib, Edward J. Delp
Use of imagery is becoming popular for phenotyping.
6 code implementations • CVPR 2019 • Javier Ribera, David Güera, Yuhao Chen, Edward J. Delp
In these networks, the training procedure usually requires providing bounding boxes or the maximum number of expected objects.
Ranked #1 on
Object Localization
on Mall