1 code implementation • 10 Jul 2024 • Daizong Liu, Mingyu Yang, Xiaoye Qu, Pan Zhou, Yu Cheng, Wei Hu
Compared to traditional Large Language Models (LLMs), LVLMs present great potential and challenges due to its closer proximity to the multi-resource real-world applications and the complexity of multi-modal processing.
1 code implementation • 27 Apr 2024 • Mingyu Yang, Bowen Liu, Boyang Wang, Hun-Seok Kim
In the following diffusion step, DiffJSCC uses the derived multimodal features, together with channel state information such as the signal-to-noise ratio (SNR), as conditions to guide the denoising diffusion process, which converts the initial random noise to the final reconstruction.
1 code implementation • 14 Aug 2023 • Kejia Zhang, Mingyu Yang, Stephen D. J. Lang, Alistair M. McInnes, Richard B. Sherley, Tilo Burghardt
In this paper, we publish an animal-borne underwater video dataset of penguins and introduce a ready-to-deploy deep learning system capable of robustly detecting penguins (mAP50@98. 0%) and also instances of fish (mAP50@73. 3%).
no code implementations • 4 Jul 2023 • Sara Shoouri, Mingyu Yang, Gordy Carichner, Yuyang Li, Ehab A. Hamed, Angela Deng, Delbert A. Green II, Inhee Lee, David Blaauw, Hun-Seok Kim
A new GPS-less, daily localization method is proposed with deep learning sensor fusion that uses daylight intensity and temperature sensor data for Monarch butterfly tracking.
1 code implementation • ICCV 2023 • Sara Shoouri, Mingyu Yang, Zichen Fan, Hun-Seok Kim
Solving multiple visual tasks using individual models can be resource-intensive, while multi-task learning can conserve resources by sharing knowledge across different tasks.
2 code implementations • 17 Jun 2022 • Wentao Zhang, Zeang Sheng, Mingyu Yang, Yang Li, Yu Shen, Zhi Yang, Bin Cui
First, GNNs can learn higher-order structural information by stacking more layers but can not deal with large depth due to the over-smoothing issue.
1 code implementation • 12 May 2022 • Mingyu Yang, Yu Chen, Hun-Seok Kim
In recent years, deep learning-based approaches for visual-inertial odometry (VIO) have shown remarkable performance outperforming traditional geometric methods.
no code implementations • 7 May 2022 • Zheng Chen, Jian Zhao, Mingyu Yang, Wengang Zhou, Houqiang Li
In this work, we are dedicated to multi-target active object tracking (AOT), where there are multiple targets as well as multiple cameras in the environment.
no code implementations • 5 May 2022 • Mingyu Yang, Jian Zhao, Xunhan Hu, Wengang Zhou, Jiangcheng Zhu, Houqiang Li
In this way, agents dealing with the same subtask share their learning of specific abilities and different subtasks correspond to different specific abilities.
Multi-agent Reinforcement Learning reinforcement-learning +3
1 code implementation • 16 Mar 2022 • Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li
To the best of our knowledge, this work is the first to study the unexpected crashes in the multi-agent system.
Multi-agent Reinforcement Learning reinforcement-learning +3
1 code implementation • 16 Mar 2022 • Jian Zhao, Xunhan Hu, Mingyu Yang, Wengang Zhou, Jiangcheng Zhu, Houqiang Li
In this way, CTDS balances the full utilization of global observation during training and the feasibility of decentralized execution for online inference.
Multi-agent Reinforcement Learning reinforcement-learning +3
1 code implementation • 22 Feb 2022 • Zeyu Fang, Jian Zhao, Mingyu Yang, Wengang Zhou, Zhenbo Lu, Houqiang Li
In our approach, we regard each camera as an agent and address AMOT with a multi-agent reinforcement learning solution.
1 code implementation • 21 Feb 2022 • Jian Zhao, Mingyu Yang, Youpeng Zhao, Xunhan Hu, Wengang Zhou, Jiangcheng Zhu, Houqiang Li
Specifically, we model both individual Q-values and global Q-value with categorical distribution.
1 code implementation • NeurIPS 2021 • Wentao Zhang, Mingyu Yang, Zeang Sheng, Yang Li, Wen Ouyang, Yangyu Tao, Zhi Yang, Bin Cui
Recent works reveal that feature or label smoothing lies at the core of Graph Neural Networks (GNNs).
1 code implementation • 9 Oct 2021 • Mingyu Yang, Hun-Seok Kim
To the best of our knowledge, this is the first deep JSCC scheme that can automatically adjust its rate using a single network model.
1 code implementation • 11 Sep 2021 • Mingyu Yang, Chenghong Bian, Hun-Seok Kim
We investigate joint source channel coding (JSCC) for wireless image transmission over multipath fading channels.
2 code implementations • 5 Jan 2021 • Mingyu Yang, Chenghong Bian, Hun-Seok Kim
We present a deep learning based joint source channel coding (JSCC) scheme for wireless image transmission over multipath fading channels with non-linear signal clipping.
no code implementations • 14 Dec 2019 • Mingyu Yang, Roger Hsiao, Gordy Carichner, Katherine Ernst, Jaechan Lim, Delbert A. Green II, Inhee Lee, David Blaauw, Hun-Seok Kim
Details of Monarch butterfly migration from the U. S. to Mexico remain a mystery due to lack of a proper localization technology to accurately localize and track butterfly migration.