1 code implementation • 13 Sep 2024 • Yaojie Shen, Xinyao Wang, Yulei Niu, Ying Zhou, Lexin Tang, Libo Zhang, Fan Chen, Longyin Wen
Despite its success, our study shows that the length exploitation issue present in PO is even more severe in Iterative Preference Optimization (IPO) due to the iterative nature of the process.
1 code implementation • 15 Jun 2024 • Lu Xu, Sijie Zhu, Chunyuan Li, Chia-Wen Kuo, Fan Chen, Xinyao Wang, Guang Chen, Dawei Du, Ye Yuan, Longyin Wen
However, a large portion of videos in real-world applications are edited videos, \textit{e. g.}, users usually cut and add effects/modifications to the raw video before publishing it on social media platforms.
1 code implementation • 9 May 2024 • Jiachen Li, Xinyao Wang, Sijie Zhu, Chia-Wen Kuo, Lu Xu, Fan Chen, Jitesh Jain, Humphrey Shi, Longyin Wen
Recent advancements in Multimodal Large Language Models (LLMs) have focused primarily on scaling by increasing text-image pair data and enhancing LLMs to improve performance on multimodal tasks.
Ranked #1 on visual instruction following on LLaVA-Bench
no code implementations • 6 May 2024 • Chenhui Xu, Xinyao Wang, Fuxun Yu, JinJun Xiong, Xiang Chen
Machine learning is evolving towards high-order models that necessitate pre-training on extensive datasets, a process associated with significant overheads.
no code implementations • 3 May 2023 • Samin Batra, Varun Sharma, Yurou Sun, Xinyao Wang, Yinyu Wang
The final output of the project is a recommendation system that gives a list of the top 5 items that the users will possibly like. 6
1 code implementation • 25 Jun 2022 • Dexiang Hong, Xiaoqi Ma, Xinyao Wang, CongCong Li, YuFei Wang, Longyin Wen
This report presents the algorithm used in the submission of Generic Event Boundary Detection (GEBD) Challenge at CVPR 2022.
no code implementations • 7 Jun 2022 • CongCong Li, Xinyao Wang, Dexiang Hong, YuFei Wang, Libo Zhang, Tiejian Luo, Longyin Wen
To capture temporal context information of each frame, we design the structure context transformer (SC-Transformer) by re-partitioning input frame sequence.
no code implementations • CVPR 2022 • CongCong Li, Xinyao Wang, Longyin Wen, Dexiang Hong, Tiejian Luo, Libo Zhang
Generic event boundary detection aims to localize the generic, taxonomy-free event boundaries that segment videos into chunks.
1 code implementation • 1 Jul 2021 • Dexiang Hong, CongCong Li, Longyin Wen, Xinyao Wang, Libo Zhang
In this work, we design a Cascaded Temporal Attention Network (CASTANET) for GEBD, which is formed by three parts, the backbone network, the temporal attention module, and the classification module.
Ranked #1 on Boundary Detection on Kinetics-400
no code implementations • IEEE International Conference on Acoustics, Speech and Signal Processing 2021 • Ning Lv, Xuezhi Xiang, Xinyao Wang, Jie Yang, Rokia Abdeen, Abdulmotaleb El Saddik
In order to process divergence between detection and re-identification, we introduce switchable normalization and gradient centralization to improve the stability of the model.
1 code implementation • International Conference on Computer Vision Workshops 2019 • Dawei Du, Pengfei Zhu, Longyin Wen, Xiao Bian, Haibin Lin, QinGhua Hu, Tao Peng, Jiayu Zheng, Xinyao Wang, Yue Zhang, Liefeng Bo, Hailin Shi, Rui Zhu, Aashish Kumar, Aijin Li, Almaz Zinollayev, Anuar Askergaliyev, Arne Schumann, Binjie Mao, Byeongwon Lee, Chang Liu, Changrui Chen, Chunhong Pan, Chunlei Huo, Da Yu, Dechun Cong, Dening Zeng, Dheeraj Reddy Pailla, Di Li, Dong Wang, Donghyeon Cho, Dongyu Zhang, Furui Bai, George Jose, Guangyu Gao, Guizhong Liu, Haitao Xiong, Hao Qi, Haoran Wang, Heqian Qiu, Hongliang Li, Huchuan Lu, Ildoo Kim, Jaekyum Kim, Jane Shen, Jihoon Lee, Jing Ge, Jingjing Xu, Jingkai Zhou, Jonas Meier, Jun Won Choi, Junhao Hu, Junyi Zhang, Junying Huang, Kaiqi Huang, Keyang Wang, Lars Sommer, Lei Jin, Lei Zhang
Results of 33 object detection algorithms are presented.
7 code implementations • ICCV 2019 • Xinyao Wang, Liefeng Bo, Li Fuxin
Then we propose a novel loss function, named Adaptive Wing loss, that is able to adapt its shape to different types of ground truth heatmap pixels.
Ranked #8 on Face Alignment on WFW (Extra Data)
1 code implementation • 27 Jul 2018 • Christopher Z. Eddy, Xinyao Wang, Fuxin Li, Bo Sun
As a result, cell morphodynamics is mapped into temporal evolution of morphological phenotypes.