Search Results for author: Xinyao Wang

Found 13 papers, 8 papers with code

AIPO: Improving Training Objective for Iterative Preference Optimization

1 code implementation13 Sep 2024 Yaojie Shen, Xinyao Wang, Yulei Niu, Ying Zhou, Lexin Tang, Libo Zhang, Fan Chen, Longyin Wen

Despite its success, our study shows that the length exploitation issue present in PO is even more severe in Iterative Preference Optimization (IPO) due to the iterative nature of the process.

Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model

1 code implementation15 Jun 2024 Lu Xu, Sijie Zhu, Chunyuan Li, Chia-Wen Kuo, Fan Chen, Xinyao Wang, Guang Chen, Dawei Du, Ye Yuan, Longyin Wen

However, a large portion of videos in real-world applications are edited videos, \textit{e. g.}, users usually cut and add effects/modifications to the raw video before publishing it on social media platforms.

Question Answering Video Understanding +1

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

1 code implementation9 May 2024 Jiachen Li, Xinyao Wang, Sijie Zhu, Chia-Wen Kuo, Lu Xu, Fan Chen, Jitesh Jain, Humphrey Shi, Longyin Wen

Recent advancements in Multimodal Large Language Models (LLMs) have focused primarily on scaling by increasing text-image pair data and enhancing LLMs to improve performance on multimodal tasks.

Image Captioning visual instruction following +1

QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation

no code implementations6 May 2024 Chenhui Xu, Xinyao Wang, Fuxun Yu, JinJun Xiong, Xiang Chen

Machine learning is evolving towards high-order models that necessitate pre-training on extensive datasets, a process associated with significant overheads.

Steam Recommendation System

no code implementations3 May 2023 Samin Batra, Varun Sharma, Yurou Sun, Xinyao Wang, Yinyu Wang

The final output of the project is a recommendation system that gives a list of the top 5 items that the users will possibly like. 6

SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection

1 code implementation25 Jun 2022 Dexiang Hong, Xiaoqi Ma, Xinyao Wang, CongCong Li, YuFei Wang, Longyin Wen

This report presents the algorithm used in the submission of Generic Event Boundary Detection (GEBD) Challenge at CVPR 2022.

Boundary Detection Decoder +2

Structured Context Transformer for Generic Event Boundary Detection

no code implementations7 Jun 2022 CongCong Li, Xinyao Wang, Dexiang Hong, YuFei Wang, Libo Zhang, Tiejian Luo, Longyin Wen

To capture temporal context information of each frame, we design the structure context transformer (SC-Transformer) by re-partitioning input frame sequence.

Boundary Detection Generic Event Boundary Detection

Generic Event Boundary Detection Challenge at CVPR 2021 Technical Report: Cascaded Temporal Attention Network (CASTANET)

1 code implementation1 Jul 2021 Dexiang Hong, CongCong Li, Longyin Wen, Xinyao Wang, Libo Zhang

In this work, we design a Cascaded Temporal Attention Network (CASTANET) for GEBD, which is formed by three parts, the backbone network, the temporal attention module, and the classification module.

Boundary Detection Generic Event Boundary Detection

Adaptive Wing Loss for Robust Face Alignment via Heatmap Regression

7 code implementations ICCV 2019 Xinyao Wang, Liefeng Bo, Li Fuxin

Then we propose a novel loss function, named Adaptive Wing loss, that is able to adapt its shape to different types of ground truth heatmap pixels.

Face Alignment regression +1

The morphodynamics of 3D migrating cancer cells

1 code implementation27 Jul 2018 Christopher Z. Eddy, Xinyao Wang, Fuxin Li, Bo Sun

As a result, cell morphodynamics is mapped into temporal evolution of morphological phenotypes.

Cannot find the paper you are looking for? You can Submit a new open access paper.