Search Results for author: Da Pan

Found 11 papers, 7 papers with code

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

1 code implementation24 Feb 2025 Tianpeng Li, Jun Liu, Tao Zhang, Yuanbo Fang, Da Pan, Mingrui Wang, Zheng Liang, zehuan li, MingAn Lin, Guosheng Dong, Jianhua Xu, Haoze Sun, Zenan Zhou, WeiPeng Chen

To mitigate the loss of intelligence during pre-training and preserve the original capabilities of the LLM, we propose a two-stage pre-training strategy that maintains language understanding while enhancing audio modeling.

Language Modeling Language Modelling +2

Comprehensive Subjective and Objective Evaluation Method for Text-generated Video

no code implementations15 Jan 2025 Zelu Qi, Ping Shi, Shuqi Wang, Zhaoyang Zhang, Zefeng Ying, Da Pan

Recent text-to-video (T2V) technology advancements, as demonstrated by models such as Gen3, Pika, and Sora, have significantly broadened its applicability and popularity.

Video Generation

VersaTune: An Efficient Data Composition Framework for Training Multi-Capability LLMs

1 code implementation18 Nov 2024 Keer Lu, Keshi Zhao, Zheng Liang, Da Pan, Shusen Zhang, Xin Wu, WeiPeng Chen, Zenan Zhou, Guosheng Dong, Bin Cui, Wentao Zhang

Despite their potential, existing work mainly focuses on domain-specific enhancements during fine-tuning, the challenge of which lies in catastrophic forgetting of knowledge across other domains.

Baichuan-Omni Technical Report

2 code implementations11 Oct 2024 Yadong Li, Haoze Sun, MingAn Lin, Tianpeng Li, Guosheng Dong, Bowen Ding, Wei Song, Zhenglin Cheng, Yuqi Huo, Song Chen, Xu Li, Da Pan, Shusen Zhang, Xin Wu, Zheng Liang, Jun Liu, Tao Zhang, Keer Lu, Yaqi Zhao, Yanjun Shen, Fan Yang, Kaicheng Yu, Tao Lin, Jianhua Xu, Zenan Zhou, WeiPeng Chen

The salient multimodal capabilities and interactive experience of GPT-4o highlight its critical role in practical applications, yet it lacks a high-performing open-source counterpart.

Language Modeling Language Modelling +3

DataSculpt: Crafting Data Landscapes for Long-Context LLMs through Multi-Objective Partitioning

3 code implementations2 Sep 2024 Keer Lu, Xiaonan Nie, Zheng Liang, Da Pan, Shusen Zhang, Keshi Zhao, WeiPeng Chen, Zenan Zhou, Guosheng Dong, Bin Cui, Wentao Zhang

Through extensive experimental analysis, we identified three key challenges in designing effective data management strategies that enable the model to achieve long-context capability without sacrificing performance in other tasks: (1) a shortage of long documents across multiple domains, (2) effective construction of context windows, and (3) efficient organization of large-scale datasets.

Code Completion Combinatorial Optimization +5

Blind Predicting Similar Quality Map for Image Quality Assessment

no code implementations CVPR 2018 Da Pan, Ping Shi, Ming Hou, Zefeng Ying, Sizhe Fu, Yuan Zhang

A key problem in blind image quality assessment (BIQA) is how to effectively model the properties of human visual system in a data-driven manner.

Blind Image Quality Assessment Full reference image quality assessment +1

Cannot find the paper you are looking for? You can Submit a new open access paper.