Search Results for author: Bo-Wen Zhang

Found 17 papers, 9 papers with code

ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning

no code implementations18 Dec 2024 Jie-Jing Shao, Xiao-Wen Yang, Bo-Wen Zhang, Baizhi Chen, Wen-Da Wei, Guohao Cai, Zhenhua Dong, Lan-Zhe Guo, Yu-Feng Li

Recent advances in LLMs, particularly in language reasoning and tool integration, have rapidly sparked the real-world development of Language Agents.

Predictable Emergent Abilities of LLMs: Proxy Tasks Are All You Need

no code implementations10 Dec 2024 Bo-Wen Zhang, Yan Yan, Boxiang Yang, Yifei Xue, Guang Liu

While scaling laws optimize training configurations for large language models (LLMs) through experiments on smaller or early-stage models, they fail to predict emergent abilities due to the absence of such capabilities in these models.

All

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

4 code implementations24 Oct 2024 Shuhao Gu, Jialing Zhang, Siyuan Zhou, Kevin Yu, Zhaohu Xing, Liangdong Wang, Zhou Cao, Jintao Jia, Zhuoyi Zhang, YiXuan Wang, Zhenchong Hu, Bo-Wen Zhang, Jijie Li, Dong Liang, Yingli Zhao, Songjing Wang, Yulong Ao, Yiming Ju, Huanhuan Ma, Xiaotong Li, Haiwen Diao, Yufeng Cui, Xinlong Wang, Yaoqi Liu, Fangxiang Feng, Guang Liu

Despite the availability of several open-source multimodal datasets, limitations in the scale and quality of open-source instruction data hinder the performance of VLMs trained on these datasets, leading to a significant gap compared to models trained on closed-source data.

Image Generation Question Generation +2

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

no code implementations24 Oct 2024 Liangdong Wang, Bo-Wen Zhang, ChengWei Wu, Hanyu Zhao, Xiaofeng Shi, Shuhao Gu, Jijie Li, Quanyue Ma, Tengfei Pan, Guang Liu

We present CCI3. 0-HQ (https://huggingface. co/datasets/BAAI/CCI3-HQ), a high-quality 500GB subset of the Chinese Corpora Internet 3. 0 (CCI3. 0)(https://huggingface. co/datasets/BAAI/CCI3-Data), developed using a novel two-stage hybrid filtering pipeline that significantly enhances data quality.

Aquila2 Technical Report

2 code implementations14 Aug 2024 Bo-Wen Zhang, Liangdong Wang, Jijie Li, Shuhao Gu, Xinya Wu, Zhengduo Zhang, Boyan Gao, Yulong Ao, Guang Liu

This paper introduces the Aquila2 series, which comprises a wide range of bilingual models with parameter sizes of 7, 34, and 70 billion.

Management

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

1 code implementation13 Aug 2024 Bo-Wen Zhang, Liangdong Wang, Ye Yuan, Jijie Li, Shuhao Gu, Mengdi Zhao, Xinya Wu, Guang Liu, ChengWei Wu, Hanyu Zhao, Li Du, Yiming Ju, Quanyue Ma, Yulong Ao, Yingli Zhao, Songhe Zhu, Zhou Cao, Dong Liang, Yonghua Lin, Ming Zhang, Shunfei Wang, Yanxin Zhou, Min Ye, Xuekai Chen, Xinyang Yu, Xiangjun Huang, Jian Yang

In this paper, we present AquilaMoE, a cutting-edge bilingual 8*16B Mixture of Experts (MoE) language model that has 8 experts with 16 billion parameters each and is developed using an innovative training methodology called EfficientScale.

Language Modelling Transfer Learning

TACO: Topics in Algorithmic COde generation dataset

1 code implementation22 Dec 2023 Rongao Li, Jie Fu, Bo-Wen Zhang, Tao Huang, Zhihong Sun, Chen Lyu, Guang Liu, Zhi Jin, Ge Li

Moreover, each TACO problem includes several fine-grained labels such as task topics, algorithms, programming skills, and difficulty levels, providing a more precise reference for the training and evaluation of code generation models.

Code Generation

CoIE: Chain-of-Instruct Editing for Multi-Attribute Face Manipulation

no code implementations13 Dec 2023 Zhenduo Zhang, Bo-Wen Zhang, Guang Liu

Current text-to-image editing models often encounter challenges with smoothly manipulating multiple attributes using a single instruction.

Attribute Image Manipulation +3

Gaze Preserving CycleGANs for Eyeglass Removal & Persistent Gaze Estimation

1 code implementation6 Feb 2020 Akshay Rangesh, Bo-Wen Zhang, Mohan M. Trivedi

GPCycleGAN is based on the well-known CycleGAN approach - with the addition of a gaze classifier and a gaze consistency loss for additional supervision.

Gaze Estimation Image-to-Image Translation

Effectiveness of random deep feature selection for securing image manipulation detectors against adversarial examples

1 code implementation25 Oct 2019 Mauro Barni, Ehsan Nowroozi, Benedetta Tondi, Bo-Wen Zhang

We investigate if the random feature selection approach proposed in [1] to improve the robustness of forensic detectors to targeted attacks, can be extended to detectors based on deep learning features.

feature selection Image Manipulation +1

Effective Domain Knowledge Transfer with Soft Fine-tuning

no code implementations5 Sep 2019 Zhichen Zhao, Bo-Wen Zhang, Yuning Jiang, Li Xu, Lei LI, Wei-Ying Ma

However, the datasets from source domain are simply discarded in the fine-tuning process.

Transfer Learning

Topic Augmented Generator for Abstractive Summarization

no code implementations19 Aug 2019 Melissa Ailem, Bo-Wen Zhang, Fei Sha

In this paper, we propose a new decoder where the output summary is generated by conditioning on both the input text and the latent topics of the document.

Abstractive Text Summarization Decoder

A Probabilistic Model for Joint Learning of Word Embeddings from Texts and Images

no code implementations EMNLP 2018 Melissa Ailem, Bo-Wen Zhang, Aurelien Bellet, Pascal Denis, Fei Sha

Our approach learns textual and visual representations jointly: latent visual factors couple together a skip-gram model for co-occurrence in linguistic data and a generative latent variable model for visual data.

Coreference Resolution Image Classification +5

A Multi-strategy Query Processing Approach for Biomedical Question Answering: USTB\_PRIR at BioASQ 2017 Task 5B

no code implementations WS 2017 Zan-Xia Jin, Bo-Wen Zhang, Fan Fang, Le-Le Zhang, Xu-Cheng Yin

This paper describes the participation of USTB{\_}PRIR team in the 2017 BioASQ 5B on question answering, including document retrieval, snippet retrieval, and concept retrieval task.

Information Retrieval Question Answering +1

Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition

1 code implementation1 Sep 2016 Zhe Wang, Li-Min Wang, Yali Wang, Bo-Wen Zhang, Yu Qiao

In this paper, we propose a hybrid representation, which leverages the discriminative capacity of CNNs and the simplicity of descriptor encoding schema for image recognition, with a focus on scene recognition.

Scene Recognition

CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016

1 code implementation2 Aug 2016 Yuanjun Xiong, Li-Min Wang, Zhe Wang, Bo-Wen Zhang, Hang Song, Wei Li, Dahua Lin, Yu Qiao, Luc van Gool, Xiaoou Tang

This paper presents the method that underlies our submission to the untrimmed video classification task of ActivityNet Challenge 2016.

General Classification Video Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.