Search Results for author: Jianqing Zhu

Found 17 papers, 6 papers with code

Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion

no code implementations16 Dec 2024 Jianqing Zhu, Huang Huang, Zhihang Lin, Juhao Liang, Zhengyang Tang, Khalid Almubarak, Abdulmohsen Alharthik, Bang An, Juncai He, Xiangbo Wu, Fei Yu, Junying Chen, Zhuoheng Ma, Yuhao Du, He Zhang, Emad A. Alghamdi, Lian Zhang, Ruoyu Sun, Haizhou Li, Benyou Wang, Jinchao Xu

This paper addresses the critical need for democratizing large language models (LLM) in the Arab world, a region that has seen slower progress in developing models comparable to state-of-the-art offerings like GPT-4 or ChatGPT 3. 5, due to a predominant focus on mainstream languages (e. g., English and Chinese).

Alignment at Pre-training! Towards Native Alignment for Arabic LLMs

1 code implementation4 Dec 2024 Juhao Liang, Zhenyang Cai, Jianqing Zhu, Huang Huang, Kewei Zong, Bang An, Mosen Alharthi, Juncai He, Lian Zhang, Haizhou Li, Benyou Wang, Jinchao Xu

The alignment of large language models (LLMs) is critical for developing effective and safe language models.

AceGPT, Localizing Large Language Models in Arabic

1 code implementation21 Sep 2023 Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Dingjie Song, Zhihong Chen, Abdulmohsen Alharthi, Bang An, Juncai He, Ziche Liu, Zhiyi Zhang, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, Jinchao Xu

This paper is devoted to the development of a localized Large Language Model (LLM) specifically for Arabic, a language imbued with unique cultural characteristics inadequately addressed by current mainstream models.

Instruction Following Language Modeling +3

Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval

no code implementations CVPR 2023 Yi Xie, Huaidong Zhang, Xuemiao Xu, Jianqing Zhu, Shengfeng He

Specifically, the employed student model is initially a heavy model to fruitfully learn distilled knowledge in the early training epochs, and the student model is gradually compressed during the training.

Image Retrieval Knowledge Distillation +1

FV-MgNet: Fully Connected V-cycle MgNet for Interpretable Time Series Forecasting

no code implementations2 Feb 2023 Jianqing Zhu, Juncai He, Lian Zhang, Jinchao Xu

By investigating iterative methods for a constrained linear model, we propose a new class of fully connected V-cycle MgNet for long-term time series forecasting, which is one of the most difficult tasks in forecasting.

Image Classification Time Series +1

An Enhanced V-cycle MgNet Model for Operator Learning in Numerical Partial Differential Equations

no code implementations2 Feb 2023 Jianqing Zhu, Juncai He, Qiumei Huang

This study used a multigrid-based convolutional neural network architecture known as MgNet in operator learning to solve numerical partial differential equations (PDEs).

Operator learning

Learning Correction Filter via Degradation-Adaptive Regression for Blind Single Image Super-Resolution

1 code implementation ICCV 2023 Hongyang Zhou, Xiaobin Zhu, Jianqing Zhu, Zheng Han, Shi-Xue Zhang, Jingyan Qin, Xu-Cheng Yin

Instead of assuming degradation are spatially invariant across the whole image, we learn correction filters to adjust degradations to known degradations in a spatially variant way by a novel linearly-assembled pixel degradation-adaptive regression module (DARM).

Image Super-Resolution regression

TreeDRNet:A Robust Deep Model for Long Term Time Series Forecasting

no code implementations24 Jun 2022 Tian Zhou, Jianqing Zhu, Xue Wang, Ziqing Ma, Qingsong Wen, Liang Sun, Rong Jin

Various deep learning models, especially some latest Transformer-based approaches, have greatly improved the state-of-art performance for long-term time series forecasting. However, those transformer-based models suffer a severe deterioration performance with prolonged input length, which prohibits them from using extended historical info. Moreover, these methods tend to handle complex examples in long-term forecasting with increased model complexity, which often leads to a significant increase in computation and less robustness in performance(e. g., overfitting).

Computational Efficiency feature selection +2

A Challenging Benchmark of Anime Style Recognition

1 code implementation29 Apr 2022 Haotang Li, Shengtao Guo, Kailin Lyu, Xiao Yang, Tianchen Chen, Jianqing Zhu, Huanqiang Zeng

Given two images of different anime roles, anime style recognition (ASR) aims to learn abstract painting style to determine whether the two images are from the same work, which is an interesting but challenging problem.

Art Analysis Face Recognition +1

An Interpretive Constrained Linear Model for ResNet and MgNet

no code implementations14 Dec 2021 Juncai He, Jinchao Xu, Lian Zhang, Jianqing Zhu

We propose a constrained linear data-feature-mapping model as an interpretable mathematical model for image classification using a convolutional neural network (CNN).

Image Classification

GiT: Graph Interactive Transformer for Vehicle Re-identification

no code implementations12 Jul 2021 Fei Shen, Yi Xie, Jianqing Zhu, Xiaobin Zhu, Huanqiang Zeng

In the macro view, a list of GiT blocks are stacked to build a vehicle re-identification model, in where graphs are to extract discriminative local features within patches and transformers are to extract robust global features among patches.

Person Re-Identification Vehicle Re-Identification

Exploring Spatial Significance via Hybrid Pyramidal Graph Network for Vehicle Re-identification

1 code implementation29 May 2020 Fei Shen, Jianqing Zhu, Xiaobin Zhu, Yi Xie, Jingchang Huang

Secondly, a novel pyramidal graph network (PGN) is designed to comprehensively explore the spatial significance of feature maps at multiple scales.

Vehicle Re-Identification

Vehicle Re-identification Using Quadruple Directional Deep Learning Features

no code implementations13 Nov 2018 Jianqing Zhu, Huanqiang Zeng, Jingchang Huang, Shengcai Liao, Zhen Lei, Canhui Cai, Lixin Zheng

Specifically, the same basic deep learning architecture is a shortly and densely connected convolutional neural network to extract basic feature maps of an input square vehicle image in the first stage.

Deep Learning Vehicle Re-Identification

Deep Hybrid Similarity Learning for Person Re-identification

no code implementations16 Feb 2017 Jianqing Zhu, Huanqiang Zeng, Shengcai Liao, Zhen Lei, Canhui Cai, Lixin Zheng

In this paper, a deep hybrid similarity learning (DHSL) method for person Re-ID based on a convolution neural network (CNN) is proposed.

Metric Learning Person Re-Identification

Open-set Person Re-identification

no code implementations5 Aug 2014 Shengcai Liao, Zhipeng Mo, Jianqing Zhu, Yang Hu, Stan Z. Li

Person re-identification is becoming a hot research for developing both machine learning algorithms and video surveillance applications.

Metric Learning Person Re-Identification

Cannot find the paper you are looking for? You can Submit a new open access paper.