Search Results for author: Yihao Chen

Found 21 papers, 10 papers with code

An Empirical Study of Challenges in Machine Learning Asset Management

1 code implementation • 25 Feb 2024 • Zhimin Zhao, Yihao Chen, Abdul Ali Bangash, Bram Adams, Ahmed E. Hassan

In machine learning (ML), efficient asset management, including ML models, datasets, algorithms, and tools, is vital for resource optimization, consistent performance, and a streamlined development lifecycle.

Asset Management

Paper
Code

ADCNet: a unified framework for predicting the activity of antibody-drug conjugates

1 code implementation • 17 Jan 2024 • Liye Chen, Biaoshun Li, Yihao Chen, Mujie Lin, Shipeng Zhang, Chenxin Li, Yu Pang, Ling Wang

Antibody-drug conjugate (ADC) has revolutionized the field of cancer treatment in the era of precision medicine due to their ability to precisely target cancer cells and release highly effective drug.

Activity Prediction Language Modelling +1

Paper
Code

TinySAM: Pushing the Envelope for Efficient Segment Anything Model

2 code implementations • 21 Dec 2023 • Han Shu, Wenshuo Li, Yehui Tang, Yiman Zhang, Yihao Chen, Houqiang Li, Yunhe Wang, Xinghao Chen

Extensive experiments on various zero-shot transfer tasks demonstrate the significantly advantageous performance of our TinySAM against counterpart methods.

Knowledge Distillation Quantization

355

Paper
Code

AcademicGPT: Empowering Academic Research

no code implementations • 21 Nov 2023 • Shufa Wei, Xiaolong Xu, Xianbiao Qi, Xi Yin, Jun Xia, Jingyi Ren, Peijun Tang, Yuxiang Zhong, Yihao Chen, Xiaoqin Ren, Yuxin Liang, Liankai Huang, Kai Xie, Weikang Gui, Wei Tan, Shuanglong Sun, Yongquan Hu, Qinxian Liu, Nanjin Li, Chihao Dai, Lihua Wang, Xiaohui Liu, Lei Zhang, Yutao Xie

Our training corpus mainly consists of academic papers, thesis, content from some academic domain, high-quality Chinese data and others.

General Knowledge Question Answering

Paper
Add Code

Generic and Robust Root Cause Localization for Multi-Dimensional Data in Online Service Systems

1 code implementation • 5 May 2023 • Zeyan Li, Junjie Chen, Yihao Chen, Chengyang Luo, Yiwei Zhao, Yongqian Sun, Kaixin Sui, Xiping Wang, Dapeng Liu, Xing Jin, Qi Wang, Dan Pei

Such attribute combinations are substantial clues to the underlying root causes and thus are called root causes of multidimensional data.

Attribute

Paper
Code

LipsFormer: Introducing Lipschitz Continuity to Vision Transformers

1 code implementation • 19 Apr 2023 • Xianbiao Qi, Jianan Wang, Yihao Chen, Yukai Shi, Lei Zhang

In contrast to previous practical tricks that address training instability by learning rate warmup, layer normalization, attention formulation, and weight initialization, we show that Lipschitz continuity is a more essential property to ensure training stability.

Paper
Code

DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training

1 code implementation • CVPR 2023 • Yihao Chen, Xianbiao Qi, Jianan Wang, Lei Zhang

In this way, we can reduce the GPU memory consumption of contrastive loss computation from $\bigO(B^2)$ to $\bigO(\frac{B^2}{N})$, where $B$ and $N$ are the batch size and the number of GPUs used for training.

Contrastive Learning

Paper
Code

Exploring Vision Transformers as Diffusion Learners

no code implementations • 28 Dec 2022 • He Cao, Jianan Wang, Tianhe Ren, Xianbiao Qi, Yihao Chen, Yuan YAO, Lei Zhang

We further provide a hypothesis on the implication of disentangling the generative backbone as an encoder-decoder structure and show proof-of-concept experiments verifying the effectiveness of a stronger encoder for generative tasks with ASymmetriC ENcoder Decoder (ASCEND).

Paper
Add Code

The SpeakIn Speaker Verification System for Far-Field Speaker Verification Challenge 2022

no code implementations • 23 Sep 2022 • Yu Zheng, Jinghan Peng, Yihao Chen, Yajun Zhang, Jialong Wang, Min Liu, Minqiang Xu

In the pre-training stage we reserve the speaker weights, and there are no positive samples to train them in this stage.

Speaker Verification Task 2 +1

Paper
Add Code

The SpeakIn System Description for CNSRC2022

no code implementations • 22 Sep 2022 • Yu Zheng, Yihao Chen, Jinghan Peng, Yajun Zhang, Min Liu, Minqiang Xu

In the SV task fixed track, our system was a fusion of five models, and two models were fused in the SV task open track.

Retrieval Speaker Recognition +1

Paper
Add Code

VDDB: a comprehensive resource and machine learning platform for antiviral drug discovery

no code implementations • 17 Sep 2022 • Shunming Tao, Yihao Chen, Jingxing Wu, Duancheng Zhao, Hanxuan Cai, Ling Wang

Virus infection is one of the major diseases that seriously threaten human health.

Activity Prediction Drug Discovery

Paper
Add Code

3D Shuffle-Mixer: An Efficient Context-Aware Vision Learner of Transformer-MLP Paradigm for Dense Prediction in Medical Volume

no code implementations • 14 Apr 2022 • Jianye Pang, Cheng Jiang, Yihao Chen, Jianbo Chang, Ming Feng, Renzhi Wang, Jianhua Yao

Therefore, designing an elegant and efficient vision transformer learner for dense prediction in medical volume is promising and challenging.

Inductive Bias

Paper
Add Code

1st Place Solution for ICDAR 2021 Competition on Mathematical Formula Detection

1 code implementation • 12 Jul 2021 • Yuxiang Zhong, Xianbiao Qi, Shanjun Li, Dengyi Gu, Yihao Chen, Peiyang Ning, Rong Xiao

In this technical report, we present our 1st place solution for the ICDAR 2021 competition on mathematical formula detection (MFD).

118

Paper
Code

Winner Team Mia at TextVQA Challenge 2021: Vision-and-Language Representation Learning with Pre-trained Sequence-to-Sequence Model

no code implementations • 24 Jun 2021 • Yixuan Qiao, Hao Chen, Jun Wang, Yihao Chen, Xianbin Ye, Ziliang Li, Xianbiao Qi, Peng Gao, Guotong Xie

TextVQA requires models to read and reason about text in images to answer questions about them.

Language Modelling Masked Language Modeling +2

Paper
Add Code

PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML

2 code implementations • 5 May 2021 • Jiaquan Ye, Xianbiao Qi, Yelin He, Yihao Chen, Dengyi Gu, Peng Gao, Rong Xiao

In our method, we divide the table content recognition task into foursub-tasks: table structure recognition, text line detection, text line recognition, and box assignment. Our table structure recognition algorithm is customized based on MASTER [1], a robust image textrecognition algorithm.

Ranked #1 on Table Recognition on PubTabNet

Line Detection Table Recognition

38,382

Paper
Code

PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Table Image Recognition to Latex

no code implementations • 5 May 2021 • Yelin He, Xianbiao Qi, Jiaquan Ye, Peng Gao, Yihao Chen, Bingcong Li, Xin Tang, Rong Xiao

This paper presents our solution for the ICDAR 2021 Competition on Scientific Table Image Recognition to LaTeX.

Data Augmentation Scene Text Recognition

Paper
Add Code

Melody-Conditioned Lyrics Generation with SeqGANs

no code implementations • 28 Oct 2020 • Yihao Chen, Alexander Lerch

Automatic lyrics generation has received attention from both music and AI communities for years.

Paper
Add Code

Learning Graph Normalization for Graph Neural Networks

1 code implementation • 24 Sep 2020 • Yihao Chen, Xin Tang, Xianbiao Qi, Chun-Guang Li, Rong Xiao

We conduct extensive experiments on benchmark datasets for different tasks, including node classification, link prediction, graph classification and graph regression, and confirm that the learned graph normalization leads to competitive results and that the learned weights suggest the appropriate normalization techniques for the specific task.

Graph Classification Graph Regression +2

116

Paper
Code

Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition

no code implementations • 23 Sep 2020 • Bingcong Li, Xin Tang, Xianbiao Qi, Yihao Chen, Rong Xiao

Thus, we propose a lightweight scene text recognition model named Hamming OCR.

Optical Character Recognition (OCR) Scene Text Recognition

Paper
Add Code

Neural Mesh Refiner for 6-DoF Pose Estimation

no code implementations • 17 Mar 2020 • Di Wu, Yihao Chen, Xianbiao Qi, Yongjian Yu, Weixuan Chen, Rong Xiao

We utilise the overlay between the accurate mask prediction and less accurate mesh prediction to iteratively optimise the direct regressed 6D pose information with a focus on translation estimation.

Autonomous Driving Instance Segmentation +4

Paper
Add Code

MASTER: Multi-Aspect Non-local Network for Scene Text Recognition

7 code implementations • 7 Oct 2019 • Ning Lu, Wenwen Yu, Xianbiao Qi, Yihao Chen, Ping Gong, Rong Xiao, Xiang Bai

Attention-based scene text recognizers have gained huge success, which leverages a more compact intermediate representation to learn 1d- or 2d- attention by a RNN-based encoder-decoder architecture.

Scene Text Recognition

4,064

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.