no code implementations • 5 May 2019 • Faen Zhang, Xinyu Fan, Guo Ai, Jianfei Song, Yongqiang Qin, Jia-Hong Wu
Face detection has witnessed significant progress due to the advances of deep convolutional neural networks (CNNs).
Ranked #5 on Face Detection on WIDER Face (Hard)
no code implementations • 21 Jul 2020 • Jiahong Wu, Jianfei Lu, Xinxin Kang, Yiming Zhang, Yinhang Tang, Jianfei Song, Ze Huang, Shenglan Ben, Jiashui Huang, Faen Zhang
Panoramic segmentation is a scene where image segmentation tasks is more difficult.
no code implementations • 22 Apr 2022 • Lin Yao, Jianfei Song, Ruizhuo Xu, Yingfang Yang, Zijian Chen, Yafeng Deng
Basically, there are two main methods for SLU tasks: (1) Two-stage method, which uses a speech model to transfer speech to text, then uses a language model to get the results of downstream tasks; (2) One-stage method, which just fine-tunes a pre-trained speech model to fit in the downstream tasks.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
1 code implementation • 8 May 2022 • Chunyu Xie, Heng Cai, Jincheng Li, Fanjing Kong, Xiaoyu Wu, Jianfei Song, Henrique Morimitsu, Lin Yao, Dexin Wang, Xiangzheng Zhang, Dawei Leng, Baochang Zhang, Xiangyang Ji, Yafeng Deng
In this work, we build a large-scale high-quality Chinese Cross-Modal Benchmark named CCMB for the research community, which contains the currently largest public pre-training dataset Zero and five human-annotated fine-tuning datasets for downstream tasks.
Ranked #3 on Image Retrieval on Flickr30k-CN
no code implementations • 16 Apr 2024 • Lijun Liu, Jiali Yang, Jianfei Song, Xinglin Yang, Lele Niu, Zeqi Cai, Hui Shi, Tingjun Hou, Chang-Yu Hsieh, Weiran Shen, Yafeng Deng
Additionally, in the absence of AAV9 capsid data, apart from one wild-type sequence, we used the same model to directly generate a number of viable sequences with up to 9 mutations.