no code implementations • NAACL (BEA) 2022 • Bowei Zou, Pengfei Li, Liangming Pan, Ai Ti Aw
In field of teaching, true/false questioning is an important educational method for assessing students’ general understanding of learning materials.
no code implementations • WMT (EMNLP) 2021 • Meng Zhang, Minghao Wu, Pengfei Li, Liangyou Li, Qun Liu
This paper describes the NoahNMT system submitted to the WMT 2021 shared task of Very Low Resource Supervised Machine Translation.
no code implementations • NAACL (AutoSimTrans) 2022 • Xingshan Zeng, Pengfei Li, Liangyou Li, Qun Liu
This paper describes the system submitted to AutoSimTrans 2022 from Huawei Noah’s Ark Lab, which won the first place in the audio input track of the Chinese-English translation task.
no code implementations • 4 Dec 2024 • Noah Shumba, Opelo Tshekiso, Pengfei Li, Giulia Fanti, Shaolei Ren
While most attention has been paid to developed countries such as the U. S., this paper presents the first-of-its-kind dataset that combines nation-level weather and electricity generation data to estimate water usage efficiency for data centers in 41 African countries across five different climate regions.
1 code implementation • 8 Nov 2024 • Zongyuan Li, Yanan Ni, Runnan Qi, Lumin Jiang, Chang Lu, Xiaojie Xu, Xiangbei Liu, Pengfei Li, Yunzheng Guo, Zhe Ma, Xian Guo, Kuihua Huang, Xuebo Zhang
This paper introduces a new environment LLM-PySC2 (the Large Language Model StarCraft II Learning Environment), a platform derived from DeepMind's StarCraft II Learning Environment that serves to develop Large Language Models (LLMs) based decision-making methodologies.
no code implementations • 6 Nov 2024 • Jianyi Yang, Pengfei Li, Adam Wierman, Shaolei Ren
In this paper, we remove the FLM assumption and tackle the open problem of OBM with general bids.
no code implementations • 31 Oct 2024 • Jinlong He, Pengfei Li, Gang Liu, Shenjun Zhong
Multimodal Large Language Models (MLLMs) inherit the superior text understanding capabilities of LLMs and extend these capabilities to multimodal scenarios.
no code implementations • 21 Oct 2024 • Zhengming Wang, Junli Wang, Pengfei Li, Zhaohan Li, Peng Li, Yilun Chen
While the capabilities of autonomous driving have advanced rapidly, merging into dense traffic remains a significant challenge, many motion planning methods for this scenario have been proposed but it is hard to evaluate them.
no code implementations • 5 Sep 2024 • Julong Wei, Shanshuai Yuan, Pengfei Li, Qingda Hu, Zhongxue Gan, Wenchao Ding
Then, we build a unified multi-modal vocabulary for vision, language and action.
no code implementations • 26 Jul 2024 • Fei Wang, Yuewen Zheng, Qin Li, Jingyi Wu, Pengfei Li, Luxia Zhang
For the result of value extraction based on correct key extraction, the overall accuracy was 97. 2%, precision was 95. 8%, recall was 95. 8%, and F1-score was 95. 8%.
Optical Character Recognition Optical Character Recognition (OCR)
no code implementations • 13 Jul 2024 • Siyan Liu, Chi-Kuang Yeh, Xin Zhang, Qinglong Tian, Pengfei Li
This study introduces a new approach to addressing positive and unlabeled (PU) data through the double exponential tilting model (DETM).
1 code implementation • The 38th Annual AAAI Conference on Artificial Intelligence 2024 • Ke Sun, Pei Liu, Pengfei Li, Zhifang Liao
Additionally, when handling traffic data, researchers tend to manually design the model structure based on the data features, which makes the structure of traffic prediction redundant and the model generalizability limited.
no code implementations • 8 Jun 2024 • Biqing Qi, Pengfei Li, Fangyuan Li, Junqi Gao, Kaiyan Zhang, BoWen Zhou
Inspired by intraspecific competition driving species evolution, we propose a Online Fast-Slow chasing DPO (OFS-DPO) for preference alignment, simulating competition through fast and slow chasing among models to facilitate rapid adaptation.
1 code implementation • 8 Jun 2024 • Biqing Qi, Yang Luo, Junqi Gao, Pengfei Li, Kai Tian, Zhiyuan Ma, BoWen Zhou
We find that fixed-parameterized SSMs have output error bounds strictly related to their parameters, limiting their AT benefits, while input-dependent SSMs may face the problem of error explosion.
1 code implementation • 4 Jun 2024 • Yejia Liu, Jianyi Yang, Pengfei Li, Tongxin Li, Shaolei Ren
Public models offer predictions to a variety of downstream tasks and have played a crucial role in various AI applications, showcasing their proficiency in accurate predictions.
1 code implementation • 3 Jun 2024 • Tianjing Zeng, Junwei Lan, Jiahong Ma, Wenqing Wei, Rong Zhu, Pengfei Li, Bolin Ding, Defu Lian, Zhewei Wei, Jingren Zhou
It is generally applicable to any unseen new database to attain high estimation accuracy, while its preparation cost is as little as the basic one-dimensional histogram-based CardEst methods.
no code implementations • 24 May 2024 • Pranjol Sen Gupta, Md Rajib Hossen, Pengfei Li, Shaolei Ren, Mohammad A. Islam
Freshwater scarcity is a global problem that requires collective efforts across all industry sectors.
no code implementations • 23 May 2024 • Pengfei Li, Ziyue Ma, Hong Wang, Juan Deng, Yan Wang, Zhenyu Xu, Feng Yan, Wenjun Tu, Hong Sha
To abundant traditional image methods with depth information, a method in registering with depth images and traditional clinical images was investigated.
no code implementations • 18 May 2024 • Guibin Zhao, Pengfei Li, Zhibo Zhang, Fusen Guo, Xueting Huang, Wei Xu, Jinyin Wang, Jianlong Chen
Synthetic Aperture Radar has been extensively used in numerous fields and can gather a wealth of information about the area of interest.
1 code implementation • 28 Mar 2024 • Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao
However, the exploration of 3D dense captioning in outdoor scenes is hindered by two major challenges: 1) the domain gap between indoor and outdoor scenes, such as dynamics and sparse visual inputs, makes it difficult to directly adapt existing indoor methods; 2) the lack of data with comprehensive box-caption pair annotations specifically tailored for outdoor scenes.
no code implementations • 15 Mar 2024 • Zhou Jiang, Zhenxin Zhu, Pengfei Li, Huan-ang Gao, Tianyuan Yuan, Yongliang Shi, Hang Zhao, Hao Zhao
On the other hand, we exploit a masked autoencoder to capture the prior distribution of HDMap, which can serve as a refinement module to mitigate occlusions and artifacts.
1 code implementation • 14 Mar 2024 • Yuhang Zheng, Xiangyu Chen, Yupeng Zheng, Songen Gu, Runyi Yang, Bu Jin, Pengfei Li, Chengliang Zhong, Zengmao Wang, Lina Liu, Chao Yang, Dawei Wang, Zhen Chen, Xiaoxiao Long, Meiqing Wang
In particular, we propose an Efficient Feature Distillation (EFD) module that employs contrastive learning to efficiently and accurately distill language embeddings derived from foundational models.
1 code implementation • 13 Mar 2024 • Yupeng Zheng, Xiang Li, Pengfei Li, Yuhang Zheng, Bu Jin, Chengliang Zhong, Xiaoxiao Long, Hao Zhao, Qichao Zhang
However, existing methods rely on a complex cascaded framework with relatively limited information to restore 3D scenes, including a dependency on supervision solely on the whole network's output, single-frame input, and the utilization of a small backbone.
no code implementations • 12 Mar 2024 • Yanyue Zhang, Pengfei Li, Yilong Lai, Deyu Zhou, Yulan He
In specific, a small size of synthesized negative reviews is obtained by rewriting the positive text via a large language model.
1 code implementation • 25 Feb 2024 • Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, Jingcheng Wu, Chenghua Lin, Qifeng Liu, Tao Jiang, Wenhao Huang, Wenhu Chen, Emmanouil Benetos, Jie Fu, Gus Xia, Roger Dannenberg, Wei Xue, Shiyin Kang, Yike Guo
It is based on continual pre-training and finetuning LLaMA2 on a text-compatible music representation, ABC notation, and the music is treated as a second language.
1 code implementation • 5 Jan 2024 • Gang Liu, Jinlong He, Pengfei Li, Genrong He, Zhaolin Chen, Shenjun Zhong
In this paper, we propose a parameter efficient framework for fine-tuning MLLMs, specifically validated on medical visual question answering (Med-VQA) and medical report generation (MRG) tasks, using public benchmark datasets.
Ranked #1 on Medical Visual Question Answering on VQA-RAD (using extra training data)
Medical Report Generation Medical Visual Question Answering +5
no code implementations • 22 Oct 2023 • Zhibo Zhang, Pengfei Li, Ahmed Y. Al Hammadi, Fusen Guo, Ernesto Damiani, Chan Yeob Yeun
This paper presents a reputation-based threat mitigation framework that defends potential security threats in electroencephalogram (EEG) signal classification during model aggregation of Federated Learning.
no code implementations • 21 Oct 2023 • Pengfei Li, Zhibo Zhang, Ameena S. Al-Sumaiti, Naoufel Werghi, Chan Yeob Yeun
Metaverse is trending to create a digital circumstance that can transfer the real world to an online platform supported by large quantities of real-time interactions.
1 code implementation • 19 Sep 2023 • Shaocong Xu, Pengfei Li, Xinyu Liu, Qianpu Sun, Yang Li, Shihui Guo, Zhen Wang, Bo Jiang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao
We demonstrate that learning different abstaining penalties, apart from point-wise penalty, for different types of (synthesized) outliers can further improve the performance.
1 code implementation • ICCV 2023 • Chengliang Zhong, Yuhang Zheng, Yupeng Zheng, Hao Zhao, Li Yi, Xiaodong Mu, Ling Wang, Pengfei Li, Guyue Zhou, Chao Yang, Xinliang Zhang, Jian Zhao
To address this issue, the Transporter method was introduced for 2D data, which reconstructs the target frame from the source frame to incorporate both spatial and temporal information.
1 code implementation • 11 Jul 2023 • Pengfei Li, Gang Liu, Jinlong He, Zixu Zhao, Shenjun Zhong
Medical visual question answering (VQA) is a challenging task that requires answering clinical questions of a given medical image, by taking consider of both visual and language information.
Ranked #2 on Medical Visual Question Answering on PathVQA
1 code implementation • 20 Jun 2023 • Pengfei Li, Jianyi Yang, Adam Wierman, Shaolei Ren
The results demonstrate that existing GLB approaches may amplify environmental inequity while our proposed equity-aware GLB can significantly reduce the regional disparity in terms of carbon and water footprints.
no code implementations • 16 Jun 2023 • Pengfei Li, Jianyi Yang, Adam Wierman, Shaolei Ren
This paper studies decentralized online convex optimization in a networked multi-agent system and proposes a novel algorithm, Learning-Augmented Decentralized Online optimization (LADO), for individual agents to select actions only based on local online information.
1 code implementation • 31 May 2023 • Pengfei Li, Jianyi Yang, Shaolei Ren
The key novelty of LOMAR is a new online switching operation which, based on a judicious condition to hedge against future uncertainties, decides whether to follow the expert's decision or the RL decision for each online item.
no code implementations • 1 May 2023 • Pengfei Li, Jianyi Yang, Shaolei Ren
In this paper, we propose a novel expert-robustified learning (ERL) approach, achieving {both} good average performance and robustness.
1 code implementation • ICCV 2023 • Huan-ang Gao, Beiwen Tian, Pengfei Li, Hao Zhao, Guyue Zhou
While this paradigm is natural for image-level or pixel-level prediction, adapting it to the detection problem is challenged by the issue of proposal matching.
1 code implementation • 6 Apr 2023 • Pengfei Li, Jianyi Yang, Mohammad A. Islam, Shaolei Ren
To respond to the global water challenges, AI models can, and also must, take social responsibility and lead by example by addressing their own water footprint.
no code implementations • 20 Mar 2023 • Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao
In this work, we develop a system that trained a trillion-parameter language model on a cluster of Ascend 910 AI processors and MindSpore framework, and present the language model with 1. 085T parameters named PanGu-{\Sigma}.
1 code implementation • 27 Feb 2023 • Pengfei Li, Ruowen Zhao, Yongliang Shi, Hao Zhao, Jirui Yuan, Guyue Zhou, Ya-Qin Zhang
In this paper, we propose a novel Eikonal formulation that conditions the implicit representation on localized shape priors which function as dense boundary value constraints, and demonstrate it works on SemanticKITTI and SemanticPOSS.
1 code implementation • 2 Feb 2023 • Yupeng Zheng, Chengliang Zhong, Pengfei Li, Huan-ang Gao, Yuhang Zheng, Bu Jin, Ling Wang, Hao Zhao, Guyue Zhou, Qichao Zhang, Dongbin Zhao
By fitting a bridge-shaped curve to the illumination map distribution, both regions are suppressed and two tasks are bridged naturally.
1 code implementation • 1 Feb 2023 • Bu Jin, Xinyu Liu, Yupeng Zheng, Pengfei Li, Hao Zhao, Tong Zhang, Yuhang Zheng, Guyue Zhou, Jingjing Liu
To bridge the gap, we propose an end-to-end transformer-based architecture, ADAPT (Action-aware Driving cAPtion Transformer), which provides user-friendly natural language narrations and reasoning for each decision making step of autonomous vehicular control and action.
1 code implementation • 31 Jan 2023 • Huan-ang Gao, Beiwen Tian, Pengfei Li, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Yurong Chen, Hongbin Zha
But adapting this scheme to the state-of-the-art (SOTA) solution for PC-based layout estimation is not straightforward.
1 code implementation • ICCV 2023 • Zhijie Yan, Pengfei Li, Zheng Fu, Shaocong Xu, Yongliang Shi, Xiaoxue Chen, Yuhang Zheng, Yang Li, Tianyu Liu, Chuxuan Li, Nairui Luo, Xu Gao, Yilun Chen, Zuoxu Wang, Yifeng Shi, Pengfei Huang, Zhengxiao Han, Jirui Yuan, Jiangtao Gong, Guyue Zhou, Hang Zhao, Hao Zhao
One of the most challenging problems in motion forecasting is interactive trajectory prediction, whose goal is to jointly forecasts the future trajectories of interacting agents.
2 code implementations • 24 Nov 2022 • Pengfei Li, Gang Liu, Lin Tan, Jinying Liao, Shenjun Zhong
Medical image visual question answering (VQA) is a task to answer clinical questions, given a radiographic image, which is a challenging problem that requires a model to integrate both vision and language information.
Ranked #4 on Medical Visual Question Answering on PathVQA
1 code implementation • 19 Oct 2022 • Pengfei Li, Beiwen Tian, Yongliang Shi, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
As such, we study the challenging problem of task oriented detection, which aims to find objects that best afford an action indicated by verbs like sit comfortably on.
no code implementations • 28 Sep 2022 • Yongliang Shi, Runyi Yang, Pengfei Li, Zirui Wu, Hao Zhao, Guyue Zhou
Neural implicit representations are drawing a lot of attention from the robotics community recently, as they are expressive, continuous and compact.
1 code implementation • 18 Sep 2022 • Zhenxin Zhu, Yuantao Chen, Zirui Wu, Chao Hou, Yongliang Shi, Chuxuan Li, Pengfei Li, Hao Zhao, Guyue Zhou
In this paper, we present LATITUDE: Global Localization with Truncated Dynamic Low-pass Filter, which introduces a two-stage localization mechanism in city-scale NeRF.
3 code implementations • 15 Sep 2022 • Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan
Compared to previous safe RL methods, CUP enjoys the benefits of 1) CUP generalizes the surrogate functions to generalized advantage estimator (GAE), leading to strong empirical performance.
no code implementations • 14 Jul 2022 • Zhanzhan Cheng, Peng Zhang, Can Li, Qiao Liang, Yunlu Xu, Pengfei Li, ShiLiang Pu, Yi Niu, Fei Wu
Most existing methods divide this task into two subparts: the text reading part for obtaining the plain text from the original document images and the information extraction part for extracting key contents.
1 code implementation • 14 Jul 2022 • Liang Qiao, Hui Jiang, Ying Chen, Can Li, Pengfei Li, Zaisheng Li, Baorui Zou, Dashan Guo, Yingda Xu, Yunlu Xu, Zhanzhan Cheng, Yi Niu
Compared with the previous opensource OCR toolbox, DavarOCR has relatively more complete support for the sub-tasks of the cutting-edge technology of document understanding.
no code implementations • 4 Jul 2022 • Chang Liu, Yugong Luo, Pengfei Li, Chunhui Xing, Weiwei Kong
To deal with this problem, this paper introduces a two-dimensional maneuver management framework with a fault-tolerant mechanism on the basis of the proposed hierarchical architecture for the platoon control system.
no code implementations • 18 Apr 2022 • Pengfei Li, Jianyi Yang, Shaolei Ren
Nonetheless, by using the standard practice of training an ML model as a standalone optimizer and plugging it into an ML-augmented algorithm, the average cost performance can be highly unsatisfactory.
1 code implementation • ACL 2022 • Pengfei Li, Liangyou Li, Meng Zhang, Minghao Wu, Qun Liu
To the best of our knowledge, this is the first work to pre-train a unified model for fine-tuning on both NMT tasks.
1 code implementation • 15 Feb 2022 • Long Yang, Jiaming Ji, Juntao Dai, Yu Zhang, Pengfei Li, Gang Pan
Although using bounds as surrogate functions to design safe RL algorithms have appeared in some existing works, we develop them at least three aspects: (i) We provide a rigorous theoretical analysis to extend the surrogate functions to generalized advantage estimator (GAE).
1 code implementation • 29 Nov 2021 • Pengfei Li, Yongliang Shi, Tianyu Liu, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
Recent advances show that semi-supervised implicit representation learning can be achieved through physical constraints like Eikonal equations.
no code implementations • 1 Oct 2021 • Yan Xia, Linhui Jiang, Lu Wang, Xue Chen, Jianjie Ye, Tangyan Hou, Liqiang Wang, Yibo Zhang, Mengying Li, Zhen Li, Zhe Song, Yaping Jiang, Weiping Liu, Pengfei Li, Daniel Rosenfeld, John H. Seinfeld, Shaocai Yu
Our results show that the ORRS measurements, assisted by the machine-learning-based ensemble model developed here, can realize day-to-day supervision of on-road vehicle-specific emissions.
no code implementations • 8 Apr 2021 • Qiyao Wang, Pengfei Li, Li Zhu, Yi Niu
For the text spotting task, we detect the characters on integrated circuit and classify them based on yolov5 detection model.
no code implementations • 27 Jan 2021 • Pengfei Li, Federico Lelli, Stacy McGaugh, James Schombert, Kyu-Hyun Chae
The application of Bayesian techniques to astronomical data is generally non-trivial because the fitting parameters can be strongly degenerated and the formal uncertainties are themselves uncertain.
Astrophysics of Galaxies Cosmology and Nongalactic Astrophysics Instrumentation and Methods for Astrophysics
1 code implementation • 29 Dec 2020 • Shuang Xu, Lizhen Ji, Zhe Wang, Pengfei Li, Kai Sun, Chunxia Zhang, Jiangshe Zhang
According to the idea that each local region in the fused image should be similar to the sharpest one among source images, this paper presents an optimization-based approach to reduce defocus spread effects.
no code implementations • 15 Dec 2020 • Peixiang Zhong, Di Wang, Pengfei Li, Chen Zhang, Hao Wang, Chunyan Miao
Experimental results on two large-scale datasets support our hypothesis and show that our model can produce more accurate and commonsense-aware emotional responses and achieve better human ratings than state-of-the-art models that only specialize in one aspect.
no code implementations • 14 Dec 2020 • Long Yang, Gang Zheng, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan
We study the convergence of $\mathtt{Expected~Sarsa}(\lambda)$ with linear function approximation.
1 code implementation • ECCV 2020 • He Chen, Pengfei Guo, Pengfei Li, Gim Hee Lee, Gregory Chirikjian
In this paper, we depart from the multi-person 3D pose estimation formulation, and instead reformulate it as crowd pose estimation.
Ranked #13 on 3D Multi-Person Pose Estimation on Panoptic (using extra training data)
3D Multi-Person Human Pose Estimation 3D Multi-Person Pose Estimation +2
2 code implementations • 20 Mar 2020 • Zixiang Zhao, Shuang Xu, Chun-Xia Zhang, Junmin Liu, Pengfei Li, Jiangshe Zhang
Infrared and visible image fusion, a hot topic in the field of image processing, aims at obtaining fused images keeping the advantages of source images.
Ranked #8 on Semantic Segmentation on FMB Dataset
no code implementations • 9 Dec 2019 • Pengfei Li, Weichao Qiu, Michael Peven, Gregory D. Hager, Alan L. Yuille
Scene context is a powerful constraint on the geometry of objects within the scene in cases, such as surveillance, where the camera geometry is unknown and image quality may be poor.
no code implementations • IJCNLP 2019 • Pengfei Li, Kezhi Mao, Xuefeng Yang, Qi Li
While attention mechanisms have been proven to be effective in many NLP tasks, majority of them are data-driven.
no code implementations • 6 Sep 2019 • Long Yang, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan
To address above problem, we propose a GQ$(\sigma,\lambda)$ that extends tabular Q$(\sigma,\lambda)$ with linear function approximation.
no code implementations • 25 Jun 2019 • Long Yang, Yu Zhang, Gang Zheng, Qian Zheng, Pengfei Li, Jianhang Huang, Jun Wen, Gang Pan
Improving sample efficiency has been a longstanding goal in reinforcement learning.
no code implementations • 25 Jun 2019 • Long Yang, Yu Zhang, Jun Wen, Qian Zheng, Pengfei Li, Gang Pan
In this paper, for reducing the variance, we introduce control variate technique to $\mathtt{Expected}$ $\mathtt{Sarsa}$($\lambda$) and propose a tabular $\mathtt{ES}$($\lambda$)-$\mathtt{CV}$ algorithm.
no code implementations • 8 May 2019 • Pengfei Li, Yu Hua, Pengfei Zuo, Jingnan Jia
Index structures are important for efficient data access, which have been widely used to improve the performance in many in-memory systems.
no code implementations • 14 Jun 2018 • Wenjia Meng, Qian Zheng, Long Yang, Pengfei Li, Gang Pan
In this paper, we propose a general framework to combine DQN and most of the return-based reinforcement learning algorithms, named R-DQN.