Search Results for author: Peng Zhang

Found 255 papers, 91 papers with code

Natural Language Processing Meets Quantum Physics: A Survey and Categorization

no code implementations EMNLP 2021 Sixuan Wu, Jian Li, Peng Zhang, Yue Zhang

Recent research has investigated quantum NLP, designing algorithms that process natural language in quantum computers, and also quantum-inspired algorithms that improve NLP performance on classical computers.

Survey

ClusterFormer: Neural Clustering Attention for Efficient and Effective Transformer

no code implementations ACL 2022 Ningning Wang, Guobing Gan, Peng Zhang, Shuai Zhang, Junqiu Wei, Qun Liu, Xin Jiang

Other sparse methods use clustering patterns to select words, but the clustering process is separate from the training process of the target task, which causes a decrease in effectiveness.

Clustering Machine Translation +4

HOSMEL: A Hot-Swappable Modularized Entity Linking Toolkit for Chinese

1 code implementation ACL 2022 Daniel Zhang-li, Jing Zhang, Jifan Yu, Xiaokang Zhang, Peng Zhang, Jie Tang, Juanzi Li

We investigate the usage of entity linking (EL)in downstream tasks and present the first modularized EL toolkit for easy task adaptation.

Entity Linking Question Answering

MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation

no code implementations3 Jun 2025 Mingyang Huang, Peng Zhang, Bang Zhang

Generating long-term, coherent, and realistic music-conditioned dance sequences remains a challenging task in human motion synthesis.

Contrastive Learning Motion Synthesis +4

Are MLMs Trapped in the Visual Room?

no code implementations29 May 2025 Yazhou Zhang, Chunwang Zou, Qimeng Liu, Lu Rong, Ben Yao, Zheng Lian, Qiuchi Li, Peng Zhang, Jing Qin

In implementation, we introduce a two-tier evaluation framework spanning perception and cognition.

LLM-Based User Simulation for Low-Knowledge Shilling Attacks on Recommender Systems

no code implementations18 May 2025 Shengkang Gu, Jiahao Liu, Dongsheng Li, Guangping Zhang, Mingzhe Han, Hansu Gu, Peng Zhang, Ning Gu, Li Shang, Tun Lu

Recommender systems (RS) are increasingly vulnerable to shilling attacks, where adversaries inject fake user profiles to manipulate system outputs.

Language Modeling Language Modelling +4

AdAEM: An Adaptively and Automated Extensible Measurement of LLMs' Value Difference

no code implementations18 May 2025 Shitong Duan, Xiaoyuan Yi, Peng Zhang, Dongkuan Xu, Jing Yao, Tun Lu, Ning Gu, Xing Xie

Assessing Large Language Models (LLMs)' underlying value differences enables comprehensive comparison of their misalignment, cultural adaptability, and biases.

Informativeness

Ultrasound Report Generation with Multimodal Large Language Models for Standardized Texts

no code implementations13 May 2025 Peixuan Ge, Tongkun Su, Faqin Lv, Baoliang Zhao, Peng Zhang, Chi Hong Wong, Liang Yao, Yu Sun, Zenan Wang, Pak Kin Wong, Ying Hu

In this study, we propose a unified framework for multi-organ and multilingual US report generation, integrating fragment-based multilingual training and leveraging the standardized nature of US reports.

Text Generation

Fast-Slow Thinking for Large Vision-Language Model Reasoning

no code implementations25 Apr 2025 Wenyi Xiao, Leilei Gan, Weilong Dai, Wanggui He, Ziwei Huang, Haoyuan Li, Fangxun Shu, Zhelun Yu, Peng Zhang, Hao Jiang, Fei Wu

Recent advances in large vision-language models (LVLMs) have revealed an \textit{overthinking} phenomenon, where models generate verbose reasoning across all tasks regardless of questions.

Language Modeling Language Modelling

FedCIA: Federated Collaborative Information Aggregation for Privacy-Preserving Recommendation

1 code implementation19 Apr 2025 Mingzhe Han, Dongsheng Li, Jiafeng Xia, Jiahao Liu, Hansu Gu, Peng Zhang, Ning Gu, Tun Lu

Based on this new paradigm, we introduce the federated collaborative information aggregation (FedCIA) method for privacy-preserving recommendation.

Privacy Preserving

Spiking Neural Network for Intra-cortical Brain Signal Decoding

1 code implementation12 Apr 2025 Song Yang, Haotian Fu, Herui Zhang, Peng Zhang, Wei Li, Dongrui Wu

Decoding brain signals accurately and efficiently is crucial for intra-cortical brain-computer interfaces.

OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

no code implementations3 Apr 2025 Zhongjian Wang, Peng Zhang, Jinwei Qi, Guangyuan Wang Sheng Xu, Bang Zhang, Liefeng Bo

To address these limitations, we introduce OmniTalker, an end-to-end unified framework that simultaneously generates synchronized speech and talking head videos from text and reference video in real-time zero-shot scenarios, while preserving both speech style and facial styles.

Talking Head Generation Video Synchronization

Beyond Single-Sentence Prompts: Upgrading Value Alignment Benchmarks with Dialogues and Stories

no code implementations28 Mar 2025 Yazhou Zhang, Qimeng Liu, Qiuchi Li, Peng Zhang, Jing Qin

Evaluating the value alignment of large language models (LLMs) has traditionally relied on single-sentence adversarial prompts, which directly probe models with ethically sensitive or controversial questions.

Ethics Sentence

ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model

no code implementations27 Mar 2025 Jinwei Qi, Chaonan Ji, Sheng Xu, Peng Zhang, Bang Zhang, Liefeng Bo

Real-time interactive video-chat portraits have been increasingly recognized as the future trend, particularly due to the remarkable progress made in text and voice chat technologies.

Video Generation

Stabilization Analysis and Mode Recognition of Kerosene Supersonic Combustion: A Deep Learning Approach Based on Res-CNN-beta-VAE

no code implementations17 Mar 2025 Weiming Xu, Tao Yang, Chang Liu, Kun Wu, Peng Zhang

The scramjet engine is a key propulsion system for hypersonic vehicles, leveraging supersonic airflow to achieve high specific impulse, making it a promising technology for aerospace applications.

Clustering Dimensionality Reduction

Dynamical Mode Recognition of Turbulent Flames in a Swirl-stabilized Annular Combustor by a Time-series Learning Approach

no code implementations17 Mar 2025 Tao Yang, Weiming Xu, Liangliang Xu, Peng Zhang

Thermoacoustic instability in annular combustors, essential to aero engines and modern gas turbines, can severely impair operational stability and efficiency, accurately recognizing and understanding various combustion modes is the prerequisite for understanding and controlling combustion instabilities.

Dimensionality Reduction Time Series

Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking

no code implementations25 Feb 2025 Peng Zhang, Xin Li, Xin Lin, Liang He

Recent advancements in 3D multi-object tracking (3D MOT) have predominantly relied on tracking-by-detection pipelines.

3D Multi-Object Tracking Autonomous Driving +2

AgentCF++: Memory-enhanced LLM-based Agents for Popularity-aware Cross-domain Recommendations

1 code implementation19 Feb 2025 Jiahao Liu, Shengkang Gu, Dongsheng Li, Guangping Zhang, Mingzhe Han, Hansu Gu, Peng Zhang, Tun Lu, Li Shang, Ning Gu

However, the memory design in current methods causes user agents to introduce significant irrelevant information during decision-making in cross-domain scenarios and makes them unable to recognize the influence of other users' interactions, such as popularity factors.

Decision Making Language Modeling +3

Unbiased Collaborative Filtering with Fair Sampling

1 code implementation19 Feb 2025 Jiahao Liu, Dongsheng Li, Hansu Gu, Peng Zhang, Tun Lu, Li Shang, Ning Gu

Recommender systems leverage extensive user interaction data to model preferences; however, directly modeling these data may introduce biases that disproportionately favor popular items.

Collaborative Filtering Fairness +1

Improving LLM-powered Recommendations with Personalized Information

1 code implementation19 Feb 2025 Jiahao Liu, Xueshuo Yan, Dongsheng Li, Guangping Zhang, Hansu Gu, Peng Zhang, Tun Lu, Li Shang, Ning Gu

Due to the lack of explicit reasoning modeling, existing LLM-powered recommendations fail to leverage LLMs' reasoning capabilities effectively.

Recommendation Systems

Deep Reinforcement Learning-Based Bidding Strategies for Prosumers Trading in Double Auction-Based Transactive Energy Market

no code implementations16 Feb 2025 Jun Jiang, Yuanliang Li, Luyang Hou, Mohsen Ghafouri, Peng Zhang, Jun Yan, Yuhong Liu

We also propose a deep reinforcement learning (DRL) model with distributed learning and execution to ensure the scalability and privacy of the market environment.

Deep Reinforcement Learning

LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search

no code implementations12 Feb 2025 Yang Gao, Hong Yang, Yizhi Chen, Junxian Wu, Peng Zhang, Haishuai Wang

LLM4GNAS includes an algorithm library for graph neural architecture search algorithms based on LLMs, enabling the adaptation of GNAS methods to new search spaces through the modification of LLM prompts.

Feature Engineering Graph Learning +5

Leveraging Geolocation in Clinical Records to Improve Alzheimer's Disease Diagnosis Using DMV Framework

no code implementations6 Feb 2025 Peng Zhang, Divya Chaudhary

Alzheimer's Disease (AD) early detection is critical for enabling timely intervention and improving patient outcomes.

Every Image Listens, Every Image Dances: Music-Driven Image Animation

no code implementations30 Jan 2025 Zhikang Dong, Weituo Hao, Ju-Chiang Wang, Peng Zhang, Pawel Polak

To advance research in this field, we present a new multimodal dataset comprising 2, 904 dance videos with corresponding background music and text descriptions.

Image Animation Video Generation

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

4 code implementations22 Jan 2025 DeepSeek-AI, Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Ruoyu Zhang, Runxin Xu, Qihao Zhu, Shirong Ma, Peiyi Wang, Xiao Bi, Xiaokang Zhang, Xingkai Yu, Yu Wu, Z. F. Wu, Zhibin Gou, Zhihong Shao, Zhuoshu Li, Ziyi Gao, Aixin Liu, Bing Xue, Bingxuan Wang, Bochao Wu, Bei Feng, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, Damai Dai, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fucong Dai, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Han Bao, Hanwei Xu, Haocheng Wang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Qu, Hui Li, JianZhong Guo, Jiashi Li, Jiawei Wang, Jingchang Chen, Jingyang Yuan, Junjie Qiu, Junlong Li, J. L. Cai, Jiaqi Ni, Jian Liang, Jin Chen, Kai Dong, Kai Hu, Kaige Gao, Kang Guan, Kexin Huang, Kuai Yu, Lean Wang, Lecong Zhang, Liang Zhao, Litong Wang, Liyue Zhang, Lei Xu, Leyi Xia, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Meng Li, Miaojun Wang, Mingming Li, Ning Tian, Panpan Huang, Peng Zhang, Qiancheng Wang, Qinyu Chen, Qiushi Du, Ruiqi Ge, Ruisong Zhang, Ruizhe Pan, Runji Wang, R. J. Chen, R. L. Jin, Ruyi Chen, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shiyu Wang, Shuiping Yu, Shunfeng Zhou, Shuting Pan, S. S. Li, Shuang Zhou, Shaoqing Wu, Shengfeng Ye, Tao Yun, Tian Pei, Tianyu Sun, T. Wang, Wangding Zeng, Wanjia Zhao, Wen Liu, Wenfeng Liang, Wenjun Gao, Wenqin Yu, Wentao Zhang, W. L. Xiao, Wei An, Xiaodong Liu, Xiaohan Wang, Xiaokang Chen, Xiaotao Nie, Xin Cheng, Xin Liu, Xin Xie, Xingchao Liu, Xinyu Yang, Xinyuan Li, Xuecheng Su, Xuheng Lin, X. Q. Li, Xiangyue Jin, Xiaojin Shen, Xiaosha Chen, Xiaowen Sun, Xiaoxiang Wang, Xinnan Song, Xinyi Zhou, Xianzu Wang, Xinxia Shan, Y. K. Li, Y. Q. Wang, Y. X. Wei, Yang Zhang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Wang, Yi Yu, Yichao Zhang, Yifan Shi, Yiliang Xiong, Ying He, Yishi Piao, Yisong Wang, Yixuan Tan, Yiyang Ma, Yiyuan Liu, Yongqiang Guo, Yuan Ou, Yuduan Wang, Yue Gong, Yuheng Zou, Yujia He, Yunfan Xiong, Yuxiang Luo, Yuxiang You, Yuxuan Liu, Yuyang Zhou, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yaohui Li, Yi Zheng, Yuchen Zhu, Yunxian Ma, Ying Tang, Yukun Zha, Yuting Yan, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhean Xu, Zhenda Xie, Zhengyan Zhang, Zhewen Hao, Zhicheng Ma, Zhigang Yan, Zhiyu Wu, Zihui Gu, Zijia Zhu, Zijun Liu, Zilin Li, Ziwei Xie, Ziyang Song, Zizheng Pan, Zhen Huang, Zhipeng Xu, Zhongyu Zhang, Zhen Zhang

We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.

Mathematical Reasoning Multi-task Language Understanding +2

Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values

no code implementations13 Jan 2025 Jing Yao, Xiaoyuan Yi, Shitong Duan, Jindong Wang, Yuzhuo Bai, Muhua Huang, Peng Zhang, Tun Lu, Zhicheng Dou, Maosong Sun, Xing Xie

As Large Language Models (LLMs) achieve remarkable breakthroughs, aligning their values with humans has become imperative for their responsible development and customized applications.

The Scaling Law for LoRA Base on Mutual Information Upper Bound

no code implementations6 Jan 2025 Jing Zhang, Hui Gao, Peng Zhang, Shuzhen Sun, Chang Yang, Yuexian Hou

In the fine-tuning process for large models, two types of knowledge are typically involved: the frozen, general knowledge acquired by the model during pre-training and the new knowledge learned through the LoRA module from the current data.

General Knowledge

Exploring Timeline Control for Facial Motion Generation

no code implementations CVPR 2025 Yifeng Ma, Jinwei Qi, Chaonan Ji, Peng Zhang, Bang Zhang, Zhidong Deng, Liefeng Bo

Based on the annotations, we propose a diffusion-based generation model capable of generating facial motions that are natural and accurately aligned with input timelines.

Motion Generation

Leveraging SD Map to Augment HD Map-based Trajectory Prediction

no code implementations CVPR 2025 Zhiwei Dong, Ran Ding, Wei Li, Peng Zhang, Guobin Tang, Jia Guo

Latest trajectory prediction models in real-world autonomous driving systems often rely on online High-Definition (HD) maps to understand the road environment. However, online HD maps suffer from perception errors and feature redundancy, which hinder the performance of HD map-based trajectory prediction models. To address these issues, we introduce a framework, termed SD map-Augmented Trajectory Prediction (SATP), which leverages Standard-Definition (SD) maps to enhance HD map-based trajectory prediction models. First, we propose an SD-HD fusion approach to leverage SD maps across the diverse range of HD map-based trajectory prediction models.

Autonomous Driving Prediction +1

DeepSeek-V3 Technical Report

4 code implementations27 Dec 2024 DeepSeek-AI, Aixin Liu, Bei Feng, Bing Xue, Bingxuan Wang, Bochao Wu, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fucong Dai, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Han Bao, Hanwei Xu, Haocheng Wang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, JianZhong Guo, Jiaqi Ni, Jiashi Li, Jiawei Wang, Jin Chen, Jingchang Chen, Jingyang Yuan, Junjie Qiu, Junlong Li, Junxiao Song, Kai Dong, Kai Hu, Kaige Gao, Kang Guan, Kexin Huang, Kuai Yu, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Litong Wang, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qiancheng Wang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruisong Zhang, Ruizhe Pan, Runji Wang, Runxin Xu, Ruoyu Zhang, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Shuting Pan, T. Wang, Tao Yun, Tian Pei, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wanjia Zhao, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wenqin Yu, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaokang Zhang, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Cheng, Xin Liu, Xin Xie, Xingchao Liu, Xingkai Yu, Xinnan Song, Xinxia Shan, Xinyi Zhou, Xinyu Yang, Xinyuan Li, Xuecheng Su, Xuheng Lin, Y. K. Li, Y. Q. Wang, Y. X. Wei, Y. X. Zhu, Yang Zhang, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Yu, Yi Zheng, Yichao Zhang, Yifan Shi, Yiliang Xiong, Ying He, Ying Tang, Yishi Piao, Yisong Wang, Yixuan Tan, Yiyang Ma, Yiyuan Liu, Yongqiang Guo, Yu Wu, Yuan Ou, Yuchen Zhu, Yuduan Wang, Yue Gong, Yuheng Zou, Yujia He, Yukun Zha, Yunfan Xiong, Yunxian Ma, Yuting Yan, Yuxiang Luo, Yuxiang You, Yuxuan Liu, Yuyang Zhou, Z. F. Wu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhean Xu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhengyan Zhang, Zhewen Hao, Zhibin Gou, Zhicheng Ma, Zhigang Yan, Zhihong Shao, Zhipeng Xu, Zhiyu Wu, Zhongyu Zhang, Zhuoshu Li, Zihui Gu, Zijia Zhu, Zijun Liu, Zilin Li, Ziwei Xie, Ziyang Song, Ziyi Gao, Zizheng Pan

We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

Language Modeling Language Modelling +1

Energy-Efficient RIS-Aided Cell-Free Massive MIMO Systems: Application, Opportunities, and Challenges

no code implementations23 Dec 2024 Yu Lu, Jiayi Zhang, Enyu Shi, Peng Zhang, Derrick Wing Kwan Ng, Dusit Niyato, Bo Ai

Reconfigurable intelligent surfaces (RIS)-assisted cell-free massive multiple-input multiple-output (CF mMIMO) systems have emerged as a promising technology for sixth-generation communication systems.

SEE: Sememe Entanglement Encoding for Transformer-bases Models Compression

no code implementations15 Dec 2024 Jing Zhang, Shuzhen Sun, Peng Zhang, Guangxing Cao, Hui Gao, Xindian Ma, Nan Xu, Yuexian Hou

Transformer-based large language models exhibit groundbreaking capabilities, but their storage and computational costs are prohibitively high, limiting their application in resource-constrained scenarios.

Word Embeddings

Oracle-guided Dynamic User Preference Modeling for Sequential Recommendation

1 code implementation1 Dec 2024 Jiafeng Xia, Dongsheng Li, Hansu Gu, Tun Lu, Peng Zhang, Li Shang, Ning Gu

Besides past information, future information is also available during training, which contains the ``oracle'' user preferences in the future and will be beneficial to model dynamic user preferences.

Sequential Recommendation

Online Voltage Regulation of Distribution Systems with Disturbance-Action Controllers

no code implementations1 Dec 2024 Peng Zhang, Baosen Zhang

Inverter-based distributed energy resources facilitate the advanced voltage control algorithms in the online setting with the flexibility in both active and reactive power injections.

BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?

no code implementations19 Nov 2024 Zongmeng Zhang, Jinhua Zhu, Wengang Zhou, Xiang Qi, Peng Zhang, Houqiang Li

Dense retrieval, which aims to encode the semantic information of arbitrary text into dense vector representations or embeddings, has emerged as an effective and efficient paradigm for text retrieval, consequently becoming an essential component in various natural language processing systems.

Text Retrieval

Steam Turbine Anomaly Detection: An Unsupervised Learning Approach Using Enhanced Long Short-Term Memory Variational Autoencoder

no code implementations16 Nov 2024 Weiming Xu, Peng Zhang

Specifically, LSTMVAE, integrating LSTM with VAE, was used to project high-dimensional time-series data to a low-dimensional phase space.

Unsupervised Anomaly Detection

DeMod: A Holistic Tool with Explainable Detection and Personalized Modification for Toxicity Censorship

no code implementations4 Nov 2024 Yaqiong Li, Peng Zhang, Hansu Gu, Tun Lu, Siyuan Qiao, Yubo Shu, YiYang Shao, Ning Gu

Toxicity censorship is a complex process, wherein detection is just an initial task and a user can have further needs such as rationale understanding and content modification.

CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression

no code implementations10 Oct 2024 Wenyuan Liu, Xindian Ma, Peng Zhang, Yan Wang

Through quantitative analysis of the quantization kernel, we find that these elements are crucial for maintaining the accuracy of quantized LLMs.

Language Modeling Language Modelling +3

Filtering Discomforting Recommendations with Large Language Models

no code implementations7 Oct 2024 Jiahao Liu, YiYang Shao, Peng Zhang, Dongsheng Li, Hansu Gu, Chao Chen, Longzhi Du, Tun Lu, Ning Gu

Personalized algorithms can inadvertently expose users to discomforting recommendations, potentially triggering negative consequences.

Language Modeling Language Modelling +1

Zodiac: A Cardiologist-Level LLM Framework for Multi-Agent Diagnostics

no code implementations2 Oct 2024 Yuan Zhou, Peng Zhang, Mengya Song, Alice Zheng, Yiwen Lu, Zhiheng Liu, Yong Chen, Zhaohan Xi

In this work, we introduce ZODIAC, an LLM-powered framework with cardiologist-level professionalism designed to engage LLMs in cardiological diagnostics.

Electrocardiography (ECG)

CogVLM2: Visual Language Models for Image and Video Understanding

3 code implementations29 Aug 2024 Wenyi Hong, Weihan Wang, Ming Ding, Wenmeng Yu, Qingsong Lv, Yan Wang, Yean Cheng, Shiyu Huang, Junhui Ji, Zhao Xue, Lei Zhao, Zhuoyi Yang, Xiaotao Gu, Xiaohan Zhang, Guanyu Feng, Da Yin, Zihan Wang, Ji Qi, Xixuan Song, Peng Zhang, Debing Liu, Bin Xu, Juanzi Li, Yuxiao Dong, Jie Tang

Beginning with VisualGLM and CogVLM, we are continuously exploring VLMs in pursuit of enhanced vision-language fusion, efficient higher-resolution architecture, and broader modalities and applications.

MM-Vet MVBench +3

ConsistencyTrack: A Robust Multi-Object Tracker with a Generation Strategy of Consistency Model

1 code implementation28 Aug 2024 Lifan Jiang, Zhihui Wang, Siqi Yin, Guangxiao Ma, Peng Zhang, Boxi Wu

Multi-object tracking (MOT) is a critical technology in computer vision, designed to detect multiple targets in video sequences and assign each target a unique ID per frame.

Denoising Multi-Object Tracking

CL4KGE: A Curriculum Learning Method for Knowledge Graph Embedding

no code implementations27 Aug 2024 Yang Liu, Chuan Zhou, Peng Zhang, Yanan Cao, Yongchao Liu, Zhao Li, Hongyang Chen

Knowledge graph embedding (KGE) constitutes a foundational task, directed towards learning representations for entities and relations within knowledge graphs (KGs), with the objective of crafting representations comprehensive enough to approximate the logical and symbolic interconnections among entities.

Knowledge Graph Embedding Knowledge Graphs

AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results

1 code implementation21 Aug 2024 Maksim Smirnov, Aleksandr Gushchin, Anastasia Antsiferova, Dmitry Vatolin, Radu Timofte, Ziheng Jia, ZiCheng Zhang, Wei Sun, Jiaying Qian, Yuqin Cao, Yinan Sun, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai, Kanjar De, Qing Luo, Ao-Xiang Zhang, Peng Zhang, Haibo Lei, Linyan Jiang, Yaqing Li, Wenhui Meng, Zhenzhong Chen, Zhengxue Cheng, Jiahao Xiao, Jun Xu, Chenlong He, Qi Zheng, Ruoxi Zhu, Min Li, Yibo Fan, Zhengzhong Tu

The challenge aimed to evaluate the performance of VQA methods on a diverse dataset of 459 videos, encoded with 14 codecs of various compression standards (AVC/H. 264, HEVC/H. 265, AV1, and VVC/H. 266) and containing a comprehensive collection of compression artifacts.

Image Manipulation valid +3

AOTree: Aspect Order Tree-based Model for Explainable Recommendation

no code implementations29 Jul 2024 Wenxin Zhao, Peng Zhang, Hansu Gu, Dongsheng Li, Tun Lu, Ning Gu

Therefore, in this paper, we propose Aspect Order Tree-based (AOTree) explainable recommendation method, inspired by the Order Effects Theory from cognitive and decision psychology, in order to capture the dependency relationships among decisive factors.

Decision Making Explainable Recommendation +1

Hybrid Deep Learning Framework for Enhanced Melanoma Detection

no code implementations16 Jul 2024 Peng Zhang, Divya Chaudhary

In this paper, we present a novel and highly efficient melanoma detection framework that synergistically combines the strengths of U-Net for segmentation and EfficientNet for the classification of skin images.

Binary Classification Classification +2

Structure-Aware Consensus Network on Graphs with Few Labeled Nodes

no code implementations2 Jul 2024 Shuaike Xu, Xiaolin Zhang, Peng Zhang, Kun Zhan

Secondly, SACN uniquely integrates the graph's structural information to achieve strong-to-strong consensus learning, improving the utilization of unlabeled data while maintaining multiview learning.

Graph Neural Network Multiview Learning +2

Physics-Informed AI Inverter

no code implementations25 Jun 2024 Qing Shen, Yifan Zhou, Peng Zhang, Yacov A. Shamash, Roshan Sharma, Bo Chen

This letter devises an AI-Inverter that pilots the use of a physics-informed neural network (PINN) to enable AI-based electromagnetic transient simulations (EMT) of grid-forming inverters.

3D-RPE: Enhancing Long-Context Modeling Through 3D Rotary Position Encoding

no code implementations14 Jun 2024 Xindian Ma, Wenyuan Liu, Peng Zhang, Nan Xu

3D-RPE is an advanced version of the widely used 2D Rotary Position Encoding (RoPE), with two major advantages for modeling long contexts: controllable long-term decay and improved position resolution.

Language Modeling Language Modelling +2

Quantum Implicit Neural Representations

1 code implementation6 Jun 2024 Jiaming Zhao, Wenbo Qiao, Peng Zhang, Hui Gao

In this paper, we propose Quantum Implicit Representation Network (QIREN), a novel quantum generalization of FNNs.

Image Generation

Decision-focused Graph Neural Networks for Combinatorial Optimization

no code implementations5 Jun 2024 Yang Liu, Chuan Zhou, Peng Zhang, Shirui Pan, Zhao Li, Hongyang Chen

In recent years, there has been notable interest in investigating combinatorial optimization (CO) problems by neural-based framework.

Combinatorial Optimization

Combinatorial Optimization with Automated Graph Neural Networks

1 code implementation5 Jun 2024 Yang Liu, Peng Zhang, Yang Gao, Chuan Zhou, Zhao Li, Hongyang Chen

The idea of AutoGNP is to use graph neural architecture search algorithms to automatically find the best GNNs for a given NP-hard combinatorial optimization problem.

Combinatorial Optimization Graph Embedding +2

Joint Precoding for RIS-Assisted Wideband THz Cell-Free Massive MIMO Systems

no code implementations13 May 2024 Xin Su, Ruisi He, Peng Zhang, Bo Ai

After that, with knowledge of the optimal TDs of APs, we decouple the optimization problem into three subproblems of optimizing the baseband beamformers, RISs and TDs of RISs, respectively.

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

5 code implementations7 May 2024 DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, JianZhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, T. Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Liu, Xin Xie, Xingkai Yu, Xinnan Song, Xinyi Zhou, Xinyu Yang, Xuan Lu, Xuecheng Su, Y. Wu, Y. K. Li, Y. X. Wei, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Zheng, Yichao Zhang, Yiliang Xiong, Yilong Zhao, Ying He, Ying Tang, Yishi Piao, Yixin Dong, Yixuan Tan, Yiyuan Liu, Yongji Wang, Yongqiang Guo, Yuchen Zhu, Yuduan Wang, Yuheng Zou, Yukun Zha, Yunxian Ma, Yuting Yan, Yuxiang You, Yuxuan Liu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhewen Hao, Zhihong Shao, Zhiniu Wen, Zhipeng Xu, Zhongyu Zhang, Zhuoshu Li, Zihan Wang, Zihui Gu, Zilin Li, Ziwei Xie

MLA guarantees efficient inference through significantly compressing the Key-Value (KV) cache into a latent vector, while DeepSeekMoE enables training strong models at an economical cost through sparse computation.

Language Modeling Language Modelling +2

Dynamical Mode Recognition of Coupled Flame Oscillators by Supervised and Unsupervised Learning Approaches

no code implementations27 Apr 2024 Weiming Xu, Tao Yang, Peng Zhang

Combustion instability in gas turbines and rocket engines, as one of the most challenging problems in combustion research, arises from the complex interactions among flames, which are also influenced by chemical reactions, heat and mass transfer, and acoustics.

Dimensionality Reduction Dynamic Time Warping

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

1 code implementation24 Apr 2024 Zinan Guo, Yanze Wu, Zhuowei Chen, Lang Chen, Peng Zhang, Qian He

We propose Pure and Lightning ID customization (PuLID), a novel tuning-free ID customization method for text-to-image generation.

Text to Image Generation Text-to-Image Generation

MFORT-QA: Multi-hop Few-shot Open Rich Table Question Answering

no code implementations28 Mar 2024 Che Guan, Mengyu Huang, Peng Zhang

To address this challenge, the approach of Table Question Answering (QA) has been developed to extract the relevant information.

Few-Shot Learning Question Answering +2

InternLM2 Technical Report

3 code implementations26 Mar 2024 Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, FuKai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Chao Xu, Ruiliang Xu, Hang Yan, Yirong Yan, Xiaogui Yang, Haochen Ye, Huaiyuan Ying, JIA YU, Jing Yu, Yuhang Zang, Chuyu Zhang, Li Zhang, Pan Zhang, Peng Zhang, Ruijie Zhang, Shuo Zhang, Songyang Zhang, Wenjian Zhang, Wenwei Zhang, Xingcheng Zhang, Xinyue Zhang, Hui Zhao, Qian Zhao, Xiaomeng Zhao, Fengzhe Zhou, Zaida Zhou, Jingming Zhuo, Yicheng Zou, Xipeng Qiu, Yu Qiao, Dahua Lin

The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI).

4k Long-Context Understanding

Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning

no code implementations15 Mar 2024 Yongquan He, Wenyuan Zhang, Xuancheng Huang, Peng Zhang

Recent methods try to alleviate the CF problem by modifying models or replaying data, which may only remember the surface-level pattern of instructions and get confused on held-out tasks.

Instruction Following

Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation

no code implementations11 Mar 2024 Peng Zhang, Ting Wu, Jinsheng Sun, Weiqing Li, Zhiyong Su

This paper concentrates on an unexplored yet meaningful task, i. e., interactive point cloud semantic segmentation, which assigns high-quality semantic labels to all points in a scene with user corrective clicks.

Point Cloud Segmentation Segmentation +1

Learning Expressive And Generalizable Motion Features For Face Forgery Detection

no code implementations8 Mar 2024 Jingyi Zhang, Peng Zhang, Jingjing Wang, Di Xie, ShiLiang Pu

However, current sequence-based face forgery detection methods use general video classification networks directly, which discard the special and discriminative motion information for face manipulation detection.

Anomaly Detection Classification +1

Negating Negatives: Alignment with Human Negative Samples via Distributional Dispreference Optimization

1 code implementation6 Mar 2024 Shitong Duan, Xiaoyuan Yi, Peng Zhang, Yan Liu, Zheng Liu, Tun Lu, Xing Xie, Ning Gu

Large language models (LLMs) have revolutionized the role of AI, yet pose potential social risks.

DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

no code implementations CVPR 2024 Junwen Xiong, Peng Zhang, Tao You, Chuanyue Li, Wei Huang, Yufei zha

Audio-visual saliency prediction can draw support from diverse modality complements, but further performance enhancement is still challenged by customized architectures as well as task-specific loss functions.

Denoising Prediction +1

VN Network: Embedding Newly Emerging Entities with Virtual Neighbors

no code implementations21 Feb 2024 Yongquan He, Zihan Wang, Peng Zhang, Zhaopeng Tu, Zhaochun Ren

To address this issue, recent works apply the graph neural network on the existing neighbors of the unseen entities.

Graph Neural Network Knowledge Graph Completion +1

Frequency-aware Graph Signal Processing for Collaborative Filtering

no code implementations13 Feb 2024 Jiafeng Xia, Dongsheng Li, Hansu Gu, Tun Lu, Peng Zhang, Li Shang, Ning Gu

Graph Signal Processing (GSP) based recommendation algorithms have recently attracted lots of attention due to its high efficiency.

Collaborative Filtering

APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding

no code implementations12 Jan 2024 Mingdao Liu, Aohan Zeng, Bowen Wang, Peng Zhang, Jie Tang, Yuxiao Dong

The massive adoption of large language models (LLMs) demands efficient deployment strategies.

Re-evaluating the Memory-balanced Pipeline Parallelism: BPipe

no code implementations4 Jan 2024 Mincong Huang, Chao Wang, Chi Ma, Yineng Zhang, Peng Zhang, Lei Yu

Pipeline parallelism is an essential technique in the training of large-scale Transformer models.

Rapid Open-World Adaptation by Adaptation Principles Learning

no code implementations18 Dec 2023 Cheng Xue, Ekaterina Nikonova, Peng Zhang, Jochen Renz

This is an important characteristic of intelligent agents, as it allows them to continue to function effectively in novel or unexpected situations, but still stands as a critical challenge for deep reinforcement learning (DRL).

Deep Reinforcement Learning

Heterogeneous Graph Neural Architecture Search with GPT-4

1 code implementation14 Dec 2023 Haoyuan Dong, Yang Gao, Haishuai Wang, Hong Yang, Peng Zhang

The basic idea of GHGNAS is to design a set of prompts that can guide GPT-4 toward the task of generating new heterogeneous graph neural architectures.

Neural Architecture Search

MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing

1 code implementation12 Dec 2023 Kangneng Zhou, Daiheng Gao, Xuan Wang, Jie Zhang, Peng Zhang, Xusen Sun, Longhao Zhang, Shiqi Yang, Bang Zhang, Liefeng Bo, Yaxing Wang, Ming-Ming Cheng

This enhances masked-based editing in local areas; second, we present a novel distillation strategy: Conditional Distillation on Geometry and Texture (CDGT).

Resilience-Assuring Hydrogen-Powered Microgrids

no code implementations7 Dec 2023 Chaofan Lin, Peng Zhang, Xiaonan Lu

Green hydrogen has shown great potential to power microgrids as a primary source, yet the operation methodology under extreme events is still an open area.

Dimensionality Reduction and Dynamical Mode Recognition of Circular Arrays of Flame Oscillators Using Deep Neural Network

no code implementations5 Dec 2023 Weiming Xu, Tao Yang, Peng Zhang

Oscillatory combustion in aero engines and modern gas turbines often has significant adverse effects on their operation, and accurately recognizing various oscillation modes is the prerequisite for understanding and controlling combustion instability.

Dimensionality Reduction

VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior

no code implementations4 Dec 2023 Xusen Sun, Longhao Zhang, Hao Zhu, Peng Zhang, Bang Zhang, Xinya Ji, Kangneng Zhou, Daiheng Gao, Liefeng Bo, Xun Cao

Audio-driven talking head generation has drawn much attention in recent years, and many efforts have been made in lip-sync, expressive facial expressions, natural head pose generation, and high video quality.

Talking Head Generation

Human Still Wins over LLM: An Empirical Study of Active Learning on Domain-Specific Annotation Tasks

no code implementations16 Nov 2023 Yuxuan Lu, Bingsheng Yao, Shao Zhang, Yun Wang, Peng Zhang, Tun Lu, Toby Jia-Jun Li, Dakuo Wang

Large Language Models (LLMs) have demonstrated considerable advances, and several claims have been made about their exceeding human performance.

Active Learning

Software-Defined Virtual Synchronous Condenser

no code implementations15 Nov 2023 Zimin Jiang, Peng Zhang, Yifan Zhou, Łukasz Kocewiak, Divya Kurthakoti Chandrashekhara, Marie-Lou Picherit, Zefan Tang, Kenneth B. Bowes, Guangya Yang

Synchronous condensers (SCs) play important roles in integrating wind energy into relatively weak power grids.

A Comprehensive Summarization and Evaluation of Feature Refinement Modules for CTR Prediction

1 code implementation8 Nov 2023 Fangye Wang, Hansu Gu, Dongsheng Li, Tun Lu, Peng Zhang, Li Shang, Ning Gu

In addition, we present a new architecture of assigning independent FR modules to separate sub-networks for parallel CTR models, as opposed to the conventional method of inserting a shared FR module on top of the embedding layer.

Benchmarking Click-Through Rate Prediction

Mid-Long Term Daily Electricity Consumption Forecasting Based on Piecewise Linear Regression and Dilated Causal CNN

no code implementations23 Oct 2023 Zhou Lan, Ben Liu, Yi Feng, Danhuang Dong, Peng Zhang

This study decomposes the daily electricity consumption series into three components: trend, seasonal, and residual, and constructs a two-stage prediction method using piecewise linear regression as a filter and Dilated Causal CNN as a predictor.

Prediction regression

MuseChat: A Conversational Music Recommendation System for Videos

1 code implementation CVPR 2024 Zhikang Dong, Bin Chen, Xiulong Liu, Pawel Polak, Peng Zhang

The reasoning module, equipped with the power of Large Language Model (Vicuna-7B) and extended to multi-modal inputs, is able to provide reasonable explanation for the recommended music.

Language Modeling Language Modelling +3

Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering

1 code implementation10 Oct 2023 Xiulong Liu, Zhikang Dong, Peng Zhang

In recent years, there has been a growing emphasis on the intersection of audio, vision, and text modalities, driving forward the advancements in multimodal research.

Question Answering

Early Warning Prediction with Automatic Labeling in Epilepsy Patients

no code implementations9 Oct 2023 Peng Zhang, Ting Gao, Jin Guo, Jinqiao Duan, Sergey Nikolenko

Early warning for epilepsy patients is crucial for their safety and well-being, in particular to prevent or minimize the severity of seizures.

EEG Meta-Learning +1

Graph Neural Architecture Search with GPT-4

no code implementations30 Sep 2023 Haishuai Wang, Yang Gao, Xin Zheng, Peng Zhang, Hongyang Chen, Jiajun Bu, Philip S. Yu

In this paper, we integrate GPT-4 into GNAS and propose a new GPT-4 based Graph Neural Architecture Search method (GPT4GNAS for short).

Neural Architecture Search

Physics-Informed Induction Machine Modelling

no code implementations29 Sep 2023 Qing Shen, Yifan Zhou, Peng Zhang

This rapid communication devises a Neural Induction Machine (NeuIM) model, which pilots the use of physics-informed machine learning to enable AI-based electromagnetic transient simulations.

Physics-informed machine learning

Physics-Aware Neural Dynamic Equivalence of Power Systems

no code implementations29 Sep 2023 Qing Shen, Yifan Zhou, Qiang Zhang, Slava Maslennikov, Xiaochuan Luo, Peng Zhang

The contributions are threefold: (1) an ODE-Net-enabled NeuDyE formulation to enable a continuous-time, data-driven dynamic equivalence of power systems; (2) a physics-informed NeuDyE learning method (PI-NeuDyE) to actively control the closed-loop accuracy of NeuDyE without an additional verification module; (3) a physics-guided NeuDyE (PG-NeuDyE) to enhance the method's applicability even in the absence of analytical physics models.

Scalable Neural Dynamic Equivalence for Power Systems

no code implementations29 Sep 2023 Qing Shen, Yifan Zhou, Huanfeng Zhao, Peng Zhang, Qiang Zhang, Slava Maslenniko, Xiaochuan Luo

Traditional grid analytics are model-based, relying strongly on accurate models of power systems, especially the dynamic models of generators, controllers, loads and other dynamic components.

Physics-informed machine learning

3D Multiple Object Tracking on Autonomous Driving: A Literature Review

no code implementations27 Sep 2023 Peng Zhang, Xin Li, Liang He, Xin Lin

This paper undertakes a comprehensive examination, assessment, and synthesis of the research landscape in this domain, remaining attuned to the latest developments in 3D MOT while suggesting prospective avenues for future investigation.

3D Multi-Object Tracking Autonomous Driving +1

UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection

no code implementations15 Sep 2023 Junwen Xiong, Peng Zhang, Chuanyue Li, Wei Huang, Yufei zha, Tao You

While many approaches have crafted task-specific training paradigms for either video saliency prediction or video salient object detection tasks, few attention has been devoted to devising a generalized saliency modeling framework that seamlessly bridges both these distinct tasks.

Decoder object-detection +5

Segment Anything Model for Brain Tumor Segmentation

no code implementations15 Sep 2023 Peng Zhang, Yaping Wang

Glioma is a prevalent brain tumor that poses a significant health risk to individuals.

Brain Tumor Segmentation Image Segmentation +3

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning

no code implementations14 Aug 2023 Xugong Qin, Pengyuan Lyu, Chengquan Zhang, Yu Zhou, Kun Yao, Peng Zhang, Hailun Lin, Weiping Wang

Different from existing methods which integrate multiple-granularity features or multiple outputs, we resort to the perspective of representation learning in which auxiliary tasks are utilized to enable the encoder to jointly learn robust features with the main task of per-pixel classification during optimization.

Representation Learning Scene Text Detection +1

AutoSeqRec: Autoencoder for Efficient Sequential Recommendation

1 code implementation14 Aug 2023 Sijia Liu, Jiahao Liu, Hansu Gu, Dongsheng Li, Tun Lu, Peng Zhang, Ning Gu

Sequential recommendation demonstrates the capability to recommend items by modeling the sequential behavior of users.

Collaborative Filtering Computational Efficiency +1

Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source Localization

1 code implementation9 Aug 2023 Tianyu Liu, Peng Zhang, Wei Huang, Yufei zha, Tao You, Yanning Zhang

By decoupling the gradients of visual and audio modalities, the discriminative visual representations of sound sources can be learned with the designed Induction Vector in a bootstrap manner, which also enables the audio modality to be aligned with the visual modality consistently.

Contrastive Learning Sound Source Localization

Recommendation Unlearning via Matrix Correction

no code implementations29 Jul 2023 Jiahao Liu, Dongsheng Li, Hansu Gu, Tun Lu, Jiongran Wu, Peng Zhang, Li Shang, Ning Gu

We conducted comprehensive experiments to validate the effectiveness of IMCorrect and the results demonstrate that IMCorrect is superior in completeness, utility, and efficiency, and is applicable in many recommendation unlearning scenarios.

Collaborative Filtering Recommendation Systems

Improving Semi-Supervised Semantic Segmentation with Dual-Level Siamese Structure Network

1 code implementation26 Jul 2023 Zhibo Tain, Xiaolin Zhang, Peng Zhang, Kun Zhan

Semi-supervised semantic segmentation (SSS) is an important task that utilizes both labeled and unlabeled data to reduce expenses on labeling training examples.

Contrastive Learning Pseudo Label +1

Entropy Neural Estimation for Graph Contrastive Learning

1 code implementation26 Jul 2023 Yixuan Ma, Xiaolin Zhang, Peng Zhang, Kun Zhan

In this paper, we theoretically illustrate that the entropy of a dataset can be approximated by maximizing the lower bound of the mutual information across different views of a graph, \ie, entropy is estimated by a neural network.

Contrastive Learning

Learning Stochastic Dynamical Systems as an Implicit Regularization with Graph Neural Networks

no code implementations12 Jul 2023 Jin Guo, Ting Gao, Yufu Lan, Peng Zhang, Sikun Yang, Jinqiao Duan

To that end, the observed randomness and spatial-correlations are captured by learning the drift and diffusion terms of the stochastic differential equation with a Gumble matrix embedding, respectively.

Time Series

FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction

no code implementations8 Jul 2023 Ganglai Wang, Peng Zhang, Junwen Xiong, Feihan Yang, Wei Huang, Yufei zha

DeepFake based digital facial forgery is threatening public media security, especially when lip manipulation has been used in talking face generation, and the difficulty of fake video detection is further improved.

Face Detection Face Swapping +2

VisKoP: Visual Knowledge oriented Programming for Interactive Knowledge Base Question Answering

no code implementations6 Jul 2023 Zijun Yao, Yuanyong Chen, Xin Lv, Shulin Cao, Amy Xin, Jifan Yu, Hailong Jin, Jianjun Xu, Peng Zhang, Lei Hou, Juanzi Li

We present Visual Knowledge oriented Programming platform (VisKoP), a knowledge base question answering (KBQA) system that integrates human into the loop to edit and debug the knowledge base (KB) queries.

Knowledge Base Question Answering Program induction +2

MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction

2 code implementations17 Jun 2023 Limeng Qiao, Yongchao Zheng, Peng Zhang, Wenjie Ding, Xi Qiu, Xing Wei, Chi Zhang

This report introduces the 1st place winning solution for the Autonomous Driving Challenge 2023 - Online HD-map Construction.

Autonomous Driving Decoder

Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method

1 code implementation11 Jun 2023 Shicheng Tan, Weng Lam Tam, Yuanchun Wang, Wenwen Gong, Shu Zhao, Peng Zhang, Jie Tang

To address these problems, we propose a general language model distillation (GLMD) method that performs two-stage word prediction distillation and vocabulary compression, which is simple and surprisingly shows extremely strong performance.

Knowledge Distillation Language Modeling +1

GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

1 code implementation11 Jun 2023 Shicheng Tan, Weng Lam Tam, Yuanchun Wang, Wenwen Gong, Yang Yang, Hongyin Tang, Keqing He, Jiahao Liu, Jingang Wang, Shu Zhao, Peng Zhang, Jie Tang

Currently, the reduction in the parameter scale of large-scale pre-trained language models (PLMs) through knowledge distillation has greatly facilitated their widespread deployment on various devices.

General Knowledge Knowledge Distillation +2

Revisiting Acceptability Judgements

1 code implementation23 May 2023 Hai Hu, Ziyin Zhang, Weifang Huang, Jackie Yan-Ki Lai, Aini Li, Yina Patterson, Jiahui Huang, Peng Zhang, Chien-Jer Charles Lin, Rui Wang

We introduce CoLAC - Corpus of Linguistic Acceptability in Chinese, the first large-scale acceptability dataset for a non-Indo-European language.

Cross-Lingual Transfer Linguistic Acceptability

Binary stochasticity enabled highly efficient neuromorphic deep learning achieves better-than-software accuracy

no code implementations25 Apr 2023 Yang Li, Wei Wang, Ming Wang, Chunmeng Dou, Zhengyu Ma, Huihui Zhou, Peng Zhang, Nicola Lepri, Xumeng Zhang, Qing Luo, Xiaoxin Xu, Guanhua Yang, Feng Zhang, Ling Li, Daniele Ielmini, Ming Liu

We propose a binary stochastic learning algorithm that modifies all elementary neural network operations, by introducing (i) stochastic binarization of both the forwarding signals and the activation function derivatives, (ii) signed binarization of the backpropagating errors, and (iii) step-wised weight updates.

Binarization Deep Learning

Triple Structural Information Modelling for Accurate, Explainable and Interactive Recommendation

no code implementations23 Apr 2023 Jiahao Liu, Dongsheng Li, Hansu Gu, Tun Lu, Peng Zhang, Li Shang, Ning Gu

Specifically, TriSIM4Rec consists of 1) a dynamic ideal low-pass graph filter to dynamically mine co-occurrence information in user-item interactions, which is implemented by incremental singular value decomposition (SVD); 2) a parameter-free attention module to capture sequential information of user interactions effectively and efficiently; and 3) an item transition matrix to store the transition probabilities of item pairs.

Collaborative Filtering Interactive Recommendation

A Measurement-Based Quantum-Like Language Model for Text Matching

no code implementations Conference 2023 Wantong Zhang, Guobing Gan, Hui Gao, Peng Zhang, Wenjie Hui & Zipeng Fan

We take the word density matrix in one sentence as a set of measurement operators to measure another sentence, which is consistent with the definition of measurement operators in quantum theory and has a specific semantic interpretation.

Language Modeling Language Modelling +2

CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective

no code implementations11 Mar 2023 Junwen Xiong, Ganglai Wang, Peng Zhang, Wei Huang, Yufei zha, Guangtao Zhai

Incorporating the audio stream enables Video Saliency Prediction (VSP) to imitate the selective attention mechanism of human brain.

Decoder Saliency Prediction +1

NovPhy: A Testbed for Physical Reasoning in Open-world Environments

1 code implementation3 Mar 2023 Chathura Gamage, Vimukthini Pinto, Cheng Xue, Peng Zhang, Ekaterina Nikonova, Matthew Stephenson, Jochen Renz

But is it enough to only have physical reasoning capabilities to operate in a real physical environment?

A network-based biomarkers discovery of Cold/Hot ZHENG chronic gastritis and Cold/Hot herbs of formulae

no code implementations10 Feb 2023 Boyang Wang, Pan Chen, Peng Zhang, Shao Li

And we collected 25 formulae (with traditional effects related to Cold/Hot ZHENG) for CG and corresponding 89 Cold/Hot herbs (including Warm/Cool herbs) to discover features and construct target networks of Cold/Hot herbs on the basis of network target and enrichment analysis.

Personalized Graph Signal Processing for Collaborative Filtering

no code implementations4 Feb 2023 Jiahao Liu, Dongsheng Li, Hansu Gu, Tun Lu, Peng Zhang, Li Shang, Ning Gu

However, the interaction signal may not be sufficient to accurately characterize user interests and the low-pass filters may ignore the useful information contained in the high-frequency component of the observed signals, resulting in suboptimal accuracy.

Collaborative Filtering

ReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image Editing

no code implementations31 Jan 2023 Bingchuan Li, Tianxiang Ma, Peng Zhang, Miao Hua, Wei Liu, Qian He, Zili Yi

Specifically, in Phase I, a W-space-oriented StyleGAN inversion network is trained and used to perform image inversion and editing, which assures the editability but sacrifices reconstruction quality.

Image Generation

Model Based Reinforcement Learning with Non-Gaussian Environment Dynamics and its Application to Portfolio Optimization

no code implementations23 Jan 2023 Huifang Huang, Ting Gao, Pengbo Li, Jin Guo, Peng Zhang, Nan Du

With the fast development of quantitative portfolio optimization in financial engineering, lots of AI-based algorithmic trading strategies have demonstrated promising results, among which reinforcement learning begins to manifest competitive advantages.

Algorithmic Trading Decision Making +5

GIPA: A General Information Propagation Algorithm for Graph Learning

1 code implementation19 Jan 2023 Houyi Li, Zhihong Chen, Zhao Li, Qinkai Zheng, Peng Zhang, Shuigeng Zhou

Specifically, the bit-wise correlation calculates the element-wise attention weight through a multi-layer perceptron (MLP) based on the dense representations of two nodes and their edge; The feature-wise correlation is based on the one-hot representations of node attribute features for feature selection.

Attribute feature selection +3

CL4CTR: A Contrastive Learning Framework for CTR Prediction

1 code implementation1 Dec 2022 Fangye Wang, Yingxu Wang, Dongsheng Li, Hansu Gu, Tun Lu, Peng Zhang, Ning Gu

Many Click-Through Rate (CTR) prediction works focused on designing advanced architectures to model complex feature interactions but neglected the importance of feature representation learning, e. g., adopting a plain embedding layer for each feature, which results in sub-optimal feature representations and thus inferior CTR prediction performance.

Click-Through Rate Prediction Contrastive Learning +4

AS-PD: An Arbitrary-Size Downsampling Framework for Point Clouds

no code implementations2 Nov 2022 Peng Zhang, Ruoyin Xie, Jinsheng Sun, Weiqing Li, Zhiyong Su

Given an input point cloud of arbitrary size, we first perform a task-agnostic pre-sampling on the input point cloud to a specified sample size.

Parameter-free Dynamic Graph Embedding for Link Prediction

1 code implementation15 Oct 2022 Jiahao Liu, Dongsheng Li, Hansu Gu, Tun Lu, Peng Zhang, Ning Gu

Dynamic interaction graphs have been widely adopted to model the evolution of user-item interactions over time.

Attribute Dynamic graph embedding +2

DART: Articulated Hand Model with Diverse Accessories and Rich Textures

1 code implementation14 Oct 2022 Daiheng Gao, Yuliang Xiu, Kailin Li, Lixin Yang, Feng Wang, Peng Zhang, Bang Zhang, Cewu Lu, Ping Tan

Unity GUI is also provided to generate synthetic hand data with user-defined settings, e. g., pose, camera, background, lighting, textures, and accessories.

Diversity Hand Pose Estimation +1

Multi-Scale Wavelet Transformer for Face Forgery Detection

no code implementations8 Oct 2022 Jie Liu, Jingjing Wang, Peng Zhang, Chunmao Wang, Di Xie, ShiLiang Pu

To overcome these limitations, we propose a multi-scale wavelet transformer framework for face forgery detection.

Enhanced Secure Wireless Transmission Using IRS-aided Directional Modulation

no code implementations29 Sep 2022 Yeqing Lin, Rongen Dong, Peng Zhang, Feng Shu, Jiangzhou Wang

To reduce the computational complexity, a new method of maximizing receive power with zero-forcing constraint (Max-RP-ZFC) of only reflecting CM and no AN is proposed.

Multi-frequency PolSAR Image Fusion Classification Based on Semantic Interactive Information and Topological Structure

no code implementations5 Sep 2022 Yice Cao, Yan Wu, Ming Li, Mingjie Zheng, Peng Zhang, Jili Wang

Finally, an adaptive weighting fusion (AWF) strategy is proposed to merge inference from different bands, so as to make the MF joint classification decisions of SIC and TPC.

Classification image-classification +2

Neuro-Dynamic State Estimation for Networked Microgrids

no code implementations25 Aug 2022 Fei Feng, Yifan Zhou, Peng Zhang

We devise neuro-dynamic state estimation (Neuro-DSE), a learning-based dynamic state estimation (DSE) algorithm for networked microgrids (NMs) under unknown subsystems.

State Estimation

AMinerGNN: Heterogeneous Graph Neural Network for Paper Click-through Rate Prediction with Fusion Query

no code implementations15 Aug 2022 Zepeng Huai, Zhe Wang, Yifan Zhu, Peng Zhang

Paper recommendation with user-generated keyword is to suggest papers that simultaneously meet user's interests and are relevant to the input keyword.

Click-Through Rate Prediction Graph Neural Network +1

Subgraph Neighboring Relations Infomax for Inductive Link Prediction on Knowledge Graphs

1 code implementation28 Jul 2022 Xiaohan Xu, Peng Zhang, Yongquan He, Chengpeng Chao, Chaoyang Yan

Inductive link prediction for knowledge graph aims at predicting missing links between unseen entities, those not shown in training stage.

Inductive Link Prediction Knowledge Graphs

Measuring Difficulty of Novelty Reaction

no code implementations28 Jul 2022 Ekaterina Nikonova, Cheng Xue, Vimukthini Pinto, Chathura Gamage, Peng Zhang, Jochen Renz

In this paper, we propose to define the novelty reaction difficulty as a relative difficulty of performing the known task after the introduction of the novelty.

TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents

no code implementations14 Jul 2022 Zhanzhan Cheng, Peng Zhang, Can Li, Qiao Liang, Yunlu Xu, Pengfei Li, ShiLiang Pu, Yi Niu, Fei Wu

Most existing methods divide this task into two subparts: the text reading part for obtaining the plain text from the original document images and the information extraction part for extracting key contents.

global-optimization Language Modelling

A Framework Based on Generational and Environmental Response Strategies for Dynamic Multi-objective Optimization

no code implementations6 Jul 2022 Qingya Li, Xiangzhi Liu, Fuqiang Wang, Shuai Wang, Peng Zhang, Xiaoming Wu

In this paper, a novel framework based on generational and environmental response strategies (FGERS) is proposed, in which response strategies are run both in the environmental change stage and the environmental static stage to obtain population evolution information of those both stages.

Resilience in Industrial Internet of Things Systems: A Communication Perspective

no code implementations1 Jun 2022 Hao Wu, Yifan Miao, Peng Zhang, Yang Tian, Hui Tian

Industrial Internet of Things is an ultra-large-scale system that is much more sophisticated and fragile than conventional industrial platforms.

Management

Measurement of carbon finance level and exploration of its influencing factors

no code implementations1 Jun 2022 Peng Zhang, Yuwei Zhang, Nuo Xu

Faced with increasingly severe environmental problems, carbon trading markets and related financial activities aiming at limiting carbon dioxide emissions are booming.

Stock Trading Optimization through Model-based Reinforcement Learning with Resistance Support Relative Strength

no code implementations30 May 2022 Huifang Huang, Ting Gao, Yi Gui, Jin Guo, Peng Zhang

Reinforcement learning (RL) is gaining attention by more and more researchers in quantitative finance as the agent-environment interaction framework is aligned with decision making process in many business problems.

Decision Making Model-based Reinforcement Learning +3

AI-aided multiscale modeling of physiologically-significant blood clots

no code implementations25 May 2022 Yicong Zhu, Changnian Han, Peng Zhang, Guojing Cong, James R. Kozloski, Chih-Chieh Yang, Leili Zhang, Yuefan Deng

We have developed an AI-aided multiple time stepping (AI-MTS) algorithm and multiscale modeling framework (AI-MSM) and implemented them on the Summit-like supercomputer, AIMOS.

Rapid Phase Ambiguity Elimination Methods for DOA Estimator via Hybrid Massive MIMO Receive Array

no code implementations27 Apr 2022 Xichao Zhan, YiWen Chen, Feng Shu, Xin Cheng, Yuanyuan Wu, Qi Zhang, Yifang Li, Peng Zhang

In the proposed Max-RP-QI, a quadratic interpolation scheme is adopted to interpolate the three DOA values corresponding to the largest three receive powers of Max-RP.

Enhancing CTR Prediction with Context-Aware Feature Representation Learning

1 code implementation19 Apr 2022 Fangye Wang, Yingxu Wang, Dongsheng Li, Hansu Gu, Tun Lu, Peng Zhang, Ning Gu

However, most methods only learn a fixed representation for each feature without considering the varying importance of each feature under different contexts, resulting in inferior performance.

Click-Through Rate Prediction Representation Learning

A Multi-Metric Latent Factor Model for Analyzing High-Dimensional and Sparse data

no code implementations16 Apr 2022 Di wu, Peng Zhang, Yi He, Xin Luo

High-dimensional and sparse (HiDS) matrices are omnipresent in a variety of big data-related applications.

Representation Learning

Personalized Image Aesthetics Assessment with Rich Attributes

no code implementations CVPR 2022 Yuzhe Yang, Liwu Xu, Leida Li, Nan Qie, Yaqian Li, Peng Zhang, Yandong Guo

To solve the dilemma, we conduct so far, the most comprehensive subjective study of personalized image aesthetics and introduce a new Personalized image Aesthetics database with Rich Attributes (PARA), which consists of 31, 220 images with annotations by 438 subjects.

An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection

no code implementations10 Mar 2022 Ganglai Wang, Peng Zhang, Lei Xie, Wei Huang, Yufei zha, Yanning Zhang

DeepFake based digital facial forgery is threatening the public media security, especially when lip manipulation has been used in talking face generation, the difficulty of fake video detection is further improved.

Decision Making Face Detection +2

Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild

no code implementations8 Mar 2022 Ganglai Wang, Peng Zhang, Lei Xie, Wei Huang, Yufei zha

Rather than focusing on the unimportant regions of the face image, the proposed AttnWav2Lip model is able to pay more attention on the lip region reconstruction.

Talking Face Generation

Audio-visual speech separation based on joint feature representation with cross-modal attention

no code implementations5 Mar 2022 Junwen Xiong, Peng Zhang, Lei Xie, Wei Huang, Yufei zha, Yanning Zhang

Multi-modal based speech separation has exhibited a specific advantage on isolating the target character in multi-talker noisy environments.

Optical Flow Estimation Speech Separation

Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement

1 code implementation4 Mar 2022 Junwen Xiong, Yu Zhou, Peng Zhang, Lei Xie, Wei Huang, Yufei zha

Active speaker detection and speech enhancement have become two increasingly attractive topics in audio-visual scenario understanding.

Active Speaker Detection Multi-Task Learning +1

Absolute Zero-Shot Learning

1 code implementation23 Feb 2022 Rui Gao, Fan Wan, Daniel Organisciak, Jiyao Pu, Junyan Wang, Haoran Duan, Peng Zhang, Xingsong Hou, Yang Long

Considering the increasing concerns about data copyright and privacy issues, we present a novel Absolute Zero-Shot Learning (AZSL) paradigm, i. e., training a classifier with zero real data.

Transfer Learning Zero-Shot Learning

Learning Multiple Explainable and Generalizable Cues for Face Anti-spoofing

no code implementations21 Feb 2022 Ying Bian, Peng Zhang, Jingjing Wang, Chunmao Wang, ShiLiang Pu

However, many other generalizable cues are unexplored for face anti-spoofing, which limits their performance under cross-dataset testing.

Face Anti-Spoofing

AD-NEGF: An End-to-End Differentiable Quantum Transport Simulator for Sensitivity Analysis and Inverse Problems

no code implementations10 Feb 2022 Yingzhanghao Zhou, Xiang Chen, Peng Zhang, Jun Wang, Lei Wang, Hong Guo

Since proposed in the 70s, the Non-Equilibrium Green Function (NEGF) method has been recognized as a standard approach to quantum transport simulations.

Automated assessment of disease severity of COVID-19 using artificial intelligence with synthetic chest CT

no code implementations11 Dec 2021 Mengqiu Liu, Ying Liu, Yidong Yang, Aiping Liu, Shana Li, Changbing Qu, Xiaohui Qiu, Yang Li, Weifu Lv, Peng Zhang, Jie Wen

Correlations between imaging findings and clinical lab tests suggested the value of this system as a potential tool to assess disease severity of COVID-19.

Data Augmentation Lesion Segmentation

Modeling Variable Space with Residual Tensor Networks for Multivariate Time Series

no code implementations29 Sep 2021 Jing Zhang, Peng Zhang, Yupeng He, Siwei Rao, Jun Wang, Guangjian Tian

In this framework, we derive the mathematical representation of the variable space, and then use a tensor network based on the idea of low-rank approximation to model the variable space.

Multivariate Time Series Forecasting Tensor Networks +2

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

1 code implementation22 Sep 2021 Bingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Qian He, Miao Hua, Zili Yi

To address these limitations, we design a Dynamic Style Manipulation Network (DyStyle) whose structure and parameters vary by input samples, to perform nonlinear and adaptive manipulation of latent codes for flexible and precise attribute control.

Attribute Contrastive Learning

Phy-Q as a measure for physical reasoning intelligence

1 code implementation31 Aug 2021 Cheng Xue, Vimukthini Pinto, Chathura Gamage, Ekaterina Nikonova, Peng Zhang, Jochen Renz

Inspired by how human IQ is calculated, we define the physical reasoning quotient (Phy-Q score) that reflects the physical reasoning intelligence of an agent using the physical scenarios we considered.

Graph Contrastive Learning for Anomaly Detection

2 code implementations17 Aug 2021 Bo Chen, Jing Zhang, Xiaokang Zhang, Yuxiao Dong, Jian Song, Peng Zhang, Kaibo Xu, Evgeny Kharlamov, Jie Tang

To achieve the contrastive objective, we design a graph neural network encoder that can infer and further remove suspicious links during message passing, as well as learn the global context of the input graph.

Anomaly Detection Binary Classification +3

Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking

1 code implementation31 Jul 2021 Jingxian Sun, Lichao Zhang, Yufei zha, Abel Gonzalez-Garcia, Peng Zhang, Wei Huang, Yanning Zhang

To solve this problem, we propose to distill representations of the TIR modality from the RGB modality with Cross-Modal Distillation (CMD) on a large amount of unlabeled paired RGB-TIR data.

Transfer Learning

Hi-Phy: A Benchmark for Hierarchical Physical Reasoning

1 code implementation17 Jun 2021 Cheng Xue, Vimukthini Pinto, Chathura Gamage, Peng Zhang, Jochen Renz

In this paper, we propose a new benchmark for physical reasoning that allows us to test individual physical reasoning capabilities.

WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments

1 code implementation13 Jun 2021 Yunzhe Hao, Jiaming Xu, Peng Zhang, Bo Xu

In the speaker extraction problem, it is found that additional information from the target speaker contributes to the tracking and extraction of the target speaker, which includes voiceprint, lip movement, facial expression, and spatial information.

Action Detection Activity Detection

Path-based Deep Network for Candidate Item Matching in Recommenders

no code implementations18 May 2021 Houyi Li, Zhihong Chen, Chenliang Li, Rong Xiao, Hongbo Deng, Peng Zhang, Yongchao Liu, Haihong Tang

PDN utilizes Trigger Net to capture the user's interest in each of his/her interacted item, and Similarity Net to evaluate the similarity between each interacted item and the target item based on these items' profile and CF information.

Diversity Recommendation Systems +1

GIPA: General Information Propagation Algorithm for Graph Learning

2 code implementations13 May 2021 Qinkai Zheng, Houyi Li, Peng Zhang, Zhixiong Yang, Guowei Zhang, Xintan Zeng, Yongchao Liu

Graph neural networks (GNNs) have been popularly used in analyzing graph-structured data, showing promising results in various applications such as node classification, link prediction and network recommendation.

Graph Attention Graph Learning +2

LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment

2 code implementations13 May 2021 Liang Qiao, Zaisheng Li, Zhanzhan Cheng, Peng Zhang, ShiLiang Pu, Yi Niu, Wenqi Ren, Wenming Tan, Fei Wu

In this paper, we aim to obtain more reliable aligned bounding boxes by fully utilizing the visual information from both text regions in proposed local features and cell relations in global features.

Table Recognition

A Feature Fusion-Net Using Deep Spatial Context Encoder and Nonstationary Joint Statistical Model for High Resolution SAR Image Classification

no code implementations11 May 2021 Wenkai Liang, Yan Wu, Ming Li, Peng Zhang, Yice Cao, Xin Hu

To address this problem, a novel end-to-end supervised classification method is proposed for HR SAR images by considering both spatial context and statistical features.

image-classification Image Classification

GraphTheta: A Distributed Graph Neural Network Learning System With Flexible Training Strategy

1 code implementation21 Apr 2021 Yongchao Liu, Houyi Li, Guowei Zhang, Xintan Zeng, Yongyong Li, Bin Huang, Peng Zhang, Zhao Li, Xiaowei Zhu, Changhua He, WenGuang Chen

Herein, we present GraphTheta, the first distributed and scalable graph learning system built upon vertex-centric distributed graph processing with neural network operators implemented as user-defined functions.

Graph Learning Graph Neural Network

Noise-Resilient Quantum Machine Learning for Stability Assessment of Power Systems

no code implementations10 Apr 2021 Yifan Zhou, Peng Zhang

Transient stability assessment (TSA) is a cornerstone for resilient operations of today's interconnected power grids.

BIG-bench Machine Learning Quantum Machine Learning

A review of artificial intelligence methods combined with Raman spectroscopy to identify the composition of substances

no code implementations5 Apr 2021 Liangrui Pan, Peng Zhang, Chalongrat Daengngam, Mitchai Chongcheawchamnan

This review summarizes the work of Raman spectroscopy in identifying the composition of substances and reviews the preprocessing process of Raman spectroscopy, the analysis methods and applications of artificial intelligence.

OAG-BERT: Towards A Unified Backbone Language Model For Academic Knowledge Services

1 code implementation3 Mar 2021 Xiao Liu, Da Yin, Jingnan Zheng, Xingjian Zhang, Peng Zhang, Hongxia Yang, Yuxiao Dong, Jie Tang

Academic knowledge services have substantially facilitated the development of the science enterprise by providing a plenitude of efficient research tools.

Language Modeling Language Modelling +1

CogDL: A Comprehensive Library for Graph Deep Learning

1 code implementation1 Mar 2021 Yukuo Cen, Zhenyu Hou, Yan Wang, Qibin Chen, Yizhen Luo, Zhongming Yu, Hengrui Zhang, Xingcheng Yao, Aohan Zeng, Shiguang Guo, Yuxiao Dong, Yang Yang, Peng Zhang, Guohao Dai, Yu Wang, Chang Zhou, Hongxia Yang, Jie Tang

In CogDL, we propose a unified design for the training and evaluation of GNN models for various graph tasks, making it unique among existing graph learning libraries.

Deep Learning Graph Classification +6

D2A U-Net: Automatic Segmentation of COVID-19 Lesions from CT Slices with Dilated Convolution and Dual Attention Mechanism

1 code implementation10 Feb 2021 Xiangyu Zhao, Peng Zhang, Fan Song, Guangda Fan, Yangyang Sun, Yujia Wang, Zheyuan Tian, Luqi Zhang, Guanglei Zhang

In this paper we propose a dilated dual attention U-Net (D2A U-Net) for COVID-19 lesion segmentation in CT slices based on dilated convolution and a novel dual attention mechanism to address the issues above.

Computed Tomography (CT) Decoder +3

Neuro-Reachability of Networked Microgrids

no code implementations13 Jan 2021 Yifan Zhou, Peng Zhang

A neural ordinary differential equations network (ODE-Net)-enabled reachability method (Neuro-Reachability) is devised for the dynamic verification of networked microgrids (NMs) with unidentified subsystems and heterogeneous uncertainties.

Model Discovery

TextTN: Probabilistic Encoding of Language on Tensor Network

no code implementations1 Jan 2021 Peng Zhang, Jing Zhang, Xindian Ma, Siwei Rao, Guangjian Tian, Jun Wang

As a novel model that bridges machine learning and quantum theory, tensor network (TN) has recently gained increasing attention and successful applications for processing natural images.

General Classification Sentence +4

Fluctuations in crystalline plasticity

no code implementations23 Dec 2020 Jérôme Weiss, Peng Zhang, Oguz Umut Salman, Gang Liu, Lev Truskinovsky

We link this new size effect with other related phenomena like size dependence of strength ("smaller is stronger") and the size induced switch between different hardening mechanisms.

Mesoscale and Nanoscale Physics Materials Science Statistical Mechanics Computational Physics

Residual Matrix Product State for Machine Learning

no code implementations22 Dec 2020 Ye-Ming Meng, Jing Zhang, Peng Zhang, Chao GAO, Shi-Ju Ran

Tensor network, which originates from quantum physics, is emerging as an efficient tool for classical and quantum machine learning.

BIG-bench Machine Learning Quantum Machine Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.