Search Results for author: Wei Wang

Found 704 papers, 229 papers with code

A Bayesian Topic Model for Human-Evaluated Interpretability

no code implementations LREC 2022 Justin Wood, Corey Arnold, Wei Wang

Given a nonparametric topic model, we can include weakly-supervised input using novel modifications to the nonparametric generative model.

Topic Models

PALM: Pre-training an Autoencoding\&Autoregressive Language Model for Context-conditioned Generation

no code implementations EMNLP 2020 Bin Bi, Chenliang Li, Chen Wu, Ming Yan, Wei Wang, Songfang Huang, Fei Huang, Luo Si

An extensive set of experiments show that PALM achieves new state-of-the-art results on a variety of language generation benchmarks covering generative question answering (Rank 1 on the official MARCO leaderboard), abstractive summarization on CNN/DailyMail as well as Gigaword, question generation on SQuAD, and conversational response generation on Cornell Movie Dialogues.

Abstractive Text Summarization Conversational Response Generation +8

Schrodinger's Memory: Large Language Models

no code implementations16 Sep 2024 Wei Wang, Qing Li

So, what is the underlying mechanism of this memory?

AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions

no code implementations3 Sep 2024 Chenghao Qian, Mahdi Rezaei, Saeed Anwar, Wenjing Li, Tanveer Hussain, Mohsen Azarmi, Wei Wang

AllWeather-Net effectively transforms images into normal weather and daytime scenes, demonstrating superior image enhancement results and subsequently enhancing the performance of semantic segmentation, with up to a 5. 3% improvement in mIoU in the trained domain.

Autonomous Driving Image Enhancement +2

PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View

no code implementations29 Aug 2024 Zichen Yu, Quanli Liu, Wei Wang, Liyong Zhang, Xiaoguang Zhao

In this paper, in order to adapt the image information distribution and preserve the view symmetry by regular convolution, we propose to employ the polar BEV representation to substitute the Cartesian BEV representation.

3D Object Detection Autonomous Driving +2

GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content

no code implementations29 Aug 2024 Lebin Zhou, Kun Han, Nam Ling, Wei Wang, Wei Jiang

The first is super-resolution with deferred rendering, to support the gaming solution of rendering and transferring LR images only and restoring HR images on the client side.

Image Generation Image Restoration +2

BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems

no code implementations28 Aug 2024 Wei Wang, Dan Zhang, Tao Feng, Boyan Wang, Jie Tang

Compared to single agents, multi-agent systems have higher requirements for the collaboration capabilities of language models.

Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion

1 code implementation25 Aug 2024 Xu Zhang, Zhipeng Xie, Haiyang Yu, Qitong Wang, Peng Wang, Wei Wang

Based on this observation, we introduce the Collaborative Decision Making (CDM) module, which fuses the multiple classifier heads to enhance the inference performance of adaptive deep networks.

Decision Making Image Classification

TVG: A Training-free Transition Video Generation Method with Diffusion Models

no code implementations24 Aug 2024 Rui Zhang, Yaosen Chen, Yuegen Liu, Wei Wang, Xuming Wen, Hongxia Wang

Transition videos play a crucial role in media production, enhancing the flow and coherence of visual narratives.

GPR Video Generation

HabitAction: A Video Dataset for Human Habitual Behavior Recognition

no code implementations24 Aug 2024 Hongwu Li, Zhenliang Zhang, Wei Wang

Experimental results demonstrate that our proposed method has much better performance in action recognition than the existing methods on the proposed dataset.

Action Recognition Temporal Action Localization

ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding

no code implementations21 Aug 2024 Yijia Xiao, Edward Sun, Yiqiao Jin, Qifan Wang, Wei Wang

Understanding biological processes, drug development, and biotechnological advancements requires detailed analysis of protein structures and sequences, a task in protein research that is inherently complex and time-consuming when performed manually.

Language Modelling Large Language Model +1

BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation

1 code implementation21 Aug 2024 Haotian Peng, Jiawei Liu, Jinsong Du, Jie Gao, Wei Wang

This involves adaptively sampling the vibration signals based on the sampling rate of the sensor, incorporating the frequency domain to unify input dimensions, and using a fault-free reference signal as an auxiliary input.

Management

Security Attacks on LLM-based Code Completion Tools

1 code implementation20 Aug 2024 Wen Cheng, Ke Sun, Xinyu Zhang, Wei Wang

The rapid development of large language models (LLMs) has significantly advanced code completion capabilities, giving rise to a new generation of LLM-based Code Completion Tools (LCCTs).

Code Completion

Language-Driven Interactive Shadow Detection

1 code implementation16 Aug 2024 Hongqiu Wang, Wei Wang, Haipeng Zhou, Huihui Xu, Shaozhi Wu, Lei Zhu

Based on this dataset, we propose a Referring Shadow-Track Memory Network (RSM-Net) for addressing the RVSD task.

Descriptive Shadow Detection +2

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

no code implementations14 Aug 2024 Yuxin Jiang, Bo Huang, YuFei Wang, Xingshan Zeng, Liangyou Li, Yasheng Wang, Xin Jiang, Lifeng Shang, Ruiming Tang, Wei Wang

Direct preference optimization (DPO), a widely adopted offline preference optimization algorithm, aims to align large language models (LLMs) with human-desired behaviors using pairwise preference data.

Informativeness Instruction Following +1

Spectrum Prediction With Deep 3D Pyramid Vision Transformer Learning

1 code implementation13 Aug 2024 Guangliang Pan, Qihui Wu, Bo Zhou, Jie Li, Wei Wang, Guoru Ding, David K. Y. Yau

Based on the Deep- SPred, we first propose a novel 3D spectrum prediction method combining a flow processing strategy with 3D vision Transformer (ViT, i. e., Swin) and a pyramid to serve possible applications such as spectrum monitoring task, named 3D-SwinSTB.

Transfer Learning

CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization

no code implementations13 Aug 2024 Wei Peng, Junmei Ding, Wei Wang, Lei Cui, Wei Cai, Zhiyu Hao, Xiaochun Yun

Cyber Threat Intelligence (CTI) summarization task requires the system to generate concise and accurate highlights from raw intelligence data, which plays an important role in providing decision-makers with crucial information to quickly detect and respond to cyber threats in the cybersecurity domain.

Abstractive Text Summarization

Inverse designing metamaterials with programmable nonlinear functional responses in graph space

no code implementations12 Aug 2024 Marco Maurizi, Derek Xu, Yu-Tong Wang, Desheng Yao, David Hahn, Mourad Oudich, Anish Satpati, Mathieu Bauchy, Wei Wang, Yizhou Sun, Yun Jing, Xiaoyu Rayne Zheng

Material responses to static and dynamic stimuli, represented as nonlinear curves, are design targets for engineering functionalities like structural support, impact protection, and acoustic and photonic bandgaps.

Composite Learning Adaptive Control without Excitation Condition

no code implementations3 Aug 2024 Jiajun Shen, Wei Wang, Changyun Wen, Jinhu Lu

In comparison to the existing results, all spectrums of previously appeared excitation information are collected, with the matrices in linear regression equation guaranteed to be bounded.

regression

Adaptive Safety with Control BarrierFunctions and Triggered Batch Least-Squares Identifier

no code implementations3 Aug 2024 Jiajun Shen, Wei Wang, Jing Zhou, Jinhu Lu

In this paper, a triggered Batch Least-Squares Identifier (BaLSI) based adaptive safety control scheme is proposed for uncertain systems with potentially conflicting control objectives and safety constraints.

Universal Approximation Theory: Foundations for Parallelism in Neural Networks

no code implementations31 Jul 2024 Wei Wang, Qing Li

Neural networks are increasingly evolving towards training large models with big data, a method that has demonstrated superior performance across many tasks.

Deep Learning-Based Longitudinal Prediction of Childhood Myopia Progression Using Fundus Image Sequences and Baseline Refraction Data

no code implementations31 Jul 2024 Mengtian Kang, Yansong Hu, Shuo Gao, Yuanyuan Liu, Hongbei Meng, Xuemeng Li, Xuhang Chen, Hubin Zhao, Jing Fu, Guohua Hu, Wei Wang, Yanning Dai, Arokia Nathan, Peter Smielewski, Ningli Wang, Shiming Li

In this study, we introduce a novel, high-accuracy method for quantitatively predicting the myopic trajectory and myopia risk in children using only fundus images and baseline refraction data.

DFE-IANet: A Method for Polyp Image Classification Based on Dual-domain Feature Extraction and Interaction Attention

no code implementations30 Jul 2024 Wei Wang, Jixing He, Xin Wang

This challenge is mainly attributed to the fact that polyps are similar to other pathologies and have complex features influenced by texture, color, and morphology.

Image Classification

Improved Bounds for Pure Private Agnostic Learning: Item-Level and User-Level Privacy

no code implementations30 Jul 2024 Bo Li, Wei Wang, Peng Ye

Machine Learning has made remarkable progress in a wide range of fields.

Discrete Spectrum Analysis of Vector OFDM Signals

no code implementations28 Jul 2024 Xiang-Gen Xia, Wei Wang

We obtain a linear relationship between a vector of information symbols and a vector of the same size of components evenly distributed in the discrete VOFDM signal spectrum, and show that if a vector of information symbols is set to 0, then a corresponding vector of the same size of the discrete VOFDM signal spectrum is 0 as well, where the components of the 0 vector are not together but evenly distributed in the spectrum.

Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundational Models

no code implementations18 Jul 2024 Simha Sankar Baradwaj, Destiny Gilliland, Jack Rincon, Henning Hermjakob, Yu Yan, Irsyad Adam, Gwyneth Lemaster, Dean Wang, Karol Watson, Alex Bui, Wei Wang, Peipei Ping

We explore strategies that can be implemented throughout the biomedical AI pipeline to effectively tackle these challenges, ensuring that these FMs are translated responsibly into clinical and translational settings.

Decision Making Memorization +2

Explainable Biomedical Hypothesis Generation via Retrieval Augmented Generation enabled Large Language Models

1 code implementation17 Jul 2024 Alexander R. Pelletier, Joseph Ramirez, Irsyad Adam, Simha Sankar, Yu Yan, Ding Wang, Dylan Steinecke, Wei Wang, Peipei Ping

The vast amount of biomedical information available today presents a significant challenge for investigators seeking to digest, process, and understand these findings effectively.

Navigate RAG +1

Hardware-Efficient and Reliable Coherent DSCM Systems Enabled by Single-Pilot-Tone-Based Polarization Demultiplexing

no code implementations14 Jul 2024 Wei Wang, Dongdong Zou, Weihao Ni, Fan Li

Besides, a low-complexity, high-spectral-efficiency, and ultra-fast polarization demultiplexing method based on a single pilot tone (SPT) is proposed for the DSCM system in this work.

Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder

1 code implementation10 Jul 2024 Kun Wu, Zhiguo Jiang, Kunming Tang, Jun Shi, Fengying Xie, Wei Wang, Haibo Wu, Yushan Zheng

The results have demonstrated the effectiveness of PAMA in generalized and discriminative WSI representation learning and pan-cancer WSI pre-training.

Position Representation Learning +1

CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement

no code implementations9 Jul 2024 Wei Wang, Zhi Jin

Low-Light Image Enhancement (LLIE) has advanced with the surge in phone photography demand, yet many existing methods neglect compression, a crucial concern for resource-constrained phone photography.

Low-Light Image Enhancement

Solving General Natural-Language-Description Optimization Problems with Large Language Models

no code implementations9 Jul 2024 Jihai Zhang, Wei Wang, Siyan Guo, Li Wang, Fangquan Lin, Cheng Yang, Wotao Yin

Optimization problems seek to find the best solution to an objective under a set of constraints, and have been widely investigated in real-world applications.

Decision Making

InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct

1 code implementation8 Jul 2024 Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang, Lingzhe Gao, Shihao Liu, Ziyuan Nan, Kaizhao Yuan, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Yewen Pu, Dawei Yin, Xing Hu, Yunji Chen

Recent advancements in open-source code large language models (LLMs) have demonstrated remarkable coding abilities by fine-tuning on the data generated from powerful closed-source LLMs such as GPT-3. 5 and GPT-4 for instruction tuning.

Code Generation Code Summarization +1

CLIMB: A Benchmark of Clinical Bias in Large Language Models

1 code implementation7 Jul 2024 Yubo Zhang, Shudi Hou, Mingyu Derek Ma, Wei Wang, Muhao Chen, Jieyu Zhao

We introduce CLIMB (shorthand for A Benchmark of Clinical Bias in Large Language Models), a pioneering comprehensive benchmark to evaluate both intrinsic (within LLMs) and extrinsic (on downstream tasks) bias in LLMs for clinical decision tasks.

counterfactual Decision Making

LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts

1 code implementation6 Jul 2024 Yijia Xiao, Edward Sun, Tianyu Liu, Wei Wang

We propose LogicVista, an evaluation benchmark that assesses the integrated logical reasoning capabilities of multimodal large language models (MLLMs) in Visual contexts.

Logical Reasoning Mathematical Reasoning +1

Universal Approximation Theory: The Basic Theory for Deep Learning-Based Computer Vision Models

no code implementations2 Jul 2024 Wei Wang, Qing Li

What is the fundamental difference between residual-based CNNs and Transformer-based networks?

SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules

no code implementations2 Jul 2024 Suyi Li, Lingyun Yang, Xiaoxiao Jiang, Hanfeng Lu, Zhipeng Di, Weiyi Lu, Jiawei Chen, Kan Liu, YingHao Yu, Tao Lan, Guodong Yang, Lin Qu, Liping Zhang, Wei Wang

It commences with our observation that add-on modules, i. e., ControlNets and LoRAs, that augment the base stable diffusion models, are ubiquitous in generating images for commercial applications.

Image Generation

MIRAI: Evaluating LLM Agents for Event Forecasting

no code implementations1 Jul 2024 Chenchen Ye, Ziniu Hu, Yihe Deng, Zijie Huang, Mingyu Derek Ma, Yanqiao Zhu, Wei Wang

Recent advancements in Large Language Models (LLMs) have empowered LLM agents to autonomously collect world information, over which to conduct reasoning to solve complex problems.

Benchmarking

Universal Approximation Theory: The Basic Theory for Transformer-based Large Language Models

no code implementations1 Jul 2024 Wei Wang, Qing Li

Language models have emerged as a critical area of focus in artificial intelligence, particularly with the introduction of groundbreaking innovations like ChatGPT.

In-Context Learning

MVOC: a training-free multiple video object composition method with diffusion models

1 code implementation22 Jun 2024 Wei Wang, Yaosen Chen, Yuegen Liu, Qi Yuan, Shubin Yang, Yanru Zhang

Although image composition based on diffusion models has been highly successful, it is not straightforward to extend the achievement to video object composition tasks, which not only exhibit corresponding interaction effects but also ensure that the objects in the composited video maintain motion and identity consistency, which is necessary to composite a physical harmony video.

Image to Video Generation Object +1

Proving Olympiad Algebraic Inequalities without Human Demonstrations

no code implementations20 Jun 2024 Chenrui Wei, Mengzhou Sun, Wei Wang

To address these issues, we propose AIPS, an Algebraic Inequality Proving System capable of autonomously generating complex inequality theorems and effectively solving Olympiad-level inequality problems without requiring human demonstrations.

Ultra-High-Definition Restoration: New Benchmarks and A Dual Interaction Prior-Driven Solution

1 code implementation19 Jun 2024 Liyan Wang, Cong Wang, Jinshan Pan, Weixiang Zhou, Xiaoran Sun, Wei Wang, Zhixun Su

To better utilize these priors, we introduce single prior feature interaction and dual prior feature interaction, where the former respectively fuses normal and gradient priors with high-resolution features to enhance prior ones, while the latter calculates the similarity between enhanced prior ones and further exploits dual guided filtering to boost the feature interaction of dual priors.

4k Image Restoration +1

AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models

no code implementations19 Jun 2024 Ken Chen, Sachith Seneviratne, Wei Wang, Dongting Hu, Sanjay Saha, Md. Tarek Hasan, Sanka Rasnayaka, Tamasha Malepathirana, Mingming Gong, Saman Halgamuge

Face reenactment refers to the process of transferring the pose and facial expressions from a reference (driving) video onto a static facial (source) image while maintaining the original identity of the source image.

Face Reenactment Face Swapping

Teaching Large Language Models to Express Knowledge Boundary from Their Own Signals

no code implementations16 Jun 2024 Lida Chen, Zujie Liang, Xintao Wang, Jiaqing Liang, Yanghua Xiao, Feng Wei, Jinglei Chen, Zhenghong Hao, Bing Han, Wei Wang

Large language models (LLMs) have achieved great success, but their occasional content fabrication, or hallucination, limits their practical application.

Hallucination

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery

1 code implementation16 Jun 2024 Yu Zhang, Xiusi Chen, Bowen Jin, Sheng Wang, Shuiwang Ji, Wei Wang, Jiawei Han

In many scientific fields, large language models (LLMs) have revolutionized the way text and other modalities of data (e. g., molecules and proteins) are handled, achieving superior performance in various applications and augmenting the scientific discovery process.

scientific discovery

Multi-Objective Control Co-design Using Graph-Based Optimization for Offshore Wind Farm Grid Integration

no code implementations14 Jun 2024 Himanshu Sharma, Wei Wang, Bowen Huang, Thiagarajan Ramachandran, Veronica Adetola

Offshore wind farms have emerged as a popular renewable energy source that can generate substantial electric power with a low environmental impact.

CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions

1 code implementation14 Jun 2024 Mingyu Derek Ma, Chenchen Ye, Yu Yan, Xiaoxuan Wang, Peipei Ping, Timothy S Chang, Wei Wang

The integration of Artificial Intelligence (AI), especially Large Language Models (LLMs), into the clinical diagnosis process offers significant potential to improve the efficiency and accessibility of medical care.

Decision Making

Interconnected Markets: Exploring the Dynamic Relationship Between BRICS Stock Markets and Cryptocurrency

no code implementations11 Jun 2024 Wei Wang, Haibo Wang

This study uses data from the BRICS stock market index, cryptocurrencies, and investor sentiment indicators from January 6, 2015, to June 29, 2023.

MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension

no code implementations10 Jun 2024 Khiem Le, Zhichun Guo, Kaiwen Dong, Xiaobao Huang, Bozhao Nan, Roshni Iyer, Xiangliang Zhang, Olaf Wiest, Wei Wang, Nitesh V. Chawla

Large Language Models (LLMs) with their strong task-handling capabilities have shown remarkable advancements across a spectrum of fields, moving beyond natural language understanding.

Natural Language Understanding Retrosynthesis

M2NO: Multiresolution Operator Learning with Multiwavelet-based Algebraic Multigrid Method

no code implementations7 Jun 2024 Zhihao LI, Zhilu Lai, Xiaobo Wang, Wei Wang

Solving partial differential equations (PDEs) effectively necessitates a multi-scale approach, particularly critical in high-dimensional scenarios characterized by increasing grid points or resolution.

Operator learning Super-Resolution

FourierKAN-GCF: Fourier Kolmogorov-Arnold Network -- An Effective and Efficient Feature Transformation for Graph Collaborative Filtering

1 code implementation3 Jun 2024 Jinfeng Xu, Zheyu Chen, Jinze Li, Shuo Yang, Wei Wang, Xiping Hu, Edith C. -H. Ngai

We revisit these two components and discover that a part of feature transformation and nonlinear operation during message passing in GCN can improve the representation of GCF, but increase the difficulty of training.

Collaborative Filtering

Multipath Exploitation for Fluctuating Target Detection in RIS-Assisted ISAC Systems

no code implementations2 Jun 2024 Shoushuo Zhang, Zichao Xiao, Rang Liu, Ming Li, Wei Wang, Qian Liu

Integrated sensing and communication (ISAC) systems are typically deployed in multipath environments, which is usually deemed as a challenging issue for wireless communications.

Diversity

Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks

1 code implementation2 Jun 2024 Cong Wang, Wei Wang, Chengjin Yu, Jie Mu

To better model this property for image detaining, we develop a multi-scale graph network with exemplars, called MSGNN, that contains two branches: 1) internal data-based supervised branch is used to model the internal relations of similar patches from the rainy image itself and its multi-scale images and 2) external data-participated unsupervised branch is used to model the external relations of the similar patches in the rainy image and exemplar.

Single Image Deraining

Correlation Matching Transformation Transformers for UHD Image Restoration

1 code implementation2 Jun 2024 Cong Wang, Jinshan Pan, Wei Wang, Gang Fu, Siyuan Liang, Mengzhu Wang, Xiao-Ming Wu, Jun Liu

To better improve feature representation in low-resolution space, we propose to build feature transformation from the high-resolution space to the low-resolution one.

Deblurring Image Deblurring +3

Exploring Channel Estimation and Signal Detection for ODDM-based ISAC Systems

no code implementations1 Jun 2024 Dezhi Wang, Chongwen Huang, Lei Liu, Xiaoming Chen, Wei Wang, Zhaoyang Zhang, Chau Yuen, Mérouane Debbah

Inspired by providing reliable communications for high-mobility scenarios, in this letter, we investigate the channel estimation and signal detection in integrated sensing and communication~(ISAC) systems based on the orthogonal delay-Doppler multiplexing~(ODDM) modulation, which consists of a pulse-train that can achieve the orthogonality with respect to the resolution of the delay-Doppler~(DD) plane.

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

no code implementations30 May 2024 Yihe Deng, Pan Lu, Fan Yin, Ziniu Hu, Sheng Shen, James Zou, Kai-Wei Chang, Wei Wang

To further self-improve reasoning on the extracted visual information, we let the model reuse a small portion of existing instruction-tuning data and append its self-generated image descriptions to the prompts.

Image Comprehension Visual Question Answering

Cross-Context Backdoor Attacks against Graph Prompt Learning

1 code implementation28 May 2024 Xiaoting Lyu, Yufei Han, Wei Wang, Hangwei Qian, Ivor Tsang, Xiangliang Zhang

Graph Prompt Learning (GPL) bridges significant disparities between pretraining and downstream applications to alleviate the knowledge transfer bottleneck in real-world graph learning.

Backdoor Attack Computational Efficiency +3

QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models

no code implementations14 May 2024 Wei Wang, Zhaowei Li, Qi Xu, Yiqing Cai, Hang Song, Qi Qi, Ran Zhou, Zhida Huang, Tao Wang, Li Xiao

For the learning of positive knowledge, we collect positive rationales through self-consistency to denoise the LLM rationales generated by temperature sampling.

Contrastive Learning Knowledge Distillation +1

Minimum-Variance Recursive State Estimation for 2-D Systems: When Asynchronous Multi-Channel Delays meet Energy Harvesting Constraints

no code implementations13 May 2024 Yu Chen, Wei Wang

Based on the reconstructed activated observation sequence and activated probability, a novel unbiased h+1-step recursive estimator is constructed.

Fighter flight trajectory prediction based on spatio-temporal graphcial attention network

no code implementations13 May 2024 Yao Sun, Tengyu Jing, Jiapeng Wang, Wei Wang

The Transformer branch network is used to extract the temporal characteristics of historical trajectories and capture the impact of the fighter's historical state on future trajectories, while the GAT branch network is used to extract spatial features in historical trajectories and capture potential spatial correlations between fighters. Then we concatenate the outputs of the two branches into a new feature vector and input it into a decoder composed of a fully connected network to predict the future position coordinates of the blue army fighter. The computer simulation results show that the proposed network significantly improves the prediction accuracy of flight trajectories compared to the enhanced CNN-LSTM network (ECNN-LSTM), with improvements of 47% and 34% in both ADE and FDE indicators, providing strong support for subsequent autonomous combat missions.

Graph Attention Trajectory Prediction

VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context

1 code implementation8 May 2024 Yunxin Li, Baotian Hu, Haoyuan Shi, Wei Wang, Longyue Wang, Min Zhang

Large Multimodal Models (LMMs) have achieved impressive success in visual understanding and reasoning, remarkably improving the performance of mathematical reasoning in a visual context.

Math Mathematical Reasoning

Flight Trajectory Prediction Using an Enhanced CNN-LSTM Network

no code implementations30 Apr 2024 Qinzhi Hao, Jiali Zhang, Tengyu Jing, Wei Wang

Aiming at the problem of low accuracy of flight trajectory prediction caused by the high speed of fighters, the diversity of tactical maneuvers, and the transient nature of situational change in close range air combat, this paper proposes an enhanced CNN-LSTM network as a fighter flight trajectory prediction method.

Diversity Trajectory Prediction

Probing Unlearned Diffusion Models: A Transferable Adversarial Attack Perspective

1 code implementation30 Apr 2024 Xiaoxuan Han, Songlin Yang, Wei Wang, Yang Li, Jing Dong

Specifically, we employ an adversarial search strategy to search for the adversarial embedding which can transfer across different unlearned models.

Adversarial Attack

BrainODE: Dynamic Brain Signal Analysis via Graph-Aided Neural Ordinary Differential Equations

no code implementations30 Apr 2024 Kaiqiao Han, Yi Yang, Zijie Huang, Xuan Kan, Yang Yang, Ying Guo, Lifang He, Liang Zhan, Yizhou Sun, Wei Wang, Carl Yang

Brain network analysis is vital for understanding the neural interactions regarding brain structures and functions, and identifying potential biomarkers for clinical phenotypes.

Irregular Time Series Missing Values +1

No More Ambiguity in 360° Room Layout via Bi-Layout Estimation

no code implementations15 Apr 2024 Yu-Ju Tsai, Jin-Cheng Jhang, Jingjing Zheng, Wei Wang, Albert Y. C. Chen, Min Sun, Cheng-Hao Kuo, Ming-Hsuan Yang

A unique property of our Bi-Layout model is its ability to inherently detect ambiguous regions by comparing the two predictions.

Room Layout Estimation

DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness

no code implementations14 Apr 2024 Yuqi Wang, Zeqiang Wang, Wei Wang, Qi Chen, Kaizhu Huang, Anh Nguyen, Suparna De

Safe and reliable natural language inference is critical for extracting insights from clinical trial reports but poses challenges due to biases in large pre-trained language models.

Data Augmentation Diversity +3

Hierarchical Attention Models for Multi-Relational Graphs

1 code implementation14 Apr 2024 Roshni G. Iyer, Wei Wang, Yizhou Sun

BR-GCN models use bi-level attention to learn node embeddings through (1) node-level attention, and (2) relation-level attention.

Graph Attention Link Prediction +2

Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of Artifacts

1 code implementation12 Apr 2024 Yang Li, Songlin Yang, Wei Wang, Ziwen He, Bo Peng, Jing Dong

We verify the effectiveness of the proposed explanations from two aspects: (1) Counterfactual Trace Visualization: the enhanced forgery images are useful to reveal artifacts by visually contrasting the original images and two different visualization methods; (2) Transferable Adversarial Attacks: the adversarial forgery images generated by attacking the detection model are able to mislead other detection models, implying the removed artifacts are general.

Adversarial Attack counterfactual

Efficient and Scalable Chinese Vector Font Generation via Component Composition

no code implementations10 Apr 2024 Jinyu Song, Weitao You, Shuhui Shi, Shuxuan Guo, Lingyun Sun, Wei Wang

In this work, we first observe that most Chinese characters can be disassembled into frequently-reused components.

Font Generation

Image and Video Compression using Generative Sparse Representation with Fidelity Controls

no code implementations9 Apr 2024 Wei Jiang, Wei Wang

Our framework can be conveniently used for both learned image compression (LIC) and learned video compression (LVC).

Decoder Image Compression +1

Event Detection from Social Media for Epidemic Prediction

1 code implementation2 Apr 2024 Tanmay Parekh, Anh Mac, Jiarui Yu, Yuxuan Dong, Syed Shahriar, Bonnie Liu, Eric Yang, Kuan-Hao Huang, Wei Wang, Nanyun Peng, Kai-Wei Chang

In our work, we pioneer exploiting Event Detection (ED) for better preparedness and early warnings of any upcoming epidemic by developing a framework to extract and analyze epidemic-related events from social media posts.

Event Detection

IterAlign: Iterative Constitutional Alignment of Large Language Models

no code implementations27 Mar 2024 Xiusi Chen, Hongzhi Wen, Sreyashi Nag, Chen Luo, Qingyu Yin, Ruirui Li, Zheng Li, Wei Wang

Such a constitution discovery pipeline can be run iteratively and automatically to discover new constitutions that specifically target the alignment gaps in the current LLM.

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

no code implementations25 Mar 2024 Shilong Zhang, Lianghua Huang, Xi Chen, Yifei Zhang, Zhi-Fan Wu, Yutong Feng, Wei Wang, Yujun Shen, Yu Liu, Ping Luo

This work presents FlashFace, a practical tool with which users can easily personalize their own photos on the fly by providing one or a few reference face images and a text prompt.

Face Swapping Instruction Following +1

Deep Variational Incomplete Multi-View Clustering: Exploring Shared Clustering Structures

no code implementations Conference 2024 Gehui Xu, Jie Wen, Chengliang Liu, Bing Hu, Yicheng Liu, Lunke Fei, Wei Wang

Existing IMVC methods primarily suffer from two issues: 1) Imputation-based methods inevitably introduce inaccurate imputations, which in turn degrade clustering performance; 2) Imputation-free methods are susceptible to unbalanced information among views and fail to fully exploit shared information.

Clustering Imputation +1

https://paperswithcode.com/paper/negatives-make-a-positive-an-embarrassingly

no code implementations Conference 2024 Gehui Xu, Jie Wen, Chengliang Liu, Bing Hu, Yicheng Liu, Lunke Fei, Wei Wang

Existing IMVC methods primarily suffer from two issues: 1) Imputation-based methods inevitably introduce inaccurate imputations, which in turn degrade clustering performance; 2) Imputation-free methods are susceptible to unbalanced information among views and fail to fully exploit shared information.

Clustering Imputation +1

Object Detectors in the Open Environment: Challenges, Solutions, and Outlook

1 code implementation24 Mar 2024 Siyuan Liang, Wei Wang, Ruoyu Chen, Aishan Liu, Boxi Wu, Ee-Chien Chang, Xiaochun Cao, DaCheng Tao

This paper aims to bridge this gap by conducting a comprehensive review and analysis of object detectors in open environments.

Incremental Learning Object

A Parallel Workflow for Polar Sea-Ice Classification using Auto-labeling of Sentinel-2 Imagery

no code implementations19 Mar 2024 Jurdana Masuma Iqrah, Wei Wang, Hongjie Xie, Sushil Prasad

Using the Antarctic's Ross Sea region as an example, the U-Net model trained on auto-labeled data achieves a classification accuracy of 98. 97% for auto-labeled training datasets when the thin clouds and shadows from the S2 images are filtered out.

Simplified Self-homodyne Coherent System Based on Alamouti Coding and Digital Subcarrier Multiplexing

no code implementations18 Mar 2024 Wei Wang, Dongdong Zou, Zhenpeng Wu, Qi Sui, Xingwen Yi, Fan Li, Chao Lu, Zhaohui Li

Coherent technology inherent with more availabledegrees of freedom is deemed a competitive solution for nextgeneration ultra-high-speed short-reach optical interconnects. However, the fatal barriers to implementing the conventiona. coherent system in short-reach optical interconnect are the costfootprint, and power consumption.

TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model

no code implementations15 Mar 2024 Jiahao Lyu, Jin Wei, Gangyan Zeng, Zeng Li, Enze Xie, Wei Wang, Yu Zhou

Taking advantage of the fine-tuned language model on scene recognition benchmarks and the paradigm of text block detection, extensive experiments demonstrate the superior performance of our scene text spotter across multiple public benchmarks.

Language Modelling Optical Character Recognition (OCR) +3

How Powerful Potential of Attention on Image Restoration?

no code implementations15 Mar 2024 Cong Wang, Jinshan Pan, Yeying Jin, Liyan Wang, Wei Wang, Gang Fu, Wenqi Ren, Xiaochun Cao

Our designs provide a closer look at the attention mechanism and reveal that some simple operations can significantly affect the model performance.

Image Restoration

Adaptive Split Learning over Energy-Constrained Wireless Edge Networks

no code implementations8 Mar 2024 Zuguang Li, Wen Wu, Shaohua Wu, Wei Wang

Then, a two-layer optimization method is proposed to solve the MIP problem.

QAQ: Quality Adaptive Quantization for LLM KV Cache

1 code implementation7 Mar 2024 Shichen Dong, Wen Cheng, Jiayu Qin, Wei Wang

The emergence of LLMs has ignited a fresh surge of breakthroughs in NLP applications, particularly in domains such as question-answering systems and text generation.

Quantization Question Answering +1

Improving Event Definition Following For Zero-Shot Event Detection

no code implementations5 Mar 2024 Zefan Cai, Po-Nien Kung, Ashima Suvarna, Mingyu Derek Ma, Hritik Bansal, Baobao Chang, P. Jeffrey Brantingham, Wei Wang, Nanyun Peng

We hypothesize that a diverse set of event types and definitions are the key for models to learn to follow event definitions while existing event extraction datasets focus on annotating many high-quality examples for a few event types.

Event Detection Event Extraction

DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation

no code implementations4 Mar 2024 Chen Xu, Tian Lan, Changlong Yu, Wei Wang, Jun Gao, Yu Ji, Qunxi Dong, Kun Qian, Piji Li, Wei Bi, Bin Hu

Constrained decoding approaches aim to control the meaning or style of text generated by a Pre-trained Language Model (PLM) using specific target words during inference.

Language Modelling Text Generation

Causal Graph ODE: Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems

no code implementations29 Feb 2024 Zijie Huang, Jeehyun Hwang, Junkai Zhang, Jinwoo Baik, Weitong Zhang, Dominik Wodarz, Yizhou Sun, Quanquan Gu, Wei Wang

Real-world multi-agent systems are often dynamic and continuous, where the agents co-evolve and undergo changes in their trajectories and interactions over time.

counterfactual Decision Making +1

Learning to Edit: Aligning LLMs with Knowledge Editing

1 code implementation19 Feb 2024 Yuxin Jiang, YuFei Wang, Chuhan Wu, Wanjun Zhong, Xingshan Zeng, Jiahui Gao, Liangyou Li, Xin Jiang, Lifeng Shang, Ruiming Tang, Qun Liu, Wei Wang

Knowledge editing techniques, aiming to efficiently modify a minor proportion of knowledge in large language models (LLMs) without negatively impacting performance across other inputs, have garnered widespread attention.

knowledge editing Philosophy

EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models

no code implementations18 Feb 2024 Jun Gao, Huan Zhao, Wei Wang, Changlong Yu, Ruifeng Xu

In this study, we present EventRL, a reinforcement learning approach developed to enhance event extraction for large language models (LLMs).

Event Extraction Hallucination +1

Generalizing across Temporal Domains with Koopman Operators

no code implementations12 Feb 2024 Qiuhao Zeng, Wei Wang, Fan Zhou, Gezheng Xu, Ruizhi Pu, Changjian Shui, Christian Gagne, Shichun Yang, Boyu Wang, Charles X. Ling

By employing Koopman Operators, we effectively address the time-evolving distributions encountered in TDG using the principles of Koopman theory, where measurement functions are sought to establish linear transition relations between evolving domains.

Domain Generalization Generalization Bounds

Rocks Coding, Not Development--A Human-Centric, Experimental Evaluation of LLM-Supported SE Tasks

no code implementations8 Feb 2024 Wei Wang, Huilong Ning, Gaowei Zhang, Libo Liu, Yi Wang

Our study thus provides first-hand insights into using ChatGPT to fulfill software engineering tasks with real-world developers and motivates the need for novel interaction mechanisms that help developers effectively work with large language models to achieve desired outcomes.

TinyLLM: Learning a Small Student from Multiple Large Language Models

no code implementations7 Feb 2024 Yijun Tian, Yikun Han, Xiusi Chen, Wei Wang, Nitesh V. Chawla

To solve the problems and facilitate the learning of compact language models, we propose TinyLLM, a new knowledge distillation paradigm to learn a small student LLM from multiple large teacher LLMs.

Diversity Knowledge Distillation

Polyp-DAM: Polyp segmentation via depth anything model

no code implementations3 Feb 2024 Zhuoran Zheng, Chen Wu, Wei Wang, Yeying Jin, Xiuyi Jia

In this paper, we unfold a new perspective on polyp segmentation modeling by leveraging the Depth Anything Model (DAM) to provide depth prior to polyp segmentation models.

Segmentation

Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection

no code implementations2 Feb 2024 Hao Li, Wei Wang, Cong Wang, Zhigang Luo, Xinwang Liu, Kenli Li, Xiaochun Cao

Single-domain generalized object detection aims to enhance a model's generalizability to multiple unseen target domains using only data from a single source domain during training.

object-detection Object Detection +2

A Survey on Self-Supervised Learning for Non-Sequential Tabular Data

1 code implementation2 Feb 2024 Wei-Yao Wang, Wei-Wei Du, Derek Xu, Wei Wang, Wen-Chih Peng

Recently, SSL has become a new trend in exploring the representation learning capability in the realm of tabular data, which is more challenging due to not having explicit relations for learning descriptive representations.

Contrastive Learning Descriptive +2

Beyond Inserting: Learning Identity Embedding for Semantic-Fidelity Personalized Diffusion Generation

no code implementations31 Jan 2024 Yang Li, Songlin Yang, Wei Wang, Jing Dong

The previous methods either failed to accurately fit the face region or lost the interactive generative ability with other existing concepts in T2I models.

Image Generation Personalized Image Generation

On the Emergence of Symmetrical Reality

no code implementations26 Jan 2024 Zhenliang Zhang, Zeyu Zhang, Ziyuan Jiao, Yao Su, Hangxin Liu, Wei Wang, Song-Chun Zhu

Artificial intelligence (AI) has revolutionized human cognitive abilities and facilitated the development of new AI entities capable of interacting with humans in both physical and virtual environments.

Mixed Reality

Emulation-based Stabilization for Networked Control Systems with Stochastic Channels

no code implementations22 Jan 2024 Wei Ren, Wei Wang, Zhuo-Rui Pan, Xi-Ming Sun, Andrew R. Teel, Dragan Nesic

Next, the proposed scheduling protocol is embedded into the closed-loop system, which leads to a stochastic hybrid model for NCSs with random packet dropouts.

Scheduling

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

1 code implementation19 Jan 2024 Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Wei Wang, Yaodong Yang, Song-Chun Zhu

Within CivRealm, we provide interfaces for two typical agent types: tensor-based agents that focus on learning, and language-based agents that emphasize reasoning.

Decision Making

ConcEPT: Concept-Enhanced Pre-Training for Language Models

no code implementations11 Jan 2024 Xintao Wang, Zhouhong Gu, Jiaqing Liang, Dakuan Lu, Yanghua Xiao, Wei Wang

In this paper, we propose ConcEPT, which stands for Concept-Enhanced Pre-Training for language models, to infuse conceptual knowledge into PLMs.

Entity Linking Entity Typing

Privacy-Preserving Sequential Recommendation with Collaborative Confusion

no code implementations9 Jan 2024 Wei Wang, Yujie Lin, Pengjie Ren, Zhumin Chen, Tsunenori Mine, Jianli Zhao, Qiang Zhao, Moyan Zhang, Xianye Ben, YuJun Li

Unlike existing research, we capture collaborative signals of neighbor interaction sequences and directly inject indistinguishable items into the target sequence before the recommendation process begins, thereby increasing the perplexity of the target sequence.

Collaborative Filtering Federated Learning +2

When To Grow? A Fitting Risk-Aware Policy for Layer Growing in Deep Neural Networks

no code implementations6 Jan 2024 Haihang Wu, Wei Wang, Tamasha Malepathirana, Damith Senanayake, Denny Oetomo, Saman Halgamuge

Neural growth is the process of growing a small neural network to a large network and has been utilized to accelerate the training of deep neural networks.