Search Results for author: Qi Wang

Found 84 papers, 19 papers with code

RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation

1 code implementation ICCV 2023 Lijun Li, Linrui Tian, Xindi Zhang, Qi Wang, Bang Zhang, Mengyuan Liu, Chen Chen

The current interacting hand (IH) datasets are relatively simplistic in terms of background and texture, with hand joints being annotated by a machine annotator, which may result in inaccuracies, and the diversity of pose distribution is limited.

3D Interacting Hand Pose Estimation Hand Pose Estimation

Learning Independent Instance Maps for Crowd Localization

1 code implementation8 Dec 2020 Junyu Gao, Tao Han, Qi Wang, Yuan Yuan, Xuelong Li

Furthermore, to improve the segmentation quality for different density regions, we present a differentiable Binarization Module (BM) to output structured instance maps.

Binarization Segmentation

ScreenAgent: A Vision Language Model-driven Computer Control Agent

1 code implementation9 Feb 2024 Runliang Niu, Jindong Li, Shiqi Wang, Yali Fu, Xiyu Hu, Xueyuan Leng, He Kong, Yi Chang, Qi Wang

Additionally, we construct the ScreenAgent Dataset, which collects screenshots and action sequences when completing a variety of daily computer tasks.

Language Modelling

Encoding physics to learn reaction-diffusion processes

2 code implementations9 Jun 2021 Chengping Rao, Pu Ren, Qi Wang, Oral Buyukozturk, Hao Sun, Yang Liu

Modeling complex spatiotemporal dynamical systems, such as the reaction-diffusion processes, have largely relied on partial differential equations (PDEs).

Epidemiology

DR.VIC: Decomposition and Reasoning for Video Individual Counting

2 code implementations CVPR 2022 Tao Han, Lei Bai, Junyu Gao, Qi Wang, Wanli Ouyang

Instead of relying on the Multiple Object Tracking (MOT) techniques, we propose to solve the problem by decomposing all pedestrians into the initial pedestrians who existed in the first frame and the new pedestrians with separate identities in each following frame.

Crowd Counting Density Estimation +2

Unsupervised Semantic Aggregation and Deformable Template Matching for Semi-Supervised Learning

1 code implementation NeurIPS 2020 Tao Han, Junyu Gao, Yuan Yuan, Qi Wang

In this paper, we combine both to propose an Unsupervised Semantic Aggregation and Deformable Template Matching (USADTM) framework for SSL, which strives to improve the classification performance with few labeled data and then reduce the cost in data annotating.

Template Matching

Window Normalization: Enhancing Point Cloud Understanding by Unifying Inconsistent Point Densities

1 code implementation5 Dec 2022 Qi Wang, Sheng Shi, Jiahui Li, Wuming Jiang, Xiangde Zhang

Existing methods are limited by the inconsistent point densities of different parts in the point cloud.

Generic and Robust Root Cause Localization for Multi-Dimensional Data in Online Service Systems

1 code implementation5 May 2023 Zeyan Li, Junjie Chen, Yihao Chen, Chengyang Luo, Yiwei Zhao, Yongqian Sun, Kaixin Sui, Xiping Wang, Dapeng Liu, Xing Jin, Qi Wang, Dan Pei

Such attribute combinations are substantial clues to the underlying root causes and thus are called root causes of multidimensional data.

Attribute

Physics-informed Deep Super-resolution for Spatiotemporal Data

1 code implementation2 Aug 2022 Pu Ren, Chengping Rao, Yang Liu, Zihan Ma, Qi Wang, Jian-Xun Wang, Hao Sun

High-fidelity simulation of complex physical systems is exorbitantly expensive and inaccessible across spatiotemporal scales.

Super-Resolution

SS-GNN: A Simple-Structured Graph Neural Network for Affinity Prediction

1 code implementation25 May 2022 Shuke Zhang, Yanzhao Jin, Tianmeng Liu, Qi Wang, Zhaohui Zhang, Shuliang Zhao, Bo Shan

We also develop an edge-based atom-pair feature aggregation method to represent complex interactions and a graph pooling-based method to predict the binding affinity of the complex.

Large Language Models can be Guided to Evade AI-Generated Text Detection

1 code implementation18 May 2023 Ning Lu, Shengcai Liu, Rui He, Qi Wang, Yew-Soon Ong, Ke Tang

Large language models (LLMs) have shown remarkable performance in various tasks and have been extensively utilized by the public.

Question Answering Text Detection

Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library

1 code implementation15 May 2023 Zhiyi Zhang, Pengfei Zhang, Qi Wang

Java is very powerful, but in Deep Learning field, its capabilities probably has not been sufficiently exploited.

QMR:Q-learning based Multi-objective optimization Routing protocol for Flying Ad Hoc Networks

1 code implementation Computer Communications 2019 Jianmin Liu, Qi Wang, ChenTao He, Katia Jaffrès-Runser, Yida Xu, Zhenyu Li, Yongjun Xu

It is difficult for existing routing protocols for Mobile Ad Hoc Networks (MANETs) and Vehicular Ad Hoc Networks (VANETs) to adapt the high dynamics of FANETs.

Q-Learning

Pretraining is All You Need: A Multi-Atlas Enhanced Transformer Framework for Autism Spectrum Disorder Classification

1 code implementation4 Jul 2023 Lucas Mahler, Qi Wang, Julius Steiglechner, Florian Birk, Samuel Heczko, Klaus Scheffler, Gabriele Lohmann

Through stratified cross-validation, we evaluate the proposed framework and show that it surpasses state-of-the-art performance on the ABIDE I dataset, with an average accuracy of 83. 7% and an AUC-score of 0. 832.

DISGAN: Wavelet-informed Discriminator Guides GAN to MRI Super-resolution with Noise Cleaning

1 code implementation23 Aug 2023 Qi Wang, Lucas Mahler, Julius Steiglechner, Florian Birk, Klaus Scheffler, Gabriele Lohmann

Departing from the traditional approach of training SR and denoising tasks as separate models, our proposed DISGAN is trained only on the SR task, but also achieves exceptional performance in denoising.

Denoising Image Generation +1

Feature-aware Adaptation and Density Alignment for Crowd Counting in Video Surveillance

no code implementations8 Dec 2019 Junyu. Gao, Yuan Yuan, Qi Wang

To reduce the gap, in this paper, we propose a domain-adaptation-style crowd counting method, which can effectively adapt the model from synthetic data to the specific real-world scenes.

Crowd Counting Density Estimation +1

CM-Net: Concentric Mask based Arbitrary-Shaped Text Detection

no code implementations30 Nov 2020 Chuang Yang, Mulin Chen, Zhitong Xiong, Yuan Yuan, Qi Wang

Extensive experiments demonstrate the proposed CM is efficient and robust to fit arbitrary-shaped text instances, and also validate the effectiveness of MPF and constraints loss for discriminative text features recognition.

Text Detection

Inverse-design magnonic devices

no code implementations8 Dec 2020 Qi Wang, Andrii V. Chumak, Philipp Pirro

The field of magnonics offers a new type of low-power information processing, in which magnons, the quanta of spin waves, carry and process data instead of electrons.

Applied Physics

Dynamics of a predator-prey system in open advective heterogeneous environments

no code implementations21 Dec 2020 Qi Wang

In this paper, we investigate the effect of dispersal and advection on the dynamics of a predator-prey model.

Analysis of PDEs 35B35, 35K57, 92D25, 92D40

Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models

no code implementations16 Feb 2021 Qi Wang, Herke van Hoof

Reinforcement learning is a promising paradigm for solving sequential decision-making problems, but low data efficiency and weak generalization across tasks are bottlenecks in real-world applications.

Decision Making Meta Reinforcement Learning +3

Multi-channel Deep Supervision for Crowd Counting

no code implementations17 Mar 2021 Bo Wei, Mulin Chen, Qi Wang, Xuelong Li

To obtain the accurate supervision information of different channels, the MDSNet employs an auxiliary network called SupervisionNet (SN) to generate abundant supervision maps based on existing groundtruth.

Crowd Counting

Spatial-spectral Hyperspectral Image Classification via Multiple Random Anchor Graphs Ensemble Learning

no code implementations25 Mar 2021 Yanling Miao, Qi Wang, Mulin Chen, Xuelong Li

Graph-based semi-supervised learning methods, which deal well with the situation of limited labeled data, have shown dominant performance in practical applications.

Descriptive Ensemble Learning +2

Auto-weighted Multi-view Feature Selection with Graph Optimization

no code implementations11 Apr 2021 Qi Wang, Xu Jiang, Mulin Chen, Xuelong Li

In this paper, we focus on the unsupervised multi-view feature selection which tries to handle high dimensional data in the field of multi-view learning.

feature selection Graph Learning +1

BiP-Net: Bidirectional Perspective Strategy based Arbitrary-Shaped Text Detection Network

no code implementations11 Apr 2021 Chuang Yang, Mulin Chen, Yuan Yuan, Qi Wang

Specifically, a new text representation strategy is proposed to represent text contours from a top-down perspective, which can fit highly curved text contours effectively.

Object Detection Text Detection

Spatial-Spectral Clustering with Anchor Graph for Hyperspectral Image

no code implementations24 Apr 2021 Qi Wang, Yanling Miao, Mulin Chen, Xuelong Li

In order to better handle the high dimensionality problem and preserve the spatial structures, this paper proposes a novel unsupervised approach called spatial-spectral clustering with anchor graph (SSCAG) for HSI data clustering.

Clustering

MT: Multi-Perspective Feature Learning Network for Scene Text Detection

no code implementations12 May 2021 Chuang Yang, Mulin Chen, Yuan Yuan, Qi Wang

Text detection, the key technology for understanding scene text, has become an attractive research topic.

Scene Text Detection Text Detection

Waveform Design for Joint Sensing and Communications in Millimeter-Wave and Low Terahertz Bands

no code implementations3 Jun 2021 Tianqi Mao, Jiaxuan Chen, Qi Wang, Chong Han, Zhaocheng Wang, George K. Karagiannidis

Finally, a data-embedded MS-QP (DE-MS-QP) waveform is constructed through time-domain extension of the MS-QP sequence, generating null frequency points on each subband for data transmission.

Unsupervised Domain Adaptive Learning via Synthetic Data for Person Re-identification

no code implementations12 Sep 2021 Qi Wang, Sikai Bai, Junyu Gao, Yuan Yuan, Xuelong Li

In addition, due to domain gaps between different datasets, the performance is dramatically decreased when re-ID models pre-trained on label-rich datasets (source domain) are directly applied to other unlabeled datasets (target domain).

Person Re-Identification Unsupervised Domain Adaptation

LDC-Net: A Unified Framework for Localization, Detection and Counting in Dense Crowds

no code implementations10 Oct 2021 Qi Wang, Tao Han, Junyu Gao, Yuan Yuan, Xuelong Li

The rapid development in visual crowd analysis shows a trend to count people by positioning or even detecting, rather than simply summing a density map.

Visual Crowd Analysis

ASK: Adaptively Selecting Key Local Features for RGB-D Scene Recognition

no code implementations14 Oct 2021 Zhitong Xiong, Yuan Yuan, Qi Wang

Discriminative local theme-level and object-level representations can be selected with the DLFS module from the spatially-correlated multi-modal RGB-D features.

feature selection Scene Classification +1

Adaptive Shrink-Mask for Text Detection

no code implementations18 Nov 2021 Chuang Yang, Mulin Chen, Yuan Yuan, Qi Wang, Xuelong Li

It weakens the coupling of texts to shrink-masks, which improves the robustness of detection results.

Text Detection

Towards Integrated Sensing and Communications for 6G

no code implementations12 Jan 2022 Qi Wang, Anastasios Kakkavas, Xitao Gong, Richard A. Stirling-Gallacher

For the next generation of mobile communications systems, the integration of sensing and communications promises benefits in terms of spectrum utilization, cost, latency, area and weight.

A Deep Learning Approach to Predicting Ventilator Parameters for Mechanically Ventilated Septic Patients

no code implementations21 Feb 2022 Zhijun Zeng, Zhen Hou, Ting Li, Lei Deng, Jianguo Hou, Xinran Huang, Jun Li, Meirou Sun, Yunhan Wang, Qiyu Wu, Wenhao Zheng, Hua Jiang, Qi Wang

We develop a deep learning approach to predicting a set of ventilator parameters for a mechanically ventilated septic patient using a long and short term memory (LSTM) recurrent neural network (RNN) model.

Optimization-based Block Coordinate Gradient Coding for Mitigating Partial Stragglers in Distributed Learning

no code implementations6 Jun 2022 Qi Wang, Ying Cui, Chenglin Li, Junni Zou, Hongkai Xiong

To reduce computational complexity, we first transform each to an equivalent but much simpler discrete problem with N\llL variables representing the partition of the L coordinates into N blocks, each with identical redundancy.

Crowd Localization from Gaussian Mixture Scoped Knowledge and Scoped Teacher

no code implementations12 Jun 2022 Juncheng Wang, Junyu Gao, Yuan Yuan, Qi Wang

The core reason of intrinsic scale shift being one of the most essential issues in crowd localization is that it is ubiquitous in crowd scenes and makes scale distribution chaotic.

Holistic Transformer: A Joint Neural Network for Trajectory Prediction and Decision-Making of Autonomous Vehicles

no code implementations17 Jun 2022 Hongyu Hu, Qi Wang, Zhengguang Zhang, Zhengyi Li, Zhenhai Gao

Trajectory prediction and behavioral decision-making are two important tasks for autonomous vehicles that require good understanding of the environmental context; behavioral decisions are better made by referring to the outputs of trajectory predictions.

Autonomous Vehicles Decision Making +2

SHDM-NET: Heat Map Detail Guidance with Image Matting for Industrial Weld Semantic Segmentation Network

no code implementations9 Jul 2022 Qi Wang, Jingwu Mei

This paper proposes an industrial weld segmentation network based on a deep learning semantic segmentation algorithm fused with heatmap detail guidance and Image Matting to solve the automatic segmentation problem of weld regions.

Image Matting Segmentation +1

MAFNet: A Multi-Attention Fusion Network for RGB-T Crowd Counting

no code implementations14 Aug 2022 PengYu Chen, Junyu Gao, Yuan Yuan, Qi Wang

RGB-Thermal (RGB-T) crowd counting is a challenging task, which uses thermal images as complementary information to RGB images to deal with the decreased performance of unimodal RGB-based methods in scenes with low-illumination or similar backgrounds.

Crowd Counting

Growing Instance Mask on Leaf

no code implementations30 Nov 2022 Chuang Yang, Haozhao Ma, Qi Wang

Considering the superiorities above, we propose VeinMask to formulate the instance segmentation problem as the simulation of the vein growth process and to predict the major and minor veins in polar coordinates.

Instance Segmentation Segmentation +1

Counting Like Human: Anthropoid Crowd Counting on Modeling the Similarity of Objects

no code implementations2 Dec 2022 Qi Wang, Juncheng Wang, Junyu Gao, Yuan Yuan, Xuelong Li

The mainstream crowd counting methods regress density map and integrate it to obtain counting results.

Crowd Counting

RELIANT: Fair Knowledge Distillation for Graph Neural Networks

1 code implementation3 Jan 2023 Yushun Dong, Binchi Zhang, Yiling Yuan, Na Zou, Qi Wang, Jundong Li

Knowledge Distillation (KD) is a common solution to compress GNNs, where a light-weighted model (i. e., the student model) is encouraged to mimic the behavior of a computationally expensive GNN (i. e., the teacher GNN model).

Fairness Graph Learning +1

Less is More: Understanding Word-level Textual Adversarial Attack via n-gram Frequency Descend

no code implementations6 Feb 2023 Ning Lu, Shengcai Liu, Zhirui Zhang, Qi Wang, Haifeng Liu, Ke Tang

Intuitively, this finding suggests a natural way to improve model robustness by training the model on the $n$-FD examples.

Adversarial Attack

Anomaly Detection of UAV State Data Based on Single-class Triangular Global Alignment Kernel Extreme Learning Machine

no code implementations18 Feb 2023 Feisha Hu, Qi Wang, Haijian Shao, Shang Gao, Hualong Yu

To improve the performance of OCKELM, we choose a Triangular Global Alignment Kernel (TGAK) instead of an RBF Kernel and introduce the Fast Independent Component Analysis (FastICA) algorithm to reconstruct UAV data.

Anomaly Detection

Improving Video Retrieval by Adaptive Margin

no code implementations9 Mar 2023 Feng He, Qi Wang, Zhifan Feng, Wenbin Jiang, Yajuan Lv, Yong Zhu, Xiao Tan

While most video retrieval methods overlook that phenomenon, we propose an adaptive margin changed with the distance between positive and negative pairs to solve the aforementioned issue.

Retrieval Video Retrieval

A Spatio-temporal Decomposition Method for the Coordinated Economic Dispatch of Integrated Transmission and Distribution Grids

no code implementations17 Mar 2023 Qi Wang, Wenchuan Wu, Chenhui Lin, Bin Wang

In the spatial dimension, a multi-parametric programming projection based spatial decomposition algorithm is developed to coordinate the ED problems of TG and DNs in a distributed manner.

Computational Efficiency

A Three-Player GAN for Super-Resolution in Magnetic Resonance Imaging

no code implementations24 Mar 2023 Qi Wang, Lucas Mahler, Julius Steiglechner, Florian Birk, Klaus Scheffler, Gabriele Lohmann

Current SISR methods for 3D volumetric images are based on Generative Adversarial Networks (GANs), especially Wasserstein GANs due to their training stability.

Image Super-Resolution

Boundary-to-Solution Mapping for Groundwater Flows in a Toth Basin

no code implementations28 Mar 2023 Jingwei Sun, Jun Li, Yonghong Hao, Cuiting Qi, Chunmei Ma, Huazhi Sun, Negash Begashaw, Gurcan Comet, Yi Sun, Qi Wang

In this paper, the authors propose a new approach to solving the groundwater flow equation in the Toth basin of arbitrary top and bottom topographies using deep learning.

Regularized Shallow Image Prior for Electrical Impedance Tomography

no code implementations30 Mar 2023 Zhe Liu, Zhou Chen, Qi Wang, Sheng Zhang, Yunjie Yang

The results suggest that combining the shallow image prior and the hand-crafted regularization can achieve similar performance to the Deep Image Prior (DIP) but with less architectural dependency and complexity of the neural network.

Improving Urban Flood Prediction using LSTM-DeepLabv3+ and Bayesian Optimization with Spatiotemporal feature fusion

no code implementations19 Apr 2023 Zuxiang Situ, Qi Wang, Shuai Teng, Wanen Feng, Gongfa Chen, Qianqian Zhou, Guangtao Fu

This study presented a CNN-RNN hybrid feature fusion modelling approach for urban flood prediction, which integrated the strengths of CNNs in processing spatial features and RNNs in analyzing different dimensions of time sequences.

Bayesian Optimization

A Stochastic-Gradient-based Interior-Point Algorithm for Solving Smooth Bound-Constrained Optimization Problems

no code implementations28 Apr 2023 Frank E. Curtis, Vyacheslav Kungurtsev, Daniel P. Robinson, Qi Wang

A stochastic-gradient-based interior-point algorithm for minimizing a continuously differentiable objective function (that may be nonconvex) subject to bound constraints is presented, analyzed, and demonstrated through experimental results.

Spatial and Modal Optimal Transport for Fast Cross-Modal MRI Reconstruction

no code implementations4 May 2023 Qi Wang, Zhijie Wen, Jun Shi, Qian Wang, Dinggang Shen, Shihui Ying

Multi-modal magnetic resonance imaging (MRI) plays a crucial role in comprehensive disease diagnosis in clinical medicine.

MRI Reconstruction

Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization

no code implementations CVPR 2023 Li’an Zhuo, Jian Cao, Qi Wang, Bang Zhang, Liefeng Bo

Then the optimization-based method is introduced to reconstruct the foot pose and foot-ground contact for the general multi-view datasets including AIST++ and Human3. 6M.

Pose Estimation

Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning

no code implementations24 May 2023 Qi Wang, Junming Yang, Yunbo Wang, Xin Jin, Wenjun Zeng, Xiaokang Yang

Training offline reinforcement learning (RL) models using visual inputs poses two significant challenges, i. e., the overfitting problem in representation learning and the overestimation bias for expected future rewards.

Offline RL Reinforcement Learning (RL) +2

Reduce Computational Complexity for Convolutional Layers by Skipping Zeros

no code implementations28 Jun 2023 Zhiyi Zhang, Pengfei Zhang, Zhuopin Xu, Qi Wang

Convolutional neural networks necessitate good algorithms to reduce complexity, and sufficient utilization of parallel processors for acceleration.

Almost-sure convergence of iterates and multipliers in stochastic sequential quadratic optimization

no code implementations7 Aug 2023 Frank E. Curtis, Xin Jiang, Qi Wang

In this paper, new almost-sure convergence guarantees for the primal iterates, Lagrange multipliers, and stationarity measures generated by a stochastic SQP algorithm in this subclass of methods are proved.

Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On

no code implementations8 Aug 2023 Daiheng Gao, Xu Chen, Xindi Zhang, Qi Wang, Ke Sun, Bang Zhang, Liefeng Bo, QiXing Huang

Since traditional warping-based texture generation methods require a significant number of control points to be manually selected for each type of garment, which can be a time-consuming and tedious process.

Texture Synthesis Virtual Try-on

A Fast Minimization Algorithm for the Euler Elastica Model Based on a Bilinear Decomposition

no code implementations25 Aug 2023 Zhifang Liu, Baochen Sun, Xue-Cheng Tai, Qi Wang, Huibin Chang

A host of numerical experiments are conducted to show that the new algorithm produces good results with much-improved efficiency compared to other state-of-the-art algorithms for the EE model.

SCVCNet: Sliding cross-vector convolution network for cross-task and inter-individual-set EEG-based cognitive workload recognition

no code implementations21 Sep 2023 Qi Wang, Li Chen, Zhiyuan Zhan, Jianhua Zhang, Zhong Yin

This paper presents a generic approach for applying the cognitive workload recognizer by exploiting common electroencephalogram (EEG) patterns across different human-machine tasks and individual sets.

EEG Electroencephalogram (EEG)

RELand: Risk Estimation of Landmines via Interpretable Invariant Risk Minimization

no code implementations6 Nov 2023 Mateo Dulce Rubio, Siqi Zeng, Qi Wang, Didier Alvarado, Francisco Moreno, Hoda Heidari, Fei Fang

Landmines remain a threat to war-affected communities for years after conflicts have ended, partly due to the laborious nature of demining tasks.

Feature Engineering Humanitarian +1

Traffic Sign Interpretation in Real Road Scene

no code implementations17 Nov 2023 Chuang Yang, Kai Zhuang, Mulin Chen, Haozhao Ma, Xu Han, Tao Han, Changxing Guo, Han Han, Bingxuan Zhao, Qi Wang

Following the above issues, we propose a traffic sign interpretation (TSI) task, which aims to interpret global semantic interrelated traffic signs (e. g.,~driving instruction-related texts, symbols, and guide panels) into a natural language for providing accurate instruction support to autonomous or assistant driving.

Instruction Following Multi-Task Learning

Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation

no code implementations19 Dec 2023 Yuze He, Yushi Bai, Matthieu Lin, Jenny Sheng, Yubin Hu, Qi Wang, Yu-Hui Wen, Yong-Jin Liu

By lifting the pre-trained 2D diffusion models into Neural Radiance Fields (NeRFs), text-to-3D generation methods have made great progress.

Text to 3D

Deep Unfolding Network with Spatial Alignment for multi-modal MRI reconstruction

no code implementations28 Dec 2023 Hao Zhang, Qi Wang, Jun Shi, Shihui Ying, Zhijie Wen

In this paper, we construct a novel Deep Unfolding Network with Spatial Alignment, termed DUN-SA, to appropriately embed the spatial alignment task into the reconstruction process.

MRI Reconstruction

A multimodal gesture recognition dataset for desktop human-computer interaction

no code implementations8 Jan 2024 Qi Wang, Fengchao Zhu, Guangming Zhu, Liang Zhang, Ning li, Eryang Gao

Gesture recognition is an indispensable component of natural and efficient human-computer interaction technology, particularly in desktop-level applications, where it can significantly enhance people's productivity.

Gesture Recognition

SamLP: A Customized Segment Anything Model for License Plate Detection

no code implementations12 Jan 2024 Haoxuan Ding, Junyu Gao, Yuan Yuan, Qi Wang

Meanwhile, the proposed SamLP has great few-shot and zero-shot learning ability, which shows the potential of transferring vision foundation model.

License Plate Detection Zero-Shot Learning

A Bi-Pyramid Multimodal Fusion Method for the Diagnosis of Bipolar Disorders

no code implementations15 Jan 2024 Guoxin Wang, Sheng Shi, Shan An, Fengmei Fan, Wenshu Ge, Qi Wang, Feng Yu, Zhiren Wang

Previous research on the diagnosis of Bipolar disorder has mainly focused on resting-state functional magnetic resonance imaging.

Medical Diagnosis

Model-Free $δ$-Policy Iteration Based on Damped Newton Method for Nonlinear Continuous-Time H$\infty$ Tracking Control

no code implementations23 Jan 2024 Qi Wang

Tracking HJI equation is a nonlinear partial differential equation, traditional reinforcement learning methods for solving the tracking HJI equation are mostly based on the Newton method, which usually only satisfies local convergence and needs a good initial guess.

reinforcement-learning

A Novel Policy Iteration Algorithm for Nonlinear Continuous-Time H$\infty$ Control Problem

no code implementations23 Jan 2024 Qi Wang

In this paper, a novel reinforcement learning method which is named {\alpha}-policy iteration ({\alpha}-PI) is introduced for solving HJI equation.

reinforcement-learning

MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music

no code implementations15 Feb 2024 ZiHao Wang, Shuyu Li, Tao Zhang, Qi Wang, Pengfei Yu, Jinyang Luo, Yan Liu, Ming Xi, Kejun Zhang

To this end, we present MuChin, the first open-source music description benchmark in Chinese colloquial language, designed to evaluate the performance of multimodal LLMs in understanding and describing music.

Information Retrieval Music Information Retrieval

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

no code implementations27 Feb 2024 Linrui Tian, Qi Wang, Bang Zhang, Liefeng Bo

In this work, we tackle the challenge of enhancing the realism and expressiveness in talking head video generation by focusing on the dynamic and nuanced relationship between audio cues and facial movements.

Video Generation

Effects of diffusion and advection on predator prey dynamics in an advective patchy environment

no code implementations12 Mar 2024 Qi Wang

We study the dynamics and the asymptotic profiles of positive steady states according to the mortality rate of the specialist predators, advection and diffusion rates.

A Generalized Framework with Adaptive Weighted Soft-Margin for Imbalanced SVM Classification

no code implementations13 Mar 2024 Lu Jiang, Qi Wang, Yuhang Chang, Jianing Song, Haoyue Fu

In this paper, we present a new generalized framework with Adaptive Weight function for soft-margin Weighted SVM (AW-WSVM), which aims to enhance the issue of imbalance and outlier sensitivity in standard support vector machine (SVM) for classifying two-class data.

Emotion Classification

PeerGPT: Probing the Roles of LLM-based Peer Agents as Team Moderators and Participants in Children's Collaborative Learning

no code implementations21 Mar 2024 Jiawen Liu, Yuanyuan Yao, Pengcheng An, Qi Wang

In children's collaborative learning, effective peer conversations can significantly enhance the quality of children's collaborative interactions.

Language Modelling Large Language Model

Cannot find the paper you are looking for? You can Submit a new open access paper.