Search Results for author: Xin Zhang

The system automatically learns internal representations of the necessary processing steps such as detecting useful road features with only the human steering angle as the training signal.

Lane Detection Self-Driving Cars

1,492

Paper
Code

Blocking Collapsed Gibbs Sampler for Latent Dirichlet Allocation Models

no code implementations • 2 Aug 2016 • Xin Zhang, Scott A. Sisson

In this article, we introduce a blocking scheme to the collapsed Gibbs sampler for the LDA model which can, with a theoretical guarantee, improve chain mixing efficiency.

Blocking

Paper
Add Code

Learning Multi-level Features For Sensor-based Human Action Recognition

no code implementations • 22 Nov 2016 • Yan Xu, Zhengyang Shen, Xin Zhang, Yifan Gao, Shujian Deng, Yipei Wang, Yubo Fan, Eric I-Chao Chang

This paper proposes a multi-level feature learning framework for human action recognition using a single body-worn inertial sensor.

Action Recognition Temporal Action Localization

Paper
Add Code

FuRongWang at SemEval-2017 Task 3: Deep Neural Networks for Selecting Relevant Answers in Community Question Answering

no code implementations • SEMEVAL 2017 • Sheng Zhang, Jiajun Cheng, Hui Wang, Xin Zhang, Pei Li, Zhaoyun Ding

We describes deep neural networks frameworks in this paper to address the community question answering (cQA) ranking task (SemEval-2017 task 3).

Answer Selection Community Question Answering

Paper
Add Code

Sleep Stage Classification Based on Multi-level Feature Learning and Recurrent Neural Networks via Wearable Device

no code implementations • 2 Nov 2017 • Xin Zhang, Weixuan Kou, Eric I-Chao Chang, He Gao, Yubo Fan, Yan Xu

The feature learning framework is designed to extract low- and mid-level features.

Automatic Sleep Stage Classification General Classification +1

Paper
Add Code

Aspect-level Sentiment Classification with HEAT (HiErarchical ATtention) Network

no code implementations • CIKM 2017 • Jiajun Cheng, Shenglin Zhao, Jiani Zhang, Irwin King, Xin Zhang, Hui Wang

However, the prior work only attends to the sentiment information and ignores the aspect-related information in the text, which may cause mismatching between the sentiment words and the aspects when an unrelated sentiment word is semantically meaningful for the given aspect.

Classification Sentence +2

Paper
Add Code

Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections

no code implementations • NeurIPS 2018 • Xin Zhang, Armando Solar-Lezama, Rishabh Singh

We argue that such a correction is a useful way to provide feedback to a user when the network's output is different from a desired output.

Paper
Add Code

Land use mapping in the Three Gorges Reservoir Area based on semantic segmentation deep learning method

no code implementations • 18 Mar 2018 • Xin Zhang, Bingfang Wu, Liang Zhu, Fuyou Tian, Miao Zhang, Yuanzeng

In this paper, we first test the state of the art semantic segmentation deep learning classifiers for LUCC mapping with 7 categories in the TGRA area with rapideye 5m resolution data.

Semantic Segmentation

Paper
Add Code

Taming Convergence for Asynchronous Stochastic Gradient Descent with Unbounded Delay in Non-Convex Learning

no code implementations • 24 May 2018 • Xin Zhang, Jia Liu, Zhengyuan Zhu

Understanding the convergence performance of asynchronous stochastic gradient descent method (Async-SGD) has received increasing attention in recent years due to their foundational role in machine learning.

Paper
Add Code

Skeleton-based Gesture Recognition Using Several Fully Connected Layers with Path Signature Features and Temporal Transformer Module

1 code implementation • 17 Nov 2018 • Chenyang Li, Xin Zhang, Lufan Liao, Lianwen Jin, Weixin Yang

In this paper, we first leverage a robust feature descriptor, path signature (PS), and propose three PS features to explicitly represent the spatial and temporal motion characteristics, i. e., spatial PS (S_PS), temporal PS (T_PS) and temporal spatial PS (T_S_PS).

Ranked #1 on Gesture Recognition on ChaLearn 2013

Computational Efficiency General Classification +1

Paper
Code

Watermark Retrieval from 3D Printed Objects via Convolutional Neural Networks

no code implementations • 19 Nov 2018 • Xin Zhang, Qian Wang, Toby Breckon, Ioannis Ivrissimtzis

We present a method for reading digital data embedded in planar 3D printed surfaces.

Retrieval

Paper
Add Code

Probabilistic Verification of Fairness Properties via Concentration

1 code implementation • 2 Dec 2018 • Osbert Bastani, Xin Zhang, Armando Solar-Lezama

As machine learning systems are increasingly used to make real world legal and financial decisions, it is of paramount importance that we develop algorithms to verify that these systems do not discriminate against minorities.

BIG-bench Machine Learning Fairness

Paper
Code

On-line Search History-assisted Restart Strategy for Covariance Matrix Adaptation Evolution Strategy

no code implementations • 16 Mar 2019 • Yang Lou, Shiu Yin Yuen, Guanrong Chen, Xin Zhang

The entire on-line search history of cNrGA is stored in a binary space partitioning (BSP) tree, which is effective for performing local search.

Paper
Add Code

The Random Conditional Distribution for Higher-Order Probabilistic Inference

no code implementations • 25 Mar 2019 • Zenna Tavares, Xin Zhang, Edgar Minaysan, Javier Burroni, Rajesh Ranganath, Armando Solar Lezama

The need to condition distributional properties such as expectation, variance, and entropy arises in algorithmic fairness, model simplification, robustness and many other areas.

Fairness Probabilistic Programming

Paper
Add Code

Spatial CUSUM for Signal Region Detection

no code implementations • 5 Apr 2019 • Xin Zhang, Zhengyuan Zhu

We develop theoretical properties of the method which indicates that asymptotically SCUSUM can reach high classification accuracy.

Epidemiology

Paper
Add Code

Watermark retrieval from 3D printed objects via synthetic data training

no code implementations • 23 May 2019 • Xin Zhang, Ning Jia, Ioannis Ivrissimtzis

We conclude that in our application domain of information retrieval from 3D printed objects, where access to the exact CAD files of the printed objects can be assumed, one can use inexpensive synthetic data to enhance neural network training, reducing the need for the labour intensive process of creating large amounts of hand labelled real data or the need to generate photorealistic synthetic data.

Information Retrieval Retrieval

Paper
Add Code

Distributed Linear Model Clustering over Networks: A Tree-Based Fused-Lasso ADMM Approach

no code implementations • 28 May 2019 • Xin Zhang, Jia Liu, Zhengyuan Zhu

In this work, we consider to improve the model estimation efficiency by aggregating the neighbors' information as well as identify the subgroup membership for each node in the network.

Clustering

Paper
Add Code

Machine Reading Comprehension: a Literature Review

no code implementations • 30 Jun 2019 • Xin Zhang, An Yang, Sujian Li, Yizhong Wang

Machine reading comprehension aims to teach machines to understand a text like a human and is a new challenging direction in Artificial Intelligence.

Machine Reading Comprehension

Paper
Add Code

Neural Machine Reading Comprehension: Methods and Trends

no code implementations • 2 Jul 2019 • Shanshan Liu, Xin Zhang, Sheng Zhang, Hui Wang, Weiming Zhang

Machine reading comprehension (MRC), which requires a machine to answer questions based on a given context, has attracted increasing attention with the incorporation of various deep-learning techniques over the past few years.

Machine Reading Comprehension

Paper
Add Code

Byzantine-Resilient Stochastic Gradient Descent for Distributed Learning: A Lipschitz-Inspired Coordinate-wise Median Approach

no code implementations • 10 Sep 2019 • Haibo Yang, Xin Zhang, Minghong Fang, Jia Liu

In this work, we consider the resilience of distributed algorithms based on stochastic gradient descent (SGD) in distributed learning with potentially Byzantine attackers, who could send arbitrary information to the parameter server to disrupt the training process.

Paper
Add Code

PolSAR Image Classification Based on Dilated Convolution and Pixel-Refining Parallel Mapping network in the Complex Domain

1 code implementation • 24 Sep 2019 • Dongling Xiao, Chang Liu, Qi. Wang, Chao Wang, Xin Zhang

For general supervised deep learning classification algorithms, the pixel-by-pixel algorithm achieves precise yet inefficient classification with a small number of labeled pixels, whereas the pixel mapping algorithm achieves efficient yet edge-rough classification with more prior labels required.

Classification General Classification +1

Paper
Code

ASYNCHRONOUS MULTI-AGENT GENERATIVE ADVERSARIAL IMITATION LEARNING

no code implementations • 25 Sep 2019 • Xin Zhang, Weixiao Huang, Renjie Liao, Yanhua Li

Imitation learning aims to inversely learn a policy from expert demonstrations, which has been extensively studied in the literature for both single-agent setting with Markov decision process (MDP) model, and multi-agent setting with Markov game (MG) model.

Imitation Learning

Paper
Add Code

Air-Writing Translater: A Novel Unsupervised Domain Adaptation Method for Inertia-Trajectory Translation of In-air Handwriting

no code implementations • 1 Nov 2019 • Songbin Xu, Yang Xue, Xin Zhang, Lianwen Jin

As a new way of human-computer interaction, inertial sensor based in-air handwriting can provide a natural and unconstrained interaction to express more complex and richer information in 3D space.

Translation Unsupervised Domain Adaptation

Paper
Add Code

Spectrogram-frame linear network and continuous frame sequence for bird sound classification

1 code implementation • Ecological Informatics 2019 • Xin Zhang, Aibin Chen, Guoxiong Zhou, Zhiqiang Zhang, Xibei Huang, Xiaohu Qiang

Inspired by that bird sound has various frequency distributions and continuous time-varying properties, a novel method is proposed for the classification of bird sound based on continuous frame sequence and spectrogram-frame linear network (SFLN).

Sound Classification

Paper
Code

Blockchain for Future Smart Grid: A Comprehensive Survey

1 code implementation • 8 Nov 2019 • Muhammad Baqer Mollah, Jun Zhao, Dusit Niyato, Kwok-Yan Lam, Xin Zhang, Amer M. Y. M. Ghias, Leong Hai Koh, Lei Yang

In this paper, we aim to provide a comprehensive survey on application of blockchain in smart grid.

Cryptography and Security Distributed, Parallel, and Cluster Computing Networking and Internet Architecture Social and Information Networks Systems and Control Systems and Control

Paper
Code

Finite-Time 4-Expert Prediction Problem

no code implementations • 22 Nov 2019 • Erhan Bayraktar, Ibrahim Ekren, Xin Zhang

We explicitly solve the nonlinear PDE that is the continuous limit of dynamic programming of \emph{expert prediction problem} in finite horizon setting with $N=4$ experts.

Paper
Add Code

ResNetX: a more disordered and deeper network architecture

no code implementations • 18 Dec 2019 • Wenfeng Feng, Xin Zhang, Guangpeng Zhao

Our architecture is a natural extension to ResNet, and can be integrated with existing state-of-the-art methods with little effort.

Image Classification

Paper
Add Code

Private and Communication-Efficient Edge Learning: A Sparse Differential Gaussian-Masking Distributed SGD Approach

no code implementations • 12 Jan 2020 • Xin Zhang, Minghong Fang, Jia Liu, Zhengyuan Zhu

In this paper, we consider the problem of jointly improving data privacy and communication efficiency of distributed edge learning, both of which are critical performance metrics in wireless edge network computing.

Paper
Add Code

Complexity of tree-coloring interval graphs equitably

no code implementations • 9 Mar 2020 • Bei Niu, Bi Li, Xin Zhang

An equitable tree-$k$-coloring of a graph is a vertex $k$-coloring such that each color class induces a forest and the size of any two color classes differ by at most one.

Combinatorics Discrete Mathematics

Paper
Add Code

Malicious Experts versus the multiplicative weights algorithm in online prediction

no code implementations • 18 Mar 2020 • Erhan Bayraktar, H. Vincent Poor, Xin Zhang

We assume that one of the experts is honest and makes correct prediction with probability $\mu$ at each round.

Paper
Add Code

HF-UNet: Learning Hierarchically Inter-Task Relevance in Multi-Task U-Net for Accurate Prostate Segmentation

no code implementations • 21 May 2020 • Kelei He, Chunfeng Lian, Bing Zhang, Xin Zhang, Xiaohuan Cao, Dong Nie, Yang Gao, Junfeng Zhang, Dinggang Shen

In this paper, we tackle the challenging task of prostate segmentation in CT images by a two-stage network with 1) the first stage to fast localize, and 2) the second stage to accurately segment the prostate.

Multi-Task Learning Segmentation

Paper
Add Code

Generalization Study of Quantum Neural Network

no code implementations • 2 Jun 2020 • JinZhe Jiang, Xin Zhang, Chen Li, YaQian Zhao, RenGang Li

In this model, we mapped the feature data to a quantum state in Hilbert space firstly, and then implement unitary evolution on it, in the end, we can get the classification result by im-plement measurement on the quantum state.

Paper
Add Code

A Computer Vision Aided Beamforming Scheme with EM Exposure Control in Outdoor LOS Scenarios

no code implementations • 14 Jun 2020 • Tianqi Xiang, Huiwen Li, Boren Guo, Xin Zhang

The proposed exposure avoidance method is validated in simulations, and the results show that the finer beam management granularity can guarantee communication quality while reducing the electromagnetic exposure.

Management

Paper
Add Code

A study of the effect of the illumination model on the generation of synthetic training datasets

no code implementations • 15 Jun 2020 • Xin Zhang, Ning Jia, Ioannis Ivrissimtzis

Our results show that the effect of the illumination model is important, comparable in significance to the network architecture.

Paper
Add Code

A Computer Vision Based Beamforming Scheme for Millimeter Wave Communication in LOS Scenarios

no code implementations • 20 Jun 2020 • Tianqi Xiang, Yaxin Wang, Huiwen Li, Boren Guo, Xin Zhang

A novel location-aware beamforming scheme for millimeter wave communication is proposed for line of sight (LOS) and low mobility scenarios, in which computer vision is introduced to derive the required position or spatial angular information from the image or video captured by camera(s) co-located with mmWave antenna array at base stations.

Position

Paper
Add Code

An Explainable 3D Residual Self-Attention Deep Neural Network FOR Joint Atrophy Localization and Alzheimer's Disease Diagnosis using Structural MRI

no code implementations • 10 Aug 2020 • Xin Zhang, Liangxiu Han, Wenyong Zhu, Liang Sun, Daoqiang Zhang

Different from the existing approaches, the novelty of our approach is three-fold: 1) A Residual Self-Attention Deep Neural Network has been proposed to capture local, global and spatial information of MR images to improve diagnostic performance; 2) An explanation method using Gradient-based Localization Class Activation mapping (Grad-CAM) has been introduced to improve the explainable of the proposed method; 3) This work has provided a full end-to-end learning solution for automated disease diagnosis.

Hippocampus

Paper
Add Code

Worst-Case-Aware Curriculum Learning for Zero and Few Shot Transfer

1 code implementation • 23 Sep 2020 • Sheng Zhang, Xin Zhang, Weiming Zhang, Anders Søgaard

Multi-task transfer learning based on pre-trained language encoders achieves state-of-the-art performance across a range of tasks.

Transfer Learning

Paper
Code

$f$-GAIL: Learning $f$-Divergence for Generative Adversarial Imitation Learning

1 code implementation • 2 Oct 2020 • Xin Zhang, Yanhua Li, Ziming Zhang, Zhi-Li Zhang

This naturally gives rise to the following question: Given a set of expert demonstrations, which divergence can recover the expert policy more accurately with higher data efficiency?

Imitation Learning

Paper
Code

A Novel Semi-Supervised Data-Driven Method for Chiller Fault Diagnosis with Unlabeled Data

no code implementations • 31 Oct 2020 • Bingxu Li, Fanyong Cheng, Xin Zhang, Can Cui, Wenjian Cai

Usually, the number of labeled data is limited and most data available are unlabeled.

Generative Adversarial Network

Paper
Add Code

Prediction against a limited adversary

no code implementations • 31 Oct 2020 • Erhan Bayraktar, Ibrahim Ekren, Xin Zhang

We study the problem of prediction with expert advice with adversarial corruption where the adversary can at most corrupt one expert.

Paper
Add Code

f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning

no code implementations • NeurIPS 2020 • Xin Zhang, Yanhua Li, Ziming Zhang, Zhi-Li Zhang

This naturally gives rise to the following question: Given a set of expert demonstrations, which divergence can recover the expert policy more accurately with higher data efficiency?

Imitation Learning

Paper
Add Code

K-core in percolated dense graph sequences

no code implementations • 17 Dec 2020 • Erhan Bayraktar, Suman Chakraborty, Xin Zhang

Keeping an edge $(i, j)$ of $G_n$ with probability $\min \{ {a^n_{i, j}}/{n}, 1 \}$ independently, we obtain a sequence of random graphs $G_n(\frac{1}{n})$.

Probability Combinatorics

Paper
Add Code

Super strong paramagnetism of aromatic peptides adsorbed with monovalent cations

no code implementations • 22 Dec 2020 • Shiqi Sheng, Haijun Yang, Liuhua Mu, Zixin Wang, Jihong Wang, Peng Xiu, Jun Hu, Xin Zhang, Feng Zhang, Haiping Fang

We experimentally demonstrated that the AYFFF self-assemblies adsorbed with various monovalent cations (Na+, K+, and Li+) show unexpectedly super strong paramagnetism.

Biological Physics

Paper
Add Code

Spectroscopic and Photometric Redshift Estimation by Neural Networks For the China Space Station Optical Survey (CSS-OS)

no code implementations • 7 Jan 2021 • Xingchen Zhou, Yan Gong, Xian-Min Meng, Xin Zhang, Ye Cao, Xuelei Chen, Valeria Amaro, Zuhui Fan, Liping Fu

This indicates that the neural network method is feasible and powerful for spec-z and photo-z estimations in future cosmological surveys.

Photometric Redshift Estimation Cosmology and Nongalactic Astrophysics

Paper
Add Code

Designing 24-hour Electrical Power Generator: Thermoradiative Device for Harvesting Energy from Sun and Outer Space

no code implementations • 14 Jan 2021 • Xin Zhang, Guofeng Yang, Mengqi Yan, Lay Kee Ang, Yee Sin Ang

Energy harvesting from sun and outer space using thermoradiative devices (TRD), despite being promising renewable energy sources, are limited only to daytime and nighttime period, respectively.

Applied Physics Mesoscale and Nanoscale Physics

Paper
Add Code

Relieving the $H_0$ tension with a new interacting dark energy model

no code implementations • 26 Jan 2021 • Li-Yang Gao, She-Sheng Xue, Xin Zhang

Here we explain this model as a scenario of vacuum energy interacting with matter and radiation.

Cosmology and Nongalactic Astrophysics General Relativity and Quantum Cosmology High Energy Physics - Phenomenology

Paper
Add Code

Phantom Bethe roots in the integrable open spin $1/2$ $XXZ$ chain

no code implementations • 5 Feb 2021 • Xin Zhang, Andreas Klümper, Vladislav Popkov

Bethe eigenstates are described by two complementary sets of Bethe Ansatz equations for regular roots, one for each invariant subspace.

Statistical Mechanics Mathematical Physics Mathematical Physics Quantum Physics

Paper
Add Code

Phantom Bethe excitations and spin helix eigenstates in integrable periodic and open spin chains

no code implementations • 5 Feb 2021 • Vladislav Popkov, Xin Zhang, Andreas Klümper

The phantom Bethe roots lead to degeneracies between different magnetization sectors in the periodic case and to the appearance of spin helix states (SHS), i. e. periodically modulated states of chiral nature in both open and closed systems.

Statistical Mechanics Mathematical Physics Mathematical Physics Quantum Physics

Paper
Add Code

Constraints on interacting dark energy models from time-delay cosmography with seven lensed quasars

no code implementations • 18 Feb 2021 • Ling-Feng Wang, Dong-Ze He, Jing-Fei Zhang, Xin Zhang

When the TD data are added to the CMB$+$BAO$+$SN$+H_0$ data, we find that: (i) the coupling parameter $\beta$ in all the considered IDE models is positive within 1$\sigma$ range, implying a mild preference for the case of cold dark matter decaying into dark energy; (ii) the IDE model with $Q = \beta H_{0} \rho_{\rm c}$ slightly relieves the $S_8$ tension, but the other considered IDE models further aggravate this tension; (iii) the Akaike information criteria of the IDE models with $Q \propto \rho_{\rm c}$ are lower than that of the $\Lambda$CDM model, indicating that these IDE models are more preferred by the current mainstream data.

Cosmology and Nongalactic Astrophysics General Relativity and Quantum Cosmology High Energy Physics - Phenomenology

Paper
Add Code

Genetic Algorithm based hyper-parameters optimization for transfer Convolutional Neural Network

no code implementations • 26 Feb 2021 • Chen Li, JinZhe Jiang, YaQian Zhao, RenGang Li, EnDong Wang, Xin Zhang, Kun Zhao

Decision of transfer layers and trainable layers is a major task for design of the transfer convolutional neural networks (CNN).

Hyperparameter Optimization

Paper
Add Code

Constructing long-short stock portfolio with a new listwise learn-to-rank algorithm

1 code implementation • 26 Apr 2021 • Xin Zhang, Lan Wu, Zhixue Chen

Our loss function, motivated by the long-short strategy, is endogenously shift-invariant and can be viewed as a direct generalization of ListMLE.

Binary Classification

Paper
Code

GT-STORM: Taming Sample, Communication, and Memory Complexities in Decentralized Non-Convex Learning

no code implementations • 4 May 2021 • Xin Zhang, Jia Liu, Zhengyuan Zhu, Elizabeth S. Bentley

Decentralized nonconvex optimization has received increasing attention in recent years in machine learning due to its advantages in system robustness, data privacy, and implementation simplicity.

Paper
Add Code

Cross-Modality Brain Tumor Segmentation via Bidirectional Global-to-Local Unsupervised Domain Adaptation

1 code implementation • 17 May 2021 • Kelei He, Wen Ji, Tao Zhou, Zhuoyuan Li, Jing Huo, Xin Zhang, Yang Gao, Dinggang Shen, Bing Zhang, Junfeng Zhang

Specifically, a bidirectional image synthesis and segmentation module is proposed to segment the brain tumor using the intermediate data distributions generated for the two domains, which includes an image-to-image translator and a shared-weighted segmentation network.

Brain Tumor Segmentation Image Generation +3

Paper
Code

Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition

1 code implementation • ACL 2021 • Xin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang, Pengjun Xie

Crowdsourcing is regarded as one prospective solution for effective supervised learning, aiming to build large-scale annotated training data by crowd workers.

Domain Adaptation named-entity-recognition +3

Paper
Code

Towards interpreting computer vision based on transformation invariant optimization

no code implementations • 18 Jun 2021 • Chen Li, JinZhe Jiang, Xin Zhang, Tonghuan Zhang, YaQian Zhao, Dongdong Jiang, RenGang Li

Interpreting how does deep neural networks (DNNs) make predictions is a vital field in artificial intelligence, which hinders wide applications of DNNs.

Paper
Add Code

Anarchic Federated Learning

no code implementations • 23 Aug 2021 • Haibo Yang, Xin Zhang, Prashant Khanduri, Jia Liu

To satisfy the need for flexible worker participation, we consider a new FL paradigm called "Anarchic Federated Learning" (AFL) in this paper.

Federated Learning

Paper
Add Code

Stabilized Likelihood-based Imitation Learning via Denoising Continuous Normalizing Flow

no code implementations • 29 Sep 2021 • Xin Zhang, Yanhua Li, Ziming Zhang, Christopher Brinton, Zhenming Liu, Zhi-Li Zhang, Hui Lu, Zhihong Tian

State-of-the-art imitation learning (IL) approaches, e. g, GAIL, apply adversarial training to minimize the discrepancy between expert and learner behaviors, which is prone to unstable training and mode collapse.

Denoising Imitation Learning

Paper
Add Code

Cellular Network Radio Propagation Modeling with Deep Convolutional Neural Networks

no code implementations • 5 Oct 2021 • Xin Zhang, Xiujun Shu, Bingwen Zhang, Jie Ren, Lizhou Zhou, Xin Chen

Deterministic models, such as ray tracing based on physical laws of wave propagation, are more accurate and site specific.

Paper
Add Code

CXR-Net: An Encoder-Decoder-Encoder Multitask Deep Neural Network for Explainable and Accurate Diagnosis of COVID-19 pneumonia with Chest X-ray Images

no code implementations • 20 Oct 2021 • Xin Zhang, Liangxiu Han, Tam Sobeih, Lianghao Han, Nina Dempsey, Symeon Lechareas, Ascanio Tridente, Haoming Chen, Stephen White

The proposed method can provide more detailed high resolution visual explanation for the classification decision, compared to current state-of-the-art visual explanation methods and has a great potential to be used in clinical practice for COVID-19 pneumonia diagnosis.

Pneumonia Detection

Paper
Add Code

The self-supervised spectral-spatial attention-based transformer network for automated, accurate prediction of crop nitrogen status from UAV imagery

no code implementations • 12 Nov 2021 • Xin Zhang, Liangxiu Han, Tam Sobeih, Lewis Lappin, Mark Lee, Andew Howard, Aron Kisdi

In this work, we propose a novel deep learning framework: a self-supervised spectral-spatial attention-based vision transformer (SSVT).

Self-Supervised Learning

Paper
Add Code

Can depth-adaptive BERT perform better on binary classification tasks

no code implementations • 22 Nov 2021 • Jing Fan, Xin Zhang, Sheng Zhang, Yan Pan, Lixiang Guo

In light of the success of transferring language models into NLP tasks, we ask whether the full BERT model is always the best and does it exist a simple but effective method to find the winning ticket in state-of-the-art deep neural networks without complex calculations.

Binary Classification Text Classification

Paper
Add Code

CoDiM: Learning with Noisy Labels via Contrastive Semi-Supervised Learning

no code implementations • 23 Nov 2021 • Xin Zhang, Zixuan Liu, Kaiwen Xiao, Tian Shen, Junzhou Huang, Wei Yang, Dimitris Samaras, Xiao Han

Labels are costly and sometimes unreliable.

Ranked #5 on Image Classification on mini WebVision 1.0

Contrastive Learning Learning with noisy labels

Paper
Add Code

Domain Prompt Learning for Efficiently Adapting CLIP to Unseen Domains

1 code implementation • 25 Nov 2021 • Xin Zhang, Shixiang Shane Gu, Yutaka Matsuo, Yusuke Iwasawa

We propose Domain Prompt Learning (DPL) as a novel approach for domain inference in the form of conditional prompt generation.

Ranked #4 on Transfer Learning on Office-Home

Domain Generalization Image Classification +2

Paper
Code

Taming Communication and Sample Complexities in Decentralized Policy Evaluation for Cooperative Multi-Agent Reinforcement Learning

no code implementations • NeurIPS 2021 • Xin Zhang, Zhuqing Liu, Jia Liu, Zhengyuan Zhu, Songtao Lu

To our knowledge, this paper is the first work that achieves both $\mathcal{O}(\epsilon^{-2})$ sample complexity and $\mathcal{O}(\epsilon^{-2})$ communication complexity in decentralized policy evaluation for cooperative MARL.

Multi-agent Reinforcement Learning Reinforcement Learning (RL) +1

Paper
Add Code

Boundary Aware Learning for Out-of-distribution Detection

no code implementations • 22 Dec 2021 • Sen Pei, Xin Zhang, Richard Yida Xu, Gaofeng Meng

This paper focuses on the problem of detecting out-of-distribution (ood) samples with neural nets.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Paper
Add Code

TR-MISR: Multiimage Super-Resolution Based on Feature Fusion With Transformers

1 code implementation • IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2022 • Tai An, Xin Zhang, Chunlei Huo, Bin Xue, Lingfeng Wang, Chunhong Pan

In addition, TR-MISR adopts an additional learnable embedding vector that fuses these vectors to restore the details to the greatest extent. TR-MISR has successfully applied the transformer to MISR tasks for the first time, notably reducing the difficulty of training the transformer by ignoring the spatial relations of image patches.

Ranked #1 on Multi-Frame Super-Resolution on PROBA-V

Multi-Frame Super-Resolution

Paper
Code

Fast and Robust Sparsity Learning over Networks: A Decentralized Surrogate Median Regression Approach

no code implementations • 11 Feb 2022 • Weidong Liu, Xiaojun Mao, Xin Zhang

Decentralized sparsity learning has attracted a significant amount of attention recently due to its rapidly growing applications.

regression

Paper
Add Code

Adaptive Graph Convolutional Networks for Weakly Supervised Anomaly Detection in Videos

no code implementations • 14 Feb 2022 • Congqi Cao, Xin Zhang, Shizhou Zhang, Peng Wang, Yanning Zhang

For weakly supervised anomaly detection, most existing work is limited to the problem of inadequate video representation due to the inability of modeling long-term contextual information.

Graph Learning Supervised Anomaly Detection +1

Paper
Add Code

Priming Cross-Session Motor Imagery Classification with A Universal Deep Domain Adaptation Framework

3 code implementations • 19 Feb 2022 • Zhengqing Miao, Xin Zhang, Carlo Menon, Yelong Zheng, Meirong Zhao, Dong Ming

Compared to the vanilla EEGNet and ConvNet, the proposed SDDA framework was able to boost the MI classification accuracy by 15. 2%, 10. 2% respectively in IIA dataset, and 5. 5%, 4. 2% in IIB dataset.

Domain Adaptation EEG +1

Paper
Code

Contrastive-mixup learning for improved speaker verification

no code implementations • 22 Feb 2022 • Xin Zhang, Minho Jin, Roger Cheng, Ruirui Li, Eunjung Han, Andreas Stolcke

In this work, we propose contrastive-mixup, a novel augmentation strategy that learns distinguishing representations based on a distance metric.

Data Augmentation Metric Learning +1

Paper
Add Code

Identifying Chinese Opinion Expressions with Extremely-Noisy Crowdsourcing Annotations

1 code implementation • ACL 2022 • Xin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang, Xiaobin Wang, Min Zhang

Recent works of opinion expression identification (OEI) rely heavily on the quality and scale of the manually-constructed training corpus, which could be extremely difficult to satisfy.

Paper
Code

Cross-Camera Trajectories Help Person Retrieval in a Camera Network

1 code implementation • 27 Apr 2022 • Xin Zhang, Xiaohua Xie, JianHuang Lai, Wei-Shi Zheng

To address this issue, we propose a pedestrian retrieval framework based on cross-camera trajectory generation, which integrates both temporal and spatial information.

Person Retrieval Re-Ranking +1

Paper
Code

A Coupling Enhancement Algorithm for ZrO2 Ceramic Bearing Ball Surface Defect Detection Based on Cartoon-texture Decomposition Model and Multi-Scale Filtering Method

no code implementations • 23 May 2022 • Wei Wang, Xin Zhang, Jiaqi Yi, Xianqi Liao, Wenjie Li, Zhenhong Li

The experimental results show that the image denoising method of ZrO2 ceramic bearing ball surface defect based on cartoon-texture decomposition model can denoise while retaining the image details.

Defect Detection Image Denoising +1

Paper
Add Code

Gigapixel Whole-Slide Images Classification using Locally Supervised Learning

1 code implementation • 17 Jul 2022 • Jingwei Zhang, Xin Zhang, Ke Ma, Rajarsi Gupta, Joel Saltz, Maria Vakalopoulou, Dimitris Samaras

Histopathology whole slide images (WSIs) play a very important role in clinical studies and serve as the gold standard for many cancer diagnoses.

Classification Multiple Instance Learning +1

Paper
Code

Deep Learning for Unsupervised Anomaly Localization in Industrial Images: A Survey

no code implementations • 21 Jul 2022 • Xian Tao, Xinyi Gong, Xin Zhang, Shaohua Yan, Chandranath Adak

This paper aims to help researchers in this field by comprehensively surveying recent achievements in unsupervised anomaly localization in industrial images using deep learning.

Paper
Add Code

Layer-Wise Partitioning and Merging for Efficient and Scalable Deep Learning

no code implementations • 22 Jul 2022 • Samson B. Akintoye, Liangxiu Han, Huw Lloyd, Xin Zhang, Darren Dancey, Haoming Chen, Daoqiang Zhang

Deep Neural Network (DNN) models are usually trained sequentially from one layer to another, which causes forward, backward and update locking's problems, leading to poor performance in terms of training time.

Paper
Add Code

INTERACT: Achieving Low Sample and Communication Complexities in Decentralized Bilevel Learning over Networks

no code implementations • 27 Jul 2022 • Zhuqing Liu, Xin Zhang, Prashant Khanduri, Songtao Lu, Jia Liu

Our main contributions in this paper are two-fold: i) We first propose a deterministic algorithm called INTERACT (inner-gradient-descent-outer-tracked-gradient) that requires the sample complexity of $\mathcal{O}(n \epsilon^{-1})$ and communication complexity of $\mathcal{O}(\epsilon^{-1})$ to solve the bilevel optimization problem, where $n$ and $\epsilon > 0$ are the number of samples at each agent and the desired stationarity gap, respectively.

Bilevel Optimization Meta-Learning +1

Paper
Add Code

Adaptive Domain Generalization via Online Disagreement Minimization

no code implementations • 3 Aug 2022 • Xin Zhang, Ying-Cong Chen

Domain Generalization (DG) aims to safely transfer a model to unseen target domains by only relying on a set of source domains.

Domain Generalization

Paper
Add Code

NET-FLEET: Achieving Linear Convergence Speedup for Fully Decentralized Federated Learning with Heterogeneous Data

no code implementations • 17 Aug 2022 • Xin Zhang, Minghong Fang, Zhuqing Liu, Haibo Yang, Jia Liu, Zhengyuan Zhu

Moreover, whether or not the linear speedup for convergence is achievable under fully decentralized FL with data heterogeneity remains an open question.

Federated Learning Open-Ended Question Answering

Paper
Add Code

SYNTHESIS: A Semi-Asynchronous Path-Integrated Stochastic Gradient Method for Distributed Learning in Computing Clusters

no code implementations • 17 Aug 2022 • Zhuqing Liu, Xin Zhang, Jia Liu

To increase the training speed of distributed learning, recent years have witnessed a significant amount of interest in developing both synchronous and asynchronous distributed stochastic variance-reduced optimization methods.

Paper
Add Code

PLMCL: Partial-Label Momentum Curriculum Learning for Multi-Label Image Classification

no code implementations • 22 Aug 2022 • Rabab Abdelfattah, Xin Zhang, Zhenyao Wu, Xinyi Wu, XiaoFeng Wang, Song Wang

A special case is to annotate only one positive label in each training image.

Multi-Label Classification Multi-Label Image Classification +1

Paper
Add Code

Domain-Specific NER via Retrieving Correlated Samples

1 code implementation • COLING 2022 • Xin Zhang, Yong Jiang, Xiaobin Wang, Xuming Hu, Yueheng Sun, Pengjun Xie, Meishan Zhang

Successful Machine Learning based Named Entity Recognition models could fail on texts from some special domains, for instance, Chinese addresses and e-commerce titles, where requires adequate background knowledge.

Named Entity Recognition

Paper
Code

Swin-transformer-yolov5 For Real-time Wine Grape Bunch Detection

no code implementations • 30 Aug 2022 • Shenglian Lu, Xiaoyu Liu, Zixaun He, Wenbo Liu, Xin Zhang, Manoj Karkee

Results showed that the proposed Swin-T-YOLOv5 outperformed all other studied models for grape bunch detection, with up to 97% of mean Average Precision (mAP) and 0. 89 of F1-score when the weather was cloudy.

Paper
Add Code

A PDE approach for regret bounds under partial monitoring

no code implementations • 2 Sep 2022 • Erhan Bayraktar, Ibrahim Ekren, Xin Zhang

In this paper, we study a learning problem in which a forecaster only observes partial information.

Paper
Add Code

ReX: A Framework for Incorporating Temporal Information in Model-Agnostic Local Explanation Techniques

no code implementations • 8 Sep 2022 • Junhao Liu, Xin Zhang

To address this limitation, we propose ReX, a general framework for adapting various explanation techniques to models that process variable-length inputs, expanding explanation coverage to data points of different lengths.

Anomaly Detection Sentiment Analysis

Paper
Add Code

Graph Contrastive Learning with Personalized Augmentation

no code implementations • 14 Sep 2022 • Xin Zhang, Qiaoyu Tan, Xiao Huang, Bo Li

Thus, blindly augmenting all graphs without considering their individual characteristics may undermine the performance of GCL arts. To deal with this, we propose the first principled framework, termed as \textit{G}raph contrastive learning with \textit{P}ersonalized \textit{A}ugmentation (GPA), to advance conventional GCL by allowing each graph to choose its own suitable augmentation operations. In essence, GPA infers tailored augmentation strategies for each graph based on its topology and node attributes via a learnable augmentation selector, which is a plug-and-play module and can be effectively trained with downstream GCL models end-to-end.

Contrastive Learning Data Augmentation

Paper
Add Code

Passive Non-line-of-sight Imaging for Moving Targets with an Event Camera

no code implementations • 27 Sep 2022 • Conghe Wang, Yutong He, Xia Wang, Honghao Huang, Changda Yan, Xin Zhang, Hongwei Chen

Non-line-of-sight (NLOS) imaging is an emerging technique for detecting objects behind obstacles or around corners.

Paper
Add Code

SAGDA: Achieving $\mathcal{O}(ε^{-2})$ Communication Complexity in Federated Min-Max Learning

no code implementations • 2 Oct 2022 • Haibo Yang, Zhuqing Liu, Xin Zhang, Jia Liu

To lower the communication complexity of federated min-max learning, a natural approach is to utilize the idea of infrequent communications (through multiple local updates) same as in conventional federated learning.

Federated Learning

Paper
Add Code

FedMT: Federated Learning with Mixed-type Labels

no code implementations • 5 Oct 2022 • Qiong Zhang, Jing Peng, Xin Zhang, Aline Talhouk, Gang Niu, Xiaoxiao Li

In federated learning (FL), classifiers (e. g., deep networks) are trained on datasets from multiple data centers without exchanging data across them, which improves the sample efficiency.

Federated Learning Vocal Bursts Type Prediction

Paper
Add Code

G2NetPL: Generic Game-Theoretic Network for Partial-Label Image Classification

no code implementations • 20 Oct 2022 • Rabab Abdelfattah, Xin Zhang, Mostafa M. Fouda, XiaoFeng Wang, Song Wang

To effectively address partial-label classification, this paper proposes an end-to-end Generic Game-theoretic Network (G2NetPL) for partial-label learning, which can be applied to most partial-label settings, including a very challenging, but annotation-efficient case where only a subset of the training images are labeled, each with only one positive label, while the rest of the training images remain unlabeled.

Multi-Label Classification Multi-Label Image Classification +2

Paper
Add Code

Extending Phrase Grounding with Pronouns in Visual Dialogues

1 code implementation • 23 Oct 2022 • Panzhong Lu, Xin Zhang, Meishan Zhang, Min Zhang

First, we construct a dataset of phrase grounding with both noun phrases and pronouns to image regions.

Phrase Grounding

Paper
Code

Depth Monocular Estimation with Attention-based Encoder-Decoder Network from Single Image

no code implementations • 24 Oct 2022 • Xin Zhang, Rabab Abdelfattah, Yuqi Song, Samuel A. Dauchert, XiaoFeng Wang

Depth information is the foundation of perception, essential for autonomous driving, robotics, and other source-constrained applications.

Autonomous Driving SSIM

Paper
Add Code

An Effective Approach for Multi-label Classification with Missing Labels

no code implementations • 24 Oct 2022 • Xin Zhang, Rabab Abdelfattah, Yuqi Song, XiaoFeng Wang

Through comprehensive experiments on three large-scale multi-label image datasets, i. e. MS-COCO, NUS-WIDE, and Pascal VOC12, we show that our method can handle the imbalance between positive labels and negative labels, while still outperforming existing missing-label learning approaches in most cases, and in some cases even approaches with fully labeled datasets.

Classification Missing Labels +2

Paper
Add Code

Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech

no code implementations • 4 Nov 2022 • Xin Zhang, Iván Vallés-Pérez, Andreas Stolcke, Chengzhu Yu, Jasha Droppo, Olabanji Shonibare, Roberto Barra-Chicote, Venkatesh Ravichandran

By fine-tuning an ASR model on synthetic stuttered speech we are able to reduce word error by 5. 7% relative on stuttered utterances, with only minor (<0. 2% relative) degradation for fluent utterances.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Weakly Supervised Video Anomaly Detection Based on Cross-Batch Clustering Guidance

no code implementations • 16 Dec 2022 • Congqi Cao, Xin Zhang, Shizhou Zhang, Peng Wang, Yanning Zhang

To enhance the discriminative power of features, we propose a batch clustering based loss to encourage a clustering branch to generate distinct normal and abnormal clusters based on a batch of data.

Anomaly Detection Clustering +1

Paper
Add Code

Bring Your Own View: Graph Neural Networks for Link Prediction with Personalized Subgraph Selection

1 code implementation • 23 Dec 2022 • Qiaoyu Tan, Xin Zhang, Ninghao Liu, Daochen Zha, Li Li, Rui Chen, Soo-Hyun Choi, Xia Hu

To bridge the gap, we introduce a Personalized Subgraph Selector (PS2) as a plug-and-play framework to automatically, personally, and inductively identify optimal subgraphs for different edges when performing GNNLP.

Link Prediction

Paper
Code

Ordinal Label Distribution Learning

no code implementations • ICCV 2023 • Changsong Wen, Xin Zhang, Xingxu Yao, Jufeng Yang

Therefore, we propose a new paradigm, termed ordinal label distribution learning (OLDL).

Age Estimation

Paper
Add Code

How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection

1 code implementation • 18 Jan 2023 • Biyang Guo, Xin Zhang, Ziyuan Wang, Minqi Jiang, Jinran Nie, Yuxuan Ding, Jianwei Yue, Yupeng Wu

We call the collected dataset the Human ChatGPT Comparison Corpus (HC3).

1,190

Paper
Code

sMRI-PatchNet: A novel explainable patch-based deep learning network for Alzheimer's disease diagnosis and discriminative atrophy localisation with Structural MRI

no code implementations • 17 Feb 2023 • Xin Zhang, Liangxiu Han, Lianghao Han, Haoming Chen, Darren Dancey, Daoqiang Zhang

Specifically, it consists of two primary components: 1) A fast and efficient explainable patch selection mechanism for determining the most discriminative patches based on computing the SHapley Additive exPlanations (SHAP) contribution to a transfer learning model for AD diagnosis on massive medical data; and 2) A novel patch-based network for extracting deep features and AD classfication from the selected patches with position embeddings to retain position information, capable of capturing the global and local information of inter- and intra-patches.

Position Transfer Learning

Paper
Add Code

Zero-Shot Information Extraction via Chatting with ChatGPT

1 code implementation • 20 Feb 2023 • Xiang Wei, Xingyu Cui, Ning Cheng, Xiaobin Wang, Xin Zhang, Shen Huang, Pengjun Xie, Jinan Xu, Yufeng Chen, Meishan Zhang, Yong Jiang, Wenjuan Han

Zero-shot information extraction (IE) aims to build IE systems from the unannotated text.

Event Extraction named-entity-recognition +3

718

Paper
Code

Knowledge-infused Contrastive Learning for Urban Imagery-based Socioeconomic Prediction

1 code implementation • 25 Feb 2023 • Yu Liu, Xin Zhang, Jingtao Ding, Yanxin Xi, Yong Li

To address such issues, in this paper, we propose a Knowledge-infused Contrastive Learning (KnowCL) model for urban imagery-based socioeconomic prediction.

Contrastive Learning Representation Learning

Paper
Code

PRECISION: Decentralized Constrained Min-Max Learning with Low Communication and Sample Complexities

no code implementations • 5 Mar 2023 • Zhuqing Liu, Xin Zhang, Songtao Lu, Jia Liu

Decentralized min-max optimization problems with domain constraints underpins many important ML applications, including multi-agent ML fairness assurance, and policy evaluations in multi-agent reinforcement learning.

Fairness Multi-agent Reinforcement Learning

Paper
Add Code

Co-Occurrence Matters: Learning Action Relation for Temporal Action Localization

no code implementations • 15 Mar 2023 • Congqi Cao, Yizhe WANG, Yue Lu, Xin Zhang, Yanning Zhang

Existing works in this field mainly suffer from two weaknesses: (1) They often neglect the multi-label case and only focus on temporal modeling.

Relation Temporal Action Localization

Paper
Add Code

Machine Learning Automated Approach for Enormous Synchrotron X-Ray Diffraction Data Interpretation

no code implementations • 20 Mar 2023 • Xiaodong Zhao, YiXuan Luo, Juejing Liu, Wenjun Liu, Kevin M. Rosso, Xiaofeng Guo, Tong Geng, Ang Li, Xin Zhang

This study highlighted the importance of labeled experimental patterns on the training of DNN models to solve u-XRD mapping data from in-situ experiments involving liquid phase.

Paper
Add Code

Distribution-restrained Softmax Loss for the Model Robustness

no code implementations • 22 Mar 2023 • Hao Wang, Chen Li, JinZhe Jiang, Xin Zhang, YaQian Zhao, Weifeng Gong

Recently, the robustness of deep learning models has received widespread attention, and various methods for improving model robustness have been proposed, including adversarial training, model architecture modification, design of loss functions, certified defenses, and so on.

Paper
Add Code

A Heterogeneous Parallel Non-von Neumann Architecture System for Accurate and Efficient Machine Learning Molecular Dynamics

no code implementations • 26 Mar 2023 • Zhuoying Zhao, Ziling Tan, Pinghui Mo, Xiaonan Wang, Dan Zhao, Xin Zhang, Ming Tao, Jie Liu

This paper proposes a special-purpose system to achieve high-accuracy and high-efficiency machine learning (ML) molecular dynamics (MD) calculations.

Atomic Forces

Paper
Add Code

LMDA-Net:A lightweight multi-dimensional attention network for general EEG-based brain-computer interface paradigms and interpretability

2 code implementations • 29 Mar 2023 • Zhengqing Miao, Xin Zhang, Meirong Zhao, Dong Ming

By incorporating two novel attention modules designed specifically for EEG signals, the channel attention module and the depth attention module, LMDA-Net can effectively integrate features from multiple dimensions, resulting in improved classification performance across various BCI tasks.

EEG Motor Imagery

Paper
Code

D-Score: A White-Box Diagnosis Score for CNNs Based on Mutation Operators

no code implementations • 3 Apr 2023 • Xin Zhang, Yuqi Song, XiaoFeng Wang, Fei Zuo

However, concerns have been raised with respect to the trustworthiness of these models: The standard testing method evaluates the performance of a model on a test set, while low-quality and insufficient test sets can lead to unreliable evaluation results, which can have unforeseeable consequences.

Autonomous Driving Data Augmentation +1

Paper
Add Code

RFAConv: Innovating Spatial Attention and Standard Convolutional Operation

1 code implementation • 6 Apr 2023 • Xin Zhang, Chen Liu, Degang Yang, Tingting Song, Yichen Ye, Ke Li, Yingze Song

In this paper, we propose a new perspective on the effectiveness of spatial attention, which is that the spatial attention mechanism essentially solves the problem of convolutional kernel parameter sharing.

Classification Object Detection +1

116

Paper
Code

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities

1 code implementation • 18 May 2023 • Dong Zhang, ShiMin Li, Xin Zhang, Jun Zhan, Pengyu Wang, Yaqian Zhou, Xipeng Qiu

Multi-modal large language models are regarded as a crucial step towards Artificial General Intelligence (AGI) and have garnered significant interest with the emergence of ChatGPT.

Language Modelling Large Language Model +2

889

Paper
Code

GDA: Generative Data Augmentation Techniques for Relation Extraction Tasks

no code implementations • 26 May 2023 • Xuming Hu, Aiwei Liu, Zeqi Tan, Xin Zhang, Chenwei Zhang, Irwin King, Philip S. Yu

These techniques neither preserve the semantic consistency of the original sentences when rule-based augmentations are adopted, nor preserve the syntax structure of sentences when expressing relations using seq2seq models, resulting in less diverse augmentations.

Data Augmentation Relation +1

Paper
Add Code

ELSA: Efficient Label Shift Adaptation through the Lens of Semiparametric Models

no code implementations • 30 May 2023 • Qinglong Tian, Xin Zhang, Jiwei Zhao

We study the domain adaptation problem with label shift in this work.

Computational Efficiency Domain Adaptation

Paper
Add Code

Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model

no code implementations • 13 Jun 2023 • Xin Zhang, Jiaxian Guo, Paul Yoo, Yutaka Matsuo, Yusuke Iwasawa

To guarantee the visual coherence of the generated or edited image, we introduce an inpainting and harmonizing module to guide the pre-trained diffusion model to seamlessly blend the inserted subject into the scene naturally.

Denoising Image Generation +1

Paper
Add Code

Deep learning radiomics for assessment of gastroesophageal varices in people with compensated advanced chronic liver disease

no code implementations • 13 Jun 2023 • Lan Wang, Ruiling He, Lili Zhao, Jia Wang, Zhengzi Geng, Tao Ren, Guo Zhang, Peng Zhang, Kaiqiang Tang, Chaofei Gao, Fei Chen, Liting Zhang, Yonghe Zhou, Xin Li, Fanbin He, Hui Huan, Wenjuan Wang, Yunxiao Liang, Juan Tang, Fang Ai, Tingyu Wang, Liyun Zheng, Zhongwei Zhao, Jiansong Ji, Wei Liu, Jiaojiao Xu, Bo Liu, Xuemei Wang, Yao Zhang, Qiong Yan, Muhan Lv, Xiaomei Chen, Shuhua Zhang, Yihua Wang, Yang Liu, Li Yin, Yanni Liu, Yanqing Huang, Yunfang Liu, Kun Wang, Meiqin Su, Li Bian, Ping An, Xin Zhang, Linxue Qian, Shao Li, Xiaolong Qi

Validation analysis revealed that the AUCs of DLRP were 0. 91 for GEV (95% CI 0. 90 to 0. 93, p < 0. 05) and 0. 88 for HRV (95% CI 0. 86 to 0. 89, p < 0. 01), which were significantly and robustly better than canonical risk indicators, including the value of LSM and SSM.

Paper
Add Code

Attention Hybrid Variational Net for Accelerated MRI Reconstruction

no code implementations • 21 Jun 2023 • Guoyao Shen, Boran Hao, Mengyu Li, Chad W. Farris, Ioannis Ch. Paschalidis, Stephan W. Anderson, Xin Zhang

However, the drawback of these structures is that they are not fully utilizing the information from both domains (k-space and image).

MRI Reconstruction

Paper
Add Code

SHISRCNet: Super-resolution And Classification Network For Low-resolution Breast Cancer Histopathology Image

1 code implementation • 25 Jun 2023 • Luyuan Xie, Cong Li, ZiRui Wang, Xin Zhang, Boyan Chen, Qingni Shen, Zhonghai Wu

CF module extracts and fuses the multi-scale features of SR images for classification.

Histopathological Image Classification Image Classification +1

Paper
Code

A generic self-supervised learning (SSL) framework for representation learning from spectra-spatial feature of unlabeled remote sensing imagery

no code implementations • 27 Jun 2023 • Xin Zhang, Liangxiu Han

The success of SSL is heavily dependent on a pre-designed pretext task, which introduces an inductive bias into the model from a large amount of unlabelled data.

Earth Observation Inductive Bias +4

Paper
Add Code

Early Autism Diagnosis based on Path Signature and Siamese Unsupervised Feature Compressor

no code implementations • 12 Jul 2023 • Zhuowen Yin, Xinyao Ding, Xin Zhang, Zhengwang Wu, Li Wang, Gang Li

Specifically, we propose a Siamese verification framework to extend the scarce data, and an unsupervised compressor to alleviate data imbalance by extracting key features.

Paper
Add Code

Joint Service Caching, Communication and Computing Resource Allocation in Collaborative MEC Systems: A DRL-based Two-timescale Approach

no code implementations • 19 Jul 2023 • Qianqian Liu, Haixia Zhang, Xin Zhang, Dongfeng Yuan

Meeting the strict Quality of Service (QoS) requirements of terminals has imposed a signiffcant challenge on Multiaccess Edge Computing (MEC) systems, due to the limited multidimensional resources.

Edge-computing

Paper
Add Code

PPN: Parallel Pointer-based Network for Key Information Extraction with Complex Layouts

no code implementations • 20 Jul 2023 • Kaiwen Wei, Jie Yao, Jingyuan Zhang, Yangyang Kang, Fubang Zhao, Yating Zhang, Changlong Sun, Xin Jin, Xin Zhang

Firstly, the layout of existing datasets is relatively fixed and limited in the number of semantic entity categories, creating a significant gap between these datasets and the complex real-world scenarios.

Key Information Extraction

Paper
Add Code

Collaborative Graph Neural Networks for Attributed Network Embedding

1 code implementation • 22 Jul 2023 • Qiaoyu Tan, Xin Zhang, Xiao Huang, Hao Chen, Jundong Li, Xia Hu

Graph neural networks (GNNs) have shown prominent performance on attributed network embedding.

Attribute Network Embedding

Paper
Code

Holistic Exploration on Universal Decompositional Semantic Parsing: Architecture, Data Augmentation, and LLM Paradigm

1 code implementation • 25 Jul 2023 • Hexuan Deng, Xin Zhang, Meishan Zhang, Xuebo Liu, Min Zhang

In this paper, we conduct a holistic exploration of the Universal Decompositional Semantic (UDS) Parsing.

Attribute Data Augmentation +1

Paper
Code

Evaluating Large Language Models for Radiology Natural Language Processing

1 code implementation • 25 Jul 2023 • Zhengliang Liu, Tianyang Zhong, Yiwei Li, Yutong Zhang, Yi Pan, Zihao Zhao, Peixin Dong, Chao Cao, Yuxiao Liu, Peng Shu, Yaonai Wei, Zihao Wu, Chong Ma, Jiaqi Wang, Sheng Wang, Mengyue Zhou, Zuowei Jiang, Chunlin Li, Jason Holmes, Shaochen Xu, Lu Zhang, Haixing Dai, Kai Zhang, Lin Zhao, Yuanhao Chen, Xu Liu, Peilong Wang, Pingkun Yan, Jun Liu, Bao Ge, Lichao Sun, Dajiang Zhu, Xiang Li, Wei Liu, Xiaoyan Cai, Xintao Hu, Xi Jiang, Shu Zhang, Xin Zhang, Tuo Zhang, Shijie Zhao, Quanzheng Li, Hongtu Zhu, Dinggang Shen, Tianming Liu

The rise of large language models (LLMs) has marked a pivotal shift in the field of natural language processing (NLP).

792

Paper
Code

Data-Driven Modeling with Experimental Augmentation for the Modulation Strategy of the Dual-Active-Bridge Converter

no code implementations • 30 Jul 2023 • Xinze Li, Josep Pou, Jiaxin Dong, Fanfan Lin, Changyun Wen, Suvajit Mukherjee, Xin Zhang

The D2EA approach is instantiated for the efficiency optimization of a hybrid modulation for neutral-point-clamped dual-active-bridge (NPC-DAB) converter.

Paper
Add Code

Towards Imbalanced Large Scale Multi-label Classification with Partially Annotated Labels

no code implementations • 31 Jul 2023 • Xin Zhang, Yuqi Song, Fei Zuo, XiaoFeng Wang

In this work, we address the issue of label imbalance and investigate how to train classifiers using partial labels in large labeling spaces.

Multi-Label Classification

Paper
Add Code

Artificial-Intelligence-Based Hybrid Extended Phase Shift Modulation for the Dual Active Bridge Converter with Full ZVS Range and Optimal Efficiency

no code implementations • 1 Aug 2023 • Xinze Li, Xin Zhang, Fanfan Lin, Changjiang Sun, Kezhi Mao

ZVS range and efficiency are two significant performance indicators for DAB converter.

Ensemble Learning

Paper
Add Code

Artificial-Intelligence-Based Triple Phase Shift Modulation for Dual Active Bridge Converter with Minimized Current Stress

no code implementations • 1 Aug 2023 • Xinze Li, Xin Zhang, Fanfan Lin, Changjiang Sun, Kezhi Mao

However, to minimize the current stress when the DAB converter is under TPS modulation, two difficulties exist in analysis process and realization process, respectively.

Paper
Add Code

Particle swarm optimization with state-based adaptive velocity limit strategy

no code implementations • 2 Aug 2023 • Xinze Li, Kezhi Mao, Fanfan Lin, Xin Zhang

Several adaptive VL strategies have been introduced with which the performance of PSO can be improved.

Paper
Add Code

Towards General Text Embeddings with Multi-stage Contrastive Learning

no code implementations • 7 Aug 2023 • Zehan Li, Xin Zhang, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang

We present GTE, a general-purpose text embedding model trained with multi-stage contrastive learning.

Contrastive Learning Unsupervised Pre-training

Paper
Add Code

Real-time Strawberry Detection Based on Improved YOLOv5s Architecture for Robotic Harvesting in open-field environment

no code implementations • 8 Aug 2023 • Zixuan He, Salik Ram Khanal, Xin Zhang, Manoj Karkee, Qin Zhang

This study proposed a YOLOv5-based custom object detection model to detect strawberries in an outdoor environment.

object-detection Object Detection

Paper
Add Code

MRI Field-transfer Reconstruction with Limited Data: Regularization by Neural Style Transfer

no code implementations • 21 Aug 2023 • Guoyao Shen, Yancheng Zhu, Hernan Jara, Sean B. Andersson, Chad W. Farris, Stephan Anderson, Xin Zhang

This enables RNST to reconstruct a high-quality image from a noisy low-quality image with different image styles and limited data.

Deblurring Denoising +3

Paper
Add Code

SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models

3 code implementations • 31 Aug 2023 • Xin Zhang, Dong Zhang, ShiMin Li, Yaqian Zhou, Xipeng Qiu

Therefore, we propose SpeechTokenizer, a unified speech tokenizer for speech large language models.

Language Modelling Quantization

273

Paper
Code

ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

no code implementations • 8 Oct 2023 • Tianyang Zhong, Wei Zhao, Yutong Zhang, Yi Pan, Peixin Dong, Zuowei Jiang, Xiaoyan Kui, Youlan Shang, Li Yang, Yaonai Wei, Longtao Yang, Hao Chen, Huan Zhao, Yuxiao Liu, Ning Zhu, Yiwei Li, Yisong Wang, Jiaqi Yao, Jiaqi Wang, Ying Zeng, Lei He, Chao Zheng, Zhixue Zhang, Ming Li, Zhengliang Liu, Haixing Dai, Zihao Wu, Lu Zhang, Shu Zhang, Xiaoyan Cai, Xintao Hu, Shijie Zhao, Xi Jiang, Xin Zhang, Xiang Li, Dajiang Zhu, Lei Guo, Dinggang Shen, Junwei Han, Tianming Liu, Jun Liu, Tuo Zhang

Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels.

Decision Making Language Modelling +1

Paper
Add Code

Language Models are Universal Embedders

1 code implementation • 12 Oct 2023 • Xin Zhang, Zehan Li, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Min Zhang

As such cases span from English to other natural or programming languages, from retrieval to classification and beyond, it is desirable to build a unified embedding model rather than dedicated ones for each scenario.

Code Search Language Modelling +2

Paper
Code

Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model

no code implementations • 16 Oct 2023 • Junpeng Tan, Xin Zhang, Yao Lv, Xiangmin Xu, Gang Li

Finally, the experimental results on real-world fetal brain MRI stacks demonstrate the state-of-the-art performance of our method.

3D Reconstruction Super-Resolution

Paper
Add Code

TBDLNet: a network for classifying multidrug-resistant and drug-sensitive tuberculosis

no code implementations • 27 Oct 2023 • Ziquan Zhu, Jing Tao, Shuihua Wang, Xin Zhang, Yudong Zhang

Five indexes are selected in this paper, which are accuracy, sensitivity, precision, F1-score, and specificity.

Specificity

Paper
Add Code

Map-assisted TDOA Localization Enhancement Based On CNN

no code implementations • 2 Nov 2023 • YiWen Chen, Tianqi Xiang, Xi Chen, Xin Zhang

For signal processing related to localization technologies, non line of sight (NLOS) multipaths have a significant impact on the localization error level.

Paper
Add Code

Reconfigurable Intelligent Surface & Edge -- An Introduction of an EM manipulation structure on obstacles' edge

no code implementations • 3 Nov 2023 • Tianqi Xiang, Zhiwei Jiang, Weijun Hong, Xin Zhang, Yuehong Gao

In this paper, Reconfigurable Intelligent Surface & Edge (RISE) is proposed to extend RIS' abilities of reflection and refraction over surfaces to diffraction around obstacles' edge for better adaptation to specific coverage scenarios.

Paper
Add Code

Image Recognition of Oil Leakage Area Based on Logical Semantic Discrimination

no code implementations • 3 Nov 2023 • Weiying Lin, Che Liu, Xin Zhang, Zhen Wei, Sizhe Li, Xun Ma

The process begins with histogram equalization to enhance the original image, followed by the use of Mask RCNN to identify the preliminary positions and outlines of oil tanks, the ground, and areas of potential oil contamination.

Paper
Add Code

Holistic Evaluation of GPT-4V for Biomedical Imaging

no code implementations • 10 Nov 2023 • Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, Jingyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang, Xinyu Wang, Xu Zhang, Lin Zhao, Yiheng Liu, Kai Zhang, Liheng Yan, Lichao Sun, Jun Liu, Ning Qiang, Bao Ge, Xiaoyan Cai, Shijie Zhao, Xintao Hu, Yixuan Yuan, Gang Li, Shu Zhang, Xin Zhang, Xi Jiang, Tuo Zhang, Dinggang Shen, Quanzheng Li, Wei Liu, Xiang Li, Dajiang Zhu, Tianming Liu

GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain.

Anatomy Image Captioning +1

Paper
Add Code

Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale

no code implementations • 14 Nov 2023 • Wei Wen, Kuang-Hung Liu, Igor Fedorov, Xin Zhang, Hang Yin, Weiwei Chu, Kaveh Hassani, Mengying Sun, Jiang Liu, Xu Wang, Lin Jiang, Yuxin Chen, Buyun Zhang, Xi Liu, Dehua Cheng, Zhengxing Chen, Guang Zhao, Fangqiu Han, Jiyan Yang, Yuchen Hao, Liang Xiong, Wen-Yen Chen

In industry system, such as ranking system in Meta, it is unclear whether NAS algorithms from the literature can outperform production baselines because of: (1) scale - Meta ranking systems serve billions of users, (2) strong baselines - the baselines are production models optimized by hundreds to thousands of world-class engineers for years since the rise of deep learning, (3) dynamic baselines - engineers may have established new and stronger baselines during NAS search, and (4) efficiency - the search pipeline must yield results quickly in alignment with the productionization life cycle.

Neural Architecture Search

Paper
Add Code

K-space Cold Diffusion: Learning to Reconstruct Accelerated MRI without Noise

no code implementations • 16 Nov 2023 • Guoyao Shen, Mengyu Li, Chad W. Farris, Stephan Anderson, Xin Zhang

In this paper, we propose a k-space cold diffusion model that performs image degradation and restoration in k-space without the need for Gaussian noise.

Image Generation MRI Reconstruction +1

Paper
Add Code

AKConv: Convolutional Kernel with Arbitrary Sampled Shapes and Arbitrary Number of Parameters

1 code implementation • 20 Nov 2023 • Xin Zhang, Yingze Song, Tingting Song, Degang Yang, Yichen Ye, Jie zhou, Liming Zhang

In response to the above questions, the Alterable Kernel Convolution (AKConv) is explored in this work, which gives the convolution kernel an arbitrary number of parameters and arbitrary sampled shapes to provide richer options for the trade-off between network overhead and performance.

object-detection Object Detection

Paper
Code

SiGeo: Sub-One-Shot NAS via Information Theory and Geometry of Loss Landscape

no code implementations • 22 Nov 2023 • Hua Zheng, Kuang-Hung Liu, Igor Fedorov, Xin Zhang, Wen-Yen Chen, Wei Wen

Neural Architecture Search (NAS) has become a widely used tool for automating neural network design.

Neural Architecture Search

Paper
Add Code

Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning

1 code implementation • 22 Nov 2023 • Xin Zhang, Jiawei Du, Yunsong Li, Weiying Xie, Joey Tianyi Zhou

Dataset pruning aims to construct a coreset capable of achieving performance comparable to the original, full dataset.

Classification

Paper
Code

CLIPC8: Face liveness detection algorithm based on image-text pairs and contrastive learning

2 code implementations • 29 Nov 2023 • Xu Liu, Shu Zhou, Yurong Song, Wenzhe Luo, Xin Zhang

To tackle this issue, we propose a face liveness detection method based on image-text pairs and contrastive learning, dividing liveness attack problems in the financial field into eight categories and using text information to describe the images of these eight types of attacks.

Contrastive Learning Face Recognition

Paper
Code

Automated interpretation of congenital heart disease from multi-view echocardiograms

no code implementations • 30 Nov 2023 • Jing Wang, Xiaofeng Liu, Fangyun Wang, Lin Zheng, Fengqiao Gao, Hanwen Zhang, Xin Zhang, Wanqing Xie, Binbin Wang

Our video-based model can diagnose with an accuracy of 93. 9\% (binary classification), and 92. 1\% (3-class classification) in a collected 2D video testing set, which does not need key-frame selection and view annotation in testing.

Binary Classification

Paper
Add Code

Physics Inspired Criterion for Pruning-Quantization Joint Learning

1 code implementation • 1 Dec 2023 • Weiying Xie, Xiaoyi Fan, Xin Zhang, Yunsong Li, Jie Lei, Leyuan Fang

Pruning-quantization joint learning always facilitates the deployment of deep neural networks (DNNs) on resource-constrained edge devices.

Image Classification Model Compression +1

Paper
Code

TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-Commerce

1 code implementation • 8 Dec 2023 • Tongxin Hu, Zhuang Li, Xin Jin, Lizhen Qu, Xin Zhang

Annually, e-commerce platforms incur substantial financial losses due to trademark infringements, making it crucial to identify and mitigate potential legal risks tied to merchant information registered to the platforms.

Legal Reasoning

Paper
Code

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

no code implementations • 20 Dec 2023 • Zhaojian Yu, Xin Zhang, Ning Shang, Yangyu Huang, Can Xu, Yishujie Zhao, Wenxiang Hu, Qiufeng Yin

This paper thus offers a significant contribution to the field of instruction data generation and fine-tuning models, providing new insights and tools for enhancing performance in code-related tasks.

Code Generation

Paper
Add Code

HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping

no code implementations • 29 Dec 2023 • Xin Zhang, Jinheng Xie, Yuan Yuan, Michael Bi Mi, Robby T. Tan

Further, to ensure the distinguishability among various regions, we introduce a region-level contrastive clustering loss to pull closer similar regions across images.

Object Object Discovery +2

Paper
Add Code

Efficient Sparse Least Absolute Deviation Regression with Differential Privacy

no code implementations • 2 Jan 2024 • Weidong Liu, Xiaojun Mao, Xiaofei Zhang, Xin Zhang

To fast solve the non-smooth loss under a given privacy budget, we develop a Fast Robust And Privacy-Preserving Estimation (FRAPPE) algorithm for least absolute deviation regression.

Privacy Preserving regression

Paper
Add Code

Understanding LLMs: A Comprehensive Overview from Training to Inference

no code implementations • 4 Jan 2024 • Yiheng Liu, Hao He, Tianle Han, Xu Zhang, Mengyuan Liu, Jiaming Tian, Yutong Zhang, Jiaqi Wang, Xiaohui Gao, Tianyang Zhong, Yi Pan, Shaochen Xu, Zihao Wu, Zhengliang Liu, Xin Zhang, Shu Zhang, Xintao Hu, Tuo Zhang, Ning Qiang, Tianming Liu, Bao Ge

Low-cost training and deployment of LLMs represent the future development trend.

Language Modelling Large Language Model +2

Paper
Add Code

TRLS: A Time Series Representation Learning Framework via Spectrogram for Medical Signal Processing

no code implementations • 6 Jan 2024 • Luyuan Xie, Cong Li, Xin Zhang, Shengfang Zhai, Yuejian Fang, Qingni Shen, Zhonghai Wu

Representation learning frameworks in unlabeled time series have been proposed for medical signal processing.

Representation Learning Time Series

Paper
Add Code

SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems

1 code implementation • 8 Jan 2024 • Dong Zhang, Zhaowei Li, Pengyu Wang, Xin Zhang, Yaqian Zhou, Xipeng Qiu

In this paper, we propose SpeechAgents, a multi-modal LLM based multi-agent system designed for simulating human communication.

Language Modelling Large Language Model

Paper
Code

Weiss-Weinstein bound of frequency estimation error for very weak GNSS signals

1 code implementation • 11 Jan 2024 • Xin Zhang, Xingqun Zhan, Jihong Huang, Jiahui Liu, Yingchao Xiao

Tightness remains the center quest in all modern estimation bounds.

Paper
Code

UV-SAM: Adapting Segment Anything Model for Urban Village Identification

1 code implementation • 16 Jan 2024 • Xin Zhang, Yu Liu, Yuming Lin, Qingmin Liao, Yong Li

Urban villages, defined as informal residential areas in or around urban centers, are characterized by inadequate infrastructures and poor living conditions, closely related to the Sustainable Development Goals (SDGs) on poverty, adequate housing, and sustainable cities.

Image Classification Semantic Segmentation

Paper
Code

CLIP Model for Images to Textual Prompts Based on Top-k Neighbors

no code implementations • 18 Jan 2024 • Xin Zhang, YeMing Cai, Tianzhi Jia

Text-to-image synthesis, a subfield of multimodal generation, has gained significant attention in recent years.

Image Generation multimodal generation

Paper
Add Code

SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation

1 code implementation • 24 Jan 2024 • Dong Zhang, Xin Zhang, Jun Zhan, ShiMin Li, Yaqian Zhou, Xipeng Qiu

It comprises an autoregressive model based on LLM for semantic information modeling and a non-autoregressive model employing flow matching for perceptual information modeling.

Voice Conversion

889

Paper
Code

ReTaSA: A Nonparametric Functional Estimation Approach for Addressing Continuous Target Shift

no code implementations • 29 Jan 2024 • Hwanwoo Kim, Xin Zhang, Jiwei Zhao, Qinglong Tian

This work focuses on the target shift problem in a regression setting (Zhang et al., 2013; Nguyen et al., 2016).

regression

Paper
Add Code

Partially Recentralization Softmax Loss for Vision-Language Models Robustness

no code implementations • 6 Feb 2024 • Hao Wang, Xin Zhang, JinZhe Jiang, YaQian Zhao, Chen Li

However, it has been shown that multimodal NLP are vulnerable to adversarial attacks, where the outputs of a model can be dramatically changed by a perturbation to the input.

Adversarial Robustness

Paper
Add Code

ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization

1 code implementation • 14 Feb 2024 • Feifan Song, Yuxuan Fan, Xin Zhang, Peiyi Wang, Houfeng Wang

Large Language Models (LLMs) rely on Human Preference Alignment (HPA) to ensure the generation of safe content.

In-Context Learning

Paper
Code

Dynamic Patch-aware Enrichment Transformer for Occluded Person Re-Identification

no code implementations • 16 Feb 2024 • Xin Zhang, Keren Fu, Qijun Zhao

To facilitate the seamless integration of global classification features with the finely detailed local features selected by DPSM, we introduce a novel feature blending module (FBM).

Contrastive Learning Person Re-Identification

Paper
Add Code

Knowledge Graph Assisted Automatic Sports News Writing

no code implementations • 17 Feb 2024 • Yang Cao, Xinyi Chen, Xin Zhang, Siying Li

In this paper, we present a novel method for automatically generating sports news, which employs a unique algorithm that extracts pivotal moments from live text broadcasts and uses them to create an initial draft of the news.

Knowledge Graph Completion

Paper
Add Code

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

1 code implementation • 19 Feb 2024 • Jun Zhan, Junqi Dai, Jiasheng Ye, Yunhua Zhou, Dong Zhang, Zhigeng Liu, Xin Zhang, Ruibin Yuan, Ge Zhang, Linyang Li, Hang Yan, Jie Fu, Tao Gui, Tianxiang Sun, Yugang Jiang, Xipeng Qiu

We introduce AnyGPT, an any-to-any multimodal language model that utilizes discrete representations for the unified processing of various modalities, including speech, text, images, and music.

Language Modelling Large Language Model

419

Paper
Code

A Literature Review of Literature Reviews in Pattern Analysis and Machine Intelligence

no code implementations • 20 Feb 2024 • Penghai Zhao, Xin Zhang, Ming-Ming Cheng, Jian Yang, Xiang Li

To improve efficiency, this paper aims to provide a thorough review of reviews in the PAMI field from diverse perspectives.

Language Modelling Large Language Model

Paper
Add Code

Explicit Motion Handling and Interactive Prompting for Video Camouflaged Object Detection

no code implementations • 4 Mar 2024 • Xin Zhang, Tao Xiao, GePeng Ji, Xuan Wu, Keren Fu, Qijun Zhao

The prompt fed to the motion stream is learned by supervising optical flow in a self-supervised manner.

Motion Estimation object-detection +3

Paper
Add Code

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models

1 code implementation • 5 Mar 2024 • Zheng Li, Xiang Li, Xinyi Fu, Xin Zhang, Weiqiang Wang, Shuo Chen, Jian Yang

To our best knowledge, we are the first to (1) perform unsupervised domain-specific prompt-driven knowledge distillation for CLIP, and (2) establish a practical pre-storing mechanism of text features as shared class vectors between teacher and student.

Ranked #1 on Prompt Engineering on Oxford-IIIT Pet Dataset

Knowledge Distillation Prompt Engineering +1

Paper
Code

ArgMed-Agents: Explainable Clinical Decision Reasoning with Large Language Models via Argumentation Schemes

no code implementations • 10 Mar 2024 • Shengxin Hong, Liang Xiao, Xin Zhang, Jianxia Chen

There are two main barriers to using large language models (LLMs) in clinical reasoning.

Paper
Add Code

Fine-grainedly Synthesize Streaming Data Based On Large Language Models With Graph Structure Understanding For Data Sparsity

no code implementations • 10 Mar 2024 • Xin Zhang, Linhai Zhang, Deyu Zhou, Guoqiang Xu

Due to the sparsity of user data, sentiment analysis on user reviews in e-commerce platforms often suffers from poor performance, especially when faced with extremely sparse user data or long-tail labels.

Attribute Sentiment Analysis

Paper
Add Code

Two-sided Acoustic Metascreen for Broadband and Individual Reflection and Transmission Control

no code implementations • 12 Mar 2024 • Ao Chen, Xin Zhang

Acoustic wave modulation plays a pivotal role in various applications, including sound-field reconstruction, wireless communication, and particle manipulation, among others.

Paper
Add Code

Accurate and Data-Efficient Micro-XRD Phase Identification Using Multi-Task Learning: Application to Hydrothermal Fluids

no code implementations • 15 Mar 2024 • Yanfei Li, Juejing Liu, Xiaodong Zhao, Wenjun Liu, Tong Geng, Ang Li, Xin Zhang

Traditional analysis of highly distorted micro-X-ray diffraction ({\mu}-XRD) patterns from hydrothermal fluid environments is a time-consuming process, often requiring substantial data preprocessing and labeled experimental data.

Binary Classification Multi-Task Learning

Paper
Add Code

PE-GPT: A Physics-Informed Interactive Large Language Model for Power Converter Modulation Design

no code implementations • 21 Mar 2024 • Fanfan Lin, Junhua Liu, Xinze Li, Shuai Zhao, Bohui Zhao, Hao Ma, Xin Zhang

This paper proposes PE-GPT, a custom-tailored large language model uniquely adapted for power converter modulation design.

In-Context Learning Language Modelling +1

Paper
Add Code

GCN-DevLSTM: Path Development for Skeleton-Based Action Recognition

1 code implementation • 22 Mar 2024 • Lei Jiang, Weixin Yang, Xin Zhang, Hao Ni

Skeleton-based action recognition (SAR) in videos is an important but challenging task in computer vision.

Action Recognition Dimensionality Reduction +1

Paper
Code

LLMs Instruct LLMs:An Extraction and Editing Method

no code implementations • 23 Mar 2024 • Xin Zhang, Tianjie Ju, Huijia Liang, Ying Fu, Qin Zhang

The interest in updating Large Language Models (LLMs) without retraining from scratch is substantial, yet it comes with some challenges. This is especially true for situations demanding complex reasoning with limited samples, a scenario we refer to as the Paucity-Constrained Complex Reasoning Adaptation for LLMs (PCRA-LLM). Traditional methods like Low-Rank Adaptation (LoRA) and Retrieval-Augmented Generation (RAG) are inadequate for this critical issue, particularly evident in our exploration of a specific medical context that epitomize the PCRA-LLM's distinct needs. To address the issue, we propose a Sequential Fusion method to incorporate knowledge from complex context into LLMs.

Knowledge Graphs Question Answering

Paper
Add Code

CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

no code implementations • 2 Apr 2024 • Xu Li, Ruiqi Sun, Jiameng Lv, Peng Jia, Nan Li, Chengliang Wei, Zou Hu, Xinzhong Er, Yun Chen, Zhang Ban, Yuedong Fang, Qi Guo, Dezi Liu, Guoliang Li, Lin Lin, Ming Li, Ran Li, Xiaobo Li, Yu Luo, Xianmin Meng, Jundan Nie, Zhaoxiang Qi, Yisheng Qiu, Li Shao, Hao Tian, Lei Wang, Wei Wang, Jingtian Xian, Youhua Xu, Tianmeng Zhang, Xin Zhang, Zhimin Zhou

To overcome these challenges, we have developed a framework based on a hierarchical visual Transformer with a sliding window technique to search for strong lensing systems within entire images.

Paper
Add Code

DSGNN: A Dual-View Supergrid-Aware Graph Neural Network for Regional Air Quality Estimation

no code implementations • 2 Apr 2024 • Xin Zhang, Ling Chen, Xing Tang, Hongyu Shi

To this end, we propose a Dual-view Supergrid-aware Graph Neural Network (DSGNN) for regional air quality estimation, which can model the spatial dependencies of distant grid regions from dual views (i. e., satellite-derived aerosol optical depth (AOD) and meteorology).

Paper
Add Code

SpeechAlign: Aligning Speech Generation to Human Preferences

2 code implementations • 8 Apr 2024 • Dong Zhang, Zhaowei Li, ShiMin Li, Xin Zhang, Pengyu Wang, Yaqian Zhou, Xipeng Qiu

However, the integration of human feedback to align speech outputs to human preferences is often neglected.

Language Modelling

889

Paper
Code

Sociolectal Analysis of Pretrained Language Models

no code implementations • EMNLP 2021 • Sheng Zhang, Xin Zhang, Weiming Zhang, Anders Søgaard

Using data from English cloze tests, in which subjects also self-reported their gender, age, education, and race, we examine performance differences of pretrained language models across demographic groups, defined by these (protected) attributes.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.