Search Results for author: Xin Chen

Found 205 papers, 72 papers with code

基于人物特征增强的拟人句要素抽取方法研究(Research on Element Extraction of Personified Sentences Based on Enhanced Characters)

no code implementations CCL 2021 Jing Li, Suge Wang, Xin Chen, Dian Wang

“在散文阅读理解的鉴赏类问题中, 对拟人句赏析考查比较频繁。目前, 已有的工作仅对拟人句中的本体要素进行识别并抽取, 存在要素抽取不完整的问题, 尤其是当句子中出现多个本体时, 需要确定拟人词与各个本体的对应关系。为解决这些问题, 本文提出了基于人物特征增强的拟人句要素抽取方法。该方法利用特定领域的特征, 增强句子的向量表示, 再利用条件随机场模型对拟人句中的本体和拟人词要素进行识别。在此基础上, 利用自注意力机制对要素之间的关系进行检测, 使用要素同步机制和关系同步机制进行信息交互, 用于要素识别和关系检测的输入更新。在自建的拟人数据集上进行<本体, 拟人词>抽取的比较实验, 结果表明本文提出的模型性能优于其他比较模型。”

Automatic Knowledge Graph Construction for Judicial Cases

no code implementations15 Apr 2024 Jie zhou, Xin Chen, Hang Zhang, Zhe Li

Building on these results, we detail the automatic construction process of case knowledge graphs for judicial cases, enabling the assembly of knowledge graphs for hundreds of thousands of judgments.

graph construction Knowledge Graphs

Enhance Low-Carbon Power System Operation via Carbon-Aware Demand Response

no code implementations8 Apr 2024 Xin Chen

As the electrification process advances, enormous power flexibility is becoming available on the demand side, which can be harnessed to facilitate power system decarbonization.

Scheduling

LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking

no code implementations8 Apr 2024 Faren Yan, Peng Yu, Xin Chen

The use of LLMs for natural language processing has become a popular trend in the past two years, driven by their formidable capacity for context comprehension and learning, which has inspired a wave of research from academics and industry professionals.

Language Modelling Large Language Model +3

CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data

no code implementations7 Apr 2024 Wei Fang, Yuxing Tang, Heng Guo, Mingze Yuan, Tony C. W. Mok, Ke Yan, Jiawen Yao, Xin Chen, Zaiyi Liu, Le Lu, Ling Zhang, Minfeng Xu

In the realm of medical 3D data, such as CT and MRI images, prevalent anisotropic resolution is characterized by high intra-slice but diminished inter-slice resolution.

Super-Resolution

GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU

1 code implementation3 Apr 2024 Zhongming Yu, Genghan Zhang, Hanxian Huang, Xin Chen, Jishen Zhao

Yet, efficient tensor-centric frameworks for GNNs remain scarce due to unique challenges and limitations encountered when implementing segment reduction in GNN contexts.

GNSS Spoofing Detection by Crowdsourcing Double Differential Pseudorange Spatial Distribution

no code implementations3 Apr 2024 Xin Chen, Kai Wang

It is widely known that spoofing is a major threat that adversely impacts the reliability and accuracy of GNSS applications.

MotionChain: Conversational Motion Controllers via Multimodal Prompts

1 code implementation2 Apr 2024 Biao Jiang, Xin Chen, Chi Zhang, Fukun Yin, Zhuoyuan Li, Gang Yu, Jiayuan Fan

However, this proficiency remains largely unexplored in other multimodal generative models, particularly in human motion models.

Language Modelling

Improving Out-of-Vocabulary Handling in Recommendation Systems

no code implementations27 Mar 2024 William Shiao, Mingxuan Ju, Zhichun Guo, Xin Chen, Evangelos Papalexakis, Tong Zhao, Neil Shah, Yozen Liu

This work focuses on a complementary problem: recommending new users and items unseen (out-of-vocabulary, or OOV) at training time.

Recommendation Systems

Enhanced Generative Recommendation via Content and Collaboration Integration

no code implementations27 Mar 2024 Yidan Wang, Zhaochun Ren, Weiwei Sun, Jiyuan Yang, Zhixiang Liang, Xin Chen, Ruobing Xie, Su Yan, Xu Zhang, Pengjie Ren, Zhumin Chen, Xin Xin

However, existing generative recommendation approaches still encounter challenges in (i) effectively integrating user-item collaborative signals and item content information within a unified generative framework, and (ii) executing an efficient alignment between content information and collaborative signals.

Collaborative Filtering Language Modelling +1

InternLM2 Technical Report

1 code implementation26 Mar 2024 Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, FuKai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Chao Xu, Ruiliang Xu, Hang Yan, Yirong Yan, Xiaogui Yang, Haochen Ye, Huaiyuan Ying, JIA YU, Jing Yu, Yuhang Zang, Chuyu Zhang, Li Zhang, Pan Zhang, Peng Zhang, Ruijie Zhang, Shuo Zhang, Songyang Zhang, Wenjian Zhang, Wenwei Zhang, Xingcheng Zhang, Xinyue Zhang, Hui Zhao, Qian Zhao, Xiaomeng Zhao, Fengzhe Zhou, Zaida Zhou, Jingming Zhuo, Yicheng Zou, Xipeng Qiu, Yu Qiao, Dahua Lin

The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI).

4k Long-Context Understanding

Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making

no code implementations22 Mar 2024 Xin Chen, I-Hong Hou

This paper introduces a novel multi-armed bandits framework, termed Contextual Restless Bandits (CRB), for complex online decision-making.

Decision Making Multi-Armed Bandits

Generative Motion Stylization within Canonical Motion Space

no code implementations18 Mar 2024 Jiaxu Zhang, Xin Chen, Gang Yu, Zhigang Tu

Our key insight is to embed motion style into a cross-modality latent space and perceive the cross-structure skeleton topologies, allowing for motion stylization within a canonical motion space.

Motion Synthesis

Learning to Maximize Mutual Information for Chain-of-Thought Distillation

no code implementations5 Mar 2024 Xin Chen, Hanxian Huang, Yanjun Gao, Yi Wang, Jishen Zhao, Ke Ding

Knowledge distillation, the technique of transferring knowledge from large, complex models to smaller ones, marks a pivotal step towards efficient AI deployment.

Knowledge Distillation Language Modelling +1

Enhancing Power Prediction of Photovoltaic Systems: Leveraging Dynamic Physical Model for Irradiance-to-Power Conversion

1 code implementation19 Feb 2024 Baojie Li, Xin Chen, Anubhav Jain

This dynamic model, periodically-updated (as short as daily), can closely capture the actual health status, enabling precise power estimation.

Digital Twin Mobility Profiling: A Spatio-Temporal Graph Learning Approach

1 code implementation6 Feb 2024 Xin Chen, Mingliang Hou, Tao Tang, Achhardeep Kaur, Feng Xia

With the arrival of the big data era, mobility profiling has become a viable method of utilizing enormous amounts of mobility data to create an intelligent transportation system.

Graph Learning Management

Tensor Completion via Integer Optimization

no code implementations6 Feb 2024 Xin Chen, Sukanya Kudva, Yongzheng Dai, Anil Aswani, Chen Chen

The main challenge with the tensor completion problem is a fundamental tension between computation power and the information-theoretic sample complexity rate.

Triple Disentangled Representation Learning for Multimodal Affective Analysis

no code implementations29 Jan 2024 Ying Zhou, Xuefeng Liang, Han Chen, Yin Zhao, Xin Chen, Lida Yu

We revisit the disentanglement issue, and propose a novel triple disentanglement approach, TriDiRA, which disentangles the modality-invariant, effective modality-specific and ineffective modality-specific representations from input data.

Disentanglement

LocMoE: A Low-overhead MoE for Large Language Model Training

no code implementations25 Jan 2024 Jing Li, Zhijie Sun, Xuan He, Li Zeng, Yi Lin, Entong Li, Binfan Zheng, Rongqian Zhao, Xin Chen

However, the performance of MoE is limited by load imbalance and high latency of All-To-All communication, along with relatively redundant computation owing to large expert capacity.

Language Modelling Large Language Model

$M^{2}$Fusion: Bayesian-based Multimodal Multi-level Fusion on Colorectal Cancer Microsatellite Instability Prediction

no code implementations15 Jan 2024 Quan Liu, Jiawen Yao, Lisha Yao, Xin Chen, Jingren Zhou, Le Lu, Ling Zhang, Zaiyi Liu, Yuankai Huo

The contribution of the paper is three-fold: (1) $M^{2}$Fusion is the first pipeline of multi-level fusion on pathology WSI and 3D radiology CT image for MSI prediction; (2) CT images are the first time integrated into multimodal fusion for CRC MSI prediction; (3) feature-level fusion strategy is evaluated on both Transformer-based and CNN-based method.

Representation Learning Weakly-supervised Learning +1

Plug-in Diffusion Model for Sequential Recommendation

1 code implementation5 Jan 2024 Haokai Ma, Ruobing Xie, Lei Meng, Xin Chen, Xu Zhang, Leyu Lin, Zhanhui Kang

To address this issue, this paper presents a novel Plug-in Diffusion Model for Recommendation (PDRec) framework, which employs the diffusion model as a flexible plugin to jointly take full advantage of the diffusion-generating user preferences on all items.

Image Generation Model Optimization +1

Stochastic Gradient Descent for Additive Nonparametric Regression

no code implementations1 Jan 2024 Xin Chen, Jason M. Klusowski

This paper introduces an iterative algorithm for training additive models that enjoys favorable memory storage and computational requirements.

Additive models regression

AppAgent: Multimodal Agents as Smartphone Users

no code implementations21 Dec 2023 Chi Zhang, Zhao Yang, Jiaxuan Liu, Yucheng Han, Xin Chen, Zebiao Huang, Bin Fu, Gang Yu

Recent advancements in large language models (LLMs) have led to the creation of intelligent agents capable of performing complex tasks.

Navigate

Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models

1 code implementation21 Dec 2023 Xianfang Zeng, Xin Chen, Zhongqi Qi, Wen Liu, Zibo Zhao, Zhibin Wang, Bin Fu, Yong liu, Gang Yu

This paper presents Paint3D, a novel coarse-to-fine generative framework that is capable of producing high-resolution, lighting-less, and diverse 2K UV texture maps for untextured 3D meshes conditioned on text or image inputs.

2k

DoDo-Code: a Deep Levenshtein Distance Embedding-based Code for IDS Channel and DNA Storage

no code implementations20 Dec 2023 Alan J. X. Guo, Sihan Sun, Xiang Wei, Mengyi Wei, Xin Chen

In this paper, we propose an innovative approach that utilizes deep Levenshtein distance embedding to bypass these mathematical challenges.

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts

1 code implementation17 Dec 2023 Mingsheng Li, Xin Chen, Chi Zhang, Sijin Chen, Hongyuan Zhu, Fukun Yin, Gang Yu, Tao Chen

Furthermore, we establish a new benchmark for assessing the performance of large models in understanding multi-modal 3D prompts.

Instruction Following

OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers

no code implementations14 Dec 2023 Han Liang, Jiacheng Bao, Ruichi Zhang, Sihan Ren, Yuecheng Xu, Sibei Yang, Xin Chen, Jingyi Yu, Lan Xu

At the subsequent fine-tuning stage, we introduce motion ControlNet, which incorporates text prompts as conditioning information, through a trainable copy of the pre-trained model and the proposed novel Mixture-of-Controllers (MoC) block.

HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models

no code implementations8 Dec 2023 Pei Lin, Sihang Xu, Hongdi Yang, Yiran Liu, Xin Chen, Jingya Wang, Jingyi Yu, Lan Xu

We further present a strong baseline method HandDiffuse for the controllable motion generation of interacting hands using various controllers.

Data Augmentation Temporal Sequences

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

1 code implementation30 Nov 2023 Sijin Chen, Xin Chen, Chi Zhang, Mingsheng Li, Gang Yu, Hao Fei, Hongyuan Zhu, Jiayuan Fan, Tao Chen

However, developing LMMs that can comprehend, reason, and plan in complex and diverse 3D environments remains a challenging topic, especially considering the demand for understanding permutation-invariant point cloud 3D representations of the 3D scene.

3D dense captioning Dense Captioning +1

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model

1 code implementation29 Nov 2023 Fukun Yin, Xin Chen, Chi Zhang, Biao Jiang, Zibo Zhao, Jiayuan Fan, Gang Yu, Taihao Li, Tao Chen

The advent of large language models, enabling flexibility through instruction-driven approaches, has revolutionized many traditional generative tasks, but large models for 3D data, particularly in comprehensively handling 3D shapes with other modalities, are still under-explored.

3D Shape Generation Language Modelling +1

ChartLlama: A Multimodal LLM for Chart Understanding and Generation

no code implementations27 Nov 2023 Yucheng Han, Chi Zhang, Xin Chen, Xu Yang, Zhibin Wang, Gang Yu, Bin Fu, Hanwang Zhang

Next, we introduce ChartLlama, a multi-modal large language model that we've trained using our created dataset.

Language Modelling Large Language Model

VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations

no code implementations23 Oct 2023 Yiying Yang, Wen Liu, Fukun Yin, Xin Chen, Gang Yu, Jiayuan Fan, Tao Chen

Recent advancements in implicit neural representations have contributed to high-fidelity surface reconstruction and photorealistic novel view synthesis.

Novel View Synthesis Quantization +1

TapMo: Shape-aware Motion Generation of Skeleton-free Characters

no code implementations19 Oct 2023 Jiaxu Zhang, Shaoli Huang, Zhigang Tu, Xin Chen, Xiaohang Zhan, Gang Yu, Ying Shan

In this work, we present TapMo, a Text-driven Animation Pipeline for synthesizing Motion in a broad spectrum of skeleton-free 3D characters.

To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now

1 code implementation18 Oct 2023 Yimeng Zhang, Jinghan Jia, Xin Chen, Aochuan Chen, Yihua Zhang, Jiancheng Liu, Ke Ding, Sijia Liu

Our results demonstrate the effectiveness and efficiency merits of UnlearnDiffAtk over the state-of-the-art adversarial prompt generation method and reveal the lack of robustness of current safety-driven unlearning techniques when applied to DMs.

Adversarial Robustness Benchmarking +1

Specializing Small Language Models towards Complex Style Transfer via Latent Attribute Pre-Training

1 code implementation19 Sep 2023 Ruiqi Xu, Yongfeng Huang, Xin Chen, Lin Zhang

In this work, we introduce the concept of complex text style transfer tasks, and constructed complex text datasets based on two widely applicable scenarios.

Attribute Contrastive Learning +2

Error Reduction from Stacked Regressions

no code implementations18 Sep 2023 Xin Chen, Jason M. Klusowski, Yan Shuo Tan

In this paper, we learn these weights analogously by minimizing an estimate of the population risk subject to a nonnegativity constraint.

Model Selection regression

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

1 code implementation6 Sep 2023 Sijin Chen, Hongyuan Zhu, Mingsheng Li, Xin Chen, Peng Guo, Yinjie Lei, Gang Yu, Taihao Li, Tao Chen

Moreover, we argue that object localization and description generation require different levels of scene understanding, which could be challenging for a shared set of queries to capture.

3D dense captioning Caption Generation +4

Unveiling Causalities in SAR ATR: A Causal Interventional Approach for Limited Data

no code implementations18 Aug 2023 Chenwei Wang, Xin Chen, You Qin, Siyi Luo, Yulin Huang, Jifang Pei, Jianyu Yang

Then, a feature discrimination approach with hybrid similarity measurement is introduced to measure and mitigate the structural and vector angle impacts of varying imaging conditions on the extracted features from SAR images.

Causal Inference Data Augmentation

Towards Carbon-Free Electricity: A Flow-Based Framework for Power Grid Carbon Accounting and Decarbonization

no code implementations7 Aug 2023 Xin Chen, Hungpo Chao, Wenbo Shi, Na Li

This paper introduces a comprehensive framework aimed at advancing research and policy development in the realm of decarbonization within electric power systems.

Decision Making Fairness

Deep Reinforcement Learning-Based Battery Conditioning Hierarchical V2G Coordination for Multi-Stakeholder Benefits

no code implementations1 Aug 2023 Yubao Zhang, Xin Chen, Yi Gu, Zhicheng Li, Wu Kai

On the grid side, load fluctuations and renewable energy consumption are considered, while on the EVA side, energy constraints and charging costs are considered.

Scheduling

Parse and Recall: Towards Accurate Lung Nodule Malignancy Prediction like Radiologists

no code implementations20 Jul 2023 Jianpeng Zhang, Xianghua Ye, Jianfeng Zhang, Yuxing Tang, Minfeng Xu, Jianfei Guo, Xin Chen, Zaiyi Liu, Jingren Zhou, Le Lu, Ling Zhang

In this paper, we propose a radiologist-inspired method to simulate the diagnostic process of radiologists, which is composed of context parsing and prototype recalling modules.

Decision Making

Cluster-Induced Mask Transformers for Effective Opportunistic Gastric Cancer Screening on Non-contrast CT Scans

no code implementations10 Jul 2023 Mingze Yuan, Yingda Xia, Xin Chen, Jiawen Yao, Junli Wang, Mingyan Qiu, Hexin Dong, Jingren Zhou, Bin Dong, Le Lu, Li Zhang, Zaiyi Liu, Ling Zhang

In our experiments, the proposed method achieves a sensitivity of 85. 0% and specificity of 92. 6% for detecting gastric tumors on a hold-out test set consisting of 100 patients with cancer and 148 normal.

Specificity

Learning from Heterogeneity: A Dynamic Learning Framework for Hypergraphs

no code implementations7 Jul 2023 Tiehua Zhang, Yuze Liu, Zhishu Shen, Xingjun Ma, Xin Chen, Xiaowei Huang, Jun Yin, Jiong Jin

Graph neural network (GNN) has gained increasing popularity in recent years owing to its capability and flexibility in modeling complex graph structure data.

Graph Learning Link Prediction +1

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

1 code implementation NeurIPS 2023 Zibo Zhao, Wen Liu, Xin Chen, Xianfang Zeng, Rui Wang, Pei Cheng, Bin Fu, Tao Chen, Gang Yu, Shenghua Gao

We present a novel alignment-before-generation approach to tackle the challenging task of generating general 3D shapes based on 2D images or texts.

3D Shape Generation

MotionGPT: Human Motion as a Foreign Language

2 code implementations NeurIPS 2023 Biao Jiang, Xin Chen, Wen Liu, Jingyi Yu, Gang Yu, Tao Chen

Building upon this "motion vocabulary", we perform language modeling on both motion and text in a unified manner, treating human motion as a specific language.

Language Modelling Motion Captioning +2

Higher-order Motif-based Time Series Classification for Forced Oscillation Source Location in Power Grids

no code implementations23 Jun 2023 Long Huo, Xin Chen

A MECF-based unsupervised learning approach is applied in locating the source of the forced oscillation (FO), a periodic disturbance that detrimentally impacts power grids.

Time Series Time Series Classification

Coupled Attention Networks for Multivariate Time Series Anomaly Detection

no code implementations12 Jun 2023 Feng Xia, Xin Chen, Shuo Yu, Mingliang Hou, Mujie Liu, Linlin You

To address this issue, we propose a coupled attention-based neural network framework (CAN) for anomaly detection in multivariate time series data featuring dynamic variable relationships.

Anomaly Detection Graph Attention +4

Differentially private sliced inverse regression in the federated paradigm

no code implementations10 Jun 2023 Shuaida He, Jiarui Zhang, Xin Chen

Sliced inverse regression (SIR), which includes linear discriminant analysis (LDA) as a special case, is a popular and powerful dimension reduction tool.

Dimensionality Reduction regression

Fast global convergence of gradient descent for low-rank matrix approximation

no code implementations30 May 2023 Hengchao Chen, Xin Chen, Mohamad Elmasri, Qiang Sun

This paper investigates gradient descent for solving low-rank matrix approximation problems.

Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided Relation Alignment and Adaptation

no code implementations18 May 2023 Yuan Zhou, Xin Chen, Yanrong Guo, Shijie Hao, Richang Hong, Qi Tian

Incremental few-shot semantic segmentation (IFSS) aims to incrementally extend a semantic segmentation model to novel classes according to only a few pixel-level annotated data, while preserving its segmentation capability on previously learned base categories.

Few-Shot Semantic Segmentation Incremental Learning +3

Collective Large-scale Wind Farm Multivariate Power Output Control Based on Hierarchical Communication Multi-Agent Proximal Policy Optimization

no code implementations17 May 2023 Yubao Zhang, Xin Chen, Sumei Gong, Haojie Chen

Simulation results demonstrate that the proposed multivariate HCMAPPO can significantly increase wind farm power output compared to the traditional PID control, coordinated model-based predictive control, and multi-agent deep deterministic policy gradient algorithm.

An Object SLAM Framework for Association, Mapping, and High-Level Tasks

no code implementations12 May 2023 Yanmin Wu, Yunzhou Zhang, Delong Zhu, Zhiqiang Deng, Wenkai Sun, Xin Chen, Jian Zhang

Taking into consideration the semantic invariance of objects, we convert the object map to a topological map to provide semantic descriptors to enable multi-map matching.

Decision Making Object +2

Semi-supervised Road Updating Network (SRUNet): A Deep Learning Method for Road Updating from Remote Sensing Imagery and Historical Vector Maps

no code implementations28 Apr 2023 Xin Chen, Anzhu Yu, Qun Sun, Wenyue Guo, Qing Xu, Bowei Wen

However, obtaining bi-phase images for the same area is difficult, and complex post-processing methods are required to update the existing databases. To solve these problems, we proposed a road detection method based on semi-supervised learning (SRUNet) specifically for road-updating applications; in this approach, historical road information was fused with the latest images to directly obtain the latest state of the road. Considering that the texture of a road is complex, a multi-branch network, named the Map Encoding Branch (MEB) was proposed for representation learning, where the Boundary Enhancement Module (BEM) was used to improve the accuracy of boundary prediction, and the Residual Refinement Module (RRM) was used to optimize the prediction results.

Representation Learning

Fulfilling Formal Specifications ASAP by Model-free Reinforcement Learning

no code implementations25 Apr 2023 Mengyu Liu, Pengyuan Lu, Xin Chen, Fanxin Kong, Oleg Sokolsky, Insup Lee

We propose a model-free reinforcement learning solution, namely the ASAP-Phi framework, to encourage an agent to fulfill a formal specification ASAP.

reinforcement-learning

Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism

no code implementations22 Apr 2023 Xin Chen, Hengheng Zhang, Xiaotao Gu, Kaifeng Bi, Lingxi Xie, Qi Tian

The Mixture of Experts (MoE) model becomes an important choice of large language models nowadays because of its scalability with sublinear computational complexity for training and inference.

Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?

no code implementations20 Apr 2023 Haoyang Peng, Baopu Li, Bo Zhang, Xin Chen, Tao Chen, Hongyuan Zhu

Then, a novel multi-view prompt fusion module is developed to effectively fuse information from different views to bridge the gap between 3D point cloud data and 2D pre-trained models.

Autonomous Driving Classification +3

PEGA: Personality-Guided Preference Aggregator for Ephemeral Group Recommendation

no code implementations18 Apr 2023 Guangze Ye, Wen Wu, Liye Shi, Wenxin Hu, Xin Chen, Liang He

The role of personality in our approach is twofold: (1) To estimate individual users' importance in a group and provide explainability; (2) to alleviate the data sparsity issue that occurred in ephemeral groups.

Triple Sequence Learning for Cross-domain Recommendation

no code implementations11 Apr 2023 Haokai Ma, Ruobing Xie, Lei Meng, Xin Chen, Xu Zhang, Leyu Lin, Jie zhou

To address this issue, we present a novel framework, termed triple sequence learning for cross-domain recommendation (Tri-CDR), which jointly models the source, target, and mixed behavior sequences to highlight the global and target preference and precisely model the triple correlation in CDR.

Contrastive Learning

POLAR-Express: Efficient and Precise Formal Reachability Analysis of Neural-Network Controlled Systems

1 code implementation31 Mar 2023 YiXuan Wang, Weichao Zhou, Jiameng Fan, Zhilu Wang, Jiajun Li, Xin Chen, Chao Huang, Wenchao Li, Qi Zhu

We also present a novel approach to propagate TMs more efficiently and precisely across ReLU activation functions.

Visual Prompt Multi-Modal Tracking

1 code implementation CVPR 2023 Jiawen Zhu, Simiao Lai, Xin Chen, Dong Wang, Huchuan Lu

To inherit the powerful representations of the foundation model, a natural modus operandi for multi-modal tracking is full fine-tuning on the RGB-based parameters.

Object Tracking Rgb-T Tracking

Transferable Deep Learning Power System Short-Term Voltage Stability Assessment with Physics-Informed Topological Feature Engineering

no code implementations13 Mar 2023 Zijian Feng, Xin Chen, Zijian Lv, Peiyuan Sun, Kai Wu

In particular, the highest accuracy reaches 99. 68\% in evaluation, which demonstrates a good knowledge transfer ability of the proposed model for power grid topology change.

Feature Engineering Transfer Learning

Text-Visual Prompting for Efficient 2D Temporal Video Grounding

1 code implementation CVPR 2023 Yimeng Zhang, Xin Chen, Jinghan Jia, Sijia Liu, Ke Ding

In this paper, we study the problem of temporal video grounding (TVG), which aims to predict the starting/ending time points of moments described by a text sentence within a long untrimmed video.

Sentence Video Grounding +1

Disentangled Causal Embedding With Contrastive Learning For Recommender System

1 code implementation7 Feb 2023 Weiqi Zhao, Dian Tang, Xin Chen, Dawei Lv, Daoli Ou, Biao Li, Peng Jiang, Kun Gai

Most previous studies neglect user's conformity and entangle interest with it, which may cause the recommender systems fail to provide satisfying results.

Contrastive Learning Recommendation Systems

Sketched Ridgeless Linear Regression: The Role of Downsampling

1 code implementation2 Feb 2023 Xin Chen, Yicheng Zeng, Siyue Yang, Qiang Sun

We identify the optimal sketching size that minimizes out-of-sample prediction risks and demonstrate that the optimally sketched estimator exhibits stabler risk curves, eliminating the peaks of those for the full-sample estimator.

regression

End-to-End 3D Dense Captioning with Vote2Cap-DETR

1 code implementation CVPR 2023 Sijin Chen, Hongyuan Zhu, Xin Chen, Yinjie Lei, Tao Chen, Gang Yu

Compared with prior arts, our framework has several appealing advantages: 1) Without resorting to numerous hand-crafted components, our method is based on a full transformer encoder-decoder architecture with a learnable vote query driven object decoder, and a caption decoder that produces the dense captions in a set-prediction manner.

3D dense captioning Dense Captioning +1

Fan-Beam Binarization Difference Projection (FB-BDP): A Novel Local Object Descriptor for Fine-Grained Leaf Image Retrieval

no code implementations ICCV 2023 Xin Chen, Bin Wang, Yongsheng Gao

Fine-grained leaf image retrieval (FGLIR) aims to search similar leaf images in subspecies level which involves very high interclass visual similarity and accordingly poses great challenges to leaf image description.

Binarization Image Retrieval +1

Executing your Commands via Motion Diffusion in Latent Space

1 code implementation CVPR 2023 Xin Chen, Biao Jiang, Wen Liu, Zilong Huang, Bin Fu, Tao Chen, Jingyi Yu, Gang Yu

We study a challenging task, conditional human motion generation, which produces plausible human motion sequences according to various conditional inputs, such as action classes or textual descriptors.

Motion Synthesis

An Improved End-to-End Multi-Target Tracking Method Based on Transformer Self-Attention

no code implementations11 Nov 2022 Yong Hong, Deren Li, Shupei Luo, Xin Chen, Yi Yang, Mi Wang

This study proposes an improved end-to-end multi-target tracking algorithm that adapts to multi-view multi-scale scenes based on the self-attentive mechanism of the transformer's encoder-decoder structure.

Multiple Object Tracking

Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast

3 code implementations3 Nov 2022 Kaifeng Bi, Lingxi Xie, Hengheng Zhang, Xin Chen, Xiaotao Gu, Qi Tian

In this paper, we present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast.

Towards Relation-centered Pooling and Convolution for Heterogeneous Graph Learning Networks

1 code implementation31 Oct 2022 Tiehua Zhang, Yuze Liu, Yao Yao, Youhua Xia, Xin Chen, Xiaowei Huang, Jiong Jin

Heterogeneous graph neural network has unleashed great potential on graph representation learning and shown superior performance on downstream tasks such as node classification and clustering.

Graph Learning Graph Representation Learning +2

2D and 3D CT Radiomic Features Performance Comparison in Characterization of Gastric Cancer: A Multi-center Study

no code implementations29 Oct 2022 Lingwei Meng, Di Dong, Xin Chen, Mengjie Fang, Rongpin Wang, Jing Li, Zaiyi Liu, Jie Tian

We comprehensively compared 2D and 3D radiomic features' representation and discrimination capacity regarding GC, via three tasks.

feature selection

Learning Variational Motion Prior for Video-based Motion Capture

no code implementations27 Oct 2022 Xin Chen, Zhuo Su, Lingbo Yang, Pei Cheng, Lan Xu, Bin Fu, Gang Yu

To improve the generalization capacity of prior space, we propose a transformer-based variational autoencoder pretrained over marker-based 3D mocap data, with a novel style-mapping block to boost the generation quality.

Pose Estimation

DIICAN: Dual Time-scale State-Coupled Co-estimation of SOC, SOH and RUL for Lithium-Ion Batteries

no code implementations20 Oct 2022 Ningbo Cai, Yuwen Qin, Xin Chen, Kai Wu

A state-coupled co-estimation method named Deep Inter and Intra-Cycle Attention Network (DIICAN) is proposed in this paper to estimate SOC, SOH, and RUL, which organizes battery measurement data into the intra-cycle and inter-cycle time scales.

Management

Motion-related Artefact Classification Using Patch-based Ensemble and Transfer Learning in Cardiac MRI

1 code implementation14 Oct 2022 Ruizhe Li, Xin Chen

The final trained model was also evaluated on an independent test set by the CMRxMotion organisers, which achieved the classification accuracy of 72. 5% and Cohen's Kappa of 0. 6309 (ranked top 1 in this grand challenge).

Transfer Learning

ConvTransSeg: A Multi-resolution Convolution-Transformer Network for Medical Image Segmentation

no code implementations13 Oct 2022 Zhendi Gong, Andrew P. French, Guoping Qiu, Xin Chen

We compared our method with many other state-of-the-art hybrid CNN and Transformer segmentation models on binary and multiple class image segmentation tasks using several public medical image datasets, including skin lesion, polyp, cell and brain tissue.

Image Segmentation Medical Image Segmentation +2

Transfer Deep Reinforcement Learning-based Large-scale V2G Continuous Charging Coordination with Renewable Energy Sources

no code implementations13 Oct 2022 Yubao Zhang, Xin Chen, Yuchen Zhang

Due to the increasing popularity of electric vehicles (EVs) and the technological advancement of EV electronics, the vehicle-to-grid (V2G) technique and large-scale scheduling algorithms have been developed to achieve a high level of renewable energy and power grid stability.

Scheduling Transfer Learning

Two-Stream UNET Networks for Semantic Segmentation in Medical Images

no code implementations27 Jul 2022 Xin Chen, Ke Ding

Recent advances of semantic image segmentation greatly benefit from deeper and larger Convolutional Neural Network (CNN) models.

Image Segmentation Medical Image Segmentation +3

Contrastive Deep Supervision

1 code implementation12 Jul 2022 Linfeng Zhang, Xin Chen, Junbo Zhang, Runpei Dong, Kaisheng Ma

The success of deep learning is usually accompanied by the growth in neural network depth.

Contrastive Learning Fine-Grained Image Classification +3

SRRT: Search Region Regulation Tracking

no code implementations10 Jul 2022 Jiawen Zhu, Xin Chen, Pengyu Zhang, Xinying Wang, Dong Wang, Wenda Zhao, Huchuan Lu

Trackers tend to lose the target object due to the limited search region or be interfered with by distractors due to the excessive search region.

Sequential Recommendation Model for Next Purchase Prediction

1 code implementation6 Jul 2022 Xin Chen, Alex Reibman, Sanjay Arora

Timeliness and contextual accuracy of recommendations are increasingly important when delivering contemporary digital marketing experiences.

Marketing Sequential Recommendation

An Adaptive Federated Relevance Framework for Spatial Temporal Graph Learning

no code implementations7 Jun 2022 Tiehua Zhang, Yuze Liu, Zhishu Shen, Rui Xu, Xin Chen, Xiaowei Huang, Xi Zheng

Spatial-temporal data contains rich information and has been widely studied in recent years due to the rapid development of relevant applications in many fields.

Federated Learning Graph Learning

Pruning-as-Search: Efficient Neural Architecture Search via Channel Pruning and Structural Reparameterization

1 code implementation2 Jun 2022 Yanyu Li, Pu Zhao, Geng Yuan, Xue Lin, Yanzhi Wang, Xin Chen

By combining the structural reparameterization and PaS, we successfully searched out a new family of VGG-like and lightweight networks, which enable the flexibility of arbitrary width with respect to each layer instead of each stage.

Instance Segmentation Network Pruning +2

An Empirical Investigation of Representation Learning for Imitation

2 code implementations16 May 2022 Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah

We propose a modular framework for constructing representation learning algorithms, then use our framework to evaluate the utility of representation learning for imitation across several environment suites.

Image Classification Imitation Learning +1

Hybrid CNN Based Attention with Category Prior for User Image Behavior Modeling

no code implementations5 May 2022 Xin Chen, Qingtao Tang, Ke Hu, Yue Xu, Shihang Qiu, Jia Cheng, Jun Lei

In Meituan, one of the largest e-commerce platform in China, an item is typically displayed with its image and whether a user clicks the item or not is usually influenced by its image, which implies that user's image behaviors are helpful for understanding user's visual preference and improving the accuracy of CTR prediction.

Click-Through Rate Prediction

Efficient Visual Tracking via Hierarchical Cross-Attention Transformer

1 code implementation25 Mar 2022 Xin Chen, Ben Kang, Dong Wang, Dongdong Li, Huchuan Lu

Most state-of-the-art trackers are satisfied with the real-time speed on powerful GPUs.

Visual Tracking

High-Performance Transformer Tracking

1 code implementation25 Mar 2022 Xin Chen, Bin Yan, Jiawen Zhu, Huchuan Lu, Xiang Ruan, Dong Wang

First, we present a transformer tracking (named TransT) method based on the Siamese-like feature extraction backbone, the designed attention-based fusion mechanism, and the classification and regression head.

Vocal Bursts Intensity Prediction

Multi-view Multi-behavior Contrastive Learning in Recommendation

1 code implementation20 Mar 2022 Yiqing Wu, Ruobing Xie, Yongchun Zhu, Xiang Ao, Xin Chen, Xu Zhang, Fuzhen Zhuang, Leyu Lin, Qing He

We argue that MBR models should: (1) model the coarse-grained commonalities between different behaviors of a user, (2) consider both individual sequence view and global graph view in multi-behavior modeling, and (3) capture the fine-grained differences between multiple behaviors of a user.

Contrastive Learning

Spectral Graph Clustering for Intentional Islanding Operations in Resilient Hybrid Energy Systems

no code implementations13 Mar 2022 Jiaxin Wu, Xin Chen, Sobhan Badakhshan, Jie Zhang, Pingfeng Wang

Establishing cleaner energy generation therefore improving the sustainability of the power system is a crucial task in this century, and one of the key strategies being pursued is to shift the dependence on fossil fuel to renewable technologies such as wind, solar, and nuclear.

Clustering Graph Clustering +1

Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation

no code implementations CVPR 2022 Linfeng Zhang, Xin Chen, Xiaobing Tu, Pengfei Wan, Ning Xu, Kaisheng Ma

Instead of directly distilling the generated images of teachers, wavelet knowledge distillation first decomposes the images into different frequency bands with discrete wavelet transformation and then only distills the high frequency bands.

Image-to-Image Translation Knowledge Distillation +1

RestainNet: a self-supervised digital re-stainer for stain normalization

no code implementations28 Feb 2022 Bingchao Zhao, Jiatai Lin, Changhong Liang, Zongjian Yi, Xin Chen, Bingbing Li, Weihao Qiu, Danyi Li, Li Liang, Chu Han, Zaiyi Liu

In this paper, we formulated stain normalization as a digital re-staining process and proposed a self-supervised learning model, which is called RestainNet.

Self-Supervised Learning

Remaining Useful Life Prediction Using Temporal Deep Degradation Network for Complex Machinery with Attention-based Feature Extraction

no code implementations21 Feb 2022 Yuwen Qin, Ningbo Cai, Chen Gao, Yadong Zhang, Yonghong Cheng, Xin Chen

The degradation-related features extracted from the sensor streaming data with neural networks can dramatically improve the accuracy of the RUL prediction.

CenGCN: Centralized Convolutional Networks with Vertex Imbalance for Scale-Free Graphs

no code implementations16 Feb 2022 Feng Xia, Lei Wang, Tao Tang, Xin Chen, Xiangjie Kong, Giles Oatley, Irwin King

In each non-output layer of the GCN, this framework uses a hub attention mechanism to assign new weights to connected non-hub vertices based on their common information with hub vertices.

Link Prediction

Attention-based Deep Neural Networks for Battery Discharge Capacity Forecasting

no code implementations14 Feb 2022 Yadong Zhang, Chenye Zou, Xin Chen

The battery capacity in different cycles can be measured with the temporal patterns extracted from the streaming sensor data based on the attention mechanism.

Capacity Estimation Management

Fast Transient Stability Prediction Using Grid-informed Temporal and Topological Embedding Deep Neural Network

no code implementations23 Jan 2022 Peiyuan Sun, Long Huo, Siyuan Liang, Xin Chen

Transient stability prediction is critically essential to the fast online assessment and maintaining the stable operation in power systems.

Time Series Time Series Analysis

AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees

no code implementations20 Jan 2022 Rong Liang, Tiehua Zhang, Yujie Lu, Yuze Liu, Zhen Huang, Xin Chen

Specifically, we collect a sheer number of source codes (both Java and Python) from the Alipay code repository and incorporate both syntactic and semantic code knowledge into our model through the help of code parsers, in which AST information of the source codes can be interpreted and integrated.

Clone Detection Code Search +2

Resource allocation algorithm for MEC based on Deep Reinforcement Learning

no code implementations IEEE 2022 Yijie Wang, Xin Chen, Ying Chen, Shougang Du

In recent years, driven by the commercialization of the 6th Generation Communication Technology (6G), an increasing number of 6G devices connected to mobile networks produces computation-intensive tasks such as ultra-high-resolution video streaming, inter-active visual reality (VR) gaming, augmented reality (AR).

Edge-computing reinforcement-learning

GPS: A Policy-driven Sampling Approach for Graph Representation Learning

no code implementations29 Dec 2021 Tiehua Zhang, Yuze Liu, Xin Chen, Xiaowei Huang, Feng Zhu, Xi Zheng

Graph representation learning has drawn increasing attention in recent years, especially for learning the low dimensional embedding at both node and graph level for classification and recommendations tasks.

Graph Classification Graph Representation Learning

Radiomic biomarker extracted from PI-RADS 3 patients support more eìcient and robust prostate cancer diagnosis: a multi-center study

no code implementations23 Dec 2021 Longfei Li, Rui Yang, Xin Chen, Cheng Li, Hairong Zheng, Yusong Lin, Zaiyi Liu, Shanshan Wang

Prostate Imaging Reporting and Data System (PI-RADS) based on multi-parametric MRI classi\^ees patients into 5 categories (PI-RADS 1-5) for routine clinical diagnosis guidance.

On the Bias-Variance-Cost Tradeoff of Stochastic Optimization

no code implementations NeurIPS 2021 Yifan Hu, Xin Chen, Niao He

We consider stochastic optimization when one only has access to biased stochastic oracles of the objective, and obtaining stochastic gradients with low biases comes at high costs.

Bilevel Optimization Stochastic Optimization

Cellular Network Radio Propagation Modeling with Deep Convolutional Neural Networks

no code implementations5 Oct 2021 Xin Zhang, Xiujun Shu, Bingwen Zhang, Jie Ren, Lizhou Zhou, Xin Chen

Deterministic models, such as ray tracing based on physical laws of wave propagation, are more accurate and site specific.

EVOQUER: Enhancing Temporal Grounding with Video-Pivoted BackQuery Generation

no code implementations10 Sep 2021 Yanjun Gao, Lulu Liu, Jason Wang, Xin Chen, Huayan Wang, Rui Zhang

Given a query and an untrimmed video, the temporal grounding model predicts the target interval, and the predicted video clip is fed into a video translation task by generating a simplified version of the input query.

Translation Video Grounding

Empirical Study of Named Entity Recognition Performance Using Distribution-aware Word Embedding

no code implementations3 Sep 2021 Xin Chen, Qi Zhao, Xinyang Liu

And the result shows that the performance of NER will be improved if the word specificity is incorporated into existing NER methods.

named-entity-recognition Named Entity Recognition +2

Neural Free-Viewpoint Performance Rendering under Complex Human-object Interactions

no code implementations1 Aug 2021 Guoxing Sun, Xin Chen, Yizhang Chen, Anqi Pang, Pei Lin, Yuheng Jiang, Lan Xu, Jingya Wang, Jingyi Yu

In this paper, we propose a neural human performance capture and rendering system to generate both high-quality geometry and photo-realistic texture of both human and objects under challenging interaction scenarios in arbitrary novel views, from only sparse RGB streams.

4D reconstruction Dynamic Reconstruction +5

Template-based Chatbot for Agriculture Related FAQs

no code implementations27 Jul 2021 Daping Zhang, Xin Chen, Yujia Zhang, Shihan Qin

Agriculture is the fundamental industry of the society, which is the basis of food supply and an important source of employment and GDP increase.

Chatbot

Few-shot Neural Human Performance Rendering from Sparse RGBD Videos

no code implementations14 Jul 2021 Anqi Pang, Xin Chen, Haimin Luo, Minye Wu, Jingyi Yu, Lan Xu

To fill this gap, in this paper we propose a few-shot neural human rendering approach (FNHR) from only sparse RGBD inputs, which exploits the temporal and spatial redundancy to generate photo-realistic free-view output of human activities.

Neural Rendering

POLAR: A Polynomial Arithmetic Framework for Verifying Neural-Network Controlled Systems

2 code implementations25 Jun 2021 Chao Huang, Jiameng Fan, Zhilu Wang, YiXuan Wang, Weichao Zhou, Jiajun Li, Xin Chen, Wenchao Li, Qi Zhu

We present POLAR, a polynomial arithmetic-based framework for efficient bounded-time reachability analysis of neural-network controlled systems (NNCSs).

Interior point search for nonparametric image segmentation

no code implementations11 Jun 2021 Sinan Onal, Xin Chen, Madagedara Maduka Balasooriya

Thus, our method offers robust automatic image segmentation that is simpler to use and less time-consuming than traditional snake models

Boundary Detection Image Segmentation +1

TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search

2 code implementations CVPR 2021 Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li

While existing NAS methods mostly design architectures on a single task, algorithms that look beyond single-task search are surging to pursue a more efficient and universal solution across various tasks.

Neural Architecture Search Transfer Learning

SportsCap: Monocular 3D Human Motion Capture and Fine-grained Understanding in Challenging Sports Videos

1 code implementation23 Apr 2021 Xin Chen, Anqi Pang, Wei Yang, Yuexin Ma, Lan Xu, Jingyi Yu

In this paper, we propose SportsCap -- the first approach for simultaneously capturing 3D human motions and understanding fine-grained actions from monocular challenging sports video input.

Action Assessment Attribute +1

Transformer Tracking

1 code implementation CVPR 2021 Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu

The correlation operation is a simple fusion manner to consider the similarity between the template and the search region.

Visual Object Tracking Visual Tracking

ChallenCap: Monocular 3D Capture of Challenging Human Performances using Multi-Modal References

2 code implementations CVPR 2021 Yannan He, Anqi Pang, Xin Chen, Han Liang, Minye Wu, Yuexin Ma, Lan Xu

We propose a hybrid motion inference stage with a generation network, which utilizes a temporal encoder-decoder to extract the motion details from the pair-wise sparse-view reference, as well as a motion discriminator to utilize the unpaired marker-based references to extract specific challenging motion characteristics in a data-driven manner.

Two-sided Dirichlet heat estimates of symmetric stable processes on horn-shaped regions

no code implementations29 Jan 2021 Xin Chen, Panki Kim, Jian Wang

In this paper, we consider symmetric $\alpha$-stable processes on (unbounded) horn-shaped regions which are non-uniformly $C^{1, 1}$ near infinity.

Probability

Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges

no code implementations27 Jan 2021 Xin Chen, Guannan Qu, Yujie Tang, Steven Low, Na Li

With large-scale integration of renewable generation and distributed energy resources, modern power systems are confronted with new operational challenges, such as growing complexity, increasing uncertainty, and aggravating volatility.

Decision Making energy management +2

Optimal Clustering in Anisotropic Gaussian Mixture Models

no code implementations14 Jan 2021 Xin Chen, Anderson Y. Zhang

We study the clustering task under anisotropic Gaussian Mixture Models where the covariance matrices from different clusters are unknown and are not necessarily the identical matrix.

Clustering

Exploring Geometry-Aware Contrast and Clustering Harmonization for Self-Supervised 3D Object Detection

no code implementations ICCV 2021 Hanxue Liang, Chenhan Jiang, Dapeng Feng, Xin Chen, Hang Xu, Xiaodan Liang, Wei zhang, Zhenguo Li, Luc van Gool

Here we present a novel self-supervised 3D Object detection framework that seamlessly integrates the geometry-aware contrast and clustering harmonization to lift the unsupervised 3D representation learning, named GCC-3D.

3D Object Detection Clustering +4

TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search

2 code implementations1 Jan 2021 Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li

While existing NAS methods mostly design architectures on one single task, algorithms that look beyond single-task search are surging to pursue a more efficient and universal solution across various tasks.

Neural Architecture Search Transfer Learning

Anomaly Detection in Time Series with Triadic Motif Fields and Application in Atrial Fibrillation ECG Classification

2 code implementations9 Dec 2020 Yadong Zhang, Xin Chen

Considering the quasi-periodic characteristics of ECG signals, the dynamic features can be extracted from the TMF images with the transfer learning pre-trained convolutional neural network (CNN) models.

Anomaly Detection Atrial Fibrillation Detection +8

Object SLAM-Based Active Mapping and Robotic Grasping

1 code implementation3 Dec 2020 Yanmin Wu, Yunzhou Zhang, Delong Zhu, Xin Chen, Sonya Coleman, Wenkai Sun, Xinggang Hu, Zhiqiang Deng

The framework is built on an object SLAM system integrated with a simultaneous multi-object pose estimation process that is optimized for robotic grasping.

Object Object SLAM +2

Graph Stochastic Neural Networks for Semi-supervised Learning

1 code implementation NeurIPS 2020 Haibo Wang, Chuan Zhou, Xin Chen, Jia Wu, Shirui Pan, Jilong Wang

Graph Neural Networks (GNNs) have achieved remarkable performance in the task of the semi-supervised node classification.

Classification General Classification +3

Attention Aware Cost Volume Pyramid Based Multi-view Stereo Network for 3D Reconstruction

1 code implementation25 Nov 2020 Anzhu Yu, Wenyue Guo, Bing Liu, Xin Chen, Xin Wang, Xuefeng Cao, Bingchuan Jiang

This strategy estimates the depth map at coarsest level, while the depth maps at finer levels are considered as the upsampled depth map from previous level with pixel-wise depth residual.

3D Reconstruction

Multi-feature driven active contour segmentation model for infrared image with intensity inhomogeneity

no code implementations25 Nov 2020 Qinyan Huang, Weiwen Zhou, Minjie Wan, Xin Chen, Qian Chen, Guohua Gu

Active contour model (ACM) is one of the most widely used image segmentation tools at present, but the existing methods only utilize the local or global single feature information of image to minimize the energy function, which is easy to cause false segmentations in IR images.

Image Segmentation Segmentation +1

Learning to Build User-tag Profile in Recommendation System (UTPM)

1 code implementation ACM International Conference on Information and Knowledge Management 2020 Su Yan, Xin Chen, Ran Huo, Xu Zhang, Leyu Lin

User profiling is one of the most important components in recommendation systems, where a user is profiled using demographic (e. g. gender, age, and location) and user behavior information (e. g. browsing and search history).

Multi-Label Classification Recommendation Systems +1

Online Learning and Distributed Control for Residential Demand Response

no code implementations11 Oct 2020 Xin Chen, YingYing Li, Jun Shimada, Na Li

This paper studies the automated control method for regulating air conditioner (AC) loads in incentive-based residential demand response (DR).

Stochastic Optimization Thompson Sampling

Balancing Common Treatment and Epidemic Control in Medical Procurement during COVID-19: Transform-and-Divide Evolutionary Optimization

no code implementations2 Aug 2020 Yu-Jun Zheng, Xin Chen, Tie-Er Gan, Min-Xia Zhang, Wei-Guo Sheng, Ling Wang

In this paper, we present an approach that first transforms the original high-dimensional, constrained multiobjective optimization problem to a low-dimensional, unconstrained multiobjective optimization problem, and then evaluates each solution to the transformed problem by solving a set of simple single-objective optimization subproblems, such that the problem can be efficiently solved by existing evolutionary multiobjective algorithms.

Evolutionary Algorithms Multiobjective Optimization

GOLD-NAS: Gradual, One-Level, Differentiable

1 code implementation7 Jul 2020 Kaifeng Bi, Lingxi Xie, Xin Chen, Longhui Wei, Qi Tian

There has been a large literature of neural architecture search, but most existing work made use of heuristic rules that largely constrained the search flexibility.

Image Classification Neural Architecture Search

GCN-BMP: Investigating Graph Representation Learning for DDI Prediction Task

1 code implementation Methods 2020 Xin Chen, Xien Liu, Ji Wu

To alleviate this problem, we investigate the utilization of the end-to-end graph representation learning for the DDI prediction task.

Graph Representation Learning Inductive Bias

AutoSweep: Recovering 3D Editable Objectsfrom a Single Photograph

1 code implementation27 May 2020 Xin Chen, Yuwei Li, Xi Luo, Tianjia Shao, Jingyi Yu, Kun Zhou, Youyi Zheng

We base our work on the assumption that most human-made objects are constituted by parts and these parts can be well represented by generalized primitives.

3D Reconstruction Instance Segmentation +1

AnimeGAN: A Novel Lightweight GAN for Photo Animation

3 code implementations International Symposium on Intelligence Computation and Applications 2020 Jie Chen, Gang Liu, Xin Chen

The existing methods usually have some problems, among which significant problems mainly include: 1) the generated images have no obvious animated style textures; 2) the generated images lose the content of the original images; 3) the parameters of the network require the large memory capacity.

Generative Adversarial Network Style Transfer

A Novel Weighted Combination Method for Feature Selection using Fuzzy Sets

no code implementations11 May 2020 Zixiao Shen, Xin Chen, Jonathan M. Garibaldi

In this paper, we propose a novel weighted combination feature selection method using bootstrap and fuzzy sets.

feature selection

Leveraging Two-Stage Adaptive Robust Optimization for Power Flexibility Aggregation

no code implementations7 May 2020 Xin Chen, Na Li

This method is applicable to aggregate only the active (or reactive) power, and the joint active-reactive power domain.

Vocal Bursts Valence Prediction

DRU-net: An Efficient Deep Convolutional Neural Network for Medical Image Segmentation

1 code implementation28 Apr 2020 Mina Jafari, Dorothee Auer, Susan Francis, Jonathan Garibaldi, Xin Chen

In comparison with ResNet-based, DenseNet-based and attention network (AttnNet) based methods within the same encoder-decoder network structure, our method achieves significantly higher segmentation accuracy with fewer number of model parameters than DenseNet and AttnNet.

Image Segmentation Lesion Segmentation +3

Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks

no code implementations17 Apr 2020 Xin Chen, Lingxi Xie, Jun Wu, Longhui Wei, Yuhui Xu, Qi Tian

We alleviate this issue by training a graph convolutional network to fit the performance of sampled sub-networks so that the impact of random errors becomes minimal.

Neural Architecture Search

A generic ensemble based deep convolutional neural network for semi-supervised medical image segmentation

1 code implementation16 Apr 2020 Ruizhe Li, Dorothee Auer, Christian Wagner, Xin Chen

To address this problem, we propose a generic semi-supervised learning framework for image segmentation based on a deep convolutional neural network (DCNN).

Image Segmentation Lesion Segmentation +5

Triad State Space Construction for Chaotic Signal Classification with Deep Learning

no code implementations26 Mar 2020 Yadong Zhang, Xin Chen

Inspired by the well-known permutation entropy (PE), an effective image encoding scheme for chaotic time series, Triad State Space Construction (TSSC), is proposed.

Classification General Classification +3

Circumventing Outliers of AutoAugment with Knowledge Distillation

1 code implementation ECCV 2020 Longhui Wei, An Xiao, Lingxi Xie, Xin Chen, Xiaopeng Zhang, Qi Tian

AutoAugment has been a powerful algorithm that improves the accuracy of many vision tasks, yet it is sensitive to the operator space as well as hyper-parameters, and an improper setting may degenerate network optimization.

Data Augmentation General Classification +2

Online Residential Demand Response via Contextual Multi-Armed Bandits

no code implementations7 Mar 2020 Xin Chen, Yutong Nie, Na Li

Residential loads have great potential to enhance the efficiency and reliability of electricity systems via demand response (DR) programs.

Decision Making Multi-Armed Bandits +1

Biased Stochastic Gradient Descent for Conditional Stochastic Optimization

no code implementations25 Feb 2020 Yifan Hu, Siqi Zhang, Xin Chen, Niao He

Conditional Stochastic Optimization (CSO) covers a variety of applications ranging from meta-learning and causal inference to invariant learning.

Causal Inference Meta-Learning +2

Motif Difference Field: A Simple and Effective Image Representation of Time Series for Classification

no code implementations21 Jan 2020 Yadong Zhang, Xin Chen

Inspired by the convolutional neural network (CNN) classifier based on the image representations of time series, motif difference field (MDF) is proposed.

Clustering General Classification +3

Latency-Aware Differentiable Neural Architecture Search

1 code implementation17 Jan 2020 Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Bowen Shi, Qi Tian, Hongkai Xiong

However, these methods suffer the difficulty in optimizing network, so that the searched network is often unfriendly to hardware.

Neural Architecture Search

Progressive DARTS: Bridging the Optimization Gap for NAS in the Wild

4 code implementations23 Dec 2019 Xin Chen, Lingxi Xie, Jun Wu, Qi Tian

With the rapid development of neural architecture search (NAS), researchers found powerful network architectures for a wide range of vision tasks.

Neural Architecture Search

MM Algorithms for Distance Covariance based Sufficient Dimension Reduction and Sufficient Variable Selection

no code implementations13 Dec 2019 Runxiong Wu, Xin Chen

Sufficient dimension reduction (SDR) using distance covariance (DCOV) was recently proposed as an approach to dimension-reduction problems.

Dimensionality Reduction Variable Selection

Anion charge-lattice volume dependent Li ion migration in compounds with the face-centered cubic anion frameworks

no code implementations25 Oct 2019 Zhenming Xu, Xin Chen, Ronghan Chen, Xin Li, Hong Zhu

In this work, the face-centered cubic (fcc) anion frameworks were creatively constructed to study the effects of anion charge and lattice volume on the stability of lithium ion occupation and lithium ion migration.

Applied Physics

Stabilizing DARTS with Amended Gradient Estimation on Architectural Parameters

1 code implementation25 Oct 2019 Kaifeng Bi, Changping Hu, Lingxi Xie, Xin Chen, Longhui Wei, Qi Tian

Our approach bridges the gap from two aspects, namely, amending the estimation on the architectural gradients, and unifying the hyper-parameter settings in the search and re-training stages.

Neural Architecture Search

Data Augmentation Revisited: Rethinking the Distribution Gap between Clean and Augmented Data

no code implementations19 Sep 2019 Zhuoxun He, Lingxi Xie, Xin Chen, Ya zhang, Yan-Feng Wang, Qi Tian

Data augmentation has been widely applied as an effective methodology to improve generalization in particular when training deep neural networks.

Data Augmentation Image Classification +2

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

8 code implementations ICLR 2020 Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian, Hongkai Xiong

Differentiable architecture search (DARTS) provided a fast solution in finding effective network architectures, but suffered from large memory and computing overheads in jointly training a super-network and searching for an optimal architecture.

Neural Architecture Search

Online Optimal Control with Linear Dynamics and Predictions: Algorithms and Regret Analysis

1 code implementation NeurIPS 2019 Ying-Ying Li, Xin Chen, Na Li

In addition, we provide a fundamental limit of the dynamic regret for any online algorithms by considering linear quadratic tracking problems.

Optimization and Control

ReachNN: Reachability Analysis of Neural-Network Controlled Systems

1 code implementation25 Jun 2019 Chao Huang, Jiameng Fan, Wenchao Li, Xin Chen, Qi Zhu

In this work, we propose a new reachability analysis approach based on Bernstein polynomials that can verify neural-network controlled systems with a more general form of activation functions, i. e., as long as they ensure that the neural networks are Lipschitz continuous.

Sample Complexity of Sample Average Approximation for Conditional Stochastic Optimization

no code implementations28 May 2019 Yifan Hu, Xin Chen, Niao He

In this paper, we study a class of stochastic optimization problems, referred to as the \emph{Conditional Stochastic Optimization} (CSO), in the form of $\min_{x \in \mathcal{X}} \EE_{\xi}f_\xi\Big({\EE_{\eta|\xi}[g_\eta(x,\xi)]}\Big)$, which finds a wide spectrum of applications including portfolio selection, reinforcement learning, robust learning, causal inference and so on.

Causal Inference Stochastic Optimization

Progressive Differentiable Architecture Search: Bridging the Depth Gap between Search and Evaluation

4 code implementations ICCV 2019 Xin Chen, Lingxi Xie, Jun Wu, Qi Tian

Recently, differentiable search methods have made major progress in reducing the computational costs of neural architecture search.

Neural Architecture Search

Attention Distillation for Learning Video Representations

no code implementations5 Apr 2019 Miao Liu, Xin Chen, Yun Zhang, Yin Li, James M. Rehg

To this end, we make use of attention modules that learn to highlight regions in the video and aggregate features for recognition.

Action Recognition Video Recognition

TightCap: 3D Human Shape Capture with Clothing Tightness Field

1 code implementation4 Apr 2019 Xin Chen, Anqi Pang, Yang Wei, Lan Xui, Jingyi Yu

In this paper, we present TightCap, a data-driven scheme to capture both the human shape and dressed garments accurately with only a single 3D human scan, which enables numerous applications such as virtual try-on, biometrics and body evaluation.

Virtual Try-on

Singing voice conversion with non-parallel data

no code implementations11 Mar 2019 Xin Chen, Wei Chu, Jinxi Guo, Ning Xu

F0 and aperiodic are obtained through the original singing voice, and used with acoustic features to reconstruct the target singing voice through a vocoder.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

BandNet: A Neural Network-based, Multi-Instrument Beatles-Style MIDI Music Composition Machine

no code implementations18 Dec 2018 Yichao Zhou, Wei Chu, Sam Young, Xin Chen

In the learning stage, a sequence of stylistically uniform, multiple-channel music samples was modeled by a RNN.

Unsupervised Domain Adaptation using Generative Models and Self-ensembling

no code implementations2 Dec 2018 Eman T. Hassan, Xin Chen, David Crandall

The results suggest that selfensembling is better than simple data augmentation with the newly generated data and a single model trained this way can have the best performance across all different transfer tasks.

Data Augmentation Style Transfer +1

Beyond "How may I help you?": Assisting Customer Service Agents with Proactive Responses

no code implementations26 Nov 2018 Mengting Wan, Xin Chen

We study the problem of providing recommended responses to customer service agents in live-chat dialogue systems.

2PFPCE: Two-Phase Filter Pruning Based on Conditional Entropy

no code implementations6 Sep 2018 Chuhan Min, Aosen Wang, Yiran Chen, Wenyao Xu, Xin Chen

To overcome this challenge, we propose a novel filter-pruning framework, two-phase filter pruning based on conditional entropy, namely \textit{2PFPCE}, to compress the CNN models and reduce the inference time with marginal performance degradation.

Edge-computing Neural Network Compression +1

A Novel Co-design Peta-scale Heterogeneous Cluster for Deep Learning Training

no code implementations7 Feb 2018 Xin Chen, Hua Zhou, Yuxiang Gao, Yu Zhu

Therefore, MiMatrix intrinsically solves the bandwidth bottleneck of central node in parameter server framework that is widely used in distributed DL tasks.

Scheduling

Fully-Coupled Two-Stream Spatiotemporal Networks for Extremely Low Resolution Action Recognition

no code implementations11 Jan 2018 Mingze Xu, Aidean Sharghi, Xin Chen, David J. Crandall

A major emerging challenge is how to protect people's privacy as cameras and computer vision are increasingly integrated into our daily lives, including in smart devices inside homes.

Action Recognition Temporal Action Localization

Beyond saliency: understanding convolutional neural networks from saliency prediction on layer-wise relevance propagation

2 code implementations22 Dec 2017 Heyi Li, Yunke Tian, Klaus Mueller, Xin Chen

In this paper, we propose a novel two-step understanding method, namely Salient Relevance (SR) map, which aims to shed light on how deep CNNs recognize images and learn features from areas, referred to as attention areas, therein.

Saliency Prediction

ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene

no code implementations30 Nov 2017 Daitao Xing, Zichen Li, Xin Chen, Yi Fang

Arbitrary-oriented text detection in the wild is a very challenging task, due to the aspect ratio, scale, orientation, and illumination variations.

Text Detection

Sparse Photometric 3D Face Reconstruction Guided by Morphable Models

no code implementations CVPR 2018 Xuan Cao, Zhang Chen, Anpei Chen, Xin Chen, Cen Wang, Jingyi Yu

We present a novel 3D face reconstruction technique that leverages sparse photometric stereo (PS) and latest advances on face registration/modeling from a single image.

3D Face Reconstruction Position +1

Deep Neural Network Capacity

no code implementations16 Aug 2017 Aosen Wang, Hua Zhou, Wenyao Xu, Xin Chen

However, the capacity of deep neural network architecture is still a mystery to the researchers.

Quantization valid

Cannot find the paper you are looking for? You can Submit a new open access paper.