Search Results for author: Yong liu

Found 345 papers, 134 papers with code

TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis

3 code implementations5 Oct 2022 Haixu Wu, Tengge Hu, Yong liu, Hang Zhou, Jianmin Wang, Mingsheng Long

TimesBlock can discover the multi-periodicity adaptively and extract the complex temporal variations from transformed 2D tensors by a parameter-efficient inception block.

Action Recognition Anomaly Detection +4

iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

4 code implementations10 Oct 2023 Yong liu, Tengge Hu, Haoran Zhang, Haixu Wu, Shiyu Wang, Lintao Ma, Mingsheng Long

These forecasters leverage Transformers to model the global dependencies over temporal tokens of time series, with each token formed by multiple variates of the same timestamp.

Time Series Time Series Forecasting

RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer

2 code implementations12 Apr 2023 Jiahao Wang, Songyang Zhang, Yong liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin

Extensive experiments and ablative analysis also demonstrate that the inductive bias of network architecture, can be incorporated into simple network structure with appropriate optimization strategy.

Inductive Bias

ActionCLIP: A New Paradigm for Video Action Recognition

2 code implementations17 Sep 2021 Mengmeng Wang, Jiazheng Xing, Yong liu

Moreover, to handle the deficiency of label texts and make use of tremendous web data, we propose a new paradigm based on this multimodal learning framework for action recognition, which we dub "pre-train, prompt and fine-tune".

Action Classification Action Recognition In Videos +4

Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation

2 code implementations ICCV 2021 Lina Liu, Xibin Song, Mengmeng Wang, Yong liu, Liangjun Zhang

Meanwhile, to guarantee that the day and night images contain the same information, the domain-separated network takes the day-time images and corresponding night-time images (generated by GAN) as input, and the private and invariant feature extractors are learned by orthogonality and similarity loss, where the domain gap can be alleviated, thus better depth maps can be expected.

Monocular Depth Estimation

Targetless Calibration of LiDAR-IMU System Based on Continuous-time Batch Estimation

2 code implementations29 Jul 2020 Jiajun Lv, Jinhong Xu, Kewei Hu, Yong liu, Xingxing Zuo

Sensor calibration is the fundamental block for a multi-sensor fusion system.

Robotics

How Can Recommender Systems Benefit from Large Language Models: A Survey

1 code implementation9 Jun 2023 Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang

In this paper, we conduct a comprehensive survey on this research direction from the perspective of the whole pipeline in real-world recommender systems.

Ethics Feature Engineering +5

Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models

1 code implementation21 Dec 2023 Xianfang Zeng, Xin Chen, Zhongqi Qi, Wen Liu, Zibo Zhao, Zhibin Wang, Bin Fu, Yong liu, Gang Yu

This paper presents Paint3D, a novel coarse-to-fine generative framework that is capable of producing high-resolution, lighting-less, and diverse 2K UV texture maps for untextured 3D meshes conditioned on text or image inputs.

2k

Parse Geometry from a Line: Monocular Depth Estimation with Partial Laser Observation

4 code implementations17 Oct 2016 Yiyi Liao, Lichao Huang, Yue Wang, Sarath Kodagoda, Yinan Yu, Yong liu

Many standard robotic platforms are equipped with at least a fixed 2D laser range finder and a monocular camera.

Depth Completion

Non-stationary Transformers: Exploring the Stationarity in Time Series Forecasting

1 code implementation28 May 2022 Yong liu, Haixu Wu, Jianmin Wang, Mingsheng Long

However, their performance can degenerate terribly on non-stationary real-world data in which the joint distribution changes over time.

Time Series Time Series Forecasting

CL-MAPF: Multi-Agent Path Finding for Car-Like Robots with Kinematic and Spatiotemporal Constraints

1 code implementation1 Nov 2020 Licheng Wen, Zhen Zhang, Zhe Chen, Xiangrui Zhao, Yong liu

In this paper, we give a mathematical formalization of Multi-Agent Path Finding for Car-Like robots (CL-MAPF) problem.

Robotics Multiagent Systems

Bootstrap Latent Representations for Multi-modal Recommendation

2 code implementations13 Jul 2022 Xin Zhou, HongYu Zhou, Yong liu, Zhiwei Zeng, Chunyan Miao, Pengwei Wang, Yuan You, Feijun Jiang

Besides the user-item interaction graph, existing state-of-the-art methods usually use auxiliary graphs (e. g., user-user or item-item relation graph) to augment the learned representations of users and/or items.

HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation

1 code implementation14 Dec 2020 Xiaoyang Lyu, Liang Liu, Mengmeng Wang, Xin Kong, Lina Liu, Yong liu, Xinxin Chen, Yi Yuan

To obtainmore accurate depth estimation in large gradient regions, itis necessary to obtain high-resolution features with spatialand semantic information.

Monocular Depth Estimation Self-Supervised Learning +2

Few-Shot Domain Adaptation with Polymorphic Transformers

1 code implementation10 Jul 2021 Shaohua Li, Xiuchao Sui, Jie Fu, Huazhu Fu, Xiangde Luo, Yangqin Feng, Xinxing Xu, Yong liu, Daniel Ting, Rick Siow Mong Goh

Thus, the chance of overfitting the annotations is greatly reduced, and the model can perform robustly on the target domain after being trained on a few annotated images.

Domain Adaptation Segmentation

LogME: Practical Assessment of Pre-trained Models for Transfer Learning

1 code implementation22 Feb 2021 Kaichao You, Yong liu, Jianmin Wang, Mingsheng Long

In pursuit of a practical assessment method, we propose to estimate the maximum value of label evidence given features extracted by pre-trained models.

Model Selection regression +2

Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model Hubs

1 code implementation20 Oct 2021 Kaichao You, Yong liu, Ziyang Zhang, Jianmin Wang, Michael I. Jordan, Mingsheng Long

(2) The best ranked PTM can either be fine-tuned and deployed if we have no preference for the model's architecture or the target PTM can be tuned by the top $K$ ranked PTMs via a Bayesian procedure that we propose.

SSC: Semantic Scan Context for Large-Scale Place Recognition

1 code implementation1 Jul 2021 Lin Li, Xin Kong, Xiangrui Zhao, Tianxin Huang, Yong liu

We also present a two-step global semantic ICP to obtain the 3D pose (x, y, yaw) used to align the point cloud to improve matching performance.

Translation Visual Place Recognition

Semantic Graph Based Place Recognition for 3D Point Clouds

1 code implementation26 Aug 2020 Xin Kong, Xuemeng Yang, Guangyao Zhai, Xiangrui Zhao, Xianfang Zeng, Mengmeng Wang, Yong liu, Wanlong Li, Feng Wen

First, we propose a novel semantic graph representation for the point cloud scenes by reserving the semantic and topological information of the raw point cloud.

Graph Matching Graph Similarity

SuperLine3D: Self-supervised Line Segmentation and Description for LiDAR Point Cloud

1 code implementation3 Aug 2022 Xiangrui Zhao, Sheng Yang, Tianxin Huang, Jun Chen, Teng Ma, Mingyang Li, Yong liu

To repetitively extract them as features and perform association between discrete LiDAR frames for registration, we propose the first learning-based feature segmentation and description model for 3D lines in LiDAR point cloud.

Point Cloud Registration Segmentation

AnatomyNet: Deep Learning for Fast and Fully Automated Whole-volume Segmentation of Head and Neck Anatomy

2 code implementations15 Aug 2018 Wentao Zhu, Yufang Huang, Liang Zeng, Xuming Chen, Yong liu, Zhen Qian, Nan Du, Wei Fan, Xiaohui Xie

Methods: Our deep learning model, called AnatomyNet, segments OARs from head and neck CT images in an end-to-end fashion, receiving whole-volume HaN CT images as input and generating masks of all OARs of interest in one shot.

3D Medical Imaging Segmentation Anatomy

Koopa: Learning Non-stationary Time Series Dynamics with Koopman Predictors

1 code implementation NeurIPS 2023 Yong liu, Chenyu Li, Jianmin Wang, Mingsheng Long

While previous models suffer from complicated series variations induced by changing temporal distribution, we tackle non-stationary time series with modern Koopman theory that fundamentally considers the underlying time-variant dynamics.

Time Series

Learning Quality-aware Dynamic Memory for Video Object Segmentation

1 code implementation16 Jul 2022 Yong liu, Ran Yu, Fei Yin, Xinyuan Zhao, Wei Zhao, Weihao Xia, Yujiu Yang

However, they mainly focus on better matching between the current frame and the memory frames without explicitly paying attention to the quality of the memory.

Ranked #11 on Semi-Supervised Video Object Segmentation on DAVIS 2016 (using extra training data)

Segmentation Semantic Segmentation +2

UniInst: Unique Representation for End-to-End Instance Segmentation

1 code implementation25 May 2022 Yimin Ou, Rui Yang, Lufan Ma, Yong liu, Jiangpeng Yan, Shang Xu, Chengjie Wang, Xiu Li

Existing instance segmentation methods have achieved impressive performance but still suffer from a common dilemma: redundant representations (e. g., multiple boxes, grids, and anchor points) are inferred for one instance, which leads to multiple duplicated predictions.

Instance Segmentation Re-Ranking +2

FReeNet: Multi-Identity Face Reenactment

1 code implementation CVPR 2020 Jiangning Zhang, Xianfang Zeng, Mengmeng Wang, Yusu Pan, Liang Liu, Yong liu, Yu Ding, Changjie Fan

This paper presents a novel multi-identity face reenactment framework, named FReeNet, to transfer facial expressions from an arbitrary source face to a target face with a shared model.

Face Reenactment

APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals

3 code implementations30 Apr 2020 Jiangning Zhang, Liang Liu, Zhu-Cun Xue, Yong liu

Audio-guided face reenactment aims at generating photorealistic faces using audio information while maintaining the same facial movement as when speaking to a real person.

Face Reenactment

APB2FaceV2: Real-Time Audio-Guided Multi-Face Reenactment

1 code implementation25 Oct 2020 Jiangning Zhang, Xianfang Zeng, Chao Xu, Jun Chen, Yong liu, Yunliang Jiang

Audio-guided face reenactment aims to generate a photorealistic face that has matched facial expression with the input audio.

Face Reenactment

Real3D-AD: A Dataset of Point Cloud Anomaly Detection

1 code implementation NeurIPS 2023 Jiaqi Liu, Guoyang Xie, Ruitao Chen, Xinpeng Li, Jinbao Wang, Yong liu, Chengjie Wang, Feng Zheng

High-precision point cloud anomaly detection is the gold standard for identifying the defects of advancing machining and precision manufacturing.

3D Anomaly Detection

IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing

2 code implementations31 Jan 2023 Guoyang Xie, Jinbao Wang, Jiaqi Liu, Jiayi Lyu, Yong liu, Chengjie Wang, Feng Zheng, Yaochu Jin

We realize that the lack of a uniform IM benchmark is hindering the development and usage of IAD methods in real-world applications.

Anomaly Detection Continual Learning +1

CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow

1 code implementation CVPR 2022 Xiuchao Sui, Shaohua Li, Xue Geng, Yan Wu, Xinxing Xu, Yong liu, Rick Goh, Hongyuan Zhu

This is mainly because the correlation volume, the basis of pixel matching, is computed as the dot product of the convolutional features of the two images.

Optical Flow Estimation

A Generalist FaceX via Learning Unified Facial Representation

1 code implementation31 Dec 2023 Yue Han, Jiangning Zhang, Junwei Zhu, Xiangtai Li, Yanhao Ge, Wei Li, Chengjie Wang, Yong liu, Xiaoming Liu, Ying Tai

This work presents FaceX framework, a novel facial generalist model capable of handling diverse facial tasks simultaneously.

Facial Editing

End-to-End Beam Retrieval for Multi-Hop Question Answering

2 code implementations17 Aug 2023 Jiahao Zhang, Haiyang Zhang, Dongmei Zhang, Yong liu, Shen Huang

This approach models the multi-hop retrieval process in an end-to-end manner by jointly optimizing an encoder and two classification heads across all hops.

Language Modelling Large Language Model +3

Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark

1 code implementation16 Apr 2024 Jiangning Zhang, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Zhucun Xue, Yong liu, Guansong Pang, DaCheng Tao

Moreover, current metrics such as AU-ROC have nearly reached saturation on simple datasets, which prevents a comprehensive evaluation of different methods.

Anomaly Detection object-detection +2

Searching Parameterized AP Loss for Object Detection

1 code implementation NeurIPS 2021 Chenxin Tao, Zizhang Li, Xizhou Zhu, Gao Huang, Yong liu, Jifeng Dai

In this paper, we propose Parameterized AP Loss, where parameterized functions are introduced to substitute the non-differentiable components in the AP calculation.

Object object-detection +1

Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning

1 code implementation ICCV 2023 Chun-Mei Feng, Kai Yu, Yong liu, Salman Khan, WangMeng Zuo

In this paper, we focus on a particular setting of learning adaptive prompts on the fly for each test sample from an unseen new domain, which is known as test-time prompt tuning (TPT).

Data Augmentation

Towards Reliable Medical Image Segmentation by utilizing Evidential Calibrated Uncertainty

3 code implementations1 Jan 2023 Ke Zou, Yidi Chen, Ling Huang, Xuedong Yuan, Xiaojing Shen, Meng Wang, Rick Siow Mong Goh, Yong liu, Huazhu Fu

DEviS not only enhances the calibration and robustness of baseline segmentation accuracy but also provides high-efficiency uncertainty estimation for reliable predictions.

Computational Efficiency Image Segmentation +3

A Survey of Visual Sensory Anomaly Detection

1 code implementation14 Feb 2022 Xi Jiang, Guoyang Xie, Jinbao Wang, Yong liu, Chengjie Wang, Feng Zheng, Yaochu Jin

In this survey, we are the first one to provide a comprehensive review of visual sensory AD and category into three levels according to the form of anomalies.

Anomaly Detection

Unsupervised Continual Anomaly Detection with Contrastively-learned Prompt

1 code implementation2 Jan 2024 Jiaqi Liu, Kai Wu, Qiang Nie, Ying Chen, Bin-Bin Gao, Yong liu, Jinbao Wang, Chengjie Wang, Feng Zheng

Unsupervised Anomaly Detection (UAD) with incremental training is crucial in industrial manufacturing, as unpredictable defects make obtaining sufficient labeled data infeasible.

continual anomaly detection Continual Learning +2

CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response Generation

1 code implementation Findings (ACL) 2021 Chujie Zheng, Yong liu, Wei Chen, Yongcai Leng, Minlie Huang

However, existing methods for empathetic response generation usually either consider only one empathy factor or ignore the hierarchical relationships between different factors, leading to a weak ability of empathy modeling.

Empathetic Response Generation Open-Domain Dialog +1

Guide Local Feature Matching by Overlap Estimation

1 code implementation18 Feb 2022 Ying Chen, Dihe Huang, Shang Xu, Jianlin Liu, Yong liu

Local image feature matching under large appearance, viewpoint, and distance changes is challenging yet important.

Feature Correlation

Sentence-level Prompts Benefit Composed Image Retrieval

1 code implementation9 Oct 2023 Yang Bai, Xinxing Xu, Yong liu, Salman Khan, Fahad Khan, WangMeng Zuo, Rick Siow Mong Goh, Chun-Mei Feng

Composed image retrieval (CIR) is the task of retrieving specific images by using a query that involves both a reference image and a relative caption.

Attribute Composed Image Retrieval (CoIR) +2

Learning Federated Visual Prompt in Null Space for MRI Reconstruction

1 code implementation CVPR 2023 Chun-Mei Feng, Bangjun Li, Xinxing Xu, Yong liu, Huazhu Fu, WangMeng Zuo

Federated Magnetic Resonance Imaging (MRI) reconstruction enables multiple hospitals to collaborate distributedly without aggregating local data, thereby protecting patient privacy.

MRI Reconstruction

Towards Persona-Based Empathetic Conversational Models

1 code implementation EMNLP 2020 Peixiang Zhong, Chen Zhang, Hao Wang, Yong liu, Chunyan Miao

To this end, we propose a new task towards persona-based empathetic conversations and present the first empirical study on the impact of persona on empathetic responding.

Adaptive Assignment for Geometry Aware Local Feature Matching

1 code implementation CVPR 2023 Dihe Huang, Ying Chen, Shang Xu, Yong liu, Wenlong Wu, Yikang Ding, Chengjie Wang, Fan Tang

The detector-free feature matching approaches are currently attracting great attention thanks to their excellent performance.

Feature Correlation

Global Spectral Filter Memory Network for Video Object Segmentation

1 code implementation11 Oct 2022 Yong liu, Ran Yu, Jiahao Wang, Xinyuan Zhao, Yitong Wang, Yansong Tang, Yujiu Yang

Besides, we empirically find low frequency feature should be enhanced in encoder (backbone) while high frequency for decoder (segmentation head).

Attribute Object +4

NeRF-Loc: Visual Localization with Conditional Neural Radiance Field

1 code implementation17 Apr 2023 Jianlin Liu, Qiang Nie, Yong liu, Chengjie Wang

We propose a novel visual re-localization method based on direct matching between the implicit 3D descriptors and the 2D image with transformer.

Neural Rendering Visual Localization

Unsupervised Representation Learning for Time Series: A Review

1 code implementation3 Aug 2023 Qianwen Meng, Hangwei Qian, Yong liu, Yonghui Xu, Zhiqi Shen, Lizhen Cui

However, there is a lack of systematic analysis of unsupervised representation learning approaches for time series.

Contrastive Learning Representation Learning +1

Can Large Language Models Empower Molecular Property Prediction?

1 code implementation14 Jul 2023 Chen Qian, Huayi Tang, Zhirui Yang, Hong Liang, Yong liu

Molecular property prediction has gained significant attention due to its transformative potential in multiple scientific disciplines.

Molecular Property Prediction Property Prediction

SelfCF: A Simple Framework for Self-supervised Collaborative Filtering

2 code implementations7 Jul 2021 Xin Zhou, Aixin Sun, Yong liu, Jie Zhang, Chunyan Miao

Collaborative filtering (CF) is widely used to learn informative latent representations of users and items from observed interactions.

Collaborative Filtering Self-Supervised Learning

E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context

1 code implementation17 Jul 2022 Zizhang Li, Mengmeng Wang, Huaijin Pi, Kechun Xu, Jianbiao Mei, Yong liu

However, the redundant parameters within the network structure can cause a large model size when scaling up for desirable performance.

Video Reconstruction

Layer-refined Graph Convolutional Networks for Recommendation

1 code implementation22 Jul 2022 Xin Zhou, Donghui Lin, Yong liu, Chunyan Miao

Specifically, these models usually aggregate all layer embeddings for node updating and achieve their best recommendation performance within a few layers because of over-smoothing.

Extended Feature Pyramid Network for Small Object Detection

1 code implementation16 Mar 2020 Chunfang Deng, Mengmeng Wang, Liang Liu, Yong liu

Small object detection remains an unsolved challenge because it is hard to extract information of small objects with only a few pixels.

Object object-detection +1

Omni-frequency Channel-selection Representations for Unsupervised Anomaly Detection

1 code implementation1 Mar 2022 Yufei Liang, Jiangning Zhang, Shiwei Zhao, Runze Wu, Yong liu, Shuwen Pan

Density-based and classification-based methods have ruled unsupervised anomaly detection in recent years, while reconstruction-based methods are rarely mentioned for the poor reconstruction ability and low performance.

Unsupervised Anomaly Detection

MaxQ: Multi-Axis Query for N:M Sparsity Network

1 code implementation12 Dec 2023 Jingyang Xiang, Siqi Li, JunHao Chen, Zhuangzhi Chen, Tianxin Huang, Linpeng Peng, Yong liu

Meanwhile, a sparsity strategy that gradually increases the percentage of N:M weight blocks is applied, which allows the network to heal from the pruning-induced damage progressively.

Image Classification Instance Segmentation +3

TransVOS: Video Object Segmentation with Transformers

1 code implementation1 Jun 2021 Jianbiao Mei, Mengmeng Wang, Yeneng Lin, Yi Yuan, Yong liu

Recently, Space-Time Memory Network (STM) based methods have achieved state-of-the-art performance in semi-supervised video object segmentation (VOS).

Object One-shot visual object segmentation +3

EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm

1 code implementation19 Jun 2022 Jiangning Zhang, Xiangtai Li, Yabiao Wang, Chengjie Wang, Yibo Yang, Yong liu, DaCheng Tao

Motivated by biological evolution, this paper explains the rationality of Vision Transformer by analogy with the proven practical Evolutionary Algorithm (EA) and derives that both have consistent mathematical formulation.

Image Classification

SoftPatch: Unsupervised Anomaly Detection with Noisy Data

1 code implementation NeurIPS 2022 Xi Jiang, Ying Chen, Qiang Nie, Yong liu, Jianlin Liu, Bin-Bin Gao, Jun Liu, Chengjie Wang, Feng Zheng

Noise discriminators are utilized to generate outlier scores for patch-level noise elimination before coreset construction.

Unsupervised Anomaly Detection

Go Wider Instead of Deeper

1 code implementation25 Jul 2021 Fuzhao Xue, Ziji Shi, Futao Wei, Yuxuan Lou, Yong liu, Yang You

To achieve better performance with fewer trainable parameters, recent methods are proposed to go shallower by parameter sharing or model compressing along with the depth.

Image Classification

Towards Efficient and Scalable Sharpness-Aware Minimization

2 code implementations CVPR 2022 Yong liu, Siqi Mai, Xiangning Chen, Cho-Jui Hsieh, Yang You

Recently, Sharpness-Aware Minimization (SAM), which connects the geometry of the loss landscape and generalization, has demonstrated significant performance boosts on training large-scale models such as vision transformers.

SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation

1 code implementation NeurIPS 2023 Zhuoyan Luo, Yicheng Xiao, Yong liu, Shuyan Li, Yitong Wang, Yansong Tang, Xiu Li, Yujiu Yang

To address this issue, we propose Semantic-assisted Object Cluster (SOC), which aggregates video content and textual guidance for unified temporal modeling and cross-modal alignment.

Ranked #2 on Referring Expression Segmentation on A2D Sentences (using extra training data)

Object Referring Expression Segmentation +4

MHCCL: Masked Hierarchical Cluster-Wise Contrastive Learning for Multivariate Time Series

1 code implementation2 Dec 2022 Qianwen Meng, Hangwei Qian, Yong liu, Lizhen Cui, Yonghui Xu, Zhiqi Shen

Learning semantic-rich representations from raw unlabeled time series data is critical for downstream tasks such as classification and forecasting.

Clustering Contrastive Learning +3

SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion

1 code implementation27 Jun 2023 Jianbiao Mei, Yu Yang, Mengmeng Wang, Tianxin Huang, Xuemeng Yang, Yong liu

However, how to effectively exploit the relationships between the semantic context in semantic segmentation and geometric structure in scene completion remains under exploration.

Autonomous Driving Scene Understanding +1

Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds

1 code implementation23 Sep 2021 Xuemeng Yang, Hao Zou, Xin Kong, Tianxin Huang, Yong liu, Wanlong Li, Feng Wen, Hongbo Zhang

Specifically, the network takes a raw point cloud as input, and merges the features from the segmentation branch into the completion branch hierarchically to provide semantic information.

3D Semantic Scene Completion 3D Semantic Segmentation +3

Adaptive Multi-Modalities Fusion in Sequential Recommendation Systems

1 code implementation30 Aug 2023 Hengchang Hu, Wei Guo, Yong liu, Min-Yen Kan

We propose a graph-based approach (named MMSR) to fuse modality features in an adaptive order, enabling each modality to prioritize either its inherent sequential nature or its interplay with other modalities.

Sequential Recommendation

PANet: LiDAR Panoptic Segmentation with Sparse Instance Proposal and Aggregation

1 code implementation27 Jun 2023 Jianbiao Mei, Yu Yang, Mengmeng Wang, Xiaojun Hou, Laijian Li, Yong liu

Firstly, we propose a non-learning Sparse Instance Proposal (SIP) module with the ``sampling-shifting-grouping" scheme to directly group thing points into instances from the raw point cloud efficiently.

Autonomous Driving Instance Segmentation +2

Unfolding Once is Enough: A Deployment-Friendly Transformer Unit for Super-Resolution

1 code implementation5 Aug 2023 Yong liu, Hang Dong, Boyang Liang, Songwei Liu, Qingji Dong, Kai Chen, Fangmin Chen, Lean Fu, Fei Wang

Since the high resolution of intermediate features in SISR models increases memory and computational requirements, efficient SISR transformers are more favored.

Image Super-Resolution

SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading Acceleration

1 code implementation10 Oct 2023 Jingyang Xiang, Siqi Li, Jun Chen, Shipeng Bai, Yukai Ma, Guang Dai, Yong liu

To overcome them, this paper proposes a novel \emph{\textbf{S}oft \textbf{U}niform \textbf{B}lock \textbf{P}runing} (SUBP) approach to train a uniform 1$\times$N sparse structured network from scratch.

GPT-4V-AD: Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection

1 code implementation5 Nov 2023 Jiangning Zhang, Haoyang He, Xuhai Chen, Zhucun Xue, Yabiao Wang, Chengjie Wang, Lei Xie, Yong liu

Large Multimodal Model (LMM) GPT-4V(ision) endows GPT-4 with visual grounding capabilities, making it possible to handle certain tasks through the Visual Question Answering (VQA) paradigm.

Anomaly Detection Question Answering +3

Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models

1 code implementation29 Feb 2024 Chen Qian, Jie Zhang, Wei Yao, Dongrui Liu, Zhenfei Yin, Yu Qiao, Yong liu, Jing Shao

This research provides an initial exploration of trustworthiness modeling during LLM pre-training, seeking to unveil new insights and spur further developments in the field.

Fairness Mutual Information Estimation

Camera-based 3D Semantic Scene Completion with Sparse Guidance Network

1 code implementation10 Dec 2023 Jianbiao Mei, Yu Yang, Mengmeng Wang, Junyu Zhu, Xiangrui Zhao, Jongwon Ra, Laijian Li, Yong liu

Semantic scene completion (SSC) aims to predict the semantic occupancy of each voxel in the entire 3D scene from limited observations, which is an emerging and critical task for autonomous driving.

3D Semantic Scene Completion Autonomous Driving

TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On

1 code implementation1 Apr 2024 Jiazheng Xing, Chao Xu, Yijie Qian, Yang Liu, Guang Dai, Baigui Sun, Yong liu, Jingdong Wang

However, the clothing identity uncontrollability and training inefficiency of existing diffusion-based methods, which struggle to maintain the identity even with full parameter training, are significant limitations that hinder the widespread applications.

Virtual Try-on

Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model

1 code implementation NeurIPS 2021 Jiangning Zhang, Chao Xu, Jian Li, Wenzhou Chen, Yabiao Wang, Ying Tai, Shuo Chen, Chengjie Wang, Feiyue Huang, Yong liu

Inspired by biological evolution, we explain the rationality of Vision Transformer by analogy with the proven practical Evolutionary Algorithm (EA) and derive that both of them have consistent mathematical representation.

Image Retrieval Retrieval

Learning Prompt with Distribution-Based Feature Replay for Few-Shot Class-Incremental Learning

1 code implementation3 Jan 2024 Zitong Huang, Ze Chen, Zhixing Chen, Erjin Zhou, Xinxing Xu, Rick Siow Mong Goh, Yong liu, WangMeng Zuo, ChunMei Feng

When progressing to a new session, pseudo-features are sampled from old-class distributions combined with training images of the current session to optimize the prompt, thus enabling the model to learn new knowledge while retaining old knowledge.

Few-Shot Class-Incremental Learning Incremental Learning +1

Very Long Term Field of View Prediction for 360-degree Video Streaming

1 code implementation4 Feb 2019 Chenge Li, Weixi Zhang, Yong liu, Yao Wang

In this work, we treat the FoV prediction as a sequence learning problem, and propose to predict the target user's future FoV not only based on the user's own past FoV center trajectory but also other users' future FoV locations.

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation

1 code implementation3 Jan 2023 Yue Han, Jiangning Zhang, Zhucun Xue, Chao Xu, Xintian Shen, Yabiao Wang, Chengjie Wang, Yong liu, Xiangtai Li

In this work, we explore a simple yet unified solution for FSIS as well as its incremental variants, and introduce a new framework named Reference Twice (RefT) to fully explore the relationship between support/query features based on a Transformer-like framework.

Benchmarking Few-Shot Object Detection +3

RIDERS: Radar-Infrared Depth Estimation for Robust Sensing

1 code implementation3 Feb 2024 Han Li, Yukai Ma, Yuehao Huang, Yaqing Gu, Weihua Xu, Yong liu, Xingxing Zuo

Dense depth recovery is crucial in autonomous driving, serving as a foundational element for obstacle avoidance, 3D object detection, and local path planning.

3D Object Detection Autonomous Driving +3

AutoTimes: Autoregressive Time Series Forecasters via Large Language Models

1 code implementation4 Feb 2024 Yong liu, Guo Qin, Xiangdong Huang, Jianmin Wang, Mingsheng Long

Foundation models of time series have not been fully developed due to the limited availability of large-scale time series and the underexploration of scalable pre-training.

In-Context Learning Language Modelling +1

Joint Learning Content and Degradation Aware Feature for Blind Super-Resolution

1 code implementation29 Aug 2022 Yifeng Zhou, Chuming Lin, Donghao Luo, Yong liu, Ying Tai, Chengjie Wang, Mingang Chen

Although some Unsupervised Degradation Prediction (UDP) methods are proposed to bypass this problem, the \textit{inconsistency} between degradation embedding and SR feature is still challenging.

Blind Super-Resolution Image Super-Resolution +1

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning

1 code implementation19 Dec 2023 Yanqi Ge, Qiang Nie, Ye Huang, Yong liu, Chengjie Wang, Feng Zheng, Wen Li, Lixin Duan

By pulling the learned features to these semantic anchors, several advantages can be attained: 1) the intra-class compactness and naturally inter-class separability, 2) induced bias or errors from feature learning can be avoided, and 3) robustness to the long-tailed problem.

Disentanglement

Keyword-Guided Neural Conversational Model

1 code implementation15 Dec 2020 Peixiang Zhong, Yong liu, Hao Wang, Chunyan Miao

We study the problem of imposing conversational goals/keywords on open-domain conversational agents, where the agent is required to lead the conversation to a target keyword smoothly and fast.

Knowledge Graphs Retrieval +1

A Unified BEV Model for Joint Learning of 3D Local Features and Overlap Estimation

1 code implementation28 Feb 2023 Lin Li, Wendong Ding, Yongkun Wen, Yufei Liang, Yong liu, Guowei Wan

For overlap detection, a cross-attention module is applied for interacting contextual information of input point clouds, followed by a classification head to estimate the overlapping region.

Point Cloud Registration

Global Knowledge Calibration for Fast Open-Vocabulary Segmentation

1 code implementation ICCV 2023 Kunyang Han, Yong liu, Jun Hao Liew, Henghui Ding, Yunchao Wei, Jiajun Liu, Yitong Wang, Yansong Tang, Yujiu Yang, Jiashi Feng, Yao Zhao

Recent advancements in pre-trained vision-language models, such as CLIP, have enabled the segmentation of arbitrary concepts solely from textual inputs, a process commonly referred to as open-vocabulary semantic segmentation (OVS).

Knowledge Distillation Open Vocabulary Semantic Segmentation +4

1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation

1 code implementation1 Jan 2024 Zhuoyan Luo, Yicheng Xiao, Yong liu, Yitong Wang, Yansong Tang, Xiu Li, Yujiu Yang

The recent transformer-based models have dominated the Referring Video Object Segmentation (RVOS) task due to the superior performance.

Object Referring Video Object Segmentation +3

Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization

1 code implementation23 Feb 2024 Zirui Zhu, Yong liu, Zangwei Zheng, Huifeng Guo, Yang You

We explore the typical data characteristics and optimization statistics of CTR prediction, revealing a strong positive correlation between the top hessian eigenvalue and feature frequency.

Click-Through Rate Prediction

DLPAlign: A Deep Learning based Progressive Alignment Method for Multiple Protein Sequences

1 code implementation21 Nov 2020 Mengmeng Kuang, Yong liu, Lufei Gao

This paper proposed a novel and straightforward approach to improve the accuracy of progressive multiple protein sequence alignment method.

Decision Making Multiple Sequence Alignment +1

Timer: Transformers for Time Series Analysis at Scale

1 code implementation4 Feb 2024 Yong liu, Haoran Zhang, Chenyu Li, Xiangdong Huang, Jianmin Wang, Mingsheng Long

Continuous progresses have been achieved as the emergence of large language models, exhibiting unprecedented ability in few-shot generalization, scalability, and task generality, which is however absent in time series models.

Anomaly Detection Imputation +2

Robust photon-efficient imaging using a pixel-wise residual shrinkage network

2 code implementations5 Jan 2022 Gongxin Yao, YiWei Chen, Yong liu, Xiaomin Hu, Yu Pan

Single-photon light detection and ranging (LiDAR) has been widely applied to 3D imaging in challenging scenarios.

Depth Estimation

Inductive Graph Transformer for Delivery Time Estimation

1 code implementation5 Nov 2022 Xin Zhou, Jinglong Wang, Yong liu, Xingyu Wu, Zhiqi Shen, Cyril Leung

Providing accurate estimated time of package delivery on users' purchasing pages for e-commerce platforms is of great importance to their purchasing decisions and post-purchase experiences.

APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation

1 code implementation6 Nov 2023 Mingjia Yin, Hao Wang, Xiang Xu, Likang Wu, Sirui Zhao, Wei Guo, Yong liu, Ruiming Tang, Defu Lian, Enhong Chen

To this end, we propose a graph-driven framework, named Adaptive and Personalized Graph Learning for Sequential Recommendation (APGL4SR), that incorporates adaptive and personalized global collaborative information into sequential recommendation systems.

Graph Learning Multi-Task Learning +1

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering

1 code implementation19 Dec 2023 Chun-Mei Feng, Yang Bai, Tao Luo, Zhen Li, Salman Khan, WangMeng Zuo, Xinxing Xu, Rick Siow Mong Goh, Yong liu

By feeding the retrieved image and question to the VQA model, one can find the images inconsistent with relative caption when the answer by VQA is inconsistent with the answer in the QA pair.

Image Retrieval Question Answering +2

Variational Quantum Circuits for Quantum State Tomography

1 code implementation16 Dec 2019 Yong Liu, Dongyang Wang, Shichuan Xue, Anqi Huang, Xiang Fu, Xiaogang Qiang, Ping Xu, He-Liang Huang, Mingtang Deng, Chu Guo, Xuejun Yang, Junjie Wu

We demonstrate our method by performing numerical simulations for the tomography of the ground state of a one-dimensional quantum spin chain, using a variational quantum circuit simulator.

Quantum Machine Learning Quantum State Tomography

PD-GAN: Adversarial Learning for Personalized Diversity-Promoting Recommendation

1 code implementation IJCAI 2019 Qiong Wu, Yong liu, Chunyan Miao, Binqiang Zhao, Yin Zhao, Lu Guan

This paper proposes Personalized Diversity-promoting GAN (PD-GAN), a novel recommendation model to generate diverse, yet relevant recommendations.

Recommendation Systems

Fair Scratch Tickets: Finding Fair Sparse Networks Without Weight Training

1 code implementation CVPR 2023 Pengwei Tang, Wei Yao, Zhicong Li, Yong liu

We randomly initialize a dense neural network and find appropriate binary masks for the weights to obtain fair sparse subnetworks without any weight training.

Fairness

RadarCam-Depth: Radar-Camera Fusion for Depth Estimation with Learned Metric Scale

1 code implementation9 Jan 2024 Han Li, Yukai Ma, Yaqing Gu, Kewei Hu, Yong liu, Xingxing Zuo

To circumvent this issue, we learn to augment versatile and robust monocular depth prediction with the dense metric scale induced from sparse and noisy Radar data.

Depth Estimation Depth Prediction

Automated Spectral Kernel Learning

1 code implementation11 Sep 2019 Jian Li, Yong liu, Weiping Wang

The generalization performance of kernel methods is largely determined by the kernel, but common kernels are stationary thus input-independent and output-independent, that limits their applications on complicated tasks.

Automatic Data Augmentation by Learning the Deterministic Policy

1 code implementation18 Oct 2019 Yinghuan Shi, Tiexin Qin, Yong liu, Jiwen Lu, Yang Gao, Dinggang Shen

By introducing an unified optimization goal, DeepAugNet intends to combine the data augmentation and the deep model training in an end-to-end training manner which is realized by simultaneously training a hybrid architecture of dueling deep Q-learning algorithm and a surrogate deep model.

Data Augmentation Q-Learning

Semi-Supervised Learning for Visual Bird's Eye View Semantic Segmentation

1 code implementation28 Aug 2023 Junyu Zhu, Lina Liu, Yu Tang, Feng Wen, Wanlong Li, Yong liu

In this paper, we present a novel semi-supervised framework for visual BEV semantic segmentation to boost performance by exploiting unlabeled images during the training.

Autonomous Vehicles Bird's-Eye View Semantic Segmentation +2

A Learning Framework for n-bit Quantized Neural Networks toward FPGAs

1 code implementation6 Apr 2020 Jun Chen, Liang Liu, Yong liu, Xianfang Zeng

Furthermore, we also design a shift vector processing element (SVPE) array to replace all 16-bit multiplications with SHIFT operations in convolution operation on FPGAs.

Inductive Representation Learning in Temporal Networks via Mining Neighborhood and Community Influences

1 code implementation1 Oct 2021 Meng Liu, Yong liu

Therefore, we propose a new inductive network representation learning method called MNCI by mining neighborhood and community influences in temporal networks.

Link Prediction Node Classification +1

Ridgeless Regression with Random Features

1 code implementation1 May 2022 Jian Li, Yong liu, Yingying Zhang

Recent theoretical studies illustrated that kernel ridgeless regression can guarantee good generalization ability without an explicit regularization.

regression

Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning

1 code implementation14 Feb 2023 Shanqi Liu, Yujing Hu, Runze Wu, Dong Xing, Yu Xiong, Changjie Fan, Kun Kuang, Yong liu

We first illustrate that the proposed value decomposition can consider the complicated interactions among agents and is feasible to learn in large-scale scenarios.

Multi-agent Reinforcement Learning

Parameter-Efficient Conversational Recommender System as a Language Processing Task

1 code implementation25 Jan 2024 Mathieu Ravaut, Hao Zhang, Lu Xu, Aixin Sun, Yong liu

Conversational recommender systems (CRS) aim to recommend relevant items to users by eliciting user preference through natural language conversation.

Dialogue Generation Knowledge Graphs +2

Towards Sharp Analysis for Distributed Learning with Random Features

1 code implementation7 Jun 2019 Jian Li, Yong liu, Weiping Wang

In this paper, using refined proof techniques, we first extend the optimal rates for distributed learning with random features to the non-attainable case.

Deep Domain Adversarial Adaptation for Photon-efficient Imaging

2 code implementations7 Jan 2022 YiWei Chen, Gongxin Yao, Yong liu, Hongye Su, Xiaomin Hu, Yu Pan

Photon-efficient imaging with the single-photon light detection and ranging (LiDAR) captures the three-dimensional (3D) structure of a scene by only a few detected signal photons per pixel.

Domain Adaptation

ViG-UNet: Vision Graph Neural Networks for Medical Image Segmentation

1 code implementation8 Jun 2023 Juntao Jiang, Xiyu Chen, Guanzhong Tian, Yong liu

Deep neural networks have been widely used in medical image analysis and medical image segmentation is one of the most important tasks.

Image Segmentation Medical Image Segmentation +2

Feature Lenses: Plug-and-play Neural Modules for Transformation-Invariant Visual Representations

1 code implementation12 Apr 2020 Shaohua Li, Xiuchao Sui, Jie Fu, Yong liu, Rick Siow Mong Goh

To make CNNs more invariant to transformations, we propose "Feature Lenses", a set of ad-hoc modules that can be easily plugged into a trained model (referred to as the "host model").

Understanding Fairness Surrogate Functions in Algorithmic Fairness

1 code implementation17 Oct 2023 Wei Yao, Zhanke Zhou, Zhicong Li, Bo Han, Yong liu

To mitigate such bias while achieving comparable accuracy, a promising approach is to introduce surrogate functions of the concerned fairness definition and solve a constrained optimization problem.

Fairness

Nighttime Thermal Infrared Image Colorization with Feedback-based Object Appearance Learning

1 code implementation24 Oct 2023 Fu-Ya Luo, Shu-Lin Liu, Yi-Jun Cao, Kai-Fu Yang, Chang-Yong Xie, Yong liu, Yong-Jie Li

Extensive experiments illustrate that the proposed FoalGAN is not only effective for appearance learning of small objects, but also outperforms other image translation methods in terms of semantic preservation and edge consistency for the NTIR2DC task.

Colorization Generative Adversarial Network +2

Learnable Chamfer Distance for Point Cloud Reconstruction

1 code implementation27 Dec 2023 Tianxin Huang, Qingyao Liu, Xiangrui Zhao, Jun Chen, Yong liu

As point clouds are 3D signals with permutation invariance, most existing works train their reconstruction networks by measuring shape differences with the average point-to-point distance between point clouds matched with predefined rules.

Point cloud reconstruction

LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking

1 code implementation7 Mar 2024 Jialin Li, Qiang Nie, WeiFu Fu, Yuhuan Lin, Guangpin Tao, Yong liu, Chengjie Wang

Deep learning models, particularly those based on transformers, often employ numerous stacked structures, which possess identical architectures and perform similar functions.

Robust Visual SLAM with Point and Line Features

no code implementations23 Nov 2017 Xingxing Zuo, Xiaojia Xie, Yong liu, Guoquan Huang

In this paper, we develop a robust efficient visual SLAM system that utilizes heterogeneous point and line features.

Stereo Matching Stereo Matching Hand

Robust Cost-Sensitive Learning for Recommendation with Implicit Feedback

no code implementations3 Jul 2017 Peng Yang, Peilin Zhao, Xin Gao, Yong liu

Morever, the proposed algorithm can be scaled up to large-sized datasets after a relaxation.

Large Margin Object Tracking with Circulant Feature Maps

no code implementations CVPR 2017 Mengmeng Wang, Yong liu, Zeyi Huang

Structured output support vector machine (SVM) based tracking algorithms have shown favorable performance recently.

Object Object Tracking

Real-time 3D Human Tracking for Mobile Robots with Multisensors

no code implementations15 Mar 2017 Mengmeng Wang, Daobilige Su, Lei Shi, Yong liu, Jaime Valls Miro

An ultrasonic sensor array is employed to provide the range information from the target person to the robot and Gaussian Process Regression is used for partial location estimation (2-D).

Sensor Fusion Visual Tracking

RISAS: A Novel Rotation, Illumination, Scale Invariant Appearance and Shape Feature

no code implementations14 Mar 2016 Kanzhi Wu, Xiaoyang Li, Ravindra Ranasinghe, Gamini Dissanayake, Yong liu

This paper presents a novel appearance and shape feature, RISAS, which is robust to viewpoint, illumination, scale and rotation variations.

Robust Object Tracking with a Hierarchical Ensemble Framework

no code implementations23 Sep 2015 Mengmeng Wang, Yong liu

A discriminative model which accounts for the matching degree of local patches is adopted via a bottom ensemble layer, and a generative model which exploits holistic templates is used to search for the object through the middle ensemble layer as well as an adaptive Kalman filter.

Object Object Tracking

Simple and Efficient Learning using Privileged Information

no code implementations6 Apr 2016 Xinxing Xu, Joey Tianyi Zhou, IvorW. Tsang, Zheng Qin, Rick Siow Mong Goh, Yong liu

The Support Vector Machine using Privileged Information (SVM+) has been proposed to train a classifier to utilize the additional privileged information that is only available in the training phase but not available in the test phase.

Image Categorization

Understand Scene Categories by Objects: A Semantic Regularized Scene Classifier Using Convolutional Neural Networks

no code implementations22 Sep 2015 Yiyi Liao, Sarath Kodagoda, Yue Wang, Lei Shi, Yong liu

As scene images have larger diversity than the iconic object images, it is more challenging for deep learning methods to automatically learn features from scene images with less samples.

Classification General Classification +4

Noise Robust IOA/CAS Speech Separation and Recognition System For The Third 'CHIME' Challenge

no code implementations21 Sep 2015 Xiaofei Wang, Chao Wu, Pengyuan Zhang, Ziteng Wang, Yong liu, Xu Li, Qiang Fu, Yonghong Yan

This paper presents the contribution to the third 'CHiME' speech separation and recognition challenge including both front-end signal processing and back-end speech recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Place classification with a graph regularized deep neural network model

no code implementations12 Jun 2015 Yiyi Liao, Sarath Kodagoda, Yue Wang, Lei Shi, Yong liu

Furthermore, results show that the features automatically learned from the raw input range data can achieve competitive results to the features constructed based on statistical and geometrical information.

Classification General Classification

Image Representation Learning Using Graph Regularized Auto-Encoders

no code implementations3 Dec 2013 Yiyi Liao, Yue Wang, Yong liu

We consider the problem of image representation for the tasks of unsupervised learning and semi-supervised learning.

Clustering Image Clustering +1

SL$^2$MF: Predicting Synthetic Lethality in Human Cancers via Logistic Matrix Factorization

no code implementations20 Oct 2018 Yong Liu, Min Wu, Chenghao Liu, Xiao-Li Li, Jie Zheng

Moreover, we also incorporate biological knowledge about genes from protein-protein interaction (PPI) data and Gene Ontology (GO).

Adaptive Re-ranking of Deep Feature for Person Re-identification

no code implementations21 Nov 2018 Yong Liu, Lin Shang, Andy Song

First, we propose a Deep Feature Fusion (DFF) method to exploit the diverse information embedded in a deep feature.

Person Re-Identification Re-Ranking +1

Dynamic Spatio-temporal Graph-based CNNs for Traffic Prediction

no code implementations5 Dec 2018 Ken Chen, Fei Chen, Baisheng Lai, Zhongming Jin, Yong liu, Kai Li, Long Wei, Pengfei Wang, Yandong Tang, Jianqiang Huang, Xian-Sheng Hua

To capture the graph dynamics, we use the graph prediction stream to predict the dynamic graph structures, and the predicted structures are fed into the flow prediction stream.

Traffic Prediction

Constructing Ontology-Based Cancer Treatment Decision Support System with Case-Based Reasoning

no code implementations5 Dec 2018 Ying Shen, Joël Colloc, Armelle Jacquet-Andrieu, Ziyi Guo, Yong liu

Disease Ontology (DO) that pertains to cancer's clinical stages and their corresponding information components is utilized to improve the reasoning ability of a decision support system (DSS).

General Classification

A Knowledge Graph Based Solution for Entity Discovery and Linking in Open-Domain Questions

no code implementations5 Dec 2018 Kai Lei, Bing Zhang, Yong liu, Yang Deng, Dongyu Zhang, Ying Shen

In Question Entity Discovery and Linking (QEDL) problem, traditional methods are challenged because multiple entities in one short question are difficult to be discovered entirely and the incomplete information in short text makes entity linking hard to implement.

Entity Linking Learning-To-Rank +5

Approach for Semi-automatic Construction of Anti-infective Drug Ontology Based on Entity Linking

no code implementations5 Dec 2018 Ying Shen, Yang Deng, Kaiqi Yuan, Li Liu, Yong liu

Experiments show that our selected features have achieved a precision rate of 86. 77%, a recall rate of 89. 03% and an F1 score of 87. 89%.

Entity Linking

Max-Diversity Distributed Learning: Theory and Algorithms

no code implementations19 Dec 2018 Yong Liu, Jian Li, Weiping Wang

We study the risk performance of distributed learning for the regularization empirical risk minimization with fast convergence rate, substantially improving the error analysis of the existing divide-and-conquer based distributed learning.

Learning Theory

Multi-Class Learning: From Theory to Algorithm

no code implementations NeurIPS 2018 Jian Li, Yong liu, Rong Yin, Hua Zhang, Lizhong Ding, Weiping Wang

In this paper, we study the generalization performance of multi-class classification and obtain a shaper data-dependent generalization error bound with fast convergence rate, substantially improving the state-of-art bounds in the existing data-dependent generalization analysis.

Classification General Classification +1

Efficient Cross-Validation for Semi-Supervised Learning

no code implementations13 Feb 2019 Yong Liu, Jian Li, Guangjun Wu, Lizhong Ding, Weiping Wang

In this paper, we provide a method to approximate the CV for manifold regularization based on a notion of robust statistics, called Bouligand influence function (BIF).

Model Selection

Multi-Scale Quasi-RNN for Next Item Recommendation

no code implementations26 Feb 2019 Chaoyue He, Yong liu, Qingyu Guo, Chunyan Miao

To this end, architectural inductive biases such as Markov-Chains, Recurrent models, Convolutional networks and many others have demonstrated reasonable success on this task.

Recommendation Systems

Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation

no code implementations19 Mar 2019 Yong Liu, Yinan Zhang, Qiong Wu, Chunyan Miao, Lizhen Cui, Binqiang Zhao, Yin Zhao, Lu Guan

Interactive recommendation that models the explicit interactions between users and the recommender system has attracted a lot of research attentions in recent years.

Recommendation Systems reinforcement-learning +1

An Evaluation of Transfer Learning for Classifying Sales Engagement Emails at Large Scale

no code implementations19 Apr 2019 Yong Liu, Pavel Dmitriev, Yifei HUANG, Andrew Brooks, Li Dong

Our results show that fine-tuning of the BERT model outperforms with as few as 300 labeled samples, but underperforms with fewer than 300 labeled samples, relative to all the feature-based approaches using different embeddings.

Language Modelling Transfer Learning

Recent Advances in Diversified Recommendation

no code implementations16 May 2019 Qiong Wu, Yong liu, Chunyan Miao, Yin Zhao, Lu Guan, Haihong Tang

With the rapid development of recommender systems, accuracy is no longer the only golden criterion for evaluating whether the recommendation results are satisfying or not.

Recommendation Systems

Audio2Face: Generating Speech/Face Animation from Single Audio with Attention-Based Bidirectional LSTM Networks

no code implementations27 May 2019 Guanzhong Tian, Yi Yuan, Yong liu

We propose an end to end deep learning approach for generating real-time facial animation from just audio.

PoseConvGRU: A Monocular Approach for Visual Ego-motion Estimation by Learning

no code implementations19 Jun 2019 Guangyao Zhai, Liang Liu, Linjian Zhang, Yong liu

The feature-encoding module encodes the short-term motion feature in an image pair, while the memory-propagating module captures the long-term motion feature in the consecutive image pairs.

Camera Calibration Motion Estimation +2

Bandit Learning for Diversified Interactive Recommendation

no code implementations1 Jul 2019 Yong Liu, Yingtai Xiao, Qiong Wu, Chunyan Miao, Juyong Zhang

Interactive recommender systems that enable the interactions between users and the recommender system have attracted increasing research attentions.

Bayesian Inference Recommendation Systems +1

Multi-Instance Multi-Scale CNN for Medical Image Classification

no code implementations4 Jul 2019 Shaohua Li, Yong liu, Xiuchao Sui, Cheng Chen, Gabriel Tjio, Daniel Shu Wei Ting, Rick Siow Mong Goh

Deep learning for medical image classification faces three major challenges: 1) the number of annotated medical images for training are usually small; 2) regions of interest (ROIs) are relatively small with unclear boundaries in the whole medical images, and may appear in arbitrary positions across the x, y (and also z in 3D images) dimensions.

General Classification Image Classification +2

Face-to-Parameter Translation for Game Character Auto-Creation

no code implementations ICCV 2019 Tianyang Shi, Yi Yuan, Changjie Fan, Zhengxia Zou, Zhenwei Shi, Yong liu

Character customization system is an important component in Role-Playing Games (RPGs), where players are allowed to edit the facial appearance of their in-game characters with their own preferences rather than using default templates.

Style Transfer Translation

From Few to More: Large-scale Dynamic Multiagent Curriculum Learning

no code implementations6 Sep 2019 Weixun Wang, Tianpei Yang, Yong liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao

In this paper, we design a novel Dynamic Multiagent Curriculum Learning (DyMA-CL) to solve large-scale problems by starting from learning on a multiagent scenario with a small size and progressively increasing the number of agents.

Pose Estimation for Ground Robots: On Manifold Representation, Integration, Re-Parameterization, and Optimization

no code implementations8 Sep 2019 Mingming Zhang, Xingxing Zuo, Yiming Chen, Yong liu, Mingyang Li

In this paper, we focus on motion estimation dedicated for non-holonomic ground robots, by probabilistically fusing measurements from the wheel odometer and exteroceptive sensors.

6D Pose Estimation Motion Estimation

Semi-supervised Vector-valued Learning: Improved Bounds and Algorithms

1 code implementation11 Sep 2019 Jian Li, Yong liu, Weiping Wang

Vector-valued learning, where the output space admits a vector-valued structure, is an important problem that covers a broad family of important domains, e. g. multi-task learning and transfer learning.

Multi-class Classification Multi-Label Learning +1

Weighted Distributed Differential Privacy ERM: Convex and Non-convex

no code implementations23 Oct 2019 Yilin Kang, Yong liu, Weiping Wang

By detailed theoretical analysis, we show that in distributed setting, the noise bound and the excess empirical risk bound can be improved by considering different weights held by multiple parties.

Two Generator Game: Learning to Sample via Linear Goodness-of-Fit Test

no code implementations NeurIPS 2019 Lizhong Ding, Mengyang Yu, Li Liu, Fan Zhu, Yong liu, Yu Li, Ling Shao

DEAN can be interpreted as a GOF game between two generative networks, where one explicit generative network learns an energy-based distribution that fits the real data, and the other implicit generative network is trained by minimizing a GOF test statistic between the energy-based distribution and the generated data, such that the underlying distribution of the generated data is close to the energy-based distribution.

Visual-Inertial Localization for Skid-Steering Robots with Kinematic Constraints

no code implementations13 Nov 2019 Xingxing Zuo, Mingming Zhang, Yiming Chen, Yong liu, Guoquan Huang, Mingyang Li

While visual localization or SLAM has witnessed great progress in past decades, when deploying it on a mobile robot in practice, few works have explicitly considered the kinematic (or dynamic) constraints of the real robotic system when designing state estimators.

Visual Localization

Multi-Agent Game Abstraction via Graph Attention Neural Network

no code implementations25 Nov 2019 Yong Liu, Weixun Wang, Yujing Hu, Jianye Hao, Xingguo Chen, Yang Gao

Traditional methods attempt to use pre-defined rules to capture the interaction relationship between agents.

Graph Attention Multi-agent Reinforcement Learning

Efficient Robotic Task Generalization Using Deep Model Fusion Reinforcement Learning

no code implementations11 Dec 2019 Tianying Wang, Hao Zhang, Wei Qi Toh, Hongyuan Zhu, Cheston Tan, Yan Wu, Yong liu, Wei Jing

The proposed method is able to efficiently generalize the previously learned task by model fusion to solve the environment adaptation problem.

reinforcement-learning Reinforcement Learning (RL)

RoboCoDraw: Robotic Avatar Drawing with GAN-based Style Transfer and Time-efficient Path Optimization

no code implementations11 Dec 2019 Tianying Wang, Wei Qi Toh, Hao Zhang, Xiuchao Sui, Shaohua Li, Yong liu, Wei Jing

The proposed RoboCoDraw system takes a real human face image as input, converts it to a stylized avatar, then draws it with a robotic arm.

Robotics Graphics

Data Heterogeneity Differential Privacy: From Theory to Algorithm

no code implementations20 Feb 2020 Yilin Kang, Jian Li, Yong liu, Weiping Wang

Traditionally, the random noise is equally injected when training with different data instances in the field of differential privacy (DP).

BIG-bench Machine Learning

Input Perturbation: A New Paradigm between Central and Local Differential Privacy

1 code implementation20 Feb 2020 Yilin Kang, Yong liu, Ben Niu, Xin-Yi Tong, Likun Zhang, Weiping Wang

By adding noise to the original training data and training with the `perturbed data', we achieve ($\epsilon$,$\delta$)-differential privacy on the final model, along with some kind of privacy on the original data.

Convolutional Spectral Kernel Learning

no code implementations28 Feb 2020 Jian Li, Yong liu, Weiping Wang

Recently, non-stationary spectral kernels have drawn much attention, owing to its powerful feature representation ability in revealing long-range correlations and input-dependent characteristics.

Theoretical Analysis of Divide-and-Conquer ERM: Beyond Square Loss and RKHS

no code implementations9 Mar 2020 Yong Liu, Lizhong Ding, Weiping Wang

However, the studies on learning theory for general loss functions and hypothesis spaces remain limited.

Learning Theory

Nearly Optimal Clustering Risk Bounds for Kernel K-Means

no code implementations9 Mar 2020 Yong Liu, Lizhong Ding, Weiping Wang

In this paper, we study the statistical properties of kernel $k$-means and obtain a nearly optimal excess clustering risk bound, substantially improving the state-of-art bounds in the existing clustering risk analyses.

Clustering

Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized Neural Networks

no code implementations4 Mar 2020 Jun Chen, Yong liu, Hao Zhang, Shengnan Hou, Jian Yang

Meanwhile, we propose a M-bit Inputs and N-bit Weights Network (MINW-Net) trained by AQE, a quantized neural network with 1-3 bits weights and activations.

Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose

no code implementations29 Mar 2020 Xianfang Zeng, Yusu Pan, Mengmeng Wang, Jiangning Zhang, Yong liu

On the one hand, we adopt the deforming autoencoder to disentangle identity and pose representations.

Face Reenactment

Learning Hierarchical Review Graph Representations for Recommendation

no code implementations24 Apr 2020 Yong liu, Susen Yang, Yinan Zhang, Chunyan Miao, Zaiqing Nie, Juyong Zhang

Therefore, they may not be effective in capturing the global dependency between words, and tend to be easily biased by noise review information.

Graph Attention

Contextualized Graph Attention Network for Recommendation with Item Knowledge Graph

no code implementations24 Apr 2020 Susen Yang, Yong liu, Yonghui Xu, Chunyan Miao, Min Wu, Juyong Zhang

Graph neural networks (GNN) have recently been applied to exploit knowledge graph (KG) for recommendation.

Graph Attention

Hierarchical and Efficient Learning for Person Re-Identification

no code implementations18 May 2020 Jiangning Zhang, Liang Liu, Chao Xu, Yong liu

Recent works in the person re-identification task mainly focus on the model accuracy while ignore factors related to the efficiency, e. g. model size and latency, which are critical for practical application.

Person Re-Identification

Neural Architecture Optimization with Graph VAE

no code implementations18 Jun 2020 Jian Li, Yong liu, Jiankun Liu, Weiping Wang

The encoder and the decoder belong to a graph VAE, mapping architectures between continuous representations and network architectures.

Computational Efficiency Neural Architecture Search

Dive Deeper Into Box for Object Detection

no code implementations ECCV 2020 Ran Chen, Yong liu, Mengdan Zhang, Shu Liu, Bei Yu, Yu-Wing Tai

Anchor free methods have defined the new frontier in state-of-the-art object detection researches where accurate bounding box estimation is the key to the success of these methods.

Object object-detection +1

LIC-Fusion 2.0: LiDAR-Inertial-Camera Odometry with Sliding-Window Plane-Feature Tracking

no code implementations17 Aug 2020 Xingxing Zuo, Yulin Yang, Patrick Geneva, Jiajun Lv, Yong liu, Guoquan Huang, Marc Pollefeys

Only the tracked planar points belonging to the same plane will be used for plane initialization, which makes the plane extraction efficient and robust.

Robotics

LIC-Fusion: LiDAR-Inertial-Camera Odometry

no code implementations9 Sep 2019 Xingxing Zuo, Patrick Geneva, Woosik Lee, Yong liu, Guoquan Huang

This paper presents a tightly-coupled multi-sensor fusion algorithm termed LiDAR-inertial-camera fusion (LIC-Fusion), which efficiently fuses IMU measurements, sparse visual features, and extracted LiDAR points.

Robotics

Optimizing Quantized Neural Networks with Natural Gradient

no code implementations1 Jan 2021 Jun Chen, Hanwen Chen, Jiangning Zhang, Wenzhou Chen, Yong liu, Yunliang Jiang

Quantized Neural Networks (QNNs) have achieved an enormous step in improving computational efficiency, making it possible to deploy large models to mobile and miniaturized devices.

Computational Efficiency

Effective Distributed Learning with Random Features: Improved Bounds and Algorithms

no code implementations ICLR 2021 Yong liu, Jiankun Liu, Shuqiang Wang

In this paper, we study the statistical properties of distributed kernel ridge regression together with random features (DKRR-RF), and obtain optimal generalization bounds under the basic setting, which can substantially relax the restriction on the number of local machines in the existing state-of-art bounds.

Generalization Bounds

Fast Estimation for Privacy and Utility in Differentially Private Machine Learning

no code implementations1 Jan 2021 Yuzhe Li, Yong liu, Weipinng Wang, Bo Li, Nan Liu

In this paper, we deduce the influence of $\epsilon$ on utility private learning models through strict mathematical derivation, and propose a novel approximate approach for estimating the utility of any $\epsilon$ value.

BIG-bench Machine Learning

Collision-free Trajectory Planning for Autonomous Surface Vehicle

no code implementations20 May 2020 Licheng Wen, Jiaqing Yan, Xuemeng Yang, Yong liu, Yong Gu

We apply a numerical optimization method in the back-end to generate the trajectory.

Robotics

HILONet: Hierarchical Imitation Learning from Non-Aligned Observations

no code implementations5 Nov 2020 Shanqi Liu, Junjie Cao, Wenzhou Chen, Licheng Wen, Yong liu

In this work, we propose a new imitation learning approach called Hierarchical Imitation Learning from Observation(HILONet), which adopts a hierarchical structure to choose feasible sub-goals from demonstrated observations dynamically.

Imitation Learning Position

Fine Perceptive GANs for Brain MR Image Super-Resolution in Wavelet Domain

no code implementations9 Nov 2020 Senrong You, Yong liu, Baiying Lei, Shuqiang Wang

Specifically, FP-GANs firstly divides an MR image into low-frequency global approximation and high-frequency anatomical texture in wavelet domain.

Generative Adversarial Network Image Super-Resolution

FlowMOT: 3D Multi-Object Tracking by Scene Flow Association

no code implementations14 Dec 2020 Guangyao Zhai, Xin Kong, Jinhao Cui, Yong liu, Zhen Yang

Most end-to-end Multi-Object Tracking (MOT) methods face the problems of low accuracy and poor generalization ability.

3D Multi-Object Tracking motion prediction +1

FCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Depth Completion

no code implementations15 Dec 2020 Lina Liu, Xibin Song, Xiaoyang Lyu, Junwei Diao, Mengmeng Wang, Yong liu, Liangjun Zhang

Then, a refined depth map is further obtained using a residual learning strategy in the coarse-to-fine stage with a coarse depth map and color image as input.

Depth Completion

Spatial Context-Aware Self-Attention Model For Multi-Organ Segmentation

no code implementations16 Dec 2020 Hao Tang, Xingwei Liu, Kun Han, Shanlin Sun, Narisu Bai, Xuming Chen, Huang Qian, Yong liu, Xiaohui Xie

State-of-the-art CNN segmentation models apply either 2D or 3D convolutions on input images, with pros and cons associated with each method: 2D convolution is fast, less memory-intensive but inadequate for extracting 3D contextual information from volumetric images, while the opposite is true for 3D convolution.

Image Segmentation Organ Segmentation +2

CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth

no code implementations18 Dec 2020 Xingxing Zuo, Nathaniel Merrill, Wei Li, Yong liu, Marc Pollefeys, Guoquan Huang

In this work, we present a lightweight, tightly-coupled deep depth network and visual-inertial odometry (VIO) system, which can provide accurate state estimates and dense depth maps of the immediate surroundings.

Depth Estimation Depth Prediction +1

Cannot find the paper you are looking for? You can Submit a new open access paper.