Search Results for author: Mingli Song

Found 146 papers, 92 papers with code

Collaboration by Competition: Self-coordinated Knowledge Amalgamation for Multi-talent Student Learning

no code implementations ECCV 2020 Sihui Luo, Wenwen Pan, Xinchao Wang, Dazhou Wang, Haihong Tang, Mingli Song

To this end, we propose a self-coordinate knowledge amalgamation network (SOKA-Net) for learning the multi-talent student model.

PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation

1 code implementation4 Dec 2024 Qihan Huang, Long Chan, Jinlong Liu, Wanggui He, Hao Jiang, Mingli Song, Jie Song

To tackle this problem, this work proposes PatchDPO that estimates the quality of image patches within each generated image and accordingly trains the model.

Image Generation Image Reconstruction +1

Deep Feature Response Discriminative Calibration

1 code implementation16 Nov 2024 Wenxiang Xu, Tian Qiu, Linyun Zhou, Zunlei Feng, Mingli Song, Huiqiong Wang

Based on the calibration values, we propose a plugin-based calibration module incorporated into a modified ResNet architecture, termed Response Calibration Networks (ResCNet).

Can Graph Neural Networks Expose Training Data Properties? An Efficient Risk Assessment Approach

1 code implementation6 Nov 2024 Hanyang Yuan, Jiarong Xu, Renhong Huang, Mingli Song, Chunping Wang, Yang Yang

Our method only requires training a small set of models on graphs, while generating a sufficient number of approximated shadow models for attacks.

Diversity Inference Attack

Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning

1 code implementation6 Nov 2024 Ping Li, Tao Wang, Xinkui Zhao, Xianghua Xu, Mingli Song

Meanwhile, the former employs the repetition penalized sampling to encourage the model to yield concise pseudo-labeled sentences with less repetition, and selects the most relevant sentences upon a pretrained video-text model.

Video Captioning

LG-CAV: Train Any Concept Activation Vector with Language Guidance

no code implementations14 Oct 2024 Qihan Huang, Jie Song, Mengqi Xue, Haofei Zhang, Bingde Hu, Huiqiong Wang, Hao Jiang, Xingen Wang, Mingli Song

To bridge the gap between vision-language model and the target model, we calculate the activation values of concept descriptions on a common pool of images (probe images) with vision-language model and utilize them as language guidance to train the LG-CAV.

Language Modelling

On the Evaluation Consistency of Attribution-based Explanations

no code implementations28 Jul 2024 Jiarui Duan, Haoling Li, Haofei Zhang, Hao Jiang, Mengqi Xue, Li Sun, Mingli Song, Jie Song

Our findings underscore the necessity for future research in this domain to conduct rigorous evaluations encompassing a broader range of models and datasets, and to reassess the assumptions underlying the empirical success of different attribution methods.

Benchmarking

Curriculum Negative Mining For Temporal Networks

1 code implementation24 Jul 2024 Ziyue Chen, Tongya Zheng, Mingli Song

When compared with static networks, temporal networks present two specific challenges for negative sampling: positive sparsity and positive shift.

Odyssey: Empowering Minecraft Agents with Open-World Skills

1 code implementation22 Jul 2024 Shunyu Liu, Yaoru Li, Kongcheng Zhang, Zhenyu Cui, Wenkai Fang, Yuxuan Zheng, Tongya Zheng, Mingli Song

In this work, we introduce Odyssey, a new framework that empowers Large Language Model (LLM)-based agents with open-world skills to explore the vast Minecraft world.

Language Modelling Large Language Model +2

Learning a Mini-batch Graph Transformer via Two-stage Interaction Augmentation

1 code implementation13 Jul 2024 Wenda Li, KaiXuan Chen, Shunyu Liu, Tongya Zheng, Wenjie Huang, Mingli Song

Mini-batch Graph Transformer (MGT), as an emerging graph learning model, has demonstrated significant advantages in semi-supervised node prediction tasks with improved computational efficiency and enhanced model robustness.

Computational Efficiency Graph Learning +1

Unveiling Global Interactive Patterns across Graphs: Towards Interpretable Graph Neural Networks

1 code implementation2 Jul 2024 Yuwen Wang, Shunyu Liu, Tongya Zheng, KaiXuan Chen, Mingli Song

Graph Neural Networks (GNNs) have emerged as a prominent framework for graph mining, leading to significant advances across various domains.

Clustering Graph Classification +2

Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks

1 code implementation25 Jun 2024 Feiyang Xu, Shunyu Liu, Yunpeng Qing, Yihe Zhou, Yuwen Wang, Mingli Song

In this paper, we propose a novel temporal prototype-aware learning method, abbreviated as TPA, to learn time-adaptive AVC under short-term training trajectories.

Multi-agent Reinforcement Learning

A Comprehensive Study of Structural Pruning for Vision Models

1 code implementation18 Jun 2024 Changhao Li, Haoling Li, Mengqi Xue, Gongfan Fang, Sheng Zhou, Zunlei Feng, Huiqiong Wang, Mingli Song, Jie Song

Structural pruning has emerged as a promising approach for producing more efficient models.

Improving Adversarial Robustness via Feature Pattern Consistency Constraint

no code implementations13 Jun 2024 Jiacong Hu, Jingwen Ye, Zunlei Feng, Jiazhen Yang, Shunyu Liu, Xiaotian Yu, Lingxiang Jia, Mingli Song

Recognizing that a correct prediction relies on the correctness of the latent feature's pattern, we introduce a novel and effective Feature Pattern Consistency Constraint (FPCC) method to reinforce the latent feature's capacity to maintain the correct feature pattern.

Adversarial Robustness feature selection

A Large-scale Universal Evaluation Benchmark For Face Forgery Detection

1 code implementation13 Jun 2024 Yijun Bei, Hengrui Lou, Jinsong Geng, Erteng Liu, Lechao Cheng, Jie Song, Mingli Song, Zunlei Feng

Consequently, various face forgery detection techniques have been proposed to identify such fake facial content.

Diversity Face Generation +1

A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges

no code implementations1 May 2024 ZhengZhao Feng, Rui Wang, Tianxing Wang, Mingli Song, Sai Wu, Shuibing He

From the analysis and evaluation results, we identify key challenges and offer principles for future research to enhance the design of models and frameworks in the dynamic GNNs field.

Simple Graph Condensation

1 code implementation22 Mar 2024 Zhenbang Xiao, Yu Wang, Shunyu Liu, Huiqiong Wang, Mingli Song, Tongya Zheng

The burdensome training costs on large-scale graphs have aroused significant interest in graph condensation, which involves tuning Graph Neural Networks (GNNs) on a small condensed graph for use on the large-scale original graph.

On the Concept Trustworthiness in Concept Bottleneck Models

1 code implementation21 Mar 2024 Qihan Huang, Jie Song, Jingwen Hu, Haofei Zhang, Yong Wang, Mingli Song

Concept Bottleneck Models (CBMs), which break down the reasoning process into the input-to-concept mapping and the concept-to-label prediction, have garnered significant attention due to their remarkable interpretability achieved by the interpretable concept bottleneck.

A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective

no code implementations12 Mar 2024 Yunpeng Qing, Shunyu Liu, Jingyuan Cong, KaiXuan Chen, Yihe Zhou, Mingli Song

Offline reinforcement learning endeavors to leverage offline datasets to craft effective agent policy without online interaction, which imposes proper conservative constraints with the support of behavior policies to tackle the out-of-distribution problem.

D4RL reinforcement-learning +1

COLA: Cross-city Mobility Transformer for Human Trajectory Simulation

1 code implementation4 Mar 2024 Yu Wang, Tongya Zheng, Yuxuan Liang, Shunyu Liu, Mingli Song

To address these challenges, we have tailored a Cross-city mObiLity trAnsformer (COLA) with a dedicated model-agnostic transfer framework by effectively transferring cross-city knowledge for human trajectory simulation.

CoLA Transfer Learning

Training-Free Pretrained Model Merging

1 code implementation CVPR 2024 Zhengqi Xu, Ke Yuan, Huiqiong Wang, Yong Wang, Mingli Song, Jie Song

Furthermore, the visualization of the merged model within the multi-task loss landscape reveals that MuDSC enables the merged model to reside in the overlapping segment, featuring a unified lower loss for each task.

Improved Regret for Bandit Convex Optimization with Delayed Feedback

no code implementations14 Feb 2024 Yuanyu Wan, Chang Yao, Mingli Song, Lijun Zhang

We investigate bandit convex optimization (BCO) with delayed feedback, where only the loss value of the action is revealed under an arbitrary delay.

Blocking

Nearly Optimal Regret for Decentralized Online Convex Optimization

no code implementations14 Feb 2024 Yuanyu Wan, Tong Wei, Mingli Song, Lijun Zhang

Previous studies have established $O(n^{5/4}\rho^{-1/2}\sqrt{T})$ and ${O}(n^{3/2}\rho^{-1}\log T)$ regret bounds for convex and strongly convex functions respectively, where $n$ is the number of local learners, $\rho<1$ is the spectral gap of the communication matrix, and $T$ is the time horizon.

Angle Robustness Unmanned Aerial Vehicle Navigation in GNSS-Denied Scenarios

no code implementations4 Feb 2024 Yuxin Wang, Zunlei Feng, Haofei Zhang, Yang Gao, Jie Lei, Li Sun, Mingli Song

Due to the inability to receive signals from the Global Navigation Satellite System (GNSS) in extreme conditions, achieving accurate and robust navigation for Unmanned Aerial Vehicles (UAVs) is a challenging task.

Disentangled Condensation for Large-scale Graphs

1 code implementation18 Jan 2024 Zhenbang Xiao, Shunyu Liu, Yu Wang, Tongya Zheng, Mingli Song

Graph condensation has emerged as an intriguing technique to save the expensive training costs of Graph Neural Networks (GNNs) by substituting a condensed small graph with the original graph.

Graph Learning Link Prediction +1

Powerformer: A Section-adaptive Transformer for Power Flow Adjustment

1 code implementation5 Jan 2024 KaiXuan Chen, Wei Luo, Shunyu Liu, Yaoquan Wei, Yihe Zhou, Yunpeng Qing, Quan Zhang, Jie Song, Mingli Song

In this paper, we present a novel transformer architecture tailored for learning robust power system state representations, which strives to optimize power dispatch for the power flow adjustment across different transmission sections.

Graph Neural Network

Progressive Feature Self-reinforcement for Weakly Supervised Semantic Segmentation

1 code implementation14 Dec 2023 Jingxuan He, Lechao Cheng, Chaowei Fang, Zunlei Feng, Tingting Mu, Mingli Song

Building upon this, we introduce a complementary self-enhancement method that constrains the semantic consistency between these confident regions and an augmented image with the same class labels.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning

no code implementations28 Nov 2023 Yaoquan Wei, Shunyu Liu, Jie Song, Tongya Zheng, KaiXuan Chen, Yong Wang, Mingli Song

Instead, we employ a proxy model to extract state features that are both discriminative (adaptive to the agent) and generally applicable (robust to agent noise).

Atari Games Deep Reinforcement Learning

Pair-wise Layer Attention with Spatial Masking for Video Prediction

1 code implementation19 Nov 2023 Ping Li, Chenhan Zhang, Zheng Yang, Xianghua Xu, Mingli Song

To this end, we present a Pair-wise Layer Attention with Spatial Masking (PLA-SM) framework for video prediction to capture the spatiotemporal dynamics, which reflect the motion trend.

Autonomous Driving Decoder +1

Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation

no code implementations22 Sep 2023 Ping Li, Junjie Chen, Li Yuan, Xianghua Xu, Mingli Song

To alleviate the expensive human labeling, semi-supervised semantic segmentation employs a few labeled images and an abundant of unlabeled images to predict the pixel-level label map with the same size.

Decoder Feature Importance +2

ModelGiF: Gradient Fields for Model Functional Distance

1 code implementation ICCV 2023 Jie Song, Zhengqi Xu, Sai Wu, Gang Chen, Mingli Song

The last decade has witnessed the success of deep learning and the surge of publicly released trained models, which necessitates the quantification of the model functional distance for various purposes.

Adversarial Erasing with Pruned Elements: Towards Better Graph Lottery Ticket

1 code implementation5 Aug 2023 Yuwen Wang, Shunyu Liu, KaiXuan Chen, Tongtian Zhu, Ji Qiao, Mengjie Shi, Yuanyu Wan, Mingli Song

Graph Lottery Ticket (GLT), a combination of core subgraph and sparse subnetwork, has been proposed to mitigate the computational cost of deep Graph Neural Networks (GNNs) on large input graphs while preserving original performance.

Graph Neural Networks-based Hybrid Framework For Predicting Particle Crushing Strength

1 code implementation26 Jul 2023 Tongya Zheng, Tianli Zhang, Qingzheng Guan, Wenjie Huang, Zunlei Feng, Mingli Song, Chun Chen

Therefore, we firstly generate a dataset with 45, 000 numerical simulations and 900 particle types to facilitate the research progress of machine learning for particle crushing.

Chemical Reaction Prediction

A Survey of Deep Learning in Sports Applications: Perception, Comprehension, and Decision

no code implementations7 Jul 2023 Zhonghan Zhao, Wenhao Chai, Shengyu Hao, Wenhao Hu, Guanhong Wang, Shidong Cao, Mingli Song, Jenq-Neng Hwang, Gaoang Wang

Deep learning has the potential to revolutionize sports performance, with applications ranging from perception and comprehension to decision.

Deep Learning Survey

Spatiotemporal-Augmented Graph Neural Networks for Human Mobility Simulation

1 code implementation15 Jun 2023 Yu Wang, Tongya Zheng, Shunyu Liu, Zunlei Feng, KaiXuan Chen, Yunzhi Hao, Mingli Song

The human mobility simulation task aims to generate human mobility trajectories given a small set of trajectory data, which have aroused much concern due to the scarcity and sparsity of human mobility data.

Curricular Subgoals for Inverse Reinforcement Learning

1 code implementation14 Jun 2023 Shunyu Liu, Yunpeng Qing, Shuqi Xu, Hongyan Wu, Jiangtao Zhang, Jingyuan Cong, Tianhao Chen, YunFu Liu, Mingli Song

Inverse Reinforcement Learning (IRL) aims to reconstruct the reward function from expert demonstrations to facilitate policy learning, and has demonstrated its remarkable success in imitation learning.

Autonomous Driving D4RL +3

Decentralized SGD and Average-direction SAM are Asymptotically Equivalent

1 code implementation5 Jun 2023 Tongtian Zhu, Fengxiang He, KaiXuan Chen, Mingli Song, DaCheng Tao

Decentralized stochastic gradient descent (D-SGD) allows collaborative learning on massive devices simultaneously without the control of a central server.

Message-passing selection: Towards interpretable GNNs for graph classification

no code implementations3 Jun 2023 Wenda Li, KaiXuan Chen, Shunyu Liu, Wenjie Huang, Haofei Zhang, Yingjie Tian, Yun Su, Mingli Song

In this paper, we strive to develop an interpretable GNNs' inference paradigm, termed MSInterpreter, which can serve as a plug-and-play scheme readily applicable to various GNNs' baselines.

Graph Classification

Improving Expressivity of GNNs with Subgraph-specific Factor Embedded Normalization

1 code implementation31 May 2023 KaiXuan Chen, Shunyu Liu, Tongtian Zhu, Tongya Zheng, Haofei Zhang, Zunlei Feng, Jingwen Ye, Mingli Song

Graph Neural Networks (GNNs) have emerged as a powerful category of learning architecture for handling graph-structured data.

Improved Projection-free Online Continuous Submodular Maximization

no code implementations29 May 2023 Yucheng Liao, Yuanyu Wan, Chang Yao, Mingli Song

We investigate the problem of online learning with monotone and continuous DR-submodular reward functions, which has received great attention recently.

Blocking

Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?

1 code implementation27 May 2023 Yihe Zhou, Shunyu Liu, Yunpeng Qing, KaiXuan Chen, Tongya Zheng, Yanhao Huang, Jie Song, Mingli Song

Despite the encouraging results achieved, CTDE makes an independence assumption on agent policies, which limits agents to adopt global cooperative information from each other during centralized training.

Multi-agent Reinforcement Learning reinforcement-learning +3

Non-stationary Online Convex Optimization with Arbitrary Delays

no code implementations20 May 2023 Yuanyu Wan, Chang Yao, Mingli Song, Lijun Zhang

Despite its simplicity, our novel analysis shows that the dynamic regret of DOGD can be automatically bounded by $O(\sqrt{\bar{d}T}(P_T+1))$ under mild assumptions, and $O(\sqrt{dT}(P_T+1))$ in the worst case, where $\bar{d}$ and $d$ denote the average and maximum delay respectively, $T$ is the time horizon, and $P_T$ is the path-length of comparators.

Transition Propagation Graph Neural Networks for Temporal Networks

1 code implementation15 Apr 2023 Tongya Zheng, Zunlei Feng, Tianli Zhang, Yunzhi Hao, Mingli Song, Xingen Wang, Xinyu Wang, Ji Zhao, Chun Chen

The proposed TIP-GNN focuses on the bilevel graph structure in temporal networks: besides the explicit interaction graph, a node's sequential interactions can also be constructed as a transition graph.

Graph Mining Link Prediction +1

Life Regression based Patch Slimming for Vision Transformers

no code implementations11 Apr 2023 Jiawei Chen, Lin Chen, Jiang Yang, Tianqi Shi, Lechao Cheng, Zunlei Feng, Mingli Song

In this study, we tackle the patch slimming problem from a different perspective by proposing a life regression module that determines the lifespan of each image patch in one go.

regression

ViT-Calibrator: Decision Stream Calibration for Vision Transformer

no code implementations10 Apr 2023 Lin Chen, Zhijie Jia, Tian Qiu, Lechao Cheng, Jie Lei, Zunlei Feng, Mingli Song

In this work, we propose a new paradigm dubbed Decision Stream Calibration that boosts the performance of general Vision Transformers.

Propheter: Prophetic Teacher Guided Long-Tailed Distribution Learning

1 code implementation9 Apr 2023 Wenxiang Xu, Yongcheng Jing, Linyun Zhou, Wenqi Huang, Lechao Cheng, Zunlei Feng, Mingli Song

This is specifically achieved by devising an elaborated ``prophetic'' teacher, termed as ``Propheter'', that aims to learn the potential class distributions.

Data Augmentation

Generalization Matters: Loss Minima Flattening via Parameter Hybridization for Efficient Online Knowledge Distillation

1 code implementation CVPR 2023 Tianli Zhang, Mengqi Xue, Jiangtao Zhang, Haofei Zhang, Yu Wang, Lechao Cheng, Jie Song, Mingli Song

Most existing online knowledge distillation(OKD) techniques typically require sophisticated modules to produce diverse knowledge for improving students' generalization ability.

Knowledge Distillation

Schema Inference for Interpretable Image Classification

1 code implementation12 Mar 2023 Haofei Zhang, Mengqi Xue, Xiaokang Liu, KaiXuan Chen, Jie Song, Mingli Song

In this paper, we study a novel inference paradigm, termed as schema inference, that learns to deductively infer the explainable predictions by rebuilding the prior deep neural network (DNN) forwarding scheme, guided by the prevalent philosophical cognitive concept of schema.

Classification Graph Matching +1

Model Doctor for Diagnosing and Treating Segmentation Error

1 code implementation17 Feb 2023 Zhijie Jia, Lin Chen, Kaiwen Hu, Lechao Cheng, Zunlei Feng, Mingli Song

Despite the remarkable progress in semantic segmentation tasks with the advancement of deep neural networks, existing U-shaped hierarchical typical segmentation networks still suffer from local misclassification of categories and inaccurate target boundaries.

Segmentation Semantic Segmentation

DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes

2 code implementations15 Feb 2023 Shenghao Hao, Peiyuan Liu, Yibing Zhan, Kaixun Jin, Zuozhu Liu, Mingli Song, Jenq-Neng Hwang, Gaoang Wang

Although cross-view multi-object tracking has received increased attention in recent years, existing datasets still have several issues, including 1) missing real-world scenarios, 2) lacking diverse scenes, 3) owning a limited number of tracks, 4) comprising only static cameras, and 5) lacking standard benchmarks, which hinder the investigation and comparison of cross-view tracking methods.

Multi-Object Tracking Object +2

Team DETR: Guide Queries as a Professional Team in Detection Transformers

1 code implementation14 Feb 2023 Tian Qiu, Linyun Zhou, Wenxiang Xu, Lechao Cheng, Zunlei Feng, Mingli Song

Recent proposed DETR variants have made tremendous progress in various scenarios due to their streamlined processes and remarkable performance.

Improved Dynamic Regret for Online Frank-Wolfe

no code implementations11 Feb 2023 Yuanyu Wan, Lijun Zhang, Mingli Song

In this way, we first show that the dynamic regret bound of OFW can be improved to $O(\sqrt{T(V_T+1)})$ for smooth functions.

DepGraph: Towards Any Structural Pruning

1 code implementation CVPR 2023 Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang

Structural pruning enables model acceleration by removing structurally-grouped parameters from neural networks.

Network Pruning Neural Network Compression

Recent advances in artificial intelligence for retrosynthesis

no code implementations14 Jan 2023 Zipeng Zhong, Jie Song, Zunlei Feng, Tiantao Liu, Lingxiang Jia, Shaolun Yao, Tingjun Hou, Mingli Song

Afterwards, we analyze these methods in terms of their mechanism and performance, and introduce popular evaluation metrics for them, in which we also provide a detailed comparison among representative methods on several public datasets.

Multi-step retrosynthesis Retrosynthesis

A Loopback Network for Explainable Microvascular Invasion Classification

no code implementations CVPR 2023 Shengxuming Zhang, Tianqi Shi, Yang Jiang, Xiuming Zhang, Jie Lei, Zunlei Feng, Mingli Song

The loopback between two branches enables the category label to supervise the cell locating branch to learn the locating ability for cancerous areas.

Binary Classification Classification +1

How To Prevent the Continuous Damage of Noises To Model Training?

no code implementations CVPR 2023 Xiaotian Yu, Yang Jiang, Tianqi Shi, Zunlei Feng, Yuexuan Wang, Mingli Song, Li Sun

To address this problem, the proposed GSS alleviates the damage by switching the current gradient direction of each sample to a new direction selected from a gradient direction pool, which contains all-class gradient directions with different probabilities.

Learning with noisy labels

Evaluation and Improvement of Interpretability for Self-Explainable Part-Prototype Networks

1 code implementation ICCV 2023 Qihan Huang, Mengqi Xue, Wenqi Huang, Haofei Zhang, Jie Song, Yongcheng Jing, Mingli Song

Part-prototype networks (e. g., ProtoPNet, ProtoTree, and ProtoPool) have attracted broad research interest for their intrinsic interpretability and comparable accuracy to non-interpretable counterparts.

SASFormer: Transformers for Sparsely Annotated Semantic Segmentation

1 code implementation5 Dec 2022 Hui Su, Yue Ye, Wei Hua, Lechao Cheng, Mingli Song

In this work, we propose a simple yet effective sparse annotated semantic segmentation framework based on segformer, dubbed SASFormer, that achieves remarkable performance.

Segmentation Semantic Segmentation +2

Conservative-Progressive Collaborative Learning for Semi-supervised Semantic Segmentation

1 code implementation30 Nov 2022 Siqi Fan, Fenghua Zhu, Zunlei Feng, Yisheng Lv, Mingli Song, Fei-Yue Wang

Pseudo supervision is regarded as the core idea in semi-supervised learning for semantic segmentation, and there is always a tradeoff between utilizing only the high-quality pseudo labels and leveraging all the pseudo labels.

Segmentation Semi-Supervised Semantic Segmentation

Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition

1 code implementation23 Nov 2022 Shunyu Liu, Yihe Zhou, Jie Song, Tongya Zheng, KaiXuan Chen, Tongtian Zhu, Zunlei Feng, Mingli Song

Value Decomposition (VD) aims to deduce the contributions of agents for decentralized policies in the presence of only global rewards, and has recently emerged as a powerful credit assignment paradigm for tackling cooperative Multi-Agent Reinforcement Learning (MARL) problems.

Contrastive Learning Diversity +1

A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges

1 code implementation12 Nov 2022 Yunpeng Qing, Shunyu Liu, Jie Song, Huiqiong Wang, Mingli Song

In this survey, we provide a comprehensive review of existing works on eXplainable RL (XRL) and introduce a new taxonomy where prior works are clearly categorized into model-explaining, reward-explaining, state-explaining, and task-explaining methods.

reinforcement-learning Reinforcement Learning +2

Attention Diversification for Domain Generalization

1 code implementation9 Oct 2022 Rang Meng, Xianfeng Li, WeiJie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Mingli Song, Di Xie, ShiLiang Pu

Under this guidance, a novel Attention Diversification framework is proposed, in which Intra-Model and Inter-Model Attention Diversification Regularization are collaborated to reassign appropriate attention to diverse task-related features.

Domain Generalization

A Survey of Neural Trees

1 code implementation7 Sep 2022 Haoling Li, Jie Song, Mengqi Xue, Haofei Zhang, Jingwen Ye, Lechao Cheng, Mingli Song

This survey aims to present a comprehensive review of NTs and attempts to identify how they enhance the model interpretability.

Survey

ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition

1 code implementation22 Aug 2022 Mengqi Xue, Qihan Huang, Haofei Zhang, Lechao Cheng, Jie Song, Minghui Wu, Mingli Song

The global prototypes are adopted to provide the global view of objects to guide local prototypes to concentrate on the foreground while eliminating the influence of the background.

Decision Making Explainable artificial intelligence +1

Re-Attention Transformer for Weakly Supervised Object Localization

1 code implementation3 Aug 2022 Hui Su, Yue Ye, Zhiwei Chen, Mingli Song, Lechao Cheng

Weakly supervised object localization is a challenging task which aims to localize objects with coarse annotations such as image categories.

Object Weakly-Supervised Object Localization

Federated Selective Aggregation for Knowledge Amalgamation

1 code implementation27 Jul 2022 Donglin Xie, Ruonan Yu, Gongfan Fang, Jie Song, Zunlei Feng, Xinchao Wang, Li Sun, Mingli Song

The goal of FedSA is to train a student model for a new task with the help of several decentralized teachers, whose pre-training tasks and data are different and agnostic.

Learning Graph Neural Networks for Image Style Transfer

no code implementations24 Jul 2022 Yongcheng Jing, Yining Mao, Yiding Yang, Yibing Zhan, Mingli Song, Xinchao Wang, DaCheng Tao

To this end, we develop an elaborated GNN model with content and style local patches as the graph vertices.

Image Stylization

Learning with Recoverable Forgetting

1 code implementation17 Jul 2022 Jingwen Ye, Yifang Fu, Jie Song, Xingyi Yang, Songhua Liu, Xin Jin, Mingli Song, Xinchao Wang

Life-long learning aims at learning a sequence of tasks without forgetting the previously acquired knowledge.

General Knowledge Transfer Learning

Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning

1 code implementation8 Jul 2022 Shunyu Liu, Jie Song, Yihe Zhou, Na Yu, KaiXuan Chen, Zunlei Feng, Mingli Song

In this work, we introduce a novel interactiOn Pattern disenTangling (OPT) method, to disentangle the entity interactions into interaction prototypes, each of which represents an underlying interaction pattern within a subgroup of the entities.

Diversity Multi-agent Reinforcement Learning +3

Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework

2 code implementations5 Jul 2022 Shunyu Liu, KaiXuan Chen, Na Yu, Jie Song, Zunlei Feng, Mingli Song

Despite the promising results achieved, state-of-the-art interactive reinforcement learning schemes rely on passively receiving supervision signals from advisor experts, in the form of either continuous monitoring or pre-defined rules, which inevitably result in a cumbersome and expensive learning process.

Topology-aware Generalization of Decentralized SGD

1 code implementation25 Jun 2022 Tongtian Zhu, Fengxiang He, Lan Zhang, Zhengyang Niu, Mingli Song, DaCheng Tao

Our theory indicates that the generalizability of D-SGD is positively correlated with the spectral gap, and can explain why consensus control in initial training phase can ensure better generalization.

Slimmable Domain Adaptation

1 code implementation CVPR 2022 Rang Meng, WeiJie Chen, Shicai Yang, Jie Song, Luojun Lin, Di Xie, ShiLiang Pu, Xinchao Wang, Mingli Song, Yueting Zhuang

In this paper, we introduce a simple framework, Slimmable Domain Adaptation, to improve cross-domain generalization with a weight-sharing model bank, from which models of different capacities can be sampled to accommodate different accuracy-efficiency trade-offs.

Domain Generalization Unsupervised Domain Adaptation

Label Matching Semi-Supervised Object Detection

3 code implementations CVPR 2022 Binbin Chen, WeiJie Chen, Shicai Yang, Yunyi Xuan, Jie Song, Di Xie, ShiLiang Pu, Mingli Song, Yueting Zhuang

To remedy this issue, we present a novel label assignment mechanism for self-training framework, namely proposal self-assignment, which injects the proposals from student into teacher and generates accurate pseudo labels to match each proposal in the student model accordingly.

Object object-detection +2

Recent Advances in Embedding Methods for Multi-Object Tracking: A Survey

no code implementations22 May 2022 Gaoang Wang, Mingli Song, Jenq-Neng Hwang

Multi-object tracking (MOT) aims to associate target objects across video frames in order to obtain entire moving trajectories.

Image Classification Multi-Object Tracking +3

Distribution-Aware Graph Representation Learning for Transient Stability Assessment of Power System

1 code implementation12 May 2022 KaiXuan Chen, Shunyu Liu, Na Yu, Rong Yan, Quan Zhang, Jie Song, Zunlei Feng, Mingli Song

As the topology of the power system is in the form of graph structure, graph neural network based representation learning is naturally suitable for learning the status of the power system.

Binary Classification Graph Neural Network +2

Comparison Knowledge Translation for Generalizable Image Classification

1 code implementation7 May 2022 Zunlei Feng, Tian Qiu, Sai Wu, Xiaotuan Jin, Zengliang He, Mingli Song, Huiqiong Wang

In this paper, we attempt to build a generalizable framework that emulates the humans' recognition mechanism in the image classification task, hoping to improve the classification performance on unseen categories with the support of annotations of other categories.

Classification Image Classification +1

Spot-adaptive Knowledge Distillation

2 code implementations5 May 2022 Jie Song, Ying Chen, Jingwen Ye, Mingli Song

Knowledge distillation (KD) has become a well established paradigm for compressing deep neural networks.

Knowledge Distillation

Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks

1 code implementation25 Mar 2022 Jiacong Hu, Jing Gao, Jingwen Ye, Yang Gao, Xingen Wang, Zunlei Feng, Mingli Song

With the rapid development of deep learning, the increasing complexity and scale of parameters make training a new model increasingly resource-intensive.

Incremental Learning Knowledge Distillation +2

Meta-attention for ViT-backed Continual Learning

1 code implementation CVPR 2022 Mengqi Xue, Haofei Zhang, Jie Song, Mingli Song

Continual learning is a longstanding research topic due to its crucial role in tackling continually arriving tasks.

Continual Learning

Root-aligned SMILES: A Tight Representation for Chemical Reaction Prediction

1 code implementation22 Mar 2022 Zipeng Zhong, Jie Song, Zunlei Feng, Tiantao Liu, Lingxiang Jia, Shaolun Yao, Min Wu, Tingjun Hou, Mingli Song

Chemical reaction prediction, involving forward synthesis and retrosynthesis prediction, is a fundamental problem in organic synthesis.

Chemical Reaction Prediction Retrosynthesis +1

Knowledge Amalgamation for Object Detection with Transformers

1 code implementation7 Mar 2022 Haofei Zhang, Feng Mao, Mengqi Xue, Gongfan Fang, Zunlei Feng, Jie Song, Mingli Song

Moreover, the transformer-based students excel in learning amalgamated knowledge, as they have mastered heterogeneous detection tasks rapidly and achieved superior or at least comparable performance to those of the teachers in their specializations.

Object object-detection +1

Imbalanced Sample Generation and Evaluation for Power System Transient Stability Using CTGAN

no code implementations16 Dec 2021 Gengshi Han, Shunyu Liu, KaiXuan Chen, Na Yu, Zunlei Feng, Mingli Song

This paper proposes a controllable sample generation framework based on Conditional Tabular Generative Adversarial Network (CTGAN) to generate specified transient stability samples.

Generative Adversarial Network

Up to 100$\times$ Faster Data-free Knowledge Distillation

2 code implementations12 Dec 2021 Gongfan Fang, Kanya Mo, Xinchao Wang, Jie Song, Shitao Bei, Haofei Zhang, Mingli Song

At the heart of our approach is a novel strategy to reuse the shared common features in training data so as to synthesize different data instances.

Data-free Knowledge Distillation

Bootstrapping ViTs: Towards Liberating Vision Transformers from Pre-training

1 code implementation CVPR 2022 Haofei Zhang, Jiarui Duan, Mengqi Xue, Jie Song, Li Sun, Mingli Song

Recently, vision Transformers (ViTs) are developing rapidly and starting to challenge the domination of convolutional neural networks (CNNs) in the realm of computer vision (CV).

A Survey of Deep Learning for Low-Shot Object Detection

no code implementations6 Dec 2021 Qihan Huang, Haofei Zhang, Mengqi Xue, Jie Song, Mingli Song

Although few-shot learning and zero-shot learning have been extensively explored in the field of image classification, it is indispensable to design new methods for object detection in the data-scarce scenario since object detection has an additional challenging localization task.

Deep Learning Few-Shot Learning +8

Safe Distillation Box

1 code implementation5 Dec 2021 Jingwen Ye, Yining Mao, Jie Song, Xinchao Wang, Cheng Jin, Mingli Song

In other words, all users may employ a model in SDB for inference, but only authorized users get access to KD from the model.

Knowledge Distillation

Learning Dynamic Preference Structure Embedding From Temporal Networks

1 code implementation23 Nov 2021 Tongya Zheng, Zunlei Feng, Yu Wang, Chengchao Shen, Mingli Song, Xingen Wang, Xinyu Wang, Chun Chen, Hao Xu

Our proposed Dynamic Preference Structure (DPS) framework consists of two stages: structure sampling and graph fusion.

Graph Sampling

Distribution Knowledge Embedding for Graph Pooling

1 code implementation29 Sep 2021 KaiXuan Chen, Jie Song, Shunyu Liu, Na Yu, Zunlei Feng, Gengshi Han, Mingli Song

A DKEPool network de facto disassembles representation learning into two stages, structure learning and distribution learning.

Representation Learning

Meta-Aggregator: Learning to Aggregate for 1-bit Graph Neural Networks

no code implementations ICCV 2021 Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, DaCheng Tao

In this paper, we study a novel meta aggregation scheme towards binarizing graph neural networks (GNNs).

Online Knowledge Distillation for Efficient Pose Estimation

1 code implementation ICCV 2021 Zheng Li, Jingwen Ye, Mingli Song, Ying Huang, Zhigeng Pan

However, existing pose distillation works rely on a heavy pre-trained estimator to perform knowledge transfer and require a complex two-stage learning procedure.

Knowledge Distillation Pose Estimation

Boundary Knowledge Translation based Reference Semantic Segmentation

no code implementations1 Aug 2021 Lechao Cheng, Zunlei Feng, Xinchao Wang, Ya Jie Liu, Jie Lei, Mingli Song

In this paper, we introduce a novel Reference semantic segmentation Network (Ref-Net) to conduct visual boundary knowledge translation.

Segmentation Semantic Segmentation +1

Visual Boundary Knowledge Translation for Foreground Segmentation

1 code implementation1 Aug 2021 Zunlei Feng, Lechao Cheng, Xinchao Wang, Xiang Wang, Yajie Liu, Xiangtong Du, Mingli Song

To this end, we propose a Translation Segmentation Network (Trans-Net), which comprises a segmentation network and two boundary discriminators.

Foreground Segmentation Image Segmentation +3

Edge-competing Pathological Liver Vessel Segmentation with Limited Labels

1 code implementation1 Aug 2021 Zunlei Feng, Zhonghua Wang, Xinchao Wang, Xiuming Zhang, Lechao Cheng, Jie Lei, Yuexuan Wang, Mingli Song

The diagnosis of MVI needs discovering the vessels that contain hepatocellular carcinoma cells and counting their number in each vessel, which depends heavily on experiences of the doctor, is largely subjective and time-consuming.

Segmentation whole slide images

Shape Controllable Virtual Try-on for Underwear Models

no code implementations28 Jul 2021 Xin Gao, Zhenjiang Liu, Zunlei Feng, Chengji Shen, Kairi Ou, Haihong Tang, Mingli Song

Existing 2D image-based virtual try-on methods aim to transfer a target clothing image onto a reference person, which has two main disadvantages: cannot control the size and length precisely; unable to accurately estimate the user's figure in the case of users wearing thick clothes, resulting in inaccurate dressing effect.

Graph Attention Virtual Try-on

Turning Frequency to Resolution: Video Super-Resolution via Event Cameras

no code implementations CVPR 2021 Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, DaCheng Tao

To this end, we propose an Event-based VSR framework (E-VSR), of which the key component is an asynchronous interpolation (EAI) module that reconstructs a high-frequency (HF) video stream with uniform and tiny pixel displacements between neighboring frames from an event stream.

Video Super-Resolution

Tree-Like Decision Distillation

no code implementations CVPR 2021 Jie Song, Haofei Zhang, Xinchao Wang, Mengqi Xue, Ying Chen, Li Sun, DaCheng Tao, Mingli Song

Knowledge distillation pursues a diminutive yet well-behaved student network by harnessing the knowledge learned by a cumbersome teacher model.

Decision Making Knowledge Distillation

Contrastive Model Inversion for Data-Free Knowledge Distillation

2 code implementations18 May 2021 Gongfan Fang, Jie Song, Xinchao Wang, Chengchao Shen, Xingen Wang, Mingli Song

In this paper, we propose Contrastive Model Inversion~(CMI), where the data diversity is explicitly modeled as an optimizable objective, to alleviate the mode collapse issue.

Contrastive Learning Data-free Knowledge Distillation +1

KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation

1 code implementation10 May 2021 Mengqi Xue, Jie Song, Xinchao Wang, Ying Chen, Xingen Wang, Mingli Song

Knowledge distillation (KD) has recently emerged as an efficacious scheme for learning compact deep neural networks (DNNs).

Knowledge Distillation Multi-class Classification

Training Generative Adversarial Networks in One Stage

1 code implementation CVPR 2021 Chengchao Shen, Youtan Yin, Xinchao Wang, Xubin Li, Jie Song, Mingli Song

Based on the adversarial losses of the generator and discriminator, we categorize GANs into two classes, Symmetric GANs and Asymmetric GANs, and introduce a novel gradient decomposition method to unify the two, allowing us to train both classes in one stage and hence alleviate the training effort.

Data-free Knowledge Distillation Image Generation

SPAGAN: Shortest Path Graph Attention Network

1 code implementation10 Jan 2021 Yiding Yang, Xinchao Wang, Mingli Song, Junsong Yuan, DaCheng Tao

SPAGAN therefore allows for a more informative and intact exploration of the graph structure and further {a} more effective aggregation of information from distant neighbors into the center node, as compared to node-based GCN methods.

Graph Attention

Self-Born Wiring for Neural Trees

no code implementations ICCV 2021 Ying Chen, Feng Mao, Jie Song, Xinchao Wang, Huiqiong Wang, Mingli Song

Neural trees aim at integrating deep neural networks and decision trees so as to bring the best of the two worlds, including representation learning from the former and faster inference from the latter.

Representation Learning

Progressive Network Grafting for Few-Shot Knowledge Distillation

2 code implementations9 Dec 2020 Chengchao Shen, Xinchao Wang, Youtan Yin, Jie Song, Sihui Luo, Mingli Song

In this paper, we investigate the practical few-shot knowledge distillation scenario, where we assume only a few samples without human annotations are available for each category.

Knowledge Distillation Model Compression +1

One-sample Guided Object Representation Disassembling

no code implementations NeurIPS 2020 Zunlei Feng, Yongming He, Xinchao Wang, Xin Gao, Jie Lei, Cheng Jin, Mingli Song

In this paper, we introduce the One-sample Guided Object Representation Disassembling (One-GORD) method, which only requires one annotated sample for each object category to learn disassembled object representation from unannotated images.

Data Augmentation Image Classification +1

DEAL: Difficulty-aware Active Learning for Semantic Segmentation

1 code implementation17 Oct 2020 Shuai Xie, Zunlei Feng, Ying Chen, Songtao Sun, Chao Ma, Mingli Song

To deal with this problem, we propose a semantic Difficulty-awarE Active Learning (DEAL) network composed of two branches: the common segmentation branch and the semantic difficulty branch.

Active Learning Segmentation +1

Learning Propagation Rules for Attribution Map Generation

no code implementations ECCV 2020 Yiding Yang, Jiayan Qiu, Mingli Song, DaCheng Tao, Xinchao Wang

Prior gradient-based attribution-map methods rely on handcrafted propagation rules for the non-linear/activation layers during the backward pass, so as to produce gradients of the input and then the attribution map.

Factorizable Graph Convolutional Networks

1 code implementation NeurIPS 2020 Yiding Yang, Zunlei Feng, Mingli Song, Xinchao Wang

In this paper, we introduce a novel graph convolutional network (GCN), termed as factorizable graph convolutional network(FactorGCN), that explicitly disentangles such intertwined relations encoded in a graph.

Graph Classification Graph Regression +1

Impression Space from Deep Template Network

no code implementations10 Jul 2020 Gongfan Fang, Xinchao Wang, Haofei Zhang, Jie Song, Mingli Song

This network is referred to as the {\emph{Template Network}} because its filters will be used as templates to reconstruct images from the impression.

Image Generation Translation

Disassembling Object Representations without Labels

no code implementations3 Apr 2020 Zunlei Feng, Xinchao Wang, Yongming He, Yike Yuan, Xin Gao, Mingli Song

In this paper, we study a new representation-learning task, which we termed as disassembling object representations.

General Classification Generative Adversarial Network +3

Distilling Knowledge from Graph Convolutional Networks

1 code implementation CVPR 2020 Yiding Yang, Jiayan Qiu, Mingli Song, DaCheng Tao, Xinchao Wang

To enable the knowledge transfer from the teacher GCN to the student, we propose a local structure preserving module that explicitly accounts for the topological semantics of the teacher.

Knowledge Distillation Transfer Learning

DEPARA: Deep Attribution Graph for Deep Knowledge Transferability

1 code implementation CVPR 2020 Jie Song, Yixin Chen, Jingwen Ye, Xinchao Wang, Chengchao Shen, Feng Mao, Mingli Song

In this paper, we propose the DEeP Attribution gRAph (DEPARA) to investigate the transferability of knowledge learned from PR-DNNs.

Model Selection Transfer Learning

Data-Free Adversarial Distillation

3 code implementations23 Dec 2019 Gongfan Fang, Jie Song, Chengchao Shen, Xinchao Wang, Da Chen, Mingli Song

Knowledge Distillation (KD) has made remarkable progress in the last few years and become a popular paradigm for model compression and knowledge transfer.

Knowledge Distillation Model Compression +2

Semantic Regularization: Improve Few-shot Image Classification by Reducing Meta Shift

no code implementations18 Dec 2019 Da Chen, Yong-Liang Yang, Zunlei Feng, Xiang Wu, Mingli Song, Wenbin Li, Yuan He, Hui Xue, Feng Mao

This strategy leads to severe meta shift issues across multiple tasks, meaning the learned prototypes or class descriptors are not stable as each task only involves their own support set.

Decoder Few-Shot Image Classification +2

Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers

1 code implementation26 Nov 2019 Ya Zhao, Rui Xu, Xinchao Wang, Peng Hou, Haihong Tang, Mingli Song

In this paper, we propose a new method, termed as Lip by Speech (LIBS), of which the goal is to strengthen lip reading by learning from speech recognizers.

Knowledge Distillation Lipreading +3

Dynamic Instance Normalization for Arbitrary Style Transfer

no code implementations16 Nov 2019 Yongcheng Jing, Xiao Liu, Yukang Ding, Xinchao Wang, Errui Ding, Mingli Song, Shilei Wen

Prior normalization methods rely on affine transformations to produce arbitrary image style transfers, of which the parameters are computed in a pre-defined way.

Style Transfer

Deep Model Transferability from Attribution Maps

2 code implementations NeurIPS 2019 Jie Song, Yixin Chen, Xinchao Wang, Chengchao Shen, Mingli Song

Exploring the transferability between heterogeneous tasks sheds light on their intrinsic interconnections, and consequently enables knowledge transfer from one task to another so as to reduce the training effort of the latter.

Transfer Learning

Customizing Student Networks From Heterogeneous Teachers via Adaptive Knowledge Amalgamation

2 code implementations ICCV 2019 Chengchao Shen, Mengqi Xue, Xinchao Wang, Jie Song, Li Sun, Mingli Song

To this end, we introduce a dual-step strategy that first extracts the task-specific knowledge from the heterogeneous teachers sharing the same sub-task, and then amalgamates the extracted knowledge to build the student network.

A Cascade Sequence-to-Sequence Model for Chinese Mandarin Lip Reading

no code implementations14 Aug 2019 Ya Zhao, Rui Xu, Mingli Song

When trained on CMLR dataset, the proposed CSSMCM surpasses the performance of state-of-the-art lip reading frameworks, which confirms the effectiveness of explicit modeling of tones for Chinese Mandarin lip reading.

Lipreading Lip Reading +1

Knowledge Amalgamation from Heterogeneous Networks by Common Feature Learning

2 code implementations24 Jun 2019 Sihui Luo, Xinchao Wang, Gongfan Fang, Yao Hu, Dapeng Tao, Mingli Song

An increasing number of well-trained deep networks have been released online by researchers and developers, enabling the community to reuse them in a plug-and-play way without accessing the training annotations.

Amalgamating Filtered Knowledge: Learning Task-customized Student from Multi-task Teachers

1 code implementation28 May 2019 Jingwen Ye, Xinchao Wang, Yixin Ji, Kairi Ou, Mingli Song

Many well-trained Convolutional Neural Network(CNN) models have now been released online by developers for the sake of effortless reproducing.

Amalgamating Knowledge towards Comprehensive Classification

1 code implementation7 Nov 2018 Chengchao Shen, Xinchao Wang, Jie Song, Li Sun, Mingli Song

We propose in this paper to study a new model-reusing task, which we term as \emph{knowledge amalgamation}.

Classification General Classification

Selective Zero-Shot Classification with Augmented Attributes

no code implementations ECCV 2018 Jie Song, Chengchao Shen, Jie Lei, An-Xiang Zeng, Kairi Ou, DaCheng Tao, Mingli Song

We propose a selective zero-shot classifier based on both the human defined and the automatically discovered residual attributes.

Attribute Classification +2

Interpretable Partitioned Embedding for Customized Fashion Outfit Composition

no code implementations13 Jun 2018 Zunlei Feng, Zhenyun Yu, Yezhou Yang, Yongcheng Jing, Junxiao Jiang, Mingli Song

In the supervised attributes module, multiple attributes labels are adopted to ensure that different parts of the overall embedding correspond to different attributes.

Attribute

Dual Swap Disentangling

1 code implementation NeurIPS 2018 Zunlei Feng, Xinchao Wang, Chenglong Ke, An-Xiang Zeng, DaCheng Tao, Mingli Song

To achieve disentangling using the labeled pairs, we follow a "encoding-swap-decoding" process, where we first swap the parts of their encodings corresponding to the shared attribute and then decode the obtained hybrid codes to reconstruct the original input pairs.

Attribute

Transductive Unbiased Embedding for Zero-Shot Learning

no code implementations CVPR 2018 Jie Song, Chengchao Shen, Yezhou Yang, Yang Liu, Mingli Song

Most existing Zero-Shot Learning (ZSL) methods have the strong bias problem, in which instances of unseen (target) classes tend to be categorized as one of the seen (source) classes.

Transductive Learning Zero-Shot Learning

Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields

1 code implementation ECCV 2018 Yongcheng Jing, Yang Liu, Yezhou Yang, Zunlei Feng, Yizhou Yu, DaCheng Tao, Mingli Song

In this paper, we present a stroke controllable style transfer network that can achieve continuous and spatial stroke size control.

Style Transfer

DeepSIC: Deep Semantic Image Compression

no code implementations29 Jan 2018 Sihui Luo, Yezhou Yang, Mingli Song

The same practice also enable the compressed code to carry the image semantic information during storage and transmission.

Benchmarking Image Compression +1

TripletGAN: Training Generative Model with Triplet Loss

no code implementations14 Nov 2017 Gongze Cao, Yezhou Yang, Jie Lei, Cheng Jin, Yang Liu, Mingli Song

As an effective way of metric learning, triplet loss has been widely used in many deep learning tasks, including face recognition and person-ReID, leading to many states of the arts.

Face Recognition General Classification +2

Neural Style Transfer: A Review

8 code implementations11 May 2017 Yongcheng Jing, Yezhou Yang, Zunlei Feng, Jingwen Ye, Yizhou Yu, Mingli Song

We first propose a taxonomy of current algorithms in the field of NST.

Style Transfer

Weakly Supervised Multiclass Video Segmentation

no code implementations CVPR 2014 Xiao Liu, DaCheng Tao, Mingli Song, Ying Ruan, Chun Chen, Jiajun Bu

In this paper, we present a novel nearest neighbor-based label transfer scheme for weakly supervised video segmentation.

Segmentation Semantic Similarity +5

Semi-Supervised Coupled Dictionary Learning for Person Re-identification

no code implementations CVPR 2014 Xiao Liu, Mingli Song, DaCheng Tao, Xingchen Zhou, Chun Chen, Jiajun Bu

In this paper, to bridge the human appearance variations across cameras, two coupled dictionaries that relate to the gallery and probe cameras are jointly learned in the training phase from both labeled and unlabeled images.

Dictionary Learning Person Re-Identification

Probabilistic Graphlet Cut: Exploiting Spatial Structure Cue for Weakly Supervised Image Segmentation

no code implementations CVPR 2013 Luming Zhang, Mingli Song, Zicheng Liu, Xiao Liu, Jiajun Bu, Chun Chen

Finally, we propose a novel image segmentation algorithm, called graphlet cut, that leverages the learned graphlet distribution in measuring the homogeneity of a set of spatially structured superpixels.

Image Segmentation Segmentation +2

Sparse Norm Filtering

no code implementations17 May 2013 Chengxi Ye, DaCheng Tao, Mingli Song, David W. Jacobs, Min Wu

Optimization-based filtering smoothes an image by minimizing a fidelity function and simultaneously preserves edges by exploiting a sparse norm penalty over gradients.

Colorization Deblurring +2

Spectral Graph Cut from a Filtering Point of View

no code implementations20 May 2012 Chengxi Ye, Yuxu Lin, Mingli Song, Chun Chen, David W. Jacobs

In this paper, we analyze image segmentation algorithms that are based on spectral graph theory, e. g., normalized cut, and show that there is a natural connection between spectural graph theory based image segmentationand and edge preserving filtering.

Image Segmentation Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.