no code implementations • ECCV 2020 • Sihui Luo, Wenwen Pan, Xinchao Wang, Dazhou Wang, Haihong Tang, Mingli Song
To this end, we propose a self-coordinate knowledge amalgamation network (SOKA-Net) for learning the multi-talent student model.
1 code implementation • 4 Dec 2024 • Qihan Huang, Long Chan, Jinlong Liu, Wanggui He, Hao Jiang, Mingli Song, Jie Song
To tackle this problem, this work proposes PatchDPO that estimates the quality of image patches within each generated image and accordingly trains the model.
1 code implementation • 16 Nov 2024 • Wenxiang Xu, Tian Qiu, Linyun Zhou, Zunlei Feng, Mingli Song, Huiqiong Wang
Based on the calibration values, we propose a plugin-based calibration module incorporated into a modified ResNet architecture, termed Response Calibration Networks (ResCNet).
1 code implementation • 6 Nov 2024 • Hanyang Yuan, Jiarong Xu, Renhong Huang, Mingli Song, Chunping Wang, Yang Yang
Our method only requires training a small set of models on graphs, while generating a sufficient number of approximated shadow models for attacks.
1 code implementation • 6 Nov 2024 • Ping Li, Tao Wang, Xinkui Zhao, Xianghua Xu, Mingli Song
Meanwhile, the former employs the repetition penalized sampling to encourage the model to yield concise pseudo-labeled sentences with less repetition, and selects the most relevant sentences upon a pretrained video-text model.
no code implementations • 14 Oct 2024 • Qihan Huang, Jie Song, Mengqi Xue, Haofei Zhang, Bingde Hu, Huiqiong Wang, Hao Jiang, Xingen Wang, Mingli Song
To bridge the gap between vision-language model and the target model, we calculate the activation values of concept descriptions on a common pool of images (probe images) with vision-language model and utilize them as language guidance to train the LG-CAV.
no code implementations • 28 Jul 2024 • Jiarui Duan, Haoling Li, Haofei Zhang, Hao Jiang, Mengqi Xue, Li Sun, Mingli Song, Jie Song
Our findings underscore the necessity for future research in this domain to conduct rigorous evaluations encompassing a broader range of models and datasets, and to reassess the assumptions underlying the empirical success of different attribution methods.
1 code implementation • 24 Jul 2024 • Ziyue Chen, Tongya Zheng, Mingli Song
When compared with static networks, temporal networks present two specific challenges for negative sampling: positive sparsity and positive shift.
1 code implementation • 22 Jul 2024 • Shunyu Liu, Yaoru Li, Kongcheng Zhang, Zhenyu Cui, Wenkai Fang, Yuxuan Zheng, Tongya Zheng, Mingli Song
In this work, we introduce Odyssey, a new framework that empowers Large Language Model (LLM)-based agents with open-world skills to explore the vast Minecraft world.
1 code implementation • 13 Jul 2024 • Wenda Li, KaiXuan Chen, Shunyu Liu, Tongya Zheng, Wenjie Huang, Mingli Song
Mini-batch Graph Transformer (MGT), as an emerging graph learning model, has demonstrated significant advantages in semi-supervised node prediction tasks with improved computational efficiency and enhanced model robustness.
1 code implementation • 2 Jul 2024 • Yuwen Wang, Shunyu Liu, Tongya Zheng, KaiXuan Chen, Mingli Song
Graph Neural Networks (GNNs) have emerged as a prominent framework for graph mining, leading to significant advances across various domains.
1 code implementation • 25 Jun 2024 • Feiyang Xu, Shunyu Liu, Yunpeng Qing, Yihe Zhou, Yuwen Wang, Mingli Song
In this paper, we propose a novel temporal prototype-aware learning method, abbreviated as TPA, to learn time-adaptive AVC under short-term training trajectories.
1 code implementation • 18 Jun 2024 • Changhao Li, Haoling Li, Mengqi Xue, Gongfan Fang, Sheng Zhou, Zunlei Feng, Huiqiong Wang, Mingli Song, Jie Song
Structural pruning has emerged as a promising approach for producing more efficient models.
no code implementations • 13 Jun 2024 • Jiacong Hu, Jingwen Ye, Zunlei Feng, Jiazhen Yang, Shunyu Liu, Xiaotian Yu, Lingxiang Jia, Mingli Song
Recognizing that a correct prediction relies on the correctness of the latent feature's pattern, we introduce a novel and effective Feature Pattern Consistency Constraint (FPCC) method to reinforce the latent feature's capacity to maintain the correct feature pattern.
1 code implementation • 13 Jun 2024 • Yijun Bei, Hengrui Lou, Jinsong Geng, Erteng Liu, Lechao Cheng, Jie Song, Mingli Song, Zunlei Feng
Consequently, various face forgery detection techniques have been proposed to identify such fake facial content.
no code implementations • 24 May 2024 • Shunyu Liu, Wei Luo, Yanzhen Zhou, KaiXuan Chen, Quan Zhang, Huating Xu, Qinglai Guo, Mingli Song
Transmission interface power flow adjustment is a critical measure to ensure the security and economy operation of power systems.
no code implementations • 1 May 2024 • ZhengZhao Feng, Rui Wang, Tianxing Wang, Mingli Song, Sai Wu, Shuibing He
From the analysis and evaluation results, we identify key challenges and offer principles for future research to enhance the design of models and frameworks in the dynamic GNNs field.
1 code implementation • 22 Mar 2024 • Zhenbang Xiao, Yu Wang, Shunyu Liu, Huiqiong Wang, Mingli Song, Tongya Zheng
The burdensome training costs on large-scale graphs have aroused significant interest in graph condensation, which involves tuning Graph Neural Networks (GNNs) on a small condensed graph for use on the large-scale original graph.
1 code implementation • 21 Mar 2024 • Qihan Huang, Jie Song, Jingwen Hu, Haofei Zhang, Yong Wang, Mingli Song
Concept Bottleneck Models (CBMs), which break down the reasoning process into the input-to-concept mapping and the concept-to-label prediction, have garnered significant attention due to their remarkable interpretability achieved by the interpretable concept bottleneck.
no code implementations • 12 Mar 2024 • Yunpeng Qing, Shunyu Liu, Jingyuan Cong, KaiXuan Chen, Yihe Zhou, Mingli Song
Offline reinforcement learning endeavors to leverage offline datasets to craft effective agent policy without online interaction, which imposes proper conservative constraints with the support of behavior policies to tackle the out-of-distribution problem.
1 code implementation • 4 Mar 2024 • Yu Wang, Tongya Zheng, Yuxuan Liang, Shunyu Liu, Mingli Song
To address these challenges, we have tailored a Cross-city mObiLity trAnsformer (COLA) with a dedicated model-agnostic transfer framework by effectively transferring cross-city knowledge for human trajectory simulation.
1 code implementation • CVPR 2024 • Zhengqi Xu, Ke Yuan, Huiqiong Wang, Yong Wang, Mingli Song, Jie Song
Furthermore, the visualization of the merged model within the multi-task loss landscape reveals that MuDSC enables the merged model to reside in the overlapping segment, featuring a unified lower loss for each task.
no code implementations • 14 Feb 2024 • Yuanyu Wan, Chang Yao, Mingli Song, Lijun Zhang
We investigate bandit convex optimization (BCO) with delayed feedback, where only the loss value of the action is revealed under an arbitrary delay.
no code implementations • 14 Feb 2024 • Yuanyu Wan, Tong Wei, Mingli Song, Lijun Zhang
Previous studies have established $O(n^{5/4}\rho^{-1/2}\sqrt{T})$ and ${O}(n^{3/2}\rho^{-1}\log T)$ regret bounds for convex and strongly convex functions respectively, where $n$ is the number of local learners, $\rho<1$ is the spectral gap of the communication matrix, and $T$ is the time horizon.
no code implementations • 4 Feb 2024 • Yuxin Wang, Zunlei Feng, Haofei Zhang, Yang Gao, Jie Lei, Li Sun, Mingli Song
Due to the inability to receive signals from the Global Navigation Satellite System (GNSS) in extreme conditions, achieving accurate and robust navigation for Unmanned Aerial Vehicles (UAVs) is a challenging task.
1 code implementation • 18 Jan 2024 • Zhenbang Xiao, Shunyu Liu, Yu Wang, Tongya Zheng, Mingli Song
Graph condensation has emerged as an intriguing technique to save the expensive training costs of Graph Neural Networks (GNNs) by substituting a condensed small graph with the original graph.
1 code implementation • 5 Jan 2024 • KaiXuan Chen, Wei Luo, Shunyu Liu, Yaoquan Wei, Yihe Zhou, Yunpeng Qing, Quan Zhang, Jie Song, Mingli Song
In this paper, we present a novel transformer architecture tailored for learning robust power system state representations, which strives to optimize power dispatch for the power flow adjustment across different transmission sections.
1 code implementation • 14 Dec 2023 • Jingxuan He, Lechao Cheng, Chaowei Fang, Zunlei Feng, Tingting Mu, Mingli Song
Building upon this, we introduce a complementary self-enhancement method that constrains the semantic consistency between these confident regions and an augmented image with the same class labels.
Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation
no code implementations • 28 Nov 2023 • Yaoquan Wei, Shunyu Liu, Jie Song, Tongya Zheng, KaiXuan Chen, Yong Wang, Mingli Song
Instead, we employ a proxy model to extract state features that are both discriminative (adaptive to the agent) and generally applicable (robust to agent noise).
1 code implementation • 19 Nov 2023 • Ping Li, Chenhan Zhang, Zheng Yang, Xianghua Xu, Mingli Song
To this end, we present a Pair-wise Layer Attention with Spatial Masking (PLA-SM) framework for video prediction to capture the spatiotemporal dynamics, which reflect the motion trend.
no code implementations • 22 Sep 2023 • Ping Li, Junjie Chen, Li Yuan, Xianghua Xu, Mingli Song
To alleviate the expensive human labeling, semi-supervised semantic segmentation employs a few labeled images and an abundant of unlabeled images to predict the pixel-level label map with the same size.
1 code implementation • ICCV 2023 • Jie Song, Zhengqi Xu, Sai Wu, Gang Chen, Mingli Song
The last decade has witnessed the success of deep learning and the surge of publicly released trained models, which necessitates the quantification of the model functional distance for various purposes.
1 code implementation • 5 Aug 2023 • Yuwen Wang, Shunyu Liu, KaiXuan Chen, Tongtian Zhu, Ji Qiao, Mengjie Shi, Yuanyu Wan, Mingli Song
Graph Lottery Ticket (GLT), a combination of core subgraph and sparse subnetwork, has been proposed to mitigate the computational cost of deep Graph Neural Networks (GNNs) on large input graphs while preserving original performance.
1 code implementation • 26 Jul 2023 • Tongya Zheng, Tianli Zhang, Qingzheng Guan, Wenjie Huang, Zunlei Feng, Mingli Song, Chun Chen
Therefore, we firstly generate a dataset with 45, 000 numerical simulations and 900 particle types to facilitate the research progress of machine learning for particle crushing.
no code implementations • 7 Jul 2023 • Zhonghan Zhao, Wenhao Chai, Shengyu Hao, Wenhao Hu, Guanhong Wang, Shidong Cao, Mingli Song, Jenq-Neng Hwang, Gaoang Wang
Deep learning has the potential to revolutionize sports performance, with applications ranging from perception and comprehension to decision.
no code implementations • 29 Jun 2023 • Zhenyu Zhang, Wenhao Chai, Zhongyu Jiang, Tian Ye, Mingli Song, Jenq-Neng Hwang, Gaoang Wang
Estimating 3D human poses only from a 2D human pose sequence is thoroughly explored in recent years.
1 code implementation • 15 Jun 2023 • Yu Wang, Tongya Zheng, Shunyu Liu, Zunlei Feng, KaiXuan Chen, Yunzhi Hao, Mingli Song
The human mobility simulation task aims to generate human mobility trajectories given a small set of trajectory data, which have aroused much concern due to the scarcity and sparsity of human mobility data.
1 code implementation • 14 Jun 2023 • Shunyu Liu, Yunpeng Qing, Shuqi Xu, Hongyan Wu, Jiangtao Zhang, Jingyuan Cong, Tianhao Chen, YunFu Liu, Mingli Song
Inverse Reinforcement Learning (IRL) aims to reconstruct the reward function from expert demonstrations to facilitate policy learning, and has demonstrated its remarkable success in imitation learning.
1 code implementation • 5 Jun 2023 • Tongtian Zhu, Fengxiang He, KaiXuan Chen, Mingli Song, DaCheng Tao
Decentralized stochastic gradient descent (D-SGD) allows collaborative learning on massive devices simultaneously without the control of a central server.
no code implementations • 3 Jun 2023 • Wenda Li, KaiXuan Chen, Shunyu Liu, Wenjie Huang, Haofei Zhang, Yingjie Tian, Yun Su, Mingli Song
In this paper, we strive to develop an interpretable GNNs' inference paradigm, termed MSInterpreter, which can serve as a plug-and-play scheme readily applicable to various GNNs' baselines.
1 code implementation • 31 May 2023 • KaiXuan Chen, Shunyu Liu, Tongtian Zhu, Tongya Zheng, Haofei Zhang, Zunlei Feng, Jingwen Ye, Mingli Song
Graph Neural Networks (GNNs) have emerged as a powerful category of learning architecture for handling graph-structured data.
no code implementations • 29 May 2023 • Yucheng Liao, Yuanyu Wan, Chang Yao, Mingli Song
We investigate the problem of online learning with monotone and continuous DR-submodular reward functions, which has received great attention recently.
1 code implementation • 27 May 2023 • Yihe Zhou, Shunyu Liu, Yunpeng Qing, KaiXuan Chen, Tongya Zheng, Yanhao Huang, Jie Song, Mingli Song
Despite the encouraging results achieved, CTDE makes an independence assumption on agent policies, which limits agents to adopt global cooperative information from each other during centralized training.
Multi-agent Reinforcement Learning reinforcement-learning +3
no code implementations • 20 May 2023 • Yuanyu Wan, Chang Yao, Mingli Song, Lijun Zhang
Despite its simplicity, our novel analysis shows that the dynamic regret of DOGD can be automatically bounded by $O(\sqrt{\bar{d}T}(P_T+1))$ under mild assumptions, and $O(\sqrt{dT}(P_T+1))$ in the worst case, where $\bar{d}$ and $d$ denote the average and maximum delay respectively, $T$ is the time horizon, and $P_T$ is the path-length of comparators.
1 code implementation • 15 Apr 2023 • Tongya Zheng, Xinchao Wang, Zunlei Feng, Jie Song, Yunzhi Hao, Mingli Song, Xingen Wang, Xinyu Wang, Chun Chen
The whole temporal neighborhood of nodes reveals the varying preferences of nodes.
1 code implementation • 15 Apr 2023 • Tongya Zheng, Zunlei Feng, Tianli Zhang, Yunzhi Hao, Mingli Song, Xingen Wang, Xinyu Wang, Ji Zhao, Chun Chen
The proposed TIP-GNN focuses on the bilevel graph structure in temporal networks: besides the explicit interaction graph, a node's sequential interactions can also be constructed as a transition graph.
no code implementations • 11 Apr 2023 • Jiawei Chen, Lin Chen, Jiang Yang, Tianqi Shi, Lechao Cheng, Zunlei Feng, Mingli Song
In this study, we tackle the patch slimming problem from a different perspective by proposing a life regression module that determines the lifespan of each image patch in one go.
no code implementations • 10 Apr 2023 • Lin Chen, Zhijie Jia, Tian Qiu, Lechao Cheng, Jie Lei, Zunlei Feng, Mingli Song
In this work, we propose a new paradigm dubbed Decision Stream Calibration that boosts the performance of general Vision Transformers.
1 code implementation • 9 Apr 2023 • Wenxiang Xu, Yongcheng Jing, Linyun Zhou, Wenqi Huang, Lechao Cheng, Zunlei Feng, Mingli Song
This is specifically achieved by devising an elaborated ``prophetic'' teacher, termed as ``Propheter'', that aims to learn the potential class distributions.
1 code implementation • CVPR 2023 • Tianli Zhang, Mengqi Xue, Jiangtao Zhang, Haofei Zhang, Yu Wang, Lechao Cheng, Jie Song, Mingli Song
Most existing online knowledge distillation(OKD) techniques typically require sophisticated modules to produce diverse knowledge for improving students' generalization ability.
1 code implementation • 12 Mar 2023 • Haofei Zhang, Mengqi Xue, Xiaokang Liu, KaiXuan Chen, Jie Song, Mingli Song
In this paper, we study a novel inference paradigm, termed as schema inference, that learns to deductively infer the explainable predictions by rebuilding the prior deep neural network (DNN) forwarding scheme, guided by the prevalent philosophical cognitive concept of schema.
1 code implementation • 17 Feb 2023 • Zhijie Jia, Lin Chen, Kaiwen Hu, Lechao Cheng, Zunlei Feng, Mingli Song
Despite the remarkable progress in semantic segmentation tasks with the advancement of deep neural networks, existing U-shaped hierarchical typical segmentation networks still suffer from local misclassification of categories and inaccurate target boundaries.
2 code implementations • 15 Feb 2023 • Shenghao Hao, Peiyuan Liu, Yibing Zhan, Kaixun Jin, Zuozhu Liu, Mingli Song, Jenq-Neng Hwang, Gaoang Wang
Although cross-view multi-object tracking has received increased attention in recent years, existing datasets still have several issues, including 1) missing real-world scenarios, 2) lacking diverse scenes, 3) owning a limited number of tracks, 4) comprising only static cameras, and 5) lacking standard benchmarks, which hinder the investigation and comparison of cross-view tracking methods.
1 code implementation • 14 Feb 2023 • Tian Qiu, Linyun Zhou, Wenxiang Xu, Lechao Cheng, Zunlei Feng, Mingli Song
Recent proposed DETR variants have made tremendous progress in various scenarios due to their streamlined processes and remarkable performance.
no code implementations • 11 Feb 2023 • Yuanyu Wan, Lijun Zhang, Mingli Song
In this way, we first show that the dynamic regret bound of OFW can be improved to $O(\sqrt{T(V_T+1)})$ for smooth functions.
1 code implementation • CVPR 2023 • Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang
Structural pruning enables model acceleration by removing structurally-grouped parameters from neural networks.
no code implementations • 14 Jan 2023 • Zipeng Zhong, Jie Song, Zunlei Feng, Tiantao Liu, Lingxiang Jia, Shaolun Yao, Tingjun Hou, Mingli Song
Afterwards, we analyze these methods in terms of their mechanism and performance, and introduce popular evaluation metrics for them, in which we also provide a detailed comparison among representative methods on several public datasets.
no code implementations • CVPR 2023 • Shengxuming Zhang, Tianqi Shi, Yang Jiang, Xiuming Zhang, Jie Lei, Zunlei Feng, Mingli Song
The loopback between two branches enables the category label to supervise the cell locating branch to learn the locating ability for cancerous areas.
no code implementations • CVPR 2023 • Xiaotian Yu, Yang Jiang, Tianqi Shi, Zunlei Feng, Yuexuan Wang, Mingli Song, Li Sun
To address this problem, the proposed GSS alleviates the damage by switching the current gradient direction of each sample to a new direction selected from a gradient direction pool, which contains all-class gradient directions with different probabilities.
1 code implementation • ICCV 2023 • Qihan Huang, Mengqi Xue, Wenqi Huang, Haofei Zhang, Jie Song, Yongcheng Jing, Mingli Song
Part-prototype networks (e. g., ProtoPNet, ProtoTree, and ProtoPool) have attracted broad research interest for their intrinsic interpretability and comparable accuracy to non-interpretable counterparts.
1 code implementation • 5 Dec 2022 • Hui Su, Yue Ye, Wei Hua, Lechao Cheng, Mingli Song
In this work, we propose a simple yet effective sparse annotated semantic segmentation framework based on segformer, dubbed SASFormer, that achieves remarkable performance.
1 code implementation • 30 Nov 2022 • Siqi Fan, Fenghua Zhu, Zunlei Feng, Yisheng Lv, Mingli Song, Fei-Yue Wang
Pseudo supervision is regarded as the core idea in semi-supervised learning for semantic segmentation, and there is always a tradeoff between utilizing only the high-quality pseudo labels and leveraging all the pseudo labels.
1 code implementation • 23 Nov 2022 • Shunyu Liu, Yihe Zhou, Jie Song, Tongya Zheng, KaiXuan Chen, Tongtian Zhu, Zunlei Feng, Mingli Song
Value Decomposition (VD) aims to deduce the contributions of agents for decentralized policies in the presence of only global rewards, and has recently emerged as a powerful credit assignment paradigm for tackling cooperative Multi-Agent Reinforcement Learning (MARL) problems.
1 code implementation • 12 Nov 2022 • Yunpeng Qing, Shunyu Liu, Jie Song, Huiqiong Wang, Mingli Song
In this survey, we provide a comprehensive review of existing works on eXplainable RL (XRL) and introduce a new taxonomy where prior works are clearly categorized into model-explaining, reward-explaining, state-explaining, and task-explaining methods.
1 code implementation • 9 Oct 2022 • Rang Meng, Xianfeng Li, WeiJie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Mingli Song, Di Xie, ShiLiang Pu
Under this guidance, a novel Attention Diversification framework is proposed, in which Intra-Model and Inter-Model Attention Diversification Regularization are collaborated to reassign appropriate attention to diverse task-related features.
1 code implementation • 7 Sep 2022 • Haoling Li, Jie Song, Mengqi Xue, Haofei Zhang, Jingwen Ye, Lechao Cheng, Mingli Song
This survey aims to present a comprehensive review of NTs and attempts to identify how they enhance the model interpretability.
1 code implementation • 22 Aug 2022 • Mengqi Xue, Qihan Huang, Haofei Zhang, Lechao Cheng, Jie Song, Minghui Wu, Mingli Song
The global prototypes are adopted to provide the global view of objects to guide local prototypes to concentrate on the foreground while eliminating the influence of the background.
1 code implementation • 3 Aug 2022 • Hui Su, Yue Ye, Zhiwei Chen, Mingli Song, Lechao Cheng
Weakly supervised object localization is a challenging task which aims to localize objects with coarse annotations such as image categories.
1 code implementation • 27 Jul 2022 • Donglin Xie, Ruonan Yu, Gongfan Fang, Jie Song, Zunlei Feng, Xinchao Wang, Li Sun, Mingli Song
The goal of FedSA is to train a student model for a new task with the help of several decentralized teachers, whose pre-training tasks and data are different and agnostic.
no code implementations • 24 Jul 2022 • Yongcheng Jing, Yining Mao, Yiding Yang, Yibing Zhan, Mingli Song, Xinchao Wang, DaCheng Tao
To this end, we develop an elaborated GNN model with content and style local patches as the graph vertices.
1 code implementation • 24 Jul 2022 • Gaoang Wang, Yibing Zhan, Xinchao Wang, Mingli Song, Klara Nahrstedt
Anomaly detection aims at identifying deviant samples from the normal data distribution.
1 code implementation • 17 Jul 2022 • Jingwen Ye, Yifang Fu, Jie Song, Xingyi Yang, Songhua Liu, Xin Jin, Mingli Song, Xinchao Wang
Life-long learning aims at learning a sequence of tasks without forgetting the previously acquired knowledge.
1 code implementation • 8 Jul 2022 • Shunyu Liu, Jie Song, Yihe Zhou, Na Yu, KaiXuan Chen, Zunlei Feng, Mingli Song
In this work, we introduce a novel interactiOn Pattern disenTangling (OPT) method, to disentangle the entity interactions into interaction prototypes, each of which represents an underlying interaction pattern within a subgroup of the entities.
2 code implementations • 5 Jul 2022 • Shunyu Liu, KaiXuan Chen, Na Yu, Jie Song, Zunlei Feng, Mingli Song
Despite the promising results achieved, state-of-the-art interactive reinforcement learning schemes rely on passively receiving supervision signals from advisor experts, in the form of either continuous monitoring or pre-defined rules, which inevitably result in a cumbersome and expensive learning process.
1 code implementation • 25 Jun 2022 • Tongtian Zhu, Fengxiang He, Lan Zhang, Zhengyang Niu, Mingli Song, DaCheng Tao
Our theory indicates that the generalizability of D-SGD is positively correlated with the spectral gap, and can explain why consensus control in initial training phase can ensure better generalization.
1 code implementation • CVPR 2022 • Rang Meng, WeiJie Chen, Shicai Yang, Jie Song, Luojun Lin, Di Xie, ShiLiang Pu, Xinchao Wang, Mingli Song, Yueting Zhuang
In this paper, we introduce a simple framework, Slimmable Domain Adaptation, to improve cross-domain generalization with a weight-sharing model bank, from which models of different capacities can be sampled to accommodate different accuracy-efficiency trade-offs.
3 code implementations • CVPR 2022 • Binbin Chen, WeiJie Chen, Shicai Yang, Yunyi Xuan, Jie Song, Di Xie, ShiLiang Pu, Mingli Song, Yueting Zhuang
To remedy this issue, we present a novel label assignment mechanism for self-training framework, namely proposal self-assignment, which injects the proposals from student into teacher and generates accurate pseudo labels to match each proposal in the student model accordingly.
no code implementations • 22 May 2022 • Gaoang Wang, Mingli Song, Jenq-Neng Hwang
Multi-object tracking (MOT) aims to associate target objects across video frames in order to obtain entire moving trajectories.
1 code implementation • 12 May 2022 • KaiXuan Chen, Shunyu Liu, Na Yu, Rong Yan, Quan Zhang, Jie Song, Zunlei Feng, Mingli Song
As the topology of the power system is in the form of graph structure, graph neural network based representation learning is naturally suitable for learning the status of the power system.
1 code implementation • 7 May 2022 • Zunlei Feng, Tian Qiu, Sai Wu, Xiaotuan Jin, Zengliang He, Mingli Song, Huiqiong Wang
In this paper, we attempt to build a generalizable framework that emulates the humans' recognition mechanism in the image classification task, hoping to improve the classification performance on unseen categories with the support of annotations of other categories.
2 code implementations • 5 May 2022 • Jie Song, Ying Chen, Jingwen Ye, Mingli Song
Knowledge distillation (KD) has become a well established paradigm for compressing deep neural networks.
1 code implementation • 25 Mar 2022 • Jiacong Hu, Jing Gao, Jingwen Ye, Yang Gao, Xingen Wang, Zunlei Feng, Mingli Song
With the rapid development of deep learning, the increasing complexity and scale of parameters make training a new model increasingly resource-intensive.
1 code implementation • CVPR 2022 • Mengqi Xue, Haofei Zhang, Jie Song, Mingli Song
Continual learning is a longstanding research topic due to its crucial role in tackling continually arriving tasks.
1 code implementation • 22 Mar 2022 • Zipeng Zhong, Jie Song, Zunlei Feng, Tiantao Liu, Lingxiang Jia, Shaolun Yao, Min Wu, Tingjun Hou, Mingli Song
Chemical reaction prediction, involving forward synthesis and retrosynthesis prediction, is a fundamental problem in organic synthesis.
1 code implementation • 7 Mar 2022 • Haofei Zhang, Feng Mao, Mengqi Xue, Gongfan Fang, Zunlei Feng, Jie Song, Mingli Song
Moreover, the transformer-based students excel in learning amalgamated knowledge, as they have mastered heterogeneous detection tasks rapidly and achieved superior or at least comparable performance to those of the teachers in their specializations.
no code implementations • 16 Dec 2021 • Gengshi Han, Shunyu Liu, KaiXuan Chen, Na Yu, Zunlei Feng, Mingli Song
This paper proposes a controllable sample generation framework based on Conditional Tabular Generative Adversarial Network (CTGAN) to generate specified transient stability samples.
2 code implementations • 12 Dec 2021 • Gongfan Fang, Kanya Mo, Xinchao Wang, Jie Song, Shitao Bei, Haofei Zhang, Mingli Song
At the heart of our approach is a novel strategy to reuse the shared common features in training data so as to synthesize different data instances.
1 code implementation • 9 Dec 2021 • Zunlei Feng, Jiacong Hu, Sai Wu, Xiaotian Yu, Jie Song, Mingli Song
The aggregate gradient strategy is a versatile module for mainstream CNN classifiers.
1 code implementation • CVPR 2022 • Haofei Zhang, Jiarui Duan, Mengqi Xue, Jie Song, Li Sun, Mingli Song
Recently, vision Transformers (ViTs) are developing rapidly and starting to challenge the domination of convolutional neural networks (CNNs) in the realm of computer vision (CV).
no code implementations • 6 Dec 2021 • Qihan Huang, Haofei Zhang, Mengqi Xue, Jie Song, Mingli Song
Although few-shot learning and zero-shot learning have been extensively explored in the field of image classification, it is indispensable to design new methods for object detection in the data-scarce scenario since object detection has an additional challenging localization task.
1 code implementation • 5 Dec 2021 • Jingwen Ye, Yining Mao, Jie Song, Xinchao Wang, Cheng Jin, Mingli Song
In other words, all users may employ a model in SDB for inference, but only authorized users get access to KD from the model.
1 code implementation • 23 Nov 2021 • Tongya Zheng, Zunlei Feng, Yu Wang, Chengchao Shen, Mingli Song, Xingen Wang, Xinyu Wang, Chun Chen, Hao Xu
Our proposed Dynamic Preference Structure (DPS) framework consists of two stages: structure sampling and graph fusion.
2 code implementations • NeurIPS 2021 • Gongfan Fang, Yifan Bao, Jie Song, Xinchao Wang, Donglin Xie, Chengchao Shen, Mingli Song
Knowledge distillation~(KD) aims to craft a compact student model that imitates the behavior of a pre-trained teacher in a target domain.
1 code implementation • 29 Sep 2021 • KaiXuan Chen, Jie Song, Shunyu Liu, Na Yu, Zunlei Feng, Gengshi Han, Mingli Song
A DKEPool network de facto disassembles representation learning into two stages, structure learning and distribution learning.
no code implementations • ICCV 2021 • Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, DaCheng Tao
In this paper, we study a novel meta aggregation scheme towards binarizing graph neural networks (GNNs).
1 code implementation • ICCV 2021 • Gaoang Wang, Renshu Gu, Zuozhu Liu, Weijie Hu, Mingli Song, Jenq-Neng Hwang
In this paper, we try to explore the significance of motion patterns for vehicle tracking without appearance information.
1 code implementation • ICCV 2021 • Zheng Li, Jingwen Ye, Mingli Song, Ying Huang, Zhigeng Pan
However, existing pose distillation works rely on a heavy pre-trained estimator to perform knowledge transfer and require a complex two-stage learning procedure.
no code implementations • 1 Aug 2021 • Lechao Cheng, Zunlei Feng, Xinchao Wang, Ya Jie Liu, Jie Lei, Mingli Song
In this paper, we introduce a novel Reference semantic segmentation Network (Ref-Net) to conduct visual boundary knowledge translation.
1 code implementation • 1 Aug 2021 • Zunlei Feng, Lechao Cheng, Xinchao Wang, Xiang Wang, Yajie Liu, Xiangtong Du, Mingli Song
To this end, we propose a Translation Segmentation Network (Trans-Net), which comprises a segmentation network and two boundary discriminators.
1 code implementation • 1 Aug 2021 • Zunlei Feng, Zhonghua Wang, Xinchao Wang, Xiuming Zhang, Lechao Cheng, Jie Lei, Yuexuan Wang, Mingli Song
The diagnosis of MVI needs discovering the vessels that contain hepatocellular carcinoma cells and counting their number in each vessel, which depends heavily on experiences of the doctor, is largely subjective and time-consuming.
no code implementations • 28 Jul 2021 • Xin Gao, Zhenjiang Liu, Zunlei Feng, Chengji Shen, Kairi Ou, Haihong Tang, Mingli Song
Existing 2D image-based virtual try-on methods aim to transfer a target clothing image onto a reference person, which has two main disadvantages: cannot control the size and length precisely; unable to accurately estimate the user's figure in the case of users wearing thick clothes, resulting in inaccurate dressing effect.
no code implementations • CVPR 2021 • Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, DaCheng Tao
To this end, we propose an Event-based VSR framework (E-VSR), of which the key component is an asynchronous interpolation (EAI) module that reconstructs a high-frequency (HF) video stream with uniform and tiny pixel displacements between neighboring frames from an event stream.
1 code implementation • CVPR 2021 • Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, DaCheng Tao
In this paper, we study a novel knowledge transfer task in the domain of graph neural networks (GNNs).
no code implementations • CVPR 2021 • Jie Song, Haofei Zhang, Xinchao Wang, Mengqi Xue, Ying Chen, Li Sun, DaCheng Tao, Mingli Song
Knowledge distillation pursues a diminutive yet well-behaved student network by harnessing the knowledge learned by a cumbersome teacher model.
2 code implementations • 18 May 2021 • Gongfan Fang, Jie Song, Xinchao Wang, Chengchao Shen, Xingen Wang, Mingli Song
In this paper, we propose Contrastive Model Inversion~(CMI), where the data diversity is explicitly modeled as an optimizable objective, to alleviate the mode collapse issue.
1 code implementation • 10 May 2021 • Mengqi Xue, Jie Song, Xinchao Wang, Ying Chen, Xingen Wang, Mingli Song
Knowledge distillation (KD) has recently emerged as an efficacious scheme for learning compact deep neural networks (DNNs).
1 code implementation • CVPR 2021 • Chengchao Shen, Youtan Yin, Xinchao Wang, Xubin Li, Jie Song, Mingli Song
Based on the adversarial losses of the generator and discriminator, we categorize GANs into two classes, Symmetric GANs and Asymmetric GANs, and introduce a novel gradient decomposition method to unify the two, allowing us to train both classes in one stage and hence alleviate the training effort.
1 code implementation • 10 Jan 2021 • Yiding Yang, Xinchao Wang, Mingli Song, Junsong Yuan, DaCheng Tao
SPAGAN therefore allows for a more informative and intact exploration of the graph structure and further {a} more effective aggregation of information from distant neighbors into the center node, as compared to node-based GCN methods.
no code implementations • ICCV 2021 • Zunlei Feng, Zhonghua Wang, Xinchao Wang, Yining Mao, Thomas Li, Jie Lei, Yuexuan Wang, Mingli Song
The existing two unsupervised methods are prone to failure on degenerated samples.
no code implementations • ICCV 2021 • Ying Chen, Feng Mao, Jie Song, Xinchao Wang, Huiqiong Wang, Mingli Song
Neural trees aim at integrating deep neural networks and decision trees so as to bring the best of the two worlds, including representation learning from the former and faster inference from the latter.
2 code implementations • 9 Dec 2020 • Chengchao Shen, Xinchao Wang, Youtan Yin, Jie Song, Sihui Luo, Mingli Song
In this paper, we investigate the practical few-shot knowledge distillation scenario, where we assume only a few samples without human annotations are available for each category.
no code implementations • NeurIPS 2020 • Zunlei Feng, Yongming He, Xinchao Wang, Xin Gao, Jie Lei, Cheng Jin, Mingli Song
In this paper, we introduce the One-sample Guided Object Representation Disassembling (One-GORD) method, which only requires one annotated sample for each object category to learn disassembled object representation from unannotated images.
1 code implementation • 17 Oct 2020 • Shuai Xie, Zunlei Feng, Ying Chen, Songtao Sun, Chao Ma, Mingli Song
To deal with this problem, we propose a semantic Difficulty-awarE Active Learning (DEAL) network composed of two branches: the common segmentation branch and the semantic difficulty branch.
no code implementations • ECCV 2020 • Yiding Yang, Jiayan Qiu, Mingli Song, DaCheng Tao, Xinchao Wang
Prior gradient-based attribution-map methods rely on handcrafted propagation rules for the non-linear/activation layers during the backward pass, so as to produce gradients of the input and then the attribution map.
1 code implementation • NeurIPS 2020 • Yiding Yang, Zunlei Feng, Mingli Song, Xinchao Wang
In this paper, we introduce a novel graph convolutional network (GCN), termed as factorizable graph convolutional network(FactorGCN), that explicitly disentangles such intertwined relations encoded in a graph.
Ranked #3 on Node Classification on PATTERN 100k
no code implementations • 20 Aug 2020 • Xinhui Song, Tianyang Shi, Zunlei Feng, Mingli Song, Jackie Lin, Chuan-Jie Lin, Changjie Fan, Yi Yuan
Facial action unit (AU) intensity is an index to describe all visually discernible facial movements.
no code implementations • 10 Jul 2020 • Gongfan Fang, Xinchao Wang, Haofei Zhang, Jie Song, Mingli Song
This network is referred to as the {\emph{Template Network}} because its filters will be used as templates to reconstruct images from the impression.
no code implementations • 3 Apr 2020 • Zunlei Feng, Xinchao Wang, Yongming He, Yike Yuan, Xin Gao, Mingli Song
In this paper, we study a new representation-learning task, which we termed as disassembling object representations.
1 code implementation • CVPR 2020 • Yiding Yang, Jiayan Qiu, Mingli Song, DaCheng Tao, Xinchao Wang
To enable the knowledge transfer from the teacher GCN to the student, we propose a local structure preserving module that explicitly accounts for the topological semantics of the teacher.
no code implementations • CVPR 2020 • Jingwen Ye, Yixin Ji, Xinchao Wang, Xin Gao, Mingli Song
Then a dual generator is trained by taking the output from the former generator as input.
1 code implementation • CVPR 2020 • Jie Song, Yixin Chen, Jingwen Ye, Xinchao Wang, Chengchao Shen, Feng Mao, Mingli Song
In this paper, we propose the DEeP Attribution gRAph (DEPARA) to investigate the transferability of knowledge learned from PR-DNNs.
3 code implementations • 23 Dec 2019 • Gongfan Fang, Jie Song, Chengchao Shen, Xinchao Wang, Da Chen, Mingli Song
Knowledge Distillation (KD) has made remarkable progress in the last few years and become a popular paradigm for model compression and knowledge transfer.
no code implementations • 18 Dec 2019 • Da Chen, Yong-Liang Yang, Zunlei Feng, Xiang Wu, Mingli Song, Wenbin Li, Yuan He, Hui Xue, Feng Mao
This strategy leads to severe meta shift issues across multiple tasks, meaning the learned prototypes or class descriptors are not stable as each task only involves their own support set.
1 code implementation • 26 Nov 2019 • Ya Zhao, Rui Xu, Xinchao Wang, Peng Hou, Haihong Tang, Mingli Song
In this paper, we propose a new method, termed as Lip by Speech (LIBS), of which the goal is to strengthen lip reading by learning from speech recognizers.
Ranked #2 on Lipreading on CMLR
no code implementations • 16 Nov 2019 • Yongcheng Jing, Xiao Liu, Yukang Ding, Xinchao Wang, Errui Ding, Mingli Song, Shilei Wen
Prior normalization methods rely on affine transformations to produce arbitrary image style transfers, of which the parameters are computed in a pre-defined way.
2 code implementations • NeurIPS 2019 • Jie Song, Yixin Chen, Xinchao Wang, Chengchao Shen, Mingli Song
Exploring the transferability between heterogeneous tasks sheds light on their intrinsic interconnections, and consequently enables knowledge transfer from one task to another so as to reduce the training effort of the latter.
2 code implementations • ICCV 2019 • Chengchao Shen, Mengqi Xue, Xinchao Wang, Jie Song, Li Sun, Mingli Song
To this end, we introduce a dual-step strategy that first extracts the task-specific knowledge from the heterogeneous teachers sharing the same sub-task, and then amalgamates the extracted knowledge to build the student network.
no code implementations • 14 Aug 2019 • Ya Zhao, Rui Xu, Mingli Song
When trained on CMLR dataset, the proposed CSSMCM surpasses the performance of state-of-the-art lip reading frameworks, which confirms the effectiveness of explicit modeling of tones for Chinese Mandarin lip reading.
Ranked #3 on Lipreading on CMLR
2 code implementations • 24 Jun 2019 • Sihui Luo, Xinchao Wang, Gongfan Fang, Yao Hu, Dapeng Tao, Mingli Song
An increasing number of well-trained deep networks have been released online by researchers and developers, enabling the community to reuse them in a plug-and-play way without accessing the training annotations.
1 code implementation • 28 May 2019 • Jingwen Ye, Xinchao Wang, Yixin Ji, Kairi Ou, Mingli Song
Many well-trained Convolutional Neural Network(CNN) models have now been released online by developers for the sake of effortless reproducing.
1 code implementation • CVPR 2019 • Jingwen Ye, Yixin Ji, Xinchao Wang, Kairi Ou, Dapeng Tao, Mingli Song
In this paper, we investigate a novel deep-model reusing task.
1 code implementation • 7 Nov 2018 • Chengchao Shen, Xinchao Wang, Jie Song, Li Sun, Mingli Song
We propose in this paper to study a new model-reusing task, which we term as \emph{knowledge amalgamation}.
no code implementations • ECCV 2018 • Jie Song, Chengchao Shen, Jie Lei, An-Xiang Zeng, Kairi Ou, DaCheng Tao, Mingli Song
We propose a selective zero-shot classifier based on both the human defined and the automatically discovered residual attributes.
no code implementations • 13 Jun 2018 • Zunlei Feng, Zhenyun Yu, Yezhou Yang, Yongcheng Jing, Junxiao Jiang, Mingli Song
In the supervised attributes module, multiple attributes labels are adopted to ensure that different parts of the overall embedding correspond to different attributes.
1 code implementation • NeurIPS 2018 • Zunlei Feng, Xinchao Wang, Chenglong Ke, An-Xiang Zeng, DaCheng Tao, Mingli Song
To achieve disentangling using the labeled pairs, we follow a "encoding-swap-decoding" process, where we first swap the parts of their encodings corresponding to the shared attribute and then decode the obtained hybrid codes to reconstruct the original input pairs.
no code implementations • CVPR 2018 • Jie Song, Chengchao Shen, Yezhou Yang, Yang Liu, Mingli Song
Most existing Zero-Shot Learning (ZSL) methods have the strong bias problem, in which instances of unseen (target) classes tend to be categorized as one of the seen (source) classes.
1 code implementation • ECCV 2018 • Yongcheng Jing, Yang Liu, Yezhou Yang, Zunlei Feng, Yizhou Yu, DaCheng Tao, Mingli Song
In this paper, we present a stroke controllable style transfer network that can achieve continuous and spatial stroke size control.
no code implementations • 29 Jan 2018 • Sihui Luo, Yezhou Yang, Mingli Song
The same practice also enable the compressed code to carry the image semantic information during storage and transmission.
no code implementations • 14 Nov 2017 • Gongze Cao, Yezhou Yang, Jie Lei, Cheng Jin, Yang Liu, Mingli Song
As an effective way of metric learning, triplet loss has been widely used in many deep learning tasks, including face recognition and person-ReID, leading to many states of the arts.
8 code implementations • 11 May 2017 • Yongcheng Jing, Yezhou Yang, Zunlei Feng, Jingwen Ye, Yizhou Yu, Mingli Song
We first propose a taxonomy of current algorithms in the field of NST.
no code implementations • CVPR 2014 • Xiao Liu, DaCheng Tao, Mingli Song, Ying Ruan, Chun Chen, Jiajun Bu
In this paper, we present a novel nearest neighbor-based label transfer scheme for weakly supervised video segmentation.
no code implementations • CVPR 2014 • Xiao Liu, Mingli Song, DaCheng Tao, Xingchen Zhou, Chun Chen, Jiajun Bu
In this paper, to bridge the human appearance variations across cameras, two coupled dictionaries that relate to the gallery and probe cameras are jointly learned in the training phase from both labeled and unlabeled images.
no code implementations • CVPR 2013 • Xiao Liu, Mingli Song, DaCheng Tao, Zicheng Liu, Luming Zhang, Chun Chen, Jiajun Bu
Node splitting is an important issue in Random Forest but robust splitting requires a large number of training samples.
no code implementations • CVPR 2013 • Luming Zhang, Mingli Song, Zicheng Liu, Xiao Liu, Jiajun Bu, Chun Chen
Finally, we propose a novel image segmentation algorithm, called graphlet cut, that leverages the learned graphlet distribution in measuring the homogeneity of a set of spatially structured superpixels.
no code implementations • 17 May 2013 • Chengxi Ye, DaCheng Tao, Mingli Song, David W. Jacobs, Min Wu
Optimization-based filtering smoothes an image by minimizing a fidelity function and simultaneously preserves edges by exploiting a sparse norm penalty over gradients.
no code implementations • 20 May 2012 • Chengxi Ye, Yuxu Lin, Mingli Song, Chun Chen, David W. Jacobs
In this paper, we analyze image segmentation algorithms that are based on spectral graph theory, e. g., normalized cut, and show that there is a natural connection between spectural graph theory based image segmentationand and edge preserving filtering.