Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation

no code implementations25 Nov 2021 Rui Wang, Jian Chen, Gang Yu, Li Sun, Changqian Yu, Changxin Gao, Nong Sang

Image manipulation with StyleGAN has been an increasing concern in recent years. Recent works have achieved tremendous success in analyzing several semantic latent spaces to edit the attributes of the generated images. However, due to the limited semantic and spatial manipulation precision in these latent spaces, the existing endeavors are defeated in fine-grained StyleGAN image manipulation, i. e., local attribute translation. To address this issue, we discover attribute-specific control units, which consist of multiple channels of feature maps and modulation styles.

Artificial Neural Network and Its Application Research Progress in Chemical Process

no code implementations18 Oct 2021 Li Sun, Fei Liang, Wutai Cui

Most chemical processes, such as distillation, absorption, extraction, and catalytic reactions, are extremely complex processes that are affected by multiple factors.

Can contrastive learning avoid shortcut solutions?

1 code implementation NeurIPS 2021 Joshua Robinson, Li Sun, Ke Yu, Kayhan Batmanghelich, Stefanie Jegelka, Suvrit Sra

However, we observe that the contrastive loss does not always sufficiently guide which features are extracted, a behavior that can negatively impact the performance on downstream tasks via "shortcuts", i. e., by inadvertently suppressing important predictive features.

Contrastive Learning

Tree-Like Decision Distillation

no code implementations CVPR 2021 Jie Song, Haofei Zhang, Xinchao Wang, Mengqi Xue, Ying Chen, Li Sun, DaCheng Tao, Mingli Song

Knowledge distillation pursues a diminutive yet well-behaved student network by harnessing the knowledge learned by a cumbersome teacher model.

Decision Making Knowledge Distillation

DG-Font: Deformable Generative Networks for Unsupervised Font Generation

1 code implementation CVPR 2021 Yangchen Xie, Xinyuan Chen, Li Sun, Yue Lu

Font generation is a challenging problem especially for some writing systems that consist of a large number of characters and has attracted a lot of attention in recent years.

Font Generation Image-to-Image Translation

Hyperbolic Variational Graph Neural Network for Modeling Dynamic Graphs

no code implementations6 Apr 2021 Li Sun, Zhongbao Zhang, Jiawei Zhang, Feiyang Wang, Hao Peng, Sen Su, Philip S. Yu

To model the uncertainty, we devise a hyperbolic graph variational autoencoder built upon the proposed TGNN to generate stochastic node representations of hyperbolic normal distributions.

Introspective Visuomotor Control: Exploiting Uncertainty in Deep Visuomotor Control for Failure Recovery

no code implementations22 Mar 2021 Chia-Man Hung, Li Sun, Yizhe Wu, Ioannis Havoutis, Ingmar Posner

To recover from high uncertainty cases, the robot monitors its uncertainty along a trajectory and explores possible actions in the state-action space to bring itself to a more certain state.

Imitation Learning

ID-Unet: Iterative Soft and Hard Deformation for View Synthesis

2 code implementations CVPR 2021 Mingyu Yin, Li Sun, Qingli Li

View synthesis is usually done by an autoencoder, in which the encoder maps a source view image into a latent content code, and the decoder transforms it into a target view image according to the condition.

Adaptive Random Bandwidth for Inference in CAViaR Models

no code implementations2 Feb 2021 Alain Hecq, Li Sun

This paper investigates the size performance of Wald tests for CAViaR models (Engle and Manganelli, 2004).

Density Estimation

Context Matters: Graph-based Self-supervised Representation Learning for Medical Images

1 code implementation11 Dec 2020 Li Sun, Ke Yu, Kayhan Batmanghelich

Experiments on large-scale Computer Tomography (CT) datasets of lung images show that our approach compares favorably to baseline methods that do not account for the context.

Representation Learning Self-Supervised Learning

Content-based Analysis of the Cultural Differences between TikTok and Douyin

no code implementations3 Nov 2020 Li Sun, Haoqi Zhang, Songyang Zhang, Jiebo Luo

Short-form video social media shifts away from the traditional media paradigm by telling the audience a dynamic story to attract their attention.

Object Detection

Hierarchical Amortized Training for Memory-efficient High Resolution 3D GAN

no code implementations5 Aug 2020 Li Sun, Junxiang Chen, Yanwu Xu, Mingming Gong, Ke Yu, Kayhan Batmanghelich

During training, we adopt a hierarchical structure that simultaneously generates a low-resolution version of the image and a randomly selected sub-volume of the high-resolution image.

Data Augmentation Domain Adaptation +4

Progressive Multi-stage Feature Mix for Person Re-Identification

1 code implementation17 Jul 2020 Yan Zhang, Binyu He, Li Sun

In this work, we propose a Progressive Multi-stage feature Mix network (PMM), which enables the model to find out the more precise and diverse features in a progressive manner.

Person Re-Identification

Learning Posterior and Prior for Uncertainty Modeling in Person Re-Identification

no code implementations17 Jul 2020 Yan Zhang, Zhilin Zheng, Binyu He, Li Sun

This paper proposes to learn the sample posterior and the class prior distribution in the latent space, so that not only representative features but also the uncertainty can be built by the model.

Person Re-Identification

Localising Faster: Efficient and precise lidar-based robot localisation in large-scale environments

no code implementations4 Mar 2020 Li Sun, Daniel Adolfsson, Martin Magnusson, Henrik Andreasson, Ingmar Posner, Tom Duckett

More importantly, the Gaussian method (i. e. deep probabilistic localisation) and non-Gaussian method (i. e. MCL) can be integrated naturally via importance sampling.

Improving End-to-End Object Tracking Using Relational Reasoning

no code implementations ICLR 2020 Fabian B. Fuchs, Adam R. Kosiorek, Li Sun, Oiwi Parker Jones, Ingmar Posner

Relational reasoning, the ability to model interactions and relations between objects, is valuable for robust multi-object tracking and pivotal for trajectory prediction.

Multi-Object Tracking Relational Reasoning +1

Disentangling the Spatial Structure and Style in Conditional VAE

no code implementations29 Oct 2019 Ziye Zhang, Li Sun, Zhilin Zheng, Qingli Li

Depending on whether the label is related with the spatial structure, the output $z_s$ from the condition mapping network is used either as a style code or a spatial structure code.

Imagine That! Leveraging Emergent Affordances for 3D Tool Synthesis

no code implementations30 Sep 2019 Yizhe Wu, Sudhanshu Kasewa, Oliver Groth, Sasha Salter, Li Sun, Oiwi Parker Jones, Ingmar Posner

In this paper we explore the richness of information captured by the latent space of a vision-based generative model.

Imagine That! Leveraging Emergent Affordances for Tool Synthesis in Reaching Tasks

no code implementations25 Sep 2019 Yizhe Wu, Sudhanshu Kasewa, Oliver Groth, Sasha Salter, Li Sun, Oiwi Parker Jones, Ingmar Posner

In this paper we investigate an artificial agent's ability to perform task-focused tool synthesis via imagination.

Customizing Student Networks From Heterogeneous Teachers via Adaptive Knowledge Amalgamation

1 code implementation ICCV 2019 Chengchao Shen, Mengqi Xue, Xinchao Wang, Jie Song, Li Sun, Mingli Song

To this end, we introduce a dual-step strategy that first extracts the task-specific knowledge from the heterogeneous teachers sharing the same sub-task, and then amalgamates the extracted knowledge to build the student network.

Disentangling Latent Space for VAE by Label Relevant/Irrelevant Dimensions

1 code implementation CVPR 2019 Zhilin Zheng, Li Sun

But different from CVAE, we present a method for disentangling the latent space into the label relevant and irrelevant dimensions, $\bm{\mathrm{z}}_s$ and $\bm{\mathrm{z}}_u$, for a single input.

Variational Inference

Amalgamating Knowledge towards Comprehensive Classification

1 code implementation7 Nov 2018 Chengchao Shen, Xinchao Wang, Jie Song, Li Sun, Mingli Song

We propose in this paper to study a new model-reusing task, which we term as \emph{knowledge amalgamation}.

Classification General Classification

Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

This study assesses the state-of-the-art machine learning (ML) methods used for brain tumor image analysis in mpMRI scans, during the last seven instances of the International Brain Tumor Segmentation (BraTS) challenge, i. e., 2012-2018.

Brain Tumor Segmentation Survival Prediction +1

Exploring Correlations in Multiple Facial Attributes through Graph Attention Network

1 code implementation22 Oct 2018 Yan Zhang, Li Sun

Estimating multiple attributes from a single facial image gives comprehensive descriptions on the high level semantics of the face.

Graph Attention Multi-Task Learning

Recurrent-OctoMap: Learning State-based Map Refinement for Long-Term Semantic Mapping with 3D-Lidar Data

no code implementations2 Jul 2018 Li Sun, Zhi Yan, Anestis Zaganidis, Cheng Zhao, Tom Duckett

Most existing semantic mapping approaches focus on improving semantic understanding of single frames, rather than 3D refinement of semantic maps (i. e. fusing semantic observations).

Learning monocular visual odometry with dense 3D mapping from dense 3D flow

no code implementations6 Mar 2018 Cheng Zhao, Li Sun, Pulak Purkait, Tom Duckett, Rustam Stolkin

Dense 2D flow and a depth image are generated from monocular images by sub-networks, which are then used by a 3D flow associated layer in the L-VO network to generate dense 3D flow.

Monocular Visual Odometry

3DOF Pedestrian Trajectory Prediction Learned from Long-Term Autonomous Mobile Robot Deployment Data

no code implementations30 Sep 2017 Li Sun, Zhi Yan, Sergi Molina Mellado, Marc Hanheide, Tom Duckett

Our approach, T-Pose-LSTM (Temporal 3DOF-Pose Long-Short-Term Memory), is trained using long-term data from real-world robot deployments and aims to learn context-dependent (environment- and time-specific) human activities.

Human Detection Pedestrian Trajectory Prediction +1

Dense RGB-D semantic mapping with Pixel-Voxel neural network

no code implementations30 Sep 2017 Cheng Zhao, Li Sun, Pulak Purkait, Rustam Stolkin

For intelligent robotics applications, extending 3D mapping to 3D semantic mapping enables robots to, not only localize themselves with respect to the scene's geometrical features but also simultaneously understand the higher level meaning of the scene contexts.

3D Reconstruction Scene Understanding +1

Single-Shot Clothing Category Recognition in Free-Configurations with Application to Autonomous Clothes Sorting

no code implementations22 Jul 2017 Li Sun, Gerardo Aragon-Camarasa, Simon Rogers, Rustam Stolkin, J. Paul Siebert

Our visual feature is robust to deformable shapes and our approach is able to recognise the category of unknown clothing in unconstrained and random configurations.

Weakly-supervised DCNN for RGB-D Object Recognition in Real-World Applications Which Lack Large-scale Annotated Training Data

1 code implementation19 Mar 2017 Li Sun, Cheng Zhao, Rustam Stolkin

We also propose a novel way to pretrain a DCNN for the depth modality, by training on virtual depth images projected from CAD models.

Object Recognition

A fully end-to-end deep learning approach for real-time simultaneous 3D reconstruction and material recognition

no code implementations14 Mar 2017 Cheng Zhao, Li Sun, Rustam Stolkin

We present the results of experiments, in which we trained our system to perform real-time 3D semantic reconstruction for 23 different materials in a real-world application.

3D Reconstruction Material Recognition

Robot Vision Architecture for Autonomous Clothes Manipulation

no code implementations18 Oct 2016 Li Sun, Gerardo Aragon-Camarasa, Simon Rogers, J. Paul Siebert

The experimental results show that the proposed dual-arm flattening using stereo vision system remarkably outperforms the single-arm flattening and widely-cited Kinect-based sensing system for dexterous manipulation tasks.

