Search Results for author: Xin Jin

Found 146 papers, 57 papers with code

FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion

no code implementations11 Jun 2024 Li-Wen Chang, Wenlei Bao, Qi Hou, Chengquan Jiang, Ningxin Zheng, Yinmin Zhong, Xuanrun Zhang, Zuquan Song, Ziheng Jiang, Haibin Lin, Xin Jin, Xin Liu

Overall, it can achieve up to 1. 24x speedups for training over Megatron-LM on a cluster of 128 GPUs with various GPU generations and interconnects, and up to 1. 66x and 1. 30x speedups for prefill and decoding inference over vLLM on a cluster with 8 GPUs with various GPU generations and interconnects.

Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis

1 code implementation10 Jun 2024 Xin Jin, Pengyi Jiao, Zheng-Peng Duan, Xingchao Yang, Chun-Le Guo, Bo Ren, Chongyi Li

Volumetric rendering based methods, like NeRF, excel in HDR view synthesis from RAWimages, especially for nighttime scenes.

2k Novel View Synthesis +1

Deciphering Human Mobility: Inferring Semantics of Trajectories with Large Language Models

no code implementations30 May 2024 Yuxiao Luo, Zhongcai Cao, Xin Jin, Kang Liu, Ling Yin

We adopt spatio-temporal attributes enhanced data formatting (STFormat) and design a context-inclusive prompt, enabling LLMs to more effectively interpret and infer the semantics of trajectory data.

StarLKNet: Star Mixup with Large Kernel Networks for Palm Vein Identification

no code implementations21 May 2024 Xin Jin, Hongyu Zhu, Mounîm A. El Yacoubi, Hongchao Liao, Huafeng Qin, Yun Jiang

To enable CNNs to capture comprehensive feature representations from palm-vein images, we explored the effect of convolutional kernel size on the performance of palm-vein identification networks and designed LaKNet, a network leveraging large kernel convolution and gating mechanism.

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

no code implementations22 Apr 2024 Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra, Xiyang Dai, Allie Del Giorno, Gustavo de Rosa, Matthew Dixon, Ronen Eldan, Victor Fragoso, Dan Iter, Mei Gao, Min Gao, Jianfeng Gao, Amit Garg, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Jamie Huynh, Mojan Javaheripi, Xin Jin, Piero Kauffmann, Nikos Karampatziakis, Dongwoo Kim, Mahoud Khademi, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Yunsheng Li, Chen Liang, Lars Liden, Ce Liu, Mengchen Liu, Weishung Liu, Eric Lin, Zeqi Lin, Chong Luo, Piyush Madan, Matt Mazzola, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Swadheen Shukla, Xia Song, Masahiro Tanaka, Andrea Tupini, Xin Wang, Lijuan Wang, Chunyu Wang, Yu Wang, Rachel Ward, Guanhua Wang, Philipp Witte, Haiping Wu, Michael Wyatt, Bin Xiao, Can Xu, Jiahang Xu, Weijian Xu, Sonali Yadav, Fan Yang, Jianwei Yang, ZiYi Yang, Yifan Yang, Donghan Yu, Lu Yuan, Chengruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou

We introduce phi-3-mini, a 3. 8 billion parameter language model trained on 3. 3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3. 5 (e. g., phi-3-mini achieves 69% on MMLU and 8. 38 on MT-bench), despite being small enough to be deployed on a phone.

Language Modelling

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

no code implementations18 Apr 2024 Chao Jin, Zili Zhang, Xuanlin Jiang, Fangyue Liu, Xin Liu, Xuanzhe Liu, Xin Jin

We implement RAGCache and evaluate it on vLLM, a state-of-the-art LLM inference system and Faiss, a state-of-the-art vector database.

Retrieval

LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism

no code implementations15 Apr 2024 Bingyang Wu, Shengyu Liu, Yinmin Zhong, Peng Sun, Xuanzhe Liu, Xin Jin

The context window of large language models (LLMs) is rapidly increasing, leading to a huge variance in resource usage between different requests as well as between different phases of the same request.

DreamLIP: Language-Image Pre-training with Long Captions

1 code implementation25 Mar 2024 Kecheng Zheng, Yifei Zhang, Wei Wu, Fan Lu, Shuailei Ma, Xin Jin, Wei Chen, Yujun Shen

Motivated by this, we propose to dynamically sample sub-captions from the text label to construct multiple positive pairs, and introduce a grouping loss to match the embeddings of each sub-caption with its corresponding local image patches in a self-supervised manner.

Contrastive Learning Language Modelling +4

Large Language Models for Forecasting and Anomaly Detection: A Systematic Literature Review

no code implementations15 Feb 2024 Jing Su, Chufeng Jiang, Xin Jin, Yuxin Qiao, Tingsong Xiao, Hongda Ma, Rong Wei, Zhi Jing, Jiajun Xu, Junhong Lin

This systematic literature review comprehensively examines the application of Large Language Models (LLMs) in forecasting and anomaly detection, highlighting the current state of research, inherent challenges, and prospective future directions.

Anomaly Classification Anomaly Detection +3

An Order-Complexity Aesthetic Assessment Model for Aesthetic-aware Music Recommendation

no code implementations13 Feb 2024 Xin Jin, Wu Zhou, Jingyu Wang, Duo Xu, Yongsen Zheng

In order to improve the quality of AI music generation and further guide computer music production, synthesis, recommendation and other tasks, we use Birkhoff's aesthetic measure to design a aesthetic model, objectively measuring the aesthetic beauty of music, and form a recommendation list according to the aesthetic feeling of music.

Music Generation Music Recommendation

Decentralized Zeno-Free Event-Triggered Control For Multiple Networks Subject to Stochastic Network Delays and Poisson Pulsing Attacks

no code implementations26 Jan 2024 Dandan Zhang, Xin Jin, Hongye Su

By designing the decentralized time-regularized (Zeno-free) event-triggered strategies for the state-feedback control law, this paper considers the stochastic stabilization of a class of networked control systems, where two sources of randomness exist in multiple decentralized networks that operate asynchronously and independently: the communication channels are constrained by the stochastic network delays and also by Poisson pulsing denial-of-service (Pp-DoS) attacks.

A Survey of Resource-efficient LLM and Multimodal Foundation Models

1 code implementation16 Jan 2024 Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, QiPeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu

Large foundation models, including large language models (LLMs), vision transformers (ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine learning lifecycle, from training to deployment.

EmMixformer: Mix transformer for eye movement recognition

no code implementations10 Jan 2024 Huafeng Qin, Hongyu Zhu, Xin Jin, Qun Song, Mounim A. El-Yacoubi, Xinbo Gao

To this end, we propose a mixed block consisting of three modules, transformer, attention Long short-term memory (attention LSTM), and Fourier transformer.

Multi-Prompts Learning with Cross-Modal Alignment for Attribute-based Person Re-Identification

no code implementations28 Dec 2023 Yajing Zhai, Yawen Zeng, Zhiyong Huang, Zheng Qin, Xin Jin, Da Cao

Thereby, this paper explores the potential of using the generated multiple person attributes as prompts in ReID tasks with off-the-shelf (large) models for more accurate retrieval results.

Attribute Person Re-Identification +1

Inter-X: Towards Versatile Human-Human Interaction Analysis

no code implementations CVPR 2024 Liang Xu, Xintao Lv, Yichao Yan, Xin Jin, Shuwen Wu, Congsheng Xu, Yifan Liu, Yizhou Zhou, Fengyun Rao, Xingdong Sheng, Yunhui Liu, Wenjun Zeng, Xiaokang Yang

We also equip Inter-X with versatile annotations of more than 34K fine-grained human part-level textual descriptions, semantic interaction categories, interaction order, and the relationship and personality of the subjects.

Graphene: Infrastructure Security Posture Analysis with AI-generated Attack Graphs

no code implementations20 Dec 2023 Xin Jin, Charalampos Katsis, Fan Sang, Jiahao Sun, Elisa Bertino, Ramana Rao Kompella, Ashish Kundu

In this paper, we propose Graphene, an advanced system designed to provide a detailed analysis of the security posture of computing infrastructures.

Adversarial AutoMixup

2 code implementations19 Dec 2023 Huafeng Qin, Xin Jin, Yun Jiang, Mounim A. El-Yacoubi, Xinbo Gao

In this paper, we propose AdAutomixup, an adversarial automatic mixup augmentation approach that generates challenging samples to train a robust classifier for image classification, by alternatively optimizing the classifier and the mixup sample generator.

Classification Image Classification

Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models

1 code implementation15 Dec 2023 Xin Jin, Jonathan Larson, Weiwei Yang, Zhiqiang Lin

Binary code summarization, while invaluable for understanding code semantics, is challenging due to its labor-intensive nature.

Benchmarking Code Summarization +2

DTA: Distribution Transform-based Attack for Query-Limited Scenario

no code implementations12 Dec 2023 Renyang Liu, Wei Zhou, Xin Jin, Song Gao, Yuanyu Wang, Ruxin Wang

In generating adversarial examples, the conventional black-box attack methods rely on sufficient feedback from the to-be-attacked models by repeatedly querying until the attack is successful, which usually results in thousands of trials during an attack.

Hard-label Attack

TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-Commerce

1 code implementation8 Dec 2023 Tongxin Hu, Zhuang Li, Xin Jin, Lizhen Qu, Xin Zhang

Annually, e-commerce platforms incur substantial financial losses due to trademark infringements, making it crucial to identify and mitigate potential legal risks tied to merchant information registered to the platforms.

Legal Reasoning

Predicting Scores of Various Aesthetic Attribute Sets by Learning from Overall Score Labels

no code implementations6 Dec 2023 Heng Huang, Xin Jin, Yaqi Liu, Hao Lou, Chaoen Xiao, Shuai Cui, Xinning Li, Dongqing Zou

Then, we define an aesthetic attribute contribution to describe the role of aesthetic attributes throughout an image and use it with the attribute scores and the overall scores to train our F2S model.

Attribute

Breathing Life into Faces: Speech-driven 3D Facial Animation with Natural Head Pose and Detailed Shape

no code implementations31 Oct 2023 Wei Zhao, Yijun Wang, Tianyu He, Lianying Yin, Jianxin Lin, Xin Jin

To augment the richness of 3D facial animation, we construct a new 3D dataset with detailed shapes and learn to synthesize facial details in line with speech content.

RLLTE: Long-Term Evolution Project of Reinforcement Learning

2 code implementations28 Sep 2023 Mingqi Yuan, Zequn Zhang, Yang Xu, Shihao Luo, Bo Li, Xin Jin, Wenjun Zeng

We present RLLTE: a long-term evolution, extremely modular, and open-source framework for reinforcement learning (RL) research and application.

Language Modelling Large Language Model +2

Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates

1 code implementation15 Sep 2023 Insu Jang, Zhenning Yang, Zhen Zhang, Xin Jin, Mosharaf Chowdhury

Oobleck enables resilient distributed training of large DNN models with guaranteed fault tolerance.

Generalized Lightness Adaptation with Channel Selective Normalization

1 code implementation ICCV 2023 Mingde Yao, Jie Huang, Xin Jin, Ruikang Xu, Shenglong Zhou, Man Zhou, Zhiwei Xiong

Existing methods typically work well on their trained lightness conditions but perform poorly in unknown ones due to their limited generalization ability.

Image Retouching inverse tone mapping +3

Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey

1 code implementation18 Aug 2023 Xin Li, Yulin Ren, Xin Jin, Cuiling Lan, Xingrui Wang, Wenjun Zeng, Xinchao Wang, Zhibo Chen

Image restoration (IR) has been an indispensable and challenging task in the low-level vision field, which strives to improve the subjective quality of images distorted by various forms of degradation.

Deblurring Image Restoration +2

Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model

1 code implementation ICCV 2023 Xin Jin, Jia-Wen Xiao, Ling-Hao Han, Chunle Guo, Xialei Liu, Chongyi Li, Ming-Ming Cheng

However, these methods are impeded by several critical limitations: a) the explicit calibration process is both labor- and time-intensive, b) challenge exists in transferring denoisers across different camera models, and c) the disparity between synthetic and real noise is exacerbated by digital gain.

Image Denoising

PPN: Parallel Pointer-based Network for Key Information Extraction with Complex Layouts

no code implementations20 Jul 2023 Kaiwen Wei, Jie Yao, Jingyuan Zhang, Yangyang Kang, Fubang Zhao, Yating Zhang, Changlong Sun, Xin Jin, Xin Zhang

Firstly, the layout of existing datasets is relatively fixed and limited in the number of semantic entity categories, creating a significant gap between these datasets and the complex real-world scenarios.

Key Information Extraction

One at a Time: Progressive Multi-step Volumetric Probability Learning for Reliable 3D Scene Perception

no code implementations22 Jun 2023 Bohan Li, Yasheng Sun, Jingxin Dong, Zheng Zhu, Jinming Liu, Xin Jin, Wenjun Zeng

Numerous studies have investigated the pivotal role of reliable 3D volume representation in scene perception tasks, such as multi-view stereo (MVS) and semantic scene completion (SSC).

Depth Estimation Representation Learning

EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model

no code implementations20 Jun 2023 Lianying Yin, Yijun Wang, Tianyu He, Jinming Liu, Wei Zhao, Bohan Li, Xin Jin, Jianxin Lin

In this paper, we present a novel framework (EMoG) to tackle the above challenges with denoising diffusion models: 1) To alleviate the one-to-many problem, we incorporate emotion clues to guide the generation process, making the generation much easier; 2) To model joint correlation, we propose to decompose the difficult gesture generation into two sub-problems: joint correlation modeling and temporal dynamics modeling.

Denoising Gesture Generation

Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning

no code implementations24 May 2023 Qi Wang, Junming Yang, Yunbo Wang, Xin Jin, Wenjun Zeng, Xiaokang Yang

Training offline reinforcement learning (RL) models using visual inputs poses two significant challenges, i. e., the overfitting problem in representation learning and the overestimation bias for expected future rewards.

Offline RL Reinforcement Learning (RL) +2

Fast Distributed Inference Serving for Large Language Models

no code implementations10 May 2023 Bingyang Wu, Yinmin Zhong, Zili Zhang, Gang Huang, Xuanzhe Liu, Xin Jin

Based on the new semi information-agnostic setting of LLM inference, the scheduler leverages the input length information to assign an appropriate initial queue for each arrival job to join.

Blocking Management +1

Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven Prompts

no code implementations4 May 2023 Ruoyu Feng, Jinming Liu, Xin Jin, Xiaohan Pan, Heming Sun, Zhibo Chen

For ICM, developing a unified codec to reduce information redundancy while empowering the compressed features to support various vision tasks is very important, which inevitably faces two core challenges: 1) How should the compression strategy be adjusted based on the downstream tasks?

Semantically Structured Image Compression via Irregular Group-Based Decoupling

no code implementations ICCV 2023 Ruoyu Feng, Yixin Gao, Xin Jin, Runsen Feng, Zhibo Chen

Nevertheless, they divide the input image into multiple rectangular regions according to semantics and ignore avoiding information interaction among them, causing waste of bitrate and distorted reconstruction of region boundaries.

Image Compression

Learned Focused Plenoptic Image Compression with Microimage Preprocessing and Global Attention

1 code implementation30 Apr 2023 Kedeng Tong, Xin Jin, Yuqing Yang, Chen Wang, Jinshi Kang, Fan Jiang

Also, it achieves 18. 73% bitrate saving and generates perceptually pleasant reconstructions compared to the state-of-the-art end-to-end image compression methods, which benefits the applications of focused plenoptic cameras greatly.

Image Compression

Dynamic Video Frame Interpolation with integrated Difficulty Pre-Assessment

no code implementations25 Apr 2023 Ban Chen, Xin Jin, Youxin Chen, Longhai Wu, Jie Chen, Jayoon Koo, Cheul-hee Hahm

Extensive experiments show that easy samples pass through fast models while difficult samples inference with heavy models, and our proposed pipeline can improve the accuracy-efficiency trade-off for VFI.

Video Frame Interpolation

An Order-Complexity Model for Aesthetic Quality Assessment of Homophony Music Performance

no code implementations23 Apr 2023 Xin Jin, Wu Zhou, Jinyu Wang, Duo Xu, Yiqing Rong, Jialin Sun

In order to guide the generation task of AI music performance, and to improve the performance effect of human performers, this paper uses Birkhoff's aesthetic measure to propose a method of objective measurement of beauty.

NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation

1 code implementation ICCV 2023 Baao Xie, Bohan Li, Zequn Zhang, Junting Dong, Xin Jin, Jingyu Yang, Wenjun Zeng

They are complementary -- the outer navigation is to identify global-view semantic directions, and the inner refinement dedicates to fine-grained attributes.

Disentanglement

Inpaint Anything: Segment Anything Meets Image Inpainting

1 code implementation13 Apr 2023 Tao Yu, Runseng Feng, Ruoyu Feng, Jinming Liu, Xin Jin, Wenjun Zeng, Zhibo Chen

We are also very willing to help everyone share and promote new projects based on our Inpaint Anything (IA).

Image Inpainting

[CLS] Token is All You Need for Zero-Shot Semantic Segmentation

no code implementations13 Apr 2023 Letian Wu, Wenyao Zhang, Tengping Jiang, Wankou Yang, Xin Jin, Wenjun Zeng

Based on that, we build upon the CLIP model as a backbone which we extend with a One-Way [CLS] token navigation from text to the visual branch that enables zero-shot dense prediction, dubbed \textbf{ClsCLIP}.

Few-Shot Semantic Segmentation Language Modelling +4

Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion

1 code implementation24 Mar 2023 Bohan Li, Yasheng Sun, Zhujin Liang, Dalong Du, Zhuanghui Zhang, XiaoFeng Wang, Yunnan Wang, Xin Jin, Wenjun Zeng

However, due to the inherent representation gap between stereo geometry and BEV features, it is non-trivial to bridge them for dense prediction task of SSC.

3D Semantic Scene Completion Hallucination +2

Understand Legal Documents with Contextualized Large Language Models

no code implementations21 Mar 2023 Xin Jin, Yuchen Wang

The growth of pending legal cases in populous countries, such as India, has become a major issue.

Sentence

Learning Distortion Invariant Representation for Image Restoration from A Causality Perspective

2 code implementations CVPR 2023 Xin Li, Bingchen Li, Xin Jin, Cuiling Lan, Zhibo Chen

In this paper, we are the first to propose a novel training strategy for image restoration from the causality perspective, to improve the generalization ability of DNNs for unknown degradations.

counterfactual Image Restoration +2

QVRF: A Quantization-error-aware Variable Rate Framework for Learned Image Compression

6 code implementations10 Mar 2023 Kedeng Tong, Yaojun Wu, Yue Li, Kai Zhang, Li Zhang, Xin Jin

In this paper, we present a Quantization-error-aware Variable Rate Framework (QVRF) that utilizes a univariate quantization regulator a to achieve wide-range variable rates within a single model.

Image Compression Quantization

TBFormer: Two-Branch Transformer for Image Forgery Localization

1 code implementation25 Feb 2023 Yaqi Liu, Binbin Lv, Xin Jin, Xiaoyu Chen, Xiaokun Zhang

In this paper, we propose a Transformer-style network with two feature extraction branches for image forgery localization, and it is named as Two-Branch Transformer (TBFormer).

Decoder Vocal Bursts Valence Prediction

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving

2 code implementations22 Feb 2023 Zhuohan Li, Lianmin Zheng, Yinmin Zhong, Vincent Liu, Ying Sheng, Xin Jin, Yanping Huang, Zhifeng Chen, Hao Zhang, Joseph E. Gonzalez, Ion Stoica

Model parallelism is conventionally viewed as a method to scale a single large deep learning model beyond the memory limits of a single device.

Stable Attribute Group Editing for Reliable Few-shot Image Generation

1 code implementation1 Feb 2023 Guanqi Ding, Xinzhe Han, Shuhui Wang, Xin Jin, Dandan Tu, Qingming Huang

SAGE takes use of all given few-shot images and estimates a class center embedding based on the category-relevant attribute dictionary.

Attribute Classification +1

Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning

1 code implementation26 Jan 2023 Mingqi Yuan, Bo Li, Xin Jin, Wenjun Zeng

We present AIRS: Automatic Intrinsic Reward Shaping that intelligently and adaptively provides high-quality intrinsic rewards to enhance exploration in reinforcement learning (RL).

Benchmarking reinforcement-learning +1

An Order-Complexity Model for Aesthetic Quality Assessment of Symbolic Homophony Music Scores

no code implementations14 Jan 2023 Xin Jin, Wu Zhou, Jinyu Wang, Duo Xu, Yiqing Rong, Shuai Cui

Computational aesthetics evaluation has made great achievements in the field of visual arts, but the research work on music still needs to be explored.

Music Generation

Discrete Point-wise Attack Is Not Enough: Generalized Manifold Adversarial Attack for Face Recognition

1 code implementation CVPR 2023 Qian Li, Yuxiao Hu, Ye Liu, Dongxiao Zhang, Xin Jin, Yuntian Chen

Classical adversarial attacks for Face Recognition (FR) models typically generate discrete examples for target identity with a single state image.

Adversarial Attack Data Augmentation +1

GAS-NeXt: Few-Shot Cross-Lingual Font Generator

1 code implementation6 Dec 2022 Haoyang He, Xin Jin, Angela Chen

Generating new fonts is a time-consuming and labor-intensive task, especially in a language with a huge amount of characters like Chinese.

Decoder Font Generation +1

Task Residual for Tuning Vision-Language Models

1 code implementation CVPR 2023 Tao Yu, Zhihe Lu, Xin Jin, Zhibo Chen, Xinchao Wang

Large-scale vision-language models (VLMs) pre-trained on billion-level data have learned general visual representations and broad visual concepts.

Transfer Learning

A Unified Pyramid Recurrent Network for Video Frame Interpolation

1 code implementation CVPR 2023 Xin Jin, Longhai Wu, Jie Chen, Youxin Chen, Jayoon Koo, Cheul-hee Hahm

Cast in a flexible pyramid framework, UPR-Net exploits lightweight recurrent modules for both bi-directional flow estimation and intermediate frame synthesis.

Optical Flow Estimation Video Frame Interpolation

Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning

no code implementations19 Sep 2022 Mingqi Yuan, Bo Li, Xin Jin, Wenjun Zeng

Exploration is critical for deep reinforcement learning in complex environments with high-dimensional observations and sparse rewards.

Atari Games Benchmarking +3

Deliberated Domain Bridging for Domain Adaptive Semantic Segmentation

1 code implementation16 Sep 2022 Lin Chen, Zhixiang Wei, Xin Jin, Huaian Chen, Miao Zheng, Kai Chen, Yi Jin

In this work, we resort to data mixing to establish a deliberated domain bridging (DDB) for DASS, through which the joint distributions of source and target domains are aligned and interacted with each in the intermediate space.

Knowledge Distillation Semantic Segmentation +3

Aesthetics Driven Autonomous Time-Lapse Photography Generation by Virtual and Real Robots

no code implementations22 Aug 2022 Xiaobo Gao, Qi Kuang, Xin Jin, Bin Zhou, Boyan Dong, Xunyu Wang

Then we propose a time-lapse photography interface to facilitate users to view and adjust parameters and use virtual robots to conduct virtual photography in a three-dimensional scene.

Hierarchical Compositional Representations for Few-shot Action Recognition

no code implementations19 Aug 2022 Changzhen Li, Jie Zhang, Shuzhe Wu, Xin Jin, Shiguang Shan

Recently action recognition has received more and more attention for its comprehensive and practical applications in intelligent surveillance and human-computer interaction.

Few-Shot action recognition Few Shot Action Recognition

Underwater Ranker: Learn Which Is Better and How to Be Better

1 code implementation14 Aug 2022 Chunle Guo, Ruiqi Wu, Xin Jin, Linghao Han, Zhi Chai, Weidong Zhang, Chongyi Li

To achieve that, we also contribute a dataset, URankerSet, containing sufficient results enhanced by different UIE algorithms and the corresponding perceptual rankings, to train our URanker.

Image Quality Assessment UIE

Aesthetic Visual Question Answering of Photographs

no code implementations10 Aug 2022 Xin Jin, Wu Zhou, Xinghui Zhou, Shuai Cui, Le Zhang, Jianwen Lv, Shu Zhao

In this paper, we propose a new task of aesthetic language assessment: aesthetic visual question and answering (AVQA) of images.

Question Answering Sentiment Analysis +1

Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2

no code implementations9 Aug 2022 Xinghui Zhou, Xin Jin, Jianwen Lv, Heng Huang, Ming Mao, Shuai Cui

In this paper, we propose aesthetic attribute assessment, which is the aesthetic attributes captioning, i. e., to assess the aesthetic attributes such as composition, lighting usage and color arrangement.

Attribute Image Captioning

Aesthetic Language Guidance Generation of Images Using Attribute Comparison

no code implementations9 Aug 2022 Xin Jin, Qiang Deng, Jianwen Lv, Heng Huang, Hao Lou, Chaoen Xiao

The differences of the three attributes between the input images and the photography templates or the guidance images are described in natural language, which is aesthetic natural language guidance (ALG).

Attribute

Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics Driven Reinforcement Learning

no code implementations9 Aug 2022 Xin Jin, Shu Zhao, Le Zhang, Xin Zhao, Qiang Deng, Chaoen Xiao

In recent years, image generation has made great strides in improving the quality of images, producing high-fidelity ones.

Attribute Face Generation +3

Learning with Recoverable Forgetting

1 code implementation17 Jul 2022 Jingwen Ye, Yifang Fu, Jie Song, Xingyi Yang, Songhua Liu, Xin Jin, Mingli Song, Xinchao Wang

Life-long learning aims at learning a sequence of tasks without forgetting the previously acquired knowledge.

General Knowledge Transfer Learning

Image Coding for Machines with Omnipotent Feature Learning

no code implementations5 Jul 2022 Ruoyu Feng, Xin Jin, Zongyu Guo, Runsen Feng, Yixin Gao, Tianyu He, Zhizheng Zhang, Simeng Sun, Zhibo Chen

Learning a kind of feature that is both general (for AI tasks) and compact (for compression) is pivotal for its success.

Self-Supervised Learning

Aesthetic Attribute Assessment of Images Numerically on Mixed Multi-attribute Datasets

3 code implementations5 Jul 2022 Xin Jin, Xinning Li, Hao Lou, Chenyu Fan, Qiang Deng, Chaoen Xiao, Shuai Cui, Amit Kumar Singh

Besides, we propose a efficient method for image aesthetic attribute assessment on mixed multi-attribute dataset and construct a multitasking network architecture by using the EfficientNet-B0 as the backbone network.

Attribute

A perspective on Attitude Control Issues and Techniques

no code implementations30 Jun 2022 Dandan Zhang, Xin Jin, Hongye Su

This paper reviews the attitude control problems for rigid-body systems, starting from the attitude representation for rigid body kinematics.

Short Video Uprising: How #BlackLivesMatter Content on TikTok Challenges the Protest Paradigm

no code implementations20 Jun 2022 Yanru Jiang, Xin Jin, Qinhao Deng

This study concludes that while short-form video platforms could potentially challenge the protest paradigm on the content creators' side, the audiences' preference as measured by social media visibility might still be moderately associated with the protest paradigm.

Descriptive

Edge Security: Challenges and Issues

no code implementations14 Jun 2022 Xin Jin, Charalampos Katsis, Fan Sang, Jiahao Sun, Ashish Kundu, Ramana Kompella

Edge computing is a paradigm that shifts data processing services to the network edge, where data are generated.

Edge-computing

Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation

1 code implementation CVPR 2022 Lin Chen, Huaian Chen, Zhixiang Wei, Xin Jin, Xiao Tan, Yi Jin, Enhong Chen

Such NWD can be coupled with the classifier to serve as a discriminator satisfying the K-Lipschitz constraint without the requirements of additional weight clipping or gradient penalty strategy.

Unsupervised Domain Adaptation

Unsupervised Coherent Video Cartoonization with Perceptual Motion Consistency

1 code implementation2 Apr 2022 Zhenhuan Liu, Liang Li, Huajie Jiang, Xin Jin, Dandan Tu, Shuhui Wang, Zheng-Jun Zha

Furthermore, we devise the spatio-temporal correlative map as a style-independent, global-aware regularization on the perceptual motion consistency.

Decoder Optical Flow Estimation +1

ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion Generation

no code implementations ICCV 2023 Liang Xu, Ziyang Song, Dongliang Wang, Jing Su, Zhicheng Fang, Chenjing Ding, Weihao Gan, Yichao Yan, Xin Jin, Xiaokang Yang, Wenjun Zeng, Wei Wu

We present a GAN-based Transformer for general action-conditioned 3D human motion generation, including not only single-person actions but also multi-person interactive actions.

Robust Event Triggering Control for Lateral Dynamics of Intelligent Vehicles with Designable Inter-event Times

no code implementations14 Mar 2022 Xing Chu, Zhi Liu, Lei Mao, Xin Jin, Zhaoxia Peng, Guoguang Wen

In this brief, an improved event-triggered update mechanism (ETM) for the linear quadratic regulator is proposed to solve the lateral motion control problem of intelligent vehicle under bounded disturbances.

SADN: Learned Light Field Image Compression with Spatial-Angular Decorrelation

2 code implementations22 Feb 2022 Kedeng Tong, Xin Jin, Chen Wang, Fan Jiang

Light field image becomes one of the most promising media types for immersive video applications.

Image Compression

Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks

no code implementations25 Jan 2022 Xin Jin, Ruoyu Feng, Simeng Sun, Runsen Feng, Tianyu He, Zhibo Chen

Traditional media coding schemes typically encode image/video into a semantic-unknown binary stream, which fails to directly support downstream intelligent tasks at the bitstream level.

Action Recognition Object +8

Pseudo-labelling and Meta Reweighting Learning for Image Aesthetic Quality Assessment

no code implementations8 Jan 2022 Xin Jin, Hao Lou, Huang Heng, XiaoDong Li, Shuai Cui, Xiaokun Zhang, Xiqiao Li

In the tasks of image aesthetic quality evaluation, it is difficult to reach both the high score area and low score area due to the normal distribution of aesthetic datasets.

Binary Classification Classification +1

Unleashing Potential of Unsupervised Pre-Training With Intra-Identity Regularization for Person Re-Identification

no code implementations CVPR 2022 Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao

During the pre-training, we attempt to address two critical issues for learning fine-grained ReID features: (1) the augmentations in CL pipeline may distort the discriminative clues in person images.

Contrastive Learning Person Re-Identification +2

Unleashing the Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification

1 code implementation1 Dec 2021 Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao

During the pre-training, we attempt to address two critical issues for learning fine-grained ReID features: (1) the augmentations in CL pipeline may distort the discriminative clues in person images.

Contrastive Learning Person Re-Identification +2

Confounder Identification-free Causal Visual Feature Learning

no code implementations26 Nov 2021 Xin Li, Zhizheng Zhang, Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Xin Jin, Zhibo Chen

In this paper, we propose a novel Confounder Identification-free Causal Visual Feature Learning (CICF) method, which obviates the need for identifying confounders.

Domain Generalization Meta-Learning

A Close Look at Few-shot Real Image Super-resolution from the Distortion Relation Perspective

no code implementations25 Nov 2021 Xin Li, Xin Jin, Jun Fu, Xiaoyuan Yu, Bei Tong, Zhibo Chen

Under this brand-new scenario, we propose Distortion Relation guided Transfer Learning (DRTL) for the few-shot RealSR by transferring the rich restoration knowledge from auxiliary distortions (i. e., synthetic distortions) to the target RealSR under the guidance of distortion relation.

Image Restoration Image Super-Resolution +4

Meta Clustering Learning for Large-scale Unsupervised Person Re-identification

no code implementations19 Nov 2021 Xin Jin, Tianyu He, Xu Shen, Tongliang Liu, Xinchao Wang, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua

Unsupervised Person Re-identification (U-ReID) with pseudo labeling recently reaches a competitive performance compared to fully-supervised ReID methods based on modern clustering algorithms.

Clustering Unsupervised Person Re-Identification

MC-LCR: Multi-modal contrastive classification by locally correlated representations for effective face forgery detection

no code implementations7 Oct 2021 Gaojian Wang, Qian Jiang, Xin Jin, Wei Li, Xiaohui Cui

Moreover, we make a key observation that subtle forgery artifacts can be further exposed in the patch-wise phase and amplitude spectrum and exhibit different clues.

Unleash the Potential of Adaptation Models via Dynamic Domain Labels

no code implementations29 Sep 2021 Xin Jin, Tianyu He, Xu Shen, Songhua Wu, Tongliang Liu, Xinchao Wang, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua

In this paper, we propose an embarrassing simple yet highly effective adversarial domain adaptation (ADA) method for effectively training models for alignment.

Domain Adaptation Memorization

A HYPOTHESIS FOR THE COGNITIVE DIFFICULTY OF IMAGES

no code implementations29 Sep 2021 Xu Cheng, Xin Wang, Haotian Xue, Zhengyang Liang, Xin Jin, Quanshi Zhang

This paper proposes a hypothesis to analyze the underlying reason for the cognitive difficulty of an image from two perspectives, i. e. a cognitive image usually makes a DNN strongly activated by cognitive concepts; discarding massive non-cognitive concepts may also help the DNN focus on cognitive concepts.

Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies

1 code implementation14 Aug 2021 Xin Jin, Zhonglan Li, Ke Liu, Dongqing Zou, XiaoDong Li, Xingfan Zhu, Ziyin Zhou, Qilong Sun, Qingyu Liu

Classification sub-module supplies classifying of images according to the eras, nationalities and garment types; Parsing sub-network supplies the semantic for person contours, clothing and background in the image to achieve more accurate colorization of clothes and persons and prevent color overflow.

Classification Colorization +2

Can we imitate the principal investor's behavior to learn option price?

no code implementations24 May 2021 Xin Jin

This paper presents a framework of imitating the principal investor's behavior for optimal pricing and hedging options.

Decision Making Time Series +1

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization

1 code implementation CVPR 2022 Xin Jin, Tianyu He, Kecheng Zheng, Zhiheng Yin, Xu Shen, Zhen Huang, Ruoyu Feng, Jianqiang Huang, Xian-Sheng Hua, Zhibo Chen

Specifically, we introduce Gait recognition as an auxiliary task to drive the Image ReID model to learn cloth-agnostic representations by leveraging personal unique and cloth-independent gait information, we name this framework as GI-ReID.

Cloth-Changing Person Re-Identification Computational Efficiency +1

Re-energizing Domain Discriminator with Sample Relabeling for Adversarial Domain Adaptation

no code implementations ICCV 2021 Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen

Many unsupervised domain adaptation (UDA) methods exploit domain adversarial training to align the features to reduce domain gap, where a feature extractor is trained to fool a domain discriminator in order to have aligned feature distributions.

Unsupervised Domain Adaptation

Local Patch AutoAugment with Multi-Agent Collaboration

2 code implementations20 Mar 2021 Shiqi Lin, Tao Yu, Ruoyu Feng, Xin Li, Xin Jin, Zhibo Chen

We formulate it as a multi-agent reinforcement learning (MARL) problem, where each agent learns an augmentation policy for each patch based on its content together with the semantics of the whole image.

Data Augmentation Fine-Grained Image Recognition +2

Dense Interaction Learning for Video-based Person Re-identification

no code implementations ICCV 2021 Tianyu He, Xin Jin, Xu Shen, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua

The CNN encoder is responsible for efficiently extracting discriminative spatial features while the DI decoder is designed to densely model spatial-temporal inherent interaction across frames.

Decoder Video-Based Person Re-Identification

Style Normalization and Restitution for Domain Generalization and Adaptation

1 code implementation3 Jan 2021 Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen

In this paper, we design a novel Style Normalization and Restitution module (SNR) to simultaneously ensure both high generalization and discrimination capability of the networks.

Disentanglement Domain Generalization +4

Learned Block-based Hybrid Image Compression

no code implementations17 Dec 2020 Yaojun Wu, Xin Li, Zhizheng Zhang, Xin Jin, Zhibo Chen

Recent works on learned image compression perform encoding and decoding processes in a full-resolution manner, resulting in two problems when deployed for practical applications.

Blocking Image Compression +2

Learning Omni-frequency Region-adaptive Representations for Real Image Super-Resolution

no code implementations11 Dec 2020 Xin Li, Xin Jin, Tao Yu, Yingxue Pang, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Traditional single image super-resolution (SISR) methods that focus on solving single and uniform degradation (i. e., bicubic down-sampling), typically suffer from poor performance when applied into real-world low-resolution (LR) images due to the complicated realistic degradations.

Image Super-Resolution

Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment

1 code implementation4 Nov 2020 Qi Kuang, Xin Jin, Qinping Zhao, Bin Zhou

Our model can judge whether a UAV video was shot by professional photographers or amateurs together with the scene type classification.

Video Classification

A Deep Drift-Diffusion Model for Image Aesthetic Score Distribution Prediction

no code implementations15 Oct 2020 Xin Jin, Xiqiao Li, Heng Huang, XiaoDong Li, Xinghui Zhou

In this paper, we propose a Deep Drift-Diffusion (DDD) model inspired by psychologists to predict aesthetic score distribution from images.

Binary Classification

FAN: Frequency Aggregation Network for Real Image Super-resolution

no code implementations30 Sep 2020 Yingxue Pang, Xin Li, Xin Jin, Yaojun Wu, Jianzhao Liu, Sen Liu, Zhibo Chen

Specifically, we extract different frequencies of the LR image and pass them to a channel attention-grouped residual dense network (CA-GRDB) individually to output corresponding feature maps.

Image Super-Resolution SSIM

On Efficient Constructions of Checkpoints

no code implementations ICML 2020 Yu Chen, Zhenming Liu, Bin Ren, Xin Jin

Efficient construction of checkpoints/snapshots is a critical tool for training and diagnosing deep learning models.

Quantization

Hierarchical Context Embedding for Region-based Object Detection

no code implementations ECCV 2020 Zhao-Min Chen, Xin Jin, Borui Zhao, Xiu-Shen Wei, Yanwen Guo

To address this issue, we present a simple but effective Hierarchical Context Embedding (HCE) framework, which can be applied as a plug-and-play component, to facilitate the classification ability of a series of region-based detectors by mining contextual cues.

Object object-detection +1

Feature Alignment and Restoration for Domain Generalization and Adaptation

no code implementations22 Jun 2020 Xin Jin, Cuiling Lan, Wen-Jun Zeng, Zhibo Chen

To ensure high discrimination, we propose a Feature Restoration (FR) operation to distill task-relevant features from the residual information and use them to compensate for the aligned features.

Disentanglement Domain Generalization +1

Is Network the Bottleneck of Distributed Training?

1 code implementation17 Jun 2020 Zhen Zhang, Chaokun Chang, Haibin Lin, Yida Wang, Raman Arora, Xin Jin

As such, we advocate that the real challenge of distributed training is for the network community to develop high-performance network transport to fully utilize the network capacity and achieve linear scale-out.

Global Distance-distributions Separation for Unsupervised Person Re-identification

no code implementations ECCV 2020 Xin Jin, Cuiling Lan, Wen-Jun Zeng, Zhibo Chen

To address this problem, we introduce a global distance-distributions separation (GDS) constraint over the two distributions to encourage the clear separation of positive and negative samples from a global view.

Domain Adaptation POS +1

On Construction of the ASR-oriented Indian English Pronunciation Dictionary

no code implementations LREC 2020 Xian Huang, Xin Jin, Qike Li, Keliang Zhang

An Automatic Speech Recognition (ASR) system simply trained on British English (BE) /American English (AE) speech data and using the BE/AE pronunciation dictionary performs much worse when applied to IE.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Exploring Categorical Regularization for Domain Adaptive Object Detection

1 code implementation CVPR 2020 Chang-Dong Xu, Xing-Ran Zhao, Xin Jin, Xiu-Shen Wei

Specifically, by integrating an image-level multi-label classifier upon the detection backbone, we can obtain the sparse but crucial image regions corresponding to categorical information, thanks to the weakly localization ability of the classification manner.

Domain Adaptation Object +2

Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification

no code implementations15 Jan 2020 Xin Jin, Cuiling Lan, Wen-Jun Zeng, Zhibo Chen

To the best of our knowledge, we are the first to make use of multi-shots of an object in a teacher-student learning manner for effectively boosting the single image based re-id.

Knowledge Distillation Object

Region Normalization for Image Inpainting

1 code implementation23 Nov 2019 Tao Yu, Zongyu Guo, Xin Jin, Shilin Wu, Zhibo Chen, Weiping Li, Zhizheng Zhang, Sen Liu

In this work, we show that the mean and variance shifts caused by full-spatial FN limit the image inpainting network training and we propose a spatial region-wise normalization named Region Normalization (RN) to overcome the limitation.

Image Inpainting

Aesthetic Attributes Assessment of Images

2 code implementations11 Jul 2019 Xin Jin, Le Wu, Geng Zhao, Xiao-Dong Li, Xiaokun Zhang, Shiming Ge, Dongqing Zou, Bin Zhou, Xinghui Zhou

This is a new formula of image aesthetic assessment, which predicts aesthetic attributes captions together with the aesthetic score of each attribute.

Attribute Image Captioning +1

Facial Makeup Transfer Combining Illumination Transfer

no code implementations8 Jul 2019 Xin Jin, Rui Han, Ning Ning, Xiao-Dong Li, Xiaokun Zhang

To meet the women appearance needs, we present a novel virtual experience approach of facial makeup transfer, developed into windows platform application software.

Facial Makeup Transfer

A Survey and Experimental Analysis of Distributed Subgraph Matching

1 code implementation27 Jun 2019 Longbin Lai, Zhu Qing, Zhengyi Yang, Xin Jin, Zhengmin Lai, Ran Wang, Kongzhang Hao, Xuemin Lin, Lu Qin, Wenjie Zhang, Ying Zhang, Zhengping Qian, Jingren Zhou

We conduct extensive experiments for both unlabelled matching and labelled matching to analyze the performance of distributed subgraph matching under various settings, which is finally summarized as a practical guide.

Databases

Semantics-Aligned Representation Learning for Person Re-identification

1 code implementation30 May 2019 Xin Jin, Cuiling Lan, Wen-Jun Zeng, Guoqiang Wei, Zhibo Chen

Specifically, we build a Semantics Aligning Network (SAN) which consists of a base network as encoder (SA-Enc) for re-ID, and a decoder (SA-Dec) for reconstructing/regressing the densely semantics aligned full texture image.

Decoder Person Re-Identification +2

Harmonia: Near-Linear Scalability for Replicated Storage with In-Network Conflict Detection

no code implementations18 Apr 2019 Hang Zhu, Zhihao Bai, Jialin Li, Ellis Michael, Dan Ports, Ion Stoica, Xin Jin

Experimental results show that Harmonia improves the throughput of these protocols by up to 10X for a replication factor of 10, providing near-linear scalability up to the limit of our testbed.

Distributed, Parallel, and Cluster Computing

Relation-Aware Global Attention for Person Re-identification

1 code implementation CVPR 2020 Zhizheng Zhang, Cuiling Lan, Wen-Jun Zeng, Xin Jin, Zhibo Chen

For person re-identification (re-id), attention mechanisms have become attractive as they aim at strengthening discriminative features and suppressing irrelevant ones, which matches well the key of re-id, i. e., discriminative feature learning.

Clustering Image Classification +3

Neural Packet Classification

no code implementations27 Feb 2019 Eric Liang, Hang Zhu, Xin Jin, Ion Stoica

First, many of the existing solutions are iteratively building a decision tree by splitting nodes in the tree.

Classification General Classification +2

Flash: Efficient Dynamic Routing for Offchain Networks

2 code implementations14 Feb 2019 Peng Wang, Hong Xu, Xin Jin, Tao Wang

Mice payments are directly sent by looking up a routing table with a few precomputed paths to reduce probing overhead.

Networking and Internet Architecture

Unsupervised Single Image Deraining with Self-supervised Constraints

no code implementations21 Nov 2018 Xin Jin, Zhibo Chen, Jianxin Lin, Zhikai Chen, Wei Zhou

Most existing single image deraining methods require learning supervised models from a large set of paired synthetic training data, which limits their generality, scalability and practicality in real-world multimedia applications.

Benchmarking Generative Adversarial Network +1

Unsupervised Learnable Sinogram Inpainting Network (SIN) for Limited Angle CT reconstruction

no code implementations9 Nov 2018 Ji Zhao, Zhiqiang Chen, Li Zhang, Xin Jin

In this paper, we propose a sinogram inpainting network (SIN) to solve limited-angle CT reconstruction problem, which is a very challenging ill-posed issue and of great interest for several clinical applications.

Medical Physics Image and Video Processing

Learning for Video Compression

no code implementations26 Apr 2018 Zhibo Chen, Tianyu He, Xin Jin, Feng Wu

One key challenge to learning-based video compression is that motion predictive coding, a very effective tool for video compression, can hardly be trained into a neural network.

Multimedia Image and Video Processing

Multi-level Chaotic Maps for 3D Textured Model Encryption

no code implementations25 Sep 2017 Xin Jin, Shuyun Zhu, Le Wu, Geng Zhao, Xiao-Dong Li, Quan Zhou, Huimin Lu

In this work, a multi-level chaotic maps models for 3D textured encryption was presented by observing the different contributions for recognizing cipher 3D models between vertices (point cloud), polygons and textures.

Predicting Aesthetic Score Distribution through Cumulative Jensen-Shannon Divergence

2 code implementations23 Aug 2017 Xin Jin, Le Wu, Xiao-Dong Li, Siyu Chen, Siwei Peng, Jingying Chi, Shiming Ge, Chenggen Song, Geng Zhao

Thus, a novel CNN based on the Cumulative distribution with Jensen-Shannon divergence (CJS-CNN) is presented to predict the aesthetic score distribution of human ratings, with a new reliability-sensitive learning method based on the kurtosis of the score distribution, which eliminates the requirement of the original full data of human ratings (without normalization).

Single Reference Image based Scene Relighting via Material Guided Filtering

no code implementations23 Aug 2017 Xin Jin, Yannan Li, Ningning Liu, Xiao-Dong Li, Xianggang Jiang, Chaoen Xiao, Shiming Ge

We propose a novel outdoor scene relighting method, which needs only a single reference image and is based on material constrained layer decomposition.

Image Relighting

Privacy Preserving Face Retrieval in the Cloud for Mobile Users

no code implementations9 Aug 2017 Xin Jin, Shiming Ge, Chenggen Song

The experimental results reveal that our protocol can successfully retrieve the proper photos from the cloud server and protect the user photos and the face detector.

Privacy Preserving Retrieval

ILGNet: Inception Modules with Connected Local and Global Features for Efficient Image Aesthetic Quality Classification using Domain Adaptation

2 code implementations7 Oct 2016 Xin Jin, Le Wu, Xiao-Dong Li, Xiaokun Zhang, Jingying Chi, Siwei Peng, Shiming Ge, Geng Zhao, Shuying Li

Thus, it is easy to use a pre-trained GoogLeNet for large-scale image classification problem and fine tune our connected layers on an large scale database of aesthetic related images: AVA, i. e. \emph{domain adaptation}.

Domain Adaptation General Classification +2

Face Alignment In-the-Wild: A Survey

no code implementations15 Aug 2016 Xin Jin, Xiaoyang Tan

Over the last two decades, face alignment or localizing fiducial facial points has received increasing attention owing to its comprehensive applications in automatic face analysis.

Face Alignment Robust Face Alignment

Cannot find the paper you are looking for? You can Submit a new open access paper.