Search Results for author: Dazhong Shen

Found 14 papers, 7 papers with code

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

1 code implementation19 Apr 2024 Zhuofan Zong, Bingqi Ma, Dazhong Shen, Guanglu Song, Hao Shao, Dongzhi Jiang, Hongsheng Li, Yu Liu

Although some large-scale pretrained vision encoders such as vision encoders in CLIP and DINOv2 have brought promising performance, we found that there is still no single vision encoder that can dominate various image content understanding, e. g., the CLIP vision encoder leads to outstanding results on general image understanding but poor performance on document or chart content.

Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach

no code implementations13 Apr 2024 Feihu Jiang, Chuan Qin, Jingshuai Zhang, Kaichun Yao, Xi Chen, Dazhong Shen, Chen Zhu, HengShu Zhu, Hui Xiong

In the contemporary era of widespread online recruitment, resume understanding has been widely acknowledged as a fundamental and crucial task, which aims to extract structured information from resume documents automatically.

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance

2 code implementations8 Apr 2024 Dazhong Shen, Guanglu Song, Zeyue Xue, Fu-Yun Wang, Yu Liu

Classifier-Free Guidance (CFG) has been widely used in text-to-image diffusion models, where the CFG scale is introduced to control the strength of text guidance on the whole image space.

Denoising Semantic Segmentation

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

2 code implementations4 Apr 2024 Dongzhi Jiang, Guanglu Song, Xiaoshi Wu, Renrui Zhang, Dazhong Shen, Zhuofan Zong, Yu Liu, Hongsheng Li

We further attribute this phenomenon to the diffusion model's insufficient condition utilization, which is caused by its training paradigm.

Attribute Image Captioning +1

AFDGCF: Adaptive Feature De-correlation Graph Collaborative Filtering for Recommendations

1 code implementation26 Mar 2024 Wei Wu, Chao Wang, Dazhong Shen, Chuan Qin, Liyi Chen, Hui Xiong

Collaborative filtering methods based on graph neural networks (GNNs) have witnessed significant success in recommender systems (RS), capitalizing on their ability to capture collaborative signals within intricate user-item relationships via message-passing mechanisms.

Collaborative Filtering Recommendation Systems

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

1 code implementation20 Mar 2024 Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang, Xiaoyu Shi, Dazhong Shen, Guanglu Song, Yu Liu, Hongsheng Li

We introduce MOTIA Mastering Video Outpainting Through Input-Specific Adaptation, a diffusion-based pipeline that leverages both the intrinsic data-specific patterns of the source video and the image/video generative prior for effective outpainting.

DGR: A General Graph Desmoothing Framework for Recommendation via Global and Local Perspectives

no code implementations7 Mar 2024 Leilei Ding, Dazhong Shen, Chao Wang, Tianfu Wang, Le Zhang, Yanyong Zhang

Graph Convolutional Networks (GCNs) have become pivotal in recommendation systems for learning user and item embeddings by leveraging the user-item interaction graph's node information and topology.

Recommendation Systems

ReliCD: A Reliable Cognitive Diagnosis Framework with Confidence Awareness

no code implementations29 Dec 2023 Yunfei Zhang, Chuan Qin, Dazhong Shen, Haiping Ma, Le Zhang, Xingyi Zhang, HengShu Zhu

To address this, in this paper, we propose a novel Reliable Cognitive Diagnosis(ReliCD) framework, which can quantify the confidence of the diagnosis feedback and is flexible for different cognitive diagnostic functions.

cognitive diagnosis

A Comprehensive Survey of Artificial Intelligence Techniques for Talent Analytics

no code implementations3 Jul 2023 Chuan Qin, Le Zhang, Rui Zha, Dazhong Shen, Qi Zhang, Ying Sun, Chen Zhu, HengShu Zhu, Hui Xiong

To this end, we present an up-to-date and comprehensive survey on AI technologies used for talent analytics in the field of human resource management.

Decision Making Management

Preference or Intent? Double Disentangled Collaborative Filtering

no code implementations18 May 2023 Chao Wang, HengShu Zhu, Dazhong Shen, Wei Wu, Hui Xiong

In this way, the low-rating items will be treated as positive samples for modeling intents while the negative samples for modeling preferences.

Collaborative Filtering Disentanglement +1

Topic Modeling Revisited: A Document Graph-based Neural Network Perspective

1 code implementation NeurIPS 2021 Dazhong Shen, Chuan Qin, Chao Wang, Zheng Dong, HengShu Zhu, Hui Xiong

To this end, in this paper, we revisit the task of topic modeling by transforming each document into a directed graph with word dependency as edges between word nodes, and develop a novel approach, namely Graph Neural Topic Model (GNTM).

Variational Inference

Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness

1 code implementation24 Oct 2021 Dazhong Shen, Chuan Qin, Chao Wang, HengShu Zhu, Enhong Chen, Hui Xiong

As one of the most popular generative models, Variational Autoencoder (VAE) approximates the posterior of latent variables based on amortized variational inference.

Variational Inference

A Machine Learning-enhanced Robust P-Phase Picker for Real-time Seismic Monitoring

no code implementations21 Nov 2019 Dazhong Shen, Qi Zhang, Tong Xu, HengShu Zhu, Wenjia Zhao, Zikai Yin, Peilun Zhou, Lihua Fang, Enhong Chen, Hui Xiong

To this end, in this paper, we present a machine learning-enhanced framework based on ensemble learning strategy, EL-Picker, for the automatic identification of seismic P-phase arrivals on continuous and massive waveforms.

BIG-bench Machine Learning Ensemble Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.