2D Matryoshka Sentence Embeddings

no code implementations22 Feb 2024 Xianming Li, Zongxi Li, Jing Li, Haoran Xie, Qing Li

The experimental results demonstrate the effectiveness of our proposed model in dynamically supporting different embedding sizes and Transformer layers, allowing it to be highly adaptable to various scenarios.

Semantic Textual Similarity Sentence +3

Towards Causal Classification: A Comprehensive Study on Graph Neural Networks

no code implementations27 Jan 2024 Simi Job, Xiaohui Tao, Taotao Cai, Lin Li, Haoran Xie, Jianming Yong

The exploration of Graph Neural Networks (GNNs) for processing graph-structured data has expanded, particularly their potential for causal analysis due to their universal approximation capabilities.

Graph Classification

Cross Initialization for Personalized Text-to-Image Generation

1 code implementation26 Dec 2023 Lianyu Pang, Jian Yin, Haoran Xie, Qiping Wang, Qing Li, Xudong Mao

Additionally, a fast version of our method allows for capturing an input image in roughly 26 seconds, while surpassing the baseline methods in terms of both reconstruction and editability.

Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment

no code implementations19 Dec 2023 Lingling Xu, Haoran Xie, Si-Zhao Joe Qin, Xiaohui Tao, Fu Lee Wang

The demands for fine-tuning PLMs, especially LLMs, have led to a surge in the development of PEFT methods, as depicted in Fig.


Cross-BERT for Point Cloud Pretraining

no code implementations8 Dec 2023 Xin Li, Peng Li, Zeyong Wei, Zhe Zhu, Mingqiang Wei, Junhui Hou, Liangliang Nan, Jing Qin, Haoran Xie, Fu Lee Wang

By performing cross-modal interaction, Cross-BERT can smoothly reconstruct the masked tokens during pretraining, leading to notable performance enhancements for downstream tasks.

Self-Supervised Learning

Recognizing Conditional Causal Relationships about Emotions and Their Corresponding Conditions

no code implementations28 Nov 2023 Xinhong Chen, Zongxi Li, YaoWei Wang, Haoran Xie, JianPing Wang, Qing Li

To highlight the context in such special causal relationships, we propose a new task to determine whether or not an input pair of emotion and cause has a valid causal relationship under different contexts and extract the specific context clauses that participate in the causal relationship.


Label Supervised LLaMA Finetuning

1 code implementation2 Oct 2023 Zongxi Li, Xianming Li, Yuzhang Liu, Haoran Xie, Jing Li, Fu-lee Wang, Qing Li, Xiaoqin Zhong

We evaluate this approach with Label Supervised LLaMA (LS-LLaMA), based on LLaMA-2-7B, a relatively small-scale LLM, and can be finetuned on a single GeForce RTX4090 GPU.

named-entity-recognition Named Entity Recognition +7

PDRL: Multi-Agent based Reinforcement Learning for Predictive Monitoring

no code implementations19 Sep 2023 Thanveer Shaik, Xiaohui Tao, Lin Li, Haoran Xie, U R Acharya, Raj Gururajan, Xujuan Zhou

The PDRL framework is able to learn the future states of the traffic and weather forecasting and the cumulative rewards are gradually increasing over each episode.

reinforcement-learning Time Series +3

Graph-enabled Reinforcement Learning for Time Series Forecasting with Adaptive Intelligence

no code implementations18 Sep 2023 Thanveer Shaik, Xiaohui Tao, Haoran Xie, Lin Li, Jianming Yong, Yuefeng Li

In this study, we propose a novel approach for predicting time-series data using GNN and monitoring with Reinforcement Learning (RL).

Bayesian Optimisation reinforcement-learning +5

TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective

1 code implementation ICCV 2023 Jun Dan, Yang Liu, Haoyu Xie, Jiankang Deng, Haoran Xie, Xuansong Xie, Baigui Sun

We investigate the reasons for this phenomenon and discover that the existing data augmentation approach and hard sample mining strategy are incompatible with ViTs-based FR backbone due to the lack of tailored consideration on preserving face structural information and leveraging each local token information.

Data Augmentation Face Recognition

A Survey of Multimodal Information Fusion for Smart Healthcare: Mapping the Journey from Data to Wisdom

no code implementations21 Jun 2023 Thanveer Shaik, Xiaohui Tao, Lin Li, Haoran Xie, Juan D. Velásquez

The components of the comprehensive survey presented in this paper form the foundation for more successful implementation of multimodal fusion in smart healthcare.

feature selection

AniFaceDrawing: Anime Portrait Exploration during Your Sketching

no code implementations13 Jun 2023 Zhengyu Huang, Haoran Xie, Tsukasa Fukusato, Kazunori Miyata

In the second stage, we simulated the drawing process of the generated images without any additional data (labels) and trained the sketch encoder for incomplete progressive sketches to generate high-quality portrait images with feature alignment to the disentangled representations in the teacher encoder.

Conditional Image Generation Disentanglement

Recurrent Attention Networks for Long-text Modeling

1 code implementation12 Jun 2023 Xianming Li, Zongxi Li, Xiaotian Luo, Haoran Xie, Xing Lee, Yingbin Zhao, Fu Lee Wang, Qing Li

Revisiting the self-attention mechanism and the recurrent structure, this paper proposes a novel long-document encoding model, Recurrent Attention Network (RAN), to enable the recurrent operation of self-attention.


Exploring the Landscape of Machine Unlearning: A Comprehensive Survey and Taxonomy

no code implementations10 May 2023 Thanveer Shaik, Xiaohui Tao, Haoran Xie, Lin Li, Xiaofeng Zhu, Qing Li

Machine unlearning (MU) is gaining increasing attention due to the need to remove or modify predictions made by machine learning (ML) models.

Fairness Machine Unlearning +1

Search By Image: Deeply Exploring Beneficial Features for Beauty Product Retrieval

no code implementations24 Mar 2023 Mingqiang Wei, Qian Sun, Haoran Xie, Dong Liang, Fu Lee Wang

Searching by image is popular yet still challenging due to the extensive interference arose from i) data variations (e. g., background, pose, visual angle, brightness) of real-world captured images and ii) similar images in the query dataset.


Sketch2Cloth: Sketch-based 3D Garment Generation with Unsigned Distance Fields

no code implementations1 Mar 2023 Yi He, Haoran Xie, Kazunori Miyata

In this study, we propose Sketch2Cloth, a sketch-based 3D garment generation system using the unsigned distance fields from the user's sketch input.

Model Editing

DiffFaceSketch: High-Fidelity Face Image Synthesis with Sketch-Guided Latent Diffusion Model

1 code implementation14 Feb 2023 Yichen Peng, Chunqi Zhao, Haoran Xie, Tsukasa Fukusato, Kazunori Miyata

We then introduce a Stochastic Region Abstraction (SRA), an approach to augment our dataset to improve the robustness of SGLDM to handle sketch input with arbitrary abstraction.

Image-to-Image Translation

Sentiment analysis and opinion mining on educational data: A survey

no code implementations8 Feb 2023 Thanveer Shaik, Xiaohui Tao, Christopher Dann, Haoran Xie, Yan Li, Linda Galligan

In the education sector, opinion mining is used to listen to student opinions and enhance their learning-teaching practices pedagogically.

Decision Making Negation +4

RainDiffusion: When Unsupervised Learning Meets Diffusion Models for Real-world Image Deraining

no code implementations23 Jan 2023 Mingqiang Wei, Yiyang Shen, Yongzhen Wang, Haoran Xie, Jing Qin, Fu Lee Wang

Before answering it, we observe two major obstacles of diffusion models in real-world image deraining: the need for paired training data and the limited utilization of multi-scale rain patterns.

Rain Removal Translation

AI enabled RPM for Mental Health Facility

no code implementations20 Jan 2023 Thanveer Shaik, Xiaohui Tao, Niall Higgins, Haoran Xie, Raj Gururajan, Xujuan Zhou

To provide a therapeutic environment for both patients and staff, aggressive or agitated patients need to be monitored remotely and track their vital signs and physical activities continuously.

Time Series Time Series Analysis

SpaceEditing: Integrating Human Knowledge into Deep Neural Networks via Interactive Latent Space Editing

no code implementations8 Dec 2022 Jiafu Wei, Ding Xia, Haoran Xie, Chia-Ming Chang, Chuntao Li, Xi Yang

We propose an interactive editing method that allows humans to help deep neural networks (DNNs) learn a latent space more consistent with human knowledge, thereby improving classification accuracy on indistinguishable ambiguous data.

Dimensionality Reduction

SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation

1 code implementation30 Nov 2022 Tianyu Zhang, Xusheng Du, Chia-Ming Chang, Xi Yang, Haoran Xie

However, it is difficult to draw a proper scene graph for image retrieval, image generation, and multi-modal applications.

Graph Generation Image Generation +5

ImLiDAR: Cross-Sensor Dynamic Message Propagation Network for 3D Object Detection

no code implementations17 Nov 2022 Yiyang Shen, Rongwei Yu, Peng Wu, Haoran Xie, Lina Gong, Jing Qin, Mingqiang Wei

We propose ImLiDAR, a new 3OD paradigm to narrow the cross-sensor discrepancies by progressively fusing the multi-scale features of camera Images and LiDAR point clouds.

3D Object Detection object-detection

iSmallNet: Densely Nested Network with Label Decoupling for Infrared Small Target Detection

no code implementations29 Oct 2022 Zhiheng Hu, Yongzhen Wang, Peng Li, Jie Qin, Haoran Xie, Mingqiang Wei

First, to maintain small targets in deep layers, we develop a multi-scale nested interaction module to explore a wide range of context information.

object-detection Small Object Detection

GeoGCN: Geometric Dual-domain Graph Convolution Network for Point Cloud Denoising

no code implementations28 Oct 2022 Zhaowei Chen, Peng Li, Zeyong Wei, Honghua Chen, Haoran Xie, Mingqiang Wei, Fu Lee Wang

We propose GeoGCN, a novel geometric dual-domain graph convolution network for point cloud denoising (PCD).


TogetherNet: Bridging Image Restoration and Object Detection Together via Dynamic Enhancement Learning

1 code implementation3 Sep 2022 Yongzhen Wang, Xuefeng Yan, Kaiwen Zhang, Lina Gong, Haoran Xie, Fu Lee Wang, Mingqiang Wei

Adverse weather conditions such as haze, rain, and snow often impair the quality of captured images, causing detection networks trained on normal images to generalize poorly in these scenarios.

Image Dehazing Image Restoration +3

Contrastive Semantic-Guided Image Smoothing Network

1 code implementation2 Sep 2022 Jie Wang, Yongzhen Wang, Yidan Feng, Lina Gong, Xuefeng Yan, Haoran Xie, Fu Lee Wang, Mingqiang Wei

Image smoothing is a fundamental low-level vision task that aims to preserve salient structures of an image while removing insignificant details.

image smoothing Semantic Segmentation

PV-RCNN++: Semantical Point-Voxel Feature Interaction for 3D Object Detection

no code implementations29 Aug 2022 Peng Wu, Lipeng Gu, Xuefeng Yan, Haoran Xie, Fu Lee Wang, Gary Cheng, Mingqiang Wei

Such a module will guide our PV-RCNN++ to integrate more object-related point-wise and voxel-wise features in the pivotal areas.

3D Object Detection Novel Object Detection +3

CSDN: Cross-modal Shape-transfer Dual-refinement Network for Point Cloud Completion

no code implementations1 Aug 2022 Zhe Zhu, Liangliang Nan, Haoran Xie, Honghua Chen, Mingqiang Wei, Jun Wang, Jing Qin

The first module transfers the intrinsic shape characteristics from single images to guide the geometry generation of the missing regions of point clouds, in which we propose IPAdaIN to embed the global features of both the image and the partial point cloud into completion.

Point Cloud Completion

GeoSegNet: Point Cloud Semantic Segmentation via Geometric Encoder-Decoder Modeling

1 code implementation14 Jul 2022 Chen Chen, Yisen Wang, Honghua Chen, Xuefeng Yan, Dayong Ren, Yanwen Guo, Haoran Xie, Fu Lee Wang, Mingqiang Wei

Semantic segmentation of point clouds, aiming to assign each point a semantic category, is critical to 3D scene understanding. Despite of significant advances in recent years, most of existing methods still suffer from either the object-level misclassification or the boundary-level ambiguity.

Object Segmentation +1

Dynamic Message Propagation Network for RGB-D Salient Object Detection

no code implementations20 Jun 2022 Baian Chen, Zhilei Chen, Xiaowei Hu, Jun Xu, Haoran Xie, Mingqiang Wei, Jing Qin

This paper presents a novel deep neural network framework for RGB-D salient object detection by controlling the message passing between the RGB images and depth maps on the feature level and exploring the long-range semantic contexts and geometric information on both RGB and depth features to infer salient objects.

object-detection RGB-D Salient Object Detection +1

Efficient Human-in-the-loop System for Guiding DNNs Attention

1 code implementation13 Jun 2022 Yi He, Xi Yang, Chia-Ming Chang, Haoran Xie, Takeo Igarashi

Attention guidance is an approach to addressing dataset bias in deep learning, where the model relies on incorrect features to make decisions.

Active Learning Image Classification

UCL-Dehaze: Towards Real-world Image Dehazing via Unsupervised Contrastive Learning

1 code implementation4 May 2022 Yongzhen Wang, Xuefeng Yan, Fu Lee Wang, Haoran Xie, Wenhan Yang, Mingqiang Wei, Jing Qin

From a different yet new perspective, this paper explores contrastive learning with an adversarial training effort to leverage unpaired real-world hazy and clean images, thus bridging the gap between synthetic and real-world haze is avoided.

Contrastive Learning Image Dehazing

Semi-MoreGAN: A New Semi-supervised Generative Adversarial Network for Mixture of Rain Removal

1 code implementation28 Apr 2022 Yiyang Shen, Yongzhen Wang, Mingqiang Wei, Honghua Chen, Haoran Xie, Gary Cheng, Fu Lee Wang

Rain is one of the most common weather which can completely degrade the image quality and interfere with the performance of many computer vision tasks, especially under heavy rain conditions.

Depth Estimation Depth Prediction +2

Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds

1 code implementation23 Mar 2022 Haoran Zhou, Honghua Chen, Yingkui Zhang, Mingqiang Wei, Haoran Xie, Jun Wang, Tong Lu, Jing Qin, Xiao-Ping Zhang

Differently, our network is designed to refine the initial normal of each point by extracting additional information from multiple feature representations.

When A Conventional Filter Meets Deep Learning: Basis Composition Learning on Image Filters

1 code implementation1 Mar 2022 Fu Lee Wang, Yidan Feng, Haoran Xie, Gary Cheng, Mingqiang Wei

Image filters are fast, lightweight and effective, which make these conventional wisdoms preferable as basic tools in vision tasks.

Denoising Rain Removal

Interactive 3D Character Modeling from 2D Orthogonal Drawings with Annotations

no code implementations27 Jan 2022 Zhengyu Huang, Haoran Xie, Tsukasa Fukusato

We propose an interactive 3D character modeling approach from orthographic drawings (e. g., front and side views) based on 2D-space annotations.

Stroke Correspondence by Labeling Closed Areas

no code implementations10 Aug 2021 Ryoma Miyauchi, Tsukasa Fukusato, Haoran Xie, Kazunori Miyata

First, the proposed system separates the closed areas in each keyframe and estimates the correspondences between closed areas by using the characteristics of shape, depth, and closed area connection.

Tree-Structured Topic Modeling with Nonparametric Neural Variational Inference

1 code implementation ACL 2021 Ziye Chen, Cheng Ding, Zusheng Zhang, Yanghui Rao, Haoran Xie

Topic modeling has been widely used for discovering the latent semantic structure of documents, but most existing methods learn topics with a flat structure.

Variational Inference

Learning Perceptual Manifold of Fonts

no code implementations17 Jun 2021 Haoran Xie, Yuki Fujita, Kazunori Miyata

To solve the specific issue, we propose the perceptual manifold of fonts to visualize the perceptual adjustment in the latent space of a generative model of fonts.

Font Generation

Direction-aware Feature-level Frequency Decomposition for Single Image Deraining

no code implementations15 Jun 2021 Sen Deng, Yidan Feng, Mingqiang Wei, Haoran Xie, Yiping Chen, Jonathan Li, Xiao-Ping Zhang, Jing Qin

Second, we further establish communication channels between low-frequency maps and high-frequency maps to interactively capture structures from high-frequency maps and add them back to low-frequency maps and, simultaneously, extract details from low-frequency maps and send them back to high-frequency maps, thereby removing rain streaks while preserving more delicate features in the input image.

Single Image Deraining

Image Deformation Estimation via Multi-Objective Optimization

no code implementations8 Jun 2021 Takumi Nakane, Haoran Xie, Chao Zhang

Specifically, by partitioning the template image into several regions and measuring the similarity of each region independently, multiple objectives are built and deformation estimation can thus be realized by solving the MOP with off-the-shelf multi-objective evolutionary algorithms (MOEAs).

Evolutionary Algorithms

dualFace:Two-Stage Drawing Guidance for Freehand Portrait Sketching

1 code implementation26 Apr 2021 Zhengyu Huang, Yichen Peng, Tomohiro Hibino, Chunqi Zhao, Haoran Xie, Tsukasa Fukusato, Kazunori Miyata

In the stage of local guidance, we synthesize detailed portrait images with a deep generative model from user-drawn contour lines, but use the synthesized results as detailed drawing guidance.

Sketch-based Normal Map Generation with Geometric Sampling

no code implementations23 Apr 2021 Yi He, Haoran Xie, Chao Zhang, Xi Yang, Kazunori Miyata

This paper proposes a deep generative model for generating normal maps from users sketch with geometric sampling.

Generative Adversarial Network

Context Reinforced Neural Topic Modeling over Short Texts

1 code implementation11 Aug 2020 Jiachun Feng, Zusheng Zhang, Cheng Ding, Yanghui Rao, Haoran Xie

As one of the prevalent topic mining tools, neural topic modeling has attracted a lot of interests for the advantages of high efficiency in training and strong generalisation abilities.

text-classification Topic Models +1

Handling Collocations in Hierarchical Latent Tree Analysis for Topic Modeling

no code implementations10 Jul 2020 Leonard K. M. Poon, Nevin L. Zhang, Haoran Xie, Gary Cheng

Topic modeling has been one of the most active research areas in machine learning in recent years.

Neural Mixed Counting Models for Dispersed Topic Discovery

no code implementations ACL 2020 Jiemin Wu, Yanghui Rao, Zusheng Zhang, Haoran Xie, Qing Li, Fu Lee Wang, Ziye Chen

Mixed counting models that use the negative binomial distribution as the prior can well model over-dispersed and hierarchically dependent random variables; thus they have attracted much attention in mining dispersed document topics.

Variational Inference

MBA-RainGAN: Multi-branch Attention Generative Adversarial Network for Mixture of Rain Removal from Single Images

no code implementations21 May 2020 Yiyang Shen, Yidan Feng, Sen Deng, Dong Liang, Jing Qin, Haoran Xie, Mingqiang Wei

We observe three intriguing phenomenons that, 1) rain is a mixture of raindrops, rain streaks and rainy haze; 2) the depth from the camera determines the degrees of object visibility, where objects nearby and faraway are visually blocked by rain streaks and rainy haze, respectively; and 3) raindrops on the glass randomly affect the object visibility of the whole image space.

Generative Adversarial Network Rain Removal

Incorporating Effective Global Information via Adaptive Gate Attention for Text Classification

no code implementations22 Feb 2020 Xianming Li, Zongxi Li, Yingbin Zhao, Haoran Xie, Qing Li

The dominant text classification studies focus on training classifiers using textual instances only or introducing external knowledge (e. g., hand-craft features and domain expert knowledge).

General Classification Sentence +2

DRD-Net: Detail-recovery Image Deraining via Context Aggregation Networks

1 code implementation27 Aug 2019 Sen Deng, Mingqiang Wei, Jun Wang, Luming Liang, Haoran Xie, Meng Wang

We have validated our approach on four recognized datasets (three synthetic and one real-world).

Rain Removal

Siamese Network-Based Supervised Topic Modeling

no code implementations EMNLP 2018 Minghui Huang, Yanghui Rao, Yuwei Liu, Haoran Xie, Fu Lee Wang

Label-specific topics can be widely used for supporting personality psychology, aspect-level sentiment analysis, and cross-domain sentiment classification.

General Classification Sentiment Analysis +3

On the Effectiveness of Least Squares Generative Adversarial Networks

2 code implementations18 Dec 2017 Xudong Mao, Qing Li, Haoran Xie, Raymond Y. K. Lau, Zhen Wang, Stephen Paul Smolley

To overcome such a problem, we propose in this paper the Least Squares Generative Adversarial Networks (LSGANs) which adopt the least squares loss for both the discriminator and the generator.

AlignGAN: Learning to Align Cross-Domain Images with Conditional Generative Adversarial Networks

no code implementations5 Jul 2017 Xudong Mao, Qing Li, Haoran Xie

Recently, several methods based on generative adversarial network (GAN) have been proposed for the task of aligning cross-domain images or learning a joint distribution of cross-domain images.

Generative Adversarial Network

A Network Framework for Noisy Label Aggregation in Social Media

no code implementations ACL 2017 Xueying Zhan, Yao-Wei Wang, Yanghui Rao, Haoran Xie, Qing Li, Fu Lee Wang, Tak-Lam Wong

This paper focuses on the task of noisy label aggregation in social media, where users with different social or culture backgrounds may annotate invalid or malicious tags for documents.

Cultural Vocal Bursts Intensity Prediction Image Classification +2

Least Squares Generative Adversarial Networks

23 code implementations ICCV 2017 Xudong Mao, Qing Li, Haoran Xie, Raymond Y. K. Lau, Zhen Wang, Stephen Paul Smolley

To overcome such a problem, we propose in this paper the Least Squares Generative Adversarial Networks (LSGANs) which adopt the least squares loss function for the discriminator.

