Search Results for author: Xin Sun

Found 60 papers, 17 papers with code

Adjusting the Precision-Recall Trade-Off with Align-and-Predict Decoding for Grammatical Error Correction

1 code implementation ACL 2022 Xin Sun, Houfeng Wang

Modern writing assistance applications are always equipped with a Grammatical Error Correction (GEC) model to correct errors in user-entered sentences.

Grammatical Error Correction

A Knowledge-enhanced Pathology Vision-language Foundation Model for Cancer Diagnosis

1 code implementation17 Dec 2024 Xiao Zhou, Luoyi Sun, Dexuan He, Wenbin Guan, Ruifen Wang, LiFeng Wang, Xin Sun, Kun Sun, Ya zhang, Yanfeng Wang, Weidi Xie

To derive more nuanced image and text representations, we propose a novel knowledge-enhanced vision-language pre-training approach that integrates disease knowledge into the alignment within hierarchical semantic groups instead of unstructured image-text pairs.

Specificity whole slide images

Script-Strategy Aligned Generation: Aligning LLMs with Expert-Crafted Dialogue Scripts and Therapeutic Strategies for Psychotherapy

no code implementations11 Nov 2024 Xin Sun, Jan de Wit, Zhuying Li, Jiahuan Pei, Abdallah El Ali, Jos A. Bosch

Building on findings, we proposed ``Script-Strategy Aligned Generation (SSAG)'', a flexible alignment approach that reduces reliance on fully scripted content while enhancing LLMs' therapeutic adherence and controllability.

Chatbot

Pin-Tuning: Parameter-Efficient In-Context Tuning for Few-Shot Molecular Property Prediction

no code implementations2 Nov 2024 Qiang Liu, Shaozhen Liu, Xin Sun, Shu Wu, Liang Wang

We attribute this issue to the imbalance between the abundance of tunable parameters and the scarcity of labeled molecules, and the lack of contextual perceptiveness in the encoders.

Attribute Drug Discovery +2

MPT: A Large-scale Multi-Phytoplankton Tracking Benchmark

no code implementations22 Oct 2024 Yang Yu, Yuezun Li, Xin Sun, Junyu Dong

Phytoplankton are a crucial component of aquatic ecosystems, and effective monitoring of them can provide valuable insights into ocean environments and ecosystem changes.

Multi-Object Tracking

Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology

no code implementations31 Aug 2024 Weinan Dai, Yifeng Jiang, Yuanjing Liu, Jinkun Chen, Xin Sun, Jinglei Tao

To achieve this, we present a speech augmentation-based unsupervised learning method that utilizes the similarity between the bottleneck layer feature and the audio reconstructing information for auxiliary training.

Contrastive Learning Keyword Spotting

Rethinking the Alignment of Psychotherapy Dialogue Generation with Motivational Interviewing Strategies

no code implementations12 Aug 2024 Xin Sun, Xiao Tang, Abdallah El Ali, Zhuying Li, Pengjie Ren, Jan de Wit, Jiahuan Pei, Jos A. Bosch

Recent advancements in large language models (LLMs) have shown promise in generating psychotherapeutic dialogues, particularly in the context of motivational interviewing (MI).

Dialogue Generation

DIVE: Subgraph Disagreement for Graph Out-of-Distribution Generalization

no code implementations8 Aug 2024 Xin Sun, Qiang Liu, Shu Wu, Zilei Wang, Liang Wang

This paper addresses the challenge of out-of-distribution (OOD) generalization in graph machine learning, a field rapidly advancing yet grappling with the discrepancy between source and target data distributions.

Graph Classification Graph Learning +3

LRM-Zero: Training Large Reconstruction Models with Synthesized Data

1 code implementation13 Jun 2024 Desai Xie, Sai Bi, Zhixin Shu, Kai Zhang, Zexiang Xu, Yi Zhou, Sören Pirk, Arie Kaufman, Xin Sun, Hao Tan

We demonstrate that our LRM-Zero, trained with our fully synthesized Zeroverse, can achieve high visual quality in the reconstruction of real-world objects, competitive with models trained on Objaverse.

3D Reconstruction

OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine

no code implementations28 Feb 2024 Xiaosong Wang, Xiaofan Zhang, Guotai Wang, Junjun He, Zhongyu Li, Wentao Zhu, Yi Guo, Qi Dou, Xiaoxiao Li, Dequan Wang, Liang Hong, Qicheng Lao, Tong Ruan, Yukun Zhou, Yixue Li, Jie Zhao, Kang Li, Xin Sun, Lifeng Zhu, Shaoting Zhang

The emerging trend of advancing generalist artificial intelligence, such as GPTv4 and Gemini, has reshaped the landscape of research (academia and industry) in machine learning and many other research areas.

Transfer Learning

Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning

no code implementations CVPR 2024 Desai Xie, Jiahao Li, Hao Tan, Xin Sun, Zhixin Shu, Yi Zhou, Sai Bi, Sören Pirk, Arie E. Kaufman

To this end, we introduce Carve3D, an improved RLFT algorithm coupled with a novel Multi-view Reconstruction Consistency (MRC) metric, to enhance the consistency of multi-view diffusion models.

Language Modelling Large Language Model +1

RNA: Relightable Neural Assets

no code implementations14 Dec 2023 Krishna Mullia, Fujun Luan, Xin Sun, Miloš Hašan

We combine an MLP decoder with a feature grid.

OpenVoice: Versatile Instant Voice Cloning

1 code implementation3 Dec 2023 Zengyi Qin, Wenliang Zhao, Xumin Yu, Xin Sun

The voice styles are not directly copied from and constrained by the style of the reference speaker.

Voice Cloning

GSLB: The Graph Structure Learning Benchmark

1 code implementation NeurIPS 2023 ZHIXUN LI, Xin Sun, Yifan Luo, Yanqiao Zhu, Dingshuo Chen, Yingtao Luo, Xiangxin Zhou, Qiang Liu, Shu Wu, Liang Wang, Jeffrey Xu Yu

To fill this gap, we systematically analyze the performance of GSL in different scenarios and develop a comprehensive Graph Structure Learning Benchmark (GSLB) curated from 20 diverse graph datasets and 16 distinct GSL algorithms.

Graph structure learning

Impact of COVID-19 Lockdown Measures on Chinese Startups and Local Government Public Finance: Challenges and Policy Implications

no code implementations14 Aug 2023 Xin Sun

This paper aims to assess the impact of COVID-19 on the public finance of Chinese local governments, with a particular focus on the effect of lockdown measures on startups during the pandemic.

All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment

no code implementations7 Jul 2023 Chunhui Zhang, Xin Sun, Li Liu, Yiqian Yang, Qiong Liu, Xi Zhou, Yanfeng Wang

This approach achieves feature integration in a unified backbone, removing the need for carefully-designed fusion modules and resulting in a more effective and efficient VL tracking framework.

Accurate Airway Tree Segmentation in CT Scans via Anatomy-aware Multi-class Segmentation and Topology-guided Iterative Learning

no code implementations15 Jun 2023 Puyang Wang, Dazhou Guo, Dandan Zheng, Minghui Zhang, Haogang Yu, Xin Sun, Jia Ge, Yun Gu, Le Lu, Xianghua Ye, Dakai Jin

Intrathoracic airway segmentation in computed tomography (CT) is a prerequisite for various respiratory disease analyses such as chronic obstructive pulmonary disease (COPD), asthma and lung cancer.

Anatomy Computed Tomography (CT) +3

Uncertainty Calibration for Counterfactual Propensity Estimation in Recommendation

no code implementations23 Mar 2023 WenBo Hu, Xin Sun, Qiang Liu, Le Wu, Liang Wang

To address this, we evaluate the quality of propensity scores from the perspective of uncertainty calibration, proposing the use of expected calibration error (ECE) as a measure of propensity-score quality.

counterfactual Generalization Bounds +3

PixHt-Lab: Pixel Height Based Light Effect Generation for Image Compositing

no code implementations CVPR 2023 Yichen Sheng, Jianming Zhang, Julien Philip, Yannick Hold-Geoffroy, Xin Sun, He Zhang, Lu Ling, Bedrich Benes

To compensate for the lack of geometry in 2D Image compositing, recent deep learning-based approaches introduced a pixel height representation to generate soft shadows and reflections.

3D geometry

GAIT: Generating Aesthetic Indoor Tours with Deep Reinforcement Learning

no code implementations ICCV 2023 Desai Xie, Ping Hu, Xin Sun, Soren Pirk, Jianming Zhang, Radomir Mech, Arie E. Kaufman

Placing and orienting a camera to compose aesthetically meaningful shots of a scene is not only a key objective in real-world photography and cinematography but also for virtual content creation.

Deep Reinforcement Learning Mixed Reality +1

A Comprehensive Survey on Aerial Mobile Edge Computing: Challenges, State-of-the-Art, and Future Directions

no code implementations30 Aug 2022 Zhengyu Song, Xintong Qin, Yuanyuan Hao, Tianwei Hou, Jun Wang, Xin Sun

Driven by the visions of Internet of Things (IoT), there is an ever-increasing demand for computation resources of IoT users to support diverse applications.

Edge-computing Scheduling

Joint Optimization of Resource Allocation, Phase Shift and UAV Trajectory for Energy-Efficient RIS-Assisted UAV-Enabled MEC Systems

no code implementations30 Aug 2022 Xintong Qin, Zhengyu Song, Tianwei Hou, Wenjuan Yu, Jun Wang, Xin Sun

The unmanned aerial vehicle (UAV) enabled mobile edge computing (MEC) has been deemed a promising paradigm to provide ubiquitous communication and computing services for the Internet of Things (IoT).

Edge-computing

A Repulsive Force Unit for Garment Collision Handling in Neural Networks

no code implementations28 Jul 2022 Qingyang Tan, Yi Zhou, Tuanfeng Wang, Duygu Ceylan, Xin Sun, Dinesh Manocha

Despite recent success, deep learning-based methods for predicting 3D garment deformation under body motion suffer from interpenetration problems between the garment and the body.

You Need to Read Again: Multi-granularity Perception Network for Moment Retrieval in Videos

1 code implementation25 May 2022 Xin Sun, Xuan Wang, Jialin Gao, Qiong Liu, Xi Zhou

Moment retrieval in videos is a challenging task that aims to retrieve the most relevant video moment in an untrimmed video given a sentence description.

Moment Retrieval Reading Comprehension +2

Lossless Acceleration for Seq2seq Generation with Aggressive Decoding

2 code implementations20 May 2022 Tao Ge, Heming Xia, Xin Sun, Si-Qing Chen, Furu Wei

We study lossless acceleration for seq2seq generation with a novel decoding algorithm -- Aggressive Decoding.

Abstractive Text Summarization Grammatical Error Correction +4

RiCS: A 2D Self-Occlusion Map for Harmonizing Volumetric Objects

no code implementations14 May 2022 Yunseok Jang, Ruben Villegas, Jimei Yang, Duygu Ceylan, Xin Sun, Honglak Lee

We test the effectiveness of our representation on the human image harmonization task by predicting shading that is coherent with a given background image.

Image Harmonization

Relational Triple Extraction: One Step is Enough

no code implementations11 May 2022 Yu-Ming Shang, Heyan Huang, Xin Sun, Wei Wei, Xian-Ling Mao

Extracting relational triples from unstructured text is an essential task in natural language processing and knowledge graph construction.

graph construction Sentence

A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model

no code implementations26 Jan 2022 Xin Sun, Tao Ge, Shuming Ma, Jingjing Li, Furu Wei, Houfeng Wang

Synthetic data construction of Grammatical Error Correction (GEC) for non-English languages relies heavily on human-designed and language-specific rules, which produce limited error-corrected patterns.

Grammatical Error Correction Language Modeling +4

SurroundNet: Towards Effective Low-Light Image Enhancement

1 code implementation11 Oct 2021 Fei Zhou, Xin Sun, Junyu Dong, Haoran Zhao, Xiao Xiang Zhu

Although Convolution Neural Networks (CNNs) has made substantial progress in the low-light image enhancement task, one critical problem of CNNs is the paradox of model complexity and performance.

Low-Light Image Enhancement

M2IOSR: Maximal Mutual Information Open Set Recognition

no code implementations5 Aug 2021 Xin Sun, Henghui Ding, Chi Zhang, Guosheng Lin, Keck-Voon Ling

In this work, we aim to address the challenging task of open set recognition (OSR).

Open Set Learning

Single-image Full-body Human Relighting

no code implementations15 Jul 2021 Manuel Lagunas, Xin Sun, Jimei Yang, Ruben Villegas, Jianming Zhang, Zhixin Shu, Belen Masia, Diego Gutierrez

We present a single-image data-driven method to automatically relight images with full-body humans in them.

Image Reconstruction

Alternating Direction Method of Multiplier-Based Distributed Planning Model for Natural Gas, Electricity Network, and Regional Integrated Energy Systems

no code implementations29 Jun 2021 Ang Xuan, Yang Qiu, Yang Liu, Xin Sun

Simulation results illustrate that a distributed planning model is more sensitive to individual load differences, which is precisely the defect of the joint planning model.

Knowledge Distillation via Instance-level Sequence Learning

no code implementations21 Jun 2021 Haoran Zhao, Xin Sun, Junyu Dong, Zihe Dong, Qiong Li

Recently, distillation approaches are suggested to extract general knowledge from a teacher network to guide a student network.

General Knowledge Knowledge Distillation

Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding

1 code implementation ACL 2021 Xin Sun, Tao Ge, Furu Wei, Houfeng Wang

In this paper, we propose Shallow Aggressive Decoding (SAD) to improve the online inference efficiency of the Transformer for instantaneous Grammatical Error Correction (GEC).

Decoder Grammatical Error Correction

Network Embedding via Deep Prediction Model

no code implementations27 Apr 2021 Xin Sun, Zenghui Song, Yongbo Yu, Junyu Dong, Claudia Plant, Christian Boehm

This paper proposes a network embedding framework to capture the transfer behaviors on structured networks via deep prediction models.

Clustering Feature Engineering +2

Gaussian Dynamic Convolution for Efficient Single-Image Segmentation

no code implementations18 Apr 2021 Xin Sun, Changrui Chen, Xiaorui Wang, Junyu Dong, Huiyu Zhou, Sheng Chen

Furthermore, we also build a Gaussian dynamic pyramid Pooling to show its potential and generality in common semantic segmentation.

Image Segmentation Segmentation +1

Similarity Transfer for Knowledge Distillation

no code implementations18 Mar 2021 Haoran Zhao, Kun Gong, Xin Sun, Junyu Dong, Hui Yu

The proposed approach promotes the performance of student model as the virtual sample created by multiple images produces a similar probability distribution in the teacher and student networks.

Knowledge Distillation

Syntax-Aware Graph Attention Network for Aspect-Level Sentiment Classification

no code implementations COLING 2020 Lianzhe Huang, Xin Sun, Sujian Li, Linhao Zhang, Houfeng Wang

In this paper, we exploit syntactic awareness to the model by the graph attention network on the dependency tree structure and external pre-training knowledge by BERT language model, which helps to model the interaction between the context and aspect words better.

Classification Graph Attention +5

Open Set Recognition with Conditional Probabilistic Generative Models

no code implementations12 Aug 2020 Xin Sun, Chi Zhang, Guosheng Lin, Keck-Voon Ling

A typical challenge that hinders their real-world applications is that unknown samples may be fed into the system during the testing phase, but traditional deep neural networks will wrongly recognize these unknown samples as one of the known classes.

Open Set Learning

Learning Relation Ties with a Force-Directed Graph in Distant Supervised Relation Extraction

no code implementations21 Apr 2020 Yuming Shang, Heyan Huang, Xin Sun, Xian-Ling Mao

Then, we borrow the idea of Coulomb's Law from physics and introduce the concept of attractive force and repulsive force to this graph to learn correlation and mutual exclusion between relations.

Relation Relation Extraction

Conditional Gaussian Distribution Learning for Open Set Recognition

1 code implementation CVPR 2020 Xin Sun, Zhenning Yang, Chi Zhang, Guohao Peng, Keck-Voon Ling

A typical challenge is that unknown samples may be fed into the system during the testing phase and traditional deep neural networks will wrongly recognize the unknown sample as one of the known classes.

General Classification Open Set Learning

High-Order Paired-ASPP Networks for Semantic Segmenation

no code implementations18 Feb 2020 Yu Zhang, Xin Sun, Junyu Dong, Changrui Chen, Yue Shen

The network first introduces a High-Order Representation module to extract the contextual high-order information from all stages of the backbone.

Semantic Segmentation Vocal Bursts Intensity Prediction

Multi-level Similarity Learning for Low-Shot Recognition

no code implementations13 Dec 2019 Hongwei Xv, Xin Sun, Junyu Dong, Shu Zhang, Qiong Li

Low-shot learning indicates the ability to recognize unseen objects based on very limited labeled training samples, which simulates human visual intelligence.

MIMO Assisted Networks Relying on Intelligent Reflective Surfaces

no code implementations2 Oct 2019 Tianwei Hou, Yuanwei Liu, Zhengyu Song, Xin Sun, Yue Chen, Lajos Hanzo

The network's SE and EE are also derived.

Information Theory Signal Processing Information Theory

Few-shot Learning for Domain-specific Fine-grained Image Classification

no code implementations23 Jul 2019 Xin Sun, Hongwei Xv, Junyu Dong, Qiong Li, Changrui Chen

Learning to recognize novel visual categories from a few examples is a challenging task for machines in real-world industrial applications.

Classification Few-Shot Learning +2

Highlight Every Step: Knowledge Distillation via Collaborative Teaching

1 code implementation23 Jul 2019 Haoran Zhao, Xin Sun, Junyu Dong, Changrui Chen, Zihe Dong

Knowledge distillation aims to train a compact student network by transferring knowledge from a larger pre-trained teacher model.

Knowledge Distillation

A Procedural Texture Generation Framework Based on Semantic Descriptions

no code implementations13 Apr 2017 Junyu Dong, Li-Na Wang, Jun Liu, Xin Sun

Finally, given a set of semantic descriptions, the diverse properties of the samples in the semantic space can lead the framework to find an appropriate generation model that uses appropriate parameters to produce a desired texture.

Multi-Label Learning Texture Synthesis

Cannot find the paper you are looking for? You can Submit a new open access paper.