no code implementations • 20 Mar 2025 • Kristin Qi, Xinhan Di
Retinal Optical Coherence Tomography (OCT) segmentation is essential for diagnosing pathology.
no code implementations • 2 Jan 2025 • Shuangtao Li, Shuaihao Dong, Kexin Luan, Xinhan Di, Chaofan Ding
Large language models (LLMs) have demonstrated their remarkable capacity across a variety of tasks.
no code implementations • 23 Dec 2024 • Gongyu Chen, Haomin Zhang, Chaofan Ding, Zihao Chen, Xinhan Di
First, guidance of consistency on both context tokens and domain tokens of ALMs is set.
no code implementations • 23 Dec 2024 • Huchen Jiang, Yangyang Ma, Chaofan Ding, Kexin Luan, Xinhan Di
Our work leverages step-wise preference learning to enhance self-verification via reinforcement learning.
no code implementations • 13 Dec 2024 • Changqun Li, Chaofan Ding, Kexin Luan, Xinhan Di
Fine-tuning pre-trained large language models in a parameter-efficient manner is widely studied for its effectiveness and efficiency.
no code implementations • 12 Dec 2024 • Zihao Chen, Haomin Zhang, Xinhan Di, Haoyu Wang, Sizhe Shan, Junjie Zheng, Yunming Liang, Yihan Fan, Xinfa Zhu, Wenjie Tian, Yihua Wang, Chaofan Ding, Lei Xie
Generating sound effects for product-level videos, where only a small amount of labeled data is available for diverse scenes, requires the production of high-quality sounds in few-shot settings.
no code implementations • 7 Oct 2024 • Wenjing Gao, Yuanyuan Yang, Jianrui Wei, Xuntao Yin, Xinhan Di
The insufficient supervision limit the performance of the deep supervised models for brain disease diagnosis.
no code implementations • 2 Oct 2024 • Wenmo Qiu, Xinhan Di
There is a gap in the understanding of occluded objects in existing large-scale visual language multi-modal models.
no code implementations • 2 Oct 2024 • Shuxin Yang, Xinhan Di
There is a gap in the understanding of occluded objects in existing large-scale visual language multi-modal models.
no code implementations • 1 Oct 2024 • Shuting Zhao, Chenkang Du, Kristin Qi, Xinrong Chen, Xinhan Di
Adaptation methods are developed to adapt depth foundation models to endoscopic depth estimation recently.
no code implementations • 26 Sep 2024 • Huan Yang, Jiahui Chen, Chaofan Ding, Runhua Shi, Siyu Xiong, Qingqi Hong, Xiaoqi Mo, Xinhan Di
Gestures are pivotal in enhancing co-speech communication.
no code implementations • 1 Aug 2024 • Xinhan Di, Zihao Chen, Yunming Liang, Junjie Zheng, Yihua Wang, Chaofan Ding
Large-scale text-to-speech (TTS) models have made significant progress recently. However, they still fall short in the generation of Chinese dialectal speech.
1 code implementation • 15 Jan 2024 • Zhifeng Xie, Hao Li, Huiming Ding, Mengtian Li, Xinhan Di, Ying Cao
Fashion design is a challenging and complex process. Recent works on fashion generation and editing are all agnostic of the actual fashion design process, which limits their usage in practice. In this paper, we propose a novel hierarchical diffusion-based framework tailored for fashion design, coined as HieraFashDiff.
no code implementations • 19 Oct 2022 • Xinhan Di, Pengqian Yu
In this paper, we explore the furniture layout task as a Markov decision process (MDP) in virtual reality, which is solved by hierarchical reinforcement learning (HRL).
Hierarchical Reinforcement Learning
reinforcement-learning
+2
no code implementations • 21 Aug 2022 • Xinhan Di, Pengqian Yu
The first module is a lightweight feature attention module that extracts both local occlusion representation and global image patch representation in a coarse-to-fine manner.
1 code implementation • 18 Feb 2021 • Xinhan Di, Pengqian Yu
In the industrial interior design process, professional designers plan the furniture layout to achieve a satisfactory 3D design for selling.
Multi-agent Reinforcement Learning
reinforcement-learning
+2
1 code implementation • 19 Jan 2021 • Xinhan Di, Pengqian Yu
In the industrial interior design process, professional designers plan the size and position of furniture in a room to achieve a satisfactory design for selling.
1 code implementation • 15 Dec 2020 • Xinhan Di, Pengqian Yu, Danfeng Yang, Hong Zhu, Changyu Sun, YinDong Liu
We conduct our experiments on the proposed real-world interior layout dataset that contains $191208$ designs from the professional designers.
no code implementations • 15 Dec 2020 • Xinhan Di, Pengqian Yu, Danfeng Yang, Hong Zhu, Changyu Sun, YinDong Liu
In this paper, we propose a multiple-domain model for producing a custom-size furniture layout in the interior scene.
no code implementations • 4 Aug 2020 • Xinhan Di, Pengqian Yu, Hong Zhu, Lei Cai, Qiuyan Sheng, Changyu Sun
In this paper, we propose an assistive model that supports professional interior designers to produce industrial interior decoration solutions and to meet the personalized preferences of the property owners.
no code implementations • 24 Jun 2020 • Xinhan Di, Pengqian Yu, Hong Zhu, Lei Cai, Qiuyan Sheng, Changyu Sun
In this paper, we propose an adversarial model for producing furniture layout for interior scene synthesis when the interior room is rotated.
no code implementations • 30 Jan 2020 • Yuli Zhang, Yeyang He, Shaowen Zhu, Xinhan Di
Besides, an adversarial network with two discriminators is proposed to further improve the accuracy of the elements and to reduce the noise of the semantic segmentation.
2 code implementations • 21 May 2019 • Xinhan Di, Pengqian Yu, Rui Bu, Mingchao Sun
In order to reduce the loss, we extend the GNNs frameworks by exploring the aggregation and iteration scheme in the methodology of mutual information.
Ranked #1 on
Graph Classification
on Citeseer
1 code implementation • NeurIPS 2018 • Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, Baoquan Chen
We present a simple and general framework for feature learning from point cloud.
Ranked #2 on
Semantic Segmentation
on S3DIS Area5
(Number of params metric)
no code implementations • 2 Jul 2018 • Xinhan Di, Pengqian Yu, Meng Tian
In this paper, we extend the ambient module to the hidden space of the generator, and provide the uniqueness condition and the corresponding strategy for the ambient hidden generator in the adversarial training process.
no code implementations • 1 Jul 2018 • Xinhan Di, Pengqian Yu, Meng Tian
It has been demonstrated that deep neural networks are prone to noisy examples particular adversarial samples during inference process.
16 code implementations • NeurIPS 2018 • Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, Baoquan Chen
The proposed method is a generalization of typical CNNs to feature learning from point clouds, thus we call it PointCNN.
Ranked #1 on
3D Instance Segmentation
on S3DIS
(mIoU metric)
no code implementations • 17 Jan 2017 • Xinhan Di, Pengqian Yu
While recent deep neural networks have achieved promising results for 3D reconstruction from a single-view image, these rely on the availability of RGB textures in images and extra information as supervision.