Search Results for author: Rui Sun

Found 50 papers, 18 papers with code

JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images

1 code implementation19 Sep 2024 Zhecan Wang, Junzhang Liu, Chia-Wei Tang, Hani AlOmari, Anushka Sivakumar, Rui Sun, Wenhao Li, Md. Atabuzzaman, Hammad Ayyubi, Haoxuan You, Alvi Ishmam, Kai-Wei Chang, Shih-Fu Chang, Chris Thomas

In this paper, we release JourneyBench, a comprehensive human-annotated benchmark of generated images designed to assess the model's fine-grained multimodal reasoning abilities across five tasks: complementary multimodal chain of thought, multi-image VQA, imaginary image captioning, VQA with hallucination triggers, and fine-grained retrieval with sample-specific distractors.

Hallucination Image Captioning +3

Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation

no code implementations25 Aug 2024 Yuwen Pan, Rui Sun, Naisong Luo, Tianzhu Zhang, Yongdong Zhang

Semantic segmentation of night-time images holds significant importance in computer vision, particularly for applications like night environment perception in autonomous driving systems.

Autonomous Driving Segmentation +1

Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation

no code implementations25 Aug 2024 Zhaoyang Li, YuAn Wang, Wangkai Li, Rui Sun, Tianzhu Zhang

Point cloud few-shot semantic segmentation (PC-FSS) aims to segment targets of novel categories in a given query point cloud with only a few annotated support samples.

Diversity Few-Shot Semantic Segmentation +1

Efficient Knowledge Infusion via KG-LLM Alignment

no code implementations6 Jun 2024 Zhouyu Jiang, Ling Zhong, Mengshu Sun, Jun Xu, Rui Sun, Hui Cai, Shuhan Luo, Zhiqiang Zhang

To tackle the problem of domain-specific knowledge scarcity within large language models (LLMs), knowledge graph-retrievalaugmented method has been proven to be an effective and efficient technique for knowledge infusion.

Knowledge Graphs Question Answering

Rehearsal-free Federated Domain-incremental Learning

no code implementations22 May 2024 Rui Sun, Haoran Duan, Jiahua Dong, Varun Ojha, Tejal Shah, Rajiv Ranjan

A key feature of RefFiL is the generation of local fine-grained prompts by our domain adaptive prompt generator, which effectively learns from local domain knowledge while maintaining distinctive boundaries on a global scale.

Contrastive Learning Federated Learning +1

Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions

no code implementations18 May 2024 Junzhang Liu, Zhecan Wang, Hammad Ayyubi, Haoxuan You, Chris Thomas, Rui Sun, Shih-Fu Chang, Kai-Wei Chang

Despite the widespread adoption of Vision-Language Understanding (VLU) benchmarks such as VQA v2, OKVQA, A-OKVQA, GQA, VCR, SWAG, and VisualCOMET, our analysis reveals a pervasive issue affecting their integrity: these benchmarks contain samples where answers rely on assumptions unsupported by the provided context.

Visual Question Answering (VQA)

From Sora What We Can See: A Survey of Text-to-Video Generation

1 code implementation17 May 2024 Rui Sun, Yumin Zhang, Tejal Shah, Jiahao Sun, Shuoying Zhang, Wenqi Li, Haoran Duan, Bo Wei, Rajiv Ranjan

With impressive achievements made, artificial intelligence is on the path forward to artificial general intelligence.

Text-to-Video Generation Video Generation

Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization

no code implementations11 Apr 2024 Rui Sun, Li You, An-An Lu, Chen Sun, Xiqi Gao, Xiang-Gen Xia

In this paper, we investigate the precoder design for user-centric network (UCN) massive multiple-input multiple-output (mMIMO) downlink with matrix manifold optimization.

Computational Efficiency

Modeling Unified Semantic Discourse Structure for High-quality Headline Generation

no code implementations23 Mar 2024 Minghui Xu, Hao Fei, Fei Li, Shengqiong Wu, Rui Sun, Chong Teng, Donghong Ji

To consolidate the efficacy of S3 graphs, we further devise a hierarchical structure pruning mechanism to dynamically screen the redundant and nonessential nodes within the graph.

Abstract Meaning Representation Headline Generation +1

RankMatch: Exploring the Better Consistency Regularization for Semi-supervised Semantic Segmentation

no code implementations CVPR 2024 Huayu Mai, Rui Sun, Tianzhu Zhang, Feng Wu

In this paper we analyze the bottlenecks exist in contrastive learning-based methods and offer a fresh perspective on inter-pixel correlations to construct more safe and effective supervision signals which is in line with the nature of semantic segmentation.

Contrastive Learning Segmentation +1

Perception of Misalignment States for Sky Survey Telescopes with the Digital Twin and the Deep Neural Networks

no code implementations30 Nov 2023 Miao Zhang, Peng Jia, Zhengyang Li, Wennan Xiang, Jiameng Lv, Rui Sun

To address this, we need a method to obtain misalignment states, aiding in the reconstruction of accurate point spread functions for data processing methods or facilitating adjustments of optical components for improved image quality.

Astronomy

Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning

1 code implementation6 Oct 2023 Yinger Zhang, Hui Cai, Xeirui Song, Yicheng Chen, Rui Sun, Jing Zheng

While enabling large language models to implement function calling (known as APIs) can greatly enhance the performance of Large Language Models (LLMs), function calling is still a challenging task due to the complicated relations between different APIs, especially in a context-learning setting without fine-tuning.

SafetyBench: Evaluating the Safety of Large Language Models

1 code implementation13 Sep 2023 Zhexin Zhang, Leqi Lei, Lindong Wu, Rui Sun, Yongkang Huang, Chong Long, Xiao Liu, Xuanyu Lei, Jie Tang, Minlie Huang

Notably, SafetyBench also incorporates both Chinese and English data, facilitating the evaluation in both languages.

Multiple-choice

Kernel-Based Tests for Likelihood-Free Hypothesis Testing

1 code implementation NeurIPS 2023 Patrik Róbert Gerber, Tianze Jiang, Yury Polyanskiy, Rui Sun

Given $n$ observations from two balanced classes, consider the task of labeling an additional $m$ inputs that are known to all belong to \emph{one} of the two classes.

Binary Classification Two-sample testing

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

1 code implementation3 Jul 2023 Rui Sun, Zhecan Wang, Haoxuan You, Noel Codella, Kai-Wei Chang, Shih-Fu Chang

However, we find visual and textual fine-grained information, e. g., keywords in the sentence and objects in the image, can be fairly informative for semantics understanding.

Image-text matching Sentence +2

TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation

1 code implementation7 Jun 2023 Rui Sun, Tao Lei, Weichuan Zhang, Yong Wan, Yong Xia, Asoke K. Nandi

The hybrid architecture of convolution neural networks (CNN) and Transformer has been the most popular method for medical image segmentation.

Image Segmentation Medical Image Segmentation +2

IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models

2 code implementations24 May 2023 Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad A. Ayyubi, Kai-Wei Chang, Shih-Fu Chang

Specifically, IdealGPT utilizes an LLM to generate sub-questions, a VLM to provide corresponding sub-answers, and another LLM to reason to achieve the final answer.

Human-annotated label noise and their impact on ConvNets for remote sensing image scene classification

no code implementations20 May 2023 Longkang Peng, Tao Wei, Xuehong Chen, Xiaobei Chen, Rui Sun, Luoma Wan, Jin Chen, Xiaolin Zhu

However, the distribution of real-world human-annotated label noises on remote sensing images and their impact on ConvNets have not been investigated.

Scene Classification

Fair-CDA: Continuous and Directional Augmentation for Group Fairness

no code implementations1 Apr 2023 Rui Sun, Fengwei Zhou, Zhenhua Dong, Chuanlong Xie, Lanqing Hong, Jiawei Li, Rui Zhang, Zhen Li, Zhenguo Li

By adjusting the perturbation strength in the direction of the paths, our proposed augmentation is controllable and auditable.

Data Augmentation Disentanglement +1

An Error-Guided Correction Model for Chinese Spelling Error Correction

1 code implementation16 Jan 2023 Rui Sun, Xiuyu Wu, Yunfang Wu

By borrowing the powerful ability of BERT, we propose a novel zero-shot error detection method to do a preliminary detection, which guides our model to attend more on the probably wrong tokens in encoding and to avoid modifying the correct tokens in generating.

Chinese Spelling Error Correction Spelling Correction

Camouflaged Instance Segmentation via Explicit De-Camouflaging

no code implementations CVPR 2023 Naisong Luo, Yuwen Pan, Rui Sun, Tianzhu Zhang, Zhiwei Xiong, Feng Wu

To address these challenges, we propose a novel De-camouflaging Network (DCNet) including a pixel-level camouflage decoupling module and an instance-level camouflage suppression module.

Instance Segmentation Segmentation +1

Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation

no code implementations ICCV 2023 Rui Sun, YuAn Wang, Huayu Mai, Tianzhu Zhang, Feng Wu

In this work, we reconcile the inherent tension of spatial and temporal information to retrieve memory frame information along the object trajectory, and propose a novel and coherent Trajectory Memory Retrieval Network (TMRN) to equip with the trajectory information, including a spatial alignment module and a temporal aggregation module.

Retrieval Semantic Segmentation +2

Adaptive Template Transformer for Mitochondria Segmentation in Electron Microscopy Images

no code implementations ICCV 2023 Yuwen Pan, Naisong Luo, Rui Sun, Meng Meng, Tianzhu Zhang, Zhiwei Xiong, Yongdong Zhang

Mitochondria, as tiny structures within the cell, are of significant importance to study cell functions for biological and clinical analysis.

Rethinking the Correlation in Few-Shot Segmentation: A Buoys View

no code implementations CVPR 2023 YuAn Wang, Rui Sun, Tianzhu Zhang

In this work, we rethink how to mitigate the false matches from the perspective of representative reference features (referred to as buoys), and propose a novel adaptive buoys correlation (ABC) network to rectify direct pairwise pixel-level correlation, including a buoys mining module and an adaptive correlation module.

Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding

no code implementations14 Dec 2022 Haoxuan You, Rui Sun, Zhecan Wang, Kai-Wei Chang, Shih-Fu Chang

We present a new commonsense task, Human-centric Commonsense Grounding, that tests the models' ability to ground individuals given the context descriptions about what happened before, and their mental/physical states or intentions.

Semi-Supervised Crowd Counting from Unlabeled Data

no code implementations31 Aug 2021 Haoran Duan, Fan Wan, Rui Sun, Zeyu Wang, Varun Ojha, Yu Guan, Hubert P. H. Shum, Bingzhang Hu, Yang Long

Our method achieved competitive performance in semi-supervised learning approaches on these crowd counting datasets.

Crowd Counting

PNet -- A Deep Learning Based Photometry and Astrometry Bayesian Framework

no code implementations28 Jun 2021 Rui Sun, Peng Jia, Yongyang Sun, Zhimin Yang, Qiang Liu, Hongyan Wei

Time domain astronomy has emerged as a vibrant research field in recent years, focusing on celestial objects that exhibit variable magnitudes or positions.

Astronomy Deep Learning +2

Lesion-Aware Transformers for Diabetic Retinopathy Grading

no code implementations CVPR 2021 Rui Sun, Yihao Li, Tianzhu Zhang, Zhendong Mao, Feng Wu, Yongdong Zhang

First, to the best of our knowledge, this is the first work to formulate lesion discovery as a weakly supervised lesion localization problem via a transformer decoder.

Decoder Diabetic Retinopathy Grading +1

Blind Diagnosis for Millimeter-wave Large-scale Antenna Systems

no code implementations25 Jan 2021 Rui Sun, Weidong Wang, Li Chen, Guo Wei, Wenyi Zhang

Millimeter-wave (mmWave) communication systems rely on large-scale antenna arrays to combat large path-loss at mmWave band.

Diagnosis of Intelligent Reflecting Surface in Millimeter-wave Communication Systems

1 code implementation11 Jan 2021 Rui Sun, Weidong Wang, Li Chen, Guo Wei, Wenyi Zhang

In the second case where only partial CSI is available, we jointly exploit the sparsity of the millimeter-wave channel and the failure, and adopt compressed sparse and low-rank matrix recovery algorithm to decouple channel and failure.

EPI-based Oriented Relation Networks for Light Field Depth Estimation

1 code implementation9 Jul 2020 Kunyuan Li, Jun Zhang, Rui Sun, Xu-Dong Zhang, Jun Gao

Based on the observation that an oriented line and its neighboring pixels in an EPI share a similar linear structure, we propose an end-to-end fully convolutional network (FCN) to estimate the depth value of the intersection point on the horizontal and vertical EPIs.

Data Augmentation Depth Estimation +1

Risk Variance Penalization

no code implementations13 Jun 2020 Chuanlong Xie, Haotian Ye, Fei Chen, Yue Liu, Rui Sun, Zhenguo Li

The key of the out-of-distribution (OOD) generalization is to generalize invariance from training domains to target domains.

Online Learning and Optimization for Revenue Management Problems with Add-on Discounts

no code implementations2 May 2020 David Simchi-Levi, Rui Sun, Huanan Zhang

We show that our learning algorithm can converge to the optimal algorithm that has access to the true demand functions, and we prove that the convergence rate is tight up to a certain logarithmic term.

Management

Multi-view Point Cloud Registration with Adaptive Convergence Threshold and its Application on 3D Model Retrieval

no code implementations25 Nov 2018 Yaochen Li, Ying Liu, Rui Sun, Rui Guo, Li Zhu, Yong Qi

In this paper, we propose a framework to reconstruct the 3D models by the multi-view point cloud registration algorithm with adaptive convergence threshold, and subsequently apply it to 3D model retrieval.

Point Cloud Registration Retrieval

Study of sedimentation of non-cohesive particles via CFD-DEM simulations

2 code implementations5 Nov 2017 Shan-Lin Xu, Rui Sun, Yuan-Qiang Cai, Hong-Lei Sun

This paper employs the coupled computational fluid dynamics and discrete element method (CFD-DEM) to investigate the sedimentation process of non-cohesive particles, including the hindered settling stage and the deposition stage.

Fluid Dynamics Computational Physics Geophysics

CFD-DEM Simulations of Current-Induced Dune Formation and Morphological Evolution

5 code implementations25 Oct 2015 Rui Sun, Heng Xiao

In this work, current-induced sediment transport problems in a wide range of regimes are simulated, including 'flat bed in motion', `small dune', `vortex dune' and suspended transport.

Fluid Dynamics

Diffusion-Based Coarse Graining in Hybrid Continuum-Discrete Solvers: Theoretical Formulation and A Priori Tests

2 code implementations29 Aug 2014 Rui Sun, Heng Xiao

The numerical tests demonstrated that the proposed coarse graining procedure based on solving diffusion equations is theoretically sound, easy to implement and parallelize in general CFD solvers, and has improved mesh-convergence characteristics compared with existing coarse graining methods.

Computational Physics

Diffusion-Based Coarse Graining in Hybrid Continuum-Discrete Solvers: Applications in CFD-DEM

3 code implementations29 Aug 2014 Rui Sun, Heng Xiao

Moreover, we demonstrate that the overhead computational costs incurred by the proposed coarse-graining procedure are a small portion of the total costs in typical CFD-DEM simulations as long as the number of particles per cell is reasonably large, although admittedly the computational overhead of the coarse graining often exceeds that of the CFD solver.

Computational Physics Fluid Dynamics

Cannot find the paper you are looking for? You can Submit a new open access paper.