Search Results for author: Qi Sun

Found 56 papers, 20 papers with code

BudgetFusion: Perceptually-Guided Adaptive Diffusion Models

no code implementations8 Dec 2024 Qinchan Li, Kenneth Chen, Changyue Su, Qi Sun

The observation motivates us to present BudgetFusion, a novel model that suggests the most perceptually efficient number of diffusion steps before a diffusion model starts to generate an image.

Denoising Text-to-Image Generation

Group-wise normalization in differential abundance analysis of microbiome samples

1 code implementation23 Nov 2024 Dylan Clark-Boucher, Brent A Coull, Harrison T Reeder, Fenglei Wang, Qi Sun, Jacqueline R Starr, Kyu Ha Lee

A key challenge in differential abundance analysis of microbial samples is that the counts for each sample are compositional, resulting in biased comparisons of the absolute abundance across study groups.

Detect an Object At Once without Fine-tuning

no code implementations4 Nov 2024 Junyu Hao, Jianheng Liu, Yongjia Zhao, Zuofan Chen, Qi Sun, Jinlong Chen, Jianguo Wei, Minghao Yang

When presented with one or a few photos of a previously unseen object, humans can instantly recognize it in different scenes.

Object

An Evolved Universal Transformer Memory

1 code implementation17 Oct 2024 Edoardo Cetin, Qi Sun, Tianyu Zhao, Yujin Tang

Prior methods propose to offset the escalating costs of modern foundation models by dropping specific parts of their contexts with hand-designed rules, while attempting to preserve their original performance.

Low Latency Point Cloud Rendering with Learned Splatting

1 code implementation24 Sep 2024 Yueyu Hu, Ran Gong, Qi Sun, Yao Wang

Because of the point sparsity and irregularity, high-quality rendering of point clouds is challenging and often requires complex computations to recover the continuous surface representation.

Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models

no code implementations22 Sep 2024 Yew Ken Chia, Qi Sun, Lidong Bing, Soujanya Poria

Large multimodal models have demonstrated impressive problem-solving abilities in vision and language tasks, and have the potential to encode extensive world knowledge.

World Knowledge

Rocket Landing Control with Random Annealing Jump Start Reinforcement Learning

no code implementations21 Jul 2024 YuXuan Jiang, Yujie Yang, Zhiqian Lan, Guojian Zhan, Shengbo Eben Li, Qi Sun, Jian Ma, Tianwen Yu, Changwu Zhang

Our approach, called Random Annealing Jump Start (RAJS), is tailored for real-world goal-oriented problems by leveraging prior feedback controllers as guide policy to facilitate environmental exploration and policy learning in RL.

reinforcement-learning Reinforcement Learning +1

FabGPT: An Efficient Large Multimodal Model for Complex Wafer Defect Knowledge Queries

no code implementations15 Jul 2024 Yuqi Jiang, Xudong Lu, Qian Jin, Qi Sun, Hanming Wu, Cheng Zhuo

Leveraging the power of LMMs, we introduce FabGPT, a customized IC fabrication large multimodal model for wafer defect knowledge query.

Defect Detection Question Answering

Transformer Layers as Painters

no code implementations12 Jul 2024 Qi Sun, Marc Pickett, Aakash Kumar Nain, Llion Jones

We further show that some classes of problems have robustness to skipping layers, running the layers in an order different from how they were trained, or running the layers in parallel.

Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis

no code implementations7 Jul 2024 Qi Sun, Hang Zhou, Wengang Zhou, Li Li, Houqiang Li

Synthesizing realistic 3D indoor scenes is a challenging task that traditionally relies on manual arrangement and annotation by expert designers.

Indoor Scene Synthesis Scene Generation

BVI-RLV: A Fully Registered Dataset and Benchmarks for Low-Light Video Enhancement

no code implementations3 Jul 2024 Ruirui Lin, Nantheera Anantrasirichai, Guoxi Huang, Joanne Lin, Qi Sun, Alexandra Malyugina, David R Bull

Low-light videos often exhibit spatiotemporal incoherent noise, compromising visibility and performance in computer vision applications.

Mamba State Space Models +1

Image-GS: Content-Adaptive Image Representation via 2D Gaussians

no code implementations2 Jul 2024 Yunxiang Zhang, Alexandr Kuznetsov, Akshay Jindal, Kenneth Chen, Anton Sochenov, Anton Kaplanyan, Qi Sun

Neural image representations have recently emerged as a promising technique for storing, streaming, and rendering visual data.

EG4D: Explicit Generation of 4D Object without Score Distillation

1 code implementation28 May 2024 Qi Sun, Zhiyang Guo, Ziyu Wan, Jing Nathan Yan, Shengming Yin, Wengang Zhou, Jing Liao, Houqiang Li

In recent years, the increasing demand for dynamic 3D assets in design and gaming applications has given rise to powerful generative pipelines capable of synthesizing high-quality 4D objects.

Dynamic Reconstruction Video Generation

Defending Spiking Neural Networks against Adversarial Attacks through Image Purification

no code implementations26 Apr 2024 Weiran Chen, Qi Sun, Qi Xu

Spiking Neural Networks (SNNs) aim to bridge the gap between neuroscience and machine learning by emulating the structure of the human nervous system.

Denoising

Configurable Learned Holography

no code implementations24 Mar 2024 Yicheng Zhan, Liang Shi, Wojciech Matusik, Qi Sun, Kaan Akşit

Although the emerging learned approaches have enabled rapid and high-quality hologram generation, any alteration in display hardware still requires a retraining of the model.

Depth Estimation Knowledge Distillation

Evolutionary Optimization of Model Merging Recipes

1 code implementation19 Mar 2024 Takuya Akiba, Makoto Shing, Yujin Tang, Qi Sun, David Ha

Surprisingly, our Japanese Math LLM achieved state-of-the-art performance on a variety of established Japanese LLM benchmarks, even surpassing models with significantly more parameters, despite not being explicitly trained for such tasks.

Evolutionary Algorithms Math

Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs

no code implementations31 Jan 2024 Dingyi Dai, Yichi Zhang, Jiahao Zhang, Zhanqiu Hu, Yaohui Cai, Qi Sun, Zhiru Zhang

Quantization is a crucial technique for deploying deep learning models on resource-constrained devices, such as embedded FPGAs.

Deep Learning Quantization

Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet Extraction

1 code implementation24 Jan 2024 Qi Sun, Kun Huang, Xiaocui Yang, Rong Tong, Kun Zhang, Soujanya Poria

In this paper, we propose a Zero-shot Document-level Relation Triplet Extraction (ZeroDocRTE) framework, which generates labeled data by retrieval and denoising knowledge from LLMs, called GenRDK.

Denoising Relation +3

Exploiting GPT-4 Vision for Zero-shot Point Cloud Understanding

no code implementations15 Jan 2024 Qi Sun, Xiao Cui, Wengang Zhou, Houqiang Li

In this study, we tackle the challenge of classifying the object category in point clouds, which previous works like PointCLIP struggle to address due to the inherent limitations of the CLIP architecture.

Point Cloud Classification Robust classification +1

MM-BigBench: Evaluating Multimodal Models on Multimodal Content Comprehension Tasks

2 code implementations13 Oct 2023 Xiaocui Yang, Wenfang Wu, Shi Feng, Ming Wang, Daling Wang, Yang Li, Qi Sun, Yifei Zhang, XiaoMing Fu, Soujanya Poria

Consequently, our work complements research on the performance of MLLMs in multimodal comprehension tasks, achieving a more comprehensive and holistic evaluation of MLLMs.

Multimodal Reasoning

Toward Optimized VR/AR Ergonomics: Modeling and Predicting User Neck Muscle Contraction

1 code implementation28 Aug 2023 Yunxiang Zhang, Kenneth Chen, Qi Sun

Leveraging electromyography devices, we measure, model, and predict VR users' neck muscle contraction levels (MCL) while they move their heads to interact with the virtual environment.

S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural Fields

1 code implementation ICCV 2023 Zeke Xie, Xindi Yang, Yujie Yang, Qi Sun, Yixiang Jiang, Haoran Wang, Yunfeng Cai, Mingming Sun

Recently, Neural Radiance Field (NeRF) has shown great success in rendering novel-view images of a given scene by learning an implicit representation with only posed RGB images.

Novel View Synthesis Surface Reconstruction

Uncertainty Guided Label Denoising for Document-level Distant Relation Extraction

1 code implementation18 May 2023 Qi Sun, Kun Huang, Xiaocui Yang, Pengfei Hong, Kun Zhang, Soujanya Poria

Therefore, how to select effective pseudo labels to denoise DS data is still a challenge in document-level distant relation extraction.

Denoising Document-level Relation Extraction +1

AutoColor: Learned Light Power Control for Multi-Color Holograms

1 code implementation2 May 2023 Yicheng Zhan, Koray Kavaklı, Hakan Urey, Qi Sun, Kaan Akşit

For this purpose, we establish the first multi-color hologram dataset using synthetic images and their depth information.

Monocular Depth Estimation

Underwater target detection based on improved YOLOv7

1 code implementation14 Feb 2023 Kaiyue Liu, Qi Sun, Daming Sun, Mengduo Yang, Nizhuan Wang

Underwater target detection is a crucial aspect of ocean exploration.

Dirichlet-Neumann learning algorithm for solving elliptic interface problems

no code implementations18 Jan 2023 Qi Sun, Xuejun Xu, Haotian Yi

Non-overlapping domain decomposition methods are natural for solving interface problems arising from various disciplines, however, the numerical simulation requires technical analysis and is often available only with the use of high-quality grids, thereby impeding their use in more complicated situations.

Force-Aware Interface via Electromyography for Natural VR/AR Interaction

no code implementations3 Oct 2022 Yunxiang Zhang, Benjamin Liang, Boyuan Chen, Paul Torrens, S. Farokh Atashzar, Dahua Lin, Qi Sun

Closing the gap between real-world physicality and immersive virtual experience requires a closed interaction loop: applying user-exerted physical forces to the virtual environment and generating haptic sensations back to the users.

A Chit-Chats Enhanced Task-Oriented Dialogue Corpora for Fuse-Motive Conversation Systems

1 code implementation12 May 2022 Changhong Yu, Chunhong Zhang, Qi Sun

The goal of building intelligent dialogue systems has largely been separately pursued under two motives: task-oriented dialogue (TOD) systems, and open-domain systems for chit-chat (CC).

Diversity

PCL: Proxy-Based Contrastive Learning for Domain Generalization

1 code implementation CVPR 2022 Xufeng Yao, Yang Bai, Xinyun Zhang, Yuechen Zhang, Qi Sun, Ran Chen, Ruiyu Li, Bei Yu

Domain generalization refers to the problem of training a model from a collection of different source domains that can directly generalize to the unseen target domains.

Contrastive Learning Domain Generalization

Layer-Parallel Training of Residual Networks with Auxiliary-Variable Networks

no code implementations10 Dec 2021 Qi Sun, Hexin Dong, Zewei Chen, Jiacheng Sun, Zhenguo Li, Bin Dong

Gradient-based methods for the distributed training of residual networks (ResNets) typically require a forward pass of the input data, followed by back-propagating the error gradient to update model parameters, which becomes time-consuming as the network goes deeper.

Data Augmentation

Layer-Parallel Training of Residual Networks with Auxiliary Variables

no code implementations NeurIPS Workshop DLDE 2021 Qi Sun, Hexin Dong, Zewei Chen, Weizhen Dian, Jiacheng Sun, Yitong Sun, Zhenguo Li, Bin Dong

Backpropagation algorithm is indispensable for training modern residual networks (ResNets) and usually tends to be time-consuming due to its inherent algorithmic lockings.

Data Augmentation

Deep Learning on Monocular Object Pose Detection and Tracking: A Comprehensive Overview

no code implementations29 May 2021 Zhaoxin Fan, Yazhi Zhu, Yulin He, Qi Sun, Hongyan Liu, Jun He

Therefore, this study presents a comprehensive review of recent progress in object pose detection and tracking that belongs to the deep learning technical route.

Autonomous Driving Deep Learning +2

FoV-NeRF: Foveated Neural Radiance Fields for Virtual Reality

1 code implementation30 Mar 2021 Nianchen Deng, Zhenyi He, Jiannan Ye, Budmonde Duinkharjav, PRANEETH CHAKRAVARTHULA, Xubo Yang, Qi Sun

To tackle these problems toward six-degrees-of-freedom, egocentric, and stereo NeRF in VR, we present the first gaze-contingent 3D neural representation and view synthesis method.

Distributed Computing Neural Rendering

Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving Intelligence

2 code implementations18 Mar 2021 Yang Guan, Yangang Ren, Qi Sun, Shengbo Eben Li, Haitong Ma, Jingliang Duan, Yifan Dai, Bo Cheng

In this paper, we present an interpretable and computationally efficient framework called integrated decision and control (IDC) for automated vehicles, which decomposes the driving task into static path planning and dynamic optimal tracking that are structured hierarchically.

Autonomous Driving Model-based Reinforcement Learning +2

Decision-Making under On-Ramp merge Scenarios by Distributional Soft Actor-Critic Algorithm

no code implementations8 Mar 2021 Yiting Kong, Yang Guan, Jingliang Duan, Shengbo Eben Li, Qi Sun, Bingbing Nie

In this paper, we propose an RL-based end-to-end decision-making method under a framework of offline training and online correction, called the Shielded Distributional Soft Actor-critic (SDSAC).

Decision Making

Recurrent Model Predictive Control

no code implementations23 Feb 2021 Zhengyu Liu, Jingliang Duan, Wenxuan Wang, Shengbo Eben Li, Yuming Yin, Ziyu Lin, Qi Sun, Bo Cheng

This paper proposes an off-line algorithm, called Recurrent Model Predictive Control (RMPC), to solve general nonlinear finite-horizon optimal control problems.

Model Predictive Control

Steadily Learn to Drive with Virtual Memory

no code implementations16 Feb 2021 Yuhang Zhang, Yao Mu, Yujie Yang, Yang Guan, Shengbo Eben Li, Qi Sun, Jianyu Chen

Reinforcement learning has shown great potential in developing high-level autonomous driving.

Autonomous Driving

A Practical Layer-Parallel Training Algorithm for Residual Networks

no code implementations3 Sep 2020 Qi Sun, Hexin Dong, Zewei Chen, Weizhen Dian, Jiacheng Sun, Yitong Sun, Zhenguo Li, Bin Dong

Gradient-based algorithms for training ResNets typically require a forward pass of the input data, followed by back-propagating the objective gradient to update parameters, which are time-consuming for deep ResNets.

Data Augmentation

Mixed Reinforcement Learning with Additive Stochastic Uncertainty

no code implementations28 Feb 2020 Yao Mu, Shengbo Eben Li, Chang Liu, Qi Sun, Bingbing Nie, Bo Cheng, Baiyu Peng

This paper presents a mixed reinforcement learning (mixed RL) algorithm by simultaneously using dual representations of environmental dynamics to search the optimal policy with the purpose of improving both learning accuracy and training speed.

reinforcement-learning Reinforcement Learning +1

Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic

no code implementations13 Feb 2020 Yangang Ren, Jingliang Duan, Shengbo Eben Li, Yang Guan, Qi Sun

In this paper, we introduce the minimax formulation and distributional framework to improve the generalization ability of RL algorithms and develop the Minimax Distributional Soft Actor-Critic (Minimax DSAC) algorithm.

Autonomous Driving Decision Making +4

Direct and indirect reinforcement learning

no code implementations23 Dec 2019 Yang Guan, Shengbo Eben Li, Jingliang Duan, Jie Li, Yangang Ren, Qi Sun, Bo Cheng

Reinforcement learning (RL) algorithms have been successfully applied to a range of challenging sequential decision making and control tasks.

Decision Making reinforcement-learning +3

Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints

no code implementations26 Nov 2019 Jingliang Duan, Zhengyu Liu, Shengbo Eben Li, Qi Sun, Zhenzhong Jia, Bo Cheng

CADP linearizes the constrained optimization problem locally into a quadratically constrained linear programming problem, and then obtains the optimal update of the policy network by solving its dual problem.

DiffTaichi: Differentiable Programming for Physical Simulation

3 code implementations ICLR 2020 Yuanming Hu, Luke Anderson, Tzu-Mao Li, Qi Sun, Nathan Carr, Jonathan Ragan-Kelley, Frédo Durand

We present DiffTaichi, a new differentiable programming language tailored for building high-performance differentiable physical simulators.

Physical Simulations

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

2 code implementations ICCV 2019 Yichao Zhou, Haozhi Qi, Yuexiang Zhai, Qi Sun, Zhili Chen, Li-Yi Wei, Yi Ma

In this paper, we propose a method to obtain a compact and accurate 3D wireframe representation from a single image by effectively exploiting global structural regularities.

Stochastic Training of Residual Networks: a Differential Equation Viewpoint

no code implementations1 Dec 2018 Qi Sun, Yunzhe Tao, Qiang Du

During the last few years, significant attention has been paid to the stochastic training of artificial neural networks, which is known as an effective regularization approach that helps improve the generalization capability of trained models.

Image Classification

Nonlocal Neural Networks, Nonlocal Diffusion and Nonlocal Modeling

no code implementations NeurIPS 2018 Yunzhe Tao, Qi Sun, Qiang Du, Wei Liu

Nonlocal neural networks have been proposed and shown to be effective in several computer vision tasks, where the nonlocal operations can directly capture long-range dependencies in the feature space.

CSReader at SemEval-2018 Task 11: Multiple Choice Question Answering as Textual Entailment

no code implementations SEMEVAL 2018 Zhengping Jiang, Qi Sun

In this document we present an end-to-end machine reading comprehension system that solves multiple choice questions with a textual entailment perspective.

Common Sense Reasoning Language Modelling +5

Deep Reinforcement Learning for Resource Management in Network Slicing

no code implementations17 May 2018 Rongpeng Li, Zhifeng Zhao, Qi Sun, Chi-Lin I, Chenyang Yang, Xianfu Chen, MinJian Zhao, Honggang Zhang

Network slicing is born as an emerging business to operators, by allowing them to sell the customized slices to various tenants at different prices.

Deep Reinforcement Learning Management +2

Cannot find the paper you are looking for? You can Submit a new open access paper.