Search Results for author: Xinyu Huang

Found 33 papers, 12 papers with code

TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes

no code implementations3 Apr 2024 Cheng Zhao, Su Sun, Ruoyu Wang, Yuliang Guo, Jun-Jun Wan, Zhou Huang, Xinyu Huang, Yingjie Victor Chen, Liu Ren

Most 3D Gaussian Splatting (3D-GS) based methods for urban scenes initialize 3D Gaussians directly with 3D LiDAR points, which not only underutilizes LiDAR data capabilities but also overlooks the potential advantages of fusing LiDAR with camera data.

3D Reconstruction Autonomous Driving

Behind the Veil: Enhanced Indoor 3D Scene Reconstruction with Occluded Surfaces Completion

no code implementations3 Apr 2024 Su Sun, Cheng Zhao, Yuliang Guo, Ruoyu Wang, Xinyu Huang, Yingjie Victor Chen, Liu Ren

The 3D Inpainter with abstract representation at coarse levels is trained offline using various scenes to complete occluded surfaces.

3D Reconstruction 3D Scene Reconstruction

Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks

1 code implementation25 Jan 2024 Tianhe Ren, Shilong Liu, Ailing Zeng, Jing Lin, Kunchang Li, He Cao, Jiayu Chen, Xinyu Huang, Yukang Chen, Feng Yan, Zhaoyang Zeng, Hao Zhang, Feng Li, Jie Yang, Hongyang Li, Qing Jiang, Lei Zhang

We introduce Grounded SAM, which uses Grounding DINO as an open-set object detector to combine with the segment anything model (SAM).

Segmentation

Digital Twin-Based User-Centric Edge Continual Learning in Integrated Sensing and Communication

no code implementations20 Nov 2023 Shisheng Hu, Jie Gao, Xinyu Huang, Mushu Li, Kaige Qu, Conghao Zhou, Xuemin, Shen

A DT of the ISAC device is constructed to predict the impact of potential decisions on the long-term computation cost of the server, based on which the decisions are made with closed-form formulas.

Continual Learning Edge-computing

Open-Set Image Tagging with Multi-Grained Text Supervision

2 code implementations23 Oct 2023 Xinyu Huang, Yi-Jie Huang, Youcai Zhang, Weiwei Tian, Rui Feng, Yuejie Zhang, Yanchun Xie, Yaqian Li, Lei Zhang

Specifically, for predefined commonly used tag categories, RAM++ showcases 10. 2 mAP and 15. 4 mAP enhancements over CLIP on OpenImages and ImageNet.

Human-Object Interaction Detection Open Set Learning +1

Digital Twin-Assisted Resource Demand Prediction for Multicast Short Video Streaming

no code implementations9 Jun 2023 Xinyu Huang, Wen Wu, Xuemin Sherman Shen

In this paper, we propose a digital twin (DT)-assisted resource demand prediction scheme to enhance prediction accuracy for multicast short video streaming.

Recognize Anything: A Strong Image Tagging Model

2 code implementations6 Jun 2023 Youcai Zhang, Xinyu Huang, Jinyu Ma, Zhaoyang Li, Zhaochuan Luo, Yanchun Xie, Yuzhuo Qin, Tong Luo, Yaqian Li, Shilong Liu, Yandong Guo, Lei Zhang

We are releasing the RAM at \url{https://recognize-anything. github. io/} to foster the advancements of large models in computer vision.

Semantic Parsing

Tag2Text: Guiding Vision-Language Model via Image Tagging

2 code implementations10 Mar 2023 Xinyu Huang, Youcai Zhang, Jinyu Ma, Weiwei Tian, Rui Feng, Yuejie Zhang, Yaqian Li, Yandong Guo, Lei Zhang

This paper presents Tag2Text, a vision language pre-training (VLP) framework, which introduces image tagging into vision-language models to guide the learning of visual-linguistic features.

Language Modelling TAG

Molecular Communication for Quorum Sensing Inspired Cooperative Drug Delivery

no code implementations15 Feb 2023 Yuting Fang, Stuart T. Johnston, Matt Faria, Xinyu Huang, Andrew W. Eckford, Jamie Evans

Our results show that the activation probability at the B-NM increases as this B-NM is located closer to the center of the B-NM population and the aggregate absorption rate of the drug molecules non-linearly increases as the population density increases.

Digital Twin-Assisted Collaborative Transcoding for Better User Satisfaction in Live Streaming

no code implementations13 Nov 2022 Xinyu Huang, Mushu Li, Wen Wu, Conghao Zhou, Xuemin Sherman Shen

Particularly, two DTs are constructed for emulating the cloud-edge collaborative transcoding process by analyzing spatial-temporal information of individual videos and transcoding configurations of transcoding queues, respectively.

A Comparative Study of Gastric Histopathology Sub-size Image Classification: from Linear Regression to Visual Transformer

no code implementations25 May 2022 Weiming Hu, HaoYuan Chen, Wanli Liu, Xiaoyan Li, Hongzan Sun, Xinyu Huang, Marcin Grzegorzek, Chen Li

Ensemble learning is a way to improve the accuracy of algorithms, and finding multiple learning models with complementarity types is the basis of ensemble learning.

BIG-bench Machine Learning Ensemble Learning +2

Personalized QoE Enhancement for Adaptive Video Streaming: A Digital Twin-Assisted Scheme

no code implementations9 May 2022 Xinyu Huang, Conghao Zhou, Wen Wu, Mushu Li, Huaqing Wu, Xuemin, Shen

In this paper, we present a digital twin (DT)-assisted adaptive video streaming scheme to enhance personalized quality-of-experience (PQoE).

Management

Application of Transfer Learning and Ensemble Learning in Image-level Classification for Breast Histopathology

no code implementations18 Apr 2022 Yuchao Zheng, Chen Li, Xiaomin Zhou, HaoYuan Chen, Hao Xu, Yixin Li, Haiqing Zhang, Xiaoyan Li, Hongzan Sun, Xinyu Huang, Marcin Grzegorzek

Method: This paper proposes a deep ensemble model based on image-level labels for the binary classification of benign and malignant lesions of breast histopathological images.

Binary Classification Classification +4

A State-of-the-art Survey of U-Net in Microscopic Image Analysis: from Simple Usage to Structure Mortification

no code implementations14 Feb 2022 Jian Wu, Wanli Liu, Chen Li, Tao Jiang, Islam Mohammad Shariful, Hongzan Sun, Xiaoqi Li, Xintong Li, Xinyu Huang, Marcin Grzegorzek

Image analysis technology is used to solve the inadvertences of artificial traditional methods in disease, wastewater treatment, environmental change monitoring analysis and convolutional neural networks (CNN) play an important role in microscopic image analysis.

Image Segmentation Segmentation +1

QoE-driven Secure Video Transmission in Cloud-edge Collaborative Networks

no code implementations5 Jan 2021 Tantan Zhao, Lijun He, Xinyu Huang, Fan Li

In this paper, by considering the interaction between video encoding and edge caching, we investigate the quality of experience (QoE)-driven cross-layer optimization of secure video transmission over the wireless backhaul link in cloud-edge collaborative networks.

Multimedia

Revenue and Energy Efficiency-Driven Delay Constrained Computing Task Offloading and Resource Allocation in a Vehicular Edge Computing Network: A Deep Reinforcement Learning Approach

no code implementations16 Oct 2020 Xinyu Huang, Lijun He, Xing Chen, Liejun Wang, Fan Li

In this paper, we propose a joint task type and vehicle speed-aware task offloading and resource allocation strategy to decrease the vehicl's energy cost for executing tasks and increase the revenue of the vehicle for processing tasks within the delay constraint.

Edge-computing

AADS: Augmented Autonomous Driving Simulation using Data-driven Algorithms

1 code implementation23 Jan 2019 Wei Li, Chengwei Pan, Rong Zhang, Jiaping Ren, Yuexin Ma, Jin Fang, Feilong Yan, Qichuan Geng, Xinyu Huang, Huajun Gong, Weiwei Xu, Guoping Wang, Dinesh Manocha, Ruigang Yang

Our augmented approach combines the flexibility in a virtual environment (e. g., vehicle movements) with the richness of the real world to allow effective simulation of anywhere on earth.

Autonomous Driving

Part-level Car Parsing and Reconstruction from Single Street View

no code implementations27 Nov 2018 Qichuan Geng, Hong Zhang, Xinyu Huang, Sen Wang, Feixiang Lu, Xinjing Cheng, Zhong Zhou, Ruigang Yang

As it is labor-intensive to annotate semantic parts on real street views, we propose a specific approach to implicitly transfer part features from synthesized images to real street views.

Car Pose Estimation Domain Adaptation +1

RealPoint3D: Point Cloud Generation from a Single Image with Complex Background

1 code implementation8 Sep 2018 Yan Xia, Yang Zhang, Dingfu Zhou, Xinyu Huang, Cheng Wang, Ruigang Yang

Then, the image together with the retrieved shape model is fed into the proposed network to generate the fine-grained 3D point cloud.

3D Generation Point Cloud Generation

A Network Structure to Explicitly Reduce Confusion Errors in Semantic Segmentation

no code implementations1 Aug 2018 Qichuan Geng, Xinyu Huang, Zhong Zhou, Ruigang Yang

Confusing classes that are ubiquitous in real world often degrade performance for many vision related applications like object detection, classification, and segmentation.

Image Segmentation object-detection +3

The ApolloScape Open Dataset for Autonomous Driving and its Application

2 code implementations16 Mar 2018 Xinyu Huang, Peng Wang, Xinjing Cheng, Dingfu Zhou, Qichuan Geng, Ruigang Yang

In this paper, we provide a sensor fusion scheme integrating camera videos, consumer-grade motion sensors (GPS/IMU), and a 3D semantic map in order to achieve robust self-localization and semantic segmentation for autonomous driving.

Autonomous Driving Instance Segmentation +3

Mask-off: Synthesizing Face Images in the Presence of Head-mounted Displays

no code implementations26 Oct 2016 Yajie Zhao, Qingguo Xu, Xinyu Huang, Ruigang Yang

The main purpose of this paper is to synthesize realistic face images without occlusions based on the images captured by these cameras.

Colorization Face Alignment +1

Cannot find the paper you are looking for? You can Submit a new open access paper.