Search Results for author: Hongbo Zhang

Found 34 papers, 16 papers with code

Geo-ConvGRU: Geographically Masked Convolutional Gated Recurrent Unit for Bird-Eye View Segmentation

no code implementations28 Dec 2024 Guanglei Yang, Yongqiang Zhang, Wanlong Li, Yu Tang, Weize Shang, Feng Wen, Hongbo Zhang, Mingli Ding

Convolutional Neural Networks (CNNs) have significantly impacted various computer vision tasks, however, they inherently struggle to model long-range dependencies explicitly due to the localized nature of convolution operations.

CycleResearcher: Improving Automated Research via Automated Review

no code implementations28 Oct 2024 Yixuan Weng, Minjun Zhu, Guangsheng Bao, Hongbo Zhang, Jindong Wang, Yue Zhang, Linyi Yang

In research, the papers generated by the CycleResearcher model achieved a score of 5. 36 in simulated peer reviews, surpassing the preprint level of 5. 24 from human experts and approaching the accepted paper level of 5. 69.

scientific discovery

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

no code implementations26 Sep 2024 Jing He, Haodong Li, Wei Yin, Yixun Liang, Leheng Li, Kaiqiang Zhou, Hongbo Zhang, Bingbing Liu, Ying-Cong Chen

In this paper, we provide a systemic analysis of the diffusion formulation for the dense prediction, focusing on both quality and efficiency.

3D Reconstruction Denoising +3

LLM-based MOFs Synthesis Condition Extraction using Few-Shot Demonstrations

no code implementations6 Aug 2024 Lei Shi, Zhimeng Liu, Yi Yang, Weize Wu, Yuyang Zhang, Hongbo Zhang, Jing Lin, Siyu Wu, Zihan Chen, Ruiming Li, Nan Wang, Zipeng Liu, Huobin Tan, Hongyi Gao, Yue Zhang, Ge Wang

The extraction of Metal-Organic Frameworks (MOFs) synthesis conditions from literature text has been challenging but crucial for the logical design of new MOFs with desirable functionality.

Few-Shot Learning In-Context Learning +2

Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

no code implementations16 Jul 2024 Jianhao Li, Tianyu Sun, Zhongdao Wang, Enze Xie, Bailan Feng, Hongbo Zhang, Ze Yuan, Ke Xu, Jiaheng Liu, Ping Luo

Unlike previous arts, our auto-labeler predicts 3D shapes instead of bounding boxes and does not require training on a specific dataset.

Autonomous Driving

An Analysis on Quantizing Diffusion Transformers

no code implementations16 Jun 2024 Yuewei Yang, Jialiang Wang, Xiaoliang Dai, Peizhao Zhang, Hongbo Zhang

Prior works address PTQ of DMs on UNet structures have addressed the challenges in calibrating parameters for both activations and weights via moderate optimization.

Conditional Image Generation Denoising +1

Large Language Models Meet Text-Centric Multimodal Sentiment Analysis: A Survey

no code implementations12 Jun 2024 Hao Yang, Yanyan Zhao, Yang Wu, Shilong Wang, Tian Zheng, Hongbo Zhang, Zongyang Ma, Wanxiang Che, Bing Qin

Compared to traditional sentiment analysis, which only considers text, multimodal sentiment analysis needs to consider emotional signals from multimodal sources simultaneously and is therefore more consistent with the way how humans process sentiment in real-world scenarios.

Multimodal Sentiment Analysis

AutoSurvey: Large Language Models Can Automatically Write Surveys

1 code implementation10 Jun 2024 Yidong Wang, Qi Guo, Wenjin Yao, Hongbo Zhang, Xin Zhang, Zhen Wu, Meishan Zhang, Xinyu Dai, Min Zhang, Qingsong Wen, Wei Ye, Shikun Zhang, Yue Zhang

This paper introduces AutoSurvey, a speedy and well-organized methodology for automating the creation of comprehensive literature surveys in rapidly evolving fields like artificial intelligence.

Retrieval Survey

An Organic Weed Control Prototype using Directed Energy and Deep Learning

no code implementations31 May 2024 Deng Cao, Hongbo Zhang, Rajveer Dhillon

In this work, a directed energy weed control robot prototype specifically designed for organic farms is proposed.

Deep Learning

How Likely Do LLMs with CoT Mimic Human Reasoning?

1 code implementation25 Feb 2024 Guangsheng Bao, Hongbo Zhang, Cunxiang Wang, Linyi Yang, Yue Zhang

Chain-of-thought emerges as a promising technique for eliciting reasoning capabilities from Large Language Models (LLMs).

In-Context Learning

Self-supervised Event-based Monocular Depth Estimation using Cross-modal Consistency

no code implementations14 Jan 2024 Junyu Zhu, Lina Liu, Bofeng Jiang, Feng Wen, Hongbo Zhang, Wanlong Li, Yong liu

In this paper, to lower the annotation cost, we propose a self-supervised event-based monocular depth estimation framework named EMoDepth.

Depth Prediction Monocular Depth Estimation

Efficient Quantization Strategies for Latent Diffusion Models

no code implementations9 Dec 2023 Yuewei Yang, Xiaoliang Dai, Jialiang Wang, Peizhao Zhang, Hongbo Zhang

By treating the quantization discrepancy as relative noise and identifying sensitive part(s) of a model, we propose an efficient quantization approach encompassing both global and local strategies.

Quantization Text-to-Image Generation

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

1 code implementation16 Nov 2023 Junying Chen, Xidong Wang, Ke Ji, Anningzhe Gao, Feng Jiang, Shunian Chen, Hongbo Zhang, Dingjie Song, Wenya Xie, Chuyi Kong, Jianquan Li, Xiang Wan, Haizhou Li, Benyou Wang

We validate the new protocol in the domains where proprietary LLMs like ChatGPT perform relatively poorly, such as Traditional Chinese Medicine.

Domain Adaptation Language Modeling +1

An Early Evaluation of GPT-4V(ision)

1 code implementation25 Oct 2023 Yang Wu, Shilong Wang, Hao Yang, Tian Zheng, Hongbo Zhang, Yanyan Zhao, Bing Qin

In this paper, we evaluate different abilities of GPT-4V including visual understanding, language understanding, visual puzzle solving, and understanding of other modalities such as depth, thermal, video, and audio.

Math

Multi-step prediction of chlorophyll concentration based on Adaptive Graph-Temporal Convolutional Network with Series Decomposition

no code implementations13 Sep 2023 Ying Chen, Xiao Li, Hongbo Zhang, Wenyang Song, Chongxuan Xv

The adaptive graph convolution learns the relationship between different water quality parameters, updates the state information of each parameter, and improves the learning ability of the update relationship between nodes.

Decision Making

Enhancing Dialogue Generation via Dynamic Graph Knowledge Aggregation

1 code implementation28 Jun 2023 Chen Tang, Hongbo Zhang, Tyler Loakman, Chenghua Lin, Frank Guerin

Further analysis also shows that our representation learning framework can fill the semantic gap by coagulating representations of both text and graph knowledge.

Dialogue Generation Graph Attention +3

HuatuoGPT, towards Taming Language Model to Be a Doctor

2 code implementations24 May 2023 Hongbo Zhang, Junying Chen, Feng Jiang, Fei Yu, Zhihong Chen, Jianquan Li, Guiming Chen, Xiangbo Wu, Zhiyi Zhang, Qingying Xiao, Xiang Wan, Benyou Wang, Haizhou Li

Experimental results demonstrate that HuatuoGPT achieves state-of-the-art results in performing medical consultation among open-source LLMs in GPT-4 evaluation, human evaluation, and medical benchmark datasets.

Language Modeling Language Modelling +1

Injecting Knowledge into Biomedical Pre-trained Models via Polymorphism and Synonymous Substitution

1 code implementation24 May 2023 Hongbo Zhang, Xiang Wan, Benyou Wang

This gives us a hint that relational knowledge might not be redundant to the stored knowledge of PLMs, but rather be complementary.

Natural Language Reasoning, A Survey

1 code implementation26 Mar 2023 Fei Yu, Hongbo Zhang, Prayag Tiwari, Benyou Wang

This survey paper proposes a clearer view of natural language reasoning in the field of Natural Language Processing (NLP), both conceptually and practically.

Logical Reasoning Mathematical Reasoning +5

FG-Depth: Flow-Guided Unsupervised Monocular Depth Estimation

no code implementations20 Jan 2023 Junyu Zhu, Lina Liu, Yong liu, Wanlong Li, Feng Wen, Hongbo Zhang

The great potential of unsupervised monocular depth estimation has been demonstrated by many works due to low annotation cost and impressive accuracy comparable to supervised methods.

Image Reconstruction Monocular Depth Estimation +2

Terminology-aware Medical Dialogue Generation

1 code implementation27 Oct 2022 Chen Tang, Hongbo Zhang, Tyler Loakman, Chenghua Lin, Frank Guerin

In this paper, we propose a novel framework to improve medical dialogue generation by considering features centered on domain-specific terminology.

Dialogue Generation

GenLoco: Generalized Locomotion Controllers for Quadrupedal Robots

1 code implementation12 Sep 2022 Gilbert Feng, Hongbo Zhang, Zhongyu Li, Xue Bin Peng, Bhuvan Basireddy, Linzhu Yue, Zhitao Song, Lizhi Yang, Yunhui Liu, Koushil Sreenath, Sergey Levine

In this work, we introduce a framework for training generalized locomotion (GenLoco) controllers for quadrupedal robots.

Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds

1 code implementation23 Sep 2021 Xuemeng Yang, Hao Zou, Xin Kong, Tianxin Huang, Yong liu, Wanlong Li, Feng Wen, Hongbo Zhang

Specifically, the network takes a raw point cloud as input, and merges the features from the segmentation branch into the completion branch hierarchically to provide semantic information.

3D Semantic Scene Completion 3D Semantic Segmentation +4

SA-LOAM: Semantic-aided LiDAR SLAM with Loop Closure

no code implementations22 Jun 2021 Lin Li, Xin Kong, Xiangrui Zhao, Wanlong Li, Feng Wen, Hongbo Zhang, Yong liu

LiDAR-based SLAM system is admittedly more accurate and stable than others, while its loop closure detection is still an open issue.

3D Semantic Segmentation Loop Closure Detection

Uncertainty-aware INVASE: Enhanced Breast Cancer Diagnosis Feature Selection

1 code implementation4 May 2021 Jia-Xing Zhong, Hongbo Zhang

In this paper, we present an uncertainty-aware INVASE to quantify predictive confidence of healthcare problem.

feature selection Uncertainty Quantification

Open-set Intersection Intention Prediction for Autonomous Driving

no code implementations27 Feb 2021 Fei Li, Xiangxu Li, Jun Luo, Shiwei Fan, Hongbo Zhang

We capture map-centric features that correspond to intersection structures under a spatial-temporal graph representation, and use two MAAMs (mutually auxiliary attention module) that cover respectively lane-level and exitlevel intentions to predict a target that best matches intersection elements in map-centric feature space.

Autonomous Driving

Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets

no code implementations19 May 2020 Cong Fei, Bin Wang, Yuzheng Zhuang, Zongzhang Zhang, Jianye Hao, Hongbo Zhang, Xuewu Ji, Wulong Liu

Generative adversarial imitation learning (GAIL) has shown promising results by taking advantage of generative adversarial nets, especially in the field of robot learning.

Autonomous Vehicles Data Augmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.