Search Results for author: Kang Wang

Found 24 papers, 8 papers with code

Comprehensive Performance Evaluation of YOLOv11, YOLOv10, YOLOv9, YOLOv8 and YOLOv5 on Object Detection of Power Equipment

no code implementations28 Nov 2024 Zijian He, Kang Wang, Tian Fang, Lei Su, Rui Chen, Xihong Fei

With the rapid development of global industrial production, the demand for reliability in power equipment has been continuously increasing.

object-detection Object Detection

X-Recon: Learning-based Patient-specific High-Resolution CT Reconstruction from Orthogonal X-Ray Images

1 code implementation22 Jul 2024 Yunpeng Wang, Kang Wang, Yaoyao Zhuo, Weiya Shi, Fei Shan, Lei Liu

Rapid and accurate diagnosis of pneumothorax, utilizing chest X-ray and computed tomography (CT), is crucial for assisted diagnosis.

Computed Tomography (CT) CT Reconstruction +2

Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography

1 code implementation28 May 2024 Jie Liu, Yixiao Zhang, Kang Wang, Mehmet Can Yavuz, Xiaoxi Chen, Yixuan Yuan, Haoliang Li, Yang Yang, Alan Yuille, Yucheng Tang, Zongwei Zhou

However, these AI models often struggle with flexibility for partially annotated datasets and extensibility for new classes due to limitations in the one-hot encoding, architectural design, and learning scheme.

Computational Efficiency Computed Tomography (CT) +1

Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems

1 code implementation23 Apr 2024 Qihuang Zhong, Kang Wang, Ziyang Xu, Juhua Liu, Liang Ding, Bo Du

To this end, we propose a simple-yet-effective method, namely Deeply Understanding the Problems (DUP), to improve the LLMs' math problem-solving ability by addressing semantic misunderstanding errors.

 Ranked #1 on Math Word Problem Solving on SVAMP (Accuracy metric)

Arithmetic Reasoning GSM8K +2

Towards Multi-agent Reinforcement Learning based Traffic Signal Control through Spatio-temporal Hypergraphs

no code implementations17 Apr 2024 Kang Wang, Zhishu Shen, Zhen Lei, Tiehua Zhang

Traffic signal control systems (TSCSs) are integral to intelligent traffic management, fostering efficient vehicle flow.

Edge-computing Management +2

Nonparametric End-to-End Probabilistic Forecasting of Distributed Generation Outputs Considering Missing Data Imputation

no code implementations31 Mar 2024 Minghui Chen, Zichao Meng, Yanping Liu, Longbo Luo, Ye Guo, Kang Wang

In this paper, we introduce a nonparametric end-to-end method for probabilistic forecasting of distributed renewable generation outputs while including missing data imputation.

Imputation Missing Values

Efficient Polyp Segmentation Via Integrity Learning

no code implementations15 Sep 2023 Ziqiang Chen, Kang Wang, Yun Liu

This paper introduces the integrity concept in polyp segmentation at both macro and micro levels, aiming to alleviate integrity deficiency.

Boundary Detection Computational Efficiency +1

Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition

no code implementations9 Jun 2023 Xianzhao Chen, Yist Y. Lin, Kang Wang, Yi He, Zejun Ma

In this paper, we improve the frame-level classifier for word timings in E2E system by introducing label priors in connectionist temporal classification (CTC) loss, which is adopted from prior works, and combining low-level Mel-scale filter banks with high-level ASR encoder output as input feature.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

A Cross-Scale Hierarchical Transformer with Correspondence-Augmented Attention for inferring Bird's-Eye-View Semantic Segmentation

no code implementations7 Apr 2023 Naiyu Fang, Lemiao Qiu, Shuyou Zhang, Zili Wang, Kerui Hu, Kang Wang

To save the computation increase caused by this hierarchical framework, we exploit the cross-scale Transformer to learn feature relationships in a reversed-aligning way, and leverage the residual connection of BEV features to facilitate information transmission between scales.

Autonomous Driving Bird's-Eye View Semantic Segmentation +2

Less is more: a new machine-learning methodology for spatiotemporal systems

no code implementations Communications in Theoretical Physics Commnu. 2022 Sihan Feng, Kang Wang, Fuming Wang, Yong Zhang and Hong Zhao

Machine learning provides a way to use only portions of the variables of a spatiotemporal system to predict its subsequent evolution and consequently avoids the curse of dimensionality.

Time Series Weather Forecasting

Reinforcement Learning with Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion

1 code implementation14 Sep 2021 Haojie Shi, Bo Zhou, Hongsheng Zeng, Fan Wang, Yueqiang Dong, Jiangyong Li, Kang Wang, Hao Tian, Max Q. -H. Meng

However, due to the complex nonlinear dynamics in quadrupedal robots and reward sparsity, it is still difficult for RL to learn effective gaits from scratch, especially in challenging tasks such as walking over the balance beam.

reinforcement-learning Reinforcement Learning +1

Bayesian Eye Tracking

no code implementations25 Jun 2021 Qiang Ji, Kang Wang

Model-based eye tracking, however, is susceptible to eye feature detection errors, in particular for eye tracking in the wild.

Bayesian Inference Gaze Estimation

SHD360: A Benchmark Dataset for Salient Human Detection in 360° Videos

1 code implementation24 May 2021 Yi Zhang, Lu Zhang, Kang Wang, Wassim Hamidouche, Olivier Deforges

Salient human detection (SHD) in dynamic 360{\deg} immersive videos is of great importance for various applications such as robotics, inter-human and human-object interaction in augmented reality.

Human Detection Human-Object Interaction Detection +3

Towards Accurate RGB-D Saliency Detection with Complementary Attention and Adaptive Integration

no code implementations8 Feb 2021 Hong-Bo Bi, Zi-Qi Liu, Kang Wang, Bo Dong, Geng Chen, Ji-Quan Ma

In this paper, we propose Complementary Attention and Adaptive Integration Network (CAAI-Net), a novel RGB-D saliency detection model that integrates complementary attention based feature concentration and adaptive cross-modal feature fusion into a unified framework for accurate saliency detection.

Saliency Detection

Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech

no code implementations19 May 2020 Wenjie Li, Benlai Tang, Xiang Yin, Yushi Zhao, Wei Li, Kang Wang, Hao Huang, Yuxuan Wang, Zejun Ma

Accent conversion (AC) transforms a non-native speaker's accent into a native accent while maintaining the speaker's voice timbre.

Text to Speech

Neuro-Inspired Eye Tracking With Eye Movement Dynamics

no code implementations CVPR 2019 Kang Wang, Hui Su, Qiang Ji

In particular, we propose a novel Dynamic Gaze Transition Network (DGTN) to capture the underlying eye movement dynamics and serve as the topdown gaze prior.

Gaze Estimation

Generalizing Eye Tracking With Bayesian Adversarial Learning

no code implementations CVPR 2019 Kang Wang, Rui Zhao, Hui Su, Qiang Ji

Next, we extend the point-estimation based deterministic model to a Bayesian framework so that gaze estimation can be performed using all parameters instead of only one set of parameters.

Bayesian Inference Gaze Estimation

Real Time Eye Gaze Tracking With 3D Deformable Eye-Face Model

no code implementations ICCV 2017 Kang Wang, Qiang Ji

The key idea is to leverage on the proposed 3D eye-face model, from which we can estimate 3D eye gaze from observed 2D facial landmarks.

Face Model Gaze Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.