Search Results for author: Bin Huang

Found 15 papers, 7 papers with code

GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing

2 code implementations • 9 Mar 2024 • Hao Lu, Xuesong Niu, Jiyao Wang, Yin Wang, Qingyong Hu, Jiaqi Tang, Yuting Zhang, Kaishen Yuan, Bin Huang, Zitong Yu, Dengbo He, Shuiguang Deng, Hao Chen, Yingcong Chen, Shiguang Shan

In conclusion, this paper provides valuable insights into the potential applications and challenges of MLLMs in human-centric computing.

Emotion Recognition Facial Action Unit Detection +4

Paper
Code

Financial Time-Series Forecasting: Towards Synergizing Performance And Interpretability Within a Hybrid Machine Learning Approach

no code implementations • 31 Dec 2023 • Shun Liu, Kexin Wu, Chufeng Jiang, Bin Huang, Danqing Ma

In the realm of cryptocurrency, the prediction of Bitcoin prices has garnered substantial attention due to its potential impact on financial markets and investment strategies.

Time Series Time Series Forecasting

Paper
Add Code

VTimeLLM: Empower LLM to Grasp Video Moments

1 code implementation • 30 Nov 2023 • Bin Huang, Xin Wang, Hong Chen, Zihan Song, Wenwu Zhu

Large language models (LLMs) have shown remarkable text understanding capabilities, which have been extended as Video LLMs to handle video data for comprehending visual details.

Ranked #1 on Video-based Generative Performance Benchmarking (Detail Orientation)) on VideoInstruct

Dense Video Captioning Video-based Generative Performance Benchmarking (Consistency) +5

105

Paper
Code

Stage-by-stage Wavelet Optimization Refinement Diffusion Model for Sparse-View CT Reconstruction

1 code implementation • 30 Aug 2023 • Kai Xu, Shiyu Lu, Bin Huang, Weiwen Wu, Qiegen Liu

Diffusion models have emerged as potential tools to tackle the challenge of sparse-view CT reconstruction, displaying superior performance compared to conventional methods.

Paper
Code

Mask Hierarchical Features For Self-Supervised Learning

no code implementations • 1 Apr 2023 • Fenggang Liu, Yangguang Li, Feng Liang, Jilan Xu, Bin Huang, Jing Shao

We mask part of patches in the representation space and then utilize sparse visible patches to reconstruct high semantic image representation.

object-detection Object Detection +1

Paper
Add Code

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

1 code implementation • 29 Jan 2023 • Yangguang Li, Bin Huang, Zeren Chen, Yufeng Cui, Feng Liang, Mingzhu Shen, Fenggang Liu, Enze Xie, Lu Sheng, Wanli Ouyang, Jing Shao

Our Fast-BEV consists of five parts, We novelly propose (1) a lightweight deployment-friendly view transformation which fast transfers 2D image feature to 3D voxel space, (2) an multi-scale image encoder which leverages multi-scale information for better performance, (3) an efficient BEV encoder which is particularly designed to speed up on-vehicle inference.

Data Augmentation

525

Paper
Code

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

1 code implementation • 19 Jan 2023 • Bin Huang, Yangguang Li, Enze Xie, Feng Liang, Luya Wang, Mingzhu Shen, Fenggang Liu, Tianqi Wang, Ping Luo, Jing Shao

Recently, the pure camera-based Bird's-Eye-View (BEV) perception removes expensive Lidar sensors, making it a feasible solution for economical autonomous driving.

Autonomous Driving Data Augmentation

525

Paper
Code

Parallel Reasoning Network for Human-Object Interaction Detection

no code implementations • 9 Jan 2023 • Huan Peng, Fenggang Liu, Yangguang Li, Bin Huang, Jing Shao, Nong Sang, Changxin Gao

Human-Object Interaction (HOI) detection aims to learn how human interacts with surrounding objects.

Human-Object Interaction Detection Object +1

Paper
Add Code

One Sample Diffusion Model in Projection Domain for Low-Dose CT Imaging

2 code implementations • 7 Dec 2022 • Bin Huang, Liu Zhang, Shiyu Lu, Boyu Lin, Weiwen Wu, Qiegen Liu

Therefore, we propose a fully unsupervised one sample diffusion model (OSDM)in projection domain for low-dose CT reconstruction.

Computed Tomography (CT)

Paper
Code

Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report

no code implementations • 17 May 2021 • Andrey Ignatov, Kim Byeoung-su, Radu Timofte, Angeline Pouget, Fenglong Song, Cheng Li, Shuai Xiao, Zhongqian Fu, Matteo Maggioni, Yibin Huang, Shen Cheng, Xin Lu, Yifeng Zhou, Liangyu Chen, Donghao Liu, Xiangyu Zhang, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Bin Huang, Tianbao Zhou, Shuai Liu, Lei Lei, Chaoyu Feng, Liguang Huang, Zhikun Lei, Feifei Chen

A detailed description of all models developed in the challenge is provided in this paper.

Image Denoising

Paper
Add Code

GraphTheta: A Distributed Graph Neural Network Learning System With Flexible Training Strategy

1 code implementation • 21 Apr 2021 • Yongchao Liu, Houyi Li, Guowei Zhang, Xintan Zeng, Yongyong Li, Bin Huang, Peng Zhang, Zhao Li, Xiaowei Zhu, Changhua He, WenGuang Chen

Herein, we present GraphTheta, the first distributed and scalable graph learning system built upon vertex-centric distributed graph processing with neural network operators implemented as user-defined functions.

Graph Learning

Paper
Code

Global Strong Solutions to the Compressible Magnetohydrodynamic Equations with Slip Boundary Conditions in 3D Bounded Domains

no code implementations • 15 Feb 2021 • Yazhou Chen, Bin Huang, Xiaoding Shi

We deal with the barotropic compressible magnetohydrodynamic equations in three-dimensional (3D) bounded domain with slip boundary condition and vacuum.

Analysis of PDEs

Paper
Add Code

Computational prediction of RNA tertiary structures using machine learning methods

no code implementations • 3 Sep 2020 • Bin Huang, Yuanyang Du, Shuai Zhang, Wenfei Li, Jun Wang, Jian Zhang

RNAs play crucial and versatile roles in biological processes.

BIG-bench Machine Learning

Paper
Add Code

Decoding the mechanisms underlying cell-fate decision-making during stem cell differentiation by Random Circuit Perturbation

no code implementations • 31 Mar 2020 • Bin Huang, Mingyang Lu, Madeline Galbraith, Herbert Levine, Jose N. Onuchic, Dongya Jia

These gene states can be robustly predicted by the stemness GRN but not by randomized versions of the stemness GRN.

Decision Making

Paper
Add Code

FoxNet: A Multi-face Alignment Method

no code implementations • 22 Apr 2019 • Yuxiang Wu, Zehua Cheng, Bin Huang, Yiming Chen, Xinghui Zhu, Weiyang Wang

Multi-face alignment aims to identify geometry structures of multiple faces in an image, and its performance is essential for the many practical tasks, such as face recognition, face tracking, and face animation.

Clustering Face Alignment +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.