Search Results for author: Jiahao Zhang

Found 27 papers, 6 papers with code

Manual-PA: Learning 3D Part Assembly from Instruction Diagrams

no code implementations27 Nov 2024 Jiahao Zhang, Anoop Cherian, Cristian Rodriguez, Weijian Deng, Stephen Gould

Assembling furniture amounts to solving the discrete-continuous optimization task of selecting the furniture parts to assemble and estimating their connecting poses in a physically realistic manner.

Contrastive Learning

An Energy-Based Self-Adaptive Learning Rate for Stochastic Gradient Descent: Enhancing Unconstrained Optimization with VAV method

no code implementations10 Nov 2024 Jiahao Zhang, Christian Moya, Guang Lin

Optimizing the learning rate remains a critical challenge in machine learning, essential for achieving model stability and efficient convergence.

Scheduling

EnJa: Ensemble Jailbreak on Large Language Models

no code implementations7 Aug 2024 Jiahao Zhang, Zilong Wang, Ruofan Wang, Xingjun Ma, Yu-Gang Jiang

As Large Language Models (LLMs) are increasingly being deployed in safety-critical applications, their vulnerability to potential jailbreaks -- malicious prompts that can disable the safety mechanism of LLMs -- has attracted growing research attention.

Safety Alignment

DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models

no code implementations4 Aug 2024 Bowen Wang, Jiuyang Chang, Yiming Qian, Guoxin Chen, JunHao Chen, Zhouqiang Jiang, Jiahao Zhang, Yuta Nakashima, Hajime Nagahara

Large language models (LLMs) have recently showcased remarkable capabilities, spanning a wide range of tasks and applications, including those in the medical domain.

Question Answering

Explainable Image Recognition via Enhanced Slot-attention Based Classifier

no code implementations8 Jul 2024 Bowen Wang, Liangzhi Li, Jiahao Zhang, Yuta Nakashima, Hajime Nagahara

A novel loss function specifically for ESCOUTER is designed to fine-tune the model's behavior, enabling it to toggle between positive and negative explanations.

Explainable artificial intelligence Explainable Artificial Intelligence (XAI)

Graph Unlearning with Efficient Partial Retraining

no code implementations12 Mar 2024 Jiahao Zhang, Lin Wang, Shijie Wang, Wenqi Fan

Graph Neural Networks (GNNs) have achieved remarkable success in various real-world applications.

Linear-Time Graph Neural Networks for Scalable Recommendations

1 code implementation21 Feb 2024 Jiahao Zhang, Rui Xue, Wenqi Fan, Xin Xu, Qing Li, Jian Pei, Xiaorui Liu

In this paper, we propose a Linear-Time Graph Neural Network (LTGNN) to scale up GNN-based recommender systems to achieve comparable scalability as classic MF approaches while maintaining GNNs' powerful expressiveness for superior prediction accuracy.

Graph Neural Network

Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs

no code implementations31 Jan 2024 Dingyi Dai, Yichi Zhang, Jiahao Zhang, Zhanqiu Hu, Yaohui Cai, Qi Sun, Zhiru Zhang

Quantization is a crucial technique for deploying deep learning models on resource-constrained devices, such as embedded FPGAs.

Deep Learning Quantization

Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference

no code implementations23 Dec 2023 Hongzheng Chen, Jiahao Zhang, Yixiao Du, Shaojie Xiang, Zichao Yue, Niansong Zhang, Yaohui Cai, Zhiru Zhang

Experimental results demonstrate our approach can achieve up to 13. 4x speedup when compared to previous FPGA-based accelerators for the BERT model.

Language Modelling Large Language Model

Learning Thresholds with Latent Values and Censored Feedback

no code implementations7 Dec 2023 Jiahao Zhang, Tao Lin, Weiqiang Zheng, Zhe Feng, Yifeng Teng, Xiaotie Deng

In this paper, we investigate a problem of actively learning threshold in latent space, where the unknown reward $g(\gamma, v)$ depends on the proposed threshold $\gamma$ and latent value $v$ and it can be $only$ achieved if the threshold is lower than or equal to the unknown latent value.

Instruct Me More! Random Prompting for Visual In-Context Learning

1 code implementation7 Nov 2023 Jiahao Zhang, Bowen Wang, Liangzhi Li, Yuta Nakashima, Hajime Nagahara

Our findings suggest that InMeMo offers a versatile and efficient way to enhance the performance of visual ICL with lightweight training.

Foreground Segmentation In-Context Learning +2

An Element-wise RSAV Algorithm for Unconstrained Optimization Problems

no code implementations7 Sep 2023 Shiheng Zhang, Jiahao Zhang, Jie Shen, Guang Lin

We present a novel optimization algorithm, element-wise relaxed scalar auxiliary variable (E-RSAV), that satisfies an unconditional energy dissipation law and exhibits improved alignment between the modified and the original energy.

End-to-End Beam Retrieval for Multi-Hop Question Answering

3 code implementations17 Aug 2023 Jiahao Zhang, Haiyang Zhang, Dongmei Zhang, Yong liu, Shen Huang

This approach models the multi-hop retrieval process in an end-to-end manner by jointly optimizing an encoder and two classification heads across all hops.

Language Modelling Large Language Model +3

Energy-Dissipative Evolutionary Deep Operator Neural Networks

no code implementations9 Jun 2023 Jiahao Zhang, Shiheng Zhang, Jie Shen, Guang Lin

For an objective operator G, the Branch net encodes different input functions u at the same number of sensors, and the Trunk net evaluates the output function at any location.

Operator learning

GoferBot: A Visual Guided Human-Robot Collaborative Assembly System

no code implementations18 Apr 2023 Zheyu Zhuang, Yizhak Ben-Shabat, Jiahao Zhang, Stephen Gould, Robert Mahony

It is composed of a visual servoing module that reaches and grasps assembly parts in an unstructured multi-instance and dynamic environment, an action recognition module that performs human action prediction for implicit communication, and a visual handover module that uses the perceptual understanding of human behaviour to produce an intuitive and efficient collaborative assembly experience.

Action Recognition

Aligning Step-by-Step Instructional Diagrams to Video Demonstrations

1 code implementation CVPR 2023 Jiahao Zhang, Anoop Cherian, Yanbin Liu, Yizhak Ben-Shabat, Cristian Rodriguez, Stephen Gould

In this paper, we consider a novel setting where such an alignment is between (i) instruction steps that are depicted as assembly diagrams (commonly seen in Ikea assembly manuals) and (ii) video segments from in-the-wild videos; these videos comprising an enactment of the assembly actions in the real world.

Contrastive Learning Image Retrieval +2

SPIDR: SDF-based Neural Point Fields for Illumination and Deformation

no code implementations15 Oct 2022 Ruofan Liang, Jiahao Zhang, Haoda Li, Chen Yang, Yushi Guan, Nandita Vijaykumar

To enable more accurate illumination updates after deformation, we use the shadow mapping technique to approximate the light visibility updates caused by geometry editing.

3D Reconstruction Lighting Estimation +2

RMFGP: Rotated Multi-fidelity Gaussian process with Dimension Reduction for High-dimensional Uncertainty Quantification

no code implementations11 Apr 2022 Jiahao Zhang, Shiqi Zhang, Guang Lin

This paper proposes a new dimension reduction framework based on rotated multi-fidelity Gaussian process regression and a Bayesian active learning scheme when the available precise observations are insufficient.

Active Learning Dimensionality Reduction +2

EvoGAN: An Evolutionary Computation Assisted GAN

1 code implementation22 Oct 2021 Feng Liu, HanYang Wang, Jiahao Zhang, Ziwang Fu, Aimin Zhou, Jiayin Qi, Zhibin Li

Quantitative and Qualitative results are presented on several compound expressions, and the experimental results demonstrate the feasibility and the potential of EvoGAN.

Image Generation

Analyzing Green View Index and Green View Index best path using Google Street View and deep learning

1 code implementation26 Apr 2021 Jiahao Zhang, Anqi Hu

As an important part of urban landscape research, analyzing and studying street-level greenery can increase the understanding of a city's greenery, contributing to better urban living environment planning and design.

Optical Character Recognition (OCR) Semantic Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.