Search Results for author: Zihan Zhao

Found 22 papers, 12 papers with code

Learning Symmetry-Independent Jet Representations via Jet-Based Joint Embedding Predictive Architecture

no code implementations5 Dec 2024 Subash Katel, Haoyang Li, Zihan Zhao, Raghav Kansal, Farouk Mokhtar, Javier Duarte

In high energy physics, self-supervised learning (SSL) methods have the potential to aid in the creation of machine learning models without the need for labeled datasets for a variety of tasks, including those related to jets -- narrow sprays of particles produced by quarks and gluons in high energy particle collisions.

Jet Tagging Self-Supervised Learning

SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models

1 code implementation9 Nov 2024 Jinghan He, Haiyun Guo, Kuan Zhu, Zihan Zhao, Ming Tang, Jinqiao Wang

In this work, we first explore and emphasize the importance of attention weights in knowledge retention, and then propose a SElective attEntion-guided Knowledge Retention method (SEEKR) for data-efficient replay-based continual learning of large language models (LLMs).

Continual Learning

SciDFM: A Large Language Model with Mixture-of-Experts for Science

no code implementations27 Sep 2024 Liangtai Sun, Danyu Luo, Da Ma, Zihan Zhao, Baocai Chen, Zhennan Shen, Su Zhu, Lu Chen, Xin Chen, Kai Yu

We further analyze the expert layers and show that the results of expert selection vary with data from different disciplines.

Language Modeling Language Modelling +2

ChemDFM-X: Towards Large Multimodal Model for Chemistry

no code implementations20 Sep 2024 Zihan Zhao, Bo Chen, Jingpiao Li, Lu Chen, Liyang Wen, Pengyu Wang, Zichen Zhu, Danyang Zhang, Ziping Wan, Yansi Li, Zhongyang Dai, Xin Chen, Kai Yu

Rapid developments of AI tools are expected to offer unprecedented assistance to the research of natural science including chemistry.

model

Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

1 code implementation28 Feb 2024 Hongshen Xu, Lu Chen, Zihan Zhao, Da Ma, Ruisheng Cao, Zichen Zhu, Kai Yu

Additionally, we propose several pre-training tasks to model the interaction among text, structure, and image modalities effectively.

document understanding Information Retrieval +1

ChemDFM: A Large Language Foundation Model for Chemistry

1 code implementation26 Jan 2024 Zihan Zhao, Da Ma, Lu Chen, Liangtai Sun, Zihao Li, Yi Xia, Bo Chen, Hongshen Xu, Zichen Zhu, Su Zhu, Shuai Fan, Guodong Shen, Kai Yu, Xin Chen

In its utmost form, such a generalist AI chemist could be referred to as Chemical General Intelligence.

Form model

LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models

1 code implementation20 Aug 2023 Zihan Zhao, Yiyang Jiang, Heyang Liu, Yanfeng Wang, Yu Wang

While Large Language Models (LLMs) have demonstrated commendable performance across a myriad of domains and tasks, existing LLMs still exhibit a palpable deficit in handling multimodal functionalities, especially for the Spoken Question Answering (SQA) task which necessitates precise alignment and deep interaction between speech and text features.

Multiple-choice Question Answering

Large Language Models Are Semi-Parametric Reinforcement Learning Agents

1 code implementation NeurIPS 2023 Danyang Zhang, Lu Chen, Situo Zhang, Hongshen Xu, Zihan Zhao, Kai Yu

By equipping the LLM with a long-term experience memory, REMEMBERER is capable of exploiting the experiences from the past episodes even for different task goals, which excels an LLM-based agent with fixed exemplars or equipped with a transient working memory.

Language Modeling Language Modelling +3

Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction

2 code implementations14 May 2023 Danyang Zhang, Zhennan Shen, Rui Xie, Situo Zhang, Tianbao Xie, Zihan Zhao, Siyuan Chen, Lu Chen, Hongshen Xu, Ruisheng Cao, Kai Yu

The Graphical User Interface (GUI) is pivotal for human interaction with the digital world, enabling efficient device control and the completion of complex tasks.

Language Modelling

Knowledge-aware Bayesian Co-attention for Multimodal Emotion Recognition

no code implementations20 Feb 2023 Zihan Zhao, Yu Wang, Yanfeng Wang

Multimodal emotion recognition is a challenging research area that aims to fuse different modalities to predict human emotion.

Multimodal Emotion Recognition

Review for AI-based Open-Circuit Faults Diagnosis Methods in Power Electronics Converters

no code implementations26 Sep 2022 Chuang Liu, Lei Kou, Guowei Cai, Zihan Zhao, Zhe Zhang

Power electronics converters have been widely used in aerospace system, DC transmission, distributed energy, smart grid and so forth, and the reliability of power electronics converters has been a hotspot in academia and industry.

Fault Diagnosis

Multi-level Fusion of Wav2vec 2.0 and BERT for Multimodal Emotion Recognition

1 code implementation11 Jul 2022 Zihan Zhao, Yanfeng Wang, Yu Wang

The research and applications of multimodal emotion recognition have become increasingly popular recently.

Multimodal Emotion Recognition Transfer Learning

An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models

no code implementations14 Oct 2020 Zihan Zhao, Yuncong Liu, Lu Chen, Qi Liu, Rao Ma, Kai Yu

Recently, pre-trained language models like BERT have shown promising performance on multiple natural language processing tasks.

Clustering Quantization

From Pixel to Patch: Synthesize Context-aware Features for Zero-shot Semantic Segmentation

1 code implementation25 Sep 2020 Zhangxuan Gu, Siyuan Zhou, Li Niu, Zihan Zhao, Liqing Zhang

Thus, we focus on zero-shot semantic segmentation, which aims to segment unseen objects with only category-level semantic representations provided for unseen categories.

Image Classification Segmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.