Search Results for author: Sihang Li

Found 17 papers, 12 papers with code

Uni-SMART: Universal Science Multimodal Analysis and Research Transformer

no code implementations • 15 Mar 2024 • Hengxing Cai, Xiaochen Cai, Shuwen Yang, Jiankun Wang, Lin Yao, Zhifeng Gao, Junhan Chang, Sihang Li, Mingjun Xu, Changxin Wang, Hongshuai Wang, Yongge Li, Mujie Lin, Yaqi Li, Yuqi Yin, Linfeng Zhang, Guolin Ke

Scientific literature often includes a wide range of multimodal elements, such as molecular structure, tables, and charts, which are hard for text-focused LLMs to understand and analyze.

Paper
Add Code

SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis

no code implementations • 4 Mar 2024 • Hengxing Cai, Xiaochen Cai, Junhan Chang, Sihang Li, Lin Yao, Changxin Wang, Zhifeng Gao, Hongshuai Wang, Yongge Li, Mujie Lin, Shuwen Yang, Jiankun Wang, Yuqi Yin, Yaqi Li, Linfeng Zhang, Guolin Ke

Recent breakthroughs in Large Language Models (LLMs) have revolutionized natural language understanding and generation, igniting a surge of interest in leveraging these technologies in the field of scientific literature analysis.

Benchmarking Memorization +1

Paper
Add Code

MolTC: Towards Molecular Relational Modeling In Language Models

1 code implementation • 6 Feb 2024 • Junfeng Fang, Shuai Zhang, Chang Wu, Zhengyi Yang, Zhiyuan Liu, Sihang Li, Kun Wang, Wenjie Du, Xiang Wang

Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research.

Relational Reasoning

Paper
Code

Towards 3D Molecule-Text Interpretation in Language Models

1 code implementation • 25 Jan 2024 • Sihang Li, Zhiyuan Liu, Yanchen Luo, Xiang Wang, Xiangnan He, Kenji Kawaguchi, Tat-Seng Chua, Qi Tian

Through 3D molecule-text alignment and 3D molecule-centric instruction tuning, 3D-MoLM establishes an integration of 3D molecular encoder and LM.

Instruction Following Language Modelling +3

Paper
Code

LLaRA: Large Language-Recommendation Assistant

1 code implementation • 5 Dec 2023 • Jiayi Liao, Sihang Li, Zhengyi Yang, Jiancan Wu, Yancheng Yuan, Xiang Wang, Xiangnan He

Treating the "sequential behaviors of users" as a distinct modality beyond texts, we employ a projector to align the traditional recommender's ID embeddings with the LLM's input space.

Language Modelling Sequential Recommendation +1

Paper
Code

BianQue: Balancing the Questioning and Suggestion Ability of Health LLMs with Multi-turn Health Conversations Polished by ChatGPT

1 code implementation • 24 Oct 2023 • YiRong Chen, Zhenyu Wang, Xiaofen Xing, huimin zheng, Zhipei Xu, Kai Fang, Junhong Wang, Sihang Li, Jieling Wu, Qi Liu, Xiangmin Xu

Large language models (LLMs) have performed well in providing general and extensive health suggestions in single-turn conversations, exemplified by systems such as ChatGPT, ChatGLM, ChatDoctor, DoctorGLM, and etc.

429

Paper
Code

MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter

1 code implementation • 19 Oct 2023 • Zhiyuan Liu, Sihang Li, Yanchen Luo, Hao Fei, Yixin Cao, Kenji Kawaguchi, Xiang Wang, Tat-Seng Chua

MolCA enables an LM (e. g., Galactica) to understand both text- and graph-based molecular contents via the cross-modal projector.

Ranked #4 on Molecule Captioning on ChEBI-20

Contrastive Learning IUPAC Name Prediction +5

Paper
Code

Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Semantic Segmentation

no code implementations • 7 Oct 2023 • Jingyi Pan, Sihang Li, Yucheng Chen, Jinjing Zhu, Lin Wang

Moreover, semantic segmentation models trained on daytime datasets often face difficulties in generalizing effectively to nighttime conditions.

Autonomous Driving Contrastive Learning +4

Paper
Add Code

National Origin Discrimination in Deep-learning-powered Automated Resume Screening

no code implementations • 13 Jul 2023 • Sihang Li, Kuangzheng Li, Haibing Lu

Many companies and organizations have started to use some form of AIenabled auto mated tools to assist in their hiring process, e. g. screening resumes, interviewing candi dates, performance evaluation.

Fairness

Paper
Add Code

SSCBench: Monocular 3D Semantic Scene Completion Benchmark in Street Views

1 code implementation • 15 Jun 2023 • Yiming Li, Sihang Li, Xinhao Liu, Moonjun Gong, Kenan Li, Nuo Chen, Zijun Wang, Zhiheng Li, Tao Jiang, Fisher Yu, Yue Wang, Hang Zhao, Zhiding Yu, Chen Feng

Monocular scene understanding is a foundational component of autonomous systems.

3D Semantic Scene Completion 3D Semantic Scene Completion from a single 2D image

149

Paper
Code

SGDViT: Saliency-Guided Dynamic Vision Transformer for UAV Tracking

1 code implementation • 8 Mar 2023 • Liangliang Yao, Changhong Fu, Sihang Li, Guangze Zheng, Junjie Ye

The proposed method designs a new task-specific object saliency mining network to refine the cross-correlation operation and effectively discriminate foreground and background information.

Object Tracking

Paper
Code

Continuity-Aware Latent Interframe Information Mining for Reliable UAV Tracking

1 code implementation • 8 Mar 2023 • Changhong Fu, Mutian Cai, Sihang Li, Kunhan Lu, Haobo Zuo, Chongjun Liu

To address the above issues, this work proposes a novel framework with continuity-aware latent interframe information mining for reliable UAV tracking, i. e., ClimRT.

Autonomous Navigation

Paper
Code

Direct 3D information fusion for depth of field enhancement in optical-resolution photoacoustic microscopy

no code implementations • 24 Nov 2022 • Xianlin Song, Sihang Li, Zhuangzhuang Wang

In this work, a 3D information fusion algorithm based on 3D stationary wavelet transform and joint weighted evaluation optimization is proposed to fuse multi-focus photoacoustic data to achieve large-volumetric and high-resolution 3D imaging.

Paper
Add Code

HighlightNet: Highlighting Low-Light Potential Features for Real-Time UAV Tracking

1 code implementation • 14 Aug 2022 • Changhong Fu, Haolin Dong, Junjie Ye, Guangze Zheng, Sihang Li, Jilin Zhao

Pixel-level range mask is introduced to make HighlightNet more focused on the enhancement of the tracking object and regions without light sources.

Image Enhancement

Paper
Code

Local Perception-Aware Transformer for Aerial Tracking

1 code implementation • 1 Aug 2022 • Changhong Fu, Weiyu Peng, Sihang Li, Junjie Ye, Ziang Cao

Specifically, with local-modeling to global-search mechanism, the proposed tracker replaces the global encoder by a novel local-recognition encoder.

Inductive Bias Visual Object Tracking

Paper
Code

Let Invariant Rationale Discovery Inspire Graph Contrastive Learning

1 code implementation • 16 Jun 2022 • Sihang Li, Xiang Wang, An Zhang, Yingxin Wu, Xiangnan He, Tat-Seng Chua

Specifically, without supervision signals, RGCL uses a rationale generator to reveal salient features about graph instance-discrimination as the rationale, and then creates rationale-aware views for contrastive learning.

Contrastive Learning

Paper
Code

Ad2Attack: Adaptive Adversarial Attack on Real-Time UAV Tracking

1 code implementation • 3 Mar 2022 • Changhong Fu, Sihang Li, Xinnan Yuan, Junjie Ye, Ziang Cao, Fangqiang Ding

Therefore, to help increase awareness of the potential risk and the robustness of UAV tracking, this work proposes a novel adaptive adversarial attack approach, i. e., Ad$^2$Attack, against UAV object tracking.

Adversarial Attack Object Tracking +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.