Search Results for author: Yuwei Zhang

Found 36 papers, 12 papers with code

Attention Reveals More Than Tokens: Training-Free Long-Context Reasoning with Attention-guided Retrieval

no code implementations12 Mar 2025 Yuwei Zhang, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang

Interestingly, we observe that the internal attention weights from the generated CoT tokens can effectively ground implicit facts, even when these facts are not explicitly recalled.

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

1 code implementation10 Mar 2025 Lixue Gong, Xiaoxia Hou, Fanshi Li, Liang Li, Xiaochen Lian, Fei Liu, Liyang Liu, Wei Liu, Wei Lu, Yichun Shi, Shiqi Sun, Yu Tian, Zhi Tian, Peng Wang, Xun Wang, Ye Wang, Guofeng Wu, Jie Wu, Xin Xia, Xuefeng Xiao, Linjie Yang, Zhonghua Zhai, Xinyu Zhang, Qi Zhang, Yuwei Zhang, Shijia Zhao, Jianchao Yang, Weilin Huang

To address these limitations, we present Seedream 2. 0, a native Chinese-English bilingual image generation foundation model that excels across diverse dimensions, which adeptly manages text prompt in both Chinese and English, supporting bilingual image generation and text rendering.

Image Generation Instruction Following +1

Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning

no code implementations10 Mar 2025 Xintong Li, Jalend Bantupalli, Ria Dharmani, Yuwei Zhang, Jingbo Shang

There has been a surge in the use of large language models (LLM) conversational agents to generate responses based on long-term history from multiple sessions.

Retrieval

Object Detection for Medical Image Analysis: Insights from the RT-DETR Model

no code implementations27 Jan 2025 Weijie He, Yuwei Zhang, Ting Xu, Tai An, Yingbin Liang, Bo Zhang

Deep learning has emerged as a transformative approach for solving complex pattern recognition and object detection challenges.

Diabetic Retinopathy Detection Medical Image Analysis +2

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

1 code implementation14 Oct 2024 Di wu, Hongwei Wang, Wenhao Yu, Yuwei Zhang, Kai-Wei Chang, Dong Yu

Recent large language model (LLM)-driven chat assistant systems have integrated memory components to track user-assistant chat histories, enabling more accurate and personalized responses.

Benchmarking Large Language Model +1

Jump Diffusion-Informed Neural Networks with Transfer Learning for Accurate American Option Pricing under Data Scarcity

no code implementations26 Sep 2024 Qiguo Sun, Hanyue Huang, XiBei Yang, Yuwei Zhang

Moreover, the prevalent use of the Black-Scholes formula in hybrid models fails to accurately capture the discontinuity in the price process, limiting model performance, especially under scarce data conditions.

Bayesian Optimization Data Augmentation +1

Non-stationary BERT: Exploring Augmented IMU Data For Robust Human Activity Recognition

no code implementations25 Sep 2024 Ning Sun, YuFei Wang, Yuwei Zhang, Jixiang Wan, Shenyue Wang, Ping Liu, Xudong Zhang

Human Activity Recognition (HAR) has gained great attention from researchers due to the popularity of mobile devices and the need to observe users' daily activity data for better human-computer interaction.

Data Augmentation Human Activity Recognition

Attention Mechanism and Context Modeling System for Text Mining Machine Translation

no code implementations8 Aug 2024 Yuwei Zhang, Junming Huang, Sitong Liu, Zexi Chen, Zizheng Li

This paper advances a novel architectural schema anchored upon the Transformer paradigm and innovatively amalgamates the K-means categorization algorithm to augment the contextual apprehension capabilities of the schema.

Machine Translation Translation

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

no code implementations11 Jul 2024 Zilong Wang, Zifeng Wang, Long Le, Huaixiu Steven Zheng, Swaroop Mishra, Vincent Perot, Yuwei Zhang, Anush Mattapalli, Ankur Taly, Jingbo Shang, Chen-Yu Lee, Tomas Pfister

Retrieval augmented generation (RAG) combines the generative abilities of large language models (LLMs) with external knowledge sources to provide more accurate and up-to-date responses.

ARC RAG +2

Towards Open Respiratory Acoustic Foundation Models: Pretraining and Benchmarking

1 code implementation23 Jun 2024 Yuwei Zhang, Tong Xia, Jing Han, Yu Wu, Georgios Rizos, Yang Liu, Mohammed Mosuily, Jagmohan Chauhan, Cecilia Mascolo

Our pretrained models demonstrate superior performance (against existing acoustic models pretrained with general audio on 16 out of 19 tasks) and generalizability (to unseen datasets and new respiratory audio modalities).

Benchmarking

Monocular Localization with Semantics Map for Autonomous Vehicles

no code implementations6 Jun 2024 Jixiang Wan, Xudong Zhang, Shuzhou Dong, Yuwei Zhang, Yuchen Yang, Ruoxi Wu, Ye Jiang, Jijunnan Li, Jinquan Lin, Ming Yang

To balance efficiency and accuracy, we propose a novel lightweight visual semantic localization algorithm that employs stable semantic features instead of low-level texture features.

Autonomous Driving Computational Efficiency +1

Tool-Planner: Task Planning with Clusters across Multiple Tools

1 code implementation6 Jun 2024 Yanming Liu, Xinyue Peng, Jiannan Cao, Shi Bo, Yuwei Zhang, Xuhong Zhang, Sheng Cheng, Xun Wang, Jianwei Yin, Tianyu Du

Experiments show that our approach demonstrates a high pass and win rate across different datasets and optimizes the planning scheme for tool learning in models such as GPT-4 and Claude 3, showcasing the potential of our method.

Language Modelling Large Language Model +1

Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals

no code implementations22 Apr 2024 Qingyang Wu, Ying Xu, Tingsong Xiao, Yunze Xiao, Yitong Li, Tianyang Wang, Yichi Zhang, Shanghai Zhong, Yuwei Zhang, Wei Lu, Yifan Yang

This study conducts a comprehensive review and analysis of the existing literature on the attitudes of LLMs towards the 17 SDGs, emphasizing the comparison between their attitudes and support for each goal and those of humans.

Decision Making

Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders

no code implementations7 Mar 2024 Yuwei Zhang, Siffi Singh, Sailik Sengupta, Igor Shalyminov, Hang Su, Hwanjun Song, Saab Mansour

The triplet task gauges the model's understanding of two semantic concepts paramount in real-world conversational systems-- negation and implicature.

Clustering intent-classification +3

A Closed-loop Brain-Machine Interface SoC Featuring a 0.2$μ$J/class Multiplexer Based Neural Network

no code implementations7 Jan 2024 Chao Zhang, Yongxiang Guo, Dawid Sheng, Zhixiong Ma, Chao Sun, Yuwei Zhang, Wenxin Zhao, Fenyan Zhang, Tongfei Wang, Xing Sheng, Milin Zhang

This work presents the first fabricated electrophysiology-optogenetic closed-loop bidirectional brain-machine interface (CL-BBMI) system-on-chip (SoC) with electrical neural signal recording, on-chip sleep staging and optogenetic stimulation.

Sleep Staging

Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute Manipulation

1 code implementation14 Jul 2023 Letian Peng, Yuwei Zhang, Jingbo Shang

Prompting large language models (LLMs) for data augmentation has recently become a common practice in few-shot NLP tasks.

Aspect-Based Sentiment Analysis Attribute +9

ClusterLLM: Large Language Models as a Guide for Text Clustering

1 code implementation24 May 2023 Yuwei Zhang, Zihan Wang, Jingbo Shang

First, we prompt ChatGPT for insights on clustering perspective by constructing hard triplet questions <does A better correspond to B than C>, where A, B and C are similar data points that belong to different clusters according to small embedder.

Clustering Language Modelling +3

Toward Unsupervised Realistic Visual Question Answering

no code implementations ICCV 2023 Yuwei Zhang, Chih-Hui Ho, Nuno Vasconcelos

To resolve the first drawback, we propose a new testing dataset, RGQA, which combines AQs from an existing VQA dataset with around 29K human-annotated UQs.

Question Answering Visual Question Answering

Measurement of carbon finance level and exploration of its influencing factors

no code implementations1 Jun 2022 Peng Zhang, Yuwei Zhang, Nuo Xu

Faced with increasingly severe environmental problems, carbon trading markets and related financial activities aiming at limiting carbon dioxide emissions are booming.

New Intent Discovery with Pre-training and Contrastive Learning

1 code implementation ACL 2022 Yuwei Zhang, Haode Zhang, Li-Ming Zhan, Xiao-Ming Wu, Albert Y. S. Lam

Existing approaches typically rely on a large amount of labeled utterances and employ pseudo-labeling methods for representation learning and clustering, which are label-intensive, inefficient, and inaccurate.

Clustering Contrastive Learning +3

Low-Dose CT Denoising Using a Structure-Preserving Kernel Prediction Network

no code implementations31 May 2021 Lu Xu, Yuwei Zhang, Ying Liu, Daoye Wang, Mu Zhou, Jimmy Ren, Jingwei Wei, Zhaoxiang Ye

Low-dose CT has been a key diagnostic imaging modality to reduce the potential risk of radiation overdose to patient health.

Denoising Diagnostic

KuraNet: Systems of Coupled Oscillators that Learn to Synchronize

1 code implementation6 May 2021 Matthew Ricci, Minju Jung, Yuwei Zhang, Mathieu Chalvidal, Aneri Soni, Thomas Serre

Here, we present a single approach to both of these problems in the form of "KuraNet", a deep-learning-based system of coupled oscillators that can learn to synchronize across a distribution of disordered network conditions.

Texture and Shape Biased Two-Stream Networks for Clothing Classification and Attribute Recognition

no code implementations CVPR 2020 Yuwei Zhang, Peng Zhang, Chun Yuan, Zhi Wang

We also find that texture features have an impelling effect on these tasks and that the pre-trained ImageNet model has good performance in extracting texture features.

Attribute Classification +1

Dataset of Segmented Nuclei in Hematoxylin and Eosin Stained Histopathology Images of 10 Cancer Types

1 code implementation18 Feb 2020 Le Hou, Rajarsi Gupta, John S. Van Arnam, Yuwei Zhang, Kaustubh Sivalenka, Dimitris Samaras, Tahsin M. Kurc, Joel H. Saltz

To address this, we developed an analysis pipeline that segments nuclei in whole slide tissue images from multiple cancer types with a quality control process.

Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.