no code implementations • 26 May 2025 • Jerry Yao-Chieh Hu, Xiwen Zhang, Maojiang Su, Zhao Song, Han Liu
We study the computational limits of learning $k$-bit Boolean functions (specifically, $\mathrm{AND}$, $\mathrm{OR}$, and their noisy variants), using a minimalist single-head softmax-attention mechanism, where $k=\Theta(d)$ relevant bits are selected from $d$ inputs.
1 code implementation • 24 Mar 2025 • Junteng Liu, Weihao Zeng, Xiwen Zhang, Yijun Wang, Zifei Shan, Junxian He
Chart understanding requires models to effectively analyze and reason about numerical data, textual elements, and complex visual components.
no code implementations • 10 Mar 2025 • Chaoran E, Chenghan Chen, Yuyang Shi, Haiyun Wang, Peixin Hua, Xiwen Zhang
(2)The pulsation time characteristic points predicted by the LSTM-Transformer Model shows a maximum prediction error of 1. 78ms, which is significantly lower than other methods.
no code implementations • 23 Dec 2024 • Wei Liu, Junlong Li, Xiwen Zhang, Fan Zhou, Yu Cheng, Junxian He
Our analysis leads to a set of best practices for each factor, aimed at optimizing multimodal reasoning.
1 code implementation • 7 Nov 2024 • Rongjie Yi, Xiang Li, Weikai Xie, Zhenyan Lu, Chenghua Wang, Ao Zhou, Shangguang Wang, Xiwen Zhang, Mengwei Xu
The interest in developing small language models (SLM) for on-device deployment is fast growing.
1 code implementation • 24 Sep 2024 • Zhenyan Lu, Xiang Li, Dongqi Cai, Rongjie Yi, Fangming Liu, Xiwen Zhang, Nicholas D. Lane, Mengwei Xu
Small language models (SLMs), despite their widespread adoption in modern smart devices, have received significantly less academic attention compared to their large language model (LLM) counterparts, which are predominantly deployed in data centers and cloud environments.
1 code implementation • 18 Jun 2024 • Yuxuan Tong, Xiwen Zhang, Rui Wang, Ruidong Wu, Junxian He
Solving mathematical problems requires advanced reasoning abilities and presents notable challenges for large language models.
Ranked #4 on
Natural Questions
on TheoremQA
(using extra training data)
2 code implementations • 12 Sep 2023 • Xingchao Liu, Xiwen Zhang, Jianzhu Ma, Jian Peng, Qiang Liu
Leveraging our new pipeline, we create, to the best of our knowledge, the first one-step diffusion-based text-to-image generator with SD-level image quality, achieving an FID (Frechet Inception Distance) of $23. 3$ on MS COCO 2017-5k, surpassing the previous state-of-the-art technique, progressive distillation, by a significant margin ($37. 2$ $\rightarrow$ $23. 3$ in FID).
1 code implementation • 2 Mar 2022 • Shenggan Cheng, Xuanlei Zhao, Guangyang Lu, Jiarui Fang, Zhongming Yu, Tian Zheng, Ruidong Wu, Xiwen Zhang, Jian Peng, Yang You
In this work, we present FastFold, an efficient implementation of AlphaFold for both training and inference.
1 code implementation • 22 Mar 2020 • Abu Shafin Mohammad Mahdee Jameel, Ahmed P. Mohamed, Xiwen Zhang, Aly El Gamal
We demonstrate a first example for employing deep learning in predicting frame errors for a Collaborative Intelligent Radio Network (CIRN) using a dataset collected during participation in the final scrimmages of the DARPA SC2 challenge.
no code implementations • 26 Dec 2019 • Xingchen Wang, Shengtai Ju, Xiwen Zhang, Sharan Ramjee, Aly El Gamal
We study efficient deep learning training algorithms that process received wireless signals, if a test Signal to Noise Ratio (SNR) estimate is available.
no code implementations • 2 Sep 2019 • Mengwei Xu, Xiwen Zhang, Yunxin Liu, Gang Huang, Xuanzhe Liu, Felix Xiaozhu Lin
Elf is a runtime for an energy-constrained camera to continuously summarize video scenes as approximate object counts.
Databases
1 code implementation • 16 May 2019 • Xiwen Zhang, Tolunay Seyfi, Shengtai Ju, Sharan Ramjee, Aly El Gamal, Yonina C. Eldar
We study the problem of interference source identification, through the lens of recognizing one of 15 different channels that belong to 3 different wireless technologies: Bluetooth, Zigbee, and WiFi.