no code implementations • 9 Dec 2024 • Lan Li, Liri Fang, Vetle I. Torvik
Additionally, we propose a data cleaning benchmark to evaluate the capability of LLM agents to automatically generate workflows that address data cleaning purposes of varying difficulty levels.
1 code implementation • 30 Sep 2024 • Lan Li, Liri Fang, Yiren Liu, Vetle I. Torvik, Bertram Ludaescher
To address the T-Qs, T-KAER is designed to improve transparency by documenting the entity resolution processes in log files.
no code implementations • 21 May 2024 • Xin-Chun Li, Lan Li, De-Chuan Zhan
The loss landscape of deep neural networks (DNNs) is commonly considered complex and wildly fluctuated.
no code implementations • 21 May 2024 • Xin-Chun Li, Jin-Lin Tang, Bo Zhang, Lan Li, De-Chuan Zhan
Exploring the loss landscape offers insights into the inherent principles of deep neural networks (DNNs).
no code implementations • 20 Feb 2024 • Lan Li, Jinpeng Lv
We demonstrate a semantic search framework that achieves a comprehensive semantic understanding of the entire notebook's contents, enabling it to effectively handle various types of user queries.
no code implementations • 28 Jan 2024 • Sharib Ali, Yamid Espinel, Yueming Jin, Peng Liu, Bianca Güttner, Xukun Zhang, Lihua Zhang, Tom Dowrick, Matthew J. Clarkson, Shiting Xiao, Yifan Wu, Yijun Yang, Lei Zhu, Dai Sun, Lan Li, Micha Pfeiffer, Shahid Farid, Lena Maier-Hein, Emmanuel Buc, Adrien Bartoli
A total of 6 teams from 4 countries participated, whose proposed methods were evaluated on 16 images and two preoperative 3D models from two patients.
no code implementations • 27 Dec 2023 • Lan Li, Bowen Tao, Lu Han, De-Chuan Zhan, Han-Jia Ye
Differing from traditional semi-supervised learning, class-imbalanced semi-supervised learning presents two distinct challenges: (1) The imbalanced distribution of training samples leads to model bias towards certain classes, and (2) the distribution of unlabeled samples is unknown and potentially distinct from that of labeled samples, which further contributes to class bias in the pseudo-labels during training.
no code implementations • 15 Dec 2023 • Bowen Tao, Lan Li, Xin-Chun Li, De-Chuan Zhan
For each pseudo-labeled sample, we select positive and negative samples from labeled data instead of unlabeled data to compute contrastive loss.
no code implementations • 12 Jan 2023 • Liri Fang, Lan Li, Yiren Liu, Vetle I. Torvik, Bertram Ludäscher
Entity resolution has been an essential and well-studied task in data cleaning research for decades.
no code implementations • 25 Jul 2022 • Jingqun Tang, Wenming Qian, Luchuan Song, Xiena Dong, Lan Li, Xiang Bai
Text detection and recognition are essential components of a modern OCR system.
no code implementations • 26 Jul 2021 • Xin-Chun Li, Lan Li, De-Chuan Zhan, Yunfeng Shao, Bingshuai Li, Shaoming Song
Automatically mining sentiment tendency contained in natural language is a fundamental research to some artificial intelligent applications, where solutions alternate with challenges.
1 code implementation • 18 Jun 2020 • Hyoungwook Nam, Seung Byum Seo, Vikram Sharma Mailthody, Noor Michael, Lan Li
The model inductively generalizes on a variety of algorithmic tasks where state-of-the-art Transformer models fail to do so.