1 code implementation • 2 Dec 2024 • Xueyang Li, Han Xiao, Weixiang Weng, Xiaowei Xu, Yiyu Shi
Radiologists usually rely on a series of multi-phase contrast-enhanced computed tomography (CECT) scans done during follow-up visits to perform early detection of the potential CRLM.
no code implementations • 16 Nov 2024 • Xudong Lu, Yinghao Chen, Cheng Chen, Hui Tan, Boheng Chen, Yina Xie, Rui Hu, Guanxin Tan, Renshou Wu, Yan Hu, Yi Zeng, Lei Wu, Liuyang Bian, Zhaoxiong Wang, Long Liu, Yanzhou Yang, Han Xiao, Aojun Zhou, Yafei Wen, Xiaoxin Chen, Shuai Ren, Hongsheng Li
To be specific, we redesign the dynamic resolution scheme adopted by mainstream MLLMs and implement system optimization for hardware-aware deployment to optimize model inference on mobile phones.
1 code implementation • 10 Oct 2024 • Guankun Wang, Han Xiao, Huxin Gao, Renrui Zhang, Long Bai, Xiaoxiao Yang, Zhen Li, Hongsheng Li, Hongliang Ren
In this paper, we design a hierarchical decomposition of ESD motion granularity and introduce a multi-level surgical motion dataset (CoPESD) for training LVLMs as the robotic \textbf{Co}-\textbf{P}ilot of \textbf{E}ndoscopic \textbf{S}ubmucosal \textbf{D}issection.
no code implementations • 16 Sep 2024 • Saba Sturua, Isabelle Mohr, Mohammad Kalim Akram, Michael Günther, Bo wang, Markus Krimmel, Feng Wang, Georgios Mastrapas, Andreas Koukounas, Nan Wang, Han Xiao
We introduce jina-embeddings-v3, a novel text embedding model with 570 million parameters, achieves state-of-the-art performance on multilingual data and long-context retrieval tasks, supporting context lengths of up to 8192 tokens.
1 code implementation • 7 Sep 2024 • Michael Günther, Isabelle Mohr, Daniel James Williams, Bo wang, Han Xiao
Many use cases require retrieving smaller portions of text, and dense vector-based retrieval systems often perform better with shorter text segments, as the semantics are less likely to be over-compressed in the embeddings.
no code implementations • 29 Aug 2024 • Rohan Jha, Bo wang, Michael Günther, Georgios Mastrapas, Saba Sturua, Isabelle Mohr, Andreas Koukounas, Mohammad Kalim Akram, Nan Wang, Han Xiao
Multi-vector dense models, such as ColBERT, have proven highly effective in information retrieval.
no code implementations • 3 Jul 2024 • Yuxiang Chai, Siyuan Huang, Yazhe Niu, Han Xiao, Liang Liu, Dingyu Zhang, Peng Gao, Shuai Ren, Hongsheng Li
To advance research on AI agents in mobile scenarios, we introduce the Android Multi-annotation EXpo (AMEX), a comprehensive, large-scale dataset designed for generalist mobile GUI-control agents.
no code implementations • 27 Jun 2024 • Han Xiao, Wenqiang Tian, Shi Jin, Wendong Liu, Jia Shen, Zhihua Shi, Zhi Zhang
In this paper, an interference cancellation based neural receiver for superimposed pilot (SIP) in multi-layer transmission is proposed, where the data and pilot are non-orthogonally superimposed in the same time-frequency resource.
1 code implementation • 5 Jun 2024 • Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng Gao
Lumina-T2X is a nascent family of Flow-based Large Diffusion Transformers that establishes a unified framework for transforming noise into various modalities, such as images and videos, conditioned on text instructions.
1 code implementation • 5 Jun 2024 • Martino Ciaperoni, Han Xiao, Aristides Gionis
Today, as increasingly complex predictive models are developed, simple rule sets remain a crucial tool to obtain interpretable predictions and drive high-stakes decision making.
no code implementations • 30 May 2024 • Andreas Koukounas, Georgios Mastrapas, Michael Günther, Bo wang, Scott Martens, Isabelle Mohr, Saba Sturua, Mohammad Kalim Akram, Joan Fontanals Martínez, Saahil Ognawala, Susana Guzman, Maximilian Werk, Nan Wang, Han Xiao
Contrastive Language-Image Pretraining (CLIP) is widely used to train models to align images and texts in a common embedding space by mapping them to fixed-sized vectors.
3 code implementations • CVPR 2024 • Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao
To reduce the reliance on large-scale datasets, recent works in 3D segmentation resort to few-shot learning.
1 code implementation • 25 Mar 2024 • Hao Shao, Shengju Qian, Han Xiao, Guanglu Song, Zhuofan Zong, Letian Wang, Yu Liu, Hongsheng Li
To address these challenges, we collect and introduce the large-scale Visual CoT dataset comprising 438k question-answer pairs, annotated with intermediate bounding boxes highlighting key regions essential for answering the questions.
no code implementations • 6 Mar 2024 • Xiaoyan Hu, Pengle Wen, Han Xiao, Wenjie Wang, Kai-Kit Wong
By leveraging the SWIPT technique, the UAV can simultaneously transmit energy and the computing results during the downlink period.
no code implementations • 3 Mar 2024 • Adiba Orzikulova, Han Xiao, Zhipeng Li, Yukang Yan, Yuntao Wang, Yuanchun Shi, Marzyeh Ghassemi, Sung-Ju Lee, Anind K Dey, Xuhai "Orson" Xu
Participants preferred the adaptive interventions and rated the system highly on intervention time accuracy, effectiveness, and level of trust.
no code implementations • 26 Feb 2024 • Anchun Gui, Jian Li, Yong Dai, Nan Du, Han Xiao
Meanwhile, we propose a novel tool sampling strategy to enhance the generalizability of LLMs over unseen tools.
no code implementations • 26 Feb 2024 • Isabelle Mohr, Markus Krimmel, Saba Sturua, Mohammad Kalim Akram, Andreas Koukounas, Michael Günther, Georgios Mastrapas, Vinit Ravishankar, Joan Fontanals Martínez, Feng Wang, Qi Liu, Ziniu Yu, Jie Fu, Saahil Ognawala, Susana Guzman, Bo wang, Maximilian Werk, Nan Wang, Han Xiao
We introduce a novel suite of state-of-the-art bilingual text embedding models that are designed to support English and another target language.
no code implementations • 30 Jan 2024 • Tiannan Wang, Jiamin Chen, Qingrui Jia, Shuai Wang, Ruoyu Fang, Huilin Wang, Zhaowei Gao, Chunzhao Xie, Chuou Xu, Jihong Dai, Yibin Liu, Jialong Wu, Shengwei Ding, Long Li, Zhiwei Huang, Xinle Deng, Teng Yu, Gangan Ma, Han Xiao, Zixin Chen, Danjun Xiang, Yunxia Wang, Yuanyuan Zhu, Yi Xiao, Jing Wang, Yiru Wang, Siran Ding, Jiayang Huang, Jiayi Xu, Yilihamu Tayier, Zhenyu Hu, Yuan Gao, Chengfeng Zheng, Yueshu Ye, Yihang Li, Lei Wan, Xinyue Jiang, Yujie Wang, Siyu Cheng, Zhule Song, Xiangru Tang, Xiaohua Xu, Ningyu Zhang, Huajun Chen, Yuchen Eleanor Jiang, Wangchunshu Zhou
Weaver is pre-trained on a carefully selected corpus that focuses on improving the writing capabilities of large language models.
1 code implementation • 13 Nov 2023 • Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao
We present SPHINX, a versatile multi-modal large language model (MLLM) with a joint mixing of model weights, tuning tasks, and visual embeddings.
Ranked #2 on Visual Question Answering on BenchLMM (using extra training data)
2 code implementations • 30 Oct 2023 • Michael Günther, Jackmin Ong, Isabelle Mohr, Alaeddine Abdessalem, Tanguy Abel, Mohammad Kalim Akram, Susana Guzman, Georgios Mastrapas, Saba Sturua, Bo wang, Maximilian Werk, Nan Wang, Han Xiao
Text embedding models have emerged as powerful tools for transforming sentences into fixed-sized feature vectors that encapsulate semantic information.
no code implementations • 24 Oct 2023 • Han Xiao, Wenqiang Tian, Wendong Liu, Jiajia Guo, Zhi Zhang, Shi Jin, Zhihua Shi, Li Guo, Jia Shen
In this article, a knowledge-driven meta-learning approach is proposed, where the DL model initialized by the meta model obtained from meta training phase is able to achieve rapid convergence when facing a new scenario during target retraining phase.
2 code implementations • 7 Sep 2023 • Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao
During training, we adopt a learnable bind network to align the embedding space between LLaMA and ImageBind's image encoder.
no code implementations • 20 Jul 2023 • Michael Günther, Louis Milliken, Jonathan Geuter, Georgios Mastrapas, Bo wang, Han Xiao
Jina Embeddings constitutes a set of high-performance sentence embedding models adept at translating textual inputs into numerical representations, capturing the semantics of the text.
no code implementations • 17 May 2023 • Anchun Gui, Jinqiang Ye, Han Xiao
However, with the growth of model scale and the rising number of downstream tasks, this paradigm inevitably meets the challenges in terms of computation consumption and memory footprint issues.
no code implementations • 8 May 2023 • Anchun Gui, Han Xiao
To fully leverage the advantages of large-scale pre-trained language models (PLMs) on downstream tasks, it has become a ubiquitous adaptation paradigm to fine-tune the entire parameters of PLMs.
parameter-efficient fine-tuning Vocal Bursts Intensity Prediction
1 code implementation • 13 Apr 2023 • Ziwei Wang, Jiwen Lu, Han Xiao, Shengyu Liu, Jie zhou
On the contrary, we obtain the optimal efficient networks by directly optimizing the compression policy with an accurate performance predictor, where the ultrafast automated model compression for various computational cost constraint is achieved without complex compression policy search and evaluation.
no code implementations • 31 Jan 2023 • Han Xiao, Wenqiang Tian, Wendong Liu, Zhi Zhang, Zhihua Shi, Li Guo, Jia Shen
Recently, deep learning (DL) has been introduced to enhance CSI feedback in massive MIMO application, where the massive collected training data and lengthy training time are costly and impractical for realistic deployment.
1 code implementation • ICCV 2023 • Han Xiao, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu
Data mixing strategies (e. g., CutMix) have shown the ability to greatly improve the performance of convolutional neural networks (CNNs).
1 code implementation • 4 Oct 2022 • Martino Ciaperoni, Han Xiao, Aristides Gionis
Multi-label classification is becoming increasingly ubiquitous, but not much attention has been paid to interpretability.
1 code implementation • CVPR 2022 • Han Xiao, Ziwei Wang, Zheng Zhu, Jie zhou, Jiwen Lu
Differentiable architecture search (DARTS) acquires the optimal architectures by optimizing the architecture parameters with gradient descent, which significantly reduces the search cost.
no code implementations • 16 Jun 2022 • Han Xiao, Zhiqin Wang, Dexin Li, Wenqiang Tian, Xiaofeng Liu, Wendong Liu, Shi Jin, Jia Shen, Zhi Zhang, Ning Yang
This paper is based on the background of the 2nd Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AIWork Group, where the framework of the eigenvector-based channel state information (CSI) feedback problem is firstly provided.
no code implementations • 1 Jun 2022 • Huang Xiao, Battista Biggio, Blaine Nelson, Han Xiao, Claudia Eckert, Fabio Roli
Machine learning algorithms are increasingly being applied in security-related tasks such as spam and malware detection, although their security properties against deliberate attacks have not yet been widely understood.
1 code implementation • ICCV 2021 • Ziwei Wang, Han Xiao, Jiwen Lu, Jie zhou
On the contrary, our GMPQ searches the mixed-quantization policy that can be generalized to largescale datasets with only a small amount of data, so that the search cost is significantly reduced without performance degradation.
no code implementations • 12 Jun 2021 • Han Xiao, Zhiqin Wang, Wenqiang Tian, Xiaofeng Liu, Wendong Liu, Shi Jin, Jia Shen, Zhi Zhang, Ning Yang
In this paper, we give a systematic description of the 1st Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AI Work Group.
no code implementations • 20 Feb 2020 • Hanshu Cai, Yiwen Gao, Shuting Sun, Na Li, Fuze Tian, Han Xiao, Jianxiu Li, Zhengwu Yang, Xiaowei Li, Qinglin Zhao, Zhenyu Liu, Zhijun Yao, Minqiang Yang, Hong Peng, Jing Zhu, Xiaowei Zhang, Guoping Gao, Fang Zheng, Rui Li, Zhihua Guo, Rong Ma, Jing Yang, Lan Zhang, Xiping Hu, Yumin Li, Bin Hu
The EEG dataset includes not only data collected using traditional 128-electrodes mounted elastic cap, but also a novel wearable 3-electrode EEG collector for pervasive applications.
1 code implementation • 26 Jan 2020 • Han Xiao, Bruno Ordozgoiti, Aristides Gionis
In this paper we formulate the problem of finding local polarized communities in signed graphs as a locally-biased eigen-problem.
no code implementations • 6 Dec 2019 • Chencheng Cai, Rong Chen, Han Xiao
As an effective dimension reduction tool, singular value decomposition is often used to analyze high dimensional matrices, which are traditionally assumed to have a low rank matrix approximation.
no code implementations • 5 Dec 2019 • Chencheng Cai, Rong Chen, Han Xiao
Specifically, we propose to approximate a given matrix by the sum of a few Kronecker products of matrices, which we refer to as the Kronecker product approximation (KoPA).
no code implementations • 26 Nov 2019 • Chencheng Cai, Rong Chen, Han Xiao
A matrix completion problem is to recover the missing entries in a partially observed matrix.
no code implementations • 24 Apr 2019 • Yuhu Guo, Han Xiao, Yidong Chen, Xiaodong Shi
As an instance of event-based camera, Dynamic and Active-pixel Vision Sensor (DAVIS) combines a standard camera and an event-based camera.
3 code implementations • 17 Apr 2019 • Sujay Khandagale, Han Xiao, Rohit Babbar
In this paper, we develop a suite of algorithms, called Bonsai, which generalizes the notion of label representation in XMC, and partitions the labels in the representation space to learn shallow trees.
1 code implementation • 6 Sep 2018 • Han Xiao, Feng Wang, Jian-Feng Yan, Jingyao Zheng
The task of question answering or question generation aims to infer an answer or a question when given the counterpart based on context.
no code implementations • 9 Jan 2018 • Han Xiao
Traditionally, most complex intelligence architectures are extremely non-convex, which could not be well performed by convex optimization.
no code implementations • 5 Jan 2018 • Han Xiao
However, to construct powerful intelligence systems with various methods, we propose the intelligence graph (short as \textbf{\textit{iGraph}}), which is composed by both of neural and probabilistic graph, under the framework of forward-backward propagation.
no code implementations • 16 Dec 2017 • Han Xiao
Though traditional algorithms could be embedded into neural architectures with the proposed principle of \cite{xiao2017hungarian}, the variables that only occur in the condition of branch could not be updated as a special case.
no code implementations • 7 Dec 2017 • Han Xiao, Yidong Chen, Xiaodong Shi
However, lacking of logic flow (e. g. \textit{if, for, while}), traditional algorithms (e. g. \textit{Hungarian algorithm, A$^*$ searching, decision tress algorithm}) could not be embedded into this paradigm, which limits the theories and applications.
37 code implementations • 25 Aug 2017 • Han Xiao, Kashif Rasul, Roland Vollgraf
We present Fashion-MNIST, a new dataset comprising of 28x28 grayscale images of 70, 000 fashion products from 10 categories, with 7, 000 images per category.
no code implementations • 21 Feb 2017 • Han Xiao, Lian Meng
Recommendation system is a common demand in daily life and matrix completion is a widely adopted technique for this task.
no code implementations • 27 Aug 2016 • Han Xiao, Minlie Huang, Xiaoyan Zhu
Since both aspects and categories are semantics-relevant, the collection of categories in each aspect is treated as the semantic representation of this triple.
no code implementations • 17 Apr 2016 • Han Xiao, Minlie Huang, Xiaoyan Zhu
To this end, this paper proposes a semantic representation method for knowledge graph \textbf{(KSR)}, which imposes a two-level hierarchical generative process that globally extracts many aspects and then locally assigns a specific category in each aspect for every triple.
no code implementations • 15 Dec 2015 • Han Xiao, Minlie Huang, Xiaoyan Zhu
Knowledge graph embedding aims at offering a numerical knowledge representation paradigm by transforming the entities and relations into continuous vector space.
no code implementations • 18 Sep 2015 • Han Xiao, Minlie Huang, Yu Hao, Xiaoyan Zhu
Knowledge representation is a major topic in AI, and many studies attempt to represent entities and relations of knowledge base in a continuous vector space.
no code implementations • 18 Sep 2015 • Han Xiao, Minlie Huang, Yu Hao, Xiaoyan Zhu
Recently, knowledge graph embedding, which projects symbolic entities and relations into continuous vector space, has become a new, hot topic in artificial intelligence.
no code implementations • 11 Jun 2015 • Han Xiao, Xiaoyan Zhu
Margin-Based Principle has been proposed for a long time, it has been proved that this principle could reduce the structural risk and improve the performance in both theoretical and practical aspects.
no code implementations • 11 Jun 2015 • Han Xiao, Xiaoyan Zhu
Entropy-Based Principle is the principle with which we could estimate the unknown distribution under some limited conditions.