Search Results for author: Han Xiao

Found 56 papers, 19 papers with code

MPBD-LSTM: A Predictive Model for Colorectal Liver Metastases Using Time Series Multi-phase Contrast-Enhanced CT Scans

1 code implementation2 Dec 2024 Xueyang Li, Han Xiao, Weixiang Weng, Xiaowei Xu, Yiyu Shi

Radiologists usually rely on a series of multi-phase contrast-enhanced computed tomography (CECT) scans done during follow-up visits to perform early detection of the potential CRLM.

Time Series

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

no code implementations16 Nov 2024 Xudong Lu, Yinghao Chen, Cheng Chen, Hui Tan, Boheng Chen, Yina Xie, Rui Hu, Guanxin Tan, Renshou Wu, Yan Hu, Yi Zeng, Lei Wu, Liuyang Bian, Zhaoxiong Wang, Long Liu, Yanzhou Yang, Han Xiao, Aojun Zhou, Yafei Wen, Xiaoxin Chen, Shuai Ren, Hongsheng Li

To be specific, we redesign the dynamic resolution scheme adopted by mainstream MLLMs and implement system optimization for hardware-aware deployment to optimize model inference on mobile phones.

Quantization

CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection

1 code implementation10 Oct 2024 Guankun Wang, Han Xiao, Huxin Gao, Renrui Zhang, Long Bai, Xiaoxiao Yang, Zhen Li, Hongsheng Li, Hongliang Ren

In this paper, we design a hierarchical decomposition of ESD motion granularity and introduce a multi-level surgical motion dataset (CoPESD) for training LVLMs as the robotic \textbf{Co}-\textbf{P}ilot of \textbf{E}ndoscopic \textbf{S}ubmucosal \textbf{D}issection.

Instruction Following

jina-embeddings-v3: Multilingual Embeddings With Task LoRA

no code implementations16 Sep 2024 Saba Sturua, Isabelle Mohr, Mohammad Kalim Akram, Michael Günther, Bo wang, Markus Krimmel, Feng Wang, Georgios Mastrapas, Andreas Koukounas, Nan Wang, Han Xiao

We introduce jina-embeddings-v3, a novel text embedding model with 570 million parameters, achieves state-of-the-art performance on multilingual data and long-context retrieval tasks, supporting context lengths of up to 8192 tokens.

Representation Learning Retrieval +1

Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models

1 code implementation7 Sep 2024 Michael Günther, Isabelle Mohr, Daniel James Williams, Bo wang, Han Xiao

Many use cases require retrieving smaller portions of text, and dense vector-based retrieval systems often perform better with shorter text segments, as the semantics are less likely to be over-compressed in the embeddings.

Chunking Retrieval

AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents

no code implementations3 Jul 2024 Yuxiang Chai, Siyuan Huang, Yazhe Niu, Han Xiao, Liang Liu, Dingyu Zhang, Peng Gao, Shuai Ren, Hongsheng Li

To advance research on AI agents in mobile scenarios, we introduce the Android Multi-annotation EXpo (AMEX), a comprehensive, large-scale dataset designed for generalist mobile GUI-control agents.

Interference Cancellation Based Neural Receiver for Superimposed Pilot in Multi-Layer Transmission

no code implementations27 Jun 2024 Han Xiao, Wenqiang Tian, Shi Jin, Wendong Liu, Jia Shen, Zhihua Shi, Zhi Zhang

In this paper, an interference cancellation based neural receiver for superimposed pilot (SIP) in multi-layer transmission is proposed, where the data and pilot are non-orthogonally superimposed in the same time-frequency resource.

Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT

1 code implementation5 Jun 2024 Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng Gao

Lumina-T2X is a nascent family of Flow-based Large Diffusion Transformers that establishes a unified framework for transforming noise into various modalities, such as images and videos, conditioned on text instructions.

Point Cloud Generation Text-to-Image Generation

Efficient Exploration of the Rashomon Set of Rule Set Models

1 code implementation5 Jun 2024 Martino Ciaperoni, Han Xiao, Aristides Gionis

Today, as increasingly complex predictive models are developed, simple rule sets remain a crucial tool to obtain interpretable predictions and drive high-stakes decision making.

Decision Making Efficient Exploration +1

Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

1 code implementation25 Mar 2024 Hao Shao, Shengju Qian, Han Xiao, Guanglu Song, Zhuofan Zong, Letian Wang, Yu Liu, Hongsheng Li

To address these challenges, we collect and introduce the large-scale Visual CoT dataset comprising 438k question-answer pairs, annotated with intermediate bounding boxes highlighting key regions essential for answering the questions.

Visual Question Answering (VQA)

Maximizing Energy Charging for UAV-assisted MEC Systems with SWIPT

no code implementations6 Mar 2024 Xiaoyan Hu, Pengle Wen, Han Xiao, Wenjie Wang, Kai-Kit Wong

By leveraging the SWIPT technique, the UAV can simultaneously transmit energy and the computing results during the downlink period.

Edge-computing Scheduling

Time2Stop: Adaptive and Explainable Human-AI Loop for Smartphone Overuse Intervention

no code implementations3 Mar 2024 Adiba Orzikulova, Han Xiao, Zhipeng Li, Yukang Yan, Yuntao Wang, Yuanchun Shi, Marzyeh Ghassemi, Sung-Ju Lee, Anind K Dey, Xuhai "Orson" Xu

Participants preferred the adaptive interventions and rated the system highly on intervention time accuracy, effectiveness, and level of trust.

Knowledge-driven Meta-learning for CSI Feedback

no code implementations24 Oct 2023 Han Xiao, Wenqiang Tian, Wendong Liu, Jiajia Guo, Zhi Zhang, Shi Jin, Zhihua Shi, Li Guo, Jia Shen

In this article, a knowledge-driven meta-learning approach is proposed, where the DL model initialized by the meta model obtained from meta training phase is able to achieve rapid convergence when facing a new scenario during target retraining phase.

Meta-Learning

Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models

no code implementations20 Jul 2023 Michael Günther, Louis Milliken, Jonathan Geuter, Georgios Mastrapas, Bo wang, Han Xiao

Jina Embeddings constitutes a set of high-performance sentence embedding models adept at translating textual inputs into numerical representations, capturing the semantics of the text.

Negation Retrieval +6

G-Adapter: Towards Structure-Aware Parameter-Efficient Transfer Learning for Graph Transformer Networks

no code implementations17 May 2023 Anchun Gui, Jinqiang Ye, Han Xiao

However, with the growth of model scale and the rising number of downstream tasks, this paradigm inevitably meets the challenges in terms of computation consumption and memory footprint issues.

Inductive Bias parameter-efficient fine-tuning +1

HiFi: High-Information Attention Heads Hold for Parameter-Efficient Model Adaptation

no code implementations8 May 2023 Anchun Gui, Han Xiao

To fully leverage the advantages of large-scale pre-trained language models (PLMs) on downstream tasks, it has become a ubiquitous adaptation paradigm to fine-tune the entire parameters of PLMs.

parameter-efficient fine-tuning Vocal Bursts Intensity Prediction

Learning Accurate Performance Predictors for Ultrafast Automated Model Compression

1 code implementation13 Apr 2023 Ziwei Wang, Jiwen Lu, Han Xiao, Shengyu Liu, Jie zhou

On the contrary, we obtain the optimal efficient networks by directly optimizing the compression policy with an accurate performance predictor, where the ultrafast automated model compression for various computational cost constraint is achieved without complex compression policy search and evaluation.

Image Classification Model Compression +3

A Knowledge-Driven Meta-Learning Method for CSI Feedback

no code implementations31 Jan 2023 Han Xiao, Wenqiang Tian, Wendong Liu, Zhi Zhang, Zhihua Shi, Li Guo, Jia Shen

Recently, deep learning (DL) has been introduced to enhance CSI feedback in massive MIMO application, where the massive collected training data and lengthy training time are costly and impractical for realistic deployment.

Meta-Learning

Token-Label Alignment for Vision Transformers

1 code implementation ICCV 2023 Han Xiao, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

Data mixing strategies (e. g., CutMix) have shown the ability to greatly improve the performance of convolutional neural networks (CNNs).

Image Classification Semantic Segmentation +1

Concise and interpretable multi-label rule sets

1 code implementation4 Oct 2022 Martino Ciaperoni, Han Xiao, Aristides Gionis

Multi-label classification is becoming increasingly ubiquitous, but not much attention has been paid to interpretability.

Diversity Multi-Label Classification

Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search

1 code implementation CVPR 2022 Han Xiao, Ziwei Wang, Zheng Zhu, Jie zhou, Jiwen Lu

Differentiable architecture search (DARTS) acquires the optimal architectures by optimizing the architecture parameters with gradient descent, which significantly reduces the search cost.

Neural Architecture Search

AI Enlightens Wireless Communication: A Transformer Backbone for CSI Feedback

no code implementations16 Jun 2022 Han Xiao, Zhiqin Wang, Dexin Li, Wenqiang Tian, Xiaofeng Liu, Wendong Liu, Shi Jin, Jia Shen, Zhi Zhang, Ning Yang

This paper is based on the background of the 2nd Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AIWork Group, where the framework of the eigenvector-based channel state information (CSI) feedback problem is firstly provided.

Data Augmentation

Support Vector Machines under Adversarial Label Contamination

no code implementations1 Jun 2022 Huang Xiao, Battista Biggio, Blaine Nelson, Han Xiao, Claudia Eckert, Fabio Roli

Machine learning algorithms are increasingly being applied in security-related tasks such as spam and malware detection, although their security properties against deliberate attacks have not yet been widely understood.

Active Learning BIG-bench Machine Learning +1

Generalizable Mixed-Precision Quantization via Attribution Rank Preservation

1 code implementation ICCV 2021 Ziwei Wang, Han Xiao, Jiwen Lu, Jie zhou

On the contrary, our GMPQ searches the mixed-quantization policy that can be generalized to largescale datasets with only a small amount of data, so that the search cost is significantly reduced without performance degradation.

Quantization

AI Enlightens Wireless Communication: Analyses, Solutions and Opportunities on CSI Feedback

no code implementations12 Jun 2021 Han Xiao, Zhiqin Wang, Wenqiang Tian, Xiaofeng Liu, Wendong Liu, Shi Jin, Jia Shen, Zhi Zhang, Ning Yang

In this paper, we give a systematic description of the 1st Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AI Work Group.

Quantization

MODMA dataset: a Multi-modal Open Dataset for Mental-disorder Analysis

no code implementations20 Feb 2020 Hanshu Cai, Yiwen Gao, Shuting Sun, Na Li, Fuze Tian, Han Xiao, Jianxiu Li, Zhengwu Yang, Xiaowei Li, Qinglin Zhao, Zhenyu Liu, Zhijun Yao, Minqiang Yang, Hong Peng, Jing Zhu, Xiaowei Zhang, Guoping Gao, Fang Zheng, Rui Li, Zhihua Guo, Rong Ma, Jing Yang, Lan Zhang, Xiping Hu, Yumin Li, Bin Hu

The EEG dataset includes not only data collected using traditional 128-electrodes mounted elastic cap, but also a novel wearable 3-electrode EEG collector for pervasive applications.

EEG

Searching for polarization in signed graphs: a local spectral approach

1 code implementation26 Jan 2020 Han Xiao, Bruno Ordozgoiti, Aristides Gionis

In this paper we formulate the problem of finding local polarized communities in signed graphs as a locally-biased eigen-problem.

Hybrid Kronecker Product Decomposition and Approximation

no code implementations6 Dec 2019 Chencheng Cai, Rong Chen, Han Xiao

As an effective dimension reduction tool, singular value decomposition is often used to analyze high dimensional matrices, which are traditionally assumed to have a low rank matrix approximation.

Dimensionality Reduction

KoPA: Automated Kronecker Product Approximation

no code implementations5 Dec 2019 Chencheng Cai, Rong Chen, Han Xiao

Specifically, we propose to approximate a given matrix by the sum of a few Kronecker products of matrices, which we refer to as the Kronecker product approximation (KoPA).

Denoising

Matrix Completion using Kronecker Product Approximation

no code implementations26 Nov 2019 Chencheng Cai, Rong Chen, Han Xiao

A matrix completion problem is to recover the missing entries in a partially observed matrix.

Collaborative Filtering Denoising +1

Asynchronous "Events" are Better For Motion Estimation

no code implementations24 Apr 2019 Yuhu Guo, Han Xiao, Yidong Chen, Xiaodong Shi

As an instance of event-based camera, Dynamic and Active-pixel Vision Sensor (DAVIS) combines a standard camera and an event-based camera.

Motion Estimation

Bonsai -- Diverse and Shallow Trees for Extreme Multi-label Classification

3 code implementations17 Apr 2019 Sujay Khandagale, Han Xiao, Rohit Babbar

In this paper, we develop a suite of algorithms, called Bonsai, which generalizes the notion of label representation in XMC, and partitions the labels in the representation space to learn shallow trees.

Classification Extreme Multi-Label Classification +2

Dual Ask-Answer Network for Machine Reading Comprehension

1 code implementation6 Sep 2018 Han Xiao, Feng Wang, Jian-Feng Yan, Jingyao Zheng

The task of question answering or question generation aims to infer an answer or a question when given the counterpart based on context.

Machine Reading Comprehension Question Answering +2

Convexification of Neural Graph

no code implementations9 Jan 2018 Han Xiao

Traditionally, most complex intelligence architectures are extremely non-convex, which could not be well performed by convex optimization.

Intelligence Graph

no code implementations5 Jan 2018 Han Xiao

However, to construct powerful intelligence systems with various methods, we propose the intelligence graph (short as \textbf{\textit{iGraph}}), which is composed by both of neural and probabilistic graph, under the framework of forward-backward propagation.

Diversity

NDT: Neual Decision Tree Towards Fully Functioned Neural Graph

no code implementations16 Dec 2017 Han Xiao

Though traditional algorithms could be embedded into neural architectures with the proposed principle of \cite{xiao2017hungarian}, the variables that only occur in the condition of branch could not be updated as a special case.

Hungarian Layer: Logics Empowered Neural Architecture

no code implementations7 Dec 2017 Han Xiao, Yidong Chen, Xiaodong Shi

However, lacking of logic flow (e. g. \textit{if, for, while}), traditional algorithms (e. g. \textit{Hungarian algorithm, A$^*$ searching, decision tress algorithm}) could not be embedded into this paradigm, which limits the theories and applications.

Sentence

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

37 code implementations25 Aug 2017 Han Xiao, Kashif Rasul, Roland Vollgraf

We present Fashion-MNIST, a new dataset comprising of 28x28 grayscale images of 70, 000 fashion products from 10 categories, with 7, 000 images per category.

Benchmarking BIG-bench Machine Learning

SAR: Semantic Analysis for Recommendation

no code implementations21 Feb 2017 Han Xiao, Lian Meng

Recommendation system is a common demand in daily life and matrix completion is a widely adopted technique for this task.

Matrix Completion

KSR: A Semantic Representation of Knowledge Graph within a Novel Unsupervised Paradigm

no code implementations27 Aug 2016 Han Xiao, Minlie Huang, Xiaoyan Zhu

Since both aspects and categories are semantics-relevant, the collection of categories in each aspect is treated as the semantic representation of this triple.

Entity Retrieval Knowledge Graph Embedding +2

SSP: Semantic Space Projection for Knowledge Graph Embedding with Text Descriptions

no code implementations17 Apr 2016 Han Xiao, Minlie Huang, Xiaoyan Zhu

To this end, this paper proposes a semantic representation method for knowledge graph \textbf{(KSR)}, which imposes a two-level hierarchical generative process that globally extracts many aspects and then locally assigns a specific category in each aspect for every triple.

Knowledge Graph Embedding Question Answering

From One Point to A Manifold: Knowledge Graph Embedding For Precise Link Prediction

no code implementations15 Dec 2015 Han Xiao, Minlie Huang, Xiaoyan Zhu

Knowledge graph embedding aims at offering a numerical knowledge representation paradigm by transforming the entities and relations into continuous vector space.

Knowledge Graph Embedding Link Prediction +1

TransA: An Adaptive Approach for Knowledge Graph Embedding

no code implementations18 Sep 2015 Han Xiao, Minlie Huang, Yu Hao, Xiaoyan Zhu

Knowledge representation is a major topic in AI, and many studies attempt to represent entities and relations of knowledge base in a continuous vector space.

Knowledge Graph Embedding Metric Learning +1

TransG : A Generative Mixture Model for Knowledge Graph Embedding

no code implementations18 Sep 2015 Han Xiao, Minlie Huang, Yu Hao, Xiaoyan Zhu

Recently, knowledge graph embedding, which projects symbolic entities and relations into continuous vector space, has become a new, hot topic in artificial intelligence.

Knowledge Graph Embedding Relation

Margin-Based Feed-Forward Neural Network Classifiers

no code implementations11 Jun 2015 Han Xiao, Xiaoyan Zhu

Margin-Based Principle has been proposed for a long time, it has been proved that this principle could reduce the structural risk and improve the performance in both theoretical and practical aspects.

Max-Entropy Feed-Forward Clustering Neural Network

no code implementations11 Jun 2015 Han Xiao, Xiaoyan Zhu

Entropy-Based Principle is the principle with which we could estimate the unknown distribution under some limited conditions.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.