no code implementations • Findings (EMNLP) 2021 • Shuxian Bi, Chaozhuo Li, Xiao Han, Zheng Liu, Xing Xie, Haizhen Huang, Zengxuan Wen
As the fundamental basis of sponsored search, relevance modeling has attracted increasing attention due to the tremendous practical value.
1 code implementation • ECCV 2020 • Liyi Chen, Weiwei Wu, Chenchen Fu, Xiao Han, Yuntao Zhang
Weakly supervised semantic segmentation with image-level labels has attracted a lot of attention recently because these labels are already available in most datasets.
no code implementations • EMNLP (LaTeCHCLfL, CLFL, LaTeCH) 2021 • Wenxiu Xie, John Lee, Fangqiong Zhan, Xiao Han, Chi-Yin Chow
In Chinese, the derivation may be marked either with the standard adverbial marker DI, or the non-standard marker DE.
1 code implementation • 11 Dec 2024 • Zijian Zhou, Shikun Liu, Xiao Han, Haozhe Liu, Kam Woh Ng, Tian Xie, Yuren Cong, Hang Li, Mengmeng Xu, Juan-Manuel Pérez-Rúa, Aditya Patel, Tao Xiang, Miaojing Shi, Sen He
Additionally, we show that our loss is model-agnostic and can be used to improve the performance of other diffusion models.
1 code implementation • 4 Dec 2024 • Yongcheng Li, Lingcong Cai, Ying Lu, Xianghua Fu, Xiao Han, Ma Li, Wenxing Lai, Xiangzhong Zhang, Xiaomao Fan
In real-world scenarios, blood cell image datasets often present the issues of domain shift and data imbalance, posing challenges for accurate blood cell identification.
no code implementations • 26 Nov 2024 • Yuncong Yang, Xiao Han, Yidong Chai, Reza Ebrahimi, Rouzbeh Behnia, Balaji Padmanabhan
Recent privacy regulations (e. g., GDPR) grant data subjects the `Right to Be Forgotten' (RTBF) and mandate companies to fulfill data erasure requests from data subjects.
1 code implementation • 23 Nov 2024 • Haochen Zhao, Xiangru Tang, Ziran Yang, Xiao Han, Xuanzhi Feng, Yueqing Fan, Senhao Cheng, Di Jin, Yilun Zhao, Arman Cohan, Mark Gerstein
To address this issue in the field of chemistry, we introduce ChemSafetyBench, a benchmark designed to evaluate the accuracy and safety of LLM responses.
no code implementations • 15 Nov 2024 • Yanhao Sun, Runze Tian, Xiao Han, Xinyao Liu, Yan Zhang, Kai Xu
With the emergence of large-scale Text-to-Image(T2I) models and implicit 3D representations like Neural Radiance Fields (NeRF), many text-driven generative editing methods based on NeRF have appeared.
no code implementations • 8 Nov 2024 • Yuze He, Yanning Zhou, Wang Zhao, Zhongkai Wu, Kaiwen Xiao, Wei Yang, Yong-Jin Liu, Xiao Han
We present StdGEN, an innovative pipeline for generating semantically decomposed high-quality 3D characters from single images, enabling broad applications in virtual reality, gaming, and filmmaking, etc.
no code implementations • 26 Oct 2024 • Haozhe Liu, Shikun Liu, Zijian Zhou, Mengmeng Xu, Yanping Xie, Xiao Han, Juan C. Pérez, Ding Liu, Kumara Kahatapitiya, Menglin Jia, Jui-Chieh Wu, Sen He, Tao Xiang, Jürgen Schmidhuber, Juan-Manuel Pérez-Rúa
We introduce MarDini, a new family of video diffusion models that integrate the advantages of masked auto-regression (MAR) into a unified diffusion model (DM) framework.
1 code implementation • 21 Aug 2024 • Xiao Han, Xinfeng Zhang, Yiling Wu, Zhenduo Zhang, Zhe Wu
To this end, we introduce the Kolmogorov-Arnold Network (KAN) into time series forecasting research, which has better mathematical properties and interpretability.
no code implementations • 21 Aug 2024 • Xiao Han, Chen Zhu, Xiangyu Zhao, HengShu Zhu
Visual geo-localization demands in-depth knowledge and advanced reasoning skills to associate images with precise real-world geographic locations.
no code implementations • 19 Aug 2024 • Xiao Han, Zijian Zhang, Xiangyu Zhao, Yuanshao Zhu, Guojiang Shen, Xiangjie Kong, Xuetao Wei, Liqiang Nie, Jieping Ye
As urban residents demand higher travel quality, vehicle dispatch has become a critical component of online ride-hailing services.
no code implementations • 15 Aug 2024 • Xiao Han, Yiming Ren, Yichen Yao, Yujing Sun, Yuexin Ma
In this paper, we propose \textit{LiDAR-HMP}, the first single-LiDAR-based 3D human motion prediction approach, which receives the raw LiDAR point cloud as input and forecasts future 3D human poses directly.
no code implementations • 13 Jul 2024 • Yiming Ren, Xiao Han, Yichen Yao, Xiaoxiao Long, Yujing Sun, Yuexin Ma
LiDAR-based human motion capture has garnered significant interest in recent years for its practicability in large-scale and unconstrained environments.
no code implementations • 9 Jul 2024 • Nan He, Weichen Xiong, Hanwen Liu, Yi Liao, Lei Ding, Kai Zhang, Guohua Tang, Xiao Han, Wei Yang
The effectiveness of large language models (LLMs) is often hindered by duplicated data in their extensive pre-training datasets.
1 code implementation • 2 Jul 2024 • Fei Shen, Hu Ye, Sibo Liu, Jun Zhang, Cong Wang, Xiao Han, Wei Yang
Moreover, RCDMs can generate consistent stories with a single forward inference compared to autoregressive models.
1 code implementation • 24 Jun 2024 • Xiao Han, Chen Zhu, Xiao Hu, Chuan Qin, Xiangyu Zhao, HengShu Zhu
To address this issue, we propose a novel session-based framework, BISTRO, to timely model user preference through fusion learning of semantic and behavioral information.
no code implementations • 4 Jun 2024 • Cong Wang, Kuan Tian, Jun Zhang, Yonghang Guan, Feng Luo, Fei Shen, Zhiwei Jiang, Qing Gu, Xiao Han, Wei Yang
In our work on portrait video generation, we identified audio signals as particularly weak, often overshadowed by stronger signals such as facial pose and reference image.
1 code implementation • 28 May 2024 • Bin Wang, Linke Ouyang, Fan Wu, Wenchang Ning, Xiao Han, Zhiyuan Zhao, Jiahui Peng, Yiying Jiang, Dahua Lin, Conghui He
In the era of artificial intelligence, the diversity of data modalities and annotation formats often renders data unusable directly, requiring understanding and format conversion before it can be used by researchers or developers with different needs.
1 code implementation • 27 May 2024 • Cong Wang, Kuan Tian, Yonghang Guan, Jun Zhang, Zhiwei Jiang, Fei Shen, Xiao Han, Qing Gu, Wei Yang
In this paper, we propose a novel ensembling method, Adaptive Feature Aggregation (AFA), which dynamically adjusts the contributions of multiple models at the feature level according to various states (i. e., prompts, initial noises, denoising steps, and spatial locations), thereby keeping the advantages of multiple diffusion models, while suppressing their disadvantages.
1 code implementation • 23 May 2024 • Pengyue Jia, Yiding Liu, Xiaopeng Li, Yuhao Wang, Yantong Du, Xiao Han, Xuetao Wei, Shuaiqiang Wang, Dawei Yin, Xiangyu Zhao
Worldwide geolocalization aims to locate the precise location at the coordinate level of photos taken anywhere on the Earth.
1 code implementation • 15 Apr 2024 • Bin Wang, Fei Deng, Peifan Jiang, Shuang Wang, Xiao Han, Zhixuan Zhang
Low-dose computed tomography (LDCT) has become the technology of choice for diagnostic medical imaging, given its lower radiation dose compared to standard CT, despite increasing image noise and potentially affecting diagnostic accuracy.
no code implementations • CVPR 2024 • Yiteng Xu, Kecheng Ye, Xiao Han, Yiming Ren, Xinge Zhu, Yuexin Ma
Human-centric Point Cloud Video Understanding (PVU) is an emerging field focused on extracting and interpreting human-related features from sequences of human point clouds, further advancing downstream human-centric tasks and applications.
no code implementations • CVPR 2024 • Yiming Ren, Xiao Han, Chengfeng Zhao, Jingya Wang, Lan Xu, Jingyi Yu, Yuexin Ma
For human-centric large-scale scenes, fine-grained modeling for 3D human global pose and shape is significant for scene understanding and can benefit many real-world applications.
1 code implementation • 31 Jan 2024 • Yucheng Wu, Leye Wang, Xiao Han, Han-Jia Ye
However, such stochastic augmentations may severely damage the intrinsic properties of a graph and deteriorate the following representation learning process.
2 code implementations • 2 Jan 2024 • Jie Zhu, Leye Wang, Xiao Han, Anmin Liu, Tao Xie
To mitigate this issue, AI software compression plays a crucial role, which aims to compress model size while keeping high performance.
no code implementations • 13 Nov 2023 • Chen Cheng, Xiao Han, Xin Tong, Yusheng Wu, Yiqing Xing
Opinions are influenced by neighbors, with varying degrees of emphasis based on their connections.
1 code implementation • 18 Oct 2023 • Feng Luo, Jinxi Xiang, Jun Zhang, Xiao Han, Wei Yang
To alleviate the huge computational cost required by pixel-based diffusion SR, latent-based methods utilize a feature encoder to transform the image and then implement the SR image generation in a compact latent space.
no code implementations • 16 Oct 2023 • Kuan Tian, Yonghang Guan, Jinxi Xiang, Jun Zhang, Xiao Han, Wei Yang
Due to the absence of autoregressive modeling and optical flow alignment, we can design an extremely minimalist framework that can greatly benefit computational efficiency.
1 code implementation • 10 Oct 2023 • Fei Shen, Hu Ye, Jun Zhang, Cong Wang, Xiao Han, Wei Yang
Specifically, in the first stage, we design a simple prior conditional diffusion model that predicts the global features of the target image by mining the global alignment relationship between pose coordinates and image appearance.
no code implementations • 28 Sep 2023 • Xiao Han, Lu Zhang, Yongkai Wu, Shuhan Yuan
Anomaly detection in multivariate time series has received extensive study due to the wide spectrum of applications.
1 code implementation • 25 Sep 2023 • Xiao Han, Shuhan Yuan, Mohamed Trabelsi
However, there is a gap between language modeling and anomaly detection as the objective of training a sequential model via a language modeling loss is not directly related to anomaly detection.
no code implementations • 20 Sep 2023 • Kuan Tian, Yonghang Guan, Jinxi Xiang, Jun Zhang, Xiao Han, Wei Yang
First, to solve the problem of inconsistency of codec caused by the uncertainty of floating point calculations across platforms, we design a calibration transmitting system to guarantee the consistent quantization of entropy parameters between the encoding and decoding stages.
2 code implementations • 24 Aug 2023 • Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He
A practical solution to this problem would be to utilize the available multimodal large language models (MLLMs) to generate instruction data for vision-language tasks.
1 code implementation • 15 Aug 2023 • Yue Lv, Jinxi Xiang, Jun Zhang, Wenming Yang, Xiao Han, Wei Yang
We thus introduce a dynamic gating network on top of the low-rank adaptation method, in order to decide which decoder layer should employ adaptation.
4 code implementations • 13 Aug 2023 • Hu Ye, Jun Zhang, Sibo Liu, Xiao Han, Wei Yang
Despite the simplicity of our method, an IP-Adapter with only 22M parameters can achieve comparable or even better performance to a fully fine-tuned image prompt model.
Ranked #2 on Personalized Image Generation on DreamBooth
1 code implementation • 10 Jul 2023 • Jiate Liu, Yiqin Zhu, Kaiwen Xiao, Qiang Fu, Xiao Han, Wei Yang, Deheng Ye
The goal of program synthesis, or code generation, is to generate executable code based on given descriptions.
1 code implementation • CVPR 2023 • Xiao Han, Xiatian Zhu, Licheng Yu, Li Zhang, Yi-Zhe Song, Tao Xiang
In the fashion domain, there exists a variety of vision-and-language (V+L) tasks, including cross-modal retrieval, text-guided image retrieval, multi-modal classification, and image captioning.
1 code implementation • 4 Mar 2023 • Xiao Han, Lu Zhang, Yongkai Wu, Shuhan Yuan
Ensuring fairness in anomaly detection models has received much attention recently as many anomaly detection applications involve human beings.
1 code implementation • 24 Feb 2023 • Yi Ren, Xiao Han, Xu Zhao, Shenzheng Zhang, Yan Zhang
Therefore, the ranking stage is still essential for most applications to provide high-quality candidate set for the re-ranking stage.
no code implementations • 13 Feb 2023 • Fei Kong, Xiyue Wang, Jinxi Xiang, Sen yang, Xinran Wang, Meng Yue, Jun Zhang, Junhan Zhao, Xiao Han, Yuhan Dong, Biyue Zhu, Fang Wang, Yueping Liu
We assessed the effectiveness of FACL in cancer diagnosis and Gleason grading tasks using 19, 461 whole-slide images of prostate cancer from multiple centers.
no code implementations • 11 Feb 2023 • Ruiqing Ding, Fangjie Rong, Xiao Han, Leye Wang
In this paper, for a common disease in ICU patients, sepsis, we propose a novel cross-center collaborative learning framework guided by medical knowledge, SofaNet, to achieve early recognition of this disease.
no code implementations • 11 Feb 2023 • Chung-ju Huang, Leye Wang, Xiao Han
In order to improve the information-sharing capability and innovation of various healthcare-related institutions, and then to establish a next-generation open medical collaboration network, we propose a unified framework for vertical federated knowledge transfer mechanism (VFedTrans) based on a novel cross-hospital representation distillation component.
no code implementations • 12 Jan 2023 • Siteng Chen, Xiyue Wang, Jun Zhang, Liren Jiang, Ning Zhang, Feng Gao, Wei Yang, Jinxi Xiang, Sen yang, Junhua Zheng, Xiao Han
The OSrisk for the prediction of 5-year survival status achieved AUC of 0. 784 (0. 746-0. 819) in the TCGA cohort, which was further verified in the independent General cohort and the CPTAC cohort, with AUC of 0. 774 (0. 723-0. 820) and 0. 702 (0. 632-0. 765), respectively.
no code implementations • ICCV 2023 • Xiao Han, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang
Controllable person image synthesis aims at rendering a source image based on user-specified changes in body pose or appearance.
no code implementations • CVPR 2023 • Zhe Qu, Xingyu Li, Xiao Han, Rui Duan, Chengchao Shen, Lixing Chen
Intuitively, these poor clients may come from biased universal information shared with others.
no code implementations • 10 Dec 2022 • Ruiqing Ding, Xiao Han, Leye Wang
We propose KnowledgeDA, a unified domain language model development service to enhance the task-specific training procedure with domain knowledge graphs.
1 code implementation • 8 Dec 2022 • Xiao Han, Lu Zhang, Yongkai Wu, Shuhan Yuan
After that, we further propose an anomaly mitigation approach that aims to recommend mitigation actions on abnormal features to revert the abnormal outcomes such that the counterfactuals guided by the causal mechanism are normal.
1 code implementation • 4 Dec 2022 • Boxuan Zhao, Jun Zhang, Deheng Ye, Jian Cao, Xiao Han, Qiang Fu, Wei Yang
Most of the existing methods rely on a multiple instance learning framework that requires densely sampling local patches at high magnification.
no code implementations • 2 Dec 2022 • Tianyu Qiu, Amir Jahangiri, Xiao Han, Dmitry Lesovoy, Tatiana Agback, Peter Agback, Adnane Achour, Xiaobo Qu, Vladislav Orekhov
Nuclear magnetic resonance (NMR) spectroscopy has become a formidable tool for biochemistry and medicine.
no code implementations • 29 Nov 2022 • Jie Fu, Zhili Chen, Xiao Han
The heterogeneity and convergence of training parameters were simply not considered.
no code implementations • 22 Nov 2022 • Xiao Han, Yiming Ren, Peishan Cong, Yujing Sun, Jingya Wang, Lan Xu, Yuexin Ma
Human gait recognition is crucial in multimedia, enabling identification through walking patterns without direct interaction, enhancing the integration across various media forms in real-world applications like smart homes, healthcare and non-intrusive security.
1 code implementation • 18 Oct 2022 • Zhoujin Tian, Chaozhuo Li, Shuo Ren, Zhiqiang Zuo, Zengxuan Wen, Xinyue Hu, Xiao Han, Haizhen Huang, Denvy Deng, Qi Zhang, Xing Xie
Bilingual lexicon induction induces the word translations by aligning independently trained word embeddings in two languages.
1 code implementation • 13 Sep 2022 • Sen yang, Tao Shen, Yuqi Fang, Xiyue Wang, Jun Zhang, Wei Yang, Junzhou Huang, Xiao Han
The high-content image-based assay is commonly leveraged for identifying the phenotypic impact of genetic perturbations in biology field.
2 code implementations • 11 Aug 2022 • Jie Zhu, Leye Wang, Xiao Han
By simulating the attack mechanism as the safety test, SafeCompress can automatically compress a big model to a small one following the dynamic sparse training paradigm.
no code implementations • 4 Aug 2022 • Amir Jahangiri, Xiao Han, Dmitry Lesovoy, Tatiana Agback, Peter Agback, Adnane Achour, Vladislav Orekhov
A new deep neural network based on the WaveNet architecture (WNN) is presented, which is designed to grasp specific patterns in the NMR spectra.
1 code implementation • 1 Aug 2022 • Xiao Han, Kam Woh Ng, Sauradip Nag, Zhiyu Qu
Large-scale weakly supervised product retrieval is a practically useful yet computationally challenging problem.
1 code implementation • 17 Jul 2022 • Xiao Han, Licheng Yu, Xiatian Zhu, Li Zhang, Yi-Zhe Song, Tao Xiang
We thus propose a Multi-View Contrastive Learning task for pulling closer the visual representation of one image to the compositional multimodal representation of another image+text.
no code implementations • 28 May 2022 • Xiao Han, Leye Wang, Junjie Wu, Yuncong Yang
Basically, we propose to perturb the original network by adding or removing links, and expect the embedding generated on the perturbed network can leak little information about private links but hold high utility for various downstream tasks.
no code implementations • 7 Apr 2022 • Siteng Chen, Jinxi Xiang, Xiyue Wang, Jun Zhang, Sen yang, Junzhou Huang, Wei Yang, Junhua Zheng, Xiao Han
MC-TMB algorithm also exhibited good generalization on the external validation cohort with an AUC of 0. 732 (0. 683-0. 761), and better performance when compared to other methods.
1 code implementation • 6 Apr 2022 • Xiao Han, Sen He, Li Zhang, Yi-Zhe Song, Tao Xiang
In this paper, we propose a Unified Interactive Garment Retrieval (UIGR) framework to unify TGR and VCR.
1 code implementation • CVPR 2022 • Yonghang Guan, Jun Zhang, Kuan Tian, Sen yang, Pei Dong, Jinxi Xiang, Wei Yang, Junzhou Huang, Yuyao Zhang, Xiao Han
In this paper, we propose a hierarchical global-to-local clustering strategy to build a Node-Aligned GCN (NAGCN) to represent WSI with rich local structural information as well as global distribution.
no code implementations • 15 Dec 2021 • Xiao Han, Leye Wang, Junjie Wu, Xiao Fang
In response, we propose FedValue, a privacy-preserving, task-specific but model-free data valuation method for VFL, which consists of a data valuation metric and a federated computation method.
no code implementations • 15 Dec 2021 • Xiao Han, Yuncong Yang, Junjie Wu
We then design a novel hybrid protection mechanism called HyObscure, to cross-iteratively optimize the generalization and obfuscation operations for maximum privacy protection under a certain utility guarantee.
1 code implementation • 4 Dec 2021 • Renzhen Wang, De Cai, Kaiwen Xiao, Xixi Jia, Xiao Han, Deyu Meng
Existing methods commonly address hierarchical classification by decoupling it into a series of multi-class classification tasks.
no code implementations • 23 Nov 2021 • Xin Zhang, Zixuan Liu, Kaiwen Xiao, Tian Shen, Junzhou Huang, Wei Yang, Dimitris Samaras, Xiao Han
Labels are costly and sometimes unreliable.
Ranked #5 on Image Classification on mini WebVision 1.0
1 code implementation • 20 Oct 2021 • Xiao Han, Sen He, Li Zhang, Tao Xiang
Firstly, to fully utilize the existing small-scale benchmarking datasets for more discriminative feature learning, we introduce a cross-modal momentum contrastive learning framework to enrich the training data for a given mini-batch.
Ranked #12 on Text based Person Retrieval on CUHK-PEDES (using extra training data)
1 code implementation • 9 Dec 2020 • Biyang Guo, Songqiao Han, Xiao Han, Hailiang Huang, Ting Lu
LCM can learn label confusion to capture semantic overlap among labels by calculating the similarity between instances and labels during training and generate a better label distribution to replace the original one-hot label vector, thus improving the final classification performance.
no code implementations • 6 Nov 2020 • Leye Wang, Han Yu, Xiao Han
In particular, we first propose a federated crowdsensing framework, which analyzes the privacy concerns of each crowdsensing stage (i. e., task creation, task assignment, task execution, and data aggregation) and discuss how federated learning techniques may take effect.
no code implementations • 15 Sep 2020 • Jun Zhang, Kuan Tian, Pei Dong, Haocheng Shen, Kezhou Yan, Jianhua Yao, Junzhou Huang, Xiao Han
Recently, artificial intelligence (AI) has been used in various disease diagnosis to improve diagnostic accuracy and reliability, but the interpretation of diagnosis results is still an open problem.
no code implementations • 27 Jul 2020 • Cailian Deng, Xuming Fang, Xiao Han, Xianbin Wang, Li Yan, Rong He, Yan Long, Yuchen Guo
Due to the related stringent requirements, supporting these applications over wireless local area network (WLAN) is far beyond the capabilities of the new WLAN standard -- IEEE 802. 11ax.
1 code implementation • EMNLP (sdp) 2020 • Edwin Zhang, Nikhil Gupta, Raphael Tang, Xiao Han, Ronak Pradeep, Kuang Lu, Yue Zhang, Rodrigo Nogueira, Kyunghyun Cho, Hui Fang, Jimmy Lin
We present Covidex, a search engine that exploits the latest neural ranking models to provide information access to the COVID-19 Open Research Dataset curated by the Allen Institute for AI.
no code implementations • 19 Dec 2019 • Xiao Han, ZiHao Wang, Enmei Tu, Gunnam Suryanarayana, Jie Yang
Deep learning demands a huge amount of well-labeled data to train the network parameters.
no code implementations • 19 Oct 2019 • Xiao Han, Ruiqing Ding, Leye Wang, Hailiang Huang
Credit investigation is critical for financial services.
no code implementations • 3 Oct 2019 • Jianqing Fan, Yingying Fan, Xiao Han, Jinchi Lv
Both tests are of the Hotelling-type statistics based on the rows of empirical eigenvectors or their ratios, whose asymptotic covariance matrices are very challenging to derive and estimate.
no code implementations • 12 Jul 2018 • Xiao Han, Chen Yidong, Shi Xiaodong
In one stage, the user and item are represented from multiple perspectives and in each perspective, the representations of user and item put attentions to each other.
no code implementations • 24 Apr 2017 • Xiao Han
Liver lesion segmentation is an important step for liver cancer diagnosis, treatment planning and treatment evaluation.
1 code implementation • journal 2017 • Xiao Han
Applying a trained model to generate a complete sCT volume for each new patient MR image only took 9 s, which was much faster than the atlas‐based approach.