Search Results for author: Zheng Liu

Found 140 papers, 69 papers with code

Matching-oriented Embedding Quantization For Ad-hoc Retrieval

1 code implementation EMNLP 2021 Shitao Xiao, Zheng Liu, Yingxia Shao, Defu Lian, Xing Xie

In this work, we propose the Matching-oriented Product Quantization (MoPQ), where a novel objective Multinoulli Contrastive Loss (MCL) is formulated.

Quantization Retrieval

Leveraging Bidding Graphs for Advertiser-Aware Relevance Modeling in Sponsored Search

no code implementations Findings (EMNLP) 2021 Shuxian Bi, Chaozhuo Li, Xiao Han, Zheng Liu, Xing Xie, Haizhen Huang, Zengxuan Wen

As the fundamental basis of sponsored search, relevance modeling has attracted increasing attention due to the tremendous practical value.

Marketing

Towards Generalizeable Semantic Product Search by Text Similarity Pre-training on Search Click Logs

no code implementations ECNLP (ACL) 2022 Zheng Liu, Wei zhang, Yan Chen, Weiyi Sun, Tianchuan Du, Benjamin Schroeder

Recently, semantic search has been successfully applied to E-commerce product search and the learned semantic space for query and product encoding are expected to generalize well to unseen queries or products.

text similarity

STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?

no code implementations31 Mar 2025 Yun Li, Yiming Zhang, Tao Lin, Xiangrui Liu, Wenxiao Cai, Zheng Liu, Bo Zhao

The use of Multimodal Large Language Models (MLLMs) as an end-to-end solution for Embodied AI and Autonomous Driving has become a prevailing trend.

Autonomous Driving

Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models

1 code implementation27 Mar 2025 Haoxiang Sun, Yingqian Min, Zhipeng Chen, Wayne Xin Zhao, Zheng Liu, Zhongyuan Wang, Lei Fang, Ji-Rong Wen

In recent years, the rapid development of large reasoning models has resulted in the saturation of existing benchmarks for evaluating mathematical reasoning, highlighting the urgent need for more challenging and rigorous evaluation frameworks.

Math Mathematical Reasoning

Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding

no code implementations24 Mar 2025 Xiangrui Liu, Yan Shu, Zheng Liu, Ao Li, Yang Tian, Bo Zhao

Despite advanced token compression techniques, existing multimodal large language models (MLLMs) still struggle with hour-long video understanding.

8k Self-Supervised Learning +1

Memory-enhanced Retrieval Augmentation for Long Video Understanding

no code implementations12 Mar 2025 Huaying Yuan, Zheng Liu, Minhao Qin, Hongjin Qian, Y Shu, Zhicheng Dou, Ji-Rong Wen

Retrieval-augmented generation (RAG) shows strong potential in addressing long-video understanding (LVU) tasks.

RAG Retrieval +1

HeGMN: Heterogeneous Graph Matching Network for Learning Graph Similarity

1 code implementation11 Mar 2025 Shilong Sang, Ke-Jia Chen, Zheng Liu

Graph similarity learning (GSL), also referred to as graph matching in many scenarios, is a fundamental problem in computer vision, pattern recognition, and graph learning.

Graph Learning Graph Matching +1

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

1 code implementation6 Mar 2025 Zhipeng Chen, Yingqian Min, Beichen Zhang, Jie Chen, Jinhao Jiang, Daixuan Cheng, Wayne Xin Zhao, Zheng Liu, Xu Miao, Yang Lu, Lei Fang, Zhongyuan Wang, Ji-Rong Wen

This approach achieves a remarkable accuracy of 86. 67% with greedy search on AIME 2024, underscoring its effectiveness in enhancing model capabilities.

Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation

no code implementations24 Feb 2025 Wenyuan Wu, Zheng Liu, Yong Chen, Chao Su, Dezhong Peng, Xu Wang

In recent years, the rapid development of deep neural networks has brought increased attention to the security and robustness of these models.

Adversarial Attack Diversity +1

MMTEB: Massive Multilingual Text Embedding Benchmark

1 code implementation19 Feb 2025 Kenneth Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, Jay Gala, Wissam Siblini, Dominik Krzemiński, Genta Indra Winata, Saba Sturua, Saiteja Utpala, Mathieu Ciancone, Marion Schaeffer, Gabriel Sequeira, Diganta Misra, Shreeya Dhakal, Jonathan Rystrøm, Roman Solomatin, Ömer Çağatan, Akash Kundu, Martin Bernstorff, Shitao Xiao, Akshita Sukhlecha, Bhavish Pahwa, Rafał Poświata, Kranthi Kiran GV, Shawon Ashraf, Daniel Auras, Björn Plüster, Jan Philipp Harries, Loïc Magne, Isabelle Mohr, Mariya Hendriksen, Dawei Zhu, Hippolyte Gisserot-Boukhlef, Tom Aarsen, Jan Kostkan, Konrad Wojtasik, Taemin Lee, Marek Šuppa, Crystina Zhang, Roberta Rocca, Mohammed Hamdy, Andrianos Michail, John Yang, Manuel Faysse, Aleksei Vatolin, Nandan Thakur, Manan Dey, Dipam Vasani, Pranjal Chitale, Simone Tedeschi, Nguyen Tai, Artem Snegirev, Michael Günther, Mengzhou Xia, Weijia Shi, Xing Han Lù, Jordan Clive, Gayatri Krishnakumar, Anna Maksimova, Silvan Wehrli, Maria Tikhonova, Henil Panchal, Aleksandr Abramov, Malte Ostendorff, Zheng Liu, Simon Clematide, Lester James Miranda, Alena Fenogenova, Guangyu Song, Ruqiya Bin Safi, Wen-Ding Li, Alessia Borghini, Federico Cassano, Hongjin Su, Jimmy Lin, Howard Yen, Lasse Hansen, Sara Hooker, Chenghao Xiao, Vaibhav Adlakha, Orion Weller, Siva Reddy, Niklas Muennighoff

MMTEB includes a diverse set of challenging, novel tasks such as instruction following, long-document retrieval, and code retrieval, representing the largest multilingual collection of evaluation tasks for embedding models to date.

Instruction Following Retrieval

HawkBench: Investigating Resilience of RAG Methods on Stratified Information-Seeking Tasks

no code implementations19 Feb 2025 Hongjin Qian, Zheng Liu, Chao GAO, Yankai Wang, Defu Lian, Zhicheng Dou

In real-world information-seeking scenarios, users have dynamic and diverse needs, requiring RAG systems to demonstrate adaptable resilience.

RAG

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos

no code implementations18 Feb 2025 Huaying Yuan, Jian Ni, Yueze Wang, Junjie Zhou, Zhengyang Liang, Zheng Liu, Zhao Cao, Zhicheng Dou, Ji-Rong Wen

In this work, we present MomentSeeker, a comprehensive benchmark to evaluate retrieval models' performance in handling general long-video moment retrieval (LVMR) tasks.

Moment Retrieval RAG +2

Does RAG Really Perform Bad For Long-Context Processing?

no code implementations17 Feb 2025 Kun Luo, Zheng Liu, Peitian Zhang, Hongjin Qian, Jun Zhao, Kang Liu

The efficient processing of long context poses a serious challenge for large language models (LLMs).

RAG Retrieval

Reinforced Information Retrieval

no code implementations17 Feb 2025 Chaofan Li, Zheng Liu, Jianlyv Chen, Defu Lian, Yingxia Shao

Specifically, the generator is reinforced to generate query augmentations that enhance the retriever's performance, while the retriever is trained to better discriminate the relevant documents identified by the generator.

Domain Adaptation Information Retrieval +1

O1 Embedder: Let Retrievers Think Before Action

no code implementations11 Feb 2025 Ruin Yan, Zheng Liu, Defu Lian

Inspired by this progress, we aim to develop similar capabilities for retrieval models, which hold great promise for tackling critical challenges in the field, including multi-task retrieval, zero-shot retrieval, and tasks requiring intensive reasoning of complex relationships.

Contrastive Learning Math +1

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

2 code implementations3 Jan 2025 Yifan Du, Zikang Liu, YiFan Li, Wayne Xin Zhao, Yuqi Huo, Bingning Wang, WeiPeng Chen, Zheng Liu, Zhongyuan Wang, Ji-Rong Wen

Moreover, it seems that such textual reasoning data can be even more effective than visual reasoning data in eliciting the slow-thinking capacities of MLLMs.

Language Modeling Language Modelling +1

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

1 code implementation19 Dec 2024 Junjie Zhou, Zheng Liu, Ze Liu, Shitao Xiao, Yueze Wang, Bo Zhao, Chen Jason Zhang, Defu Lian, Yongping Xiong

Despite the rapidly growing demand for multimodal retrieval, progress in this field remains severely constrained by a lack of training data.

Image Retrieval Retrieval +1

Boosting Long-Context Management via Query-Guided Activation Refilling

no code implementations17 Dec 2024 Hongjin Qian, Zheng Liu, Peitian Zhang, Zhicheng Dou, Defu Lian

ACRE constructs a Bi-layer KV Cache for long contexts, where the layer-1 (L1) cache compactly captures global information, and the layer-2 (L2) cache provides detailed and localized information.

Management

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

3 code implementations12 Dec 2024 Yingqian Min, Zhipeng Chen, Jinhao Jiang, Jie Chen, Jia Deng, Yiwen Hu, Yiru Tang, Jiapeng Wang, Xiaoxue Cheng, Huatong Song, Wayne Xin Zhao, Zheng Liu, Zhongyuan Wang, Ji-Rong Wen

We introduce an ``imitate, explore, and self-improve'' framework, denoted as \textbf{STILL-2}, as our primary technical approach to train the reasoning model.

Physics-informed Machine Learning for Battery Pack Thermal Management

no code implementations15 Nov 2024 Zheng Liu, Yuan Jiang, Yumeng Li, Pingfeng Wang

Battery thermal management systems can effectively control the temperature of batteries; therefore, the performance and safety can be ensured.

Management Physics-informed machine learning

AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

1 code implementation11 Nov 2024 Yujia Zhou, Zheng Liu, Zhicheng Dou

The emergence of Large Language Models (LLMs) has significantly advanced natural language processing, but these models often generate factually incorrect information, known as "hallucination".

Decision Making Hallucination +3

SEGMN: A Structure-Enhanced Graph Matching Network for Graph Similarity Learning

no code implementations6 Nov 2024 Wenjun Wang, Jiacheng Lu, KeJia Chen, Zheng Liu, Shilong Sang

Equipped with a dual embedding learning module and a structure perception matching module, SEGMN achieves structure enhancement in both embedding learning and cross-graph matching.

Graph Matching Graph Similarity

AutoMIR: Effective Zero-Shot Medical Information Retrieval without Relevance Labels

1 code implementation26 Oct 2024 Lei LI, Xiangxu Zhang, Xiao Zhou, Zheng Liu

By benchmarking ten models on CMIRB, we establish a rigorous standard for evaluating medical information retrieval systems.

Benchmarking Information Retrieval +2

Elephant in the Room: Unveiling the Impact of Reward Model Quality in Alignment

no code implementations26 Sep 2024 Yan Liu, Xiaoyuan Yi, Xiaokang Chen, Jing Yao, Jingwei Yi, Daoguang Zan, Zheng Liu, Xing Xie, Tsung-Yi Ho

Despite the vital role reward models play in alignment, previous works have consistently overlooked their performance and used off-the-shelf reward models arbitrarily without verification, rendering the reward model ``\emph{an elephant in the room}''.

Making Text Embedders Few-Shot Learners

1 code implementation24 Sep 2024 Chaofan Li, Minghao Qin, Shitao Xiao, Jianlyu Chen, Kun Luo, Yingxia Shao, Defu Lian, Zheng Liu

To this end, we introduce a novel model bge-en-icl, which employs few-shot examples to produce high-quality text embeddings.

Decoder In-Context Learning

Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer

no code implementations21 Sep 2024 Zheng Liu, Jinchao Zhu, Nannan Li, Gao Huang

As the performances of linear classifiers are influenced by the relationship among samples, we employ graph regularization to improve the representations fed into the classifiers at early exits.

Transfer Learning

Trustworthiness in Retrieval-Augmented Generation Systems: A Survey

1 code implementation16 Sep 2024 Yujia Zhou, Yan Liu, Xiaoxi Li, Jiajie Jin, Hongjin Qian, Zheng Liu, Chaozhuo Li, Zhicheng Dou, Tsung-Yi Ho, Philip S. Yu

Retrieval-Augmented Generation (RAG) has quickly grown into a pivotal paradigm in the development of Large Language Models (LLMs).

Fairness Hallucination +3

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

1 code implementation9 Sep 2024 Hongjin Qian, Peitian Zhang, Zheng Liu, Kelong Mao, Zhicheng Dou

Retrieval-Augmented Generation (RAG) leverages retrieval tools to access external databases, thereby enhancing the generation quality of large language models (LLMs) through optimized context.

Memorization Question Answering +2

FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry

1 code implementation26 Aug 2024 Chunran Zheng, Wei Xu, Zuhao Zou, Tong Hua, Chongjian Yuan, Dongjiao He, Bingyang Zhou, Zheng Liu, Jiarong Lin, Fangcheng Zhu, Yunfan Ren, Rong Wang, Fanle Meng, Fu Zhang

The fusion of both visual and LiDAR measurements is based on a single unified voxel map where the LiDAR module constructs the geometric structure for registering new LiDAR scans and the visual module attaches image patches to the LiDAR points.

NeRF Visual Odometry

Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment

no code implementations22 Aug 2024 Kun Luo, Minghao Qin, Zheng Liu, Shitao Xiao, Jun Zhao, Kang Liu

In this work, we conduct a comprehensive empirical study on a wide range of retrieval tasks, including in domain accuracy, data efficiency, zero shot generalization, lengthy retrieval, instruction based retrieval, and multi task learning.

Multi-Task Learning Retrieval +1

SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models

1 code implementation30 Jul 2024 Zheng Liu, Hao Liang, Xijie Huang, Wentao Xiong, Qinhan Yu, Linzhuang Sun, Chong Chen, Conghui He, Bin Cui, Wentao Zhang

Crucially, our method's reliance on purely generated data ensures the preservation of privacy, achieving SoTA performance with just 100k data points (only 18% of the official dataset size).

Caption Generation Question Answering

Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly

1 code implementation15 Jun 2024 Yexin Liu, Zhengyang Liang, Yueze Wang, Xianfeng Wu, Feilong Tang, Muyang He, Jian Li, Zheng Liu, Harry Yang, SerNam Lim, Bo Zhao

To this end, we manually construct a benchmark with 12 categories and design evaluation metrics that assess the degree of error in MLLM responses even when the visual content is seemingly understood.

VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval

1 code implementation6 Jun 2024 Junjie Zhou, Zheng Liu, Shitao Xiao, Bo Zhao, Yongping Xiong

Thirdly, we introduce a multi-stage training algorithm, which first aligns the visual token embedding with the text encoder using massive weakly labeled data, and then develops multi-modal representation capability using the generated composed image-text data.

Image Retrieval Retrieval

MLVU: Benchmarking Multi-task Long Video Understanding

3 code implementations6 Jun 2024 Junjie Zhou, Yan Shu, Bo Zhao, Boya Wu, Zhengyang Liang, Shitao Xiao, Minghao Qin, Xi Yang, Yongping Xiong, Bo Zhang, Tiejun Huang, Zheng Liu

To address the above problems, we propose a new benchmark called MLVU (Multi-task Long Video Understanding Benchmark) for the comprehensive and in-depth evaluation of LVU.

Benchmarking Video Understanding

Compressing Lengthy Context With UltraGist

1 code implementation26 May 2024 Peitian Zhang, Zheng Liu, Shitao Xiao, Ninglu Shao, Qiwei Ye, Zhicheng Dou

Compressing lengthy context is a critical but technically challenging problem.

Few-Shot Learning

Are Long-LLMs A Necessity For Long-Context Tasks?

no code implementations24 May 2024 Hongjin Qian, Zheng Liu, Peitian Zhang, Kelong Mao, Yujia Zhou, Xu Chen, Zhicheng Dou

The learning and deployment of long-LLMs remains a challenging problem despite recent progresses.

Extending Llama-3's Context Ten-Fold Overnight

1 code implementation30 Apr 2024 Peitian Zhang, Ninglu Shao, Zheng Liu, Shitao Xiao, Hongjin Qian, Qiwei Ye, Zhicheng Dou

We extend the context length of Llama-3-8B-Instruct from 8K to 80K via QLoRA fine-tuning.

8k Retrieval

Understanding Privacy Risks of Embeddings Induced by Large Language Models

no code implementations25 Apr 2024 Zhihao Zhu, Ninglu Shao, Defu Lian, Chenwang Wu, Zheng Liu, Yi Yang, Enhong Chen

Large language models (LLMs) show early signs of artificial general intelligence but struggle with hallucinations.

Retrieval

FineFake: A Knowledge-Enriched Dataset for Fine-Grained Multi-Domain Fake News Detection

1 code implementation30 Mar 2024 Ziyi Zhou, XiaoMing Zhang, Litian Zhang, Jiacheng Liu, Senzhang Wang, Zheng Liu, Xi Zhang, Chaozhuo Li, Philip S. Yu

Existing benchmarks for fake news detection have significantly contributed to the advancement of models in assessing the authenticity of news content.

Domain Adaptation Fake News Detection

Negating Negatives: Alignment with Human Negative Samples via Distributional Dispreference Optimization

1 code implementation6 Mar 2024 Shitong Duan, Xiaoyuan Yi, Peng Zhang, Yan Liu, Zheng Liu, Tun Lu, Xing Xie, Ning Gu

Large language models (LLMs) have revolutionized the role of AI, yet pose potential social risks.

Extensible Embedding: A Flexible Multipler For LLM's Context Length

no code implementations18 Feb 2024 Ninglu Shao, Shitao Xiao, Zheng Liu, Peitian Zhang

2) Strong sample efficiency of training, which enables the embedding model to be learned in a cost-effective way.

Language Modeling Language Modelling

BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models

no code implementations18 Feb 2024 Kun Luo, Zheng Liu, Shitao Xiao, Kang Liu

In this work, we proposeExtensible Embedding, which realizes high-quality extension of LLM's context with strong flexibility and cost-effectiveness.

Chunking Language Modeling +2

Metacognitive Retrieval-Augmented Large Language Models

1 code implementation18 Feb 2024 Yujia Zhou, Zheng Liu, Jiajie Jin, Jian-Yun Nie, Zhicheng Dou

Drawing from cognitive psychology, metacognition allows an entity to self-reflect and critically evaluate its cognitive processes.

Response Generation Retrieval

Grounding Language Model with Chunking-Free In-Context Retrieval

no code implementations15 Feb 2024 Hongjin Qian, Zheng Liu, Kelong Mao, Yujia Zhou, Zhicheng Dou

These strategies not only improve the efficiency of the retrieval process but also ensure that the fidelity of the generated grounding text evidence is maintained.

Chunking Language Modeling +4

BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation

3 code implementations5 Feb 2024 Jianlv Chen, Shitao Xiao, Peitian Zhang, Kun Luo, Defu Lian, Zheng Liu

It can simultaneously perform the three common retrieval functionalities of embedding model: dense retrieval, multi-vector retrieval, and sparse retrieval, which provides a unified model foundation for real-world IR applications.

Retrieval Self-Knowledge Distillation

Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization

1 code implementation15 Jan 2024 Ninglu Shao, Shitao Xiao, Zheng Liu, Peitian Zhang

Extensible Tokenization stands as a midware in between of the tokenized context and the LLM, which transforms the raw token embeddings into the extensible embeddings.

Few-Shot Learning Language Modeling +1

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

1 code implementation12 Jan 2024 Yutao Zhu, Peitian Zhang, Chenghao Zhang, Yifei Chen, Binyu Xie, Zheng Liu, Ji-Rong Wen, Zhicheng Dou

Despite this, their application to information retrieval (IR) tasks is still challenging due to the infrequent occurrence of many IR-specific concepts in natural language.

Diversity document understanding +3

Long Context Compression with Activation Beacon

1 code implementation7 Jan 2024 Peitian Zhang, Zheng Liu, Shitao Xiao, Ninglu Shao, Qiwei Ye, Zhicheng Dou

In this paper, we propose Activation Beacon, a plug-in module for transformer-based LLMs that targets effective, efficient, and flexible compression of long contexts.

4k document understanding +2

Making Large Language Models A Better Foundation For Dense Retrieval

1 code implementation24 Dec 2023 Chaofan Li, Zheng Liu, Shitao Xiao, Yingxia Shao

LLaRA consists of two pretext tasks: EBAE (Embedding-Based Auto-Encoding) and EBAR (Embedding-Based Auto-Regression), where the text embeddings from LLM are used to reconstruct the tokens for the input sentence and predict the tokens for the next sentence, respectively.

Retrieval Sentence +1

LM-Cocktail: Resilient Tuning of Language Models via Model Merging

1 code implementation22 Nov 2023 Shitao Xiao, Zheng Liu, Peitian Zhang, Xingrun Xing

Despite simplicity, LM-Cocktail is surprisingly effective: the resulted model is able to achieve a strong empirical performance in the whole scope of general tasks while preserving a superior capacity in its targeted domain.

Language Modeling Language Modelling +1

A Diffusion Model Based Quality Enhancement Method for HEVC Compressed Video

no code implementations15 Nov 2023 Zheng Liu, Honggang Qi

Most of the existing methods need to train corresponding models for compressed videos with different quantization parameters to improve the quality of compressed videos.

Decoder Quantization

Retrieve Anything To Augment Large Language Models

1 code implementation11 Oct 2023 Peitian Zhang, Shitao Xiao, Zheng Liu, Zhicheng Dou, Jian-Yun Nie

On the other hand, the task-specific retrievers lack the required versatility, hindering their performance across the diverse retrieval augmentation scenarios.

Knowledge Distillation Retrieval

C-Pack: Packed Resources For General Chinese Embeddings

2 code implementations14 Sep 2023 Shitao Xiao, Zheng Liu, Peitian Zhang, Niklas Muennighoff, Defu Lian, Jian-Yun Nie

Along with our resources on general Chinese embedding, we release our data and models for English text embeddings.

A Consumer-tier based Visual-Brain Machine Interface for Augmented Reality Glasses Interactions

no code implementations29 Aug 2023 Yuying Jiang, Fan Bai, ZiCheng Zhang, Xiaochen Ye, Zheng Liu, Zhiping Shi, Jianwei Yao, Xiaojun Liu, Fangkun Zhu, Junling Li Qian Guo, Xiaoan Wang, Junwen Luo

Here we develop a consumer-tier Visual-Brain Machine Inteface(V-BMI) system specialized for Augmented Reality(AR) glasses interactions.

A Counterfactual Fair Model for Longitudinal Electronic Health Records via Deconfounder

no code implementations22 Aug 2023 Zheng Liu, Xiaohan Li, Philip Yu

The fairness issue of clinical data modeling, especially on Electronic Health Records (EHRs), is of utmost importance due to EHR's complex latent structure and potential selection bias.

counterfactual Fairness +1

Large Language Models for Information Retrieval: A Survey

1 code implementation14 Aug 2023 Yutao Zhu, Huaying Yuan, Shuting Wang, Jiongnan Liu, Wenhan Liu, Chenlong Deng, Haonan Chen, Zheng Liu, Zhicheng Dou, Ji-Rong Wen

This evolution requires a combination of both traditional methods (such as term-based sparse retrieval methods with rapid response) and modern neural architectures (such as language models with powerful language understanding capacity).

Information Retrieval Question Answering +3

Multimodal Object Detection by Channel Switching and Spatial Attention

no code implementations Conference on Computer Vision and Pattern Recognition (CVPR) 2023 Yue Cao, Junchi Bin, Jozsef Hamari, Erik Blasch, Zheng Liu

Multimodal object detection has attracted great attention in recent years since the information specific to different modalities can complement each other and effectively improve the accuracy and stability of the detection model.

Multispectral Object Detection object-detection +2

Generative Retrieval via Term Set Generation

1 code implementation23 May 2023 Peitian Zhang, Zheng Liu, Yujia Zhou, Zhicheng Dou, Fangchao Liu, Zhao Cao

On top of the term-set DocID, we propose a permutation-invariant decoding algorithm, with which the term set can be generated in any permutation yet will always lead to the corresponding document.

Information Retrieval Natural Questions +1

RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models

1 code implementation4 May 2023 Shitao Xiao, Zheng Liu, Yingxia Shao, Zhao Cao

It is designed to improve the quality of semantic representation where all contextualized embeddings of the pre-trained model can be leveraged.

Information Retrieval Open-Domain Question Answering +2

An Efficient Plane Extraction Approach for Bundle Adjustment on LiDAR Point clouds

no code implementations29 Apr 2023 Zheng Liu, Fu Zhang

However, the accuracy and speed of LiDAR bundle adjustment depend on the quality of plane extraction, which provides point association for LiDAR BA.

Constructing Tree-based Index for Efficient and Effective Dense Retrieval

1 code implementation24 Apr 2023 Haitao Li, Qingyao Ai, Jingtao Zhan, Jiaxin Mao, Yiqun Liu, Zheng Liu, Zhao Cao

Unfortunately, while ANN can improve the efficiency of DR models, it usually comes with a significant price on retrieval performance.

Contrastive Learning Retrieval

Transformer-based models and hardware acceleration analysis in autonomous driving: A survey

no code implementations21 Apr 2023 Juan Zhong, Zheng Liu, Xi Chen

The paper also highlights the challenges, trends, and current insights in Transformer-based models, addressing their hardware deployment and acceleration issues within the context of long-term autonomous driving applications.

Autonomous Driving Decision Making +3

WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus

1 code implementation10 Apr 2023 Hongjing Qian, Yutao Zhu, Zhicheng Dou, Haoqi Gu, Xinyu Zhang, Zheng Liu, Ruofei Lai, Zhao Cao, Jian-Yun Nie, Ji-Rong Wen

In this paper, we introduce a new NLP task -- generating short factual articles with references for queries by mining supporting evidence from the Web.

Retrieval Text Generation

Structural Imbalance Aware Graph Augmentation Learning

no code implementations24 Mar 2023 Zulong Liu, Kejia-Chen, Zheng Liu

Firstly, a Pagerank-based sampling strategy is designed to identify hub nodes and tail nodes in the graph.

Graph Classification Link Prediction +1

Document Image Shadow Removal Guided by Color-Aware Background

1 code implementation CVPR 2023 Ling Zhang, Yinghao He, Qing Zhang, Zheng Liu, Xiaolong Zhang, Chunxia Xiao

In this paper, we present a color-aware background extraction network (CBENet) for extracting a spatially varying background image that accurately depicts the background colors of the document.

Decoder Document Shadow Removal +1

CDSM: Cascaded Deep Semantic Matching on Textual Graphs Leveraging Ad-hoc Neighbor Selection

1 code implementation30 Nov 2022 Jing Yao, Zheng Liu, Junhan Yang, Zhicheng Dou, Xing Xie, Ji-Rong Wen

In the first stage, a lightweight CNN-based ad-hod neighbor selector is deployed to filter useful neighbors for the matching task with a small computation cost.

Mitigating Frequency Bias in Next-Basket Recommendation via Deconfounders

no code implementations16 Nov 2022 Xiaohan Li, Zheng Liu, Luyi Ma, Kaushiki Nag, Stephen Guo, Philip Yu, Kannan Achan

Considering the influence of historical purchases on users' future interests, the user and item representations can be viewed as unobserved confounders in the causal diagram.

Causal Inference Fairness +2

RetroMAE v2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models

1 code implementation16 Nov 2022 Shitao Xiao, Zheng Liu

DupMAE, which targets on improving the semantic representation capacity for the contextualized embeddings of both [CLS] and ordinary tokens.

 Ranked #1 on Information Retrieval on MS MARCO (MRR@10 metric)

Dimensionality Reduction Information Retrieval +4

Mitigating Health Disparities in EHR via Deconfounder

no code implementations28 Oct 2022 Zheng Liu, Xiaohan Li, Philip Yu

First, these methods usually mean a trade-off between the model's performance and fairness.

Attribute Decision Making +1

M3FGM:a node masking and multi-granularity message passing-based federated graph model for spatial-temporal data prediction

no code implementations27 Oct 2022 Yuxing Tian, Zheng Liu, Yanwen Qu, Song Li, Jiachi Luo

This paper proposes a new GNN-oriented split federated learning method, named node {\bfseries M}asking and {\bfseries M}ulti-granularity {\bfseries M}essage passing-based Federated Graph Model (M$^3$FGM) for the above issues.

Federated Learning

Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval

1 code implementation11 Oct 2022 Peitian Zhang, Zheng Liu, Shitao Xiao, Zhicheng Dou, Jing Yao

Based on comprehensive experiments on popular retrieval benchmarks, we verify that clusters and terms indeed complement each other, enabling HI$^2$ to achieve lossless retrieval quality with competitive efficiency across various index settings.

Knowledge Distillation Quantization +1

PCDNF: Revisiting Learning-based Point Cloud Denoising via Joint Normal Filtering

no code implementations2 Sep 2022 Zheng Liu, Yaowu Zhao, Sijing Zhan, Yuanyuan Liu, Renjie Chen, Ying He

Motivated by the essential interplay between point cloud denoising and normal filtering, we revisit point cloud denoising from a multitask perspective, and propose an end-to-end network, named PCDNF, to denoise point clouds via joint normal filtering.

Denoising

A Neural Corpus Indexer for Document Retrieval

1 code implementation6 Jun 2022 Yujing Wang, Yingyan Hou, Haonan Wang, Ziming Miao, Shibin Wu, Hao Sun, Qi Chen, Yuqing Xia, Chengmin Chi, Guoshuai Zhao, Zheng Liu, Xing Xie, Hao Allen Sun, Weiwei Deng, Qi Zhang, Mao Yang

To this end, we propose Neural Corpus Indexer (NCI), a sequence-to-sequence network that generates relevant document identifiers directly for a designated query.

Decoder Retrieval +1

Slim-neck by GSConv: A lightweight-design for real-time detector architectures

1 code implementation6 Jun 2022 Hulin Li, Jun Li, Hanbing Wei, Zheng Liu, Zhenfei Zhan, Qiliang Ren

Furthermore, we provide a design suggestion based on the GSConv, Slim-Neck (SNs), to achieve a higher computational cost-effectiveness of the real-time detectors.

Autonomous Vehicles Edge-computing +2

RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder

1 code implementation24 May 2022 Shitao Xiao, Zheng Liu, Yingxia Shao, Zhao Cao

The sentence embedding is generated from the encoder's masked input; then, the original sentence is recovered based on the sentence embedding and the decoder's masked input via masked language modeling.

Decoder Information Retrieval +8

A Novel Underwater Image Enhancement and Improved Underwater Biological Detection Pipeline

no code implementations20 May 2022 Zheng Liu, Yaoming Zhuang, Pengrun Jia, Chengdong Wu, Hongli Xu ang Zhanlin Liu

For aquaculture resource evaluation and ecological environment monitoring, automatic detection and identification of marine organisms is critical.

Image Enhancement Object +2

Towards Generalizable Semantic Product Search by Text Similarity Pre-training on Search Click Logs

no code implementations11 Apr 2022 Zheng Liu, Wei zhang, Yan Chen, Weiyi Sun, Tianchuan Du, Benjamin Schroeder

Recently, semantic search has been successfully applied to e-commerce product search and the learned semantic space(s) for query and product encoding are expected to generalize to unseen queries or products.

text similarity

A Mutually Reinforced Framework for Pretrained Sentence Embeddings

no code implementations28 Feb 2022 Junhan Yang, Zheng Liu, Shitao Xiao, Jianxun Lian, Lijun Wu, Defu Lian, Guangzhong Sun, Xing Xie

Instead of relying on annotation heuristics defined by humans, it leverages the sentence representation model itself and realizes the following iterative self-supervision process: on one hand, the improvement of sentence representation may contribute to the quality of data annotation; on the other hand, more effective data annotation helps to generate high-quality positive samples, which will further improve the current sentence representation model.

Contrastive Learning Sentence +1

Reinforcement Routing on Proximity Graph for Efficient Recommendation

no code implementations23 Jan 2022 Chao Feng, Defu Lian, Xiting Wang, Zheng Liu, Xing Xie, Enhong Chen

Instead of searching the nearest neighbor for the query, we search the item with maximum inner product with query on the proximity graph.

Imitation Learning Recommendation Systems

Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval

2 code implementations14 Jan 2022 Shitao Xiao, Zheng Liu, Weihao Han, Jianjin Zhang, Yingxia Shao, Defu Lian, Chaozhuo Li, Hao Sun, Denvy Deng, Liangjie Zhang, Qi Zhang, Xing Xie

In this work, we tackle this problem with Bi-Granular Document Representation, where the lightweight sparse embeddings are indexed and standby in memory for coarse-grained candidate search, and the heavyweight dense embeddings are hosted in disk for fine-grained post verification.

Quantization Retrieval

Pre-training Recommender Systems via Reinforced Attentive Multi-relational Graph Neural Network

no code implementations28 Nov 2021 Xiaohan Li, Zhiwei Liu, Stephen Guo, Zheng Liu, Hao Peng, Philip S. Yu, Kannan Achan

In this paper, we propose a novel Reinforced Attentive Multi-relational Graph Neural Network (RAM-GNN) to the pre-train user and item embeddings on the user and item graph prior to the recommendation step.

Graph Neural Network

Certifiable Artificial Intelligence Through Data Fusion

no code implementations3 Nov 2021 Erik Blasch, Junchi Bin, Zheng Liu

This paper reviews and proposes concerns in adopting, fielding, and maintaining artificial intelligence (AI) systems.

Object Recognition

GraphFormers: GNN-nested Transformers for Representation Learning on Textual Graph

1 code implementation NeurIPS 2021 Junhan Yang, Zheng Liu, Shitao Xiao, Chaozhuo Li, Defu Lian, Sanjay Agrawal, Amit Singh, Guangzhong Sun, Xing Xie

The representation learning on textual graph is to generate low-dimensional embeddings for the nodes based on the individual textual features and the neighbourhood information.

Language Modelling Recommendation Systems +1

Locality Constrained Analysis Dictionary Learning via K-SVD Algorithm

no code implementations29 Apr 2021 Kun Jiang, Zhaoli Liu, Zheng Liu, Qindong Sun

With the learned analysis dictionary, test samples can be transformed into a sparse subspace for classification efficiently.

Classification Dictionary Learning +2

AdsGNN: Behavior-Graph Augmented Relevance Modeling in Sponsored Search

1 code implementation25 Apr 2021 Chaozhuo Li, Bochen Pang, Yuming Liu, Hao Sun, Zheng Liu, Xing Xie, Tianqi Yang, Yanling Cui, Liangjie Zhang, Qi Zhang

Our motivation lies in incorporating the tremendous amount of unsupervised user behavior data from the historical search logs as the complementary graph to facilitate relevance modeling.

Marketing

Hybrid Encoder: Towards Efficient and Precise Native AdsRecommendation via Hybrid Transformer Encoding Networks

no code implementations22 Apr 2021 Junhan Yang, Zheng Liu, Bowen Jin, Jianxun Lian, Defu Lian, Akshay Soni, Eun Yong Kang, Yajun Wang, Guangzhong Sun, Xing Xie

For the sake of efficient recommendation, conventional methods would generate user and advertisement embeddings independently with a siamese transformer encoder, such that approximate nearest neighbour search (ANN) can be leveraged.

Retrieval

Matching-oriented Product Quantization For Ad-hoc Retrieval

2 code implementations16 Apr 2021 Shitao Xiao, Zheng Liu, Yingxia Shao, Defu Lian, Xing Xie

In this work, we propose the Matching-oriented Product Quantization (MoPQ), where a novel objective Multinoulli Contrastive Loss (MCL) is formulated.

Quantization Retrieval

Multi-Interest-Aware User Modeling for Large-Scale Sequential Recommendations

1 code implementation18 Feb 2021 Jianxun Lian, Iyad Batal, Zheng Liu, Akshay Soni, Eun Yong Kang, Yajun Wang, Xing Xie

User states in different channels are updated by an \emph{erase-and-add} paradigm with interest- and instance-level attention.

Recommendation Systems

Training Large-Scale News Recommenders with Pretrained Language Models in the Loop

1 code implementation18 Feb 2021 Shitao Xiao, Zheng Liu, Yingxia Shao, Tao Di, Xing Xie

Secondly, it improves the data efficiency of the training workflow, where non-informative data can be eliminated from encoding.

News Recommendation Recommendation Systems

Robust W-GAN-Based Estimation Under Wasserstein Contamination

no code implementations20 Jan 2021 Zheng Liu, Po-Ling Loh

Robust estimation is an important problem in statistics which aims at providing a reasonable estimator when the data-generating distribution lies within an appropriately defined ball around an uncontaminated distribution.

regression

Dynamic Graph Collaborative Filtering

1 code implementation8 Jan 2021 Xiaohan Li, Mengqi Zhang, Shu Wu, Zheng Liu, Liang Wang, Philip S. Yu

Here we propose Dynamic Graph Collaborative Filtering (DGCF), a novel framework leveraging dynamic graphs to capture collaborative and sequential relations of both items and users at the same time.

Collaborative Filtering Recommendation Systems

Van der Waals Heterostructure Pt$_{2}$HgSe$_{3}$/CrI$_3$ for Topological Valleytronics

no code implementations24 Dec 2020 Zheng Liu, Yulei Han, Yafei Ren, Qian Niu, Zhenhua Qiao

We identify a valley-polarized Chern insulator in the van der Waals heterostructure, Pt$_{2}$HgSe$_{3}$/CrI$_3$, for potential applications with interplay between electric, magnetic, optical, and mechanical effects.

Band Gap Materials Science

Shuffle and Learn: Minimizing Mutual Information for Unsupervised Hashing

1 code implementation20 Nov 2020 Fangrui Liu, Zheng Liu

Proof on $\epsilon$-Convergence of joint probability with approximated derivatives is provided to guarantee the preciseness on update applied on the mutual information.

Image Retrieval Person Re-Identification +1

Sampling-Decomposable Generative Adversarial Recommender

no code implementations NeurIPS 2020 Binbin Jin, Defu Lian, Zheng Liu, Qi Liu, Jianhui Ma, Xing Xie, Enhong Chen

The GAN-style recommenders (i. e., IRGAN) addresses the challenge by learning a generator and a discriminator adversarially, such that the generator produces increasingly difficult samples for the discriminator to accelerate optimizing the discrimination objective.

BALM: Bundle Adjustment for Lidar Mapping

1 code implementation16 Oct 2020 Zheng Liu, Fu Zhang

We propose a framework for bundle adjustment (BA) on sparse lidar points and incorporate it to a lidar odometry and mapping (LOAM) to lower the drift.

Robotics

All at Once: Temporally Adaptive Multi-Frame Interpolation with Advanced Motion Modeling

no code implementations ECCV 2020 Zhixiang Chi, Rasoul Mohammadi Nasiri, Zheng Liu, Juwei Lu, Jin Tang, Konstantinos N. Plataniotis

Recent advances in high refresh rate displays as well as the increased interest in high rate of slow motion and frame up-conversion fuel the demand for efficient and cost-effective multi-frame video interpolation solutions.

All

Fine-grained Interest Matching for Neural News Recommendation

no code implementations ACL 2020 Heyuan Wang, Fangzhao Wu, Zheng Liu, Xing Xie

Existing studies generally represent each user as a single vector and then match the candidate news vector, which may lose fine-grained information for recommendation.

News Recommendation

Lightrec: A memory and search-efficient recommender system

1 code implementation International World Wide Web Conference 2020 Defu Lian, Haoyu Wang, Zheng Liu, Jianxun Lian, Enhong Chen, Xing Xie

On top of such a structure, LightRec will have an item represented as additive composition of B codewords, which are optimally selected from each of the codebooks.

Recommendation Systems

Machine learning driven synthesis of few-layered WTe2

no code implementations10 Oct 2019 Manzhang Xu, Bijun Tang, Yuhao Lu, Chao Zhu, Lu Zheng, Jingyu Zhang, Nannan Han, Yuxi Guo, Jun Di, Pin Song, Yongmin He, Lixing Kang, Zhiyong Zhang, Wu Zhao, Cuntai Guan, Xuewen Wang, Zheng Liu

Reducing the lateral scale of two-dimensional (2D) materials to one-dimensional (1D) has attracted substantial research interest not only to achieve competitive electronic device applications but also for the exploration of fundamental physical properties.

BIG-bench Machine Learning

OD-GCN: Object Detection Boosted by Knowledge GCN

no code implementations6 Aug 2019 Zheng Liu, Zidong Jiang, Wei Feng, Hui Feng

Classical CNN based object detection methods only extract the objects' image features, but do not consider the high-level relationship among objects in context.

Object object-detection +1

Robustifying deep networks for image segmentation

no code implementations1 Aug 2019 Zheng Liu, Jinnian Zhang, Varun Jog, Po-Ling Loh, Alan B McMillan

Materials and Methods: In this retrospective study, the accuracy of brain tumor segmentation was studied in subjects with low- and high-grade gliomas.

Brain Tumor Segmentation Data Augmentation +3

Neural News Recommendation with Long- and Short-term User Representations

1 code implementation ACL 2019 Mingxiao An, Fangzhao Wu, Chuhan Wu, Kun Zhang, Zheng Liu, Xing Xie

In this paper, we propose a neural news recommendation approach which can learn both long- and short-term user representations.

News Recommendation

A Symmetric Encoder-Decoder with Residual Block for Infrared and Visible Image Fusion

no code implementations27 May 2019 Lihua Jian, Xiaomin Yang, Zheng Liu, Gwanggil Jeon, Mingliang Gao, David Chisholm

For the fusion stage, first, the trained model is utilized to extract the intermediate features and compensation features of two source images.

Decoder Infrared And Visible Image Fusion

Machine learning-guided synthesis of advanced inorganic materials

1 code implementation10 May 2019 Bijun Tang, Yuhao Lu, Jiadong Zhou, Han Wang, Prafful Golani, Manzhang Xu, Quan Xu, Cuntai Guan, Zheng Liu

Synthesis of advanced inorganic materials with minimum number of trials is of paramount importance towards the acceleration of inorganic materials development.

BIG-bench Machine Learning

Feedback Network for Image Super-Resolution

4 code implementations CVPR 2019 Zhen Li, Jinglei Yang, Zheng Liu, Xiaomin Yang, Gwanggil Jeon, Wei Wu

In this paper, we propose an image super-resolution feedback network (SRFBN) to refine low-level representations with high-level information.

Image Super-Resolution

Towards end-to-end pulsed eddy current classification and regression with CNN

no code implementations22 Feb 2019 Xin Fu, Chengkai Zhang, Xiang Peng, Lihua Jian, Zheng Liu

Pulsed eddy current (PEC) is an effective electromagnetic non-destructive inspection (NDI) technique for metal materials, which has already been widely adopted in detecting cracking and corrosion in some multi-layer structures.

General Classification regression

IR2VI: Enhanced Night Environmental Perception by Unsupervised Thermal Image Translation

no code implementations25 Jun 2018 Shuo Liu, Vijay John, Erik Blasch, Zheng Liu, Ying Huang

Context enhancement is critical for night vision (NV) applications, especially for the dark night situation without any artificial lights.

Translation

Efficient Traffic-Sign Recognition with Scale-aware CNN

no code implementations31 May 2018 Yuchen Yang, Shuo Liu, Wei Ma, Qiuyuan Wang, Zheng Liu

The paper presents a Traffic Sign Recognition (TSR) system, which can fast and accurately recognize traffic signs of different sizes in images.

General Classification Traffic Sign Recognition

Adversarial Binary Coding for Efficient Person Re-identification

no code implementations29 Mar 2018 Zheng Liu, Jie Qin, Annan Li, Yunhong Wang, Luc van Gool

Specifically, instead of learning explicit projections or adding fully-connected mapping layers, the proposed Adversarial Binary Coding (ABC) framework guides the extraction of binary codes implicitly and effectively.

Person Re-Identification Triplet

Multi-Channel CNN-based Object Detection for Enhanced Situation Awareness

no code implementations30 Nov 2017 Shuo Liu, Zheng Liu

In this study, we propose a novel detection algorithm for military objects by fusing multi-channel CNNs.

Computational Efficiency Object +3

Feature Map Pooling for Cross-View Gait Recognition Based on Silhouette Sequence Images

no code implementations26 Nov 2017 Qiang Chen, Yunhong Wang, Zheng Liu, Qingjie Liu, Di Huang

In this paper, we develop a novel convolutional neural network based approach to extract and aggregate useful information from gait silhouette sequence images instead of simply representing the gait process by averaging silhouette images.

Gait Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.