Search Results for author: Qian Liu

Found 156 papers, 70 papers with code

``What Do You Mean by That?'' A Parser-Independent Interactive Approach for Enhancing Text-to-SQL

no code implementations EMNLP 2020 Yuntao Li, Bei Chen, Qian Liu, Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang

In Natural Language Interfaces to Databases systems, the text-to-SQL technique allows users to query databases by using natural language questions.

Text to SQL Text-To-SQL

CGIM: A Cycle Guided Interactive Learning Model for Consistency Identification in Task-oriented Dialogue

1 code implementation COLING 2022 Libo Qin, Qiguang Chen, Tianbao Xie, Qian Liu, Shijue Huang, Wanxiang Che, Zhou Yu

Consistency identification in task-oriented dialog (CI-ToD) usually consists of three subtasks, aiming to identify inconsistency between current system response and current user response, dialog history and the corresponding knowledge base.

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

no code implementations29 May 2025 Mingzhe Du, Luu Anh Tuan, Yue Liu, Yuhao QING, Dong Huang, Xinyi He, Qian Liu, Zejun Ma, See-Kiong Ng

Large Language Models (LLMs) generate functionally correct solutions but often fall short in code efficiency, a critical bottleneck for real-world deployment.

reinforcement-learning Reinforcement Learning +1

Sparse-to-Dense: A Free Lunch for Lossless Acceleration of Video Understanding in LLMs

no code implementations25 May 2025 Xuan Zhang, Cunxiao Du, Sicheng Yu, Jiawei Wu, Fengzhuo Zhang, Wei Gao, Qian Liu

It maintains model performance while enabling a seamless transition from a standard Video-LLM to a sparse Video-LLM with minimal code modifications.

Video Understanding

SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development

no code implementations22 May 2025 Yaxin Du, Yuzhu Cai, Yifan Zhou, Cheng Wang, Yu Qian, Xianghe Pang, Qian Liu, Yue Hu, Siheng Chen

We therefore introduce SWE-Dev, the first large-scale dataset (with 14, 000 training and 500 test samples) designed to evaluate and train autonomous coding systems on real-world feature development tasks.

Bug fixing Chatbot +2

General-Reasoner: Advancing LLM Reasoning Across All Domains

1 code implementation20 May 2025 Xueguang Ma, Qian Liu, Dongfu Jiang, Ge Zhang, Zejun Ma, Wenhu Chen

Reinforcement learning (RL) has recently demonstrated strong potential in enhancing the reasoning capabilities of large language models (LLMs).

All Math +3

Impact of Insufficient CP on Sensing Performance in OFDM-ISAC Systems

no code implementations2 May 2025 Peishi Li, Rang Liu, Qian Liu, Ming Li

These expressions quantify the effects of CP length, symbol constellation, and inter-target interference (ITI) on the mainlobe and sidelobe levels.

Integrated sensing and communication ISAC

Sensing-Oriented Adaptive Resource Allocation Designs for OFDM-ISAC Systems

no code implementations9 Apr 2025 Peishi Li, Ming Li, Rang Liu, Qian Liu, A. Lee Swindlehurst

In this paper, we propose adaptive resource allocation strategies for OFDM-ISAC systems to achieve optimal trade-offs between diverse sensing requirements and communication quality-of-service (QoS).

Integrated sensing and communication ISAC

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

1 code implementation24 Mar 2025 Weihao Zeng, Yuzhen Huang, Qian Liu, Wei Liu, Keqing He, Zejun Ma, Junxian He

DeepSeek-R1 has shown that long chain-of-thought (CoT) reasoning can naturally emerge through a simple reinforcement learning (RL) framework with rule-based rewards, where the training may directly start from the base models-a paradigm referred to as zero RL training.

Instruction Following Math +1

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

1 code implementation19 Mar 2025 Tongyao Zhu, Qian Liu, Haonan Wang, Shiqi Chen, Xiangming Gu, Tianyu Pang, Min-Yen Kan

Recent advancements in LLM pretraining have featured ever-expanding context windows to process longer sequences.

8k Scheduling

Water Quality Data Imputation via A Fast Latent Factorization of Tensors with PID-based Optimizer

no code implementations10 Mar 2025 Qian Liu, Lan Wang, Bing Yang, Hao Wu

Water quality data can supply a substantial decision support for water resources utilization and pollution prevention.

Imputation Missing Values

Distributed Distortion-Aware Beamforming Designs for Cell-Free mMIMO Systems

no code implementations5 Mar 2025 Mengzhen Liu, Ming Li, Rang Liu, Qian Liu

Cell-free massive multi-input multi-output (CF-mMIMO) systems have emerged as a promising paradigm for next-generation wireless communications, offering enhanced spectral efficiency and coverage through distributed antenna arrays.

Tri-timescale Beamforming Design for Tri-hybrid Architectures with Reconfigurable Antennas

no code implementations5 Mar 2025 Mengzhen Liu, Ming Li, Rang Liu, Qian Liu

Reconfigurable antennas possess the capability to dynamically adjust their fundamental operating characteristics, thereby enhancing system adaptability and performance.

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

1 code implementation2 Mar 2025 Kashun Shum, Yuzhen Huang, Hongjian Zou, Ding Qi, Yixuan Liao, Xiaoxin Chen, Qian Liu, Junxian He

Through comprehensive experiments with 1B and 3B parameter models, we demonstrate that models trained on 30B tokens selected with PreSelect surpasses the performance of a vanilla baseline trained on 300B tokens, achieving a 10x reduction in compute requirements.

Tutorial Proposal: Speculative Decoding for Efficient LLM Inference

no code implementations1 Mar 2025 Heming Xia, Cunxiao Du, Yongqi Li, Qian Liu, Wenjie Li

This tutorial presents a comprehensive introduction to Speculative Decoding (SD), an advanced technique for LLM inference acceleration that has garnered significant research interest in recent years.

Characteristics Analysis of Autonomous Vehicle Pre-crash Scenarios

no code implementations28 Feb 2025 Yixuan Li, Xuesong Wang, Tianyi Wang, Qian Liu

In this paper, we analyzed the latest California AV collision reports and used the newly revised pre-crash scenario typology to identify pre-crash scenarios.

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

no code implementations20 Feb 2025 M-A-P Team, Xinrun Du, Yifan Yao, Kaijing Ma, Bingli Wang, Tianyu Zheng, King Zhu, Minghao Liu, Yiming Liang, Xiaolong Jin, Zhenlin Wei, Chujie Zheng, Kaixin Deng, Shawn Gavin, Shian Jia, Sichao Jiang, Yiyan Liao, Rui Li, Qinrui Li, Sirun Li, Yizhi Li, Yunwen Li, David Ma, Yuansheng Ni, Haoran Que, Qiyao Wang, Zhoufutu Wen, Siwei Wu, Tyshawn Hsing, Ming Xu, Zhenzhu Yang, Zekun Moore Wang, Junting Zhou, Yuelin Bai, Xingyuan Bu, Chenglin Cai, Liang Chen, Yifan Chen, Chengtuo Cheng, Tianhao Cheng, Keyi Ding, Siming Huang, Yun Huang, Yaoru Li, Yizhe Li, Zhaoqun Li, Tianhao Liang, Chengdong Lin, Hongquan Lin, Yinghao Ma, Tianyang Pang, Zhongyuan Peng, Zifan Peng, Qige Qi, Shi Qiu, Xingwei Qu, Shanghaoran Quan, Yizhou Tan, Zili Wang, Chenqing Wang, Hao Wang, Yiya Wang, YuBo Wang, Jiajun Xu, Kexin Yang, Ruibin Yuan, Yuanhao Yue, Tianyang Zhan, Chun Zhang, Jinyang Zhang, Xiyue Zhang, Xingjian Zhang, Yue Zhang, Yongchi Zhao, Xiangyu Zheng, Chenghua Zhong, Yang Gao, Zhoujun Li, Dayiheng Liu, Qian Liu, Tianyu Liu, Shiwen Ni, Junran Peng, Yujia Qin, Wenbo Su, Guoyin Wang, Shi Wang, Jian Yang, Min Yang, Meng Cao, Xiang Yue, Zhaoxiang Zhang, Wangchunshu Zhou, Jiaheng Liu, Qunshu Lin, Wenhao Huang, Ge Zhang

To address this gap, we present SuperGPQA, a comprehensive benchmark that evaluates graduate-level knowledge and reasoning capabilities across 285 disciplines.

Collaborative Filtering

ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease Diagnosis

1 code implementation16 Feb 2025 Xu Wang, Jiaju Kang, Puyu Han, Yubao Zhao, Qian Liu, Liwenfei He, Lingqiong Zhang, Lingyun Dai, Yongcheng Wang, Jie Tao

We present ECG-Expert-QA, a comprehensive multimodal dataset for evaluating diagnostic capabilities in electrocardiogram (ECG) interpretation.

Diagnostic Rhythm

Improving Your Model Ranking on Chatbot Arena by Vote Rigging

1 code implementation29 Jan 2025 Rui Min, Tianyu Pang, Chao Du, Qian Liu, Minhao Cheng, Min Lin

We first introduce a straightforward target-only rigging strategy that focuses on new battles involving $m_{t}$, identifying it via watermarking or a binary classifier, and exclusively voting for $m_{t}$ wins.

Chatbot

Target Detection in OFDM-ISAC Systems: A Multipath Exploitation Approach

no code implementations14 Jan 2025 Xiaohan Lv, Rang Liu, Ming Li, Qian Liu

This paper investigates the potential of multipath exploitation for enhancing target detection in orthogonal frequency division multiplexing (OFDM)-based integrated sensing and communication (ISAC) systems.

Diversity Integrated sensing and communication +1

Target Detection in ISAC Systems with Active RISs: A Multi-Perspective Observation Approach

no code implementations11 Jan 2025 Shoushuo Zhang, Rang Liu, Ming Li, Wei Wang, Qian Liu

Integrated sensing and communication (ISAC) has emerged as a transformative technology for 6G networks, enabling the seamless integration of communication and sensing functionalities.

Integrated sensing and communication ISAC

Two Birds with One Stone: Improving Rumor Detection by Addressing the Unfairness Issue

no code implementations30 Dec 2024 Junyi Chen, Mengjia Wu, Qian Liu, Ying Ding, Yi Zhang

The degraded performance and group unfairness caused by confounding sensitive attributes in rumor detection remains relatively unexplored.

Attribute Fairness

Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework

no code implementations22 Dec 2024 Jundong Xu, Hao Fei, Meng Luo, Qian Liu, Liangming Pan, William Yang Wang, Preslav Nakov, Mong-Li Lee, Wynne Hsu

In the context of large language models (LLMs), current advanced reasoning methods have made impressive strides in various reasoning tasks.

Logical Reasoning

RAG for Effective Supply Chain Security Questionnaire Automation

no code implementations18 Dec 2024 Zaynab Batool Reza, Abdul Rafay Syed, Omer Iqbal, Ethel Mensah, Qian Liu, Maxx Richard Rahman, Wolfgang Maass

In an era where digital security is crucial, efficient processing of security-related inquiries through supply chain security questionnaires is imperative.

Management RAG +2

SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages

1 code implementation2 Dec 2024 Jia Guo, Longxu Dou, Guangtao Zeng, Stanley Kok, Wei Lu, Qian Liu

In this paper, we introduce SailCompass, a reproducible and robust evaluation benchmark for assessing Large Language Models (LLMs) on Southeast Asian Languages (SEA).

Multiple-choice

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

1 code implementation20 Nov 2024 Haonan Wang, Qian Liu, Chao Du, Tongyao Zhu, Cunxiao Du, Kenji Kawaguchi, Tianyu Pang

To address this, we develop AnchorAttention, a plug-and-play attention method that alleviates numerical issues caused by BFloat16, improves long-context capabilities, and speeds up training.

Computational Efficiency Position

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

no code implementations12 Nov 2024 Fangyu Lei, Jixuan Chen, Yuxiao Ye, Ruisheng Cao, Dongchan Shin, Hongjin Su, Zhaoqing Suo, Hongcheng Gao, Wenjing Hu, Pengcheng Yin, Victor Zhong, Caiming Xiong, Ruoxi Sun, Qian Liu, Sida Wang, Tao Yu

Real-world enterprise text-to-SQL workflows often involve complex cloud or local data across various database systems, multiple SQL queries in various dialects, and diverse operations from data transformation to analytics.

Code Generation Text to SQL +1

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

no code implementations7 Nov 2024 Siming Huang, Tianhao Cheng, J. K. Liu, Jiaran Hao, Liuyihan Song, Yang Xu, J. Yang, Jiaheng Liu, Chenchen Zhang, Linzheng Chai, Ruifeng Yuan, Zhaoxiang Zhang, Jie Fu, Qian Liu, Ge Zhang, Zili Wang, Yuan Qi, Yinghui Xu, Wei Chu

To address the gap, we introduce OpenCoder, a top-tier code LLM that not only achieves performance comparable to leading models but also serves as an "open cookbook" for the research community.

Code Generation

Scaling up Masked Diffusion Models on Text

1 code implementation24 Oct 2024 Shen Nie, Fengqi Zhu, Chao Du, Tianyu Pang, Qian Liu, Guangtao Zeng, Min Lin, Chongxuan Li

Masked diffusion models (MDMs) have shown promise in language modeling, yet their scalability and effectiveness in core language tasks, such as text generation and language understanding, remain underexplored.

GSM8K Language Modeling +3

DataTales: A Benchmark for Real-World Intelligent Data Narration

1 code implementation23 Oct 2024 Yajing Yang, Qian Liu, Min-Yen Kan

We introduce DataTales, a novel benchmark designed to assess the proficiency of language models in data narration, a task crucial for transforming complex tabular data into accessible narratives.

Joint Space-Time Adaptive Processing and Beamforming Design for Cell-Free ISAC Systems

no code implementations18 Oct 2024 Rang Liu, Ming Li, Qian Liu

In this paper, we explore cooperative sensing and communication within cell-free integrated sensing and communication (ISAC) systems.

Integrated sensing and communication ISAC

CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization

no code implementations16 Oct 2024 Yixi Ding, Jiaying Wu, Tongyao Zhu, Yanxia Qin, Qian Liu, Min-Yen Kan

To broaden the dissemination of scientific knowledge to diverse audiences, scientific document summarization must simultaneously control multiple attributes such as length and empirical focus.

Document Summarization Scientific Document Summarization

DOA Estimation-Oriented Joint Array Partitioning and Beamforming Designs for ISAC Systems

no code implementations16 Oct 2024 Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst

Integrated sensing and communication has been identified as an enabling technology for forthcoming wireless networks.

Integrated sensing and communication ISAC

When Attention Sink Emerges in Language Models: An Empirical View

1 code implementation14 Oct 2024 Xiangming Gu, Tianyu Pang, Chao Du, Qian Liu, Fengzhuo Zhang, Cunxiao Du, Ye Wang, Min Lin

In this work, we first demonstrate that attention sinks exist universally in LMs with various inputs, even in small models.

Quantization

LucidGrasp: Robotic Framework for Autonomous Manipulation of Laboratory Equipment with Different Degrees of Transparency via 6D Pose Estimation

no code implementations10 Oct 2024 Maria Makarova, Daria Trinitatova, Qian Liu, Dzmitry Tsetserukou

The proposed robotic framework can be applied for laboratory automation, since it allows solving the problem of performing non-trivial manipulation tasks with the analysis of object poses of varying degrees of transparency and liquid levels, requiring high accuracy and repeatability.

6D Pose Estimation

Compressing high-resolution data through latent representation encoding for downscaling large-scale AI weather forecast model

no code implementations10 Oct 2024 Qian Liu, Bing Gong, Xiaoran Zhuang, Xiaohui Zhong, Zhiming Kang, Hao Li

The rapid advancement of artificial intelligence (AI) in weather research has been driven by the ability to learn from large, high-dimensional datasets.

Image Compression

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

1 code implementation9 Oct 2024 Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang, Min Lin

Achieving high win rates on these benchmarks can significantly boost the promotional impact of newly released language models.

Self-supervised Spatio-Temporal Graph Mask-Passing Attention Network for Perceptual Importance Prediction of Multi-point Tactility

no code implementations4 Oct 2024 Dazhong He, Qian Liu

While visual and auditory information are prevalent in modern multimedia systems, haptic interaction, e. g., tactile and kinesthetic interaction, provides a unique form of human perception.

Graph Neural Network Self-Supervised Learning

Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation

1 code implementation25 Sep 2024 Yulin Wang, Honglin Xiong, Kaicong Sun, Shuwei Bai, Ling Dai, Zhongxiang Ding, Jiameng Liu, Qian Wang, Qian Liu, Dinggang Shen

Here, we present TUMSyn, a Text-guided Universal MR image Synthesis generalist model, which can flexibly generate brain MR images with demanded imaging metadata from routinely acquired scans guided by text prompts.

Contrastive Learning Image Generation

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

1 code implementation25 Sep 2024 Fan Zhou, Zengzhi Wang, Qian Liu, Junlong Li, PengFei Liu

Large language model pre-training has traditionally relied on human experts to craft heuristics for improving the corpora quality, resulting in numerous rules developed to date.

Large Language Model

Active Reconfigurable Intelligent Surface Empowered Synthetic Aperture Radar Imaging

no code implementations18 Sep 2024 Yifan Sun, Rang Liu, Zhiping Lu, Honghao Luo, Ming Li, Qian Liu

In this paper, we first present a range-Doppler (RD) imaging algorithm to obtain imaging results for the proposed ARIS-empowered SAR system.

Dynamic Hybrid Beamforming Designs for ELAA Near-Field Communications

no code implementations5 Sep 2024 Mengzhen Liu, Ming Li, Rang Liu, Qian Liu

Extremely large-scale antenna array (ELAA) is a key candidate technology for the sixth generation (6G) mobile networks.

PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis

no code implementations18 Aug 2024 Meng Luo, Hao Fei, Bobo Li, Shengqiong Wu, Qian Liu, Soujanya Poria, Erik Cambria, Mong-Li Lee, Wynne Hsu

While existing Aspect-based Sentiment Analysis (ABSA) has received extensive effort and advancement, there are still gaps in defining a more holistic research target seamlessly integrating multimodality, conversation context, fine-granularity, and also covering the changing sentiment dynamics as well as cognitive causal rationales.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +3

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

1 code implementation18 Jul 2024 Chaofan Tao, Qian Liu, Longxu Dou, Niklas Muennighoff, Zhongwei Wan, Ping Luo, Min Lin, Ngai Wong

We investigate how vocabulary size impacts LLM scaling laws by training models ranging from 33M to 3B parameters on up to 500B characters with various vocabulary configurations.

ARC

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

1 code implementation15 Jul 2024 Ruisheng Cao, Fangyu Lei, Haoyuan Wu, Jixuan Chen, Yeqiao Fu, Hongcheng Gao, Xinzhuang Xiong, Hanchong Zhang, Yuchen Mao, Wenjing Hu, Tianbao Xie, Hongshen Xu, Danyang Zhang, Sida Wang, Ruoxi Sun, Pengcheng Yin, Caiming Xiong, Ansong Ni, Qian Liu, Victor Zhong, Lu Chen, Kai Yu, Tao Yu

These tasks, derived from real-world use cases, evaluate the ability of a multimodal agent to perform data-related tasks by writing code and managing the GUI in enterprise data software systems.

Code Generation

RegMix: Data Mixture as Regression for Language Model Pre-training

1 code implementation1 Jul 2024 Qian Liu, Xiaosen Zheng, Niklas Muennighoff, Guangtao Zeng, Longxu Dou, Tianyu Pang, Jing Jiang, Min Lin

RegMix trains many small models on diverse data mixtures, uses regression to predict performance of unseen mixtures, and applies the best predicted mixture to train a large-scale model with orders of magnitude more compute.

Common Sense Reasoning Language Modeling +3

MIMO-OFDM ISAC Waveform Design for Range-Doppler Sidelobe Suppression

no code implementations25 Jun 2024 Peishi Li, Ming Li, Rang Liu, Qian Liu, A. Lee Swindlehurst

In addition, the proposed waveform design achieves target detection and estimation performance close to that achievable by waveforms designed only for radar, which demonstrates the superiority of the proposed SLP-based ISAC approach.

Integrated sensing and communication ISAC

Bootstrapping Language Models with DPO Implicit Rewards

1 code implementation14 Jun 2024 Changyu Chen, Zichen Liu, Chao Du, Tianyu Pang, Qian Liu, Arunesh Sinha, Pradeep Varakantham, Min Lin

In this work, we make a novel observation that this implicit reward model can by itself be used in a bootstrapping fashion to further align the LLM.

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

1 code implementation13 Jun 2024 Xuan Zhang, Chao Du, Tianyu Pang, Qian Liu, Wei Gao, Min Lin

The recent development of chain-of-thought (CoT) decoding has enabled large language models (LLMs) to generate explicit logical reasoning paths for complex problem-solving.

Arithmetic Reasoning Fact Verification +2

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses

1 code implementation3 Jun 2024 Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang, Min Lin

In addition, we conduct comprehensive and elaborate (e. g., making sure to use correct system prompts) evaluations against other aligned LLMs and advanced defenses, where our method consistently achieves nearly 100% ASRs.

Multipath Exploitation for Fluctuating Target Detection in RIS-Assisted ISAC Systems

no code implementations2 Jun 2024 Shoushuo Zhang, Zichao Xiao, Rang Liu, Ming Li, Wei Wang, Qian Liu

Integrated sensing and communication (ISAC) systems are typically deployed in multipath environments, which is usually deemed as a challenging issue for wireless communications.

Diversity Integrated sensing and communication +1

Faithful Logical Reasoning via Symbolic Chain-of-Thought

1 code implementation28 May 2024 Jundong Xu, Hao Fei, Liangming Pan, Qian Liu, Mong-Li Lee, Wynne Hsu

Technically, building upon an LLM, SymbCoT 1) first translates the natural language context into the symbolic format, and then 2) derives a step-by-step plan to solve the problem with symbolic logical rules, 3) followed by a verifier to check the translation and reasoning chain.

Logical Reasoning

MANTIS: Interleaved Multi-Image Instruction Tuning

1 code implementation2 May 2024 Dongfu Jiang, Xuan He, Huaye Zeng, Cong Wei, Max Ku, Qian Liu, Wenhu Chen

We further evaluate Mantis on single-image benchmarks and demonstrate that Mantis also maintains a strong single-image performance on par with CogVLM and Emu2.

ISQA: Informative Factuality Feedback for Scientific Summarization

1 code implementation20 Apr 2024 Zekai Li, Yanxia Qin, Qian Liu, Min-Yen Kan

We propose Iterative Facuality Refining on Informative Scientific Question-Answering (ISQA) feedback\footnote{Code is available at \url{https://github. com/lizekai-richard/isqa}}, a method following human learning theories that employs model-generated feedback consisting of both positive and negative information.

Question Answering

Sailor: Open Language Models for South-East Asia

3 code implementations4 Apr 2024 Longxu Dou, Qian Liu, Guangtao Zeng, Jia Guo, Jiahui Zhou, Wei Lu, Min Lin

We present Sailor, a family of open language models ranging from 0. 5B to 7B parameters, tailored for South-East Asian (SEA) languages.

Language Modeling Language Modelling +2

Beyond Memorization: The Challenge of Random Memory Access in Language Models

1 code implementation12 Mar 2024 Tongyao Zhu, Qian Liu, Liang Pang, Zhengbao Jiang, Min-Yen Kan, Min Lin

Through carefully-designed synthetic tasks, covering the scenarios of full recitation, selective recitation and grounded question answering, we reveal that LMs manage to sequentially access their memory while encountering challenges in randomly accessing memorized content.

Memorization Open-Domain Question Answering

Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

1 code implementation21 Feb 2024 Zhaorui Yang, Tianyu Pang, Haozhe Feng, Han Wang, Wei Chen, Minfeng Zhu, Qian Liu

The surge in Large Language Models (LLMs) has revolutionized natural language processing, but fine-tuning them for specific tasks often encounters challenges in balancing performance and preserving general instruction-following abilities.

Instruction Following Language Modeling +2

EVOR: Evolving Retrieval for Code Generation

1 code implementation19 Feb 2024 Hongjin Su, Shuyang Jiang, Yuhang Lai, Haoyuan Wu, Boao Shi, Che Liu, Qian Liu, Tao Yu

Recently the retrieval-augmented generation (RAG) has been successfully applied in code generation.

Code Generation RAG +2

Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One

no code implementations19 Feb 2024 Tianlin Li, XiaoYu Zhang, Chao Du, Tianyu Pang, Qian Liu, Qing Guo, Chao Shen, Yang Liu

Building on this insight and observation, we develop FairThinking, a pipeline designed to automatically generate roles that enable LLMs to articulate diverse perspectives for fair expressions.

Fairness Language Modeling +2

Purifying Large Language Models by Ensembling a Small Language Model

no code implementations19 Feb 2024 Tianlin Li, Qian Liu, Tianyu Pang, Chao Du, Qing Guo, Yang Liu, Min Lin

The emerging success of large language models (LLMs) heavily relies on collecting abundant training data from external (untrusted) sources.

Data Poisoning Language Modeling +2

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

1 code implementation13 Feb 2024 Xiangming Gu, Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Ye Wang, Jing Jiang, Min Lin

A multimodal large language model (MLLM) agent can receive instructions, capture images, retrieve histories from memory, and decide which tools to use.

Language Modelling Large Language Model +2

Test-Time Backdoor Attacks on Multimodal Large Language Models

1 code implementation13 Feb 2024 Dong Lu, Tianyu Pang, Chao Du, Qian Liu, Xianjun Yang, Min Lin

Backdoor attacks are commonly executed by contaminating training data, such that a trigger can activate predetermined harmful effects during the test phase.

Backdoor Attack

Mercury: A Code Efficiency Benchmark for Code Large Language Models

1 code implementation12 Feb 2024 Mingzhe Du, Anh Tuan Luu, Bin Ji, Qian Liu, See-Kiong Ng

Based on the distribution, we introduce a new metric Beyond, which computes a runtime-percentile-weighted Pass score to reflect functional correctness and code efficiency simultaneously.

Code Generation Computational Efficiency

Ocassionally Secure: A Comparative Analysis of Code Generation Assistants

no code implementations1 Feb 2024 Ran Elgedawy, John Sadik, Senjuti Dutta, Anuj Gautam, Konstantinos Georgiou, Farzin Gholamrezae, Fujiao Ji, Kyungchan Lim, Qian Liu, Scott Ruoti

$ $Large Language Models (LLMs) are being increasingly utilized in various applications, with code generations being a notable example.

Code Generation

End-to-End Learning for SLP-Based ISAC Systems

no code implementations11 Jan 2024 Yixian Zheng, Rang Liu, Ming Li, Qian Liu

Integrated sensing and communication (ISAC) is an encouraging wireless technology which can simultaneously perform both radar and communication functionalities by sharing the same transmit waveform, spectral resource, and hardware platform.

Integrated sensing and communication ISAC

A Practical Beamforming Design for Active RIS-assisted MU-MISO Systems

no code implementations8 Jan 2024 Yun Yang, Zhiping Lu, Ming Li, Rang Liu, Qian Liu

Motivated by this fact, in this paper we first investigate the amplification principle of typical active RIS and propose a more accurate amplification model based on amplifier hardware characteristics.

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

2 code implementations1 Jan 2024 Terry Yue Zhuo, Armel Zebaze, Nitchakarn Suppattarachai, Leandro von Werra, Harm de Vries, Qian Liu, Niklas Muennighoff

Through investigations across 5 tasks and 8 different datasets encompassing both code comprehension and code generation tasks, we find that FFT generally leads to the best downstream performance across all scales, and PEFT methods differ significantly in their efficacy based on the model scale.

Code Generation parameter-efficient fine-tuning

Joint Sensing and Communication Optimization in Target-Mounted STARS-Assisted Vehicular Networks: A MADRL Approach

no code implementations17 Nov 2023 Haocheng Zhang, Rang Liu, Ming Li, Wei Wang, Qian Liu

Extensive experimental results confirm the effectiveness of our proposed MADRL framework in improving both sensing and communication performance through the utilization of target-mounted STARS.

Beam Prediction Decision Making +3

S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models

2 code implementations23 Oct 2023 Fangyu Lei, Qian Liu, Yiming Huang, Shizhu He, Jun Zhao, Kang Liu

The rapid development of Large Language Models (LLMs) has led to great strides in model capabilities like long-context understanding and reasoning.

Long-Context Understanding

OpenAgents: An Open Platform for Language Agents in the Wild

2 code implementations16 Oct 2023 Tianbao Xie, Fan Zhou, Zhoujun Cheng, Peng Shi, Luoxuan Weng, Yitao Liu, Toh Jing Hua, Junning Zhao, Qian Liu, Che Liu, Leo Z. Liu, Yiheng Xu, Hongjin Su, Dongchan Shin, Caiming Xiong, Tao Yu

Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs).

2D Object Detection

Lemur: Harmonizing Natural Language and Code for Language Agents

1 code implementation10 Oct 2023 Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu

We introduce Lemur and Lemur-Chat, openly accessible language models optimized for both natural language and coding capabilities to serve as the backbone of versatile language agents.

Cramer-Rao Bound Optimization for Active RIS-Empowered ISAC Systems

no code implementations17 Sep 2023 Qi Zhu, Ming Li, Rang Liu, Qian Liu

Integrated sensing and communication (ISAC), which simultaneously performs sensing and communication functions within a shared frequency band and hardware platform, has emerged as a promising technology for future wireless systems.

Integrated sensing and communication ISAC +1

OctoPack: Instruction Tuning Code Large Language Models

3 code implementations14 Aug 2023 Niklas Muennighoff, Qian Liu, Armel Zebaze, Qinkai Zheng, Binyuan Hui, Terry Yue Zhuo, Swayam Singh, Xiangru Tang, Leandro von Werra, Shayne Longpre

We benchmark CommitPack against other natural and synthetic code instructions (xP3x, Self-Instruct, OASST) on the 16B parameter StarCoder model, and achieve state-of-the-art performance among models not trained on OpenAI outputs, on the HumanEval Python benchmark (46. 2% pass@1).

Code Generation Code Repair +1

A Novel Joint Angle-Range-Velocity Estimation Method for MIMO-OFDM ISAC Systems

no code implementations7 Aug 2023 Zichao Xiao, Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst

The proposed joint estimation algorithm can achieve larger signal-to-noise-ratio (SNR) processing gains and higher resolution by fully exploiting the echo signals and jointly estimating the angle-range-velocity information.

Integrated sensing and communication ISAC

SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning

2 code implementations3 Aug 2023 Keyu Duan, Qian Liu, Tat-Seng Chua, Shuicheng Yan, Wei Tsang Ooi, Qizhe Xie, Junxian He

More recently, with the rapid development of language models (LMs), researchers have focused on leveraging LMs to facilitate the learning of TGs, either by jointly training them in a computationally intensive framework (merging the two stages), or designing complex self-supervised training tasks for feature extraction (enhancing the first stage).

Feature Engineering Graph Learning +4

Fast algorithms for k-submodular maximization subject to a matroid constraint

no code implementations26 Jul 2023 Shuxian Niu, Qian Liu, Yang Zhou, Min Li

In this paper, we apply a Threshold-Decreasing Algorithm to maximize $k$-submodular functions under a matroid constraint, which reduces the query complexity of the algorithm compared to the greedy algorithm with little loss in approximation ratio.

LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

2 code implementations25 Jul 2023 Chengsong Huang, Qian Liu, Bill Yuchen Lin, Tianyu Pang, Chao Du, Min Lin

This paper investigates LoRA composability for cross-task generalization and introduces LoraHub, a simple framework devised for the purposive assembly of LoRA modules trained on diverse given tasks, with the objective of achieving adaptable performance on unseen tasks.

In-Context Learning

Automotive Object Detection via Learning Sparse Events by Spiking Neurons

no code implementations24 Jul 2023 Hu Zhang, Yanchen Li, Luziwei Leng, Kaiwei Che, Qian Liu, Qinghai Guo, Jianxing Liao, Ran Cheng

Traditional object detection techniques that utilize Artificial Neural Networks (ANNs) face challenges due to the sparse and asynchronous nature of the events these sensors capture.

Event-based vision object-detection +1

Low-Range-Sidelobe Waveform Design for MIMO-OFDM ISAC Systems

no code implementations30 May 2023 Peishi Li, Zichao Xiao, Ming Li, Rang Liu, Qian Liu

Integrated sensing and communication (ISAC) is a promising technology in future wireless systems owing to its efficient hardware and spectrum utilization.

Integrated sensing and communication ISAC

SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables

1 code implementation22 May 2023 Xinyuan Lu, Liangming Pan, Qian Liu, Preslav Nakov, Min-Yen Kan

Current scientific fact-checking benchmarks exhibit several shortcomings, such as biases arising from crowd-sourced claims and an over-reliance on text-based evidence.

Claim Verification Fact Checking

Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination

1 code implementation20 May 2023 Hao Fei, Qian Liu, Meishan Zhang, Min Zhang, Tat-Seng Chua

In this work, we investigate a more realistic unsupervised multimodal machine translation (UMMT) setup, inference-time image-free UMMT, where the model is trained with source-text image pairs, and tested with only source-text inputs.

Hallucination Multimodal Machine Translation +1

Reasoning Implicit Sentiment with Chain-of-Thought Prompting

2 code implementations18 May 2023 Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li, Tat-Seng Chua

While sentiment analysis systems try to determine the sentiment polarities of given targets based on the key opinion expressions in input texts, in implicit sentiment analysis (ISA) the opinion cues come in an implicit and obscure manner.

Common Sense Reasoning Sentiment Analysis

Active Retrieval Augmented Generation

2 code implementations11 May 2023 Zhengbao Jiang, Frank F. Xu, Luyu Gao, Zhiqing Sun, Qian Liu, Jane Dwivedi-Yu, Yiming Yang, Jamie Callan, Graham Neubig

In this work, we provide a generalized view of active retrieval augmented generation, methods that actively decide when and what to retrieve across the course of the generation.

Retrieval Retrieval-augmented Generation +1

StarCoder: may the source be with you!

4 code implementations9 May 2023 Raymond Li, Loubna Ben allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu, Benjamin Lipkin, Muhtasham Oblokulov, Zhiruo Wang, Rudra Murthy, Jason Stillerman, Siva Sankalp Patel, Dmitry Abulkhanov, Marco Zocca, Manan Dey, Zhihan Zhang, Nour Fahmy, Urvashi Bhattacharyya, Wenhao Yu, Swayam Singh, Sasha Luccioni, Paulo Villegas, Maxim Kunakov, Fedor Zhdanov, Manuel Romero, Tony Lee, Nadav Timor, Jennifer Ding, Claire Schlesinger, Hailey Schoelkopf, Jan Ebert, Tri Dao, Mayank Mishra, Alex Gu, Jennifer Robinson, Carolyn Jane Anderson, Brendan Dolan-Gavitt, Danish Contractor, Siva Reddy, Daniel Fried, Dzmitry Bahdanau, Yacine Jernite, Carlos Muñoz Ferrandis, Sean Hughes, Thomas Wolf, Arjun Guha, Leandro von Werra, Harm de Vries

The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15. 5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention.

8k Code Generation +1

From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning

1 code implementation17 Apr 2023 Qian Liu, Fan Zhou, Zhengbao Jiang, Longxu Dou, Min Lin

Empirical results on various benchmarks validate that the integration of SQL execution leads to significant improvements in zero-shot scenarios, particularly in table reasoning.

MMLU Zero-shot Generalization

RIS-Aided Integrated Sensing and Communication: Joint Beamforming and Reflection Design

no code implementations22 Feb 2023 Honghao Luo, Rang Liu, Ming Li, Qian Liu

Integrated sensing and communication (ISAC) has been envisioned as a promising technique to alleviate the spectrum congestion problem.

Integrated sensing and communication ISAC

Joint Transceiver Beamforming and Reflecting Design for Active RIS-Aided ISAC Systems

no code implementations21 Feb 2023 Qi Zhu, Ming Li, Rang Liu, Qian Liu

Integrated sensing and communication (ISAC) is recognized as a promising technology with great potential in saving hardware and spectrum resources, since it simultaneously realizes radar detection and user communication functions in the fully-shared platform.

Integrated sensing and communication ISAC

SNR/CRB-Constrained Joint Beamforming and Reflection Designs for RIS-ISAC Systems

1 code implementation26 Jan 2023 Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst

Two optimization problems are formulated for maximizing the achievable sum-rate of the multi-user communications under an SNR constraint for target detection or a CRB constraint for parameter estimation, the transmit power budget, and the unit-modulus constraint of the RIS reflection coefficients.

Integrated sensing and communication ISAC +1

Joint Secure Transmit Beamforming Designs for Integrated Sensing and Communication Systems

no code implementations1 Dec 2022 Jinjin Chu, Rang Liu, Ming Li, Yang Liu, Qian Liu

Integrated sensing and communication (ISAC), which allows individual radar and communication systems to share the same spectrum bands, is an emerging and promising technique for alleviating spectrum congestion problems.

Integrated sensing and communication ISAC

OpenFE: Automated Feature Generation with Expert-level Performance

2 code implementations22 Nov 2022 Tianping Zhang, Zheyu Zhang, Zhiyuan Fan, Haoyan Luo, Fengyuan Liu, Qian Liu, Wei Cao, Jian Li

In the two competitions, features generated by OpenFE with a simple baseline model can beat 99. 3% and 99. 6% data science teams respectively.

Feature Importance

Learning on Large-scale Text-attributed Graphs via Variational Inference

2 code implementations26 Oct 2022 Jianan Zhao, Meng Qu, Chaozhuo Li, Hao Yan, Qian Liu, Rui Li, Xing Xie, Jian Tang

In this paper, we propose an efficient and effective solution to learning on large text-attributed graphs by fusing graph structure and language learning with a variational Expectation-Maximization (EM) framework, called GLEM.

Variational Inference

End-to-End Learning for Symbol-Level Precoding and Detection with Adaptive Modulation

no code implementations25 Oct 2022 Rang Liu, Zhu Bo, Ming Li, Qian Liu

To overcome the performance bottleneck of these approaches, in this letter we propose an end-to-end learning based approach to jointly optimize the modulation orders, the transmit precoding and the receive detection for an SLP communication system.

Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQA

1 code implementation11 Oct 2022 JunJie Huang, Wanjun Zhong, Qian Liu, Ming Gong, Daxin Jiang, Nan Duan

However, training an effective dense table-text retriever is difficult due to the challenges of table-text discrepancy and data sparsity problem.

Open-Domain Question Answering Representation Learning +1

Joint Beamforming Designs for Active Reconfigurable Intelligent Surface: A Sub-Connected Array Architecture

no code implementations5 Oct 2022 Qi Zhu, Ming Li, Rang Liu, Yang Liu, Qian Liu

Affected by the "double fading" effect, however, conventional passive RIS cannot bring considerable performance improvement when users are not close enough to RIS.

Optimization for Reflection and Transmission Dual-Functional Active RIS-Assisted Systems

no code implementations5 Sep 2022 Yanan Ma, Ming Li, Yang Liu, Qingqing Wu, Qian Liu

Reconfigurable intelligent surface (RIS) has been deemed as one of potential components of future wireless communication systems because it can adaptively manipulate the wireless propagation environment with low-cost passive devices.

Joint Beamforming Design for Intelligent Omni Surface Assisted Wireless Communication Systems

no code implementations1 Sep 2022 Wenhao Cai, Ming Li, Yang Liu, Qingqing Wu, Qian Liu

Intelligent reflecting surface (IRS) has been widely considered as one of the key enabling techniques for future wireless communication networks owing to its ability of dynamically controlling the phase shift of reflected electromagnetic (EM) waves to construct a favorable propagation environment.

Non-Cooperative Resource Management for Intelligent Reflecting Surface Aided Networks

no code implementations1 Sep 2022 Wenhao Cai, Ming Li, Qian Liu

Intelligent reflecting surface (IRS) has emerged as a promising and revolutionizing technology for future wireless networks.

Management

On Grounded Planning for Embodied Tasks with Language Models

no code implementations29 Aug 2022 Bill Yuchen Lin, Chengsong Huang, Qian Liu, Wenda Gu, Sam Sommerer, Xiang Ren

Language models (LMs) have demonstrated their capability in possessing commonsense knowledge of the physical world, a crucial aspect of performing tasks in everyday life.

Partially Distributed Beamforming Design for RIS-Aided Cell-Free Networks

no code implementations10 Aug 2022 Pengfei Ni, Ming Li, Rang Liu, Qian Liu

Cell-free networks are regarded as a promising technology to meet higher rate requirements for beyond fifth-generation (5G) communications.

User Association and Hybrid Beamforming Designs for Cooperative mmWave MIMO Systems

no code implementations10 Aug 2022 Pengfei Ni, Rang Liu, Ming Li, Qian Liu

In an effort to further exploit multiple-antenna diversities, we also consider the dynamic subarray architecture and propose a novel antenna design algorithm for the analog beamforming design.

Joint Beamforming Design for RIS-Assisted Integrated Sensing and Communication Systems

no code implementations3 Aug 2022 Honghao Luo, Rang Liu, Ming Li, Yang Liu, Qian Liu

Integrated sensing and communication (ISAC) has been envisioned as a promising technology to tackle the spectrum congestion problem for future networks.

Integrated sensing and communication ISAC

Integrated Sensing and Communication with Reconfigurable Intelligent Surfaces: Opportunities, Applications, and Future Directions

no code implementations17 Jun 2022 Rang Liu, Ming Li, Honghao Luo, Qian Liu, A. Lee Swindlehurst

Integrated sensing and communication (ISAC) is emerging as a key enabler to address the growing spectrum congestion problem and satisfy increasing demands for ubiquitous sensing and communication.

Integrated sensing and communication ISAC

IRS-assisted Multi-cell Multi-band Systems: Practical Reflection Model and Joint Beamforming Design

no code implementations13 Apr 2022 Wenhao Cai, Rang Liu, Ming Li, Yang Liu, Qingqing Wu, Qian Liu

Intelligent reflecting surface (IRS) has been regarded as a promising and revolutionary technology for future wireless communication systems owing to its capability of tailoring signal propagation environment in an energy/spectrum/hardware-efficient manner.

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models

no code implementations7 Mar 2022 Shengnan An, Yifei Li, Zeqi Lin, Qian Liu, Bei Chen, Qiang Fu, Weizhu Chen, Nanning Zheng, Jian-Guang Lou

This motivates us to propose input-tuning, which fine-tunes both the continuous prompts and the input representations, leading to a more effective way to adapt unfamiliar inputs to frozen PLMs.

Language Modeling Language Modelling +2

Reasoning Like Program Executors

1 code implementation27 Jan 2022 Xinyu Pi, Qian Liu, Bei Chen, Morteza Ziyadi, Zeqi Lin, Qiang Fu, Yan Gao, Jian-Guang Lou, Weizhu Chen

Reasoning over natural language is a long-standing goal for the research community.

Ranked #2 on Question Answering on DROP Test (using extra training data)

Logical Reasoning Math +1

Reasoning over Hybrid Chain for Table-and-Text Open Domain QA

no code implementations15 Jan 2022 Wanjun Zhong, JunJie Huang, Qian Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan

CARP utilizes hybrid chain to model the explicit intermediate reasoning process across table and text for question answering.

Open-Domain Question Answering

Image Edge Restoring Filter

no code implementations27 Dec 2021 Qian Liu, Yongpeng Li, Zhihang Wang

In computer vision, image processing and computer graphics, image smoothing filtering is a very basic and important task and to be expected possessing good edge-preserving smoothing property.

Image Denoising Image Enhancement +1

Joint Transmit Waveform and Passive Beamforming Design for RIS-Aided DFRC Systems

no code implementations16 Dec 2021 Rang Liu, Ming Li, Yang Liu, Qingqing Wu, Qian Liu

Reconfigurable intelligent surface (RIS) is a promising technology for 6G networks owing to its superior ability to enhance the capacity and coverage of wireless communications by smartly creating a favorable propagation environment.

Topological and Algebraic Structures of Atanassov's Intuitionistic Fuzzy-Values Space

no code implementations17 Nov 2021 Xinxing Wu, Tao Wang, Qian Liu, Peide Liu, Guanrong Chen, Xu Zhang

By introducing a new operator for IFVs via the linear order based on a score function and an accuracy function, we show that such an operator is a strong negation on IFVs.

Negation

Concept-Aware Denoising Graph Neural Network for Micro-Video Recommendation

no code implementations28 Sep 2021 Yiyu Liu, Qian Liu, Yu Tian, Changping Wang, Yanan Niu, Yang song, Chenliang Li

In this paper, we propose a novel concept-aware denoising graph neural network (named CONDE) for micro-video recommendation.

Denoising Graph Neural Network

Awakening Latent Grounding from Pretrained Language Models for Semantic Parsing

1 code implementation Findings (ACL) 2021 Qian Liu, Dejian Yang, Jiahui Zhang, Jiaqi Guo, Bin Zhou, Jian-Guang Lou

Recent years pretrained language models (PLMs) hit a success on several downstream tasks, showing their power on modeling language.

Text to SQL Text-To-SQL

Dual-Functional Radar-Communication Waveform Design: A Symbol-Level Precoding Approach

no code implementations11 Aug 2021 Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst

In this paper, we consider multi-input multi-output (MIMO) DFRC systems and focus on transmit beamforming designs to provide both radar sensing and multi-user communications.

Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-Dependent Text-to-SQL

no code implementations ACL 2021 Jiaqi Guo, Ziliang Si, Yu Wang, Qian Liu, Ming Fan, Jian-Guang Lou, Zijiang Yang, Ting Liu

However, we identify two biases in existing datasets for XDTS: (1) a high proportion of context-independent questions and (2) a high proportion of easy SQL queries.

Text to SQL Text-To-SQL

Reflection and Relay Dual-Functional RIS Assisted MU-MISO Systems

no code implementations24 Jul 2021 Yanan Ma, Rang Liu, Yang Liu, Ming Li, Qian Liu

Reconfigurable intelligent surfaces (RISs) have been deemed as one of potential components of future wireless communication systems because they can adaptively manipulate the wireless propagation environment with low-cost passive devices.

TAPEX: Table Pre-training via Learning a Neural SQL Executor

4 code implementations ICLR 2022 Qian Liu, Bei Chen, Jiaqi Guo, Morteza Ziyadi, Zeqi Lin, Weizhu Chen, Jian-Guang Lou

TAPEX addresses the data scarcity challenge via guiding the language model to mimic a SQL executor on the diverse, large-scale and high-quality synthetic corpus.

 Ranked #1 on Semantic Parsing on WikiSQL (Denotation accuracy (test) metric)

Language Modeling Language Modelling +2

BS-RIS-User Association and Beamforming Designs for RIS-aided Cellular Networks

no code implementations27 Jun 2021 Sifan Liu, Pengfei Ni, Rang Liu, Yang Liu, Ming Li, Qian Liu

During the dynamical access process, an iterative algorithm is proposed to alternatively obtain the active and passive beamforming.

Simulation on the Transparency of Electrons and Ion Back Flow for a Time Projection Chamber based on Staggered Multiple THGEMs

no code implementations16 Feb 2021 Mengzhi Wu, Qian Liu, Ping Li, Shi Chen, Binlong Wang, Wenhan Shen, Shiping Chen, Yangheng Zheng, Yigang Xie, Jin Li

The IBF and the transparent rate of electrons are two essential indicators of TPC, which affect the energy resolution and counting rate respectively.

Instrumentation and Detectors High Energy Physics - Experiment

Intelligent reflecting surface assisted multi-cell multi-band wireless networks

no code implementations5 Jan 2021 Wenhao Cai, Rang Liu, Yang Liu, Ming Li, Qian Liu

Therefore, the practical phase shift model, which can describe the difference of IRS phase shift responses for the signals with different frequencies, should be utilized in the IRS optimization for wideband and multi-band systems.

Channel Estimation for Practical IRS-Assisted OFDM Systems

no code implementations25 Dec 2020 Wanning Yang, Hongyu Li, Ming Li, Yang Liu, Qian Liu

Different from the prior works which assume that IRS has an ideal reflection model, we perform channel estimation by considering amplitude-phase shift-frequency relationship for the response of practical IRS.

"What Do You Mean by That?" A Parser-Independent Interactive Approach for Enhancing Text-to-SQL

1 code implementation9 Nov 2020 Yuntao Li, Bei Chen, Qian Liu, Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang

In Natural Language Interfaces to Databases systems, the text-to-SQL technique allows users to query databases by using natural language questions.

Text to SQL Text-To-SQL

Intelligent Reflecting Surface based Passive Information Transmission: A Symbol-Level Precoding Approach

no code implementations29 Jul 2020 Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst, Qingqing Wu

Intelligent reflecting surfaces (IRS) have been proposed as a revolutionary technology owing to its capability of adaptively reconfiguring the propagation environment in a cost-effective and hardware-efficient fashion.

Intelligent reflecting surface enhanced wideband MIMO-OFDM communications: From practical model to reflection optimization

no code implementations26 Jul 2020 Hongyu Li, Wenhao Cai, Yang Liu, Ming Li, Qian Liu, Qingqing Wu

Simulation results demonstrate that the proposed algorithm can offer significant average sum-rate enhancement compared to that achieved using the ideal IRS reflection model, which confirms the importance of the use of the practical model for the design of wideband systems.

Compositional Generalization by Learning Analytical Expressions

1 code implementation NeurIPS 2020 Qian Liu, Shengnan An, Jian-Guang Lou, Bei Chen, Zeqi Lin, Yan Gao, Bin Zhou, Nanning Zheng, Dongmei Zhang

Compositional generalization is a basic and essential intellective capability of human beings, which allows us to recombine known parts readily.

Hierarchical Reinforcement Learning

Practical Modeling and Beamforming for Intelligent Reflecting Surface Aided Wideband Systems

no code implementations2 Jun 2020 Wenhao Cai, Hongyu Li, Ming Li, Qian Liu

In this letter, we aim to investigate the phase-amplitude-frequency relationship of the reflected signals and propose a practical model of reflection coefficient for an IRS-aided wideband system.

A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading Comprehension

no code implementations2 Jun 2020 Jie Cai, Zhengzhou Zhu, Ping Nie, Qian Liu

In this paper, inspired by the observation that most probing tasks involve identifying matched pairs of phrases (e. g. coreference requires matching an entity and a pronoun), we propose a pairwise probe to understand BERT fine-tuning on the machine reading comprehension (MRC) task.

Boundary Detection coreference-resolution +1

You Impress Me: Dialogue Generation via Mutual Persona Perception

1 code implementation ACL 2020 Qian Liu, Yihong Chen, Bei Chen, Jian-Guang Lou, Zixuan Chen, Bin Zhou, Dongmei Zhang

Despite the continuing efforts to improve the engagingness and consistency of chit-chat dialogue systems, the majority of current work simply focus on mimicking human-like responses, leaving understudied the aspects of modeling understanding between interlocutors.

Ranked #2 on Dialogue Generation on Persona-Chat (using extra training data)

Dialogue Generation

CTM: Collaborative Temporal Modeling for Action Recognition

no code implementations8 Feb 2020 Qian Liu, Tao Wang, Jie Liu, Yang Guan, Qi Bu, Longfei Yang

In order to learn powerful feature of videos, we propose a Collaborative Temporal Modeling (CTM) block (Figure 1) to learn temporal information for action recognition.

Action Recognition Video Understanding

iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention

no code implementations7 Feb 2020 Qian Liu, Dongyang Cai, Jie Liu, Nan Ding, Tao Wang

The standard non-local (NL) module is effective in aggregating frame-level features on the task of video classification but presents low parameters efficiency and high computational cost.

General Classification Video Classification

How Far are We from Effective Context Modeling? An Exploratory Study on Semantic Parsing in Context

1 code implementation3 Feb 2020 Qian Liu, Bei Chen, Jiaqi Guo, Jian-Guang Lou, Bin Zhou, Dongmei Zhang

Recently semantic parsing in context has received considerable attention, which is challenging since there are complex contextual phenomena.

Semantic Parsing

A Split-and-Recombine Approach for Follow-up Query Analysis

1 code implementation IJCNLP 2019 Qian Liu, Bei Chen, Haoyan Liu, Lei Fang, Jian-Guang Lou, Bin Zhou, Dongmei Zhang

To leverage the advances in context-independent semantic parsing, we propose to perform follow-up query analysis, aiming to restate context-dependent natural language queries with contextual information.

Natural Language Queries Semantic Parsing

Symbol-Level Precoding Design for Intelligent Reflecting Surface Assisted Multi-user MIMO Systems

no code implementations3 Sep 2019 Rang Liu, Hongyu Li, Ming Li, Qian Liu

In this paper we investigate the problem of precoder design for a low-resolution IRS-based transmitter to implement multi-user MISO/MIMO wireless communications.

Quantization

FANDA: A Novel Approach to Perform Follow-up Query Analysis

1 code implementation24 Jan 2019 Qian Liu, Bei Chen, Jian-Guang Lou, Ge Jin, Dongmei Zhang

NLIDB allow users to search databases using natural language instead of SQL-like query languages.

Noisy Softplus: an activation function that enables SNNs to be trained as ANNs

no code implementations31 Mar 2017 Qian Liu, Yunhua Chen, Steve Furber

We extended the work of proposed activation function, Noisy Softplus, to fit into training of layered up spiking neural networks (SNNs).

General Classification

Biclustering Via Sparse Clustering

no code implementations11 Jul 2014 Qian Liu, Guanhua Chen, Michael R. Kosorok, Eric Bair

This framework can be used to identify biclusters that differ with respect to the means of the features, the variance of the features, or more general differences.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.