Search Results for author: Jie Huang

Found 81 papers, 34 papers with code

Video-based Sign Language Recognition without Temporal Segmentation

no code implementations30 Jan 2018 Jie Huang, Wengang Zhou, Qilin Zhang, Houqiang Li, Weiping Li

Worse still, isolated SLR methods typically require strenuous labeling of each word separately in a sentence, severely limiting the amount of attainable training data.

Segmentation Sentence +1

Hyper-Path-Based Representation Learning for Hyper-Networks

1 code implementation24 Aug 2019 Jie Huang, Xin Liu, Yangqiu Song

Then a carefully designed algorithm, Hyper-gram, utilizes these random walks to capture both pairwise relationships and tuplewise relationships in the whole hyper-networks.

Link Prediction Representation Learning

A Big Data Enabled Channel Model for 5G Wireless Communication Systems

no code implementations28 Feb 2020 Jie Huang, Cheng-Xiang Wang, Lu Bai, Jian Sun, Yang Yang, Jie Li, Olav Tirkkonen, Ming-Tuo Zhou

This paper investigates various applications of big data analytics, especially machine learning algorithms in wireless communications and channel modeling.

BIG-bench Machine Learning

Multi-Frequency Multi-Scenario Millimeter Wave MIMO Channel Measurements and Modeling for B5G Wireless Communication Systems

no code implementations28 Jul 2020 Jie Huang, Cheng-Xiang Wang, Hengtai Chang, Jian Sun, Xiqi Gao

Millimeter wave (mmWave) bands have been utilized for the fifth generation (5G) communication systems and will no doubt continue to be deployed for beyond 5G (B5G).

6G Oriented Wireless Communication Channel Characteristics Analysis and Modeling

no code implementations28 Jul 2020 Cheng-Xiang Wang, Jie Huang, Haiming Wang, Xiqi Gao, Xiaohu You, Yang Hao

Based on the vision on the 6G wireless communication network, i. e., global coverage, all spectrums and all applications, we comprehensively survey 6G related wireless channel measurements, channel characteristics, and channel models for all frequency bands and all scenarios.

A 3D Non-Stationary Channel Model for 6G Wireless Systems Employing Intelligent Reflecting Surface

no code implementations3 Dec 2020 Yingzhuo Sun, Cheng-Xiang Wang, Jie Huang, Jun Wang

The evolution of clusters on the linear array and planar array is also considered in the proposed model.

A General 3D Space-Time-Frequency Non-Stationary THz Channel Model for 6G Ultra-Massive MIMO Wireless Communication Systems

no code implementations20 Apr 2021 Jun Wang, Cheng-Xiang Wang, Jie Huang, Haiming Wang, Xiqi Gao

The proposed THz channel model is very general having the capability to capture different channel characteristics in multiple THz application scenarios such as indoor scenarios, device-to-device (D2D) communications, ultra-massive multiple-input multiple-output (MIMO) communications, and long traveling paths of users.

A Multi-Size Neural Network with Attention Mechanism for Answer Selection

no code implementations24 Apr 2021 Jie Huang

The experimental results show that (1) multi-size neural network (MSNN) is a more useful method to capture abstract features on different levels of granularities than single/multi-layer CNNs; (2) the attention mechanism (AM) is a better strategy to derive more informative representations; (3) AM-MSNN is a better architecture for the answer selection task for the moment.

Answer Selection Sentence

A 3D Non-Stationary Channel Model for 6G Wireless Systems Employing Intelligent Reflecting Surfaces with Practical Phase Shifts

no code implementations25 Apr 2021 Yingzhuo Sun, Cheng-Xiang Wang, Jie Huang, Jun Wang

In this paper, a three-dimensional (3D) geometry based stochastic model (GBSM) for a massive multiple-input multiple-output (MIMO) communication system employing practical discrete intelligent reflecting surface (IRS) is proposed.

Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach

1 code implementation ACL 2021 Jie Huang, Kevin Chen-Chuan Chang, JinJun Xiong, Wen-mei Hwu

To support a fine-grained domain without relying on a matching corpus for supervision, we develop hierarchical core-fringe learning, which learns core and fringe terms jointly in a semi-supervised manner contextualized in the hierarchy of the domain.

Space-Time Distillation for Video Super-Resolution

no code implementations CVPR 2021 Zeyu Xiao, Xueyang Fu, Jie Huang, Zhen Cheng, Zhiwei Xiong

In this paper, we aim to improve the performance of compact VSR networks without changing their original architectures, through a knowledge distillation approach that transfers knowledge from a complicated VSR network to a compact one.

Knowledge Distillation Video Super-Resolution

Three-dimensional instantaneous orbit map for rotor-bearing system based on a novel multivariate complex variational mode decomposition algorithm

no code implementations29 Jul 2021 Xiaolong Cui, Jie Huang, Chaoshun Li, Yujie Zhao

It can simultaneously extract the forward and backward components of multiple bearing sections and realize non-stationary complex signal decomposition of multiple bearing sections of the rotor.

A Novel 3D Non-Stationary GBSM for 6G THz Ultra-Massive MIMO Wireless Systems

no code implementations14 Aug 2021 Jun Wang, Cheng-Xiang Wang, Jie Huang, Haiming Wang, Xiqi Gao, Xiaohu You, Yang Hao

Terahertz (THz) communication is now being considered as one of possible technologies for the sixth generation (6G) wireless communication systems.

Multi-Frequency Wireless Channel Measurements and Characteristics Analysis in Indoor Corridor Scenarios

no code implementations14 Aug 2021 ZiHao Zhou, Li Zhang, Xinyue Chen, Cheng-Xiang Wang, Jie Huang

In this paper, we conduct wireless channel measurements in indoor corridor scenarios at 2. 4, 5 and 6 GHz bands with bandwidth of 320 MHz.

Open Relation Modeling: Learning to Define Relations between Entities

1 code implementation Findings (ACL) 2022 Jie Huang, Kevin Chen-Chuan Chang, JinJun Xiong, Wen-mei Hwu

Relations between entities can be represented by different instances, e. g., a sentence containing both entities or a fact in a Knowledge Graph (KG).

Open Relation Modeling Relation +1

Understanding Jargon: Combining Extraction and Generation for Definition Modeling

1 code implementation14 Nov 2021 Jie Huang, Hanyin Shao, Kevin Chen-Chuan Chang, JinJun Xiong, Wen-mei Hwu

From the composition of this phrase, machines may guess twin prime is a certain kind of prime, but it is still difficult to deduce exactly what twin stands for without additional knowledge.

Text Generation

Exposure Normalization and Compensation for Multiple-Exposure Correction

no code implementations CVPR 2022 Jie Huang, Yajing Liu, Xueyang Fu, Man Zhou, Yang Wang, Feng Zhao, Zhiwei Xiong

However, the procedures of correcting underexposure and overexposure to normal exposures are much different from each other, leading to large discrepancies for the network in correcting multiple exposures, thus resulting in poor performance.

Image Enhancement

Bijective Mapping Network for Shadow Removal

2 code implementations CVPR 2022 Yurui Zhu, Jie Huang, Xueyang Fu, Feng Zhao, Qibin Sun, Zheng-Jun Zha

Shadow removal, which aims to restore the background in the shadow regions, is challenging due to the highly ill-posed nature.

Shadow Removal

Mutual Information-Driven Pan-Sharpening

no code implementations CVPR 2022 Man Zhou, Keyu Yan, Jie Huang, Zihe Yang, Xueyang Fu, Feng Zhao

Despite the remarkable progress, existing state-of-the-art Pan-sharpening methods don't explicitly enforce the complementary information learning between two modalities of PAN and MS images.

CD-GAN: a robust fusion-based generative adversarial network for unsupervised remote sensing change detection with heterogeneous sensors

no code implementations2 Mar 2022 Jin-Ju Wang, Nicolas Dobigeon, Marie Chabert, Ding-Cheng Wang, Ting-Zhu Huang, Jie Huang

In the context of Earth observation, change detection boils down to comparing images acquired at different times by sensors of possibly different spatial and/or spectral resolutions or different modalities (e. g., optical or radar).

Change Detection Earth Observation +1

Domain Representative Keywords Selection: A Probabilistic Approach

1 code implementation Findings (ACL) 2022 Pritom Saha Akash, Jie Huang, Kevin Chen-Chuan Chang, Yunyao Li, Lucian Popa, ChengXiang Zhai

We propose a probabilistic approach to select a subset of a \textit{target domain representative keywords} from a candidate set, contrasting with a context domain.

A Novel 3D Non-Stationary Channel Model for 6G Indoor Visible Light Communication Systems

no code implementations6 Apr 2022 Xiuming Zhu, Cheng-Xiang Wang, Jie Huang, Ming Chen, Harald Haas

The visible light communication (VLC) technology has attracted much attention in the research of the sixth generation (6G) communication systems.

An unsupervised approach for semantic place annotation of trajectories based on the prior probability

no code implementations20 Apr 2022 Junyi Cheng, Xianfeng Zhang, Peng Luo, Jie Huang, Jianfeng Huang

The Bayesian Criterion is specifically employed to decompose the spatiotemporal probability of the candidate place into spatial probability, duration probability, and visiting time probability.

DEER: Descriptive Knowledge Graph for Explaining Entity Relationships

1 code implementation21 May 2022 Jie Huang, Kerui Zhu, Kevin Chen-Chuan Chang, JinJun Xiong, Wen-mei Hwu

Experiments demonstrate that our system can extract and generate high-quality relation descriptions for explaining entity relationships.

BIG-bench Machine Learning Descriptive +4

Reconfigurable intelligent surfaces: Channel characterization and modeling

no code implementations6 Jun 2022 Jie Huang, Cheng-Xiang Wang, Yingzhuo Sun, Rui Feng, Jialing Huang, Bolun Guo, Zhimeng Zhong, Tie Jun Cui

Reconfigurable intelligent surfaces (RISs) are two dimensional (2D) metasurfaces which can intelligently manipulate electromagnetic waves by low-cost near passive reflecting elements.

Source-Free Domain Adaptation for Real-world Image Dehazing

no code implementations14 Jul 2022 Hu Yu, Jie Huang, Yajing Liu, Qi Zhu, Man Zhou, Feng Zhao

Although certain Domain Adaptation (DA) dehazing methods have been presented, they inevitably require access to the source dataset to reduce the gap between the source synthetic and target real domains.

Image Dehazing Source-Free Domain Adaptation +1

An SBR Based Ray Tracing Channel Modeling Method for THz and Massive MIMO Communications

no code implementations22 Aug 2022 Yuanzhe Wang, Hao Cao, Yifan Jin, Zizhe Zhou, Yinghua Wang, Jialing Huang, Yuxiao Li, Jie Huang, Cheng-Xiang Wang

Terahertz (THz) communication and the application of massive multiple-input multiple-output (MIMO) technology have been proved significant for the sixth generation (6G) communication systems, and have gained global interests.

An Improved Equiangular Division Algorithm for SBR based Ray Tracing Channel Modeling

no code implementations22 Aug 2022 Yuyang Zhou, Yinghua Wang, Yuxiao Li, Jialing Huang, Jie Huang, Cheng-Xiang Wang

With the proposed iterative precise algorithm, error of angle of departure (AOD) and angle of arrival (AOA) is below 0. 01 degree.

A Weighted Random Forest Based PositioningAlgorithm for 6G Indoor Communications

no code implementations22 Aug 2022 Yang Wu, Yinghua Wang, Jie Huang, Cheng-Xiang Wang, Chen Huang

Due to the indoor none-line-of-sight (NLoS) propagation and multi-access interference (MAI), it is a great challenge to achieve centimeter-level positioning accuracy in indoor scenarios.

CNSNet: A Cleanness-Navigated-Shadow Network for Shadow Removal

no code implementations6 Sep 2022 Qianhao Yu, Naishan Zheng, Jie Huang, Feng Zhao

The key to shadow removal is recovering the contents of the shadow regions with the guidance of the non-shadow regions.

Long-range modeling Shadow Removal

Deep Fourier Up-Sampling

1 code implementation11 Oct 2022 Man Zhou, Hu Yu, Jie Huang, Feng Zhao, Jinwei Gu, Chen Change Loy, Deyu Meng, Chongyi Li

Existing convolutional neural networks widely adopt spatial down-/up-sampling for multi-scale modeling.

Image Dehazing Image Segmentation +4

Can Language Models Be Specific? How?

1 code implementation11 Oct 2022 Jie Huang, Kevin Chen-Chuan Chang, JinJun Xiong, Wen-mei Hwu

We hope this work can bring to awareness the notion of specificity of language models and encourage the research community to further explore this important but understudied problem.

Language Modelling Specificity

Panchromatic and Multispectral Image Fusion via Alternating Reverse Filtering Network

no code implementations15 Oct 2022 Keyu Yan, Man Zhou, Jie Huang, Feng Zhao, Chengjun Xie, Chongyi Li, Danfeng Hong

Panchromatic (PAN) and multi-spectral (MS) image fusion, named Pan-sharpening, refers to super-resolve the low-resolution (LR) multi-spectral (MS) images in the spatial domain to generate the expected high-resolution (HR) MS images, conditioning on the corresponding high-resolution PAN images.

Coordinated Topic Modeling

no code implementations16 Oct 2022 Pritom Saha Akash, Jie Huang, Kevin Chen-Chuan Chang

It then uses the axes to model a corpus for easily understandable representation.

VER: Unifying Verbalizing Entities and Relations

1 code implementation20 Nov 2022 Jie Huang, Kevin Chen-Chuan Chang

To know the relationship between two entities, humans tend to create a sentence to connect them.

Sentence

DimonGen: Diversified Generative Commonsense Reasoning for Explaining Concept Relationships

1 code implementation20 Dec 2022 Chenzhengyi Liu, Jie Huang, Kerui Zhu, Kevin Chen-Chuan Chang

MoREE consists of a mixture of retrievers model that retrieves diverse context sentences related to the given concepts, and a mixture of generators model that generates diverse sentences based on the retrieved contexts.

Retrieval

Towards Reasoning in Large Language Models: A Survey

1 code implementation20 Dec 2022 Jie Huang, Kevin Chen-Chuan Chang

Reasoning is a fundamental aspect of human intelligence that plays a crucial role in activities such as problem solving, decision making, and critical thinking.

Decision Making

Ingredient-Oriented Multi-Degradation Learning for Image Restoration

1 code implementation CVPR 2023 Jinghao Zhang, Jie Huang, Mingde Yao, Zizheng Yang, Hu Yu, Man Zhou, Feng Zhao

Learning to leverage the relationship among diverse image restoration tasks is quite beneficial for unraveling the intrinsic ingredients behind the degradation.

Image Restoration

Learning Sample Relationship for Exposure Correction

no code implementations CVPR 2023 Jie Huang, Feng Zhao, Man Zhou, Jie Xiao, Naishan Zheng, Kaiwen Zheng, Zhiwei Xiong

Exposure correction task aims to correct the underexposure and its adverse overexposure images to the normal exposure in a single network.

Task 2

Unlocking Masked Autoencoders as Loss Function for Image and Video Restoration

no code implementations29 Mar 2023 Man Zhou, Naishan Zheng, Jie Huang, Chunle Guo, Chongyi Li

We investigate the efficacy of our belief from three perspectives: 1) from task-customized MAE to native MAE, 2) from image task to video task, and 3) from transformer structure to convolution neural network structure.

Image Denoising Image Enhancement +4

Random Weights Networks Work as Loss Prior Constraint for Image Restoration

no code implementations29 Mar 2023 Man Zhou, Naishan Zheng, Jie Huang, Xiangyu Rui, Chunle Guo, Deyu Meng, Chongyi Li, Jinwei Gu

In this paper, orthogonal to the existing data and model studies, we instead resort our efforts to investigate the potential of loss function in a new perspective and present our belief ``Random Weights Networks can Be Acted as Loss Prior Constraint for Image Restoration''.

Image Restoration Image Super-Resolution +1

Region-Aware Portrait Retouching with Sparse Interactive Guidance

1 code implementation8 Apr 2023 Huimin Zeng, Jie Huang, Jiacheng Li, Zhiwei Xiong

Specifically, we propose a region-aware retouching framework with two branches: an automatic branch and an interactive branch.

Multi-step Jailbreaking Privacy Attacks on ChatGPT

1 code implementation11 Apr 2023 Haoran Li, Dadi Guo, Wei Fan, Mingshi Xu, Jie Huang, Fanpu Meng, Yangqiu Song

With the rapid progress of large language models (LLMs), many downstream NLP tasks can be well solved given appropriate prompts.

Why Does ChatGPT Fall Short in Providing Truthful Answers?

no code implementations20 Apr 2023 Shen Zheng, Jie Huang, Kevin Chen-Chuan Chang

To better understand the model's particular weaknesses in providing truthful answers, we embark an in-depth exploration of open-domain question answering.

Memorization Open-Domain Question Answering +1

CCGen: Explainable Complementary Concept Generation in E-Commerce

no code implementations19 May 2023 Jie Huang, Yifan Gao, Zheng Li, Jingfeng Yang, Yangqiu Song, Chao Zhang, Zining Zhu, Haoming Jiang, Kevin Chen-Chuan Chang, Bing Yin

We propose and study Complementary Concept Generation (CCGen): given a concept of interest, e. g., "Digital Cameras", generating a list of complementary concepts, e. g., 1) Camera Lenses 2) Batteries 3) Camera Cases 4) Memory Cards 5) Battery Chargers.

Quantifying Association Capabilities of Large Language Models and Its Implications on Privacy Leakage

1 code implementation22 May 2023 Hanyin Shao, Jie Huang, Shen Zheng, Kevin Chen-Chuan Chang

The advancement of large language models (LLMs) brings notable improvements across various applications, while simultaneously raising concerns about potential private data exposure.

Mastering the ABCDs of Complex Questions: Answer-Based Claim Decomposition for Fine-grained Self-Evaluation

no code implementations24 May 2023 Nishant Balepur, Jie Huang, Samraj Moorjani, Hari Sundaram, Kevin Chen-Chuan Chang

When answering complex questions, large language models (LLMs) may produce answers that do not satisfy all criteria of the question.

Citation: A Key to Building Responsible and Accountable Large Language Models

no code implementations5 Jul 2023 Jie Huang, Kevin Chen-Chuan Chang

Despite the complexity of implementing such a citation mechanism, along with the potential pitfalls, we advocate for its development.

Position

Unsupervised Image Denoising in Real-World Scenarios via Self-Collaboration Parallel Generative Adversarial Branches

1 code implementation ICCV 2023 Xin Lin, Chao Ren, Xiao Liu, Jie Huang, Yinjie Lei

Although unsupervised approaches based on generative adversarial networks offer a promising solution for denoising without paired datasets, they are difficult in surpassing the performance limitations of conventional GAN-based unsupervised frameworks without significantly modifying existing structures or increasing the computational complexity of denoisers.

Image Denoising

High-quality Image Dehazing with Diffusion Model

1 code implementation23 Aug 2023 Hu Yu, Jie Huang, Kaiwen Zheng, Feng Zhao

The latter stage exploits the strong generation ability of DDPM to compensate for the haze-induced huge information loss, by working in conjunction with the physical modelling.

Denoising Image Dehazing

Generalized Lightness Adaptation with Channel Selective Normalization

1 code implementation ICCV 2023 Mingde Yao, Jie Huang, Xin Jin, Ruikang Xu, Shenglong Zhou, Man Zhou, Zhiwei Xiong

Existing methods typically work well on their trained lightness conditions but perform poorly in unknown ones due to their limited generalization ability.

Image Retouching inverse tone mapping +3

Situated Natural Language Explanations

no code implementations27 Aug 2023 Zining Zhu, Haoming Jiang, Jingfeng Yang, Sreyashi Nag, Chao Zhang, Jie Huang, Yifan Gao, Frank Rudzicz, Bing Yin

Situated NLE provides a perspective and facilitates further research on the generation and evaluation of explanations.

Prompt Engineering

Bearing-based Formation with Disturbance Rejection

no code implementations29 Aug 2023 Haoshu Cheng, Jie Huang

For the case of the known frequencies, we employ the canonical internal model to solve the problem, and, for the case of the unknown frequencies, we combine the canonical internal model and {some} distributed adaptive control technique to deal with the problem.

Continuous Control

Learned Image Reasoning Prior Penetrates Deep Unfolding Network for Panchromatic and Multi-Spectral Image Fusion

no code implementations ICCV 2023 Man Zhou, Jie Huang, Naishan Zheng, Chongyi Li

Such designs penetrate the image reasoning prior into deep unfolding networks while improving its interpretability and representation capability.

Empowering Low-Light Image Enhancer through Customized Learnable Priors

1 code implementation ICCV 2023 Naishan Zheng, Man Zhou, Yanmeng Dong, Xiangyu Rui, Jie Huang, Chongyi Li, Feng Zhao

In this work, we propose a paradigm for low-light image enhancement that explores the potential of customized learnable priors to improve the transparency of the deep unfolding paradigm.

Low-Light Image Enhancement

Large Language Models Cannot Self-Correct Reasoning Yet

1 code implementation3 Oct 2023 Jie Huang, Xinyun Chen, Swaroop Mishra, Huaixiu Steven Zheng, Adams Wei Yu, Xinying Song, Denny Zhou

Large Language Models (LLMs) have emerged as a groundbreaking technology with their unparalleled text generation capabilities across various applications.

Text Generation

Debias the Training of Diffusion Models

no code implementations12 Oct 2023 Hu Yu, Li Shen, Jie Huang, Man Zhou, Hongsheng Li, Feng Zhao

Diffusion models have demonstrated compelling generation quality by optimizing the variational lower bound through a simple denoising score matching loss.

Denoising

Configuration Validation with Large Language Models

1 code implementation15 Oct 2023 Xinyu Lian, Yinfang Chen, Runxiang Cheng, Jie Huang, Parth Thakkar, Minjia Zhang, Tianyin Xu

We present a first analysis on the feasibility and effectiveness of using LLMs for configuration validation.

Code Generation Few-Shot Learning +3

Descriptive Knowledge Graph in Biomedical Domain

no code implementations18 Oct 2023 Kerui Zhu, Jie Huang, Kevin Chen-Chuan Chang

We present a novel system that automatically extracts and generates informative and descriptive sentences from the biomedical corpus and facilitates the efficient search for relational knowledge.

Descriptive

Neural Degradation Representation Learning for All-In-One Image Restoration

1 code implementation19 Oct 2023 Mingde Yao, Ruikang Xu, Yuanshen Guan, Jie Huang, Zhiwei Xiong

To this end, we propose to learn a neural degradation representation (NDR) that captures the underlying characteristics of various degradations.

Image Restoration Representation Learning

Ask To The Point: Open-Domain Entity-Centric Question Generation

1 code implementation21 Oct 2023 Yuxiang Liu, Jie Huang, Kevin Chen-Chuan Chang

We introduce a new task called *entity-centric question generation* (ECQG), motivated by real-world applications such as topic-specific learning, assisted reading, and fact-checking.

Fact Checking Question Generation +1

Text Fact Transfer

1 code implementation23 Oct 2023 Nishant Balepur, Jie Huang, Kevin Chen-Chuan Chang

Text style transfer is a prominent task that aims to control the style of text without inherently changing its factual content.

Question Answering Question Generation +4

Let the Pretrained Language Models "Imagine" for Short Texts Topic Modeling

no code implementations24 Oct 2023 Pritom Saha Akash, Jie Huang, Kevin Chen-Chuan Chang

Besides, we provide a simple solution extending a neural topic model to reduce the effect of noisy out-of-topics text generation from PLMs.

Text Generation Topic Models

A Novel 3D Non-stationary Localization-assisted ISAC Channel Model

no code implementations28 Nov 2023 Runruo Yang, Yang Wu, Jie Huang, Cheng-Xiang Wang

Integrated sensing and communication (ISAC) has attracted wide attention as an emerging application scenario for the sixth generation (6G) wireless communication system.

A WINNER+ Based 3-D Non-Stationary Wideband MIMO Channel Model

no code implementations1 Dec 2023 Ji Bian, Jian Sun, Cheng-Xiang Wang, Rui Feng, Jie Huang, Yang Yang, Minggao Zhang

In this paper, a three-dimensional (3-D) non-stationary wideband multiple-input multiple-output (MIMO) channel model based on the WINNER+ channel model is proposed.

Decoupling Degradation and Content Processing for Adverse Weather Image Restoration

no code implementations8 Dec 2023 Xi Wang, Xueyang Fu, Peng-Tao Jiang, Jie Huang, Mi Zhou, Bo Li, Zheng-Jun Zha

The former facilitates channel-dependent degradation removal operation, allowing the network to tailor responses to various adverse weather types; the latter, by integrating Fourier's global properties into channel-independent content features, enhances network capacity for consistent global content reconstruction.

Image Restoration

Cascade Speculative Drafting for Even Faster LLM Inference

1 code implementation18 Dec 2023 Ziyi Chen, Xiaocong Yang, Jiacheng Lin, Chenkai Sun, Kevin Chen-Chuan Chang, Jie Huang

Introduced to enhance the efficiency of large language model (LLM) inference, speculative decoding operates by having a smaller model generate a draft.

Language Modelling Large Language Model

Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents

1 code implementation4 Mar 2024 YiFan Song, Da Yin, Xiang Yue, Jie Huang, Sujian Li, Bill Yuchen Lin

This iterative cycle of exploration and training fosters continued improvement in the agents.

Contrastive Learning

Long-form factuality in large language models

2 code implementations27 Mar 2024 Jerry Wei, Chengrun Yang, Xinying Song, Yifeng Lu, Nathan Hu, Jie Huang, Dustin Tran, Daiyi Peng, Ruibo Liu, Da Huang, Cosmo Du, Quoc V. Le

Empirically, we demonstrate that LLM agents can outperform crowdsourced human annotators - on a set of ~16k individual facts, SAFE agrees with crowdsourced human annotators 72% of the time, and on a random subset of 100 disagreement cases, SAFE wins 76% of the time.

16k

Cannot find the paper you are looking for? You can Submit a new open access paper.